{"id":261,"date":"2019-10-08T23:42:41","date_gmt":"2019-10-08T23:42:41","guid":{"rendered":"http:\/\/nanocad.ee.ucla.edu\/?page_id=261"},"modified":"2025-10-22T17:57:56","modified_gmt":"2025-10-22T17:57:56","slug":"waferscale-integration","status":"publish","type":"page","link":"https:\/\/nanocad.ee.ucla.edu\/?page_id=261","title":{"rendered":"Waferscale Integration"},"content":{"rendered":"\n<p><strong>Students: Saptadeep Pal<\/strong><\/p>\n\n\n\n<p>With the emergence of new applications, new business models, and new data processing techniques (e.g., deep learning), the need for parallel hardware<br>has never been stronger. Unfortunately, there is an equally strong countervailing trend. Parallel hardware necessitates low overhead communication between different computing nodes. However, the overhead of communication has been increasing at an alarming pace and is expected to be worse in future as communication energy, latency, and bandwidth scale much worse than computation. As a result, increasing communication overhead is already threatening computer system scaling.<\/p>\n\n\n\n<p> <\/p>\n\n\n\n<div class=\"wp-block-media-text alignwide has-media-on-the-right is-stacked-on-mobile\" style=\"grid-template-columns:auto 48%\"><figure class=\"wp-block-media-text__media\"><img loading=\"lazy\" decoding=\"async\" width=\"1662\" height=\"676\" src=\"https:\/\/nanocad.ee.ucla.edu\/wp-content\/uploads\/2020\/06\/image-2.png\" alt=\"\" class=\"wp-image-637\"\/><\/figure><div class=\"wp-block-media-text__content\">\n<p>One approach to dramatically reduce communication overheads is waferscale (WS) processing, where a processor chip is of the size of an entire 300mm wafer. However, WS processors have been historically deemed impractical due to yield issues of building a large monolithic WS chip. Most yield loss in chipmaking comes from defects in the transistor layers or in the high-density lower metal layers.<\/p>\n\n\n\n<p>We take a different route towards realizing WS processors using the <a href=\"https:\/\/chips.ucla.edu\/research\/project\/4\">Silicon-Interconnection Fabric (Si-IF)<\/a> technology. Here smaller pre-manufactured dies are directly bonded onto a silicon wafer, enables one to build a WS system without the corresponding yield issues. The WS substrate only contains relatively low-density metal layers on the wafer, roughly the same density as the upper layers of a&nbsp;<a href=\"https:\/\/en.wikipedia.org\/wiki\/System_on_a_chip\">system on a chip<\/a>, and is only used for high-density inter-die interconnections.<\/p>\n<\/div><\/div>\n\n\n\n<div class=\"wp-block-media-text alignwide is-stacked-on-mobile\" style=\"grid-template-columns:48% auto\"><figure class=\"wp-block-media-text__media\"><img loading=\"lazy\" decoding=\"async\" width=\"1170\" height=\"636\" src=\"https:\/\/nanocad.ee.ucla.edu\/wp-content\/uploads\/2020\/06\/image-1.png\" alt=\"\" class=\"wp-image-636\"\/><\/figure><div class=\"wp-block-media-text__content\">\n<p>Related to Si-IF based WS-Processors, we specifically study and explore computer architecture solutions for massively parallel WS processors. We also develop system design solutions for dielet assembly on high performance interconnect substrates such as Si-IF, interposers etc. Our recent proposal of a waferscale-GPU can achieve up to 18.9x speedup and 143x EDP benefit compared against equivalent MCM-GPU based implementation on PCB. We are currently working on a massively parallel general purpose waferscale processor prototype built using ARM processors.<\/p>\n<\/div><\/div>\n\n\n\n<p class=\"has-text-align-center\">Collaborators: <a href=\"https:\/\/chips.ucla.edu\/\">CHIPS-UCLA<\/a>, <a href=\"http:\/\/passat.crhc.illinois.edu\/\">Rakesh Kumar (UIUC)<\/a><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Publications<\/h2>\n\n\n\n<p><ul class=\"papercite_bibliography\">          [1]         <a href=\"https:\/\/nanocad.ee.ucla.edu\/wp-content\/papercite-data\/pdf\/ip19.pdf\" title='Download PDF' class='papercite_pdf'><img src='https:\/\/nanocad.ee.ucla.edu\/wp-content\/plugins\/papercite\/img\/pdf.png' alt=\"[PDF]\"\/><\/a>           M. Niemier, Z. Enciso, M. M. Sharifi, X. S. Hu, I. O&#8217;Conner, A. Graening, R. Sharma, P. Gupta, J. Castrillon, J. P. C. Lima, N. Afroze, A. Khan, and J. Ryckaert, &#8220;Smoothing Disruption Across the Stack: Tales of Memory, Heterogeneity, &#038; Compilers,&#8221; in <span style=\"font-style: italic\">IEEE\/ACM Design, Automation and Test in Europe<\/span>,  2024, p. 10  <br\/>    <a href=\"javascript:void(0)\" id=\"papercite_8\" class=\"papercite_toggle\">[Bibtex]<\/a>    <div class=\"papercite_bibtex\" id=\"papercite_8_block\"><pre><code class=\"tex bibtex\">@inproceedings{IP19,\nauthor = {Niemier, Michael and Enciso, Zephan and Sharifi, Mohammed Mehdi and Hu, Xiaobo Sharon and O'Conner, Ian and Graening, Alexander and Sharma, Ravit and Gupta, Puneet and Castrillon, Jeronimo and Lima, J.P.C. and Afroze, Nashrah and Khan, Asif and Ryckaert, Julien},\nbooktitle = {{IEEE\/ACM Design, Automation and Test in Europe}},\ndoi = {},\nkeywords = {chiplets,wsi},\nmonth = {March},\nnote = {},\npages = {10},\ntitle = {{Smoothing Disruption Across the Stack: Tales of Memory, Heterogeneity, & Compilers}},\nyear = {2024}\n}<\/code><\/pre><\/div>   <\/br>          [2]         <a href=\"https:\/\/nanocad.ee.ucla.edu\/wp-content\/papercite-data\/pdf\/c124.pdf\" title='Download PDF' class='papercite_pdf'><img src='https:\/\/nanocad.ee.ucla.edu\/wp-content\/plugins\/papercite\/img\/pdf.png' alt=\"[PDF]\"\/><\/a>           A. Graening, S. Pal, and P. Gupta, &#8220;Chiplets: How Small is too Small?,&#8221; in <span style=\"font-style: italic\">Proc. ACM\/IEEE Design Automation Conference (DAC)<\/span>,  2023, p. 6  <br\/>    <a href=\"javascript:void(0)\" id=\"papercite_5\" class=\"papercite_toggle\">[Bibtex]<\/a>    <div class=\"papercite_bibtex\" id=\"papercite_5_block\"><pre><code class=\"tex bibtex\">@inproceedings{C124,\nauthor = {Graening, Alexander and Pal, Saptadeep and Gupta, Puneet},\nbooktitle = {{Proc. ACM\/IEEE Design Automation Conference (DAC)}},\ndoi = {},\nkeywords = {chiplets,wsi},\nmonth = {July},\nnote = {},\npages = {6},\ntitle = {{Chiplets: How Small is too Small?}},\nyear = {2023}\n}<\/code><\/pre><\/div>   <\/br>          [3]         <a href=\"https:\/\/nanocad.ee.ucla.edu\/wp-content\/papercite-data\/pdf\/c116.pdf\" title='Download PDF' class='papercite_pdf'><img src='https:\/\/nanocad.ee.ucla.edu\/wp-content\/plugins\/papercite\/img\/pdf.png' alt=\"[PDF]\"\/><\/a>           S. Pal, J. Liu, I. Alam, N. Cebry, H. Suhail, S. Bu, S. S. Iyer, S. Pamarti, R. Kumar, and P. Gupta, &#8220;Designing a 2048-Chiplet, 14336-Core Waferscale Processor,&#8221; in <span style=\"font-style: italic\">Proc. ACM\/IEEE Design Automation Conference (DAC)<\/span>,  2021  <br\/>    <a href=\"javascript:void(0)\" id=\"papercite_4\" class=\"papercite_toggle\">[Bibtex]<\/a>    <div class=\"papercite_bibtex\" id=\"papercite_4_block\"><pre><code class=\"tex bibtex\">@inproceedings{C116,\nauthor = {Pal, Saptadeep and Liu, Jingyang and Alam, Irina and Cebry, Nicholas and Suhail, Haris and Bu, Shi and Iyer, Subramanian S. and Pamarti, Sudhakar and Kumar, Rakesh and Gupta, Puneet},\nbooktitle = {{Proc. ACM\/IEEE Design Automation Conference (DAC)}},\ndoi = {},\nkeywords = {wsi},\nmonth = {December},\nnote = {},\npages = {},\ntitle = {{Designing a 2048-Chiplet, 14336-Core Waferscale Processor}},\nyear = {2021}\n}<\/code><\/pre><\/div>   <\/br>          [4]         <a href=\"https:\/\/nanocad.ee.ucla.edu\/wp-content\/papercite-data\/pdf\/c114.pdf\" title='Download PDF' class='papercite_pdf'><img src='https:\/\/nanocad.ee.ucla.edu\/wp-content\/plugins\/papercite\/img\/pdf.png' alt=\"[PDF]\"\/><\/a>           S. Pal, K. Sahoo, I. Alam, H. Suhail, R. Kumar, S. Pamarti, P. Gupta, and S. S. Iyer, &#8220;I\/O Architecture, Substrate Design, and Bonding Process for a Heterogeneous Dielet-Assembly based Waferscale Processor,&#8221; in <span style=\"font-style: italic\">IEEE Electronic Components and Technology Conference (ECTC)<\/span>,  2021  <br\/>    <a href=\"javascript:void(0)\" id=\"papercite_3\" class=\"papercite_toggle\">[Bibtex]<\/a>    <div class=\"papercite_bibtex\" id=\"papercite_3_block\"><pre><code class=\"tex bibtex\">@inproceedings{C114,\nauthor = {Pal, Saptadeep and Sahoo, Krutikesh and Alam, Irina and Suhail, Haris and Kumar, Rakesh and Pamarti, Sudhakar and Gupta, Puneet and Iyer, Subramanian S.},\nbooktitle = {{IEEE Electronic Components and Technology Conference (ECTC)}},\ndoi = {},\nkeywords = {wsi},\nmonth = {June},\nnote = {},\npages = {},\ntitle = {{I\/O Architecture, Substrate Design, and Bonding Process for a Heterogeneous Dielet-Assembly based Waferscale Processor}},\nyear = {2021}\n}<\/code><\/pre><\/div>   <\/br>          [5]         <a href=\"https:\/\/nanocad.ee.ucla.edu\/wp-content\/papercite-data\/pdf\/c111.pdf\" title='Download PDF' class='papercite_pdf'><img src='https:\/\/nanocad.ee.ucla.edu\/wp-content\/plugins\/papercite\/img\/pdf.png' alt=\"[PDF]\"\/><\/a>           S. Pal and P. Gupta, &#8220;Pathfinding for 2.5D Interconnect Technologies,&#8221; in <span style=\"font-style: italic\">System-Level Interconnect &#8211; Problems and Pathfinding Workshop<\/span>, New York, NY, USA,  2020  <br\/>    <a href=\"javascript:void(0)\" id=\"papercite_2\" class=\"papercite_toggle\">[Bibtex]<\/a>    <div class=\"papercite_bibtex\" id=\"papercite_2_block\"><pre><code class=\"tex bibtex\">@inproceedings{C111,\naddress = {New York, NY, USA},\nauthor = {Pal, Saptadeep and Gupta, Puneet},\nbooktitle = {{System-Level Interconnect - Problems and Pathfinding Workshop}},\nkeywords = {wsi},\nlocation = {San Diego, California},\nmonth = {November},\nnumpages = {8},\npublisher = {ACM},\nseries = {SLIP '20},\ntitle = {{Pathfinding for 2.5D Interconnect Technologies}},\nyear = {2020}\n}<\/code><\/pre><\/div>   <\/br>          [6]         <a href=\"https:\/\/nanocad.ee.ucla.edu\/wp-content\/papercite-data\/pdf\/j66.pdf\" title='Download PDF' class='papercite_pdf'><img src='https:\/\/nanocad.ee.ucla.edu\/wp-content\/plugins\/papercite\/img\/pdf.png' alt=\"[PDF]\"\/><\/a>   <a href='http:\/\/dx.doi.org\/https:\/\/ieeexplore.ieee.org\/document\/8998304' class='papercite_doi' title='View document on publisher site'><img src='https:\/\/nanocad.ee.ucla.edu\/wp-content\/plugins\/papercite\/img\/external.png' width='10' height='10' alt='[DOI]' \/><\/a>        S. Pal, D. Petrisko, R. Kumar, and P. Gupta, &#8220;Design Space Exploration for Chiplet Assembly Based Processors,&#8221; <span style=\"font-style: italic\">IEEE Transactions on Very Large Scale Integration Systems<\/span>, 2020.  <br\/>    <a href=\"javascript:void(0)\" id=\"papercite_9\" class=\"papercite_toggle\">[Bibtex]<\/a>    <div class=\"papercite_bibtex\" id=\"papercite_9_block\"><pre><code class=\"tex bibtex\">@article{J66,\nauthor = {Pal, Saptadeep and Petrisko, Daniel and Kumar, Rakesh and Gupta, Puneet},\ndoi = {https:\/\/ieeexplore.ieee.org\/document\/8998304},\njournal = {{IEEE Transactions on Very Large Scale Integration Systems}},\nkeywords = {{2.5-D integration, chiplet assembly, micro-architectural design space exploration (DSE), multichiplet optimization, wsi}},\npublisher = {{IEEE}},\ntitle = {{Design Space Exploration for Chiplet Assembly Based Processors}},\nyear = {2020}\n}<\/code><\/pre><\/div>   <\/br>          [7]         <a href=\"https:\/\/nanocad.ee.ucla.edu\/wp-content\/papercite-data\/pdf\/c107.pdf\" title='Download PDF' class='papercite_pdf'><img src='https:\/\/nanocad.ee.ucla.edu\/wp-content\/plugins\/papercite\/img\/pdf.png' alt=\"[PDF]\"\/><\/a>   <a href='http:\/\/dx.doi.org\/10.1109\/HPCA.2019.00042' class='papercite_doi' title='View document on publisher site'><img src='https:\/\/nanocad.ee.ucla.edu\/wp-content\/plugins\/papercite\/img\/external.png' width='10' height='10' alt='[DOI]' \/><\/a>        S. Pal, D. Petrisko, M. Tomei, S. S. Iyer, P. Gupta, and R. Kumar, &#8220;Architecting Waferscale Processors &#8211; A GPU Case Study,&#8221; in <span style=\"font-style: italic\">IEEE International Symposium on High-Performance Computer Architecture (HPCA)<\/span>,  2019, pp. 250-263  <br\/>    <a href=\"javascript:void(0)\" id=\"papercite_1\" class=\"papercite_toggle\">[Bibtex]<\/a>    <div class=\"papercite_bibtex\" id=\"papercite_1_block\"><pre><code class=\"tex bibtex\">@inproceedings{C107,\nauthor = {Pal, Saptadeep and Petrisko, Daniel and Tomei, Matthew and Iyer, Subramanian S. and Gupta, Puneet and Kumar, Rakesh},\nbooktitle = {{IEEE International Symposium on High-Performance Computer Architecture (HPCA)}},\ndoi = {10.1109\/HPCA.2019.00042},\nissn = {1530-0897},\nkeywords = {wsi},\nmonth = {February},\npages = {250-263},\npaperurl = {http:\/\/nanocad.ee.ucla.edu\/pub\/Main\/Publications\/C106_paper.pdf},\ntitle = {{Architecting Waferscale Processors - A GPU Case Study}},\nyear = {2019}\n}<\/code><\/pre><\/div>   <\/br>          [8]         <a href=\"https:\/\/nanocad.ee.ucla.edu\/wp-content\/papercite-data\/pdf\/c100.pdf\" title='Download PDF' class='papercite_pdf'><img src='https:\/\/nanocad.ee.ucla.edu\/wp-content\/plugins\/papercite\/img\/pdf.png' alt=\"[PDF]\"\/><\/a>           S. Pal, D. Petrisko, A. Bajwa, S. S. Iyer, R. Kumar, and P. Gupta, &#8220;A Case for Packageless Processors,&#8221; in <span style=\"font-style: italic\">IEEE International Symposium on High-Performance Computer Architecture (HPCA)<\/span>,  2018  <br\/>    <a href=\"javascript:void(0)\" id=\"papercite_0\" class=\"papercite_toggle\">[Bibtex]<\/a>    <div class=\"papercite_bibtex\" id=\"papercite_0_block\"><pre><code class=\"tex bibtex\">@inproceedings{C100,\nauthor = {Pal, Saptadeep and Petrisko, Daniel and Bajwa, Adeel and Iyer, Subramanian S. and Kumar, Rakesh and Gupta, Puneet},\nbooktitle = {{IEEE International Symposium on High-Performance Computer Architecture (HPCA)}},\nkeywords = {wsi},\nmonth = {February},\npaperurl = {http:\/\/nanocad.ee.ucla.edu\/pub\/Main\/Publications\/C100_paper.pdf},\ntitle = {{A Case for Packageless Processors}},\nyear = {2018}\n}<\/code><\/pre><\/div>   <\/br>          [9]         <a href=\"https:\/\/nanocad.ee.ucla.edu\/wp-content\/papercite-data\/pdf\/c96.pdf\" title='Download PDF' class='papercite_pdf'><img src='https:\/\/nanocad.ee.ucla.edu\/wp-content\/plugins\/papercite\/img\/pdf.png' alt=\"[PDF]\"\/><\/a>           S. Jangam, S. Pal, A. Bajwa, S. Pamarti, P. Gupta, and S. S. Iyer, &#8220;Latency, Bandwidth and Power Benefits of the SuperCHIPS Integration Scheme,&#8221; in <span style=\"font-style: italic\">IEEE Electronic Components and Technology Conference (ECTC)<\/span>,  2017  <br\/>    <a href=\"javascript:void(0)\" id=\"papercite_6\" class=\"papercite_toggle\">[Bibtex]<\/a>    <div class=\"papercite_bibtex\" id=\"papercite_6_block\"><pre><code class=\"tex bibtex\">@inproceedings{C96,\nauthor = {Jangam, SivaChandra and Pal, Saptadeep and Bajwa, Adeel and Pamarti, Sudhakar and Gupta, Puneet and Iyer, Subramanian S.},\nbooktitle = {{IEEE Electronic Components and Technology Conference (ECTC)}},\nkeywords = {Silicon Interconnect Fabric; Thermal Compression Bonding; Fine Pitch Interconnect, SuperCHIPS, wsi},\nmonth = {May},\npaperurl = {http:\/\/nanocad.ee.ucla.edu\/pub\/Main\/Publications\/C96_paper.pdf},\nslideurl = {http:\/\/nanocad.ee.ucla.edu\/pub\/Main\/Publications\/C96_slides.pdf},\ntitle = {{L}atency, {B}andwidth and {P}ower {B}enefits of the {S}uper{CHIPS} {I}ntegration {S}cheme},\nyear = {2017}\n}<\/code><\/pre><\/div>   <\/br>          [10]         <a href=\"https:\/\/nanocad.ee.ucla.edu\/wp-content\/papercite-data\/pdf\/ip17.pdf\" title='Download PDF' class='papercite_pdf'><img src='https:\/\/nanocad.ee.ucla.edu\/wp-content\/plugins\/papercite\/img\/pdf.png' alt=\"[PDF]\"\/><\/a>           S. Pal, S. S. Iyer, and P. Gupta, &#8220;Advanced packaging and heterogeneous integration to reboot computing,&#8221; in <span style=\"font-style: italic\">IEEE International Conference on Rebooting Computing<\/span>,  2017  <br\/>    <a href=\"javascript:void(0)\" id=\"papercite_7\" class=\"papercite_toggle\">[Bibtex]<\/a>    <div class=\"papercite_bibtex\" id=\"papercite_7_block\"><pre><code class=\"tex bibtex\">@inproceedings{IP17,\nauthor = {Pal, Saptadeep and Iyer, Subramanian S. and Gupta, Puneet},\nbooktitle = {{IEEE International Conference on Rebooting Computing}},\nkeywords = {wsi},\npaperurl = {http:\/\/nanocad.ee.ucla.edu\/pub\/Main\/Publications\/IP17_paper.pdf},\ntitle = {Advanced Packaging and Heterogeneous Integration to Reboot Computing},\nyear = {2017}\n}<\/code><\/pre><\/div>   <\/br>           <\/ul><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Students: Saptadeep Pal With the emergence of new applications, new business models, and new data processing techniques (e.g., deep learning), the need for parallel hardwarehas&hellip;<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":160,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-261","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/nanocad.ee.ucla.edu\/index.php?rest_route=\/wp\/v2\/pages\/261","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/nanocad.ee.ucla.edu\/index.php?rest_route=\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/nanocad.ee.ucla.edu\/index.php?rest_route=\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/nanocad.ee.ucla.edu\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/nanocad.ee.ucla.edu\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=261"}],"version-history":[{"count":4,"href":"https:\/\/nanocad.ee.ucla.edu\/index.php?rest_route=\/wp\/v2\/pages\/261\/revisions"}],"predecessor-version":[{"id":850,"href":"https:\/\/nanocad.ee.ucla.edu\/index.php?rest_route=\/wp\/v2\/pages\/261\/revisions\/850"}],"up":[{"embeddable":true,"href":"https:\/\/nanocad.ee.ucla.edu\/index.php?rest_route=\/wp\/v2\/pages\/160"}],"wp:attachment":[{"href":"https:\/\/nanocad.ee.ucla.edu\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=261"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}