{"id":25487,"date":"2024-12-03T13:26:42","date_gmt":"2024-12-03T07:41:42","guid":{"rendered":"https:\/\/www.revoscience.com\/en\/?p=25487"},"modified":"2024-12-03T13:49:55","modified_gmt":"2024-12-03T08:04:55","slug":"photonic-processor-could-enable-ultrafast-ai-computations-with-extreme-energy-efficiency","status":"publish","type":"post","link":"https:\/\/www.revoscience.com\/en\/photonic-processor-could-enable-ultrafast-ai-computations-with-extreme-energy-efficiency\/","title":{"rendered":"Photonic processor could enable ultrafast AI computations with extreme energy efficiency"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\"><em><strong>This new device uses light to perform the key operations of a deep neural network on a chip, opening the door to high-speed processors that can learn in real-time.\u00a0<\/strong><\/em><\/p>\n\n\n\n<figure class=\"wp-block-image size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"675\" height=\"450\" src=\"https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0-675x450.jpg\" alt=\"\" class=\"wp-image-25488\" style=\"width:822px;height:auto\" title=\"\" srcset=\"https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0-675x450.jpg 675w, https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0-600x400.jpg 600w, https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0-768x512.jpg 768w, https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0.jpg 900w\" sizes=\"auto, (max-width: 675px) 100vw, 675px\" \/><\/figure>\n\n\n<div class=\"wp-block-post-author\"><div class=\"wp-block-post-author__content\"><p class=\"wp-block-post-author__name\">Adam Zewe<\/p><\/div><\/div>\n\n\n<p class=\"wp-block-paragraph\">CAMBRIDGE, Mass. &#8212;&nbsp;The deep neural network models that power today\u2019s most demanding machine-learning applications have grown so large and complex that they are pushing the limits of traditional electronic computing hardware.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Photonic hardware, which can perform machine-learning computations with light, offers a faster and more energy-efficient alternative. However, there are some types of neural network computations that a photonic device can\u2019t perform, requiring the use of off-chip electronics or other techniques that hamper speed and efficiency.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Building on a decade of research, scientists from MIT and elsewhere have developed a new photonic chip that overcomes these roadblocks. They demonstrated a fully integrated photonic processor that can perform all the key computations of a deep neural network optically on the chip.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The optical device was able to complete\u00a0the key computations for a\u00a0machine-learning classification task in less than half a nanosecond while achieving more than 92 percent accuracy\u2014performance that is on par with traditional hardware.\u00a0<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The chip, composed of interconnected modules that form an optical neural network, is fabricated using commercial foundry processes, which could enable the scaling of the technology and its integration into electronics.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In the long run, the photonic processor could lead to faster and more energy-efficient deep learning for computationally demanding applications like lidar, scientific research in astronomy and particle physics, or high-speed telecommunications.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cThere are a lot of cases where how well the model performs isn\u2019t the only thing that matters, but also how fast you can get an answer. Now that we have an end-to-end system that can run a neural network in optics, at a nanosecond time scale, we can start thinking at a higher level about applications and algorithms,\u201d says Saumil Bandyopadhyay \u201917, MEng \u201918, PhD \u201923, a visiting scientist in the Quantum Photonics and AI Group within the Research Laboratory of Electronics (RLE) and a postdoc at NTT Research, Inc., who is the lead author of a paper on the new chip.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Bandyopadhyay is joined on the paper by Alexander Sludds \u201918, MEng \u201919, PhD \u201923, Nicholas Harris PhD \u201917, and Darius Bunandar PhD \u201919; Stefan Krastanov, a former RLE research scientist who is now an assistant professor at the University of Massachusetts at Amherst; Ryan Hamerly, a visiting scientist at RLE and senior scientist at NTT Research; Matthew Streshinsky, a former silicon photonics lead at Nokia who is now co-founder and CEO of Enosemi; Michael Hochberg, president of Periplous, LLC; and senior author Dirk Englund, a professor in the Department of Electrical Engineering and Computer Science, principal investigator of the Quantum Photonics and Artificial Intelligence Group and of RLE. The research appears in&nbsp;<a href=\"https:\/\/link.mediaoutreach.meltwater.com\/ls\/click?upn=u001.aGL2w8mpmadAd46sBDLfbHIsRYeR84h7Gvm-2BeIBvl911jH7rNM5jY0j1d6TfSD2ndOP6RJLUOoKnyecEjb6aGw-3D-3DWWp4_Gmh-2FjktplCfWo1o-2BFbkY3J9eYBJUJc-2BSUmMkHo42Dqe4Z0qTEKCmSFnQfWCe8-2B8jgXgQQcW-2Fb1rLKfKZRu-2BLLGScwMYc-2FOCX9RDmpXEBR4BY9i7y-2BNgpMuREG7n76alZeZgfTc6MhYXKfTvfscI8w2BSlml2nNr8dnt4n9PibWx-2BT3A8QPo6JBZuHutkIUWJRDqofu-2FZCdTCD53UgxE14gKZJlEnMLchAIi3314gTqeDA65bICQxA1obAfhjq98n0-2FkOoFOHSgPTrPhSaXiwtLehd-2BjQrnOOqE2b57Rl2Izm7ld3Mhpn2UmMa9zEEr-2FpeacEAJuG0wQRED12pQnE2sYnBEPABl958reDH8Z26an8SO0wYEqciZnnGrZzPCO8z6i1ZglH3NtggtTyqhwtkQ-3D-3D\" target=\"_blank\" rel=\"noreferrer noopener\"><em>Nature Photonics.<\/em><\/a><\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Machine learning with light<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Deep neural networks are composed of many interconnected layers of nodes, or neurons, that operate on input data to produce an output. One key operation in a deep neural network involves the use of linear algebra to perform matrix multiplication, which transforms data as it is passed from layer to layer.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">But in addition to these linear operations, deep neural networks perform nonlinear operations that help the model learn more intricate patterns. Nonlinear operations, like activation functions, give deep neural networks the power to solve complex problems.&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In 2017, Englund\u2019s group, along with researchers in the lab of Marin Solja\u010di\u0107, the Cecil and Ida Green Professor of Physics,&nbsp;<a href=\"https:\/\/link.mediaoutreach.meltwater.com\/ls\/click?upn=u001.aGL2w8mpmadAd46sBDLfbLA2LwfXUthDPHEKwZ5J7wJRpXnFlOMY6A0HP8qM5XyV5CSbhWNsKEzN01FU5fVrPZT3-2FRXS1kNNqQGV-2FZ7klQparntLdkksWjrBnWUI6JNGEB86XSwaFhYIxVTSE7pkMVp70B7fqwKR7cdCoRdSUHhP74fi38F3HyR-2FRKy4RPnKGam-2FN0aT6w9fGXoqEaaNllmST31tLgSpx3LKr6eiu5-2F4jxXfg3Ygaj9xOljEFFjbtozZ5QS3RbPfH7fvSylVG111M6f-2FmKxgaGMNVU4Ns4IzF1u4NHqVlxa3PMvHVWGM2nFXWLttGFBydDp3L8KjXDdoOxWBzmIYbC9GtQBo1R20bppnjvAz5WwvP7j0FKC0OYqWxLzcMEOen0tK04JQ3uMLay7qqa2j0AvIfyveOM8pp-2Bk28-2BFn782nYlwSwpSpJBUnOMC3Z-2F23UcGSLVfrHntr4i3o9I-2BHjY3IKF6xIhStS514M-2FOepH3ieApSxaavyIN8LjphUmX3RCO-2BvBJm6sCMMsIuZmSgdt49a8PIMonE5G4fGvD11yOEfk6uP7gXJ7T-2F8lcRTbpsopE8xAs2x6yOvcT2O-2FHtt44nZUpSEkbUPEzUXfFVyJ-2BEt0xgb-2FzloqfS5WfC268CGQQ2c3nlo6s1e5fyH-2FQ-2FyEfLlnUaQC8jUjn2upi3sCORVa3YEYN-2BMiT-2Fe1EN-2B4k1z09L7r2k8XXC-2FRZXWdBDxQu-2FTvQEcpwKEoPQBCzR0ScTxsQUvaSADtc-2B0DP4Zl-2BR7-2BgrpcOgVw-3D-3DMG3t_Gmh-2FjktplCfWo1o-2BFbkY3J9eYBJUJc-2BSUmMkHo42Dqe4Z0qTEKCmSFnQfWCe8-2B8jgXgQQcW-2Fb1rLKfKZRu-2BLLGScwMYc-2FOCX9RDmpXEBR4BY9i7y-2BNgpMuREG7n76alZeZgfTc6MhYXKfTvfscI8w2BSlml2nNr8dnt4n9PibWx-2BT3A8QPo6JBZuHutkIUWJRDqofu-2FZCdTCD53UgxE14gKZJlEnMLchAIi3314gTqcS3ttz-2FaouByYAQfNZ3bz92BqidVFoDuLZpDfW5p7uVlzXg-2B52pQiWu2L-2FVWGTNm6gMi175DZW0NxFdrpnngNPUQ-2F6k-2B0JeRY7hfg0YGZXuiYnvIjnQ-2BM1OJgtlD3rbsdJrd1TojVF93wNNgZpYbeOEyf1DHTQCKV2VYRPnjOSGA-3D-3D\" target=\"_blank\" rel=\"noreferrer noopener\">demonstrated an optical neural network on a single photonic chip<\/a>&nbsp;that could perform matrix multiplication with light.&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">But at the time, the device couldn\u2019t perform nonlinear operations on the chip. Optical data had to be converted into electrical signals and sent to a&nbsp;digital processor&nbsp;to perform nonlinear operations.&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cNonlinearity in optics is quite challenging because photons don\u2019t interact with each other very easily. That makes it very power consuming to trigger optical nonlinearities, so it becomes challenging to build a system that can do it in a scalable way,\u201d Bandyopadhyay explains.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">They overcame that challenge by designing devices called nonlinear optical function units (NOFUs), which combine electronics and optics to implement nonlinear operations on the chip.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The researchers built an optical deep neural network on a photonic chip using three layers of devices that perform linear and nonlinear operations.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>A fully-integrated network<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">At the outset, their system encodes the parameters of a deep neural network into light. Then, an array of programmable beamsplitters, which&nbsp;was&nbsp;demonstrated&nbsp;in the 2017 paper, performs&nbsp;matrix multiplication on those inputs.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The data then pass to programmable NOFUs, which implement nonlinear functions by siphoning off a small amount of light to photodiodes that convert optical signals to electric current. This process, which eliminates the need for an external amplifier, consumes very little energy.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cWe stay in the optical domain the whole time, until the end when we want to read out the answer. This enables us to achieve ultra-low latency,\u201d Bandyopadhyay says.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Achieving such low latency enabled them to efficiently train a deep neural network on the chip, a process known as&nbsp;<em>in situ&nbsp;<\/em>training that typically consumes a huge amount of energy in digital hardware.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cThis is especially useful for systems where you are doing in-domain processing of optical signals, like navigation or telecommunications, but&nbsp;also in systems that you want&nbsp;to learn in real time,\u201d he says.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The photonic system achieved more than 96 percent accuracy during training tests and more than 92 percent accuracy during inference, which is comparable to traditional hardware. In addition, the chip performs key computations in less than half a nanosecond.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cThis work demonstrates that computing \u2014 at its essence, the mapping of inputs to outputs \u2014 can be compiled onto new architectures of linear and nonlinear physics that enable a fundamentally different scaling law of computation versus effort needed,\u201d says Englund.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The entire circuit was fabricated using the same infrastructure and foundry processes that produce CMOS computer chips. This could enable the chip to be manufactured at scale, using tried-and-true techniques that introduce very little error into the fabrication process.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Scaling up their device and integrating it with real-world electronics like cameras or telecommunications systems will be a major focus of future work, Bandyopadhyay says. In addition, the researchers want to explore algorithms that can leverage the advantages of optics to train systems faster and with better energy efficiency.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This research was funded, in part, by the National Science Foundation, the Air Force Office of Scientific Research, and NTT Research.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>This new device uses light to perform the key operations of a deep neural network on a chip, opening the door to high-speed processors that can learn in real-time.\u00a0<\/p>\n","protected":false},"author":2,"featured_media":25488,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[17,14],"tags":[],"class_list":["post-25487","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-research","category-innovation"],"featured_image_urls":{"full":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0.jpg",900,600,false],"thumbnail":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0-200x200.jpg",200,200,true],"medium":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0-600x400.jpg",600,400,true],"medium_large":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0-768x512.jpg",750,500,true],"large":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0-675x450.jpg",675,450,true],"1536x1536":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0.jpg",900,600,false],"2048x2048":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0.jpg",900,600,false],"ultp_layout_landscape_large":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0.jpg",900,600,false],"ultp_layout_landscape":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0-870x570.jpg",870,570,true],"ultp_layout_portrait":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0-600x600.jpg",600,600,true],"ultp_layout_square":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0-600x600.jpg",600,600,true],"newspaper-x-single-post":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0-760x490.jpg",760,490,true],"newspaper-x-recent-post-big":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0-550x360.jpg",550,360,true],"newspaper-x-recent-post-list-image":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0-95x65.jpg",95,65,true],"web-stories-poster-portrait":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0.jpg",640,427,false],"web-stories-publisher-logo":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0.jpg",96,64,false],"web-stories-thumbnail":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2024\/12\/MIT-Optical-Computation-stock_0.jpg",150,100,false]},"author_info":{"info":["Adam Zewe"]},"category_info":"<a href=\"https:\/\/www.revoscience.com\/en\/category\/news\/research\/\" rel=\"category tag\">Research<\/a> <a href=\"https:\/\/www.revoscience.com\/en\/category\/innovation\/\" rel=\"category tag\">Innovation<\/a>","tag_info":"Innovation","comment_count":"0","_links":{"self":[{"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/posts\/25487","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/comments?post=25487"}],"version-history":[{"count":2,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/posts\/25487\/revisions"}],"predecessor-version":[{"id":25490,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/posts\/25487\/revisions\/25490"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/media\/25488"}],"wp:attachment":[{"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/media?parent=25487"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/categories?post=25487"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/tags?post=25487"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}