Low-overhead distribution strategy for simulation and optimization of large-area metasurfaces

Skarda, **hie; Trivedi, Rahul; Su, Logan; Ahmad-Stein, Diego; Kwon, Hyounghan; Han, Seunghoon; Fan, Shanhui; Vučković, Jelena

doi:10.1038/s41524-022-00774-y

Low-overhead distribution strategy for simulation and optimization of large-area metasurfaces

Article
Open access
Published: 21 April 2022

Volume 8, article number 78, (2022)
Cite this article

Download PDF

You have full access to this open access article

npj Computational Materials

Low-overhead distribution strategy for simulation and optimization of large-area metasurfaces

Download PDF

3553 Accesses
23 Citations
3 Altmetric
Explore all metrics

Abstract

Fast and accurate electromagnetic simulation of large-area metasurfaces remains a major obstacle in automating their design. In this paper, we propose a metasurface simulation distribution strategy which achieves a linear reduction in the simulation time with the number of compute nodes. Combining this distribution strategy with a GPU-based implementation of the Transition-matrix method, we perform accurate simulations and adjoint sensitivity analysis of large-area metasurfaces. We demonstrate ability to perform a distributed simulation of large-area metasurfaces (over 600λ × 600λ), while accurately accounting for scatterer-scatterer interactions significantly beyond the locally periodic approximation.

Electromagnetic metamaterial simulations using a GPU-accelerated FDTD method

Article 01 December 2015

Parallelization of Hierarchical Matrix Algorithms for Electromagnetic Scattering Problems

Topology Optimization Using GPGPU

Introduction

Being able to achieve full phase control of optical fields is a central challenge in optical engineering, with diverse applications in imaging, sensing, augmented, and virtual reality systems^1,2. The past decades have seen a rapid development of metasurface-based optical elements that exploit collective scattering properties of subwavelength structures for phase-sha** the incoming fields and are significantly more compact and integrable when compared to the conventional refractive optical elements^{3,4,5,6,7,8,9}. The most commonly adopted metasurface-design strategy proceeds in two steps — first, a library of periodic meta-atoms with varying transmission amplitudes and phases is generated by varying a few geometric parameters specifying the meta-atom. Next, an aperiodic meta-surface is generated by laying out the periodic meta-atoms corresponding to the target spatially-varying phase profile^{10,11,12,13,14,15,16,17}. This approach suffers from two major limitations — first, the resulting metasurface should be almost periodic, and thus this strategy cannot be used for reliably designing rapidly varying phase-profiles. Second, generating the metasurface library becomes increasingly difficult for multi-functional design problems. For instance, while it is usually not difficult to generate a library for designing a simple phase-mask operating at a few operating modes^15,18,19, it becomes increasingly difficult to scale up the number of modes since the same metasurface is required to simultaneously satisfy multiple design conditions corresponding to the different input modes.

Fully automating design of metasurfaces can provide a potential solution to this problem. Gradient-based optimization has been successful in designing integrated optical elements that are more compact, robust and high performing than their classical counterparts^{20,21,22,23,24,25,26,27}. A key ingredient in these approaches is the ability to rapidly simulate the full electromagnetic structure. This presents a challenge for metasurface designs, since practical metasurfaces could be ~10²–10³λ in the linear dimension, making it impractical to use general-purpose electromagnetic solvers such as Finite-Difference Time-Domain (FDTD)²⁸, Finite-Difference Frequency-Domain (FDFD)²⁹, or Finite Element Method (FEM)³⁰. Inverse-design approaches that use discrete general-purpose electromagnetic solvers to simulate and design the full surface are limited to small design areas or a small number of optimization iterations^31,32, or restrict the parameter space through a specific symmetry that allows for fast simulations^33,34,35. Consequently, nearly all the current methods for inverse-designing large-scale 3D metasurfaces rely on approximate electromagnetic simulations of the metasurface locally using either periodic or radiation boundary conditions^{9,36,37,38,39,40,41,42,43,44,45,46,47}, which do not accurately account for interactions between different meta-atoms. These approaches are thus fundamentally limited to designing metasurfaces with slow phase variations due to the implicit local approximation. A coupled-mode formalism can also be applied for metasurface simulation and optimization⁴⁸ but this approach is not guaranteed to yield exact fields, particularly for metasurfaces with multiple low quality-factor modes.

In this paper, we propose and demonstrate a numerically accurate simulation strategy that can be used to design and analyze large-area metasurfaces. Our strategy relies on a distribution of the simulation method where the simulation time scales linearly with the compute resources. This is achieved by a Nyquist-sampling decomposition of the fields incident on the metasurface, similar to that used recently to characterize the discrete impulse response of aperiodic metasurfaces⁴⁹. Our distribution strategy, by ensuring minimal communication between compute nodes, allows for a linear reduction in the simulation time with the number of compute nodes, indicating that arbitrarily large metasurfaces can be simulated in reasonable time with sufficiently large number of compute nodes. On each compute node, we implement a GPU-based transition-matrix (T-matrix) simulation^50,51,52. Though there are GPU-optimized FDTD implementations that allow fast simulation of unit-cells up to 100λ × 100λ⁵³, these approaches do not currently provide a low-overhead means of parallel simulation distribution. We demonstrate numerically accurate simulations of metasurfaces of size 1 mm × 1 mm at a wavelength of 1.55 μm (about 645λ × 645λ) on a cluster of 48 GPU nodes. Finally, we demonstrate the ability to efficiently compute the gradients with respect to both the geometry and the positions of the meta-atoms, thus enabling the application of optimization-based design to large-scale metasurfaces.

Results

Low-overhead multi-GPU simulation strategy

To simulate millimeter-scale metasurfaces, it is essential to parallelize the simulation method across multiple compute nodes. In order to be scalable, however, this parallelization scheme should introduce only a modest communication overhead in the simulation as this communication overhead can potentially offset any time savings achieved due to the parallelization (^54,55,56).

For metasurface simulations, however, by utilizing the property that the incident fields generated by far-field sources will be within the light-cone in the k − space, a parallelization strategy can be devised that requires minimal communication between the compute nodes. The fundamental principle behind this parallelization is to represent the bandlimited incident field by its samples using the Nyquist-sampling theorem⁵⁷. More precisely, consider an incident field propagating along the z − direction — the transverse polarization of this field, ${{{{\bf{E}}}}}_{inc}^{T}(x,y,z)$, at any z can be expressed as

$${{{{\bf{E}}}}}_{{{{\rm{inc}}}}}^{{{{\rm{T}}}}}(x,y,z)=\mathop{\sum}\limits_{i,j}{{{{\bf{E}}}}}_{{{{\rm{inc}}}}}^{{{{\rm{T}}}}}({x}_{i},{y}_{j},z){f}_{i,j}(x,y),$$

(1)

where x_i, y_j = iλ/2, jλ/2 with λ being the wavelength in the background medium, and f_i,j(x, y) is a **c function⁵⁸ centered at (x_i, y_j). Each term in the Nyquist decomposition can be considered to be an independent source, which falls off to zero with distance (Fig. 1a), and the response of a metasurface to these individual sources can be obtained by considering only a spatially-truncated portion of the metasurface in the simulation. This is numerically demonstrated in Fig. 1b, in which we consider the scattered power obtained on exciting a metasurface with a single **c source as a function of the size of the metasurface included in the simulation. As the size of the metasurface is increased, the scattered power converges, indicating that a local simulation is sufficient to capture the metasurface response. The portion of the metasurface required to achieve a particular accuracy in the simulation is governed by diffraction of the **c source as it propagates to the metasurface.

**Fig. 1: Nyquist sampling of bandlimited incident field.**

To parallelize the simulation, we can then divide up the **c sources that compose the incident electric fields into smaller groups, and simulate the local response of the metasurface for each source group by performing an independent solve on a single compute node (Fig. 2a). This parallelization strategy only requires communication between the compute nodes at the start and at the end of the simulation — once to distribute the simulation data corresponding to the local subregions, and then to consolidate the electric field data computed per subregion. On each compute node, we implement a GPU-parallelized transition-matrix (T-matrix) electromagnetic solver^59,60 (See Supplementary Note 1 for details of the T-matrix method and our implementation of it). In order to accurately account for the diffraction of the **c source while computing the local response of the metasurface, we include a padding region around the group of sources for each compute node. While in principle, we should ensure that the performance metric being analyzed (e.g. metasurface efficiency) converges with respect to the padding, in practice and for typical metasurfaces, the thickness of padding required for accurate simulations can be estimated simply by studying the response of a local patch of the metasurface to one source (similar to the study performed in Fig. 1b). After having performed all the simulations, the electric fields obtained can be added together to compute the total electric field. Furthermore, because each compute node performs roughly the same amount of compute, the total simulation time scales as 1/N_nodes (Fig. 2b). Details of our **c source computation for the T-matrix method single-node simulation can be found in Supplementary Note 2.

**Fig. 2: Low-overhead parallelization scheme for simulation of arbitrarily large metasurfaces.**

Thus, given a sufficiently large number of compute nodes, we expect the simulation strategy to be able to handle large problems – on a compute cluster with 48 V100 GPU nodes, we were able to simulate a metasurface of size about 645λ × 645λ in ~10 hours. This total time is broken down into the compute times for the key simulation parts in Fig. 2c. The simulated surface is a 1 mm × 1 mm metalens with focal length 0.4 mm (NA = 0.78) designed from a library of silicon cylinders with height 940 nm, radii range of 50–250 nm, lattice period of 1070 nm, air background, and source wavelength of 1550 nm (based on scatterer library from Arbabi et al.⁶¹). The simulation is performed on 48 V100 GPUs and is distributed between these compute nodes using a subregion size of 20 μm × 20 μm and a padding of 6.5 μm, resulting in 2601 subregion simulations.

Comparison with locally periodic approximation

Approximate simulations of large-area metasurfaces often rely on the locally periodic approximation (LPA)^19,37,61, wherein the local electromagnetic response of the metasurface is approximated with that of a periodic metasurface. To demonstrate that the full metasurface simulation approach captures meta-atom interactions beyond LPA, we compare the T-matrix simulation method with two commonly-used LPA approaches in Fig. 3. First is a simple transmission mask approximation, where we assume that the metasurface imparts a smooth position-dependent amplitude and phase to the incident field as determined by the periodic simulation⁶¹, and ignore the variation of the fields within a single unit cell. Second, we consider a more exact field stitching method⁹, where the field near the metasurface within each unit cell is approximated with the fields from the periodic simulation and then this stitched field is propagated. For high aspectratio scatterers, we find that while the transmission mask method significantly deviates from the T-matrix method, the field stitching method does not. However, for small aspect-ratio scatterers, which are expected to have larger inter meta-atom interactions, both the transmission mask and the field stitching LPA approximation methods significantly deviate from the T-matrix method⁶². These results are a strong indication of the ability of the T-matrix method to capture meta-atom interactions and accurately simulate the metasurface response.

**Fig. 3: Comparison of T-matrix method simulations with locally periodic assumption (LPA) simulations.**

Distributed optimization-based design

An essential ingredient for optimization-based design of metasurfaces is an efficient evaluation of the gradient of the figure of merit with respect to the design parameters. A particularly useful method to evaluate gradients is based on adjoint-sensitivity analysis^63,64 which analytically differentiates through Maxwell’s equations and computes the gradients with respect to all the design parameters with a cost proportional to only two electromagnetic simulations. The distributed T-matrix simulation method is also amenable to distributed adjoint sensitivity analysis and can allow for scalable evaluation of the gradient of a performance metric defined on the electric fields scattered from the metasurface with respect to both the meta-atom shape and positions (see Supplementary Note 4 for details). Figure 4 demonstrates a distributed gradient-based optimization with respect to the positions and radii of the cylindrical meta-atoms of a cost function evaluating the amount of power within a spot at the focal plane for a 30 μm × 30 μm metalens with focal-length 20 μm initially designed using the traditional periodic-approximation approach with the same scatterer library used in Fig. 3b. The distributed optimization was performed on 9 T4 GPUs with the metalens divided into 9 subregions (subregion size of 10 μm × 10 μm, and padding size of 6 μm). The forward simulations performed took an average of about 120 GPU-min and the gradient computations with respect to radius and position took an average of 150 GPU-min). The metalens has a very high NA over 0.996 and the optimization improves the efficiency of the metalens by ~2×, giving a final efficiency of about 24%. Although thin low aspect-ratio metasurfaces (Huygens metasurfaces) are of interest because they are more amenable to large-scale fabrication, they have not found widespread adoption due to their very limited efficiencies and angular responses⁶². Our ability to accurately model the scatterer-scatterer effects in our metasurface inverse-design may allow discovery of more practical Huygens metasurfaces^65,66. Combining this multi-GPU gradient computation with the multi-GPU forward simulation, we have opened the door to gradient-based optimization over the many degrees of freedom afforded by arbitrarily large metasurfaces. In particular, our method allows optimizing both the shape and position of the scatterers composing the large-area metasurface — optimizing the scatterer positions is very difficult for any inverse-design approach that relies on a periodic assumption.

**Fig. 4: Distributed Gradient-based optimization improvement of metalens design.**

Discussion

We have demonstrated a scalable distribution method to accurately simulate arbitrarily large-area metasurfaces. Our method uses the Nyquist sampling theorem to allow parallel distribution of compute across multiple GPU nodes, on which a T-matrix method formulation is used to efficiently simulate the subregion. We show a roughly $\frac{1}{{N}_{GPU}}$ scaling of the total simulation time and demonstrate that our method accurately accounts for all scatterer interactions. Finally, we demonstrate our ability to apply our distribution method to the computation of the gradient with respect to all design parameters. Our distributed simulation method provides a solution to the long-standing problem of simulating large-area metasurfaces and opens the door to gradient-based optimization of the full metasurface, taking advantage of all the design degrees of freedom.

Methods

Our low-overhead distribution strategy works by using the Nyquist sampling of the incident field to split the simulation into subregion simulations, each of which can be performed in parallel. We use RabbitMQ (https://www.rabbitmq.com) to create a queue of the metasurface subregion simulations and manage the distribution of these simulations to the available GPU compute nodes in a fault-tolerant manner. On each GPU compute node, we run our T-matrix method code implemented in C++ using CUDA for the single-node GPU parallelization of incident field, matrix-vector product, and electric and magnetic field computations. For our GPU compute nodes, we used Google Cloud V100 GPUs for the timing benchmarks and 1 mm × 1 mm metasurface simulation in Fig. 2, and T4 GPUs for the distributed inverse-design in Fig. 4. We interface our distributed metasurface solver with the photonic optimization framework software SPINS to perform the inverse-design.

Data availability

The data that support the plots within this paper and other findings of this study are available from the corresponding authors upon reasonable request.

References

Lee, Y.-H., Zhan, T. & Wu, S.-T. Prospects and challenges in augmented reality displays. Virtual Real. Intell. Hardw. 1, 10–20 (2019).
Article Google Scholar
Berkovic, G. & Shafir, E. Optical methods for distance and displacement measurements. Adv. Opt. Photon. 4, 441–471 (2012).
Article Google Scholar
Chen, W. T., Zhu, A. Y. & Capasso, F. Flat optics with dispersion-engineered metasurfaces. Nat. Rev. Mater. 5, 604–620 (2020).
Article Google Scholar
Zhou, Y., Zheng, H., Kravchenko, I. I. & Valentine, J. Flat optics for image differentiation. Nat. Photon. 14, 316–323 (2020).
Article CAS Google Scholar
Colburn, S., Zhan, A. & Majumdar, A. Metasurface optics for full-color computational imaging. Sci. Adv. 4, eaar2114 (2018).
Article CAS Google Scholar
Horie, Y., Arbabi, A., Arbabi, E., Kamali, S. M. & Faraon, A. Wide bandwidth and high resolution planar filter array based on dbr-metasurface-dbr structures. Opt. Express 24, 11677–11682 (2016).
Article CAS Google Scholar
Kwon, H., Arbabi, E., Kamali, S. M., Faraji-Dana, M. & Faraon, A. Single-shot quantitative phase gradient microscopy using a system of multifunctional metasurfaces. Nat. Photon. 14, 109–114 (2020).
Article CAS Google Scholar
Lee, G.-Y. et al. Metasurface eyepiece for augmented reality. Nat. Commun. 9, 1–10 (2018).
Article CAS Google Scholar
Li, Z. et al. Inverse design enables large-scale high-performance meta-optics resha** virtual reality. Preprint at https://arxiv.org/abs/2104.09702 (2021).
McClung, A., Mansouree, M. & Arbabi, A. At-will chromatic dispersion by prescribing light trajectories with cascaded metasurfaces. Light Sci. Appl. 9, 1–9 (2020).
Article CAS Google Scholar
Arbabi, A. et al. Increasing efficiency of high numerical aperture metasurfaces using the grating averaging technique. Sci. Rep. 10, 1–10 (2020).
Article CAS Google Scholar
Pestourie, R., Mroueh, Y., Nguyen, T. V., Das, P. & Johnson, S. G. Active learning of deep surrogates for pdes: Application to metasurface design. Npj Comput. Mater. 6, 1–7 (2020).
Article Google Scholar
Zhan, A. et al. Low-contrast dielectric metasurface optics. ACS Photon. 3, 209–214 (2016).
Article CAS Google Scholar
Aieta, F. et al. Aberration-free ultrathin flat lenses and axicons at telecom wavelengths based on plasmonic metasurfaces. Nano Lett. 12, 4932–4936 (2012).
Article CAS Google Scholar
Arbabi, E., Arbabi, A., Kamali, S. M., Horie, Y. & Faraon, A. Multiwavelength polarization-insensitive lenses based on dielectric metasurfaces with meta-molecules. Optica 3, 628–633 (2016).
Article CAS Google Scholar
Devlin, R. C., Khorasaninejad, M., Chen, W. T., Oh, J. & Capasso, F. Broadband high-efficiency dielectric metasurfaces for the visible spectrum. Proc. Natl. Acad. Sci. USA 113, 10473–10478 (2016).
Article CAS Google Scholar
Fan, Z.-B. et al. Silicon nitride metalenses for close-to-one numerical aperture and wide-angle visible imaging. Phys. Rev. Appl. 10, 014005 (2018).
Article CAS Google Scholar
Shi, Z. et al. Single-layer metasurface with controllable multiwavelength functions. Nano Lett. 18, 2420–2427 (2018).
Article CAS Google Scholar
Khorasaninejad, M. et al. Polarization-insensitive metalenses at visible wavelengths. Nano Lett. 16, 7229–7234 (2016).
Article CAS Google Scholar
Molesky, S., Piggott, A. Y., Weiliang, J., Vučković, J. & Rodriguez, A. W. Inverse design in nanophotonics. Nat. Photon. 9, 659 (2018).
Article CAS Google Scholar
Yang, K. Y. et al. Inverse-designed non-reciprocal pulse router for chip-based lidar. Nat. Photon. 14, 369–374 (2020).
Article CAS Google Scholar
Wang, F., Jensen, J. S. & Sigmund, O. Robust topology optimization of photonic crystal waveguides with tailored dispersion properties. J. Opt. Soc. Am. B 28, 387–397 (2011).
Article CAS Google Scholar
Hughes, T. W., Minkov, M., Williamson, I. A. & Fan, S. Adjoint method and inverse design for nonlinear nanophotonic devices. ACS Photon. 5, 4781–4787 (2018).
Article CAS Google Scholar
Sapra, N. V. et al. On-chip integrated laser-driven particle accelerator. Science 367, 79–83 (2020).
Article CAS Google Scholar
Dory, C. et al. Inverse-designed diamond photonics. Nat. Commun. 10, 1–7 (2019).
Article CAS Google Scholar
Piggott, A. Y. et al. Inverse design and demonstration of a compact and broadband on-chip wavelength demultiplexer. Nat. Photon. 9, 374–377 (2015).
Article CAS Google Scholar
Su, L. et al. Nanophotonic inverse design with spins: Software architecture and practical considerations. Appl. Phys. Rev. 7, 011407 (2020).
Article CAS Google Scholar
Taflove, A. & Hagness, S. C. Computational electrodynamics, vol. 28 (Artech house publishers Norwood, MA, 2000).
Rumpf, R. C. Simple implementation of arbitrarily shaped total-field/scattered-field regions in finite-difference frequency-domain. Prog. Electromagn. Res. 36, 221–248 (2012).
Article Google Scholar
Reddy, J. N. Introduction To The Finite Element Method (McGraw-Hill Education, 2019).
Camayd-Muñoz, P., Ballew, C., Roberts, G. & Faraon, A. Multifunctional volumetric meta-optics for color and polarization image sensors. Optica 7, 280–283 (2020).
Article Google Scholar
Mansouree, M. et al. Multifunctional 2.5 d metastructures enabled by adjoint optimization. Optica 7, 77–84 (2020).
Article CAS Google Scholar
Christiansen, R. E. et al. Fullwave maxwell inverse design of axisymmetric, tunable, and multi-scale multi-wavelength metalenses. Opt. Express 28, 33854–33868 (2020).
Article CAS Google Scholar
Lin, Z., Roques-Carmes, C., Christiansen, R. E., Soljačić, M. & Johnson, S. G. Computational inverse design for ultra-compact single-piece metalenses free of chromatic and angular aberration. Appl. Phys. Lett. 118, 041104 (2021).
Article CAS Google Scholar
Chung, H. & Miller, O. D. High-na achromatic metalenses by inverse design. Opt. Express 28, 6945–6965 (2020).
Article Google Scholar
Lin, Z., Liu, V., Pestourie, R. & Johnson, S. G. Topology optimization of freeform large-area metasurfaces. Opt. Express 27, 15765–15775 (2019).
Article CAS Google Scholar
Pestourie, R. et al. Inverse design of large-area metasurfaces. Opt. Express 26, 33732–33747 (2018).
Article CAS Google Scholar
Chung, H. & Miller, O. D. Tunable metasurface inverse design for 80% switching efficiencies and 144 angular deflection. ACS Photon. 7, 2236–2243 (2020).
Article CAS Google Scholar
Sell, D., Yang, J., Doshay, S. & Fan, J. A. Periodic dielectric metasurfaces with high-efficiency, multiwavelength functionalities. Adv. Opt. Mater. 5, 1700645 (2017).
Article CAS Google Scholar
Phan, T. et al. High-efficiency, large-area, topology-optimized metasurfaces. Light Sci. Appl. 8, 1–9 (2019).
Article CAS Google Scholar
Lin, Z. & Johnson, S. G. Overlap** domains for topology optimization of large-area metasurfaces. Opt. Express 27, 32445–32453 (2019).
Article CAS Google Scholar
Sell, D., Yang, J., Doshay, S., Yang, R. & Fan, J. A. Large-angle, multifunctional metagratings based on freeform multimode geometries. Nano Lett. 17, 3752–3757 (2017).
Article CAS Google Scholar
Bayati, E. et al. Inverse designed extended depth of focus meta-optics for broadband imaging in the visible. Nanophotonics https://doi.org/10.1515/nanoph-2021-0431 (2021).
Zhelyeznyakov, M. V., Brunton, S. & Majumdar, A. Deep learning to accelerate scatterer-to-field map** for inverse design of dielectric metasurfaces. ACS Photon. 8, 481–488 (2021).
Article CAS Google Scholar
Jiang, J. & Fan, J. A. Global optimization of dielectric metasurfaces using a physics-driven neural network. Nano Lett. 19, 5366–5372 (2019).
Article CAS Google Scholar
Jiang, J. et al. Free-form diffractive metagrating design based on generative adversarial networks. ACS Nano 13, 8872–8878 (2019).
Article CAS Google Scholar
Byrnes, S. J., Lenef, A., Aieta, F. & Capasso, F. Designing large, high-efficiency, high-numerical-aperture, transmissive meta-lenses for visible light. Opt. Express 24, 5110–5124 (2016).
Article CAS Google Scholar
Zhou, M. et al. Inverse design of metasurfaces based on coupled-mode theory and adjoint optimization. ACS Photon. 8, 2265–2273 (2021).
Article CAS Google Scholar
Torfeh, M. & Arbabi, A. Modeling metasurfaces using discrete-space impulse response technique. ACS Photon. 7, 941–950 (2020).
Article CAS Google Scholar
Zhan, A. et al. Controlling three-dimensional optical fields via inverse mie scattering. Sci. Adv. 5, eaax4769 (2019).
Article Google Scholar
Zhan, A., Fryett, T. K., Colburn, S. & Majumdar, A. Inverse design of optical elements based on arrays of dielectric spheres. Appl. Opt. 57, 1437–1446 (2018).
Article CAS Google Scholar
Zhelyeznyakov, M. V., Zhan, A. & Majumdar, A. Design and optimization of ellipsoid scatterer-based metasurfaces via the inverse t-matrix method. OSA Contin. 3, 89–103 (2020).
Article CAS Google Scholar
Hughes, T. W., Minkov, M., Liu, V., Yu, Z. & Fan, S. Full wave simulation and optimization of large area metalens. OSA Opt. Design and Fab. OSA Optical Design and Fabrication 2021. Congress 2021 (2021).
Tang, J., Zheng, Y., Yang, C., Wang, W. & Luo, Y. Parallelized implementation of the finite particle method for explicit dynamics in gpu. Comput. Model Eng. Sci. 122, 5–31 (2020).
Google Scholar
Hermann, E., Raffin, B., Faure, F., Gautier, T. & Allard, J. European Conference on Parallel Processing, p. 235–246 (Springer, 2010).
Dziekonski, A., Sypek, P., Lamecki, A. & Mrozowski, M. Communication and load balancing optimization for finite element electromagnetic simulations using multi-gpu workstation. IEEE Trans. Microw. Theory Tech. 65, 2661–2671 (2017).
Article Google Scholar
Landau, H. Sampling, data transmission, and the nyquist rate. Proc. IEEE 55, 1701–1706 (1967).
Article Google Scholar
Goodman, J. W. Introduction to Fourier Optics (Roberts and Compnay Publishers, 2005).
Egel, A. et al. Extending the applicability of the t-matrix method to light scattering by flat particles on a substrate via truncation of sommerfeld integrals. J. Quant. Spectrosc. Radiat. Transf. 202, 279–285 (2017).
Article CAS Google Scholar
Doicu, A., Wriedt, T. & Eremin, Y. A. Light Scattering By Systems Of Particles: Null-field Method With Discrete Sources: Theory And Programs. Vol. 124 (Springer, 2006).
Arbabi, A., Horie, Y., Ball, A. J., Bagheri, M. & Faraon, A. Subwavelength-thick lenses with high numerical apertures and large efficiency based on high-contrast transmitarrays. Nat. Commun. 6, 1–6 (2015).
Article CAS Google Scholar
Gigli, C. et al. Fundamental limitations of huygens’ metasurfaces for optical beam sha**. Laser Photon. Rev. 15, 2000448 (2021).
Article CAS Google Scholar
Lalau-Keraly, C. M., Bhargava, S., Miller, O. D. & Yablonovitch, E. Adjoint shape optimization applied to electromagnetic design. Opt. Express 21, 21693–21701 (2013).
Article CAS Google Scholar
Piggott, A. Y., Petykiewicz, J., Su, L. & Vučković, J. Fabrication-constrained nanophotonic inverse design. Sci. Rep. 7, 1–7 (2017).
Article CAS Google Scholar
Ollanik, A. J., Smith, J. A., Belue, M. J. & Escarra, M. D. High-efficiency all-dielectric huygens metasurfaces from the ultraviolet to the infrared. ACS Photon. 5, 1351–1358 (2018).
Article CAS Google Scholar
Cai, H. et al. Inverse design of metasurfaces with non-local interactions. Npj Comput. Mater. 6, 1–8 (2020).
Article CAS Google Scholar

Download references

Acknowledgements

This work was supported by the Samsung GRO program. J.S. acknowledges support from the National Science Foundation Graduate Research Fellowship (grant no. DGE-1656518) and Cisco Systems Stanford Graduate Fellowship (SGF). R.T acknowledges support from Max Planck Harvard research center for Quantum Optics (MPHQ) fellowship, and Sarah and Kailath Stanford Graduate Fellowship (SGF).

Author information

These authors contributed equally: **hie Skarda, Rahul Trivedi, Logan Su.

Authors and Affiliations

E. L. Ginzton Laboratory, Stanford University, Stanford, CA, 94305, USA
**hie Skarda, Rahul Trivedi, Logan Su, Diego Ahmad-Stein, Hyounghan Kwon, Shanhui Fan & Jelena Vučković
Max-Planck-Institut für Quantenoptik, Hans-Kopfermann-Str. 1, 85748, Garching, Germany
Rahul Trivedi
Department of electrical and computer engineering, University of Washington, Seattle, 98195, USA
Rahul Trivedi
Samsung Advanced Institute of Technology, Samsung Electronics, Suwon-si, Gyeonggi-do, 443-803, Republic of Korea
Seunghoon Han

Authors

**hie Skarda
View author publications
You can also search for this author in PubMed Google Scholar
Rahul Trivedi
View author publications
You can also search for this author in PubMed Google Scholar
Logan Su
View author publications
You can also search for this author in PubMed Google Scholar
Diego Ahmad-Stein
View author publications
You can also search for this author in PubMed Google Scholar
Hyounghan Kwon
View author publications
You can also search for this author in PubMed Google Scholar
Seunghoon Han
View author publications
You can also search for this author in PubMed Google Scholar
Shanhui Fan
View author publications
You can also search for this author in PubMed Google Scholar
Jelena Vučković
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.T. and L.S. conceived the idea. J.S., R.T., and L.S. designed and implemented the software. J.S., R.T., L.S., D.A-S., and H.K. ran benchmarks and conducted numerical experiments. J.V., S.H., and S.F. supervised the project. All authors assisted with data analysis and manuscript preparation.

Corresponding authors

Correspondence to Rahul Trivedi or Jelena Vučković.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplemental Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Skarda, J., Trivedi, R., Su, L. et al. Low-overhead distribution strategy for simulation and optimization of large-area metasurfaces. npj Comput Mater 8, 78 (2022). https://doi.org/10.1038/s41524-022-00774-y

Download citation

Received: 09 October 2021
Accepted: 26 March 2022
Published: 21 April 2022
DOI: https://doi.org/10.1038/s41524-022-00774-y
Springer Nature Limited

This article is cited by

Large area optimization of meta-lens via data-free machine learning
- Maksym Zhelyeznyakov
- Johannes Fröch
- Arka Majumdar
Communications Engineering (2023)
Advances in optical metalenses
- Amir Arbabi
- Andrei Faraon
Nature Photonics (2023)
Efficient simulators for multi-source nanophotonics
- Haitao Liu
Nature Computational Science (2022)
Fast multi-source nanophotonic simulations using augmented partial factorization
- Ho-Chun Lin
- Zeyu Wang
- Chia Wei Hsu
Nature Computational Science (2022)

Low-overhead distribution strategy for simulation and optimization of large-area metasurfaces

Abstract

Similar content being viewed by others

Electromagnetic metamaterial simulations using a GPU-accelerated FDTD method

Parallelization of Hierarchical Matrix Algorithms for Electromagnetic Scattering Problems

Topology Optimization Using GPGPU

Introduction

Results

Low-overhead multi-GPU simulation strategy

Comparison with locally periodic approximation

Distributed optimization-based design

Discussion

Methods

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplemental Information

Rights and permissions

About this article

Cite this article

This article is cited by

Large area optimization of meta-lens via data-free machine learning

Advances in optical metalenses

Efficient simulators for multi-source nanophotonics

Fast multi-source nanophotonic simulations using augmented partial factorization

Navigation

Low-overhead distribution strategy for simulation and optimization of large-area metasurfaces

Abstract

Similar content being viewed by others

Introduction

Results

Low-overhead multi-GPU simulation strategy

Comparison with locally periodic approximation

Distributed optimization-based design

Discussion

Methods

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation