Stable high-order quadrature rules with equidistant points

合集下载

Calculation of Gauss Quadrature Rules

Calculation of Gauss Quadrature Rules
Calculation of Gauss Quadrature Rules
by Gene Golub and John Welsch
Presented by Mark Gates
University of Illinois at Urbana-Champaign
March 12, 2007
Mark Gates (UIUC)
Calculation of Gauss Quadrature Rules
March 12, 2007
7 / 16
Quadrature nodes
(From previous slide) xp(x) = T p(x) + (1/an )pn (x)en Quadrature nodes are roots xi of pn (x) xi p(xi ) = T p(xi ) + 0 for i = 1, . . . , n
1/2
=
1/2
1 2 1 4
1/2
1 =√ , 2 = 1 2 for i > 1
1/2
=
Mark Gates (UIUC)
Calculation of Gauss Quadrature Rules
March 12, 2007
11 / 16
Example
Yields tridiagonal matrix 0 √ T = 1/ 2 0 with eigenvalues and eigenvectors 0.57735 Q = −0.70711 0.40825 0.57735 0 −0.8165 0.57735 0.70711 0.40825 √ 1/ 2 0 0 1/2 1 /2 0

Fronts propagating with curvature dependent speed Algorithms Based on Hamilton-Jacobi Formulations

Fronts propagating with curvature dependent speed Algorithms Based on Hamilton-Jacobi Formulations

reaching out into the unburnt gas somehow propagate slower than do concave regions which are hot gases surrounding a small unburnt pocket. At the same time, particles along the flame front undergo an increase in volume as they burn, creating a jump in velocity across the flame front. This discontinuity in the velocity field creates vorticity along the burning flame, which can be related to the local curvature, and this new vorticity field contributes to the advection of the propagating flame. Thus, there are at least two distinct ways in which the speed of the moving flame depends on the local curvature. Typically, there have been two types of numerical algorithms employed in the solution of such problems. The first parameterizes the moving front by some variable and discretizes this parameterization into a set of marker points [39]. The positions of the marker points are updated in time according to approximations to the equations of motion. Such techniques can be extremely accurate in the attempt to follow the motions of small perturbations. However, for large, complex motion, several problems soon occur. First, marker particles come together in regions where the curvature of the propagating front builds, causing numerical instability unless a regridding technique is employed. The regridding mechanism usually contains a error term which resembles diffusion and dominates the real effects of curvature under analysis. Secondly, such methods suffer from topological problems; when two regions "burn" together to form a single one, ad-hoc techniques to eliminate parts of the boundary are required to make the algorithm work. Other algorithms commonly employed fall under the category of "volume of fluid " techniques, which, rather than track the boundary of the propagating front, track the motion of the interior region. An example of this type of algorithm is SLIC [26]. In these algorithms, the interior is discretized, usually by employing a grid on the domain and assigning to each cell a "volume fraction" corresponding to the amount of interior fluid currently located in that cell. An advantage of such techniques is that no new computational elements are required as the calculation progresses (unlike the parameterization methods), and complicated topological boundaries are easily handled, see [4,32]. Unfortunately, it is difficult to calculate the curvature of the front from such a representa-

Thermo Scientific MK.4 ESD和Latch-Up测试系统中文名说明书

Thermo Scientific MK.4 ESD和Latch-Up测试系统中文名说明书

The Thermo Scientific MK.4 ESD and Latch-Up Test System is a complete,robust and feature-filled turn-key instrumentation test package, which performs automatic and manual HBM, MM, and Latch-Up tests on devices with pin counts up to 2304. It features the highest speed of test execution, lowest zap interval, and extensive parallelism that enables concurrent zapping with interleaved trace test capability to global and company driven quality standards.• Rapid-relay-based operations—up to 2304 channels• Solid state matrix topology for rapid, easy-to-use testing operations • Latch-Up stimulus and device biasing • High voltage power source chassis with patented HV isolation enables excellent pulse source performance • Advanced device preconditioning with six separate vector drive levels • Massive parallelism drives remarkable test and throughput speeds• Addresses global testing demands for devices that are smaller, faster and smarterThermo ScientificMK.4 ESD and Latch-up Test SystemIndustry standard, ESD and Latch-Up test system for producers ofmultifunction high pin-count devices Thirty years in the making! IC structure designers and QA program managers in manufacturing and test house facilities worldwide have embraced the Thermo Scientific™ MK.4, a versatile, powerful, and flexible, high yield test system. Easily upgradeable, the MK.4 ESD and Latch-Up Test System is fully capable of taking your test operations through ever-evolving regulatory and quality standards.Solid-State Matrix TopologyThe advanced rapid relay-based (modular matrix) hardware of the MK.4 system is at least ten times faster than mechanically driven ESD testers. The switching matrix, while providing consistent ESD paths, also allows any pin to be grounded, floated,vectored or connected to any of the installedV/I supplies. Furthermore, advancedalgorithms ensure accurate switching of HV, in support of pulse source technology, per recent JEDEC/ESDA trailing pulse standards.Advanced Controller and CommunicationsA powerful, extraordinarily fast embedded VME controller drives the highest Speed- of-Test execution available. Data transfer between the embedded controller and the tester’s PC server, is handled through TCP/IP communication protocols, minimizing data transfer time. The tester’s PC server can be accessed through internal networks, as well as through the internet allowing remote access to the system to determine the systems status or to gather result information.Product SpecificationsLatch-Up Stimulus and Device Biasing The MK.4 can be equipped with up to eight 100 V four-quadrant Voltage and Current (V/I) power supplies. Each V/I supply has a wide dynamic range enabling it to force and measure very low voltage at high current levels from 100 mV/10 A to 100 V/1 A. The system’s power supply matrix can deliver up to a total of 18A of current, which is distributed between the installed supplies. These supplies are able to provide a fast and versatile means of making DC parametric and leakage measurements as well as providing latch-up pulses, while offering total control and protection of the DUT.Advanced Device PreconditioningThe MK.4 system provides the most advanced device preconditioning capability available. The DUT can be vectored with complex vector patterns, providing excellent control over the device. Each pin can be driven using one of the 6 different vector supplies. The patterns can be up to 256k deep, running at clock speeds of up to 10 MHz. Device conditioning is easily verified, using the read back compare capability available on every pin.Thermo Scientific MK.4 Scimitar™Software Makes Programming Easy, while Providing Unsurpassed Programming FlexibilityThe MK.4 Windows®-based Scimitar operating software empowers users with the flexibility to easily set-up tests based on industry standards or company driven requirements.Device test plans can be created by importing existing text based device files, on the testers PC server or off-line from a satellite PC containing the application. The software also provides the capabilities to import test plans and device files from previous Thermo Scientific test systems.Test vectors from your functional testers can also be imported into the application. And of course, the vector application allows manual creation and debug of vector files.Device test plans and results are stored in an XML data base, providing unsurpassed results handling, sorting and data mining capabilities.Parallelism Drives Remarkable Test Throughput SpeedsThe MK.4 software enables ESD testing of up to twelve devices at one time using the multisite pulse source design.Embedded VME power supplies eliminate any communication delays that would be seen by using stand alone supplies. The embedded parametric (curve tracing) supply also provides fast, accurate curve tracing data to help you analyze your devices performance.The systems curve tracer can also be used as a failure analysis tool by allowing the comparison of stored, known good results, versus results from a new test sample or samples.Ready for Today’s Component Reliability Demands and Anticipating Those to Come ESD and Latch-Up testing of electronic and electrical goods can be very expensive aspects of the design and manufacturing process. This is especially true as market demands for products that are smaller, faster and smarter become the standard rather than the exception. The Thermo Scientific MK.4 leverages the technology and know- how gained over three decades of test system experience, as well as our in-depth participation and contributions to global regulatory bodies governing these changes, enabling today’s products to meet both global and industry-driven quality standards.The real key to our customers’ success is in anticipating what’s next. And to ensure that our customers possess the ability to evolve quickly to meet all change factors with efficiency and cost effectiveness.As such, the strategically-designed, field upgradeable architecture of the MK.4 system ensures a substantial return on investment over a very considerable test system lifecycle, as well as better short- and long-term qualityand ESD and Latch-Up test economies.Custom fixtures include universal package adaptors to enable the industry’s lowest cost-in-service high pin count device fixturing yetdevised. (2304-pin, Universal 1-mm pitch BGA package adaptor shown.)100W V/I Performance Thermo Scientific MK.4: eight-V/I configuration. Powerful V/Is can deliver a total of 800 W to the DUT, enabling complex testing of all advanced high power processors on your product roadmap.Solid state matrix topology for rapid, easy-to-use testing operations. Design ensures waveform integrity and reproducibility.General SpecificationsHuman Body Model (HBM) per ESDA/JEDEC JS-001-2014, MIL-STD 883E, and AEC Q100-002 25 V to 8 kV in steps of 1 V Test to multiple industry standards in one integrated system; no changing or alignment of pulse sources.Wizard-like prompts on multi-step user actions MachineModel (MM) per ESDA STM5.2, JEDEC/JESD22-A115, andAEC Q100-003, 25 V to 1.5 kV in steps of 1 VIntegrated pulse sources allow fast multi-site test execution.Latch-up testing per JEDEC/JESD 78 test pin and AECQ100-004Includes preconditioning, state read-back and full control of each.Rapid Relay-based operations at least 10 times faster thanrobotic-driven testersSuper fast test speeds.Test devices up to 2304 pins Systems available configured as 1152, 1728 or 2304 pins.Waveform network: Two, 12 site HBM (100 pF/1500Ω)and MM (200 pF/0Ω) pulse sources address up to 12devices simultaneouslyPatented design ensures waveform compliance for generations to come.Multiple device selection When multiple devices are present; graphical display indicates the devices selectedfor test; progress indicator displays the current device under test (DUT), along withtest status information.Unsurpassed software architecture Flexible programming, easy to use automated test setups, TCP/IP communication. Enables use of device set-up information Increased efficiency and accuracy from other test equipment, as well as deviceinformation import.Event trigger output Manages setup analysis with customized scope trigger capabilities.High voltage power supply chassis Modular chassis with patented HV isolation enables excellent pulse sourceperformance.Power supply sequencing Provides additional flexibility to meet more demanding test needs of integratedsystem-on-chip (SOC) flexibility.Manages ancillary test equipment through Plug-n feature allows the user to control external devices, such as scopes or heatstreams or other devices the Scimitar Plug-ins feature as required for automatedtesting.Pin drivers for use during Latch-Up testing Vector input/export capability from standard tester platforms and parametricmeasurements.256k vectors per pin with read-back Full real-time bandwidth behind each of the matrix pins.Six independent vector voltage levels Test complex I/O and Multi-Core products with ease.Up to 10MHz vector rate (programmable) Quickly and accurately set the device into the desired state for testing from an internalclock.Comprehensive engineering vector debug. Debug difficult part vectoring setups with flexibility.Up to eight separate V/I supplies (1 stimulus and 7 bias supplies) capability through the V/I matrix High accuracy DUT power, curve tracing, and Latch-up stimulus available; design also provides high current.Low resolution/high accuracy parametric measurements, using an embedded Keithley PSU With the optional Keithley PSU feature (replaces one V/I, nA measurements are achievable, allowing supply bus resistance measurement analysis to be performed.Multiple self-test diagnostic routines Ensures system integrity throughout the entire relay matrix, right up to the test socket Test reports: pre-stress, pre-fail (ESD) and post-fail data,as well as full curve trace and specific data pointmeasurementsData can be exported for statistical evaluation & presentation.Individual pin parametrics Allows the user to define V/I levels, compliance ranges, and curve trace parametersfor each pin individually.Enhanced data set features Report all data gathered for off-line reduction and analysis; core test data is readilyavailable; all data is stored in an easy-to-manipulate standard XML file structure. Interlocked safety cover Ensures no user access during test. All potentially lethal voltages are automaticallyterminated when cover is opened. Safety cover window can be easily modified toaccept 3rd party thermal heads.Dimensions60 cm (23.5 in) W x 99 cm (39 in) D x 127 cm (50 in) H© 2016 Thermo Fisher Scientific Inc. All rights reserved. Windows is a registered trademark of Microsoft Corporation in the United States and/or other countries. All other trademarks are the property of Thermo Fisher Scientific and its subsidiaries. Results may vary under different operating conditions. Specifications, terms and pricing are subject to change. Not all products are available in all countries. Please consult your local sales representative for details.Africa-Other +27 11 570 1840 Australia +61 2 8844 9500 Austria +43 1 333 50 34 0 Belgium +32 53 73 42 41 Canada +1 800 530 8447 China +86 10 8419 3588 Denmark +45 70 23 62 60 Europe-Other +43 1 333 50 34 0Finland /Norway/Sweden+46 8 556 468 00France +33 1 60 92 48 00Germany +49 6103 408 1014India +91 22 6742 9434Italy +39 02 950 591Japan +81 45 453 9100Latin America +1 608 276 5659Middle East +43 1 333 50 34 0Netherlands +31 76 579 55 55South Africa +27 11 570 1840Spain +34 914 845 965Switzerland +41 61 716 77 00UK +44 1442 233555USA +1 800 532 4752Thermo Fisher Scientific,San Jose, CA USA is ISO Certified. CTS.05102016Product SpecificationsScimitar Software FeaturesSummary Panel with easy navigation among device componentsWizard-like prompts on multi-step user actionsControl of external devices through the use of Scimitar’s user programmable Plug-in capabilities, in addition to the Event Trigger Outputs, which provide TTL control signals for external devices, such as power supplies or for triggering oscilloscopesFlexible parametric tests that are defined and placed at an arbitrary position within the executable test plan.Comprehensive results viewer that provides:• ESD and Static Latch-up data viewing capabilities• Curves viewer with zooming capabilities and the ability to add user comments• Data filtering on the following criteria – failed pins, failed results, final stress levels• A complete set or subset of results using user defined parameters• Sorting in ascending or descending order by various column criteriaTree-like logical view of the tests and test plans.Flexible data storage that provides the ability for the end-user to query the dataSeamless support of existing ZapMaster, MK.2, MK.4, and Paragon test plansCurve tracing with curve-to-curve and relative spot-to-spot comparisonOff-line curve analyzing, including third-party generated waveformsCanned JESD78A test (static latch-up only) that can be defined automaticallyPause/Resume test capabilitiesIntermediate results viewingAutomated waveform capture capability and analysis using the embedded EvaluWave software feature。

A high-flux high-order harmonic source

A high-flux high-order harmonic source

A high-flux high-order harmonic sourceP. Rudawski, C. M. Heyl, F. Brizuela, J. Schwenke, A. Persson, E. Mansten, R. Rakowski, L. Rading, F. Campi, B. Kim, P. Johnsson, and A. L’HuillierCitation: Review of Scientific Instruments 84, 073103 (2013); doi: 10.1063/1.4812266View online: /10.1063/1.4812266View Table of Contents: /content/aip/journal/rsi/84/7?ver=pdfcovPublished by the AIP PublishingAdvertisement:REVIEW OF SCIENTIFIC INSTRUMENTS84,073103(2013)A high-flux high-order harmonic sourceP.Rudawski,1,a)C.M.Heyl,1F.Brizuela,1J.Schwenke,1A.Persson,1E.Mansten,2R.Rakowski,1L.Rading,1F.Campi,1B.Kim,1P.Johnsson,1and A.L’Huillier11Department of Physics,Lund University,P.O.Box118,SE-22100Lund,Sweden2MAX-lab,Lunds Universitet,P.O.Box118,SE-22100Lund,Sweden(Received12March2013;accepted11June2013;published online9July2013)We develop and implement an experimental strategy for the generation of high-energy high-order harmonics(HHG)in gases for studies of nonlinear processes in the soft x-ray region.We generate high-order harmonics by focusing a high energy Ti:Sapphire laser into a gas cellfilled with argon or neon.The energy per pulse is optimized by an automated control of the multiple parameters that influence the generation process.This optimization procedure allows us to obtain energies per pulse and harmonic order as high as200nJ in argon and20nJ in neon,with good spatial properties, using a loose focusing geometry(f#≈400)and a20mm long medium.We also theoretically ex-amine the macroscopic conditions for absorption-limited conversion efficiency and optimization of the HHG pulse energy for high-energy laser systems.©2013Author(s).All article content,except where otherwise noted,is licensed under a Creative Commons Attribution3.0Unported License.[/10.1063/1.4812266]I.INTRODUCTIONHigh-order harmonics generated by the nonlinear inter-action of an intense ultrashort laser pulse with atoms ormolecules are now used in manyfields of physics.The in-terest in the generated radiation results from unique featureslike tunability over the extreme ultraviolet(XUV)and softx-ray(SXR)spectral regions(reaching several keV1,2),ex-cellent beam quality,3and ultrashort pulse duration downto the attosecond range.4High-order harmonic generation(HHG)sources are well established in many research areassuch as attosecond science5or femtosecond spectroscopy6and have become interesting for high-resolution imaging,7,8free-electron-laser seeding,9and nonlinear optics in the XUVrange.10,11Most applications of HHG sources benefit from har-monic pulses with high pulse energy.This requirement isdifficult to achieve due to the low conversion efficiency ofthe generation process.Since the discovery of the HHG pro-cess over two decades ago,12,13its conversion efficiency hasbeen progressively improved by optimizing the macroscopicphase-matching conditions and the microscopic single atomresponse.High-order harmonic generation has been carriedout in different conditions,such as high-pressure jets,14gascells,15semi-infinite media,and capillaries.16Phase-matchingoptimization using loosely focused(possibly self-guided)fundamentalfields has led to conversion efficiencies of∼10−7in neon,15∼10−5in argon,17and slightly below10−4in xenon.18,19By modifying the generationfield,e.g.,by com-bining the fundamentalfield with one or more of its harmon-ics,the microscopic single atom response has been controlledon the subcycle level leading to enhanced HHG signals and/orgeneration of even-order harmonics.20–22In this article,we describe a high-flux HHG sourceoperating in the photon energy range up to100eV.The HHG a)Electronic mail:piotr.rudawski@fysik.lth.se setup is designed to work in a loose focusing geometry(up to5m focal length)and is driven by a high energy femtosecondlaser system delivering up to100mJ per pulse.The optimiza-tion of the signal is performed using an automated scan ofthe main parameters that contribute to phase-matching(e.g.,driving pulse intensity,gas pressure,etc.).Using this tech-nique we have obtained a total energy per pulse in argon of amicrojoule and a few hundred nJ in neon,in a geometry withan f-number f#=f/D≈400and f#≈133,respectively, and a20mm long gas cell.Beam profiles were measuredusing an XUV-camera and the coherence properties wereestimated in a Young’s double-slit experiment.The article isorganized as follows.Section II presents theoretical con-siderations for HHG under loose focusing.The HHG setuptogether with the methods for characterization and opti-mization are described in Sec.III.Results obtained with thehigh-energy,ultrashort laser system at the Lund Laser Centreare presented in Sec.IV.Section V presents a summary ofthe work and a discussion about scaling to extremely longfocal lengths.II.MODEL FOR LOOSE FOCUSING HHGHigh-order harmonic generation with high conversionefficiency requires optimization of both the microscopicand macroscopic properties of the process.The microscopicresponse is well described by a semi-classical three-stepmodel.23,24In every half-cycle of the driving wave,electronscan tunnel through the distorted atomic potential barrier,be-ing then accelerated in the intense laserfield.Depending onthe release time into the continuum,the electrons may returnto the parent ion and recombine,emitting an XUV photon.The trajectories of these electrons can be divided into twogroups called short and long,depending on the excursion timein the continuum.HHG requires laser intensities in the rangeof1014W/cm2–1015W/cm2depending on the selected gas.0034-6748/2013/84(7)/073103/7©Author(s)201384,073103-1Macroscopically,the total HHG signal is a coherent sumof the photons emitted from different atoms in the medium.For a given harmonic order q,constructive addition occursalong the propagation direction over the so-called coherencelength L coh=π/ k.Here, k=qk1−k q is the wave-vector mismatch along the propagation direction between the gener-atedfield and the laser-induced polarization at frequency qω.In order to maximize the coherence length,the wave-vectormismatch must be minimized.In a non-guiding focus geom-etry this can be done through the interplay between the foursources of wave-vector mismatch,k= k g<0+ k n>0+ k p<0+ k d<0for z<0>0for z>0.(1)The negative contribution k g originates from theGaussian beam phase gradient along the propagation direc-tion(z). k n and k p describe the neutral and free-electrondispersion which have opposite sign and are proportional tothe gas pressure.To explicitly outline this linear dependence,we write k n,p=p∂( k n,p)/∂p,where the partial derivative is now pressure independent in the following. k d is thegradient of the so-called dipole phase which is proportionalto the intensity gradient and is small for the short trajectoriesbut large for the long ones.25Under our experimental conditions,the short trajectoriesdominate the HHG process.If only these trajectories are con-sidered in Eq.(1),the dipole phase contribution can be ne-glected and the wave-vector mismatch can be minimized bycanceling the plasma dispersion and Gaussian beam phasegradient with the neutral dispersion.For afixed generation ge-ometry,the degree of ionization in the medium determines thepressure,p match,for which the system is phase matched.26,27For each harmonic order,p match is defined asp match=−k g∂( k n)∂p+∂( k p)∂p.(2)For a given medium,harmonic order,and focal length,the only variable parameter is the free-electron contributionwhich is proportional to the degree of ionization(∂ k p/∂p ∝r ion),and consequently can be adjusted by changing the laser intensity.The equation requires the intensity to be lowenough so that the contribution due to neutral dispersion dom-inates over the free-electron dispersion.This defines a maxi-mum ionization degree(r maxion ),typically a few percent,abovewhich phase-matched generation is not possible.Figure1shows the variation of p match in argon as a func-tion of the degree of ionization for three harmonic orders and two different focusing geometries.p match tends towards infin-ity when r ion reaches r maxion .At low degree of ionization,thephase-matching pressure varies little both with pressure and with harmonic order.Considering that the dipole response is highest at the highest intensity,one could assume that the most efficient generation is possible at high pressures andat intensities that support an ionization degree around r maxion .High intensities,however,lead to steep gradients of r ion in the longitudinal and radial directions within the generation volume,confining phase-matched generation to a small vol-ume and leading to transient phase-matching.27In spite ofa FIG.1.Phase matching pressure in Ar as a function of ionization degree for different harmonic orders,q,and different focus geometries f#=100,blue, and f#=400,red.The central wavelength is800nm and the generation cell is placed at the focus of the fundamental beam.higher single atom response at high intensity,those effects can reduce the overall efficiency.An optimum ionization de-gree should assure phase-matched HHG over a broad band-width and a large volume.The ionization level should be such that the phase-matching pressure is approximately constant for a broad range of high-order harmonics,potentially leading to short and intense attosecond pulses.This phase-matching bandwidth increases with decreasing ionization degree yet at the same time the single atom response as well as the con-version efficiency decrease.As a rule of thumb,the optimumvalue for the ionization degree can be taken as∼r maxion/2for the highest harmonic in the considered HHG bandwidth.Un-der the conditions of Figure1,this corresponds to∼2%ion-ization and a laser intensity of∼1.1×1014W/cm2.When the coherence length,L coh,is maximized,the har-monic emission is limited by re-absorption in the generation gas.The absorption length,L abs,is defined byL abs(p)=kTpσion,(3)where k is the Boltzmann constant,T the temperature, andσion the ionization cross-section.Following the ar-gumentation of Constant et al.,28the harmonic yield is then maximized when the medium length,L med,is at least three times the absorption length.This allows to define an optimum medium length under phase-matched conditions,L optmed=3L abs(p match).For example for the21st harmonic in Ar,and f#=400,T=300K,σion=2×10−21m2,p match ≈5mbar,and consequently L opt med should be chosen to be at least12mm.For high-energy laser systems,an increase of the absorption-limited HHG intensity can be achieved by scal-ing up the f#,i.e.,by increasing the focal length for a certain initial beam diameter.The conversion efficiency can be held constant when changing the focal length if the laser pulse en-ergy,the gas pressure,and the medium length are scaled ing Gaussian optics and Eqs.(2)and(3),we derive the following scaling relations:E f(laser energy)∝f2 in order to keep the same intensity at focus,p match∝1/f2since k g∝1/f2,and L med∝f2.This ensures constant conversionFIG.2.Scaling of the phase matching pressure and the required laser pulse energy with focal length(or f#)for different ionization levels in argon.The corresponding minimum laser pulse energy required is shown in red.For the simulations,the following parameters were used:beam diameter before fo-cusing:D=10mm,gas cell position:at the focus,central wavelength of 800nm,harmonic order q=21.The required pulse energy was calculated assuming a peak intensity of1.5×1014W/cm2and a pulse length of45fs. efficiency,independent of the focusing geometry,27and the harmonic energy E h∝f2.Figure2illustrates these scaling relations in the case of argon,with the following parameters:800nm wavelength,a 45fs pulse duration,an intensity of1.5×1014W/cm2at fo-cus,and an initial beam diameter of10mm.A laser pulse energy of10mJ requires a focal length of approximately5m and a generation pressure of a few mbar to efficiently generate harmonics.III.HIGH-ORDER HARMONIC EXPERIMENTAL SETUP Our HHG setup consists of three sections:generation,di-agnostics,and application(see Figure3).The sections are connected by vacuum tubes with a diameterφ=40mm. The generation section is mounted on stiffly connected op-tical tables.The diagnostics section together with the applica-tion chamber are mounted on a rail system.This allows us to adjust the distance between the vacuum chambers depending on the focusing geometry in order to avoid damage of optical elements placed after the generation by the fundamental laser field.It also provides vibration isolation and high stability.High-order harmonics are generated by loosely focusing a high energy laser beam into a noble gas.The fundamen-tal laser beam is apertured down by a variable diameter iris (I),typically between9and30mm and focused by a lens (L).Control of the beam size allows for re-adjustments of the focusing geometry(f#)as well as laser energy and in-tensity distribution at focus.Thus it allows us to optimize phase-matching in a simple way.Directly after the focusing optics,the beam enters the generation chamber.The entrance UV fused silica window is mounted at a small angle to avoid back propagation of the reflected light to the laser system. The beam propagates inside a100mm diameter vacuum tube and is folded by mirrors(M)mounted on small breadboards placed in6-way crosses.Alternatively,the laser beam can be focused by a mirror at near-normal incidence placed in one of the vacuum crosses.The focused beam interacts with the noble gas confined in a cell(PGC).The cylindrical cell has a diameter of typically0.5mm and a length between3mm and20mm.The gas is released at the repetition rate of the laser by a valve29driven by a piezo-electric actuator and syn-chronized with the laser pulse.The opening and closing times are optimized for maximum harmonic signal.Simulating the gas distribution in the cell,we found a small pressure gradient from the middle of the cell,where the gas is injected,towards the ends of the cell,where the pressure abruptly drops.The cell is mounted on an XY motorized stage.Additionally,two motorized actuators control the tilt of the cell with respect to the incoming beam.In order to optimize the position of the cell relative to the laser focus the gas cell is additionally placed on a6cm long-range translation stage moving along the propagation direction(Z).The generation chamber is designed to work simultane-ously with up to two gas cells.The cells can be mounted in parallel or in series.The parallel configuration allows for the generation of two independent harmonic beams30 while the serial configuration can be used for the enhancement of the HHG process using low-order harmonics generated in thefirst cell.22In both configurations,the generated harmonic beam propagates collinearly with the fundamental radiation in vacuum(10−6mbar)to the diagnostics chamber.Elimination FIG.3.HHG setup in the4m focusing configuration;L-focusing lens,I-iris,M-folding mirrors,PGC-pulsed gas cell,F-aluminumfilters,RM-rotating mirror,XS-XUV spectrometer,VS-VUV spectrometer,and XCCD-XUV CCD camera.FIG.4.High-order harmonic spectra in argon(a)and neon(b)gas.The pulse energy per individual harmonics,shown as dots,was obtained by combining total energy measurements with the spectral response from the XUV spectrometer.of the fundamental is achieved by using200nm thick alu-minumfilters(F).Thefilters are mounted on a manual trans-lation stage placed at the entrance of the diagnostics chamber, and controlled from the outside of the vacuum chamber.The alignment of the setup is based on the beam position at the entrance iris and a reference point inside the diagnostics chamber.The precise alignment of the gas cell with respect to the laser beam is done by motorized control of the cell’s four axes(XY and two tilts).The reference point and gas cell are monitored by cameras equipped with variable focal length objectives.At the center of the diagnostics chamber,a gold-coated flat mirror(RM)mounted on a rotation stage is used to send the XUV beam to the different instruments or,when the mir-ror is removed,towards the application chamber(Fig.3). The HHG spectra are measured by aflat-field grating spec-trometer(XS,Jobin-Yvon PGM-PGS200).The spectrome-ter detects spectrally-resolved far-field spatial profiles of in-dividual high-order harmonics in the XUV spectral range. Low-order harmonics are detected using a vacuum ultravio-let monochromator(VS,McPherson234/302).The vacuum ultraviolet spectrometer is equipped with an MCP detector coated with CsI allowing HHG diagnostics in a range from50 to250nm.Additionally,spatial profiles and energy measure-ments are carried out using a back-illuminated XUV-CCD (Andor iKon-L)camera(XCCD).To attenuate the HHG beam for these measurements we use one or two200nm thick alu-minumfilters.IV.RESULTSThis section presents measurements of high-order har-monics generated in argon and neon.The driving laser sys-tem is a high-power Ti:Sapphire chirped-pulse-amplification-based laser system delivering45fs pulses with up to100mJ energy at10Hz repetition rate.Before compression,the laser beam is spatiallyfiltered with a conical pinhole mounted in a vacuum chamber.The pinhole waist,approximately500μm, is placed in a focal plane of a1.7m focal length lens.The laser beam diameter is30mm at the entrance to the har-monic setup.The laser beam position and angle are actively stabilized.Figure4presents typical integrated harmonic spectra for (a)argon and(b)neon,recorded by the XUV spectrometer. The driving laser beam,with20mJ energy in case of argon and24mJ in case of neon,was focused by a4m lens in a 20mm long cell.The HHG cut-off energy is45eV(29th harmonic)in argon whereas in neon it reaches91.5eV (59th harmonic).Under these conditions the total measured harmonic energy per laser shot is1.15μJ for argon and 0.23μJ for neon.These values correspond to conversion ef-ficiencies of5×10−5for argon and8×10−6for neon.Due to the high sensitivity of the XUV camera to the infrared radi-ation,the harmonic beam energy is measured within the alu-minumfilter’s transmission window,i.e.,between14eV and 71eV,corresponding to harmonic orders between11and45. The measurement procedure,similar to the one described by Erny et al.,31is based on XUV-CCD recorded background-subtracted images.The images are integrated to obtain the total number of counts.The total number of photons is es-timated based on a calibration curve from the manufacturer. The individual harmonic energy is obtained by multiplying the total HHG beam energy with the relative intensity of each harmonic measured by the spectrometer.The spectrum is cor-rected for the folding mirror reflection(based on data from Henke et al.32),the grating efficiency,and the measured alu-minumfilter transmission.The estimated pulse energy per harmonic is shown as dots in Figure4.The most promi-nent harmonic,both in argon and neon is the21st harmonic (32.5eV).Its energy is250nJ in argon and30nJ in neon. These values are comparable to previous results obtained by Takahashi et al.15,17Tofind the optimum high-order harmonic energies an au-tomated optimization procedure was carried out.This proce-dure is briefly summarized here.The important parameters to control are:fundamental beam energy and diameter(before focusing),gas pressure,and gas cell position relative to the laser focus.The energy of the fundamental beam is varied by an attenuator consisting of a half-wave plate mounted on a motorized rotation stage and a polarizer.We use a motorizedFIG.5.Intensity of the 21st harmonic generated in argon as a function of driving laser energy and generation gas pressure.The measurements were carried out for a gas cell placed at the laser focus for three iris sizes:(a)φ=22mm,(b)φ=24mm,and (c)φ=32mm.The values of a harmonic intensity between the measured points,shown as black dots,were interpolated.variable iris to change the diameter of the fundamental beam before focusing.The distance between the center of the cell and the fundamental beam waist is varied by moving the cell.Finally,the gas pressure in the cell is adjusted by controlling a voltage applied to the cell nozzle’s piezoelectric disks.We record a set of harmonic spectra while varying these four pa-rameters in an automated way.Either the total HHG energy or the energy of a single harmonic can be optimized.Our op-timization procedure allows us to routinely obtain HHG ener-gies at the level of several hundred nJ in argon and a few tens of nJ in neon.An example of the automated optimization is presented in Figure 5,where we investigated the dependence of the in-tensity of the 21st harmonic generated in argon as a func-tion of the laser energy and gas pressure for three iris diam-eters.The signal is normalized to the maximum obtained for 21st harmonic.The recorded data show that for increasing iris size (decreasing f #),the required laser energy decreases and the phase matching pressure increases in agreement with our model prediction (see Fig.1).Similar optimization in neon shows,as expected,a higher p match .The optimum iris diam-eter corresponds to the longest Rayleigh range (the highest f #)for which the phase-matching conditions can be achieved,while keeping a high enough intensity at focus.It assures the highest HHG beam energy as is shown in Sec.II .Figure 6(a)shows the spatial profile of high-order har-monics generated in argon and transmitted through an alu-minum filter.The corresponding orders are between 11and 45.The back-panel shows that the intensity distribution is al-most perfectly Gaussian.Similar high quality Gaussian beams were generated in neon.The high spatial quality of the gen-erated beams is due partly to the spatial quality of the driving beam,and partly to optimized phase-matching along the prop-agation axis.In our conditions,IR and XUV beams distortion due to nonlinear and plasma effects are negligible.The generated beams divergence carries information about the contribution from the electronic trajectories.The divergence of the “short trajectory”harmonic beam is usu-ally much smaller than for the “long trajectory”harmonics.For the 21st harmonic generated in argon,the divergence of the beam resulting from the long trajectory is 14times higher than that from the short trajectory.36The different divergence is a consequence of a larger accumulated phase on the long trajectories.The analysis of the harmonic beam divergence shows that the main contribution to HHG in our conditions comes from the short trajectories.The contribution from the long trajectories is visible on the analyzed CCD images as a weak background.30To estimate the spatial coherence of the HH beam,we performed a double-slit experiment.The degree of coherence of the HHG beam can be estimated from the fringe contrast in the diffraction pattern.33,34The slits used in this experiment had a width of 40μm,a slit separation of 400μm,and were located 1.5m from the source.Figure 6(b)shows a cross-section of the double-slit diffraction pattern obtained with a single shot exposure.The experimental data were fittedwithFIG.6.(a)Spatial profile of the harmonic beam generated in Ar by focusing fundamental radiation with 2m focal length lens into a 10mm long cell,recorded with an x-ray CCD camera.The back-panel shows the cross-section of the beam (gray,dotted line),and a fitted intensity distribution (blue,dashed line),(b)Diffraction pattern created in a double-slit experiment,experimental data (blue,solid line),and fitted intensity distribution (red,dashed line).a theoretical intensity function formed by the sum of diffrac-tion patterns of the different harmonic wavelengths within thetransmission window of thefilter.The bestfit was found witha degree of coherence of0.8,in good agreement with previousmeasurements.35V.SUMMARY AND OUTLOOKWe have developed a high-energy HHG setup,working ina loose focusing geometry,generating a total energy per laserpulse of a microjoule in argon and a few hundred nJ in neon.The source is designed for future studies of nonlinear pro-cesses in the XUV spectral range.The high harmonic pulseenergies together with their high spatial coherence allow usto reach high peak intensities.For example,an intensity of2×1014W/cm2per harmonic pulse could be reached by fo-cusing the HHG beam generated in argon using a broadbandgrazing-incident mirror,assuming a3μm focal spot size,20fs duration,and30%transmission after reflection andfilteringby an Alfilter.Our theoretical analysis of phase-matching in theabsorption-limited case provides a simple guide for scalingHHG properties to high laser energies.For example we esti-mate that with E f=1J,f=50m,p=0.01mbar,and L med =6m,harmonic pulses with energy as high as70μJ could be reached.Further increase in energy could be achieved by mod-ifying the single atom response,e.g.,using a double-cellscheme.22Our current beam line includes the option to drivethe HHG process with two cells or to use an interferometricsetup in order to combine the fundamental with itself,its sec-ond or third harmonic(ω/ω,ω/2ω,andω/3ω),thus providinga large range of options for modifying the drivingfield.Our experimental results combined with the above con-siderations show that HHG has the potential to provide in-tense ultrashort pulses reaching the intensity levels requiredfor nonlinear experiments in the XUV spectral range. ACKNOWLEDGMENTSThis research was supported by the Marie Curie programATTOFEL(ITN),the European Research Council(ALMA),the Joint Research Programme ALADIN of Laserlab-EuropeII,the Swedish Research Council,the Swedish Foundationfor Strategic Research,and the Knut and Alice WallenbergFoundation.1C.Spielmann,N.H.Burnett,S.Sartania,R.Koppitsch,M.Schnürer,C. Kan,M.Lenzner,P.Wobrauschek,and F.Krausz,“Generation of coherent x-rays in the water window using5-femtosecond laser pulses”Science278, 661(1997).2T.Popmintchev,M.-C.Chen,D.Popmintchev,P.Arpin,S.Brown,S.Al-isauskas,G.Andriukaitis,T.Balciunas,O.D.Mücke,A.Pugzlys,A.Bal-tuska,B.Shim,S.E.Schrauth,A.Gaeta,C.Hernández-García,L.Plaja, A.Becker,A.Jaron-Becker,M.M.Murnane,and H.C.Kapteyn,“Bright coherent ultrahigh harmonics in the keV x-ray regime from mid-infrared femtosecond lasers,”Science336,1287–1291(2012).3Y.Tamaki,J.Itatani,M.Obara,and K.Midorikawa,“Highly coherent soft x-ray generation by macroscopic phase matching of high-order harmon-ics,”Jpn.J.Appl.Phys.40,L1154–L1156(2001).4E.Goulielmakis,M.Schultze,M.Hofstetter,V.S.Yakovlev,J.Gagnon,M. Uiberacker,A.L.Aquila,E.M.Gullikson,D.T.Attwood,R.Kienberger,F.Krausz,and U.Kleineberg,“Single-cycle nonlinear optics,”Science320, 1614(2008).5F.Krausz and M.Ivanov,“Attosecond physics,”Rev.Mod.Phys.81,163–234(2009).6S.L.Sorensen,O.Bjorneholm,I.Hjelte,T.Kihlgren,G.Ohrwall,S. Sundin,S.Svensson,S.Buil,D.Descamps,and A.L’Huillier,“Femtosec-ond pump-probe photoelectron spectroscopy of predissociative states in acetylen,”J.Chem.Phys.112,8038(2000).7R.L.Sandberg,C.Song,P.W.Wachulak,D.A.Raymondson,A.Paul,B. Amirbekian,E.Lee,A.E.Sakdinawat,-O-V orakiat,M.C.Marconi, C.S.Menoni,M.M.Murnane,J.J.Rocca,H.C.Kapteyn,and J.Miao,“High numerical aperture tabletop soft x-ray diffraction microscopy with 70-nm resolution,”Proc.Natl.Acad.Sci.U.S.A.105,24–27(2008).8J.Schwenke,E.Lorek,R.Rakowski,X.He,A.Kvennefors,A.Mikkelsen, P.Rudawski,C.M.Heyl,I.Maximov,S.-G.Pettersson,A.Persson,and A. L’Huillier,“Digital in-line holography on amplitude and phase objects pre-pared with electron beam lithography,”J.Microsc.247,196–201(2012). mbert,T.Hara,D.Garzella,T.Tanikawa,bat,B.Carre,H. Kitamura,T.Shintake,M.Bougeard,S.Inoue,Y.Tanaka,P.Salieres,H. Merdji,O.Chubar,O.Gobert,K.Tahara,and M.-E.Couprie,“Injection of harmonics generated in gas in a free-electron laser providing intense and coherent extreme-ultraviolet light,”Nat.Phys.4,296–300(2008).10E.P.Benis,D.Charalambidis,T.N.Kitsopoulos,G.D.Tsakiris,and P. Tzallas,“Two-photon double ionization of rare gases by a superposition of harmonics,”Phys.Rev.A74,051402(R)(2006).11K.Ishikawa and K.Midorikawa,“Two-photon ionization of He+as a non-linear optical effect in the soft-x-ray region,”Phys.Rev.A65,043405 (2002).12M.Ferray,A.L’Huillier,X.F.Li,G.Mainfray,and C.Manus,“Multiple-harmonic conversion of1064nm radiation in rare gases,”J.Phys.B21, L31(1988).13A.McPherson,G.Gibson,H.Jara,U.Johann,T.S.Luk,I.A.McIn-tyre,K.Boyer,and C.K.Rhodes,“Studies of multiphoton production of vacuum-ultraviolet radiation in the rare gases,”J.Opt.Soc.Am.B4,595 (1987).14T.Brabec and F.Krausz,“Intense few-cycle laserfields:Frontiers of non-linear optics,”Rev.Mod.Phys.72,545(2000).15E.J.Takahashi,Y.Nabekawa,and K.Midorikawa,“Low-divergence co-herent soft x-ray source at13nm by high-order harmonics,”Appl.Phys. Lett.84,4–6(2004).16T.Popmintchev,M.C.Chen,P.Arpin,M.M.Murnane,and H.C.Kapteyn,“The attosecond nonlinear optics of bright coherent x-ray generation,”Nat. Photonics4,822–832(2010).17E.Takahashi,Y.Nabekawa,T.Otsuka,M.Obara,and K.Midorikawa,“Generation of highly coherent submicrojoule soft x rays by high-order harmonics,”Phys.Rev.A66,021802(R)(2002).18E.Takahashi,Y.Nabekawa,and K.Midorikawa,“Generation of10-μJ co-herent extreme-ultraviolet light by use of high-order harmonics,”Opt.Lett. 27,1920(2002).19J.-F.Hergott,M.Kovacev,H.Merdji,C.Hubert,Y.Mairesse,E.Jean, P.Breger,P.Agostini,B.Carré,and P.Salières,“Extreme-ultraviolet high-order harmonic pulses in the microjoule range,”Phys.Rev.A66, 021801(R)(2002).20J.Mauritsson,P.Johnsson,E.Gustafsson,A.L’Huillier,K.J.Schafer,and M.B.Gaarde,“Attosecond pulse trains generated using two color laser fields,”Phys.Rev.Lett.97,013001(2006).21D.Shafir,H.Soifer,B.D.Bruner,M.Dagan,Y.Mairesse,S.Patchkovskii, M.Yu.Ivanov,O.Smirnova,and N.Dudovich,“Resolving the time when an electron exits a tunneling barrier,”Nature(London)485,343–346 (2012).22F.Brizuela, C.M.Heyl,P.Rudawski, D.Kroon,L.Rading,J.M. Dahlström,J.Mauritsson,P.Johnsson,C.L.Arnold,and A.L’Huillier,“Efficient high-order harmonic generation boosted by below-threshold har-monics,”Sci.Rep.3,1410(2013).23P.B.Corkum,“Plasma perspective on strong-field multiphoton ionization,”Phys.Rev.Lett.71,1994(1993).24K.J.Schafer,B.Yang,L.F.DiMauro,and K.C.Kulander,“Above thresh-old ionization beyond the high harmonic cutoff,”Phys.Rev.Lett.70,1599 (1993).25P.Salières,A.L’Huillier,and M.Lewenstein,“Coherence control of high-order harmonics,”Phys.Rev.Lett.74,3776(1995).26S.Kazamias,D.Douillet,F.Weihe,C.Valentin,A.Rousse,S.Sebban,G. Grillon,F.Augé,D.Hulin,and P.Balcou,“Global optimization of high harmonic generation,”Phys.Rev.Lett.90,193901(2003).。

C4_旋转对称光子晶体平板中的对称保护连续谱束缚态

C4_旋转对称光子晶体平板中的对称保护连续谱束缚态

第40卷第3期Vol.40㊀No.3重庆工商大学学报(自然科学版)J Chongqing Technol &Business Univ(Nat Sci Ed)2023年6月Jun.2023C4旋转对称光子晶体平板中的对称保护连续谱束缚态张铭洋重庆工商大学数学与统计学院,重庆400067摘㊀要:在光子晶体平板中,连续谱束缚态关于C2和C6旋转对称的依赖性已经在数值上进行了广泛研究,但是缺少严格的理论分析过程,此外还缺少对C4旋转对称的研究,鉴于此,构建了系统分析连续谱束缚态关于所有旋转对称的依赖性的理论,并且重点研究了C4旋转对称的情况;首先,通过分析具有旋转对称的结构中麦克斯韦方程组特征解的性质,将连续谱束缚态的存在性问题转变为旋转矩阵的特征值是否与一个简单代数方程的解相同的问题;其次,给出了C4旋转对称的结构中连续谱束缚态存在时所对应的条件;然后,证明了破坏C4旋转对称保持C2旋转对称时,连续谱束缚态依然存在;最后,利用有限元软件FreeFEM 进行了大量的数值验证;上述理论可适用于所有旋转对称的情况,深入揭示了旋转对称对连续谱束缚态存在的重要性,深入揭示了高阶旋转对称性与低阶旋转对称性之间的依赖关系,为连续谱束缚态的实际应用提供了理论指导㊂关键词:光子晶体;旋转对称;连续谱束缚态中图分类号:O436㊀㊀文献标识码:A㊀㊀doi:10.16055/j.issn.1672-058X.2023.0003.09㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀㊀收稿日期:2022-05-13㊀修回日期:2022-06-20㊀文章编号:1672-058X(2023)03-0064-07基金项目:重庆市自然科学基金面上项目(CSTC2019JCYJ -MSXMX0717).作者简介:张铭洋(1997 ),女,重庆忠县人,硕士研究生,从事光子晶体㊁麦克斯韦方程组数值计算研究.引用格式:张铭洋.C4旋转对称光子晶体平板中的对称保护连续谱束缚态[J].重庆工商大学学报(自然科学版),2023,40(3):64 70.ZHANG Mingyang.Symmetry-protected bound states in the continuum in C4rotationally symmetric photonic crystal plates J .Journal of Chongqing Technology and Business University Natural Science Edition 2023 40 3 64 70.Symmetry-protected Bound States in the Continuum in C 4Rotationally Symmetric Photonic Crystal Plates ZHANG MingyangSchool of Mathematics and Statistics Chongqing Technology and Business University Chongqing 400067 ChinaAbstract The dependence of bound states in the continuum BICs on C2and C6rotational symmetry in photonic crystalplates has been extensively studied numerically.However a rigorous theoretical analysis process is lacking and there is alack of studies on C4rotational symmetries.In view of this a theory of systematic analysis of the dependence of BICs on all rotational symmetries was constructed and the case of C4rotational symmetry was mainly studied.Firstly by analyzingthe characteristic solutions of Maxwell s equations with rotationally symmetric structure the problem of the existence of BICs was transformed into the question of whether the eigenvalue of the rotation matrix was the same as the solution of asimple algebraic equation.Secondly the conditions for the existence of BICs in C4rotationally symmetric structures weregiven.Then it was proved that the BICs still existed when C4rotation symmetry was destroyed and C2rotatory symmetrywas maintained.Finally the finite element software FreeFEM was used to do a lot of numerical verifications.The abovetheory can be applied to all cases of rotational symmetries revealing the importance of rotational symmetry for the existenceof BICs.The dependence between high-order and low-order rotational symmetries was revealed providing theoretical guidance for applying BICs.Keywords photonic crystal rotational symmetry bound states in the continuum第3期张铭洋,等:C4旋转对称光子晶体平板中的对称保护连续谱束缚态1㊀引㊀言光学连续谱束缚态(bound states in the continuum, BIC)是指位于连续谱中的导模,其不能与辐射场耦合,没有能量辐射,被完美地束缚在结构中[1-3]㊂数学上,光学连续谱束缚态是指开结构中麦克斯韦方程组的一类频率位于连续谱内的平方可积特征解㊂通常,导模(即平方可积特征解)的特征频率位于连续谱外㊂1929年冯诺依曼等[4]从数学模型上发现在一些特殊的结构中存在特征频率位于连续谱内的导模㊂直到1985年文献[5]才构造出具有连续谱束缚态的真实物理系统㊂2008年,文献[3]研究了光子晶体结构中的连续谱束缚态㊂此后,连续谱束缚态受到广泛关注,与之有关的研究快速发展㊂目前,连续谱束缚态的概念和研究已推广到水波㊁声波等其他波动现象[1]㊂连续谱束缚态可看成为品质因子为无穷大的共振,只存在于若干离散的频率上㊂连续谱束缚态由共振模式所包围㊂通过扰动波矢,可在连续谱束缚态附近找到任意大小品质因子的共振模式[6]㊂此性质使得连续谱束缚态在光学㊁光子学等领域都拥有广阔的应用前景㊂目前,连续谱束缚态已在波导㊁光栅㊁光子晶体及超材料等结构中被广泛研究[7],光子晶体中的连续谱束缚态现在已经被用于传感器,激光器和滤波器的设计当中[8-10]㊂通常,共振模式的品质因子与波矢之差的平方成反比㊂文献[6]证明了存在特殊的连续谱束缚态使得附近共振模式的品质因子与波矢之差的四次方和六次方成反比,并给出了两类特殊连续谱束缚态的条件㊂从实际应用角度来讲,在这些特殊的连续谱束缚态附近更容易构造出高品质因子的共振模式㊂连续谱束缚态可以大致分为两类:对称保护的连续谱束缚态[11-15]和非对称保护的连续谱束缚态[2,16-19]㊂对称保护的连续谱束缚态(Symmetry Protected Bound states in the continuum,SPBIC)的机理是:在对称结构中,布洛赫模的对称性与结构中辐射场的对称性不相容,从而与辐射场不耦合,变成一个连续谱束缚态[3]㊂而非对称保护连续谱束缚态的存在机理是:共振模式的辐射场之间发生干涉相消现象,造成没有辐射,成为连续谱束缚态[3]㊂连续谱束缚态的存在性与结构的对称性具有密切联系㊂早期研究结果都是在对称结构中研究连续谱束缚态,学术界一度认为连续谱束缚态只存在于对称结构中㊂目前数学上还没有非对称保护连续谱束缚态的存在性理论㊂非对称保护连续谱束缚态关于结构对称性的依赖关系非常复杂㊂文献[20-23]从数值和实验上演示了破坏二维结构的C2旋转对称性后,连续谱束缚态演化为共振模式㊂这间接说明了结构的对称性对非对称保护连续谱束缚态的存在性具有重要影响㊂但是破坏对称性后连续谱束缚态是否一定会演化为共振模式并没有明确的结论㊂最近,文献[24-26]证明了只要引入足够多的结构扰动参数,连续谱束缚态可以连续存在于非对称的结构中,且对于不同类型的连续谱束缚态,所需要引入的最小参数的数量是不同的㊂上述结论表明,非对称保护连续束缚态可以存在于非对称结构中,只要结构的自由参数足够多㊂对称保护连续谱束缚态只存在于对称结构中㊂光子晶体平板可具有四类旋转对称性:C2㊁C3㊁C4和C6旋转对称性,即分别旋转180㊁120㊁90和60度后结构不变㊂文献[11-12]首先从数学理论上证明了在具有C2旋转对称的二维介质结构中对称保护连续谱束缚态的存在性,在非对称结构中一定不存在对称保护连续谱束缚态㊂研究对称保护连续谱束缚态对上述四种不同类型对称性的连续依赖性具有十分重要的意义㊂文献[21]研究了C6旋转对称性对具有拓扑电荷为q=-2的对称保护连续谱束缚态存在性的影响㊂通过数值计算发现破坏C6对称保持C2对称,对称保护连续谱束缚态依然存在,但是变成拓扑电荷q=-1;破坏C6对称保持C3对称,对称连续谱束缚态演化为共振模式,而且会产生两个非对称保护连续谱束缚态㊂上述研究结果给出了一种产生非对称保护连续谱束缚态的方法㊂目前,对于拓扑电荷为q=-1或q=1的对称保护连续谱束缚态关于结构对称性的依赖关系没有进行系统讨论,缺乏严格系统的依赖性理论㊂研究C4旋转对称结构中对称保护连续谱束缚态关于对称性的依赖关系㊂建立了严格数学理论证明破坏C4对称保持C2对称,对称保护连续谱依然存在㊂并利用有限元软件FreeFEM进行数值验证㊂相比于以前的研究,研究既有严格的数学理论,又有数值验证㊂研究成果具有一般性,可推广到分析对称保护连续谱束缚态关于C6对称性的依赖性,有利于深入理解连续谱束缚态关于对称性的依赖关系,为其实际应用提供理论指导㊂2㊀连续谱束缚态考虑一个在x与y方向为周期,在z方向上厚度有限的光子晶体平板㊂光子晶体平板通过在平板上构造正方形空气柱晶格所构成㊂设平板厚度为2D,晶格常数为L,平板的介电常数为ε1,空气的介电常数为ε0= 1㊂记整个结构的介电常数为ε(r),其中r=(x,y,z),则z>D时有ε(r)=1,且ε(r)满足ε(r)=ε(x+mL,y+nL,z)(1)其中,m与n为任意整数㊂设光子晶体平板是无磁性㊁各向同性的,由麦克斯56重庆工商大学学报(自然科学版)第40卷韦方程组可知,具有时间依赖e -iwt 的时谐波的电场E 满足如下的控制方程:∇ˑ∇ˑE -k 2εE =0∇㊃(εE )=0其中k =ωc为真空中的波速,ω为角频率,c 为真空中的光速㊂光子晶体平板中的布洛赫模(即麦克斯韦方程组的特征解)可写成:E (r )=Φ(r )e ik ㊃r其中,k =(α,β,0)为布洛赫波矢,实数α与β分别为x 与y 方向的布洛赫波速,Φ(r )满足周期条件式(1)㊂由于在z >D 时,结构是均匀的,由傅立叶展开式与分离变量法可知,满足向外辐射条件的布洛赫模可以展开为[18]E (r )=ð+ɕm ,n =-ɕd ʃm ,neik ʃm ,n㊃r ʃz >D (2)其中,常数向量d ʃm ,n满足d ʃm ,n ㊃k ʃm ,n =0,k ʃm ,n=(αm ,βn ,ʃγm ,n ),αm =α+2m πL ,βn =β+2n πL,γm ,n =k 2-α2m -β2n ㊂若结构是无耗散的,即ε(r )为非负实函数,则布洛赫模可以分为三类:导模㊁共振以及连续谱束缚态㊂(1)若k 为实数,则布洛赫模为导模㊂可以证明k 为实数等价于E (r )满足lim z ңɕE (r )=0,即没有能量辐射,没有能量损失㊂当波速和布洛赫波矢满足条件0<k <α2+β2时,导模关于α与β连续存在㊂在上述条件下,γm ,n 的虚部都大于0,所以展开式(2)中每一个平面波都在无穷远出衰退到0,即E (r )自动满足lim z ңɕE (r )=0,从而是一个导模㊂(2)若k 为复数,则布洛赫模为共振模式㊂共振模式的波速k 满足[Re(k )]2-[lm(k )]2>α2+β2㊂由于共振的波速k 为复数且满足向外辐射条件,共振满足条件lim z ңɕE (r )=ɕ,即在空间上是无限增大,但随时间指数衰退㊂共振波速k (或共振频率ω)的虚部小于零,即lm(k )<0,它表示共振随着时间衰退的速度㊂共振的品质因子Q 定义为Q =-12Re(k )lm(k ),表示共振模式的振幅衰退到原来的e -1时所需要的振荡周期㊂共振模式关于α与β也是连续存在的㊂(3)若k 为实数且满足k >α2+β2,则布洛赫模是一个连续谱束缚态㊂连续谱束缚态可以看成是一个Q 因子为无穷大的共振,只在离散的(α,β)点上存在,在连续谱束缚态的附近,通过调整α与β可以获得任意大小Q 因子的共振㊂由于k 为实数等价于条件limz ңɕE (r )=0,在展开式(2)中,若lm(γm ,n )ȡ0,则e ikm ,n㊃r可向z ңɕ辐射能量,(m ,n )是对应一个开放的辐射通道㊂若lm(γm ,n )<0,则eik ʃn ,m ㊃r在z ңɕ时衰退到零,对应一个关闭的辐射通道㊂若Re(k )>α2+β2,则至少有lm(γ0,0)ȡ0,即(0,0)处辐射通道是开放的㊂记Z 0表示所有开放的辐射通道,即Z 0=(m ,n )lm(γm ,n )ȡ0{},则条件lim z ңɕE (r )=0等价于d ʃm ,n=0,∀(m ,n )ɪZ 0(3)式(3)是布洛赫模的一个附加条件,在一般情况下,连续谱束缚态不容易存在㊂3 对称保护连续谱束缚态当光子晶体平板具有旋转对称性时,可能存在对称保护连续谱束缚态㊂下面给出具有C n 旋转对称的光子晶体平板中对称保护连续谱束缚态的定义,并分析其关于对称性的依赖关系㊂利用旋转对称性下布洛赫模的性质,将连续谱束缚态的存在性问题转变为旋转矩阵的特征值是否与一个简单代数方程的解相同的问题;其次,给出了C4旋转对称的结构中连续谱束缚态存在时所对应的条件;然后,证明了破坏C4旋转对称保持C2旋转对称时,连续谱束缚态依然存在;具有C n 旋转对称性结构的介电函数ε(r )满足条件ε(r )=ε(T -1r )其中,T =cos φ-sin φ0sin φcos φ0001éëêêêùûúúú表示旋转矩阵,φ=2πn ,只考虑n =2与n =4的情况㊂其理论可推广到n =3与n =6的情况㊂C n 旋转对称光子晶体平板中的布洛赫模具有以下性质[24]:若E (r )=Φ(r )e ik ㊃r 是一个对应于波矢k =(α,β,0)和波速k 的布洛赫模,则TE (T -1r )是一个对应于波矢Tk 和波速k 的布洛赫模㊂特别地,取α=β=0,即k =(0,0,0),有Tk =k ,此时E (r )与TE (T -1r )是对应同一个波矢与波速的两个布洛赫模㊂若特征值问题是非退化的,则E (r )与TE (T -1r )线性相关,即存在常数τ使得:TE (T -1r )=τE (r )(4)由于任何结构旋转n 次2πn角度(即2π)后都不变,有T n=I ,其中I 表示单位算子㊂所以有τn=1,即τ=e i 2πn j,j =0,1,2, ,n -1㊂更具体地,当n =2时,τ=ʃ1;当n =4时,τ=ʃ1,ʃi ㊂注意到τ的取值对应于C n 点群的不可约表示的特征㊂设布洛赫模的波矢为k =(0,0,0)且频率满足0<Re(k )ε0<2πL,即只有(0,0)处辐射通道是开放的,则当n =2时,对应于τ=1以及当n =4时,对66第3期张铭洋,等:C4旋转对称光子晶体平板中的对称保护连续谱束缚态应于τ=ʃ1的布洛赫模一定是连续谱束缚态,称为对称保护连续谱束缚态㊂在上述条件下,布洛赫模是一个连续谱束缚态的充要条件是d ʃ0,0=0㊂下面证明当n =2时,τ=1以及当n =4时,τ=ʃ1,有d ʃ0,0=0㊂将展开式(2)代入条件式(4)得:Td ʃ0,0=τd ʃ0,0(5)令,d ʃ0,0=d ʃx ,d ʃy ,d ʃz []T ,T ^=cos φ-sin φsin φcos φéëêêùûúú,d ʃʅ=d ʃx ,d ʃy []T 表示d ʃ0,0的x 与y 分量所构成的向量㊂由d ʃ0,0㊃k ʃ0,0=0且α=β=0,知d ʃz k =0,即d ʃz =0㊂所以d ʃ0,0=0等价于d ʃʅ=0㊂由式(5)得:T ^d ʃʅ=τd ʃʅ若τ不是T ^的特征值,则必有d ʃʅ=0㊂当n =2时,T ^=-100-1éëêêùûúú,T ^只有一个特征值-1㊂所以当τ=1时,有d ʃʅ=0㊂当n =4时,T ^=0-110éëêêùûúú,此时T ^的特征值为ʃi ㊂所以当τ=ʃ1时有d ʃʅ=0㊂由上面的证明过程可知,条件d ʃ0,0=0是由C n 对称性所保证的㊂在具有C4旋转对称的结构中,对称保护连续谱束缚态对应的τ=1或-1㊂注意到无论是τ=1还是-1,都有τ2=1,即这些连续谱束缚态也同时由C2旋转对称所保护㊂有以下结论:具有C4旋转对称结构中的对称保护连续谱束缚态都是由C2旋转对称所保护的,即破坏C4旋转对称,保持C2旋转对称,这些连续谱束缚态依然存在㊂4㊀拓扑电荷连续谱束缚态对应于动量空间中辐射场的漩涡,因此其存在性与拓扑性质有关㊂前面提到了布洛赫模是一个连续谱束缚态的充要条件是d ʃ0,0=0,通过d ʃ0,0的x 与y 分量可以计算得到辐射场的极化角㊂极化椭圆的长轴与y 轴的夹角称之为极化角度,记为θ㊂θ可以看成是α与β的函数,即θ=θ(α,β)㊂在αβ平面上,任意给定一条曲线Γ,让(α,β)沿着Γ绕一圈重新定义θ,使其为连续函数㊂拓扑电荷的定义为q =12πɥΓd θ=12πɥΓ∇θ㊃ n d s拓扑电荷q 表示αβ上的一点绕Γ走一圈后,极化角度旋转了q 圈,q 是一个整数㊂若Γ所围区域内无圆极化与连续谱束缚态,则q =0;若Γ所围区域内有且仅有一个连续谱束缚态则q =ʃ1,ʃ2,ʃ3, ;若Γ所围区域内只有一个圆极化,则q =ʃ12㊂需要注意的是圆极化和连续谱束缚态是αβ平面中的一个极化奇点㊂5㊀数值实验由于辐射边界条件下的特征值问题定义在无穷区间上,无法用数值方法来计算㊂所以在实际计算连续谱束缚态的时候,可以用完美匹配层的方法来将无穷区域截断为有限区域㊂用完美匹配层截断后的特征值问题是原特征值问题的一个近似,它们之间的误差关于完美匹配层的参数σ∗㊁H 2-H 1(即完美匹配层的厚度)指数衰退到零㊂所以只需要选择合适的σ∗与H 2-H 1,便可以得到足够精确的特征解,即可以计算得到连续谱束缚态的频率㊂相对于拟周期边界条件,在有限元方法中周期边界条件更容易实现㊂用有限元方法求解偏微分方程最重要的是弄清楚解空间和变分形式㊂在用有限元求解时,变分问题被近似为下列代数方程的特征值问题:A Φ=k 2B Φ其中,A 与B 为矩阵㊂考虑如图1所示的具有正方形晶格空气柱的光子晶体平板,其俯视图如图2所示㊂平板是由空气所包围的㊂平板的厚度为2D =0.5L ,介电常数为ε1=4,空气中的介电常数ε0=1㊂空气柱体横截面参数分别为w =0.2L ,a =w2㊂若h 1=h 2,则结构具有C4旋转对称性㊂若h 1ʂh 2,则结构只有C2旋转对称性㊂为了验证前面得到的理论,用完美匹配层[27]的方法将无穷区域上的特征值问题转化为有限区域上的特征值问题,并用有限元[28]的方法求解㊂数值计算时,需要用完美匹配层方法将z 方向截断为-H 2,H 2[],如图3所示㊂取完美匹配层的厚度为H 2-H 1=L ,σ∗=18ˑm +1β0(H 2-H 1)[29],β0=k 20ε0-α2-γ2,m =3㊂其中H 1-D 表示完美匹配层的远近㊂采用基于FreeFEM 软件的有限元方法来数值求解特征值问题,以计算对称保护连续谱束缚态的频率㊂计算时在平板的每个边界的离散点个数取N =10,PML 层的离散点个数也取N =10㊂考虑如此复杂结构的原因是为了避免其他对称性(例如镜面反射对称)对结果的影响㊂ε=ε0ε=ε0z =D z =-Dzxy图1㊀光子晶体平板结构图Fig.1㊀Structure diagram of photonic crystal plates76重庆工商大学学报(自然科学版)第40卷h 2h 1h 1h 2L Lαωωxy图2㊀光子晶体平板结构的俯视图Fig.2㊀Top view of the photonic crystal flat plate structurez=H2z=H1z=D z=-Dz=-H1z=-H2ε=εε=εzxy 图3㊀PML截断后的计算区域Fig.3㊀Computation region after PML truncation 若取h1=h2=0.15L,这时结构具有C4旋转对称性㊂通过数值计算,可以找到5个TM-Like模式下(即E z是z变量的奇函数)的对称保护连续谱束缚态,其频率如表1的第2列所示㊂图4(a) 图8(a)分别是SPBIC1-SPBIC5在具有C4旋转对称的结构中log10Q 关于α与β的值㊂可以通过观察得到当(α,β)ң(0, 0)时,Q的值趋近于无穷大㊂图4(c) 图8(c)分别是SPBIC1到SPBIC5在C4旋转对称结构中的磁场z分量H z在z=0时的场图㊂从下面的场图可以观察得到, SPBIC1与SPBIC5对应于τ=1,其他3个对称保护连续谱束缚态对应于τ=-1㊂表1的最后一列表示为对称保护连续谱束缚态的拓扑电荷㊂若保持h1=0.15L,令h2=0.1L,参数扰动后结构的C4旋转对称性被破坏,但保持了C2旋转对称性㊂通过数值计算表明,SPBIC1-SPBIC5在扰动后的结构中依然存在,其频率如表1的第三列所示,可以发现两种结构下连续谱束缚态的频率近乎相等㊂图4(b) 图8(b)分别是SPBIC1到SPBIC5在具有C2旋转对称的结构中log10Q关于α与β的值㊂可以通过观察得到当(α,β)ң(0,0)时,Q的值趋近于无穷大,并且可以发现两种结构下,log10Q关于α与β的值很相近㊂图4(d) 图8(d)分别代表的是扰动后SPBIC1-SPBIC5在C2旋转对称结构中的磁场z分量H z在z=0时的场图㊂从下面的场图可以观察得到,SPBIC1与SPBIC5仍然对应于τ=1,其他3个对称保护连续谱束缚态也依旧对应于τ=-1㊂通过对比,可以发现两种结构下的场图几乎一模一样,并且可以发现结构扰动不改变对称保护连续谱束缚态的拓扑电荷㊂表1㊀C4与C2旋转对称结构中对称保护连续谱束缚态的频率ωL2πc的值Table1㊀Value ofωL2πc the frequency of symmetrically protected bound states in the continuum in the rotationallysymmetric structure of C4and C2C4C2q SPBIC10.61590.6101+1 SPBIC20.63670.6282-1 SPBIC30.85690.8485-1 SPBIC40.93830.9338-1 SPBIC50.95140.9492+10.05-0.05-0.0500.05βL/(2π)Q f a c t o rαL/(2π)8640.05-0.05-0.0500.05αL/(2π)864βL/(2π)Q f a c t o r㊀㊀(a)(b )0.5-0.5-0.500.5y/LR e/H zx/L0.5-0.5-0.500.5R e/H zx/L㊀㊀(c)(d)图4㊀SPBIC1的Q因子图和场图Fig.4㊀Q factor diagram and field diagram of SPBIC10.05-0.05-0.0500.05βL/(2π)Q f a c t o rαL/(2π)8765430.05-0.05-0.0500.05αL/(2π)βL/(2π)Q f a c t o r876543㊀㊀(a)(b)86第3期张铭洋,等:C4旋转对称光子晶体平板中的对称保护连续谱束缚态0.50-0.5-0.50.5y /LR e /H zx /L0.5-0.5-0.50.5y /LR e /H zx /L㊀㊀(c )(d )图5㊀SPBIC2的Q 因子图和场图Fig.5㊀Q factor diagram and field diagram of SPBIC20.010-0.01-0.010.01βL /(2π)Q f a c t o rαL /(2π)8765430.010-0.01-0.0100.01αL /(2π)βL /(2π)Q f a c t o r76543㊀㊀(a )(b )0.5-0.5-0.50.5y /LR e /H zx /L0.5-0.5-0.50.5y /LR e /H zx /L㊀㊀(c )(d )图6㊀SPBIC3的Q 因子图和场图Fig.6㊀Q factor diagram and field diagram of SPBIC30.010-0.01-0.010.01βL /(2π)Q f a c t o rαL /(2π)76540.010-0.01-0.0100.01αL /(2π)βL /(2π)Q f a c t o r7654㊀㊀(a )(b )0.5-0.5-0.50.5y /LR e /H zx /L0.5-0.5-0.50.5y /LR e /H zx /L㊀㊀(c )(d )图7㊀SPBIC4的Q 因子图和场图Fig.7㊀Q factor diagram and field diagram of SPBIC40.020-0.02-0.0200.02βL /(2π)Q f a c t o rαL /(2π)8640.020-0.02-0.0200.02αL /(2π)βL /(2π)Q f a c t o r6543㊀㊀(a )(b )0.5-0.5-0.50.5y /LR e /H zx /L0.5-0.5-0.500.5y /LR e /H zx /L㊀㊀(c )(d )图8㊀SPBIC5的Q 因子图和场图Fig.8㊀Q factor diagram and field diagram of SPBIC5经过数值计算,从扰动前后不同结构下对称保护连续谱束缚态的频率以及对比分析它们的Q 因子图和场图可以观察得到具有C4旋转对称结构的光子晶体平板中的对称保护连续谱束缚态都是由C2旋转对称性所保护的㊂即若破坏C4旋转对称但保持C2旋转对称,原有的对称保护连续谱束缚态依然存在㊂进一步反映了C4旋转对称与C2旋转对称之间的依赖关系㊂6㊀结束语构建了系统分析连续谱束缚态关于旋转对称性的依赖理论,并且重点研究了C4旋转对称的情况,分别从理论和数值两个方面证明了具有C4旋转对称光子晶体平板中的对称保护连续谱束缚态都是由C2旋转对称性所保护的㊂即破坏C4旋转对称但是保持C2旋转对称性,原对称保护连续谱束缚态依然存在㊂虽然只考虑了C4旋转对称光子晶体平板中的对称保护连续谱束缚态,但提出的理论和数值分析方法都可以用于研究具有C6旋转对称的光子晶体平板,不过由于此结构同时具有C2与C3旋转对称性,对称保护连续谱束缚态与对称性的依赖关系可能会更加复杂㊂提出的理论分析方法也可以适用于所有旋转对称的情况㊂由于是从麦克斯韦方程组出发,没有引入模型近似,并且分析过程根据严格㊂研究结果有利于深入理解对称保护连续谱束缚态的性质,为其理论分析和实际应用提供指导㊂96重庆工商大学学报(自然科学版)第40卷参考文献References1 ㊀HSU C W ZHEN B STONE A D et al.Bound states in thecontinuum J .Nature Reviews Materials 2016 1 9 1 13.2 ㊀HSU C W ZHEN B LEE J et al.Observation of trappedlight within the radiation continuum J .Nature 2013 4997457 188 191.3 ㊀MARINICA D C BORISOV A G SHABANOV S V.Boundstates in the continuum in photonics J .Physical Review Letters 2008 100 18 1 4.4 ㊀NEUMANN J WIGNER E P.Über merkwürdige diskreteEigenwerte M .Berlin Heidelberg Springer 1993.5 ㊀FRIEDRICH H WINTGEN D.Interfering resonances andbound states in the continuum J .Physical Review A 198532 6 3231 3239.6 ㊀YUAN L LU Y Y.Bound states in the continuum on periodicstructures surrounded by strong resonances J .Physical Review A 2018 97 4 1 8.7 ㊀AZZAM S I KILDISHEV A V.Photonic bound states in thecontinuum from basics to applications J .Advanced Optical Materials 2020 9 1 1469 1477.8 ㊀KODIGALA A LEPETIT T GU Q et sing action fromphotonic bound states in continuum J .Nature 2017 5417636 196 199.9 ㊀JIN J YIN X NI L et al.Topologically enabled ultrahigh-Qguided resonances robust to out-of-plane scattering J .Nature 2019 574 7779 501 504.10 HAN S CONG L SRIVASTAVA Y K et al.All-dielectricactive terahertz photonics driven by bound states in the continuum J .Advanced Materials 2019 31 37 1 28.11 BONNET-BENDHIA A S STARLING F.Guided waves byelectromagnetic gratings and non-uniqueness examples for the diffraction problem J .Mathematical Methods in the Applied Sciences 1994 17 5 305 338.12 VENAKIDES S SHIPMAN S P.Resonance and bound statesin photonic crystal slabs J .SIAM Journal on Applied Mathematics 2003 64 1 322 342.13 SHIPMAN S VOLKOV D.Guided modes in periodic slabsexistence and nonexistence J .SIAM Journal on Applied Mathematics 2007 67 3 687 713.14 LEE J ZHEN B CHUA S L et al.Observation anddifferentiation of unique high-Q optical resonances near zero wave vector in macroscopic photonic crystal slabs J .Physical Review Letters 2012 109 6 1 5.15 HU Z LU Y Y.Standing waves on two-dimensional periodicdielectric waveguides J .Journal of Optics 2015 176065601 065608.16 YANG Y PENG C LIANG Y et al.Analytical perspective for bound states in the continuum in photonic crystal slabs J . Physical Review Letters 2014 113 3 1 5.17 YUAN L LU Y Y.Propagating bloch modes above the lightline on a periodic array of cylinders J .Journal of Physics B Atomic Molecular and Optical Physics 2017 505 1 5.18 KANG M ZHANG S XIAO M et al.Merging bound states in the continuum at off-high symmetry points J .Physical Review Letters 2021 126 11 1 7.19 YUAN L LU Y Y.Conditional robustness of propagating bound states in the continuum in structures with two-dimensional periodicity J .Physical Review A 2021 1034 1 10.20 OVERVIG A C MALEK S C CARTER M J et al.Selection rules for quasibound states in the continuum J .Physical Review B 2020 102 3 1 30.21 YODA T NOTOMI M.Generation and annihilation of topologically protected bound states in the continuum and circularly polarized states by symmetry breaking J .Physical Review Letters 2020 125 5 1 12.22 LI S ZHOU C LIU T et al.Symmetry-protected bound states in the continuum supported by all-dielectric metasurfaces J . Physical Review A 2019 100 6 1 6.23 LI L LI Y ZHU Y et al.Rotational symmetry of photonic bound states in the continuum J .Scientific Reports 2020 101 1 8.24 SAKODA K.Optical properties of photonic crystals M . Berlin Springer Science&Business Media 2004.25 YUAN L LU Y Y.Parametric dependence of bound states in the continuum on periodic structures J .Physical Review A 2020 102 3 1 9.26 YUAN L LUO X LU Y Y.Parametric dependence of bound states in the continuum in periodic structures vectorial cases J .Physical Review A 2021 104 2 1 11.27 LU Y Y.Minimizing the discrete reflectivity of perfectly matched layers J .IEEE Photonics Technology Letters 2006 18 3 487 489.28 LI Y J JIN J M.Fast full-wave analysis of large-scale three-dimensional photonic crystal devices J .JOSA B 2007 249 2406 2415.29 ZHEN B HSU C W LU L et al.Topological nature of optical bound states in the continuum J .Physical Review Letters 2014 113 25 1 20.责任编辑:田㊀静07。

以规则为基础加强全球治理的英语

以规则为基础加强全球治理的英语

以规则为基础加强全球治理的英语A rule-based strengthened global governance aims to answer accurately and provide explanations on how it can be used.Here are 26 bilingual examples:1. We need to establish a rule-based framework for global governance.我们需要建立一个以规则为基础的全球治理框架。

2. Rule-based decision-making ensures fairness and transparency.基于规则的决策确保了公平和透明。

3. Global governance should be strengthened through the establishment of international rules.通过建立国际规则,应加强全球治理。

4. International organizations need to adhere to rule-based principles.国际组织需要遵循规则为基础的原则。

5. The rule-based approach promotes accountability and responsibility.基于规则的方法促进了问责和责任。

6. A rule-based system helps to mitigate conflicts and disputes.基于规则的体系有助于缓解冲突和争端。

7. Rule-based global governance reduces the risk of arbitrary decisions.规则为基础的全球治理减少了随意决策的风险。

8. A rule-based international order enhances cooperation and collaboration.基于规则的国际秩序增强了合作和协作。

A theory of weak bisimulation for core cml

A theory of weak bisimulation for core cml

J.Functional Programming1(1):1–000,January1993c1993Cambridge University Press1A Theory of Weak Bisimulation for Core CMLWILLIAM FERREIRA†Computing LaboratoryUniversity of CambridgeMATTHEW HENNESSY AND ALAN JEFFREY‡School of Cognitive and Computing SciencesUniversity of Sussex1IntroductionThere have been various attempts to extend standard programming languages with con-current or distributed features,(Giacalone et al.,1989;Holmstr¨o m,1983;Nikhil,1990). Concurrent ML(CML)(Reppy,1991a;Reppy,1992;Panangaden&Reppy,1996)is a practical and elegant example.The language Standard ML is extended with two new type constructors,one for generating communication channels,and the other for delayed com-putations,and a new function for spawning concurrent threads of computation.Thus the language has all the functional and higher-order features of ML,but in addition pro-grams also have the ability to communicate with each other by transmitting values along communication channels.In(Reppy,1992),a reduction style operational semantics is given for a subset of CML calledλcv,which may be viewed as a concurrent version of the call-by-valueλ-calculus of(Plotkin,1975).Reppy’s semantics gives reduction rules for whole programs,not for program fragments.It is not compositional,in that the semantics of a program is not defined in terms of the semantics of its subterms.Reppy’s semantics is designed to prove properties about programs(for example type safety),and not about program fragments(for example equational reasoning).In this paper we construct a compositional operational semantics in terms of a labelled †William Ferreira was funded by a CASE studentship from British Telecom.‡This work is carried out in the context of EC BRA7166CONCUR2.2W.Ferreira,M.Hennessy and A.S.A.Jeffreytransition system,for a core subset of CML which we callµCML.This semantics not only describes the evaluation steps of programs,as in(Reppy,1992),but also their communi-cation potentials in terms of their ability to input and output values along communication channels.This semantics extends the semantics of higher-order processes(Thomsen,1995) with types andfirst-class functions.We then proceed to demonstrate the usefulness of this semantics by using it to define a version of weak bisimulation,(Milner,1989),suitable forµCML.We prove that,modulo the usual problems associated with the choice operator of CCS,our chosen equivalence is preserved by allµCML contexts and therefore may be used as the basis for reasoning about CML programs.In this paper we do not investigate in detail the resulting theory but confine ourselves to pointing out some of its salient features;for example standard identities one would expect of a call-by-valueλ-calculus are given and we also show that certain algebraic laws common to process algebras,(Milner,1989),hold.We now explain in more detail the contents of the remainder of the paper.In Section2we describeµCML,a monomorphically typed core subset of CML,which nonetheless includes base types for channel names,booleans and integers,and type con-structors for pairs,functions,and delayed computations which are known as events.µCML also includes a selection of the constructs and constants for manipulating event types,such as and for constructing basic events for sending and receiving values, for combining delayed computations,for selecting between delayed compu-tations,and a function for launching new concurrent threads of computation within a program.The major omission is thatµCML has no facility for generating new channel names.However we believe that this can be remedied by using techniques common to the π-calculus,(Milner,1991;Milner et al.,1992;Sangiorgi,1992).In the remainder of this section we present the operational semantics ofµCML in terms of a labelled transition system.In order to describe all possible states which can arise dur-ing the computation of a well-typedµCML program we need to extend the language.This extension is twofold.Thefirst consists in adding the constants of event type used by Reppy in(Reppy,1992)to defineλcv,i.e.constants to denote certain delayed computations.This extended language,which we callµCML cv,essentially coincides with theλcv,the lan-guage used in(Reppy,1992),except for the omissions cited above.However to obtain a compositional semantics we make further extensions toµCML cv.We add a parallel oper-ator,commonly used in process algebras,which allows us to use programs in place of the multisets of programs of(Reppy,1992).Thefinal addition is more subtle;we include inµCML cv expressions which correspond to the ed versions of Reppy’s constants for representing delayed computations.Thus the labelled transition system uses as states programs from a language which we call µCML.This language is a superset ofµCML cv,which is our version of Reppy’sλcv, which in turn is a superset ofµCML,our mini-version of CML.The following diagramA Theory of Weak Bisimulation for Core CML3indicates the relationships between these languages:µCMLλcvCMLIn Section3we discuss semantic equivalences defined on the labelled transition of Sec-tion2.We demonstrate the inadequacies of the obvious adaptations of strong and weak bisimulation equivalence,(Milner,1989),and then consider adaptations of higher-order and irreflexive bisimulations from(Thomsen,1995).Finally we suggest a new variation called hereditary bisimulation equivalence which overcomes some of the problems en-countered with using higher-order and irreflexive bisimulations.In Section4we show that hereditary bisimulation is preserved by allµCML contexts.This is an application of the proof method originally suggested in(Howe,1989)but the proof is further complicated by the fact that hereditary bisimulations are defined in terms of pairs of relations satisfying mutually dependent properties.In Section5we briefly discuss the resulting algebraic theory ofµCML expressions.This paper is intended only to lay the foundations of this theory and so here we simply indicate that our theory extends both that of call-by-valueλ-calculus(Plotkin,1975)and process algebras(Milner,1989).In Section6we show that,up to weak bisimulation equivalence,our semantics coincides with the reduction semantics forλcv presented in(Reppy,1992).This technical result ap-plies only to the common sub-language,namelyµCML cv.In Section7we briefly consider other approaches to the semantics of CML and related languages and we end with some suggestions for further work.2The LanguageIn this section we introduce our languageµCML,a subset of Concurrent ML(Reppy, 1991a;Reppy,1992;Panangaden&Reppy,1996).We describe the syntax,including a typing system,and an operational semantics in terms of a labelled transition system. Unfortunately,there is not enough space in this paper to provide an introduction to pro-gramming in CML:see(Panangaden&Reppy,1996)for a discussion of the design and philosophy of CML.The type expressions for our language are given by:A::A A A A A AThus we have three base types,,and;the latter two are simply examples of useful base types and one could easily include more.These types are closed under four con-structors:pairing,function space,and the less common and type constructors.4W.Ferreira,M.Hennessy and A.S.A.JeffreyOur language may be viewed as a typedλ-calculus augmented with the type constructors A for communication channels sending and receiving data of type A,and A for constructing delayed computations of type A.Let Chan A be a type-indexed family of disjoint sets of channel names,ranged over by k, and let Var denote a set of variables ranged over by x,y and z.The expressions ofµCML are given by the following abstract syntax:e f g Exp::v ce e e e e e x e e eev w Val::x y e x k01c Const::The main syntactic category is that of Exp which look very much like the set of expressions for an applied call-by-value version of theλ-calculus.There are the usual pairing,let-binding and branching constructors,and two forms of application:the application of one expression to another,ee,the application of a constant to an expression,ce.There is also a syntactic category of value expressions Val,used in giving a semantics to call-by-value functions and communicate-by-value channels.They are restricted in form: either a variable,a recursively defined function,x y e,or a predefined literal value for the base types.We will use some syntax sugar,writing y e for x y e when x does not occur in e,and e;f for x e f when x does not occur in f. Finally there are a small collection of constant functions.These consist of a representa-tive sample of constants for manipulating objects of base type,,which could easily be extended,the projection functions and,together with the set of constants for manipulating delayed computations taken directly from(Reppy,1992):and,for constructing delayed computations which can send and receive values,,for constructing alternatives between delayed computations,,for spawning new computational threads,,for launching delayed computations,,for combining delayed computations,,for a delayed computation which always deadlocks,and,for a delayed computation which immediately terminates with a value. Note that there is no method for generating channel names other than using the predefined set of names Chan A.There are two constructs in the language which bind occurrences of variables,xe1e2where free occurrences of x in e2are bound and x y e where free oc-currences of both x and y in e are bound.We will not dwell on the precise definitions of free and bound variables but simply use f v e to denote the set of variables which have free occurrences in e.If f v e/0then e is said to be a closed expression,which we sometimes refer to as a program.We also use the standard notation of e v x to denote the substitution of the value v for all free occurrences of x in e where bound names may be changed in order to avoid the capture of free variables in v.(Since we are modelling aA Theory of Weak Bisimulation for Core CML5:A B A:A A:A B B:A A::A A A::::A A B B:A A:A:A AFigure1a.Type rules forµCML constant functionsx yΓx:A y:BΓ:Γ:Γx y e:A BΓe:AΓf:BΓe:AΓe f:BΓe:AΓx:A f:BΓe f g:A6W.Ferreira,M.Hennessy and A.S.A.Jeffreythis reduction semantics are of the form:CτCwhere C C are configurations which combine a closed expression with a run-time envi-ronment necessary for its evaluation,andτis Milner’s notation for a silent action.However this semantics is not compositional as the reductions of an expression can not be deduced directly from the reductions of it constituent components.Here we give a compositional operational semantics with four kinds of judgements:eτe,representing a one step evaluation or reduction,e v e,representing the production of the value v,with a side effect e,e k?x e,representing the potential to input a value x along the channel k,ande k!v e,representing the output of the value v along the channel k.These are formally defined in Figure2,but wefirst give an informal overview.In order to define these relations we introduce extra syntactic constructs.These are introduced as required in the overview but are summarized in Figure3.The rules for one step evaluation or reduction have much in common with those for a standard call-by-valueλ-calculus.But in addition a closed expression e of type A should evaluate to a value of type A and it is this production of values which is the subject of the second kind of judgement.HoweverµCML expressions can spawn subprocesses before returning a value,so we have to allow expressions to continue evaluation even after they have returned a result.For example in the expression:0a;aone possible reduction is(whereτindicates a sequence ofτ-reductions):0a;aτa?11a!0where the process returns the value1before outputting0.For this reason we need a reduc-tion e v e rather than the more usual termination e v.The following diagram illustrates all of the possible transitions from this expression:0a;aτa!0τa?vva!0vA Theory of Weak Bisimulation for Core CML7 judgements of the operational semantics apply to these configurations.The second,more common in work on process algebras,(Bergstra&Klop,1985;Milner,1989),extends the syntax of the language being interpreted to encompass configurations.We choose the latter approach and one extra construct we add to the language is a parallel operator,e f.This has the same operational rules as in CCS,allowing reduction of both processes:eαee fαe fand communication between the processes:e k!v ef k?x fe fτe v x fThe assymetry is introduced by termination(a feature missing from CCS).A CML process has a main thread of control,and only the main thread can return a value.By convention, we write the main thread on the right,so the rule is:f v feαeSecondly,e may have spawned some concurrent processes before returning a function,and these should carry on evaluation,so we use the silent rule for constant application:e v e8W.Ferreira,M.Hennessy and A.S.A.JeffreyThe well-typedness of the operational semantics will ensure that v is a function of the appropriate type,.With this method of representing newly created computation threads more of the rules corresponding toβ-reduction in a call-by-valueλ-calculus may now be given.To evaluate an application expression e f,first e is evaluated to a value of functional form and then the evaluation of f is initiated.This is represented by the rules:eαee fτe yf g(In fact we use a slightly more complicated version of the latter rule as functions are al-lowed to be recursive.)Continuing with the evaluation of e f,we now evaluate f to a value which is then substituted into g for y.This is represented by the two rules:fτfx f gτf g v xThe evaluation of the application expression c f is similar;f is evaluated to a value and then the constant c is applied to the resulting value.This is represented by the two rulesfτfc fτfδc vHere,borrowing the notation of(Reppy,1992),we use the functionδto represent the effect of applying the constant c to the value v.This effect depends on the constant in question and we have already seen one instance of this rule,for the constant,which result from the fact thatδv v.The definition ofδfor all constants in the language is given in Figure2f.For the constants associated with the base types this is self-explanatory; the others will be explained below as the constant in question is considered.Note that because of the introduction of into the language we can treat all constants uniformly, unlike(Reppy,1992)where and have to considered in a special manner.In order to implement the standard left-to-right evaluation of pairs of expressions we introduce a new value v w representing a pair which has been fully evaluated.Then to evaluate e f:first allow e to evaluate:eαee fτe xf v xThese value pairs may then be used by being applied to functions of type A B.For example the following inferences result from the definition of the functionδfor the constants and:e v w eeτe m nIt remains to explain how delayed computations,i.e.programs of type A,are han-dled.It is important to realise that expressions of type A represent potential rather than actual computations and this potential can only be activated by an application of theA Theory of Weak Bisimulation for Core CML9eαee fαe feαee f gαe f geαee fαef fαfe f v e fFigure2a.Operational semantics:static rules ge1αege1ge2αegeαeceτeδc v e ee f gτe ge v ee fτe yfg v xv x y g e v ee fτef v xe k?x ef k!v fv vΛk?k?x x10W.Ferreira,M.Hennessy and A.S.A.Jeffreye f g Exp::v ce e e e e e x e e eev w Val::x y e x k01c Const::Figure3a.Syntax ofµCMLv w Val::v v gege GExp::v!v v?ge v ge geΛA vFigure3b.Syntax ofµCML cve f g Exp::ge e eFigure3c.Syntax ofµCMLconstant,of type A A.Thus for example the expression k is of type A and represents a delayed computation which has the potential to receive a value of type A along the channel k.The expression k can actually receive such a value v along channel k,or more accurately can evaluate to such a value,provided some other computation thread can send the value along channel k.The semantics of is handled by introducing a new constructor for values.For certain kinds of expressions ge of type A,which we call guarded expressions,let ge be a value of type A;this represents a delayed computation which when launched initiates a new computation thread which evaluates the expression ge.Then the expression ge reduces in one step to the expression ge.More generally the evaluation of the expressione proceeds as follows:First evaluate e until it can produce a value:eτeeτe geNote that here,as always,the production of a value may have as a side-effect the generation of a new computation thread e and this is launched concurrently with the delayed compu-tation ge.Also both of these rules are instances of more general rules already considered. Thefirst is obtained from the rule for the evaluation of applications of the form ce and the second by definingδge to be ge.The precise syntax for guarded expressions will emerge by considering what types of values of the form e can result from the evaluation of expressions of type from the basic languageµCML.The constant is of type A A and thereforethe evaluation of the expression e proceeds byfirst evaluating e to a value of type A until it returns a value k,and then returning a delayed computation consisting of an event which can receive any value of type A on the channel k.To represent this event we extend the syntax further by letting k?be a guarded expression for any k and A,with the associated rule:e k eeτe k!vIt is these two new expressions k?and k!v which perform communication between compu-tation threads.Formally k!v is of type and we have the axiom:k?k?x xTherefore in general input moves are of the form e k?x f where e:B and x:A f:B. Communication can now be modelled as in CCS by the simultaneous occurrence of input and output actions:e k?x ef k!v feτeΛobtained,once more,by definingδto beΛ.The constant is of type A A B B.The evaluation of e proceeds in the standard way by evaluating e until it produces a value,which must be of the form ge v,where ge is a guarded expression of type A and v has type A B.Then the evaluation of e continues by the construction of the new delayed computation ge v.Bearing in mind the fact that the production of values can generate new computation threads,this is formally represented by the inference rule:e ge v ege vαveThe construct,of type A A,evaluates its argument to a value v,and thenreturns a trivial a delayed computation;this computation,when activated,immediately evaluates to the value v.In order to represent these trivial computations we introduce a new constructor for guarded expressions,A and the semantics of is then captured by the rule:e v eA vτvThe choice construct e is a choice between delayed computations as has the type A A A.To interpret it we introduce a new choice constructor ge1ge2where ge1and ge2are guarded expressions of the same type.Then e pro-ceeds by evaluating e until it can produce a value,which must be of the form ge1ge2, and the evaluation continues by constructing the delayed computation ge1ge2.This is represented by the rule:e ge1ge2ege2αege1ge2αeΓv:AΓw:BΓge:AΓv:AΓw:AΓv?:AΓge:AΓv:A BΓge1ge2:AΓA v:AΓe:AΓf:BΓτ:A Γv:AΓk?x:AΓw:Bof the form e k ?xf where f may be an open expression we need to consider relations over open expressions.Let an open type-indexed relation R be a family of relations R ΓA such that if e R ΓA f then Γe :A and Γf :A .We will often elide the subscripts from relations,for example writing e R f for e R ΓA f when context makes the type obvious.Let a closed type-indexed relation R be an open type-indexed relation where Γis everywhere the empty context,and can therefore be elided.For any closed type-indexed relation R ,let its open extension R be defined as:e R x :A Bf iff e v x R B f v x for allv :AA closed type-indexed relation R is structure preserving iff:if v R A w and A is a base type then v w ,if v 1v 2R A 1A 2w 1w 2then v i R A i w i ,if ge 1R A ge 2then ge 1R A ge 2,andif v R A B v then for all w :A we have vw R B v w .With this notation we can now define strong bisimulations over µCML expressions.A closed type-indexed relation R is a first-order strong simulation iff it is structure-preserving and the following diagram can be completed:e 1R e 2e 1R e 2ase 1lRe 2lsince the definition of strong bisimulation demands that the actions performed by expres-sions match up to syntactic identity.This counter-example can also be reproduced using only µCML contexts:kx121kx21since the left hand side can perform the move:kx12τk !x12but this can only be matched by the right hand side up to strong bisimulation:kx21τk !x21In fact,it is easy to verify that the only first-order strong bisimulation which is a congruence for µCML is the identity relation.To find a satisfactory treatment of bisimulation for µCML,we need to look to higher-order bisimulation ,where the structure of the labels is accounted for.To this end,given a closed type-indexed relation R ,define its extension to labels R l as:v R l A wk !v R l A k !wkChan BThen R is a higher-order strong simulation iff it is structure-preserving and the followingdiagram can be completed:e 1R e 2e 1R e 2aswhere l 1R l l 2e 1l 1Re 2l 2lotherwise.Then R is a first-order weak simulation iff it is structure-preserving and the following diagram can be completed:e 1R e 2e 1R e 2ase 1lRe 2ˆl Let1be the largest first-order weak bisimulation.Proposition 3.31is an equivalence.ProofSimilar to the proof of Proposition 3.1.Unfortunately,1is not a congruence,for the same reason as 1,and so we can attempt the same modification.R is a higher-order weak simulation iff it is structure-preserving and the following diagram can be completed:e 1R e 2e 1R e 2aswhere l 1R ll 2e 1l 1Re 2ˆl 2Lethbe the largest higher-order weak bisimulation.Proposition 3.4h is an equivalence.ProofSimilar to the proof of Proposition 3.1.However,h is still not a congruence,for the usual reason that weak bisimulation equiva-lence is not a congruence for CCS summation.Recall from (Milner,1989)that in CCS 0τ0but a 00a 0τ0.We can duplicate this counter-example in µCML since the CCS operator corresponds to the µCML operator and 0corresponds to Λ.However may only be applied to guarded expressions and therefore we need a guarded expressionwhich behaves like τ0;the required expression is A Λ.Thus:ΛhA Λsince the right hand side has only one reduction:A ΛτΛτΛbut:Λk !0hA Λk !0because the only reduction of Λk !0is Λk !0k !0ΛΛand:A Λk !0τΛτΛThis counter-example can also be replicated using the restricted syntax of µCML.We have:hsince the left hand side has only one reduction:ΛΛand the right hand side can match this with:A ΛΛand we have seen:ΛhA ΛHowever:k 0hk 0since the left hand side has only one reduction:k 0τΛk !0whereas the right hand side has the reduction:k 0τA Λk !0A first attempt to rectify this is to adapt Milner’s observational equivalence for µCML,and to define h as the smallest symmetric relation such that the following diagram can be completed:e 1he 2e 1he 2aswhere l 1h ll 2e 1l 1he 2l 2Proposition 3.5h is an equivalence.ProofSimilar to the proof of Proposition 3.1.This attempt fails,however,since it only looks at the first move of a process,and not at thefirst moves of any processes in its transitions.Thus,the above µCML counter-example for h being a congruence also applies to h ;i.e.hbut:k 0hk 0This failure was first noted in (Thomsen,1995)for CHOCS.Thomsen’s solution to this problem is to require that τ-moves can always be matched by at least one τ-move,which produces his definition of an irreflexive simulation as a structure-preserving relation where the following diagram can be completed:e 1R e 2e 1R e 2aswhere l 1R l l 2e 1l 1Re 2l 2Letibe the largest irreflexive bisimulation.Proposition 3.6iis a congruence.ProofThe proof that i is an equivalence is similar to the proof of Proposition 3.1.The proof that it is a congruence is similar to the proof of Theorem 4.7in the next section.However this relation is rather too strong for many purposes,for example 12i111since the right hand side can perform more τ-moves than the left hand side.This is similar to the problem in CHOCS where a τP i a P .In order to find an appropriate definition of bisimulation for µCML,we observe that µCML only allows to be used on guarded expressions ,and not on arbitrary expressions.We can thus ignore the initial τ-moves of all expressions except for guarded expressions.For this reason,we have to provide two equivalences:one on terms where we are not interested in initial τ-moves,and one on terms where we are.A pair of closed type-indexed relations R R n R s form a hereditary simulation (we call R n an insensitive simulation and R s a sensitive simulation )iff R s is structure-preserving and we can complete the following diagrams:e 1R ne 2e 1R ne 2aswhere l 1R sll 2e 1l 1R ne 2ˆl 2and:e 1R se 2e 1R se 2aswhere l 1R s l l 2e 1l 1R ne 2l 2Let n sbe the largest hereditary bisimulation.Note that we require R s to be structure-preserving because it is used to compare the labels in transitions,which may contain ab-stractions or guarded events.In the operational semantics of µCML expressions,guarded expressions can only appear in labels,and not as the residuals of transitions.This explains why in the definition of n labels are compared with respect to the sensitive relation s whereas the insensitive relation is used for the residuals.For example,if ge 1n s ge 2then we have:xge 1nxge 2since once either side is applied to an argument,their first action will be a τ-step.On the other hand:ge 1nge 2sinceis precisely the construct which allows us to embed ge 1and ge 2in acontext.Theorem 3.7s is a congruence for µCML ,andnis a congruence for µCML.ProofThe proof that s and n are equivalences is similar to the proof of Proposition 3.1.The proof that they form congruences is the subject of the next section.Proposition 3.8The equivalences on µCML have the following strict inclusions:1shh111x1xk k i h k12s i1111n s x1xh n1h h x1xwhere:x x(Note that this settles an open question(Thomsen,1995)as to whether i is the largestcongruence contained in h.)It is the operator which differentiates between the two equivalences n and h.Howeverin order to demonstrate the difference we need to be able to apply to guarded expressionswhich can spontaneously evolve,i.e.performτ-moves.The onlyµCML constructor for guarded expressions which allows this is A,and in turn occurrences of this can only begenerated by theµCML constructor.Therefore:Proposition3.9For the subset ofµCML without and A,n is the same as h,and s is the same as h.ProofFrom Proposition3.8n h.For the subset ofµCML without and A,define R s as:v w v h w ge1ge2ge1h ge2v1w v2w v1h v2Then since no event without A can perform aτ-move,and since the only initial moves ofv i w areβ-reductions,we can show that h R s forms an hereditary bisimulation,and so h n.From this it is routine to show that s h.Unfortunately we have not been able to show that n is the largestµCML congruence con-tained in weak higher-order bisimulation equivalence.However we do have the following characterisation:Theorem3.10n is the largest higher-order weak bisimulation which respectsµCML contexts.ProofBy definition,n is a higher-order weak bisimulation,and we have shown that it respectsµCML contexts.All that remains is to show that it is the largest such.Let R be a higher-order weak bisimulation which respectsµCML contexts.Then define: R n R v1w e2v1R v2v2wτe2e1v2w v1R v2v1wτe1R s v w v R w ge1ge2ge1R ge2v1w v2w v1R v2We will now show that R n R s forms an hereditary simulation,from which we can de-duce R R n n.。

INVERSE ORDER RULE FOR WEIGHTED GENERALIZED INVERSE #

INVERSE ORDER RULE FOR WEIGHTED GENERALIZED INVERSE #

INVERSE ORDER RULE FOR WEIGHTED GENERALIZEDINVERSE ∗WENYU SUN †AND YIMIN WEI ‡SIAM J.M ATRIX A NAL.A PPL .c 1998Society for Industrial and Applied Mathematics Vol.19,No.3,pp.772–775,July 1998011Abstract.The weighted generalized inverses have several important applications in researching the singular matrices,regularization methods for ill-posed problems,optimization problems,and statistics problems.In this paper we establish some sufficient and necessary conditions for inverse order rule of weighted generalized inverse.Key words.generalized inverse,weighted generalized inverse,inverse order rule,matrix com-putationAMS subject classifications.15A09,65F10PII.S08954798963054411.Introduction.The generalized inverse is an important tool for researching the singular matrix problems,ill-posed problems,optimization problems,and statis-tics problems.The inverse order rule for generalized inverse plays an important role in the theoretic research and numerical computations in the above areas (see [2],[4],[5],[6],[8],[11],[12],[13],[14],[15]).The purpose of this paper is to establish the inverse order rule of the weighted generalized inverse,which is met in our research of the weighted trust region approach for optimization problems [3],[7],[9],[10].In addition,the inverse order rule for weighted generalized inverse is also applied to the generalized least squares problem and the weighted perturbation theory of the singular matrix.In general,the inverse order rule does not always hold.If A ∈C m ×n ,B ∈C n ×l ,the sufficient and necessary condition for inverse order rule is as follows.Lemma 1.1.The following conditions are equivalent:1.(AB )+=B +A +;2.R (A ∗AB )⊂R (B )and R (BB ∗A ∗)⊂R (A ∗);3.R (A ∗ABB ∗)=R (BB ∗A ∗A ).(See [1],[2],[5],[6].)In this paper we generalize the above results to the case of the weighted generalized inverse.2.Main results.Notation.For convenience,we list some notation as follows:C m ×n ,C m ×n r :m ×n matrix set and m ×n matrix set with rank r ,respectively;R (·),N (·):range and null space,respectively;A ∗:conjugate transpose matrix of A ;A #:weighted conjugate transpose matrix of A ;A +:Moore–Penrose inverse of A ;A +M,N :weighted Moore–Penrose inverse of A ;∗Received by the editors June 19,1996;accepted for publication (in revised form)by G.P.Styan August 29,1997;published electronically April 2,1998.This work was supported by CNPq of Brazil and the National Natural Science Foundation of China./journals/simax/19-3/30544.html †Department of Mathematics,Nanjing Normal University,Nanjing 210097,People’s Republic of China (sun@mat.ufpr.br).‡Department of Mathematics,Fudan University,Shanghai 200433,People’s Republic of China (ymwei@).772INVERSE ORDER RULE FOR WEIGHTED GENERALIZED INVERSE773 P L,M:a projector with range L and null space M.Definition2.1.Let A∈C m×n.Also,let M and N be m×m and n×n positive definite Hermite matrices,respectively.Then there is a unique matrix G∈C n×m such thatAGA=A,GAG=G,(MAG)∗=MAG,(NGA)∗=NGA,(1)where G is called weighted Moore–Penrose generalized inverse and written as G=A+MN .Definition2.2.Let M and N be m×m and n×n positive definite matrices, respectively.Given A∈C m×n,the weighted conjugate transpose matrix A#of A is defined asA#=N−1A∗M.(2)Obviously,A#satisfies the following properties:if A,A1∈C m×n,B∈C n×l, then(A+A1)#=A#+A#1,(AB)#=B#A#,(A#)#=A,(A#)∗=(A∗)#.(3)Before giving the properties of weighted generalized inverse,we state one lemma which is proved in[2]and[5].Lemma2.3.A+MN satisfies the following properties:1.AA+MN=P R(A),M−1N(A∗)=P R(A),N(A#),A+MN A=P N−1R(A∗),N(A)=P R(A#),N(A);2.A+MN=N−12(M12AN−12)+M12.In the following,we establish some sufficient and necessary conditions for the inverse order rule of the weighted generalized inverse.Here we employ a brief proof instead of the original proof due to a referee’s suggestion.Theorem2.4.Let A∈C m×n,B∈C n×l.Also,let M,N,L be m×m,n×n, and l×l positive definite Hermite matrices,respectively.Then(AB)+ML=B+NL A+MN(4)if and only ifR(A#AB)⊂R(B)and R(BB#A#)⊂R(A#).(5)Proof.In view of Lemma2.3,we haveB+NL A+MN=(AB)+ML(6)if and only ifL−12(N12BL−12)+N12N−12(M12AN−12)+M12=L−12(M12ABL−12)+M12, or,equivalently,if and only if˜B+˜A+=(˜A˜B)+,(7)where˜A:=M12AN−12and˜B:=N12BL−12.774WENYU SUN AND YIMIN WEILemma1.1tells us that(7)holds if and only ifR(˜A∗˜A˜B)⊂R(˜B)and R(˜B˜B∗˜A∗)⊂R(˜A∗).(8)That(8)is equivalent toR(A#AB)⊂R(B)and R(BB#A#)⊂R(A#)(9)follows easily from the definition of(·)#and the fact that M,N,and L are positive definite matrices.This completes the proof.Corollary2.5.Let A∈C m×n,B∈C n×l.Also,let M,N,L be m×m,n×n, and l×l positive definite Hermite matrices,respectively.Then(AB)+ML=B+NL A+MNif and only ifA+MN ABB#A#=BB#A#and BB+NL A#AB=A#AB.(10)Proof.It is directly obtained from(5)and Lemma2.3.Theorem2.6.Let A∈C m×n,B∈C n×l.Also,let M,N,L be m×m,n×n, and l×l positive definite Hermite matrices.Then(AB)+ML=B+NL A+MN(11)if and only ifR(A#ABB#)=R(BB#A#A).(12)Proof.Similar to the proof of Theorem2.4,we can also directly obtain this result from Lemmas1.1and2.3.Acknowledgments.The authors would like to thank the referees and the as-sociate editor George P.H.Styan for their helpful suggestions.The authors would especially like to express their gratitude to Dr.Hans Joachim Werner for providing a brief proof for their main result instead of the original version of the proof and two important references[13],[14]which improved the paper greatly.REFERENCES[1] E.Arghiriade,Remarques sur l’inverse g´e n´e ralize de produit de matrices,Sci.Fis.Nat.Natur.,42(1967),pp.621–625.[2] A.Ben-Israel and T.N.E.Greville,Generalized Inverses:Theory and Applications,JohnWiley,New York,1974.[3]S.Di and W.Sun,Trust region method for conic model to solve unconstrained optimization,Optimization Methods and Software,6(1996),pp.237–263.[4]G.H.Golub and C.F.Van Loan,Matrix Computations,The Johns Hopkins UniversityPress,Baltimore,MD,1983.[5]X.He and W.Sun,Introduction to Generalized Inverses of Matrices,Jiangsu Sci.&Tech.Pub-lishing House,Nanjing,China,1991.[6] C.R.Rao and S.K.Mitra,Generalized Inverse of Matrices and Its Applications,JohnWiley,New York,1971.[7]R.J.B.Sampaio,W.Sun,and J.Yuan,On trust region algorithm for nonsmooth optimiza-tion,Applied put.,85(1997),pp.109–116.[8]W.Sun,Cramer rules for weighted systems,Nanjing Daxue Xuebao Shuxue Bannian Kan,3(1986),pp.117–121.INVERSE ORDER RULE FOR WEIGHTED GENERALIZED INVERSE775 [9]W.Sun,J.Yuan,and Y.Yuan,Trust region method of conic model for linearly constrainedoptimization,J.Optim.Theory Appl.,submitted.[10]W.Sun and Y.Yuan,Trust Region Method of Conic Model for Nonlinearly ConstrainedOptimization,p.,submitted.[11]W.Sun and Y.Yuan,Optimization Theory and Methods,Science Press,Beijing,1996.[12]G.Wang,Perturbation theory for weighted generalized inverse,put.Math.,1(1987),pp.48–60.[13]H.J.Werner,G-inverse of matrix products,in Data Analysis and Statistical Inference,S.Schach and G.Trenkler,eds.,Eul Verlag,Bergisch-Gladbach,1992,pp.531–546.[14]H.J.Werner,When is B−A−a generalized inverse of AB?,Linear Algebra Appl.,210(1994),pp.255–263.[15]Z.Z.Yu and Z.L.Yu,Weighted generalized inverse and rank-deficient variance of network,J.Wuhan Surveying and Drawing College,1(1985).。

  1. 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
  2. 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
  3. 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。

Stable high-order quadrature rules withequidistant pointsDaan HuybrechsReport TW528,Sep2008Katholieke Universiteit LeuvenDepartment of Computer ScienceCelestijnenlaan200A–B-3001Heverlee(Belgium)Stable high-order quadrature rules withequidistant pointsDaan HuybrechsReport TW528,Sep2008Department of Computer Science,K.U.LeuvenAbstractNewton-Cotes quadrature rules are based on polynomial interpo-lation in a set of equidistant points.They are very useful in applica-tions where sampled function values are only available on a regulargrid.Yet,these rules rapidly become unstable for high orders.Inthis paper we review two techniques to construct stable high-orderquadrature rules using equidistant quadrature points.The stabilityfollows from the fact that all coefficients are positive.This resultcan be achieved by allowing the number of quadrature points to belarger than the polynomial order of accuracy.The computed ap-proximations then implicitly correspond to the integral of a leastsquares approximation of the integrand.We show how the under-lying discrete least squares approximation can be optimised for thepurpose of numerical integration.Keywords:numerical integration,least squares approximation,discrete or-thogonal polynomialsAMS(MOS)Classification:Primary:41A55,Secondary:65F20,42C05Stable high-order quadrature rules withequidistant pointsDaan Huybrechs∗†AbstractNewton-Cotes quadrature rules are based on polynomial interpola-tion in a set of equidistant points.They are very useful in applicationswhere sampled function values are only available on a regular grid.Yet,these rules rapidly become unstable for high orders.In this paper we re-view two techniques to construct stable high-order quadrature rules usingequidistant quadrature points.The stability follows from the fact thatall coefficients are positive.This result can be achieved by allowing thenumber of quadrature points to be larger than the polynomial order ofaccuracy.The computed approximations then implicitly correspond tothe integral of a least squares approximation of the integrand.We showhow the underlying discrete least squares approximation can be optimisedfor the purpose of numerical integration.1IntroductionA recurring problem in computational science is the approximate evaluation of the integralI[f]:= b a w(x)f(x)d x≈Q[f]:=N j=1w j f(x j)(1)by a quadrature Q[f]rule with N points x i and weights w i.There is a rich body of literature on numerical integration,we refer the reader to the volumes[14,6,7] and the references therein for a broad overview.A case of great practical importance is the case where f is known only in equidistant points,i.e.,in the pointsx j=a+(j−1)b−aN−1,j=1,...,N.(2)Popular quadrature rules in this setting are the(composite)trapezoidal rule and Simpson rules.These are low-order variants of the family of Newton-Cotes rules,which for the set of N points(2)are exact for all polynomials up to degree N−1.We say that the rule has order N.Newton-Cotes rules are easy to apply and easy to implement for a variety of weight functions w(x),but the low-order ∗Department of Computer Science,Katholieke Universiteit Leuven,Belgium (daan.huybrechs@cs.kuleuven.be)†The author is a Postdoctoral Fellow of the Research Foundation-Flanders(FWO).1rules converge slowly and the high-order rules are numerically unstable.We will discuss these properties in more detail in §2.Stability in numerical integration follows from having positive weights.It is well-known that quadrature rules with positive weights converge for all con-tinuous functions f on [a,b ].A fundamental theorem in numerical integration,due originally to Tchakaloff[15](see also [14,4]),states that for any functional of the form (1)with a positive weight function w (x )>0,a quadrature rule of order d exists with d points and with positive weights.When searching for a rule with positive weights,Tchakaloff’s result essentially supplies an upper bound on the required number of points.The theorem is more general than stated here and holds for a variety of basis functions and for higher dimensions.For the case of polynomials and univariate integrals,the problem of determin-ing such rules is completely resolved by the existence of Guassian quadrature or Clenshaw-Curtis quadrature.These rules however do not have equidistant points.The problem in higher dimensions is much more challenging [2].It turns out that rules with positive weights can be constructed on equidis-tant grids by letting the number of quadrature points N be larger than the order d of the rule.It was shown by Davis that if a rule of order d with positive weights exists on N equidistant points,then another rule of order d exists with only d nonzero weights on the same grid [4].This rule thus achieves Tchakaloff’s upper bound.It was shown later by Wilson that the minimal number of equidistant points N scales as N ∼Cd 2as d →∞[18].The constant C is fairly modest,we illustrate numerically in this paper that C ≈0.07for the case w (x )=1.For example,a quadrature rule of order 20requires only N =33.Conversely,when samples are given on an equidistant grid with N points,a quadrature rule of order d ≈10.07√N with positive weights may be used to integrate the function.A first technique to construct quadrature rules with positive coefficients is by solving a least squares problem.The connection between quadrature rules and discrete least squares problems was also examined by Wilson [17].It ap-pears that this connection has not been further explored in a systematic manner since the publication in 1970of [17,18].On the other hand it has long been,and still is,quite common to construct a (discrete)least squares approximation of a (sampled)function.This approximation can then be integrated exactly.Both approaches are obviously related:in the polynomial case,the order of the quadrature rule corresponds to the polynomial degree of the least squares approximation.We intend to show in this paper that the interpretation as a quadrature rule does have some advantages.First,the requirement that the weights should be positive yields a natural and generally applicable stopping cri-terion for discrete least squares approximations.Increasing the degree beyond a certain value may yield numerical instability for certain functions.Second,the connection with least squares problems supplies a numerically stable and effi-cient way to construct Newton-Cotes quadrature rules and general interpolatory quadrature rules on arbitrarily spaced points.We note that such rules are most often constructed by solving an ill-conditioned system with a Vandermonde-like matrix.However,explicit expressions for the weights can easily be derived in terms of discrete orthogonal polynomials.Third,discrete least squares approx-imations for the sole purpose of numerical integration benefit from optimized choices of a weighted discrete inner product.The weight factors themselves are related to numerical integration.The described algorithm to construct least squares quadrature rules is fast2and stable and rules of very high order can in applications be computed on the fly.The algorithm as presented in§4was suggested in the context of quadra-ture rules in[17],but the method for discrete least squares problems using recurrences for orthogonal polynomials dates back to Forsythe[8].A second technique to construct quadrature rules with positive coefficients is by careful selection of the points in interpolatory quadrature.The construc-tion of a quadrature rule with only d positive weights,corresponding to d out of N possible equidistant points,can be achieved by solving a least squares problem subject to linear inequality constraints.A convergent algorithm for this particular type of problem was proposed in[12],called the NNLS algorithm (nonnegative least squares).The result is a class of interpolatory quadrature rules with guaranteed numerical stability and convergence for increasing order.We continue the paper in§2with an illustration of the difficulties of using Newton-Cotes quadrature.We describe least squares quadrature rules in§3,a stable implementation based on the method of Forsythe in§4and nonnegative least squares methods in§5.We end the paper with numerical results in§6.We would like to stress the fact that most of the theory in§2is present already in the papers[17,18].In this paper,we supplement a self-contained description of this theory with pointers to and descriptions of applicable existing algorithms and with extensive numerical examples in the later sections.2Newton-Cotes quadrature2.1PreliminariesIn the following,we will characterize the quadrature ruleQ[f]:=Nj=1w j f(x j),(3)with the vector of quadrature points x=[x j]T∈R N and the weight vector w=[w j]T∈R N by[x,w].We are interested in a rule that is exact on a finite-dimensional function spaceV:=span{φi}d−1i=0.(4) For a given vector x,interpolatory quadrature rules in general require N=d quadrature points to satisfy the d exactness conditionsQ[φi]=I[φi],i=0,...,d−1.(5) The corresponding weights can be found by solving the linear systemS w=B,(6) where the matrix S∈R d×d consists of function evaluations ofφi,S i,j=φi(x j),(7) and the right hand side vector B∈R d contains the momentsB i=I[φi].(8)3Figure1:The quantityκ(w)for Newton-Cotes quadrature as a function of the order d.High-order rules rapidly become very unstable.The quadrature rule[x,w]constructed in this way interpolates the function f in the points x by a linear combination of the basis functions and integrates the result exactly,hence the name interpolatory quadrature.A sufficient condition for the unique existence of the rule is that the matrix S in(6)is non-singular. This implies that the basis functions can interpolate any function in the points x.A Chebyshev set of functions on[a,b],which can interpolate any function on any d points in[a,b],is therefore suitable to construct general quadrature rules.Of those,polynomial basis functions are most popular.We denote by P n the space of all polynomials up to degree n.2.2Definition of Newton-Cotes quadratureA specific choice of points x is an equidistant set.Consider without loss of generality the interval[a,b]=[−1,1]and define the equidistant points1x NC j =−1+2jN−1,j=0,...,N−1.(9)The interpolatory quadrature rule on x NC that is exact for all polynomials in P d−1for w(x)=1is called a Newton-Cotes rule.The two-point rule is the trapezoidal rule,with two constant weights and order2.The three point rule is Simpson’s rule,which has order4.2.3Convergence of Newton-Cotes quadratureEquispaced points appear in a variety of applications and Newton-Cotes quadra-ture therefore has great practical signifie of Newton-Cotes rules for high order is not recommended however,due to the numerical instability of such rules.Quadrature rules may be unstable if the weights are large and differ in sign.A common measure for the stability of quadrature rules isκ(w)=Ni=1|w i|.(10)1Note that for numerical stability it is always a good idea to transform[a,b]into[−1,1].4(a)f(x )=11+x 2(b)f (x )=11+8x 2Figure 2:The absolute error of Newton-Cotes quadrature as a function of the order d for two example functions with poles near the real axis.Both examples show divergence due to numerical instability.If all weights are positive,we have κ(w )= N i =1w i =I [1].In all other casesκ(w )>I [1].The relevance of the quantity κ(w )lies in the following basic bound on the worst case error.Assume that ˜f is a user-supplied function such that |˜f(x )−f (x )|≤ǫ.Then, N i =1w i ˜f (x i )−N i =1w i f (x i ) ≤N i =1|w i (˜f (x i )−f (x i ))|≤N i =1|w i |ǫ=ǫκ(w ).This implies that,for example,round-offerrors due to inexact arithmetic may be multiplied by the factor κ(w ).It is well-known that Newton-Cotes rules of high order (higher than 9for w (x )=1)have weights with mixed sign and round-offerror may therefore affect convergence.Worse still,even in exact arithmetic Newton-Cotes rules of high order may not converge (for d →∞)to the value of the integral [13].This is due to the failure to converge of the underlying polynomial interpolation in equidistant points.A sufficient condition for convergence is that f is analytic in an ellipse centered at (a +b )/2,with a major axis of length 108(b −a )along the x -axis and a minor axis of length 68(b −a )into the complex plane [3].We illustrate these properties in Figures 1and 2.Fig.1displays κ(w )for Newton-Cotes quadrature as a function of the order d .The quantity increases exponentially starting from d =9.Fig.2shows the error of Newton-Cotes quadrature applied to the integrands f (x )=11+x 2and f (x )=11+8x 2.In the former case (left panel),the process initially converges but then it diverges due to the numerical instability of the quadrature rules.In the latter case (right panel),the process does not converge because the function is not analytic in a sufficiently large region (the poles at ±i 2√2are too close to the real axis).After a while,divergence occurs again due to numerical instability.These examples were computed in Matlab in double precision.53Least-squares quadratureWe set out to construct stable high-order quadrature rules on equidistant point sets.In order to achieve this goal,we introduce redundancy by allowing the number of quadrature points N to be greater than the order d of the quadrature rule.Wefind the corresponding weights by solving a least-squares problem. 3.1Formulation as a least-squares problemFor N≥d,the counterpart of the linear system of equations(6)is given byA w=B,(11) where A∈R d×N is a rectangular matrix with elementsA i,j=φi(x j),i=1,...,d,j=1,...,N.(12) The right hand side vectorB is the same as before,with elements given by(8).The existence of a solution to(11)follows from the existence of interpolatory quadrature rules.Lemma3.1.If{φi}d i=1is a Chebyshev set and N≥d then system(11)has at least one solution.Proof.One can construct an interpolatory quadrature rule of order d using d distinct arbitrary points by solving(6).The matrix S is non-singular because {φi}d i=1is a Chebyshev set.One can construct such a rule for any subset of d points from x.Extending that rule with zero weights for the remaining points yields a solution to(11).It follows that the matrix A has rank d.The set W of all solutions to(11)is a linear space with dimension N−d[11].From this space,we choose the least squares solution w∗,i.e.,the one that minimizes w 2:w∗=arg minw∈W w 2.(13) The least squares solution w∗to(11)exists and is unique[11].An explicit expression is found by solving the systemAA T u=B,(14) and settingw∗=A T u.(15) Note that AA T∈R d×d is a square matrix.The set W consists of the sum of w∗and any vector in the kernel of A,i.e.,W:={w∗+v|Av=0}.(16) In the following,we consider the case of quadrature rules that are exact for polynomials in P d−1.There is considerable freedom in the choice of the basis functionsφi.For interpolatory quadrature rules,this freedom is often used to reduce the condition number of matrix S in(6),for example by using Chebyshev polynomials.The least squares problem suggests a useful alternative.63.2Discrete orthogonal polynomialsLet us introduce the discrete scalar product associated with the vector x,u(f,g)=Nj=1f(x j)g(x j)(17)and the corresponding normf u:= u(f,f).(18) The scalar product u(·,·)is positive definite on the space of polynomials up to degree N−1.Note that the polynomial p N of degree N that vanishes at x j satisfies u(p N,p N)=0.It follows that we can define afinite sequence of discrete orthogonal polynomials.The n-th orthogonal polynomial p n satisfiesp n(x)∈P n and∀q∈P n−1:u(p n,q)=0.(19) These polynomials are only defined up to a scaling factor.We define the nor-malized sequence of polynomials q n byq n(x)=1p n u p n(x),n=0,1,...,N−1.(20)3.3Characterizing the least squares solutionRecall from(14)that the least squares problem is solved in terms of the matrixC:=AA T.(21) One can verify that the elements of C are given explicitly byC m,n=Nj=1φm(x j)φn(x j)=u(φm,φn),m,n=1,2,...,d.(22)It follows immediately that if the basis functions are orthogonal with respect to the discrete scalar product u(·,·),then the matrix C is diagonal.If moreover the basis functions are orthonormal,the matrix C becomes the identity matrix I d×d.In that case,the solution of(14)is given by u=B and we havew∗=A T B.(23) The weights vector becomes a linear combination of the basis functions,evalu-ated in the quadrature points.LetΦi=[q i(x j)]N j=1,then we obtain the following explicit expression for the weights:w∗=d−1i=0 b a w(x)q i(x)d x Φi.(24)This expression holds for N≥d and for any arbitrary set of points x.It is therefore valid for all interpolatory quadrature rules,where N=d,including rules with nonequispaced points.7Expression(24)is very informative.We may derive for example the asymp-totic behaviour of the weights forfixed d as N→∞.Assume that the points x are equispaced on[a,b]=[−1,1].Then the discrete inner product converges to the continuous L2-inner product,lim N→∞2Nu(f,g)=limN→∞2Nf(x j)g(x j)= 1−1f(x)g(x)d x,(25)because the expression2N u(f,g)becomes a Riemann sum.It follows that thediscrete orthogonal polynomials q i(x)converge to the classical Legendre polyno-mials L i(x),that are orthonormal on[−1,1]with respect to the constant weight function.We have for exampleq0(x)=1√N=q0(1)L0(x)and q1(x)→√3√N x=q1(1)L1(x).The normalizing factors in these expression appear because the classical Legen-dre polynomials satisfy L i(1)=1.The elements of B,i.e.,the moments(8),are now given explicitly byB i= 1−1w(x)q i(x)d x→q i(1) 1−1w(x)L i(x)d x.(26)These moments converge to(a scaling of)the coefficients of the expansion of w(x)in Legendre polynomials.In the particular case where w(x)=1,we have 1−1L0(x)d x=2,and 1−1L i(x)d x=0,i=2,3,....Only thefirst moment is nonzero.From(24)it follows that the least squares weights converge to B1times thefirst row of A T.This is a constant value,w∗→2N,N→∞.(27) This shows among other things that,forfixed d and sufficiently large N,the weights will become positive.The convergence is rather slow however,due to the slow convergence of the limit in(25).Up to a scaling,the discrete inner product is essentially an order1quadrature rule for the continuous inner product.The convergence can be increased by creating inner products that converge faster. By altering the inner product we can also study the case of general weight functions.3.4More general inner productsConsider the inner productv(f,g)=Nj=1r j f(x j)g(x j),(28)with values r j>0.The associated norm is f v= v(f,f).We mayfind the weight vector w∗that minimizes w V by solving the least squares problemAR˜w=B,(29)8where R∈R N×N is a diagonal matrix with entries R jj=√r j.The least squares solution of(29)is found as˜w∗=(AR)T˜uwhere˜u is the unique solution ofAR(AR)T˜u=B.We obtain w∗as(see,for example,[11])w∗=R˜w∗.If the basis functions are orthonormal polynomials with respect to v,then the matrix ARR T A T is diagonal and˜u=B.Wefind in this case that the weights are given explicitly byw∗=(AR)T B,(30)orw∗=d−1i=0 b a w(x)q i(x)d x Φi·r,where byΦi·r we mean the product pare this to(24).We will now choose the coefficients r i such that the discrete orthogonal polynomials converge to the polynomials that are orthogonal with respect to the weight function w(x).For this,it is sufficient thatlim N→∞v(f,g)=limN→∞Nj=1r j f(x j)g(x j)= b a w(x)f(x)g(x)d x.This is possible if w(x)≥0is a positive weight function.Any quadrature scheme with positive coefficients suits the pattern.We may for example use a composite trapezoidal rule that incorporates the weight function w(x).Thefirst orthonormal discrete polynomial is given in general byq0(x)=1 N j=1r j.It follows that thefirst moment is given byB1= b a w(x)q0(x)d x=1 N j=1r j b a w(x)d x→ Nj=1r j,N→∞,because the sum of r j converges to b a w(x)d x by construction.We have that B i→0,i=1,2,...,d−1.From(30)it follows that the weights in this case converge to B1times thefirst row of(AR)T,and wefind thatw∗→r.(31) The least squares quadrature corresponding to the inner product simply con-verges to the vector r=[r j]T,forfixed d and in the limit N→∞.Since r j>0,the weights of the least squares quadrature rule will become positive for sufficiently large N.93.5High-order correctionsFrom the results in the previous section§3.4,we see that the least squares quadrature rules can be interpreted as high-order corrections to low-order com-posite quadrature schemes.Assume that[x,r[N]]is a quadrature scheme foreach N,with positive weights r[N]j >0and such that the quadrature approx-imation converges to I[f]for N→∞.Compute the least squares quadrature rule[x,w∗]that minimizes w v.This involves constructing polynomials that are orthogonal with respect to a weighted l2inner product(28),with r[N]jas weights.Then[x,w∗]has order d,regardless of the order of[x,r[N]],but the weights w∗converge to r[N].Thus the difference w∗−r[N]becomes small.The rate of convergence depends on the order of[x,r[N]].High-order corrections are well-known for the trapezoidal rule,based on the Euler-Maclaurin formula(see,for example[1]and references therein).These are sometimes called Gregory rules.The focus lies on constructing corrections near the endpoints only.This compares favorably to the current setting,where the corrections are spread over all weights,including those in the interior of the integration interval.On the other hand,the current construction guarantees positive weights,arbitrarily high order,equispaced points and fast construction.4An efficient construction algorithmA stable algorithm to compute both interpolatory and least squares quadrature rules,on equidistant and arbitrary point sets,hinges on efficient methods to work with orthogonal polynomials.In this section we recall the use of the three-term recurrence formula to obtain a fast and stable method.This approach was first explored by Forsythe in[8].4.1Construction of the orthogonal polynomialsThe polynomials p n that are orthogonal with respect to the discrete inner prod-uct v(·,·),as defined by(28),satisfy the well-known three term recurrence relation[10]p n(x)=λn(x−αn)p n−1(x)−λnβn p n−2(x),n=2,3,...,N−1.(32) with the coefficientsαn andβn given byαn=v(xp n−1,p n−1)v(p n−1,p n−1)(33)andβn=v(xp n−1,p n−2)v(p n−2,p n−2).(34)Theλn is a scaling parameter that can be used to normalize according to(20). Alternatively,one can chooseλn=1,which results in a monic polynomial.The three-term recurrence relation is a useful and efficient way to construct the discrete orthogonal polynomial sequence.The recursion is started by settingp−1(x)≡0and p0(x)=1.10The recurrence relation is also a numerically stable way to evaluate these poly-nomials in any required point x.It takes O(N)operations to evaluateαn andβn for each value of n.Since n ranges from1to d−1in our application,theorthogonal polynomials can be constructed in O(Nd)operations.If the recur-rence coefficients are stored,the orthogonal polynomials can subsequently beevaluated in M points in O(Md)operations.4.2Computation of the momentsThe right hand side of the linear system(29)requires the computation of themoments b a w(x)q i(x)d x,i=0,...,d−1,(35) where q i(x)are the normalized discrete orthogonal polynomials.The compu-tation of moments is a recurring problem in most methods for constructingquadrature rules.As such,it has been thoroughly investigated and describedin the literature.We refer the reader to[9]for an account of the theory andmethods.In our numerical implementation,we have evaluated these moments simplyby using known Gaussian quadrature rules for the weight function w(x).Thisis a feasible approach for standard weight functions.We used a Gaussian ruleof order d with⌊d2⌋points,which leads to⌊d2⌋evaluations per integrand.Each integrand evaluation requires O(d)computations using the recursive schemefrom§4.1.This results in O(d2)computational complexity.4.3Computational complexityAn efficient algorithm for constructing our least squares quadrature rules canbe summarized as follows.1.Construct the sequence of discrete orthogonal polynomials p n(x),n=0,...,d−1,by computingαn andβn for n=1,...,d−1.This steprequires O(Nd)operations.2.Normalize the sequence by computing the norms p n u for n=0,...,d−1.This step also takes O(Nd)operations.pute the moments(35)of the orthogonal polynomials q n.The com-putational complexity depends on the method that is used,but it is alwaysindependent of N.Our implementation requires O(d2)operations.4.Evaluate the weights by w∗=A T R T B.This is another O(Nd)step. Overall,the algorithm works inO(Nd)+O(d2)(36) steps.For afixed d,this method is only linear in N.For interpolatory rules, where d=N,the method requires O(d2)operations.115Nonnegative least squaresThe least squares rules are not optimal from the point of view of Tchakaloff’s upper bound on the number of points required to obtain positive weights.The minimization of a(weighted)l2-norm has the effect of spreading out the weights over all quadrature points–none of the weights are zero.It is known however that a rule with only d nonzero weights should exist on an equidistant grid with N points,provided that N is sufficiently large[4].Such rules can be found by solving a least squares problem subject to inequality constraints,for which standard algorithms are readily available.5.1The NNLS formulationA least squares problem with inequality constraints demanding a nonnegative solution is called a NNLS(nonnegative least squares)problem in[12].A NNLS problem is defined as follows.Minimize A x−b subject to x j≥0,j=1,...,N.(37) In our setting,we are interested in the minimization of A w−b for the standard l2inner product and in the minimization of AR˜w−b for weighted inner products.Both casesfit the standard NNLS problem.Note that if˜w j≥0,then w j=r j˜w j≥0.Problem(37)always has a solution.For an underdetermined matrix A, however,there may be infinitely many solutions.In that case,a solution is sought that maximizes sparsity of x,i.e.,one wishes tofind the vector x with as much zero entries as possible.5.2Application to quadratureIn our setting,the solution of(37)always corresponds to a quadrature rule with positive weights.It is not necessarily the case however that A w−b =0, because a positive rule with the required exactness conditions may not exist.If N is sufficiently large,such that a quadrature rule with positive weights exists that satisfies A w=b,then the solution to the NNLS problem will also be a quadrature rule with positive weights that satisfies A w=b.Moreover,this rule has maximal sparsity.From theoretical considerations,we then know that the rule only has d nonzero entries.Note that this rule is not necessarily unique.If0< A w−b <ǫ,then the quadrature rule is not exact.The rule may have any number of nonzero weights,but none of the weights are negative.The rule is not necessarily exact for any polynomial.However,ifǫ≈0,then it is reasonable to expect the quadrature approximation to have small error.An algorithm to solve problem(37)exists that always converges.We refer the reader to[12,Ch.23]for a detailed description.It is an iterative algorithm, and convergence is proved by showing that the residual A x−b decreases in each iteration and that because of this the number of iterations isfinite.In our numerical experiments,we used a Matlab implementation of the NNLS algorithm(lsqnonneg).12。

相关文档
最新文档