半渡的博客

关于： "CAE":

Computer Experiments in Fluid Dynamics

by Francis H. Harlow and Jacob E. Fromm

Lab. y. Scheepsbouwkunde
Technische Hogeschool Delft
MAART 1965
REPRINT UIT: SCIENTIFIC AMERICAN

The fundamental behavior of fluids has traditionally been studied in tanks and wind tunnels. The capacities of the modern computer make it possible to do subtler experiments on the computer alone.

The natural philosophers of ancient Greece liked to do experiments in their heads. Centuries later Galileo developed the "thought" experiment into a fruitful method of inquiry and in our own time the method appealed strongly to such men as Albert Einstein and Enrico Fermi. Now the arrival of the modern electronic computer has made the method immensely more powerful and versatile. The computer makes it possible to simulate nature with numerical models and to investigate it in ways that have never been practicable before. Physical processes of enormous complexity are being examined minutely and with considerable realism. New hypotheses are being proved true or false. In physics, engineering, economics and even anthropology the computer has become a revolutionary tool.

One of the great attractions of experiment by computer is that it can avoid some of the uncertainties of measurement. Moreover, it provides a technique that can be classed as both theoretical and experimental. It is theoretical because it deals with abstract (that is, mathematical) statements of how things relate to one another. It is experimental because the computer is given only data specifying the initial state of a system and a set of rules for calculating its state at some time in the future. The computer worker has no more idea how this future state will unfold than has the traditional worker who conducts a comparable experiment in an actual laboratory.

To demonstrate the power of computer experiments we have chosen a single example involving the dynamic behavior of fluids. The particular experiment is a study of the flow of air past a rectangular rod.

At first thought the use of a computer for calculating this flow may seem to be a needlessly roundabout procedure. Would it not be simpler and more enlightening to put the rod in a wind tunnel and observe how air containing filaments of smoke flows around it? Actually it would not. For many of the questions to be investigated the physical experiment would be more complicated and costly, and it would not provide as much information as the experiment by computer.

For an example one can point to the problem of redesigning the Tacoma Narrows Bridge after it had been shaken to pieces by wind-induced vibrations soon after it was built. For the rebuilding of the bridge many elaborate models were made and tested again and again before a safe design was finally developed. Without doubt much of the cost and time spent on the problem could have been saved by computer calculations if the computers and appropriate numerical techniques had then been available. Experiments with numerical models can show the interaction of winds and a bridge in detail and produce answers in far less time than it takes to prepare a physical experiment.

The Soviet physicist A. A. Dorodnitsyn has remarked about such problems that computer calculation "can give a solution that is not only more rapid and cheaper but also more accurate" than the physical experiment itself.

Experimentation by computer also allows the investigation of many phenomena that are either inaccessible to direct study or involve factors that cannot be measured accurately. In the flow problem that we shall discuss, for example, it is difficult to measure directly in a wind tunnel the temperature distribution in the complicated downstream wake. Computer experiments, however, can yield a reliable description of the temperature distribution.

Another benefit of a computer experiment is that it usually affords far better control of the experimental conditions than is possible in a physical experiment. In wind tunnel studies, for instance, the experimenter must modify his interpretations to include the consideration of such effects as those due to the compressibility of the working fluid, variations in fluid viscosity and uncertainties in flow velocity. In a computer experiment such properties often can be excluded or included at will. Moreover, the computer program can isolate crucial features for examination, can eliminate irrelevant factors and can often assess the experimental uncertainties.

Finally, and most importantly, experiments by computer provide a test of the applicability of theory to the complicated phenomena under investigation. Do the equations of fluid dynamics really represent the correct theoretical description when applied to phenomena as complicated, say, as the oscillatory flow that develops in the wake of a retreating rectangular rod? For such problems the mathematician would like to obtain what he calls an analytical solution—the kind of exact solution that can be obtained by the processes of mathematical analysis. For problems in fluid dynamics, however, the necessary mathematical techniques for obtaining the complete solution have not yet been developed. The detailed results provided by a computer can actually help in the development of analytical solutions to the basic equations of fluid dynamics. Usually in the mathematical model of a complex problem some of the factors can only be approximated, and obtaining a realistic solution depends on finding out which features are crucial for a reasonable representation. With the help of computer experiments one tries to discover workable approximations that will simplify the mathematics needed to solve complicated problems—in this case a problem in oscillatory fluid flow.

The reader will find the "computer wind tunnel" experiment easier to follow if we consider briefly how a fluid behaves when it flows around a fixed object such as a rectangular rod.

At low speed the airflow is smooth and steady, a condition described as laminar flow. At a certain critical speed, which depends on the size of the rod, the laminar flow breaks down. For a rod one inch in height the critical speed in air is about one inch per second; the smaller the rod, the higher the speed at which turbulence begins. If the fluid is more viscous than air, laminar flow is more easily maintained and the critical speed for turbulence becomes higher.

Above the critical speed the airstream breaks up into vortices that are similar to the small whirlpools seen when a cup of coffee is stirred. These vortices are shed alternately from the top and bottom of the object placed in the airstream. This oscillating wake was first extensively studied by the aerodynamicist Theodor von Kármán and is known as a "von Kármán vortex street."

The oscillating wake sends out pulses that react back on the object itself. The vibration so produced is responsible for the sound made by a golf club swung rapidly through the air and for the whine of a ship's rigging in the wind. It was resonant vibration produced by the wind that caused the Tacoma Narrows Bridge to break and fall into the bay.

As the air speed increases, the vortices in the vortex street become more and more ragged and eventually break up into tiny eddies whose motion is almost entirely random. At this stage fully developed turbulence has been reached.

The known patterns of air motion past an object, then, give us certain definite phenomena to look for in the computer experiments. If the computer reproduces a vortex street and, at a later stage, turbulence, it will show that the theoretical understanding of fluid dynamics is accurate and therefore can be relied on to predict what will happen when a fluid flows past objects of various shapes and at various speeds.

To set up the calculational experiment we must first translate the physical situation into the language of numbers for the computer. For bookkeeping purposes the experimental area in the computer wind tunnel is divided into many square cells, which form the basic computing mesh. A typical mesh requires at least 49 cells in the direction of horizontal flow and 24 cells in the vertical dimension, for a total of 1,176 cells. Each cell must contain two numbers representing the components of average air velocity in two directions, together with other numbers representing such variable quantities as "vorticity," "stream function" and, if heat flow is desired, temperature as well. Finally, the computer must be supplied with a set of operating instructions, or "code," that spells out in detail exactly how the computer must manipulate every number in every cell in order to calculate how the flow configuration will change from instant to instant. It can require billions of mathematical operations and anywhere from a few minutes to a few hours of computing time to carry out the calculations needed to represent the flow of air for a time interval of several minutes. In our studies we have used either an IBM 704 or the somewhat faster machine, also built by the International Business Machines Corporation, known as Stretch.

The actual development of a successful code is a time-consuming process and is carried out in three steps. The first involves the testing of detailed numerical methods and is strewn with pitfalls. It is no trick, for example, to invent methods that develop numerical instability: the computer results rapidly run askew and lead to complete nonsense. Like physical experiments, computer experiments are also subject to interference by gremlins. Just as the vibration of a motor may produce extraneous turbulence in a wind tunnel, so the numerical approximations fed into a computer may lead to equally unwanted "truncation turbulence."

The second step is to prepare a full-scale code. For our problem in fluid dynamics this required many months, most of them consumed by "debugging," or ferreting out errors in the step-by-step instructions. Such a code is written with sufficient generality so that it can be used to solve a wide variety of roughly similar problems. Thus a good code can be used for years and will often be a source of inspiration for workers in other laboratories.

The third step is to formulate the code in terms of a specific problem. In our oscillating-wake study an important part of the formulation was to determine the "initial" and "boundary" conditions. The initial condition describes the state of the air as turbulence progressively increases (four steps from left to right). In this experiment computational particles are introduced through a single cell (horizontal streaks), as though squirting a jet of colored water into a clear tank. The jet of air is unstable and soon breaks into expanding, irregular vortices like those exhibited by a plume of smoke. Similar but far more complex experiments can be used to test theories about aircraft jet engine noise suppression.

We could have assumed, for example, that the air was at rest, corresponding to the condition in a real wind tunnel before the fan is turned on. We found that it was simpler, however, to start with the fluid behaving as if it were flowing past the rod in a simple laminar manner without viscosity.

The boundary conditions refer to what is happening at the edges of the computational mesh. Our decision was to have the top and bottom edges represent the walls of the wind tunnel and to have the left edge represent an air input of uniform flow. The right edge gave us more trouble, but we finally arranged for the fluid to flow out and back into the computing region in a way that created a minimum of mathematical disturbance.

The computing process itself can be compared to the making of a motion picture. Starting with the initial conditions prescribed for each of the 1,176 cells in "frame" No. 1, the computer follows the coded instructions to determine the conditions in each cell a brief instant of time later, thereby producing frame No. 2 of the film. Each successive frame is similarly generated on the basis of numerical data computed for the preceding frame. The fastest computer available to us, Stretch, can generate about 10 frames a minute. When the calculation has proceeded far enough, the results are gathered up for study.

The computer's results can be presented in any of several different forms. One form of print-out consists of all the numbers describing the flow in each frame. Usually this form of print-out is restricted to samplings taken at selected intervals, because the complete data for every one of the hundreds or thousands of cycles in an experiment would be far too much for an analyst to digest, to say nothing of storing the reams of paper. Sometimes the computer is programmed to print certain calculations that supply particular points of information, such as the amount of air drag caused by the obstacle at specific wind speeds. The most useful and popular type of print-out, however, is the actual plotting of the flow in pictorial form.

The computer itself can generate plots of the flow configurations and put them on film by means of a microfilm recorder. Several selected frames from such recordings, exactly as they came from the computer, are among the illustrations on this page and preceding pages of this article. The sequence of all the frames of an experiment, combined in a film strip and run through a motion picture projector, gives a very vivid picture of the development of vortices and other features as a fluid flows around an obstacle.

From the numbers describing the flow in each cell of the computing mesh, the computer generates streamlines that show both the direction and the speed of flow throughout the space. The speed is indicated by the spacing between the streamlines: where the lines are close together the flow is fast; where they are farther apart the flow is slower. The computer can show the streamline patterns in either of two ways: as if a camera were photographing a stream of air flowing past it or as if the camera were moving along with the stream. The latter view shows the pattern of vortices in clear detail.

The computer can even simulate the motion of markers often used to make flow visible, such as filaments of smoke in air or of dye in water. In the computer the markers consist of "computational particles." At certain desired points in the computation these particles are thrown in (the magic of the computer allows their creation anywhere at will) and thereafter they are carried along wherever the flow of air goes. Their paths of motion produce lines called streak lines. The streak lines generated by the computer give a remarkably faithful impression of the behavior of smoke or dye filaments. Perhaps the most striking of these computer constructions is the configuration of streak lines emerging from a jet: it looks like a filament of cigarette smoke.

Usually the computer is programmed to furnish several different configuration plots, showing features of the flow from various points of view. These are by no means merely an interesting album of pictures. They show the qualitative features of the development of the flow and provide precise quantitative information about the flow at every point. In many cases the computer reveals important details that would be extremely difficult to obtain from physical experiments.

The example we have described of a computer technique for investigating fluid flow is only one of many successful efforts that have been made to carry out complex experiments by computer. Other workers have used computers to tell in detail what is going on inside a nuclear reactor and to assess in an instant the progress of a rocket soaring into space. Tomorrow the computer may give accurate forecasts of the weather, of the future of the economy and of the state of man's health.

n5321 | 2025年8月9日 21:01

中国有限元

用计算数学解决工程问题通常有4个步骤：建立数学模型、设计算法、编程实现、上机计算。很长一段时间，研究人员被“卡”在计算方法上。

究竟什么是有限元？冯康曾有过形象的比喻：“分整为零、裁弯取直、以简驭繁、化难为易。”
有限元方法可谓一种特殊的“拼图游戏”：为了解决一个复杂的大问题，例如一个大型建筑的结构分析，先把它拆解成许多小块，这些小块的“拼图碎片”就是“有限元”；然后逐一分析每个“有限元”，分别建立方程；最后将它们组合成方程组并求解，最终解决问题。
实际上，我国古代“曹冲称象”的典故、数学家刘徽采用割圆法计算圆周长，就是有限元思想的具体体现。

在黄河上游，大河穿越深邃峡谷，水声如雄狮怒吼，汹涌澎湃。行至两千余里，大河骤转九十度弯，转而向西，壮美无匹。

转弯处，就是我国首座百万千瓦级水电站——刘家峡水电站，其主体构筑物为我国建造的首座超百米的混凝土高坝。然而，鲜为人知的是，刘家峡水电站建成的背后有一批中国计算数学家。他们面对国家急迫需求，夜以继日为大坝设计和建造奋战，助推水电站如期完工。

上世纪五六十年代，以冯康（1980年当选中国科学院院士）为代表的中国科学院计算数学家们在大坝计算中获得启示，独立于西方创立了有限元方法数学理论，开创了科学与工程计算方法和理论的新领域。

时至今日，有限元方法已成为研发设计类工业软件的核心技术之一。桥隧大坝、飞机船舶、手机芯片……一个个复杂事物的诞生，都离不开有限元方法的支持。

如今，中国科学院数学与系统科学研究院（以下简称数学院）的一群年轻人，正站在先辈的肩膀上，努力构建新一代基础工业软件计算内核，启航新型工业化征程。

1 萌芽：援手刘家峡

1958年，甘肃，黄河上游。

作为一项国家重大工程，百万千瓦级刘家峡水电站开工，其主体结构是一座超百米的大型混凝土坝。我国自主设计、施工、建造如此巨大的水电工程，还是第一次。如果顺利建成，奔涌的黄河水将在这里“歇脚”，实现发电、灌溉和防洪，造福一方百姓。

工程难度之大超乎想象，设计和建设方法都与以往不同。大坝工程进展缓慢，在1961年甚至一度停工。

最初承担水坝工程计算攻关任务的，是1956年从中国科学院原数学研究所分出去成立的中国科学院计算技术研究所三室（现数学院计算数学与科学工程计算研究所的前身，以下简称三室）二组的黄鸿慈等人，冯康提供业务指导。黄鸿慈和同事们编写的计算程序质量非常高，为大坝计算奠定了坚实基础。

1963年2月，农历新年刚过，寒冷依旧，刘家峡大坝设计组副组长朱昭钧风尘仆仆地来到北京求助，希望科研人员帮助解决大坝应力计算问题。冯康等三室领导经过慎重考虑，把任务交给了崔俊芝。那一年，崔俊芝25岁，刚从西北工业大学毕业工作一年，与黄鸿慈、石钟慈等人在一个办公室。

用计算数学解决工程问题通常有4个步骤：建立数学模型、设计算法、编程实现、上机计算。很长一段时间，研究人员被“卡”在计算方法上。

“对方了解工程上的试荷载方法，交给我的任务是求解由此推导出的40阶线性方程组，从而验证该方法的正确性。”1995年当选中国工程院院士的崔俊芝研究员，在当时费尽九牛二虎之力，反复验算后发现，该方法得到的线性方程组系数矩阵近乎是奇异矩阵，难以数值求解。他还尝试了黄鸿慈研发的应力函数法程序等方法，都难以得到令工程师满意的应力场计算结果。

造成这一困难的主要原因之一是当时的算力不够。崔俊芝回忆，当时我国两台最早的计算机——“103”机、“104”机相继诞生不久。他需要在字长39位、容量4K、运算速度每秒一万次、内存仅2048个全字长磁芯体的“104”机上，求解超过1000个未知数的离散方程。

而刘家峡大坝的工程师们把最后的希望寄托在了中国科学院的科研人员身上，恳求他们“无论如何要想想办法”。

“无法满足用户要求的时候，就应该另辟新路了。我们开始寻找新的方法。”崔俊芝说。

冯康（左）与崔俊芝。

2 转机：合力渡难关

上世纪60年代初，为响应“向科学进军”的号召，中国科学院提出“任务带学科”科研模式，即以完成国家重大需求任务为目标，开展系统研究，解决国家发展中遇到的科学问题。

重任在肩，精确计算大坝应力场，满足工程师的要求，成为了三室的主要任务。

转机来自冯康推荐的一篇文章和一本书。

崔俊芝至今还记得，当年，冯康在一场报告上提到普拉格、辛格1947年发表于美国《应用数学季刊》的论文，讲述把微分方程写成变分形式，用变分的原理推导差分格式，这对水坝计算有所启发。“冯先生实际是在播撒有限元方法的种子，那次报告拉开了三室‘系统性研究’的序幕。”崔俊芝说。

与此同时，冯康倡导成立了第七研究组，即理论组，黄鸿慈任组长。冯康给黄鸿慈推荐了福赛思、沃索合著的《偏微分方程的差分方法》，书中着重讲了3类偏微分方程的数值方法，给黄鸿慈提供了重要启发。

在冯康的筹划下，“水坝计算”小组分为3支小队，分别从变分法、积分守恒法、去掉坝体基础的角度开展研究。

崔俊芝在“积分守恒法”小队，该方法从平衡方程出发，把应力与应变关系带进拉梅方程进行计算。经过一段时间复杂而又艰难的计算，他们在1964年春节前得出第一批计算结果。验证了格式、解法和程序的正确性后，崔俊芝很快为刘家峡大坝计算出应力场，经朱昭钧等验算，应力基本平衡，结果比较满意，但在坝踵和坝趾附近误差仍然较大，应力场的精度还不够。

崔俊芝全力以赴编写了我国第一个平面弹性问题有限元方法计算程序，顺利计算出令设计者满意的应力场。

那个年代，科研人员编写程序十分困难。“104”机没有操作系统、编译系统、数据管理和进程管理等系统软件，所有程序都需自己使用机器指令直接编写出代码串式的程序，包括输入数据和打印结果。

在只有2048个存储单元的内存空间，崔俊芝要求解约1000阶代数方程组。他需要先扣除约500个存储单元存放程序，在剩余约1500个存储单元的限制下求解。这必须精打细算、精心设计迭代算法。

经过夜以继日的调试、查错、修改、验算，崔俊芝利用自主编写的有限元程序，终于为刘家峡设计组计算出十多组方案及其工况作用下的应力场。

1964年5月中旬，“刘家峡计算任务汇报会”召开。朱昭钧及其同事、黄鸿慈等参加了会议，崔俊芝汇报了计算任务完成情况，朱昭钧对此给予了很高评价。

1966年10月，胜利的消息从高原传来，万马奔腾般奔流而来的黄河被巍峨的大坝拦腰截住，“锁”在峡谷之中，平静而缓和。从此，刘家峡水电站成为了亮丽的“高原明珠”。

刘家峡大坝建设成功后，中央发出明码电报，表彰科研人员为刘家峡工程所作出的突出贡献。

“冯先生总是高瞻远瞩，引领着计算数学及其应用研究的发展。”崔俊芝说。

1978年10月，冯康（中）与黄鸿慈、数学院研究员张关泉合影。

3 首创：独立于西方

冯康一直是计算数学团队的实际学术指导和领路人。因为冯康，有限元的“种子”从刘家峡大坝的土壤中生根发芽，成为世界级学术成果。

究竟什么是有限元？冯康曾有过形象的比喻：“分整为零、裁弯取直、以简驭繁、化难为易。”

有限元方法可谓一种特殊的“拼图游戏”：为了解决一个复杂的大问题，例如一个大型建筑的结构分析，先把它拆解成许多小块，这些小块的“拼图碎片”就是“有限元”；然后逐一分析每个“有限元”，分别建立方程；最后将它们组合成方程组并求解，最终解决问题。

实际上，我国古代“曹冲称象”的典故、数学家刘徽采用割圆法计算圆周长，就是有限元思想的具体体现。

1943年，著名数学家柯朗发表了世界上第一篇具有有限元思想的论文，只是由于当时计算机尚未出现，该文章未引发应有的关注。

20余年后，随着航空事业的快速发展，复杂的结构分析问题对计算方法提出了更高要求。计算机的出现，使大量复杂的有限元计算得以实现。美国波音公司工程师特纳、陶普和土木工程教授克劳夫、航空工程教授马丁合作，在航空科技领域的期刊上发表了一篇采用有限元技术计算飞机机翼强度的论文。学界一般认为，这是工程学界有限元方法的开端。

有限元方法于1960年由克劳夫在美国土木工程学会计算机会议上第一次正式提出。他发表的一篇处理平面弹性问题的论文，将应用范围扩展到飞机以外的土木工程。

但当时的中国与西方隔绝，中国数学家难以了解有限元方法的发展前沿。

“西方的有限元方法是作为结构工程的分析方法而提出的，在中国则是从数学发展而来，中国和西方沿着不同方向独立发展出有限元方法。”崔俊芝说。

冯康于1965年在《应用数学与计算数学》发表论文《基于变分原理的差分格式》，这是一套求解偏微分方程的数值算法，也是著名的有限元方法。

事实上，后来任香港浸会大学教授的黄鸿慈，1963年发表了中国第一篇包含有限元思想的文章。但他在多个场合提到：“我的文章已包含了证明中最重要的原理，冯先生则是在最广泛的条件下得出最一般的结论，这只有在高深的数学基础上才能做到，因而也具有更高层次的开创性，西方在1969年以后才得出了类似结果。”“如果有限元方法不是像数学家这样处理，其应用就大受限制，就不能形成今天这样在理论上、应用上被如此广泛重视的局面。”

《基于变分原理的差分格式》既是冯康的传世之作，也是中国学者独立于西方创立有限元方法理论的标志。

改革开放后，冯康的论文被译成英文，被世界知晓。原美国总统科学顾问、纽约大学柯朗数学研究所所长拉克斯在纪念冯康的文章中特别指出，冯康“独立于西方国家在应用数学方面的发展，创造了有限元方法理论……在方法的实现及理论基础的创立两方面都作出了重要贡献”。

著名数学家丘成桐曾在1998年指出：“中国近代数学能超越西方或与之并驾齐驱的有3个，陈省身的示性类、华罗庚的多复变函数和冯康的有限元计算。”巴布斯卡、利翁斯等国际知名数学家在相关文章中也都给予了高度评价。

类似的评价很快得到许多国际同行的认同。这篇传世之作犹如暗夜里的一束火光，指引、温暖着那群30多岁的年轻人。

位于北京中关村南街的三室很快热闹起来，信函和来访络绎不绝。为全面介绍有限元方法，冯康、崔俊芝等人创办了讲习班，近300人参加，其中不乏知名学者。崔俊芝回忆，讲习班影响很大，对促进有限元方法在中国的推广和应用发挥了很大作用。

1982年，冯康、黄鸿慈、王荩贤、崔俊芝因独立于西方创立有限元方法获得国家自然科学奖二等奖。

冯康（右一）在学术报告会上。

4 传承：突围工业软件内核

如今，距离中国科学院计算数学团队完成刘家峡计算任务已过去60年了。那段历史多次被计算数学家们提起，他们不断思考：为什么是冯康独立于西方开创了有限元方法？如何传承老一辈科学家的精神？

崔俊芝这样解释为什么冯康能成功：“他拥有深厚的跨学科知识、多学科综合交叉思维的学术思想；不仅能深刻认识科学与工程问题的物理模型，洞察解决问题的可行路径，还能统观不同科学与工程问题的内涵，以高度抽象和严格的数学形式表述它们；再加上他全身心地投入科学研究，具有不达目标决不罢休的献身精神，这些使得他一个接一个地做出了国际首创的研究成果。”

冯康于1993年逝世，年轻人虽未能有机会当面聆听其教诲，但早已对冯康及刘家峡计算的故事耳熟能详，并从中汲取着强劲的精神动力。

随着计算机的迅猛发展，基于有限元方法的软件已经成为辅助现代工程和装备研发的主要软件——计算机辅助工程CAE的核心。CAE 软件可对工程和装备的功能、性能与安全可靠性进行计算分析，对未来工程和装备进行模拟仿真，证实其可用性与可靠性。

CAE与计算机辅助设计（CAD）、计算机辅助制造（CAM）组合简称为CAX，是现代工业的基石、智能制造的灵魂，是我国成为制造强国的基础性战略支撑。

2023年7月，在中国科学院战略性先导科技专项支持下，数学院成立了基础软件研究中心，围绕工业软件CAX一体化的计算方法和数学理论，向关键科学问题发起冲击。

数学院年轻的研究员崔涛、贾晓红等组成“冯康CAX科技攻关青年突击队”，承袭先辈的衣钵，构建新一代具有中国自主知识产权的CAX基础工业软件的数学内核。

基础软件研究中心让平均年龄不超过40岁的年轻人担当重任，考核标准也不再是发多少论文，而是产出“实在、能用”的算法和软件。

这并不是一条坦途。但在新一代年轻人看来，“路虽远，行则将至；事虽难，做则必成”。

冯康领衔开创的有限元方法，正孕育新的开创性、引领性成果，迸发出崭新的、无限的可能。

2024年1月，CAX一体化算法开发验证平台发布。数学院供图

n5321 | 2025年7月31日 23:53

Thirty years of development and application of CFD at Boeing Commercial Airplanes,

2004年发布的波音公司CFD应用30年。

1973到2004.波音搞了数千亿美金的飞机。30年里，波音的工程师所使用的工具必须具备准确预测和确认飞机的飞行特性的能力。在1973年以前，这些工具由解析近似法、风洞试验和飞行测试组成。但是这三十年里，波音用的CFD。

这篇短文讲的是波音的西雅图采购，开发，应用CFD的情况。

介绍

1973 年，波音商用估计有 100 到 200 次的CFD分析。2002年，超过了2W次。这2W次案例涉及的情况也更为复杂。为什么？原因：

现在人们承认 CFD 具有巨大的价值，并在飞行器设计、分析和支持流程中带来了范式转变；
波音公司的 CFD 工作由强大而有能力的远见卓识者 Paul Rubbert 博士领导，他招募并得到了许多才华横溢的管理人员和技术人员的支持；
CFD 工作非常多样化，涉及算法研究、代码开发、应用和验证研究、流程改进和用户支持；
波音公司开发了广泛的产品线，并得到了许多富有创新精神和严格要求的项目工程师的支持；
计算能力和可负担性提高了三到四个数量级；
学术界和政府中的众多先驱者继续在算法上取得突破；
波音公司和政府中的资金经理不反对承担风险。

The role and value of CFD

工程师的目的：预测和确认飞行特性。方式：解析，风洞测试，飞行测试。新方式CFD—— 用数值算法进行仿真分析。CFD 的价值是它以低成本进行少量模拟就能获得完成设计所需的“理解”。具体说，CFD 可用于“逆向设计”或优化模式，预测优化某些流动特性或收益函数（例如阻力）所需的几何形状变化。可以对实验数据（通常通过在风洞中测试飞行器的缩比模型）进行分析，扩展数据，获取准确的飞机的特性。还可以帮工程师找到设计失效问题的根源。

Eﬀective use of CFD is a key ingredient in the successful design of modern commercial aircraft.

有效运用 CFD 是波音成功设计飞机的一项关键因素

聪明、普遍且谨慎地使用 CFD 是波音产品开发的主要战略。Experience to date at Boeing Commercial Airplanes has shown that CFD has had its greatest eﬀect in the aerodynamic design of the high-speed cruise conﬁguration of a transport aircraft.

经验表明CFD) 在波音的飞机设计中发挥了至关重要的作用。过去 20 年使用 CFD 搞飞机开发波音公司节省了数千万美元。数千万美元好像不菲，但它们只是 CFD 为波音创造的价值的一小部分。大头是使用CFD以后为飞机增加附加值。Value to the airline customer is what sells airplanes!

Value is added to the airplane product by achieving design solutions that are otherwise unreachable during the fast-paced development of a new airplane. Value is added by shortening the design development process. Time to market is critical in the commercial world, particularly when starting after a competitor has committed a similar product to market. Very important in the commercial world is getting it right the ﬁrst time. No prototypes are built. From ﬁrst ﬂight to revenue service is frequently less than one year! Any deﬁciencies discovered during ﬂight test must be rectiﬁed suﬃciently for government certiﬁcation and acceptance by the airline customer based on a schedule set years before. Any delays in meeting this schedule may result in substantial penalties and jeopardize future market success. The added value to the airplane product will produce increased sales and may even open up completely new markets. The result is more proﬁt to both the buyer and seller (who does not have to discount the product as much to make the sale). All this translates into greater market share.

商业价值详解见上。

CFD 开发和应用过程

In industry, CFD has no value of its own. The only way CFD can deliver value is for it to aﬀect the product. CFD必须成为产品设计、制造和支持工程流程中不可或缺的一部分。it must get into the hands of the engineers who execute these processes. 理想

The CFD developers and ‘‘expert’’ users can certainly contribute, but are only a part of the engineering process.

将 CFD 投入“生产”并非易事——这通常是一个耗时多年的过程。

CFD 开发流程分为五个不同的阶段

第一阶段旨在开发使能技术算法，为解决特定问题提供基本方法。
第二阶段是对新计算技术的初步探索、验证和演示。（demo）主要输出是演示代码（可用于计算实验和演示），并结合对实际需求的设想。
第三阶段旨在提供该设想的实质内容，通常需要对第二阶段的代码进行泛化或其他修改（可能是完全重写），并结合前后端界面，以生成用户友好、易于理解且易于维护的软件。They have yet to gain enough conﬁdence to make important, standalone decisions based on the code. That takes time, exposure, and experience.
第四阶段涉及“应用研究”，设计工程师、管理人员和代码开发人员共同努力，研究这项新功能将如何融入并改变气动设计流程。软件落地
第五阶段：成熟的能力。代码通常需要相当长的时间才能达到第五阶段的成熟度

Forrester T. Johnson *, Edward N. Tinoco, N. Jong Yu

Received 1 June 2004; accepted 18 June 2004 Available online 26 February 2005

Abstract

Over the last 30 years, Boeing has developed, manufactured, sold, and supported hundreds of billions of dollars worth of commercial airplanes. During this period, it has been absolutely essential that Boeing aerodynamicists have access to tools that accurately predict and conﬁrm vehicle ﬂight characteristics. Thirty years ago, these tools consisted almost entirely of analytic approximation methods, wind tunnel tests, and ﬂight tests. With the development of increasingly powerful computers, numerical simulations of various approximations to the Navier–Stokes equations began supplementing these tools. Collectively, these numerical simulation methods became known as Computational Fluid Dynamics (CFD). This paper describes the chronology and issues related to the acquisition, development, and use of CFD at Boeing Commercial Airplanes in Seattle. In particular, it describes the evolution of CFD from a curiosity to a full partner with established tools in the design of cost-eﬀective and high-performing commercial transports.

Contents

Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1116
The role and value of CFD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1117

The CFD development and application process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1120
Chronology of CFD capability and use . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1124 4.1. Linear potential flow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1125 4.1.1. First generation methods––early codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1125 4.1.2. First generation methods––TA230 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1126 4.1.3. Second generation linear potential flow method––PANAIR/A502 . . . . . . . . . . . . . . . . . 1128 4.2. Full potential/coupled boundary layer methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1131 4.2.1. A488/A411 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1131 4.2.2. TRANAIR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1132 4.2.3. BLWF . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1137 4.3. Euler/coupled boundary layer methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1138 4.4. Navier–Stokes methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1139 4.4.1. Structure grid codes––Zeus TLNS3D/CFL3D, OVERFLOW . . . . . . . . . . . . . . . . . . . . . . 1139 4.4.2. Unstructured grid codes––Fluent, NSU2D/3D, CFD++ . . . . . . . . . . . . . . . . . . . . . . . 1142 4.4.3. Other Navier–Stokes codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1143 4.4.4. Next generation Navier–Stokes codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1143 4.5. Design and optimization methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1145 4.5.1. A555, A619 inverse design codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1145 4.5.2. TRANAIR optimization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1146
Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1148 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1148
Introduction

In 1973, an estimated 100–200 computer runs simulating ﬂows about vehicles were made at Boeing Commercial Airplanes, Seattle. In 2002, more than 20,000 CFD cases were run to completion. Moreover, these cases involved physics and geometries of far greater complexity. Many factors were responsible for such a dramatic increase: (1) CFD is now acknowledged to provide substantial value and has created a paradigm shift in the vehicle design, analysis, and support processes; (2) the CFD eﬀort at Boeing was led by a strong and capable visionary, Dr. Paul Rubbert, who recruited and was supported by the services of a number of talented managers and technical people; (3) this CFD eﬀort was well diversiﬁed, involving algorithm research, code development, application and validation studies, process improvement, and user support; (4) Boeing developed a broad line of products, supported by a number of innovative and demanding project engineers; (5) computing power and aﬀordability improved by three to four orders of magnitude; (6) numerous pioneers in academia and the Government continued to make algorithmic breakthroughs; and (7) there were funding managers in Boeing and the Government who were not averse to taking risks.

It would be impossible to adequately address all these factors in this short paper. Consequently, we will concentrate on issues that were central to the eﬀorts of the authors, who have been members of the CFD Development and Applications groups at Boeing, Seattle for more than 30 years. In Section 2, we describe the role and value of CFD as it has evolved over the last 30 years and as it may possibly evolve in the future. In Section 3, we describe the CFD development and application processes. In Section 4, we lay out a brief history of the codes and methods that were most heavily used at Boeing, Seattle, as well as some of the issues that lay behind their development. In Section 5, we draw some brief conclusions.

Finally, we note that CFD has had a long and distinguished history in many other parts of the Boeing Enterprise. That history would best be related by those intimately involved.

The role and value of CFD

The application of CFD today has revolutionized the process of aerodynamic design. CFD has joined the wind tunnel and ﬂight test as primary tools of the trade [1–4]. Each has its strengths and limitations Because of the tremendous cost involved in ﬂight testing, modern aircraft development must focus instead on the use of CFD and the wind tunnel. The wind tunnel has the advantage of dealing with a ‘‘real’’ ﬂuid and can produce global data over a far greater range of the ﬂight envelope than can CFD. It is best suited for validation and database building within acceptable limits of a development program's cost and schedule. Historically, CFD has been considered unsuited for such as task. However, the wind tunnel typically does not produce data at ﬂight Reynolds number, is subject to signiﬁcant wall and mounting system corrections, and is not well suited to provide ﬂow details. The strength of CFD is its ability to inexpensively produce a small number of simulations leading to understanding necessary for design. Of great utility in this connection is the fact that CFD can be used in an ‘‘inverse design’’ or optimization mode, predicting the necessary geometry shape changes to optimize certain ﬂow characteristics or a payoﬀ function (e.g., drag). Beyond this, CFD is heavily used to provide corrections for the extrapolation of data acquired experimentally (typically from testing a reduced scale model of the vehicle in a wind tunnel) to conditions that characterize the full-scale ﬂight vehicle. Finally, CFD is used to provide understanding and insight as to the source of undesirable ﬂight characteristics, whether they are observed in subscale model testing or in the full-scale conﬁguration.

Eﬀective use of CFD is a key ingredient in the successful design of modern commercial aircraft. The combined pressures of market competitiveness, dedication to the highest of safety standards, and desire to remain a proﬁtable business enterprise all contribute to make intelligent, extensive, and careful use of CFD a major strategy for product development at Boeing.

Experience to date at Boeing Commercial Airplanes has shown that CFD has had its greatest eﬀect in the aerodynamic design of the high-speed cruise conﬁguration of a transport aircraft. The advances in computing technology over the years have allowed CFD methods to aﬀect the solution of problems of greater and greater relevance to aircraft design, as illustrated in Figs. 1 and 2. Use of these methods allowed a more thorough aerodynamic design earlier in the development process, permitting greater concentration on operational and safety-related features.

The 777, being a new design, allowed designers substantial freedom to exploit the advances in CFD and aerodynamics. High-speed cruise wing design and propulsion/airframe integration consumed the bulk of the CFD applications. Many other features of the aircraft design were inﬂuenced by CFD. For example, CFD was instrumental in design of the fuselage. Once the body diameter was settled, CFD was used to design the cab. No further changes were necessary as a result of wind tunnel testing. In fact, the need for wind tunnel testing in future cab design was eliminated. Here, CFD augmented wind tunnel testing for aft body and wing/body fairing shape design. In a similar fashion, CFD augmented wind tunnel testing for the design of the ﬂap support fairings. The wind tunnel was used to assess the resulting drag characteristics. CFD was used to identify prime locations for static source, sideslip ports, and angle-of-attack vanes for the air data system. CFD was used for design of the environmental control system (ECS) inlet and exhaust ports and to plan an unusual wind tunnel evaluation of the inlet. The cabin (pressurization) outﬂow valves were positioned with CFD. Although still in its infancy with respect to high-lift design, CFD did provide insight to high-lift concepts and was used to assess planform eﬀects. The bulk of the high-lift design work, however, was done in the wind tunnel [5]. Another collaboration between the wind tunnel and CFD involved the use of CFD to determine and reﬁne the corrections applied to the experimental data due to the presence of the wind tunnel walls and model mounting system.

The Next Generation 737-700/600/800/900 (illustrated in Fig. 2), being a derivative of earlier 737s, presented a much more constrained design problem. Again the bulk of the CFD focused on cruise wing design and engine/airframe integration. Although the wing was new, its design was still constrained by the existing wing-body intersection and by the need to maintain manual control of the ailerons in case of a complete hydraulic failure. As with the 777, CFD was used in conjunction with the wind tunnel in the design of the wing-body fairing, modiﬁcations to the aft body, and design of the ﬂap track fairings and the high-lift system.

Boeing Commercial Airplanes has leveraged academia- and NASA-developed CFD technology, some developed under contract by Boeing Commercial Airplanes, into engineering tools used in new airplane development. As a result of the use of these CFD tools, the number of wings designed and wind tunnel tested for high-speed cruise lines deﬁnition during an airplane development program has steadily decreased (Fig. 3). In recent years, the number of wings designed and tested is more a function of changing requirements during the development program and the need to support more extensive aerodynamic/structural trade studies during development. These advances in developing and using CFD tools for commercial airplane development have saved Boeing tens of millions of dollars over the past 20 years. However, as signiﬁcant as these savings are, they are only a small fraction of the value CFD delivered to the company.

A much greater value of CFD in the commercial arena is the added value of the product (the airplane) due to the use of CFD. Value to the airline customer is what sells airplanes! Value is added to the airplane product by achieving design solutions that are otherwise unreachable during the fast-paced development of a new airplane. Value is added by shortening the design development process. Time to market is critical in the commercial world, particularly when starting after a competitor has committed a similar product to market. Very important in the commercial world is getting it right the ﬁrst time. No prototypes are built. From ﬁrst ﬂight to revenue service is frequently less than one year! Any deﬁciencies discovered during ﬂight test must be rectiﬁed suﬃciently for government certiﬁcation and acceptance by the airline customer based on a schedule set years before. Any delays in meeting this schedule may result in substantial penalties and jeopardize future market success. The added value to the airplane product will produce increased sales and may even open up completely new markets. The result is more proﬁt to both the buyer and seller (who does not have to discount the product as much to make the sale). All this translates into greater market share.

CFD will continue to see an ever-increasing role in the aircraft development process as long as it continues to add value to the product from the customer's point of view. CFD has improved the quality of aerodynamic design, but has not yet had much eﬀect on the rest of the overall airplane development process, as illustrated in Fig. 4. CFD is now becoming more interdisciplinary, helping provide closer ties between aerodynamics, structures, propulsion, and ﬂight controls. This will be the key to more concurrent engineering, in which various disciplines will be able to work more in parallel rather than in the sequential manner as is today's practice. The savings due to reduced development ﬂow time can be enormous!

To be able to use CFD in these multidisciplinary roles, considerable progress in algorithm and hardware technology is still necessary. Flight conditions of interest are frequently characterized by large regions of separated ﬂows. For example, such ﬂows are encountered on transports at low speed with deployed high-lift devices, at their structural design load conditions, or when transports are subjected to in-ﬂight upsets that expose them to speed and/or angle of attack conditions outside the envelope of normal ﬂight conditions. Such ﬂows can only be simulated using the Navier–Stokes equations. Routine use of CFD based on Navier–Stokes formulations will require further improvements in turbulence models, algorithm, and hardware performance. Improvements in geometry and grid generation to handle complexity such as high-lift slats and ﬂaps, deployed spoilers, deﬂected control surfaces, and so on, will also be necessary. However, improvements in CFD alone will not be enough. The process of aircraft development, itself, will have to change to take advantage of the new CFD capabilities.

The CFD development and application process

In industry, CFD has no value of its own. The only way CFD can deliver value is for it to aﬀect the product. To aﬀect the product, it must become an integral part of the engineering process for the design, manufacture, and support of the product. Otherwise, CFD is just an add-on; it may have some value but its eﬀect is limited. To make CFD an integral part of the Product Development and Support engineering processes, it must get into the hands of the engineers who execute these processes. This is the only way the volume of analysis/design runs necessary to aﬀect the product can be made. Moreover, it is in the Product Development and Support organizations that ownership of the CFD/engineering processes resides, and it is these processes that management relies on when investing billions of dollars in a new airplane development. The CFD developers and ‘‘expert’’ users can certainly contribute, but are only a part of the engineering process.

Getting CFD into ‘‘production’’ use is not trivial––it is frequently a multiyear process. There are ﬁve distinct phases in the CFD development process. These are illustrated in Fig. 5.

Phase I produces enabling technology algorithms that provide a basic means for solving a given problem. Phase II, which overlaps Phase I, constitutes the initial attempts to explore, validate, and demonstrate a new computational technology. There are some limited pioneering applications at this stage, but the emerging technology is not yet at a state that will produce signiﬁcant payoﬀ or impact because the technology is still subject to surprise. Hence, managers and design engineers are unwilling at this point to make important, standalone design decisions based on computed results. Such decisions by users do not happen until well into Phase IV.

Many of the code developments end in the middle of Phase II with a contractor report or scientiﬁc paper that proclaims, ‘‘Gee whiz, look what can be done.’’ For many codes, this is a good and natural transfer point for industry to assume responsibility for further development, because most of what must occur beyond this point will be unique to the particular needs of each individual industry organization. Of course, this implies that corporate managers must have the wisdom to understand what they must support to turn such a code into a mature and eﬀective capability that will live up to the ‘‘Gee whiz’’ expectations. That requires the time and investment associated with Phases III and IV.

The main outputs of Phase II are demonstrator codes (useful for computational experiments and demonstrations) combined with a vision of what is really needed. Phase III is aimed at supplying the substance of that vision and usually entails a generalization or other modiﬁcation of Phase II codes (perhaps complete rewrites) combined with a coupling of front- and back-end interfaces to produce user-friendly, well-understood, and maintainable software. Most commercially available (COTS) codes have reached this stage of development. But even at this stage, their contribution or eﬀect on the corporate bottom line is still minimal because engineers and managers don't yet understand how the existence of this new tool will change the engineering process and what it will be used for. They have yet to gain enough conﬁdence to make important, standalone decisions based on the code. That takes time, exposure, and experience.

In the fourth phase, the payoﬀ or aﬀect of a code grows rapidly. Phase IV entails ‘‘applications research,’’ where design engineers, management, and code developers work together to learn how this new capability will enter into and change the aerodynamic design process. The applications research endeavor requires people with broad backgrounds who can ask the right questions of the algorithm researchers, and code developers who can intelligently question experimental data when test-theory comparisons don't agree. Both must also be good physicists, for it is not unusual to ﬁnd that the short-comings lie neither in the experiment nor in the quality of the computations, but in the fact that the theoretical model assumed in the computations was not an adequate description of the real physics. Need for code reﬁnements that were not anticipated invariably surface during this phase and these reﬁnements often require more algorithm research, additional geometry preprocessors, and so on. Over time, the requests for additions or reﬁnements diminish until the code settles down to occupy its proper niche in the toolbox, and design engineers and managers have learned the capabilities, limitations, and proper applications of this now-mature code. Without the investments in Phase IV, the enormous pay-oﬀ of having a mature capability in Phase V will not happen. An attempt to bypass Phase IV by taking a code developed by algorithm researchers and placing it directly in the hands of design engineers, who may not understand the underlying theoretical models, algorithms, and possible numerical idiosyncrasies, usually results in a prolonged period of frustration and unreliability that leads to abandonment of the code.

Product Development engineers must be able to focus on engineering processes and have little time for manipulating the CFD ‘‘process’’ (i.e., codes must be very user oriented). Stable, packaged software solutions enable and promote consistent processes. These not only put CFD into the hands of the Product Development/Product Support engineers but also allow the ‘‘expert’’ user to get fast results with reduced variation. Integrated packaged software solutions combine various components to go from ‘‘lofts to plots’’ in the time scale consistent with a fast-paced engineering program. These packages include scripted packages for ‘‘standard’’ conﬁgurations, geometry and grid/paneling generation components, ﬂow solvers, and postprocessing components for analyzing the results. These are all placed under some form of software version control to maintain consistency.

A key component of CFD and most engineering processes is geometry. CAD systems, such as CATIA, dominate most geometry engineering needs. However, these systems are designed for component design and deﬁnition and are not well suited to CFD use. A key component of many Boeing Commercial Airplanes CFD processes is AGPS––Aero Grid and Paneling System [6]. AGPS is a geometry software tool implemented as a programming language with an interactive graphical user interface. It can be dynamically conﬁgured to create a tailored geometry environment for speciﬁc tasks. AGPS is used to create, manipulate, interrogate, or visualize geometry of any type. Since its ﬁrst release in 1983, AGPS has been applied with great success within The Boeing Company to a wide variety of engineering analysis tasks, such as CFD and structural analysis, in addition to other geometry-related tasks.

Computing resources consisting of high-end computing and graphics workstations must also be integrated. Seamless mass data storage must be available to store the vast amount of information that will be generated during the engineering application. These resources require dedicated computing system administration. The software control and computing system administration are necessary to free the engineers to focus their work on the engineering processes and not be consumed by the ‘‘computing’’ process.

Close customer involvement and acceptance is absolutely essential to deriving value from CFD. Customers are responsible for implementing the engineering process that will use CFD. They own the process, they determine what CFD, if any, they will depend on to carry out their assigned tasks. They are being graded on the engineering tasks they accomplish not on which CFD codes they use. Their use and trust of CFD is based on a long-term relationship between supplier and user. This relationship has engaged the customer early on in demonstrations of a new code or new application of an existing code. Validation is an on-going process, ﬁrst of cases of interest to the customer, and then of the customer's ability to implement the new tool. Frequently, parallel applications are undertaken in which the customer continues with the existing tools while the supplier/developer duplicates the process with the new tool. This is especially the case when the new tool may enable the development of an entirely new process for executing the engineering task.

The long-term relationship with the customer is essential from another point of view. Until recently, project engineers, without exception, initially rejected every new CFD development that later became the primary CFD analysis and design tool in Boeing Commercial Airplanes Product Development and Product Support organizations. Every new or proposed CFD capability was initially viewed as too diﬃcult to use, too costly to run, not able to produce timely results, not needed, and so on. ‘‘Just ﬁx what we already have,’’ the customer would tell the developers. The customers had a point. Not until the new CFD technology had been integrated with the customer's preprocessing/postprocessing tools and computing system, validated to the customer's program, guaranteed of long-term support, and committed to continuous development and enhancement would the new technology be useful to them.

This made it diﬃcult for the developers to propose new Phase I, II and III eﬀorts. In particular, the initiation and continual defense of Phase I eﬀorts demanded clear and unwavering vision. True vision invariably requires a fundamental understanding of both needs and means. As customers generally did not have the specialized algorithmic knowledge underlying CFD numerics, it was incumbent on the developers to acquire a thorough understanding of customer needs and concerns. The developers learned they could not just throw a new CFD tool over the fence and expect the customer to use it no matter how good it might be. The customer was interested in getting an engineering job done and not in the CFD tool itself! The process of thoroughly understanding customer issues took many years, and early Phase I, II, and III eﬀorts were mostly ‘‘technology push’’ eﬀorts, which had to be funded by NASA or other Government agencies. As these eﬀorts progressed to Phase IV and V, and the developers established a track record for producing useful capabilities, the situation gradually changed.

Each success allowed the developers a little more leeway. Often they spotted ‘‘niche’’ needs that could be satisﬁed by the introduction of their new technology. It was felt that when the users were satisﬁed with the usability and utility of the technology in these areas they would then be willing to consider whether or not replacing their old tools in other areas might oﬀer distinct advantages. Once the users accepted a new capability, they often became very innovative and applied the codes in unanticipated ways, perpetually keeping the developers and validation experts in an anxious state. Most of the new applications were, in fact, legitimate, and the developers had to run fast to understand the implications involved as well as to try and anticipate future application directions. As time went on, code developers, application experts, and project engineers began understanding each other's functions and issues, and a certain amount of trust developed. Gradually, CFD became a ‘‘pull’’ rather than ‘‘push’’ technology. This transformation was greatly facilitated by the rotation of top engineers between these functions.

Today in Boeing Commercial Airplanes, more than 20,000 CFD runs a year are made to support product development and the various existing product lines. More than 90% of these runs are done by production engineers outside the research group. The CFD methods in use provide timely results in hours or days, not weeks or months. Suﬃcient experience with the methods has given management conﬁdence in their results. This means that solutions are believable without further comparison of known results with experiment, that the CFD methods contain enough of the right physics and resolve the important physical and geometric length scales, that the numerics of the method are accurate and reliable, and that the CFD tools are already in place––for there is no time to develop and validate new methods. Most of all, management is convinced that the use of CFD makes economic sense. A look at the history of CFD at Boeing Commercial Airplanes will show how we got to this level of use.

Chronology of CFD capability and use

CFD today covers a wide range of capabilities in terms of ﬂow physics and geometric complexity. The most general mathematical description of the ﬂow physics relevant to a commercial transport is provided by the Navier–Stokes equations. These equations state the laws of conservation of mass, momentum, and energy of a ﬂuid in thermodynamic equilibrium. Unfortunately, direct solutions to these equations for practical aircraft conﬁgurations at typical ﬂight conditions are well beyond the capabilities of today's computers. Such ﬂows include chaotic, turbulent motions over a very wide range of length scales. Computations for the simulations of all scales of turbulence would require solving for on the order of 10¹⁸ degrees of freedom!

Fortunately, solutions to simpliﬁed (and more tractable) forms of these equations are still of great engineering value. Turbulent ﬂows may be simulated by the Reynolds equations, in which statistical averages are used to describe details of the turbulence. Closure requires the development of turbulence models, which tend to be adequate for the particular and rather restrictive classes of ﬂow for which empirical correlations are available, but which may not be currently capable of reliably predicting behavior of the more complex ﬂows that are generally of interest to the aerodynamicist. Use of turbulence models leads to various forms of what are called the Reynolds-averaged Navier–Stokes equations.

For many aerodynamic design applications, the ﬂow equations are further simpliﬁed to make them more amenable to solution. Neglecting viscosity leads to the Euler equations for the conservation of mass, momentum, and energy of an inviscid ﬂuid. Fortunately, under many ﬂight conditions the eﬀects of viscosity are small and can be ignored or simulated by the addition of the boundary layer equations, a much simpliﬁed form of the Reynolds-averaged Navier–Stokes equations.

The introduction of a velocity potential reduces the need to solve ﬁve nonlinear partial diﬀerential equations (that make up the Euler equations) to the solution of a single nonlinear partial

diﬀerential equation known as the full potential equation. However, the potential approximation assumes an inviscid, irrotational, isentropic (constant entropy) ﬂow. Potential solutions can adequately simulate shock waves as long as they are weak, which is the normal case for commercial transport conﬁgurations.

Further simpliﬁcations eliminate all the nonlinear terms in the potential equation, resulting in the Prandtl–Glauert equation for linear compressible ﬂows, or the Laplace equation for incompressible ﬂows. The use of these equations is formally justiﬁed when the vehicle is relatively slender or thin and produces only small disturbances from freestream ﬂow.

In the following sections, we describe the CFD capability most heavily used at Boeing Commercial Airplanes in Seattle over the last 30 years. For the purposes of a rough chronological summary, we can say the following. Before 1973, the main codes employed by project engineers involved linearized supersonic ﬂows with linearized representations of the geometry or else 2D incompressible ﬂows. From 1973 to 1983, panel methods, which could model complex geometries in the presence of linear subsonic and supersonic ﬂows, took center stage. The nonlinear potential ﬂow/coupled boundary layer codes achieved their prime from 1983 to 1993. Their Euler counterparts came into use later in that timeframe. From 1993 to 2003, Reynolds averaged Navier–Stokes codes began to be used with increasing frequency. Clearly, much of the development and demonstration work leading to the widespread use of these codes occurred from ﬁve to 10 years earlier than these dates. It is important to note that a considerable length of time is often required for a code to achieve the Phase V level of maturity. It is also important to realize that once a code achieves this level of maturity and is in use and accepted by the user community, it tends to remain in use, even though improved capability at the Phase III or IV level may be available.

The Boeing panel code, A502, remains in some use today, even though its underlying technology was developed almost 30 years ago. The full potential code TRANAIR still receives widespread and heavy use.

4.1. Linear potential ﬂow

4.1.1. First generation methods––early codes The ﬂow physics described by the early linear methods were greatly simpliﬁed compared to the ‘‘real’’ ﬂow. Similarly, the geometric ﬁdelity of the actual conﬁguration also had to be greatly simpliﬁed for the computational analysis to ﬁt within the speed and size constraints of the computers of that era. In spite of such seemingly hopeless limitations, these early CFD methods were successfully applied during the supersonic transport development programs of the late 1960s––the Anglo-French Concord and the United States/Boeing SST. The need for computational help in the aerodynamic development of these aircraft stemmed from two factors. First, there was the relative lack of experience in designing supersonic cruise aircraft (the ﬁrst supersonic ﬂight had occurred only 15 years earlier). Second, there is great sensitivity of supersonic wave drag to details of the aircraft design. Thus, the challenge of developing a viable low-drag design through empirical ‘‘cut and try’’ demanded whatever computational help was available. The opportunity to use simpliﬁed computational methods resulted because the design requirements for low supersonic wave drag led to thin, slender vehicles that minimized ‘‘perturbing’’ the airﬂow. These characteristics were consistent with the limitations of the linearized supersonic theory embedded in the early CFD codes. These codes included TA80 [7], a Supersonic Area Rule Code based on slender body theory; TA139/201 [8], a Mach Box Code based on linearized supersonic theory; and TA176/217 [9], a Wing-Body Code based on linear potential ﬂow theory with linearized geometry representations. These codes ran on IBM7094 machines. The good agreement with test data predicted by these linear theory methods for a drag polar of the Boeing SST model 733-290 is shown in Fig. 6. This was a linear theory optimized design of the conﬁguration that allowed Boeing to win the SST design development Government contract. The resulting supersonic transport designs ended up looking as they did, in part, because the early CFD codes could not handle more geometrically complex conﬁgurations.

The linear aerodynamics of the Wing-Body Code was later combined with linear structural and dynamic analysis methods in the FLEXSTAB [10] system for the evaluation of static and dynamic stability, trim state, inertial and aerodynamic loading, and elastic deformations of aircraft conﬁgurations at supersonic and subsonic speeds. This system was composed of a group of 14 individual computer programs that could be linked by tape or disk data transfer. The system was designed to operate on CDC-6000 and -7000 series computers and on the IBM 360/370 computers. A very successful early application of FLEXSTAB was the aeroelastic analysis of the Lockheed YF-12A as part of the NASA Flight Loads program. Thirty-two ﬂight test conditions ranging from Mach 0.80 to 3.0 and involving hot or cold structures and diﬀerent fuel loading conditions were analyzed at several load factors [11].

4.1.2. First generation methods––TA230 By 1973, 3D subsonic panel methods were beginning to aﬀect the design and analysis of aircraft conﬁgurations at Boeing. Subsonic panel methods had their origins with the introduction of the Douglas Neumann program in 1962 [12]. This program was spectacularly successful for its time in solving the 3D incompressible linear potential ﬂow (Laplace) equation about complex conﬁgurations using solid wall (Neumann) boundary conditions. The numerical method represented the boundary by constant strength source panels with the strengths determined by an inﬂuence coefﬁcient equation set relating the velocities induced by the source panels to the boundary conditions. The lack of provision for doublet panels limited the class of solutions to those without potential jumps and hence without lift. One of the ﬁrst computer programs for attacking arbitrary potential ﬂow problems with Neumann boundary conditions [13,14] combined the source panel scheme of the Douglas Neumann program with variations of the vortex lattice technique [15]. This program became known as the Boeing TA230 program. A very useful feature of this program was the ability to handle, in a logical fashion, any well-posed Neumann boundary value problem. From its inception, the method employed a building block approach wherein the inﬂuence coeﬃcient equation set for a complex problem was constructed by simply assembling networks appropriate to the boundary value problem. A network was viewed as a paneled surface segment on which a source or doublet distribution was deﬁned, accompanied by a properly posed set of Neumann boundary conditions. The surface segment could be oriented arbitrarily in space and the boundary conditions could be exact or linearized. Several doublet network types with diﬀering singularity degrees of freedom were available to simulate a variety of physical phenomena producing discontinuities in potential. Compressibility eﬀects were handled through scaling. These features combined to allow the analysis of conﬁgurations having thin or thick wings, bodies, nacelles, empennage, ﬂaps, wakes, eﬄux tubes, barriers, free surfaces, interior ducts, fans, and so on.

By 1973, Boeing had acquired a CDC 6600 for scientiﬁc computing, which allowed the TA230 program to solve problems involving hundreds of panels. This was suﬃcient to model full conﬁgurations with the ﬁdelity necessary to understand component interactions.

One of the most impressive early uses of the TA230 code was in the initial design phase of the B747 Space Shuttle Carrier Aircraft (SCA). The purpose of the initial design phase was to deﬁne the modiﬁcations needed to accomplish the following missions: to ferry the Space Shuttle Orbiter; to air-launch the Orbiter; and to ferry the external fuel tank. To keep the cost of the program to a minimum, CFD was extensively used to investigate the Orbiter attitude during the ferry mission, the Orbiter trajectory and attitude during the launch test, and the external tank location and attitude during the ferry mission. At the conclusion of the design phase, the ﬁnal conﬁgurations selected were tested in the wind tunnel to verify predictions. A typical example of a paneling scheme of the B747 with the Space Shuttle Orbiter is depicted in Fig. 7. In this example, the Orbiter incidence angle was 8 deg with respect to the B747 reference plane. The predicted lift coeﬃcient, CL, as a function of wing angle of attack for this conﬁguration is shown in Fig. 8. The agreement between the analyses and wind tunnel data shown in this ﬁgure is excellent.

TA230 was used with TA378 [16], a 3D Vortex Lattice Method with design/optimization capability, to develop winglets for a KC-135 aircraft. Wind tunnel tests conﬁrmed a 7–8% drag reduction in airplane drag due to the installation of these winglets [17].

Another early CFD success was the improvement of the understanding of the interference drag of a pylon-mounted engine nacelle under the wing. The existence of unwanted interference drag had been revealed by wind tunnel testing, but the physical mechanism of the interference was still unknown. To avoid the interference drag, it is common practice to move the engine away from the wing. The resulting additional weight and drag due to the longer engine strut must be balanced against the potential interference drag if the engine is moved closer to the wing. CFD studies with TA230 along with specialized wind tunnel testing in the mid-1970s, provided the necessary insight into the ﬂow mechanism responsible for the interference. This understanding led to the development of design guidelines that allowed closer coupling of the nacelle to the wing [18]. The Boeing 757, 767, 777, 737-300/400/500 series, Next Generation 737/600/700/800/900 series, and the KC-135R are all examples of aircraft where very closely coupled nacelle installations were achieved without incurring a signiﬁcant drag penalty.

4.1.3. Second generation linear potential ﬂow method––PANAIR/A502 The success of the TA 230 code in modeling complete vehicle conﬁgurations and component interactions created a strong demand among Boeing aerodynamicists for CFD analyses and was undoubtedly the key factor that initiated the paradigm shift toward acceptance of CFD as an equal partner to the wind tunnel and ﬂight test in the analysis and design of commercial aircraft. However, the paradigm shift was slowed by the fact that the code had to be run by experts possessing specialized knowledge, some of which was totally unrelated to aerodynamics. In fact, it often took weeks requiring the expertise of an engineer having months or years of experience with the method to set up and run a complex conﬁguration. To some extent this was unavoidable; to correctly model a complex ﬂow for which no previous user experience was available, the engineer had to understand the mathematical properties and limitations of potential ﬂow. Nevertheless, once the boundary value problem was formulated, the user still had to contend with certain numerical idiosyncrasies and ineﬃciencies that required adherence to stringent paneling rules, frequently incompatible with the complex geometrical contours and rapidly changing aerodynamic

length scales of the vehicle under analysis. Such diﬃculties were directly related to the use of ﬂat panels with constant source and doublet strengths. Methods employing these features were quite sensitive to panel layout. Numerical problems arose when panel shapes and sizes varied, and ﬁne paneling in regions of rapid ﬂow variations often forced ﬁne paneling elsewhere. In addition, excessive numbers of panels were often required since numerical accuracy was strongly aﬀected by local curvature and singularity strength gradient. These problems placed severe limitations on the development of automatic panelers and other complementary aids aimed at relieving the user of the large amount of handwork and judgments associated with producing accurate numerical solutions.

Consequently, a method was developed under contract to NASA to enhance practical usability by improving upon the ﬂat, constant singularity strength panels employed in the construction of networks [19]. This method featured the use of curved panels and higher order distributions of singularities. Source and doublet strengths were deﬁned by least square ﬁts of linear and quadratic splines to discrete values located at speciﬁc points on the networks. Higher order inﬂuence coefﬁcients were obtained using recursion relations with the standard low order coeﬃcients as initial conditions. Boundary conditions were enforced at the same or other discrete locations depending on their type. Virtually any boundary condition that made sense mathematically was provided for. In particular, the incorporation of Dirichlet boundary conditions not only oﬀered the opportunity to design surface segments to achieve desired pressure distributions, but also clariﬁed the nature of the boundary value problem associated with modeling viscous wakes and propulsion eﬀects. Robin boundary conditions provided for the modeling of slotted walls, which allowed for direct comparisons of CFD results with wind tunnel data. These features were incorporated in the NASA code known as PANAIR and the Boeing code known as A502. The latter code was generalized to treat supersonic ﬂows [20], free vortex ﬂows [21], and time harmonic ﬂows [22]. In the supersonic case, upwinding was achieved by forward weighting the least square singularity spline ﬁts.

The numerics incorporated into A502 solved a number of usability issues. Fig. 9 clearly demonstrates the relative insensitivity and stability of computed results to paneling. This insensitivity encouraged project users to apply the code and trust results. In addition, the boundary condition ﬂexibility allowed users to experiment with various types of modeling, leading to a wide variety of applications never entirely envisioned by the developers.

The versatility of A502 paid oﬀ when a ‘‘surprise’’ was encountered during the precertiﬁcation ﬂight testing of the then new 737-300. The aircraft was not demonstrating the preﬂight wind tunnel based prediction of take-oﬀ lift/drag ratio. A ﬁx was needed quickly to meet certiﬁcation and delivery schedules. Specialized ﬂight testing was undertaken to ﬁnd the cause and to ﬁx the performance shortfall. A CFD study was immediately undertaken to enhance understanding and provide guidance to the ﬂight program. Eighteen complete conﬁguration analyses were carried out over a period of three months. These included diﬀerent ﬂap settings, wind tunnel and ﬂight wing twist, ﬂow through and powered nacelle simulations, free air and wind tunnel walls, ground eﬀect, seal and slotted ﬂaps, and other geometric variations [23]. These solutions explained and clariﬁed the limitations of previous low-speed wind tunnel test techniques and provided guidance in recovering the performance shortfall through ‘‘tuning’’ of the ﬂap settings during the ﬂight testing. The aircraft was certiﬁed and delivered on schedule. A comparison of the computation L/D predictions with ﬂight is shown in Fig. 10.

A502 studies have been used to support other ﬂight programs on a time-critical basis. In particular, the code was used to support engine/airframe installation studies in the early 1980s [24], to evaluate wind tunnel tare and interference eﬀects, and to provide Mach blockage corrections for testing large models. In addition, the code was used for the design of the wingtip pod for the Navy E6-A, a version of the Boeing 707. No wind tunnel testing was done before ﬂight. The FAA has accepted A502 analysis for certiﬁcation of certain aircraft features that were shown to have minimal change from previous accepted standards. Finally, A502 was used to develop a skin waviness criteria and measurement technique that led to the virtual elimination of failed altimeter split testing during the ﬁrst ﬂight of every B747-400 aircraft coming oﬀ the production line. Initially, one of every three aircraft was failing this test, requiring several days down time to ﬁx the problem. The A502-based procedure could identify excessive skin waviness before ﬁrst ﬂight and led to manufacturing improvements to eliminate the root cause of the problem.

A502 is still used today to provide quick estimates for preliminary design studies. A relatively new feature of the code takes advantage of available linear sensitivities to predict a large number of perturbations to stability and control characteristics and stability derivatives, including control surface sensitivities. Virtual control surface deﬂections and rotary dynamic derivatives are modeled through surface panel transpiration. Stability derivatives, such as the lift curve slope or directional stability, are calculated automatically. A typical application may involve 20 subcases submitted in a single run, with solutions available in an hour or so. Within the limitations of the code, all major stability and control derivatives can be generated in a single run (at a single Mach). The method is typically used to calculate increments between similar conﬁgurations. The code was recently used to calculate stability and control increments between a known baseline and a new conﬁguration. A total of 2400 characteristics were computed for eight conﬁgurations by one engineer in a two-day period!

4.2. Full potential/coupled boundary layer methods

4.2.1. A488/A411 Since Murman and Cole [25] introduced a numerical solution method for the transonic small disturbance equation in the early 1970s, computational ﬂuid dynamics method development for nonlinear ﬂows has progressed rapidly. Jameson and Caughey [26] formulated a fully conservative, rotated ﬁnite volume scheme to solve the full potential equation––the well-known FLO27/28 codes. The Boeing Company acquired the codes and invested a signiﬁcant amount of eﬀort to advance the capability from Phase II to Phase V. Convergence reliability and solution accuracy were enhanced. To allow transonic analyses over complex transport conﬁgurations, a numerical grid generation method based on Thompson's elliptic grid generation approach [27] was developed [28] and tested extensively for wing or nacelle alone, wing-body, and wing-body-strut-nacelle conﬁgurations. The potential ﬂow solvers FLO27/28 coupled with the 3D ﬁnite diﬀerence boundary layer code A411 [29] and the 3D grid generation code formed the major elements of the Boeing transonic ﬂow analysis system, A488––the most heavily used analysis code at Boeing from late 1970s to early 1990s. The production version of the A488 system, illustrated in Fig. 11, included a number of preprocessing and postprocessing programs that could handle the complete analysis process automatically for speciﬁc conﬁguration topologies––a truly useable code for design engineers. This integrated packaged combined the various software components to go from ‘‘lofts to plots’’ in the time scale consistent with a fast paced engineering program––overnight!

Fig. 12 shows a comparison of A488 results obtained by project engineers with wing pressure distributions measured in ﬂight on a 737-300. The computational model consisted of the wing, body, strut, and nacelle. The wing deﬁnition included the estimated aeroelastic twist for the condition ﬂown. Although the character of the pressure distribution on the wing changes dramatically across the span, the computational results agree reasonably well with the measured data.

The Boeing Propulsion organization also employed a full potential/coupled boundary layer code called P582. It was developed at Boeing and used a rectangular grid [30] and multigrid acceleration scheme [31]. P582 was used extensively for engine inlet simulation and design in the late 1970s and 1980s and is still used in the Propulsion organization for various nacelle inlet simulations.

4.2.2. TRANAIR By 1983, complex conﬁgurations were routinely being analyzed by project engineers using panel methods. Surface geometry generation tools were maturing, and users took for granted the ability to add, move, or delete components at will; readily change boundary condition types; and obtain numerically accurate solutions at reasonable cost in a day or two. On the other hand, the nonlinear potential ﬂow codes required expert users and considerable ﬂow time to obtain converged and accurate results on new and nonstandard conﬁgurations. Often, geometrical simpliﬁcations had to be made jeopardizing the validity of conclusions regarding component interactions. Clearly, the nonlinear nature of the ﬂow was responsible for numerous diﬃculties. The development of shocks in the ﬂowﬁeld prolonged convergence, especially if the shocks were strong and prematurely set in the wrong location. Moreover, weak and double shocks were often not captured accurately, if at all. Boundary layer coupling contributed problems as well, especially as separation was approached. Often, the boundary layer displacement eﬀect had to be ﬁxed after a certain number of iterations, leading to questionable results. Experts became very good at circumventing many of these problems; however, the one problem that could not readily be overcome was the necessity to generate a volume grid to capture nonlinear eﬀects.

Even today, volume grid generation is one of the main barriers to routine use of nonlinear codes. Often the creation of a suitable grid about a new complex conﬁguration can take weeks, if not months. In the early 1980s, the situation was far worse, and suitable grids were readily available only for standard and relatively simple conﬁgurations. Because of the enormous promise demonstrated by existing nonlinear methods, the panel method developers at Boeing were awarded a contract from NASA to investigate alternatives to surface ﬁtted grid generation. In the next few paragraphs, we describe some of the technical issues that arose during this contract. They are of interest to this paper in that they followed directly from a ‘‘needs and usability’’ starting point rather than the usual ‘‘technology discovery’’ starting point. To a large extent, this has characterized the CFD development eﬀorts at Boeing.

The developers started with a rather naıve approach, i.e., take an A502 paneling, with which the project users were already familiar, and embed it in a uniform rectangular grid to capture nonlinear eﬀects (Fig. 13). This approach logically led to a sequence of subproblems that had to be addressed in turn [32]. First, one could hardly aﬀord to extend a uniform grid into the far ﬁeld to ensure proper far ﬁeld inﬂuence. However, if the ﬂow was assumed to be linear outside a compact region enclosing the conﬁguration, one could use linear methods to obtain the far ﬁeld inﬂuence. A discrete Green's function for the Prandtl–Glauert equation was constructed, which incorporated the eﬀect of downstream sources and sinks resulting from wakes. This Green's function was applied using FFTs and the doubling algorithm of Hockney [33], a standard technique in astrophysics. The net eﬀect was the same as if the uniform grid extended all the way to inﬁnity, the only approximation being the assumption of linearity outside a compact box. As a byproduct of this solution, the user no longer had to estimate a suitable far ﬁeld stretching ratio.

The next problem that had to be addressed was how to handle the intersections of the grid with the paneling and how to apply boundary conditions. The developers decided to use a ﬁnite element approach based on the Bateman variational principle [34]. Upwinding was achieved by factoring the density at the centroid of the elements out of the stiﬀness integrals and then biasing it in an upwind direction. The elements intersecting the paneled boundary were assumed to have linear basis functions regardless of their shapes. Stiﬀness matrix integrals were then evaluated over the subset of the elements exposed to the ﬂowﬁeld. The integration was performed recursively using volume and then surface integration by parts. Additional surface integrals were added to impose the same variety of boundary conditions as available in A502.

The main problem with a uniform rectangular grid is its inability to capture local length scales of the geometry and ﬂow. Consequently, grid reﬁnement was an absolutely necessary feature of the approach. However, it was felt that solution adaptive grid reﬁnement was necessary in any event to ensure accuracy, especially if the code was to be used by project engineers without the aid of the developers. The reﬁnement mechanism was relatively straightforward, just divide each rectangular grid box into eight similar boxes (Fig. 14) and keep track of the reﬁnement hierarchy using an eﬃcient oct-tree data structure.

Development of a suitable error indicator was another matter, however. Mathematical theory certainly oﬀered guidance here, but a surprising amount of engineering knowledge had to be injected into the process. A typical ‘‘gotch-ya’’ with a pure mathematical approach was the tendency of the reﬁnement algorithm to capture the precise details of a wing tip vortex all the way from the trailing edge to the end of a wind tunnel diﬀuser.

The existence of reﬁned grid complicated the design of a solution algorithm. Multigrid methods were somewhat of a natural here, but the developers were partial to direct solvers, as they had turned out to be so ﬂexible for the panel codes, especially when it came to implementing unusual boundary conditions and coupling boundary layer equations and unknowns. They adopted a damped Newton method approach, with the Jacobian solved using a preconditioned GMRES iterative algorithm. A sparse direct solver was used as a preconditioner. Even with nested dissection ordering, the cost and storage for a complete factorization was prohibitive, hence they settled on the use of an incomplete factorization employing a dynamic drop tolerance approach, whereby small ﬁll-in elements were dropped as they were formed. The method was surprisingly eﬃcient and robust. As a rule, decomposition of the Jacobian resulted in ﬁll-in factors of less than two and constituted less than 10% of the total run cost, even for grids having more than a million nodes.

Early versions of TRANAIR used the A411 boundary layer code in an indirectly coupled mode in much the same manner as A488. However, the desired convergence reliability was never achieved, and the shock boundary layer interaction model was occasionally suspect. About this time, Drela [35] developed an exceedingly accurate 2D integral boundary layer that he directly coupled with his 2D Euler solver. With Drela's help, the TRANAIR development team modiﬁed this boundary layer to incorporate sweep and taper eﬀects and integrated it into the code. In this connection, the use of a direct solver was invaluable. The resultant code turned out to be very accurate for transport conﬁgurations and agreement with experiment was considered by project users to be quite remarkable.

As TRANAIR received increasing use, a number of enhancements were added. To model powered eﬀects, regions of non-freestream but constant total temperature and pressure were simulated along with appropriate shear layer eﬀects [36]. Far ﬁeld drag calculations were added, which later led to the ability to perform aerodynamic optimization. Time harmonic capability was created for stability and control calculations. Aeroelastic eﬀects were simulated by adding structural unknowns and equations to the system [37]. Here again the use of a sparse solver was invaluable.

Without question, the development of the TRANAIR code strongly beneﬁted from the work and experiences of CFD pioneers such as Murman [25], Jameson [26], Hafez [38], Cebeci [39], McLean [29], Drela [35], and others. Nevertheless, about 10 major and 30 minor algorithms had to be developed or adapted. A few were quite far from the mainstream CFD eﬀorts of the time and required considerable eﬀort. It took almost ﬁve years of research and development before a truly useful result could be produced (1989). The TRANAIR code ultimately evolved into the Boeing workhorse aerodynamic code of the 1990s and up to the current time for analyzing ﬂows about complex conﬁgurations. TRANAIR was heavily used in the design of the 777, the 737NG, and all subsequent modiﬁcations and derivatives to the Boeing Commercial Airplanes ﬂeet. Since 1989, it has been run to completion more than 70,000 times on an enormously wide variety of conﬁgurations, some of which were not even vehicles. It has had about 90 users in Boeing. An older version of the code was used by NASA, the Air Force, the Navy, and General Aviation. In 2002, TRANAIR was run to completion at Boeing more than 15,000 times, which is considerable use for a complex geometry CFD code. If we had to choose one single technical feature of TRANAIR that was responsible for such widespread use, we would choose solution adaptive grid reﬁnement. In retrospect, while this feature was intended to improve accuracy, its main beneﬁt was to greatly relieve the user of the burdensome and labor-intensive task of generating a volume grid.

Even with substantially simpliﬁed gridding requirements, inputting a general geometry CFD code and processing the outputs are still formidable tasks. An essential enabler for TRANAIR has been the development of a packaged process for inputting ‘‘standard’’ conﬁgurations. By ‘‘standard,’’ we mean those conﬁguration types that have been scripted in the various components that make up the process. Conﬁgurations not included in the ‘‘standard’’ can still be analyzed but will not beneﬁt from the same degree of automation. This package, illustrated in Fig. 15, is compatible and takes advantage of common Boeing Commercial Airplanes processes for geometry and postprocessing. At the center of this process is the TRANAIR ﬂow solver. AGPS scripts have been developed to automate the paneling of ‘‘standard’’ conﬁgurations from AGPS lofts. AGPS scripts have also been developed to generate the input deck for the TRANAIR solver. These inputs deﬁne the ﬂight conditions, solution adaptive gridding strategy, and the boundary layer inputs for ‘‘standard’’ conﬁgurations. A UNIX script is available to generate the various job control ﬁles to execute the solver on several types of computers. The TRANAIR solver generates several ﬁles for restarts of the solver and output processor, output ﬁles for various aerodynamic parameters, and a ﬁle for ﬂowﬁeld parameters. A special-purpose code, compatible with the unique TRANAIR grid structure, is available to view the ﬂowﬁeld properties. The package enables setting up and submitting for solution a ‘‘standard’’ conﬁguration from AGPS lofts in one or two hours. Complete solutions from ‘‘lofts to plots’’ are frequently available in less than 12 h. ‘‘Standard’’ conﬁgurations include transport conﬁgurations including, for example, four-engine 747-like aircraft with underwing struts and nacelles and vertical and horizontal stabilizer with boundary layer on both wing and body.

During the aerodynamic design of the Boeing 777 in the early 1990s, the risk of signiﬁcant interference drag due to the exhaust from the large engines was revealed through TRANAIR analysis. Neither the earlier linear-based CFD methods nor conventional wind tunnel testing techniques, which did not simulate the exhaust, would have detected this potential problem. Only a very expensive powered-nacelle testing technique could assess these interference eﬀects. Three diﬀerent manufacturer's engines were being considered for the new aircraft. Using the powered testing technique to develop the engine installations would have added considerable expense. Moreover, such a wind tunnel based development would have unacceptable design ﬂow time. Nonlinear transonic TRANAIR analysis by the product development engineers made it practical to address these installation problems including the eﬀects of the engine exhaust ﬂows in a timely manner. Had these problems gone undetected until late in the aircraft's development when the powered testing is usually done, any ﬁxes would have been extremely expensive to implement.

Fig. 16 shows a comparison of TRANAIR results with test data from a similar conﬁguration. TRANAIR's ability to provide insight to design changes allowed a close ‘‘Working Together’’ relationship between the various Boeing engineering disciplines and the engine manufacturers. It is noteworthy that the exhaust system of all three engines models is very similar in design, a feature found only on the 777. Key to the success of this application was the ability to model enough of the relevant physics and to provide solutions quickly enough to support the development schedule. The eﬀect of CFD on the project was to provide information facilitating a closer working relationship between design groups. This enabled detecting problems early in the development process, when ﬁxing or avoiding them was least expensive.

TRANAIR continues to see extensive use as the primary tool for transonic aerodynamic evaluation and design of commercial aircraft conﬁgurations. It is well suited for analysis in the attached and mildly separated ﬂow portion of the ﬂight envelope. For conditions with strong viscous interactions, one must resort to using the Navier–Stokes equations.

4.2.3. BLWF The BLWF code was developed by researchers at the Central Aerohydrodynamic Institute (TsAGI) and enhanced under contract with the Boeing Technology Research Center in Moscow, CIS [40]. It saw it ﬁrst use at Boeing in 1994. The BLWF technology was very similar to the technology of the A488 system that had been developed internally at Boeing. However, it diﬀered from A488 in that it had been designed and tuned for workstations and later, PC computing systems, instead of the large vector supercomputers that had been the main computational modeling tool within Boeing Commercial Airplanes. The tool was very responsive, providing solutions within minutes, rather than hours. The rapidity of response, along with the signiﬁcant cost-of-use reduction by hosting on less expensive hardware systems, changed the nature of use of the modeling tool. New applications, such as Reynolds number corrections for wing loads, have become feasible with such a tool. This application requires solutions for about a dozen Mach numbers over a range of angles of attack (ﬁve to 10). Use of BLWF allows a database of hundreds of solutions to be generated in a matter of a few hours, rather than days or weeks. The code has also been used extensively in the preliminary design stage of aircraft deﬁnition. At this point in the airplane development cycle, there are typically a large number of signiﬁcant changes in the aircraft deﬁnition, along with a need to understand the behavior of the conﬁguration over a large range of conditions. BLWF allows more realistic modeling of the ﬂight characteristics than other Preliminary Design methods and also provides an ability to obtain the information rapidly, allowing more eﬀective cycling of the preliminary design through the evolution of an aircraft.

4.3. Euler/coupled boundary layer methods

The use of full potential/boundary layer coupling code reaches its limit in predicting airplane performance at oﬀ-design conditions where signiﬁcant shock induced ﬂow separations or vortex ﬂows generated from sharp edges of the conﬁguration, occur in the ﬂowﬁeld. The boundary layer approximation breaks down, and the irrotational/isentropic ﬂow assumption is not a good approximation for such ﬂow conditions. Moreover, wake locations must be estimated a priori, preventing the accurate analysis of ﬂows where vortex interactions are an important feature.

Algorithm research in the early 1980s focused on solution of the Euler equations––the governing equations for inviscid ﬂuid ﬂows. The Boeing version of an Euler/boundary layer coupling code––A588 is based on FLO57 [41] coupled with the same boundary layer code A411 used in A488. The code also introduced a capability for simulating engine inlet and exhaust ﬂows with various total pressures and total temperatures, as well as propfan engine power eﬀects through the use of an actuator disk concept. A588 was the main analysis tool for isolated nacelle development studies until very recently. It provided accurate predictions of nacelle fan cowl pressure distributions, as well as fan cowl drag rise. The multiblock 3D Euler code was used extensively for the simulation of the propfan engine on The Boeing 7J7 program during the mid-1980s, as shown in Fig. 17. A key application was the evaluation of propfan engine installation eﬀects on tail stability characteristics––including simulations that could not be accomplished in the wind tunnel.

Another Euler/integral boundary layer coupling code––A585, based on Drela and Giles [42], was developed in mid-1980s for 2D airfoil analysis and design. This code has been used extensively for advanced airfoil technology development, an essential capability for airplane product development engineers.

4.4. Navier–Stokes methods

The limitation of full potential or Euler/boundary layer coupling codes to ﬂow regimes without signiﬁcant ﬂow separation leads to the development and application of solutions to Navier–Stokes equations, which are valid over the whole range of ﬂight regime for most commercial airplanes. Finite diﬀerence schemes [43] or ﬁnite volume schemes with either artiﬁcial numerical dissipation [44] or Roe's upwind scheme [45] were developed and tested extensively during the late 1980s and early 1990s. At the same time, development of turbulence models for attached and separated ﬂow simulations progressed rapidly. The simple zero equation Baldwin/Lomax model [46] was used extensively during the early stage of Navier–Stokes code applications. Later on, the Baldwin/Barth one equation model [47], the Spalart/Allmaras one equation model [48], together with Menter's shear-stress transport k–w model [49], were available, and were used for a wide range of ﬂight conditions including massively separated ﬂows.

4.4.1. Structure grid codes––Zeus TLNS3D/CFL3D, OVERFLOW Navier–Stokes technology using structured grids was well developed by the early 1990s and is available to the industry. However, most existing structured grid Navier–Stokes codes require the users to provide high-quality 3D grids to resolve detailed viscous ﬂows near conﬁguration surfaces and viscous wake regions. The task of grid generation––both surface grid and ﬁeld grid––has become one of the essential elements, as well as the bottleneck in using Navier–Stokes technology for complex conﬁguration/complex ﬂow analysis. In addition, most Navier–Stokes solvers have not been thoroughly checked out and validated for numerical accuracy, convergence reliability, and application limitations. Boeing has acquired several Navier–Stokes codes from NASA, as well from other research organizations, and has devoted a great deal of eﬀort testing the codes and validating numerical results with available wind tunnel and ﬂight data. In addition, to make the codes usable tools for engineering design, Boeing CFD developers have rewritten a 3D grid generation code through the use of an advancing front approach [50], so that a precise control on grid quality, such as grid spacing, stretching ratio, and grid orthogonality near conﬁguration surfaces can be achieved. This is an important requirement for accurate resolution of viscous ﬂow regions for all existing Navier–Stokes solvers. Two structured grid generation approaches are currently in use (i.e., the matched/patched multiblock grid approach and the overset or overlap grid approach). The former approach subdivides the ﬂowﬁeld into a number of topologically simple regions, such that in each region high quality grid can be generated. This is a rather time-consuming and tedious process for complex conﬁguration analysis. However, once this ‘‘blocking’’ process is done for one conﬁguration, a similar conﬁguration can be done easily through the use of script or command ﬁles. The TLNS3D/CFL3D based Zeus Navier–Stokes analysis system [51] developed and used at Boeing for Loads and Stability and Control applications belongs to this structured, multiblock grid approach. The Zeus analysis system inherited the process developed in the A488 system, which packaged many user-friendly preprocessing programs that handled geometry and ﬂow condition input as well as postprocessing programs that printed and plotted wing sectional data and airplane force and moment data. This has allowed the design engineers to reduce their input to just geometry lofts and ﬂight conditions and obtain the solution within a few hours or overnight depending on the size of the problem and the availability of the computing resources. The Zeus system is illustrated in Fig. 18.

Some recent applications of using the Zeus Navier–Stokes analysis system include the prediction of Reynolds number eﬀects on tail eﬀectiveness, shown in Fig. 19. CFD results captured the eﬀect of Reynolds number on horizontal tail boundary layer health and on tail eﬀectiveness quite well.

Another application is the simulation of vortex generators on a complete airplane conﬁguration [52] as shown in Fig. 20. The eﬀects of vortex generators on airplane pitch characteristics are shown. Again, the results compare reasonably well with ﬂight data with respect to predicting airplane pitch characteristics, even at relatively high angles of attack where the ﬂow is massively separated. The CFD solution also provides ﬂowﬁeld details that illustrate the ﬂow physics behind how vortex generators work to improve high-speed handling characteristics, a very useful tool for design engineers in selecting and placing vortex generators on lifting surfaces.

The second structured grid Navier–Stokes method uses the overset grid approach, whereby the ﬂowﬁeld grid is generated for each component of the conﬁguration independently. Each set of grid overlaps with other set or sets of grid, and communication between various sets of grid is achieved through numerical interpolation in the overlap region. The advantage of this approach is that each component of the conﬁguration is relatively simple, and a high-quality local grid can be easily generated. However, one pays the price of performing complex 3D interpolation with some risk of degrading overall numerical accuracy. The OVERFLOW code [43] used at Boeing for high-speed and high-lift conﬁguration analysis belongs to this overset/overlap structured grid approach. Fig. 21 shows the overset grids and OVERFLOW solution of a complex high-lift system, including all high-lift components of the airplane [53]. Results agree well with experimental data for low to moderate angle of attacks. At high angle of attack, there are complex ﬂow separations in the ﬂap and slat gap regions, which could not be simulated adequately with the current one- or two-equation turbulence models. Improvements in turbulence models for separated ﬂow simulation, as well as Navier–Stokes solver accuracy and robustness, are essential for a reliable prediction of airplane high-lift performance, as well as airplane pitch characteristics.

Another important element for successful use of Navier–Stokes technology in airplane design and analysis is the availability of high-performance computing. All Navier–Stokes codes require large memory and many CPU hours to resolve viscous ﬂows over an airplane conﬁguration. The rapid development of parallel computing hardware and software, as well as PC clusters with large number of CPUs, have made the use of Navier–Stokes technology in practical airplane design and analysis a reality. The analysis of an airplane conﬁguration with 16 vortex generators on each side of the wing consists of approximately 25 million points. Using 56 CPUs on a SGI Origin 2000 machine, the CFD solution for each ﬂight condition can be obtained within 11 h of ﬂow time.

4.4.2. Unstructured grid codes––Fluent, NSU2D/3D, CFD++ The structured grid Navier–Stokes codes make highly eﬃcient use of computer memory and processing power due to the well-ordered data structure used in the solution algorithm. However, they suﬀer two major drawbacks; i.e., the lack of ﬂexibility in handling complex geometry and the diﬃculty of implementing solution adaptive gridding. These requirements, namely, complex geometry and solution adaptive capability, are essential for accurate and reliable predictions of airplane design and oﬀ-design performance. Consequently, it is less common and often more diﬃcult to use CFD to analyze geometrically complex parts of the airplane, such as high-lift systems (ﬂaps and slats), engine compartments, auxiliary power units, and so on. Paradoxically, the success of CFD in designing major components has eliminated many of the experiments that previously provided a ‘‘piggyback’’ opportunity to test these complicated devices. Consequently, there is an increased need to compute airﬂows around and through systems that are distinguished by very complex geometry and ﬂow patterns. In the last decade, there has been impressive progress in unstructured grid Navier–Stokes code developments [54–57]. Boeing Commercial Airplanes has explored and used Fluent, the most recent unstructured grid Navier–Stokes codes NSU2D/NSU3D of Mavriplis [54], and CFD++ of Chakravarthy [57] for 2D and 3D high-lift analysis with success.

A recent application of unstructured grid technology involved the use of Fluent V5 [58] to investigate the behavior of the eﬄux from engine thrust reversers [59]. A typical commercial airplane deploys its thrust reversers brieﬂy after touch down. A piece of engine cowling translates aft and blocker doors drop down, directing the engine airﬂow into a honeycomb structure called a cascade. The cascade directs the ﬂow forward, which acts to slow the aircraft and decrease lift for more eﬀective braking. There are some critical design considerations in properly directing the reversed ﬂow. The reverser is used precisely at the time when high-lift devices, wing leading and trailing edge ﬂaps and slats, are fully deployed. Consequently, the plumes of hot exhaust must be directed so as not to impinge on these devices. In addition, the plumes should not hit the fuselage or other parts of the aircraft. Moreover, reingestion (in which the reversed plume reenters the engine inlet), engine ingestion of debris blown up from the runway, and plume envelopment of the vertical tail (which aﬀects directional control) must be avoided. To eliminate these eﬀects, it's important for designers to know exactly where the exhaust plumes go.

The Tetra module of grid generation software from ICEM CFD Engineering [60] has been used to obtain fully unstructured meshes. Starting from a new airplane geometry (with cleaned up lofts), these meshes can be created in a day or two. The grid generation software contains a replay capability so that minor changes to the geometry can be remeshed quickly. Because the entire CFD analysis cycle can be completed in about three days, designers can use this tool repeatedly as a way to optimize the design. In this way, it is possible to map the performance of the reverser against the power setting of the reversed engine fan and the airplane forward speed. Tests that involve geometry changes, such as the repositioning of the cascades or the nacelle relative to the wing or variation of the cascade angles, can be accomplished with minimal remeshing and analysis. Wind tunnel testing and expense are reduced, but the key beneﬁts are really time and risk mitigation. If a need to change the design should become apparent after the tooling was built and aircraft was in test, the delay in entry into service and the expense of retooling would be unacceptable. The grid and engine reverser eﬄux particle traces from one of these cases is illustrated in Fig. 22. Fluent is in widespread use at Boeing for other geometrically complex problems, such as cooling ﬂows in engine compartments and dispersion of ﬁre suppression chemicals.

4.4.3. Other Navier–Stokes codes The Propulsion Analysis group at Boeing Commercial Airplanes has long acquired, supported, and used a number of other Navier–Stokes codes. The present authors are not qualiﬁed to describe this activity; however, we do wish to mention some of the codes involved. These include the Boeing named Mach3 code based on the implicit predictor, corrector methodology of McCormack [61], the PARC code [62] of NASA Lewis, the WIND code [63], and BCFD [64], which is scheduled to be the platform for an Enterprise common Navier–Stokes code. These codes have been used for nacelle inlet analysis and design and for nacelle fan and core cowl nozzle performance studies [64,65].

4.4.4. Next generation Navier–Stokes codes The successful application of Navier–Stokes codes during the last 10 years has raised expectations among Boeing engineers that CFD can become a routine tool for the loads analysis,stability and control analysis, and high-lift design processes. In fact, there is considerable speculation that it may be possible to populate databases involving tens of thousands of cases with results from Navier–Stokes CFD codes, if dramatic improvements in computing aﬀordability continue over the next ﬁve years. For the ﬁrst time, the aﬀordability per Navier–Stokes data point may rival that of a wind tunnel generated data point. Of course, project engineers use CFD and wind tunnel data in a complementary fashion so that cost is not a competitive issue here. Before Navier–Stokes codes can be routinely used to populate databases; however, accuracy, reliability, eﬃciency, and usability issues need to be addressed. Gaps in data, inconsistent data, and long acquisition times seriously degrade the utility of a database. Even with current user aids, the application of Navier–Stokes codes to new conﬁgurations generally requires the services of an expert user. The generation of a ‘‘good grid’’ is still somewhat of an art and often quite labor intensive. Although everyone realizes that a ‘‘good grid’’ is necessary for accuracy and even convergence, there is no precise deﬁnition of what constitutes a ‘‘good grid’’. In fact, the deﬁnition would probably vary from code to code and is certainly case dependent. Usability problems are reﬂected in the fact that although Navier–Stokes codes are now considered capable of generating more accurate results, they are used far less frequently than TRANAIR at Boeing Commercial Airplanes.

Much of the current eﬀort to improve the usability of our Navier–Stokes codes would have to be termed evolutionary. As is always the case with evolutionary improvements, it is necessary to determine whether or not incremental improvements are approaching a horizontal asymptote, while implementation costs are mounting. Boeing is currently involved in an eﬀort to reevaluate the current technology and explore alternatives, much the same as was done 20 years ago in the case of potential ﬂow. The project is called General Geometry Navier–Stokes Solver (GGNS).

From our TRANAIR experience, it seems rather evident that solution adaptive grids must be an essential feature for reliability and usability. This is especially true when computing ﬂows at oﬀ-design conditions where our understanding of the ﬂow physics is limited, making it diﬃcult to generate ‘‘good grids’’. However, these grids must now be anisotropic and, more than likely, quite irregular. This places a huge burden on improving discretization ﬁdelity, as current discretization algorithms do not seem to do well with irregular spacings and cell shapes. Higher order elements are certainly desirable for eﬃciency's sake and for capturing latent features. However, stabilization and limiter technologies need to be advanced to handle such elements. Current solvers are relatively weak, and convergence is often incomplete, especially when turbulent transport equations are involved. Some of these issues are addressed in detail elsewhere [66]. It should be noted that our reevaluation and development work here is a joint eﬀort between the CFD developers at Boeing and their colleagues at the Boeing Technical Research Center in Moscow. We also note there are related eﬀorts going on elsewhere. We mention in particular the FAAST project at NASA Langley.

4.5. Design and optimization methods

4.5.1. A555, A619 inverse design codes Most existing CFD codes are analysis tools (i.e., given a conﬁguration, the codes predict aerodynamic characteristics of the conﬁguration). In airplane design, one would like to have tools that can provide design capability (i.e., given airplane aerodynamic characteristics, the codes generate realistic geometry). The design method used by Henne [67], which prescribes wing surface pressures and employs an iterative method to ﬁnd the corresponding geometry, was one of the very ﬁrst inverse design methods used in the airplane industry. Boeing Commercial Airplanes developed a similar method for wing design using the A555 code [68], illustrated in Fig. 23. This code was used extensively on the 7J7, 777, and 737NG programs. The code borrowed heavily from the A488 system to ensure usability in the fast-paced airplane development environment. On the Boeing 777 program, CFD contributed to a high degree of conﬁdence in performance with only a three-cycle wing development program. Signiﬁcantly fewer wing designs were tested for the 777 than for the earlier 757 and 767 programs. The resulting ﬁnal design would have been 21% thinner without the ‘‘inverse design’’ CFD capability of A555. Such a wing would not have been manufacturable due to skin gages being too thick for the automatic riveting machines in the factory, and it would have less fuel volume. Conversely, if the wing could meet the skin gage and fuel volume requirements, the cruise Mach number would have had to be signiﬁcantly slower. In either case, the airplane would not have achieved customer satisfaction. The eﬀect of CFD wing design in this case was an airplane that has dominated sales in its class since being oﬀered to the airlines.

More recently, Campbell [69] introduced a constrained, direct, iterative, surface curvature method (CDISC) for wing design. The method has been incorporated into both the structured grid single-block Navier–Stokes code A619 [70], and the overset grid code OVERFLOW/OVERDISC at Boeing. Both codes are in use for conﬁguration design in the product development organization.

4.5.2. TRANAIR optimization Because of boundary condition generality, and in particular the use of transpiration to simulate surface movement, the TRANAIR code could have easily been substituted into the existing Boeing standard inverse aerodynamic design process, A555. However, the process itself had a number of issues. First and foremost was the diﬃculty of ﬁnding ‘‘good’’ pressure distributions for highly 3D ﬂows. Such pressure distributions needed to result in acceptable oﬀ-design performance as well as low cruise drags. Although many rules of thumb were developed through the years, only a few highly experienced aerodynamicists could create acceptable distributions on a routine basis. Second, it was never clear whether the resultant designs were in fact optimal, a question of some importance in a highly competitive environment. Third, multidisciplinary constraints often had to be imposed after the fact leading to a highly iterative and time consuming process as well as potentially suboptimal designs.

A serendipitous result of the decision to use a powerful sparse solver to converge the TRANAIR analysis cases was the ability to rapidly generate solution sensitivities. In a sense, each sensitivity represented just another right hand side for the already decomposed analysis Jacobian matrix to solve. In addition, the adaptive grid capability allowed accurate tracking of changes in critical ﬂow features predicted by these sensitivities. Formally, it was an easy matter to feed the sensitivities into an optimization driver such as NPSOL [71] and systematize the design process as illustrated in Fig. 24. However, optimization codes have been notorious for promising spectacular results and then falling ﬂat because of overly simplistic mathematical realizations of the problems. Aerodynamic design requires understanding of very complicated geometric, ﬂow and interdisciplinary constraints. These constraints are rather nebulous and often exist only in the minds of the designers. An initial optimization capability using TRANAIR was available in 1992 [72], but it took several more years before project users were willing to trust their design processes to optimization [73]. A wide variety of payoﬀ functions and constraints were built into TRANAIR, but the one component of a payoﬀ function that users were really interested in was, of course, drag. Consequently, a great deal of eﬀort was invested in numerical work to improve TRANAIR's drag calculations. Careful studies in the mid-1990s [74] then validated the ability of TRANAIR to compute accurate drag increments for subsonic transports.

At the same time, a multipoint optimization capability was introduced, since it was well understood that drag minimization at a single ﬂight condition was somewhat ill-posed and often led to unacceptable oﬀ design characteristics. Moreover, users desired capability for simultaneously optimizing slightly diﬀerent conﬁgurations having major portions of their geometries in common. By 1997, TRANAIR optimization had replaced inverse design as the preferred aerodynamic design process for ﬂight conditions where full potential/boundary layer modeling is applicable. At the current time, the code can handle as many as 600 geometry degrees of freedom and 45,000 nonlinear inequalities. These inequalities represent the pointwise application of roughly 25 diﬀerent types of ﬂow and geometry constraints. The code has seen extensive use in the design of a large variety of conﬁgurations covering the Mach range from transonic to Mach 2.4. This has contributed (in several cases critically) to detailed development studies for a number of vehicles, some of which are illustrated in Fig. 25.

TRANAIR design/optimization applications that have aﬀected a product include the payload fairing on the Sea Launch rocket, nacelle fan cowl for the GE90-115B engine, and the process used to determine ‘‘Reduced Vertical Separation Minimums’’ compliance for new and in-service aircraft.

Conclusions

During the last 30 years at Boeing Commercial Airplanes, Seattle, CFD has evolved into a highly valued tool for the design, analysis, and support of cost-eﬀective and high-performing commercial transports. The application of CFD today has revolutionized the process of aerodynamic design, and CFD has joined the wind tunnel and ﬂight test as a critical tool of the trade. This did not have to be the case; CFD could have easily remained a somewhat interesting tool with modest value in the hands of an expert as a means to assess problems arising from time to time. As the reader can gather from the previous sections, there are many reasons that this did not happen. The one we would like to emphasize in this Conclusion section is the fact that Boeing recognized the leverage in getting CFD into the hands of the project engineers and was willing to do all the things necessary to make it happen.

n5321 | 2025年7月29日 20:58

Advanced methodology for uncertainty propagation in computer experiments with large number of inputs

Bertrand Iooss,a,b;∗ and Amandine Marrel,c a. EDF R&D, Département PRISME, 6 Quai Watier, 78401 Chatou, France b. Institut de Mathématiques de Toulouse, 31062 Toulouse, France c. CEA, DEN, DER, SESI, LEMS, 13108 Saint-Paul-lez-Durance, France ∗Email: bertrand.iooss@edf.fr

December 24, 2018

Abstract

In the framework of the estimation of safety margins in nuclear accident analysis, a quantitative assessment of the uncertainties tainting the results of computer simulations is essential. Accurate uncertainty propagation (estimation of high probabilities or quantiles) and quantitative sensitivity analysis may call for several thousand of code simulations. Complex computer codes, as the ones used in thermal-hydraulic accident scenario simulations, are often too cpu-time expensive to be directly used to perform these studies. A solution consists in replacing the computer model by a cpu inexpensive mathematical function, called a metamodel, built from a reduced number of code simulations. However, in case of high dimensional experiments (with typically several tens of inputs), the metamodel building process remains difficult. To face this limitation, we propose a methodology which combines several advanced statistical tools: initial space-filling design, screening to identify the non-influential inputs, Gaussian process (Gp) metamodel building with the group of influential inputs as explanatory variables. The residual effect of the group of non-influential inputs is captured by another Gp metamodel. Then, the resulting joint Gp metamodel is used to accurately estimate Sobol’ sensitivity indices and high quantiles (here 95%-quantile). The efficiency of the methodology to deal with a large number of inputs and reduce the calculation budget is illustrated on a thermal-hydraulic calculation case simulating with the CATHARE2 code a Loss Of Coolant Accident scenario in a Pressurized Water Reactor. A predictive Gp metamodel is built with only a few hundred of code simulations and allows the calculation of the Sobol’ sensitivity indices. This Gp also provides a more accurate estimation of the 95%-quantile and associated confidence interval than the empirical approach, at equal calculation budget. Moreover, on this test case, the joint Gp approach outperforms the simple Gp.

Keywords | Gaussian process, Metamodel, Quantile, Screening, Sobol’ indices

I. INTRODUCTION

Best-estimate computer codes are increasingly used to estimate safety margins in nuclear accident management analysis instead of conservative procedures [1, 2]. In this context, it is essential to evaluate the accuracy of the numerical model results, whose uncertainties come mainly from the lack of knowledge of the underlying physics and the model input parameters. The so-called Best Estimate Plus Uncertainty (BEPU) methods were then developed and introduced in safety analyses, especially for thermal-hydraulic issues, and even more precisely for the large-break loss-of-coolant accident [3, 4, 5]. Its main principles rely on a probabilistic modeling of the model input uncertainties, on Monte Carlo sampling for running the thermal-hydraulic computer code on sets of inputs, and on the application of statistical tools (based for example on order statistics as the Wilks’ formula) to infer high quantiles of the output variables of interest [6, 7, 8, 9, 10].

Not restricted to the nuclear engineering domain, the BEPU approach is more largely known as the uncertainty analysis framework [11]. Quantitative assessment of the uncertainties tainting the results of computer simulations is indeed a major topic of interest in both industrial and scientific communities. One of the key issues in such studies is to get information about the output when the numerical simulations are expensive to run. For example, one often faces up with cpu time consuming numerical models and, in such cases, uncertainty propagation, sensitivity analysis, optimization processing and system robustness analysis become difficult tasks using such models. In order to circumvent this problem, a widely accepted method consists in replacing cpu-time expensive computer models by cpu inexpensive mathematical functions (called metamodels) based, e.g., on polynomials, neural networks, or Gaussian processes [12]. This metamodel is built from a set of computer code simulations and must be as representative as possible of the code in the variation domain of its uncertain parameters, with good prediction capabilities. The use of metamodels has been extensively applied in engineering issues as it provides a multi-objective tool [13]: once estimated, the metamodel can be used to perform global sensitivity analysis, as well as uncertainty propagation, optimization, or calibration studies. In BEPU-kind analyses, several works [14, 15, 16] have introduced the use of metamodels and shown how this technique can help estimate quantiles or probability of failure in thermal-hydraulic calculations.

However, the metamodeling technique is known to be relevant when simulated phenomena are related to a small number of random input variables (see [17] for example). In case of high dimensional numerical experiments (with typically several tens of inputs), and depending on the complexity of the underlying numerical model, the metamodel building process remains difficult, or even impracticable. For example, the Gaussian process (Gp) model [18] which has shown strong capabilities to solve practical problems, has some caveats when dealing with high dimensional problems. The main difficulty relies on the estimation of Gp hyperparameters. Manipulating predefined or well-adapted Gp kernels (as in [19, 20]) is a current research way, while several authors propose to couple the estimation procedure with variable selection techniques [21, 22, 23].

In this paper, following the latter technique, we propose a rigorous and robust method for building a Gp metamodel with a high-dimensional vector of inputs. Moreover, concerning the practical use of this metamodel, our final goal is twofold: to be able to interpret the relationships between the model inputs and outputs with a quantitative sensitivity analysis, and to have a reliable high-level quantile estimation method which does not require additional runs of the computer code.

In what follows, the system under study is generically denoted

Y = g (X1; ... ; Xd) (1)

where g(·) is the numerical model (also called the computer code), whose output Y and input parameters X1, ..., Xd belong to some measurable spaces Y and X1, ..., Xd respectively. X = (X1, ..., Xd) is the input vector and we suppose that X = ∏d k=1 Xk ⊂ Rd and Y ⊂ R. For a given value of the vector of inputs x = (x1, ..., xd) ∈ Rd, a simulation run of the code yields an observed value y = g(x).

To meet our aforementioned objectives, we propose a sequential methodology which combines several relevant statistical techniques. Our approach consists in four steps:

Step 1: Design of experiments. Knowing the variation domain of the input variables, a design of n numerical experiments is firstly performed and yields n model output values. The obtained sample of inputs/outputs constitutes the learning sample. The goal is here to explore, the most efficiently and with a reasonable simulation budget, the domain of uncertain parameters X and get as much information as possible about the behavior of the simulator output Y. For this, we use a space-filling design (SFD) of experiments, providing a full coverage of the high-dimensional input space [12, 23].
Step 2: Preliminary screening. From the learning sample, a screening technique is performed in order to identify the Primary Influential Inputs (PII) on the model output variability and rank them by decreasing influence. To achieve it, we use dependence measures with associated statistical tests. These measures have several advantages: they can be directly estimated on a SFD, the sensitivity indices that they provide are interpretable quantitatively and their efficiency for screening purpose has been recently illustrated by [24, 25, 26].
Step 3: Building and validation of joint metamodel. From the learning sample, a metamodel is built to fit the simulator output Y. For this, we propose to use a joint Gp metamodel [27], by considering only the PII as the explanatory inputs while the other inputs (screened as non significantly influential) are considered as a global stochastic (i.e. unknown) input. Moreover, we use a sequential process to build the joint Gp where the ranked PII are successively included as explanatory inputs in the metamodel (ranking from Step 2). At each iteration, a first Gp model [22], only depending on the current set of explanatory inputs, is built to approximate the mean component. The residual effect of the other inputs is captured using a second Gp model, also function of the explanatory inputs, which approximates the variance component. The accuracy and prediction capabilities of the joint metamodel are controlled on a test sample or by cross-validation.
Step 4: Use of the metamodel for sensitivity analysis and uncertainty propagation. A quantitative sensitivity analysis (Step 4A) and an uncertainty propagation (Step 4B) are performed using the joint metamodel instead of the computer model, leading to a large gain of computation time [28, 27]. In this work, we are particularly interested in estimating variance-based sensitivity indices (namely Sobol’ indices) and the 95%-quantile of model output.

For ease of reading and understanding of the methodology, Figure 1 gives a general workflow of the articulation of the main steps. For each step, the dedicated sections, the main notations that will be properly introduced later and the key equations are also referenced, in order to provide a guideline for the reader. The paper is organized as follows. In the following section, the nuclear test case which constitutes the guideline application of the paper is described. It consists in a thermal-hydraulic calculation case which simulates an accidental scenario in a nuclear Pressurized Water Reactor. Then, each step of the above statistical methodology is detailed in a dedicated section and illustrated as the same time on the use-case. A last section concludes the work.

Fig. 1. General workflow of the statistical methodology.

II. THERMAL-HYDRAULIC TEST CASE

Our use-case consists in thermal-hydraulic computer experiments, typically used in support of regulatory work and nuclear power plant design and operation. Indeed, some safety analysis considers the so-called "Loss Of Coolant Accident" which takes into account a double-ended guillotine break with a specific size piping rupture. The test case under study is a simplified one, as regards both physical phenomena and dimensions of the system, with respect to a realistic modeling of the reactor. The numerical model is based on code CATHARE2 (V2.5 3mod3.1) which simulates the time evolution of physical quantities during a thermal-hydraulic transient. It models a test carried out on the Japanese mock-up "Large Scale Test Facility" (LSTF) in the framework of the OECD/ROSA-2 project, and which is representative of an Intermediate Break Loss Of Coolant Accident (IBLOCA) [29]. This mock-up represents a reduced scale Westinghouse PWR (1/1 ratio in height and 1/48 in volume), with two loops instead of the four loops on the actual reactor and an electric powered heating core (10 MWe), see Figure 2. It operates at the same pressure and temperature values as the reference PWR. The simulated accidental transient is an IBLOCA with a break on the cold leg and no safety injection on the broken leg. The test under study reproduces a PWR 17% (of cold leg cross-sectional area) cold leg IBLOCA transient with total failure of the auxiliary feedwater, single failure of diesel generators and three systems only available in the intact loop (high pressure injection, accumulator and low pressure injection).

CATHARE2 is used to simulate this integral effect test (see [30] for the full details of the modeling). During an IBLOCA, the reactor coolant system minimum mass inventory and the Peak Cladding Temperature (PCT) are obtained shortly after the beginning of the accumulators’ injection. Figure 3 shows the CATHARE2 prediction and the experimental values of the maximal cladding temperature (also called maximal heater rod temperature) obtained during the test. The conclusion of [30], which also presents other results, is that the CATHARE2 modeling of the LSTF allows to reproduce the global trends of the different physical phenomena during the transient of the experimental test. In our study, the main output variable of interest will be a single scalar which is the PCT during the accident transient (as an example, see the peak in Figure 3). This quantity is derived from the physical outputs provided by CATHARE2 code.

The input parameters of the CATHARE2 code correspond to various system parameters as boundary conditions, some critical flow rates, interfacial friction coefficients, condensation coefficients, heat transfer coefficients, etc. In our study, only uncertainties related to physical parameters are considered and no uncertainty on scenario variables (initial state of the reactor before the transient) is taken into account. All uncertain physical models identified in a IBLOCA transient of a nuclear power plant are supposed to apply to the LSTF, except phenomena related to fuel behavior because of the fuel absence in the LTSF. A physical model uncertainty consists in an additive or multiplicative coefficient associated to a physical model. Finally, d = 27 scalar input variables of CATHARE2 code are considered uncertain and statistically independent of each other. They are then defined by their marginal probability density function (uniform, log-uniform, normal or log-normal). Table I gives more details about these uncertain inputs and their probability density functions (pdf). The nature of these uncertainties appears to be epistemic since they come from a lack of knowledge on the true value of these parameters.

TABLE I: List of the 27 uncertain input parameters and associated physical models in CATHARE2 code.

Type of inputs	Inputs	pdf a	Physical models
Heat transfer in the core	X1	N	Departure from nucleate boiling
	X2	U	Minimum film stable temperature
	X3	LN	HTCb for steam convection
	X4	LN	Wall-fluid HTC
	X5	N	HTC for film boiling
Heat transfer in the steam generators (SG) U-tube	X6	LU	HTC forced wall-steam convection
	X7	N	Liquid-interface HTC for film condensation
Wall-steam friction in core	X8	LU
Interfacial friction	X9	LN	SG outlet plena and crossover legs together
	X10	LN	Hot legs (horizontal part)
	X11	LN	Bend of the hot legs
	X12	LN	SG inlet plena
	X13	LN	Downcomer
	X14	LN	Core
	X15	LN	Upper plenum
	X16	LN	Lower plenum
	X17	LN	Upper head
Condensation	X18	LN	Downcomer
	X19	U	Cold leg (intact)
	X20	U	Cold leg (broken)
	X27	U	Jet
Break flow	X21	LN	Flashing (undersaturated)
	X22	N	Wall-liquid friction (undersaturated)
	X23	N	Flashing delay (undersaturated)
	X24	LN	Flashing (saturated)
	X25	N	Wall-liquid friction (saturated)
	X26	LN	Global interfacial friction (saturated)

a. U, LU, N and LN respectively stands for Uniform, Log-Uniform, Normal and Log-Normal probability distributions. b. Heat Transfer Coefficient.

Our objective with this use-case is to provide a good metamodel for sensitivity analysis, uncertainty propagation and, more generally, safety studies. Indeed, the cpu-time cost of one simulation is too important to directly perform all the statistical analysis which are required in a safety study and for which many simulations are needed. To overcome this limitation, an accurate metamodel, built from a reduced number of direct code calculations, will make it possible to develop a more complete and robust safety demonstration.

III. STEP 1: DESIGN OF EXPERIMENTS

This initial step of sampling is to define a design of n experiments for the inputs and performing the corresponding runs with the numerical model g. The obtained sample of inputs/outputs will constitute the learning sample on which the metamodel will then be fitted. The objective is therefore to investigate, most efficiently and with few simulations, the whole variation domain of the uncertain parameters in order to build a predictive metamodel which approximates as accurately as possible the output Y.

For this, we propose to use a space-filling design (SFD) of a n of experiments, this kind of design providing a full coverage of the high-dimensional input space [12]. Among SFD types, a Latin Hypercube Sample (LHS, [31])) with optimal space-filling and good projection properties [23] would be well adapted. In particular, [12, 32] have shown the importance of ensuring good low-order sub-projection properties. Maximum projection designs [33] or low-centered L2 discrepancy LHS [34] are then particularly well-suited.

Mathematically, the experimental design corresponds to a n-size sample {x(1), ..., x(n)} which is performed on the model (or code) g. This yields n model output values denoted {y(1), ..., y(n)} with y(i) = g(x(i)). The obtained learning sample is denoted (Xs, Ys) with Xs = [x(1)T, ..., x(n)T]T and Ys = {y(1), ..., y(n)}T. Then, the goal is to build an approximating metamodel of g from the n-sample (Xs, Ys).

The number n of simulations is a compromise between the CPU time required for each simulation and the number of input parameters. For uncertainty propagation and metamodel-building purpose, some rules of thumb propose to choose n at least as large as 10 times the dimension d of the input vector [35, 22].

To build the metamodel for the IBLOCA test case, n = 500 CATHARE2 simulations are performed following a space-filling LHS built in dimension d = 27. The histogram of the obtained values for the output of interest, namely the PCT, is given by Figure 4 (temperature is in °C). A kernel density estimator [36] of the data is also added on the plot to provide an estimator of the probability density function. A bimodality seems to be present in the histogram. It underlines the existence of bifurcation or threshold effects in the code, probably caused by a phenomenon of counter current flow limitation between the bend of hot legs and the steam generator inlet plena.

Fig. 4. Histogram of the PCT from the learning sample of n = 500 simulations.

Remark III.1 Note that the input values are sampled following their prior distributions defined on their variation ranges. Indeed, as we are not ensured to be able to build a sufficiently accurate metamodel, we prefer sample the inputs following the probabilistic distributions in order to have at least a probabilized sample of the uncertain output, on which statistical characteristics could be estimated. Moreover, as explained in the next section, dependence measures can be directly estimated on this sample, providing first usable results of sensitivity analysis.

Finally, the bimodality that has been observed on the PCT distribution strengthens the use of advanced sensitivity indices (i.e. more general than linear ones or variance-based ones) in our subsequent analysis.

IV. STEP 2: PRELIMINARY SCREENING BASED ON DEPENDENCE MEASURE

From the learning sample, a screening technique is performed in order to identify the primary influential inputs (PII) on the variability of model output. It has been recently shown that screening based on dependence measures [24, 25] or on derivative-based global sensitivity measures [37, 38] are very efficient methods which can be directly applied on a SFD. Moreover, beyond the screening job, these sensitivity indices can be quantitatively interpreted and used to order the PII by decreasing influence, paving the way for a sequential building of metamodel. In the considered IBLOCA test case, the adjoint model is not available and the derivatives of the model output are therefore not computed because of their costs. The screening step will then be based only on dependence measures, more precisely on Hilbert-Schmidt independence criterion (HSIC) indices, directly estimated from the learning sample.

The dependence measures for screening purpose have been proposed by [24] and [25]. These sensitivity indices are not the classical ones based on the decomposition of output variance (see [39] for a global review). They consider higher order information about the output behavior in order to provide more detailed information. Among them, the Hilbert-Schmidt independence criterion (HSIC) introduced by [40] builds upon kernel-based approaches for detecting dependence, and more particularly on cross-covariance operators in reproducing kernel Hilbert spaces, see Appendix A for mathematical details.

From the estimated R²HSIC [24], independence tests can be performed for a screening purpose. The objective is to separate the inputs into two sub-groups, the significant ones and the non-significant ones. For a given input Xk, statistical HSIC-based tests aims at testing the null hypothesis "H0(k): Xk and Y are independent", against its alternative "H1(k): Xk and Y are dependent". The significance level (c) of these tests is hereinafter noted α. Several HSIC-based statistical tests are available: asymptotic versions based on an approximation with a Gamma law (for large sample size), spectral extensions and permutation-based versions for non-asymptotic case (case of small sample size). All these tests are described and compared in [25]; a guidance to use them for a screening purpose is also proposed. (c) The significance level of a statistical hypothesis test is the probability of rejecting the null hypothesis H0 when it is true.

So, at the end of this HSIC-based screening step, the inputs are clustered into two subgroups, PII and non-influential inputs, and the PII are ordered by decreasing R²HSIC. This order will be used for the sequential metamodel building in step 3.

On the IBLOCA test case, from the learning sample of n = 500 simulations, R²HSIC dependence measures are estimated and bootstrap independence tests with α = 0.1 are performed. The independence hypothesis H0 is rejected for eleven inputs, which are now designated as PII (Primary Influential Inputs). These inputs are given by Table II with their estimated R²HSIC. Ordering them by decreasing R²HSIC reveals:

the large influence of the interfacial friction coefficient in the horizontal part of the hot legs (X10),
followed by the minimum stable film temperature in the core X2, the interfacial friction coefficient in the SG inlet plena X12 and the wall to liquid friction (in under-saturated break flow conditions) in the break line X22,
followed by seven parameters with a lower influence: the interfacial friction coefficients in the upper plenum X15, the downcomer X13, the core X14 and the SG outlet plena and crossover legs together X9, the heat transfer coefficient in the core for film boiling X5, the interfacial friction coefficient of the saturated break flow X26 and the condensation coefficient in the jet during the injection X27.

These results clearly underline the predominance influence of the uncertainties on various interfacial friction coefficients.

TABLE II: HSIC-based sensitivity indices R²HSIC for the PII (Primary Influential Inputs), identified by independence test (Step 2).

PII	X10	X2	X12	X22	X15	X13	X9	X5	X14	X26	X27
R²HSIC	0.39	0.04	0.02	0.02	0.01	0.01	0.01	0.01	0.01	0.01	0.01

From the learning sample, some scatterplots of the PCT with respect to some well-chosen inputs (the three most influential ones: X2, X10 and X12) are displayed in Figure 5. An additional local regression using weighted linear least squares and a first degree polynomial model (moving average filter) is added on each scatterplot to extract a possible tendency. One can observe that larger values of the interfacial friction coefficient in the horizontal part of the hot legs (X10) lead to larger values of the PCT. This can be explained by the increase of vapor which brings the liquid in the horizontal part of hot legs, leading to a reduction of the liquid water return from the rising part of the U-tubes of the SG to the core (through the hot branches and the upper plenum). Since the amount of liquid water available to the core cooling is reduced, higher PCT are observed.

In addition, we notice a threshold effect concerning this input: beyond a value of 2, the water non-return effect seems to have been reached, and X10 no longer appears to be influential. We also note that the minimum stable film temperature in the core (X2) shows a trend: the more it increases, the lower the PCT. This is explained by the fact that in the film boiling regime in the core (i.e. when the rods are isolated from the liquid by a film of vapor), X2 represents (with a decrease in heat flux) the temperature from which the thermal transfer returns to the nucleate boiling regime. Thus, the larger X2, the faster the re-wetting of the rods, the faster the cladding temperature excursion is stopped, and thus the lower the PCT.

Fig. 5. Scatterplots with local polynomial regression of PCT according to several inputs, from the learning sample of n = 500 simulations.

The same kind of physical analysis can be made for other PII by looking at their individual scatterplot. Finally, it is important to note that the estimated HSIC and the results of significant tests are relatively stable when the learning sample size varies from n = 300 to n = 500. Only two or three selected variables with a very low HSIC (R²HSIC around 0.01) can differ. This confirms the robustness, with respect to the sample size, of the estimated HSIC and the results of the associated significance tests. Their relevance for qualitative ranking and screening purpose is emphasized.

In the next steps, only the eleven PII are considered as explanatory variables in the joint metamodel and will be successively included in the building process. The other sixteen variables will be joined in a so-called uncontrollable parameter.

V. STEP 3: JOINT GP METAMODEL WITH SEQUENTIAL BUILDING PROCESS

Among all the metamodel-based solutions (polynomials, splines, neural networks, etc.), we focus our attention on the Gaussian process (Gp) regression, which extends the kriging principles of geostatistics to computer experiments by considering the correlation between two responses of a computer code depending on the distance between input variables. The Gp-based metamodel presents some real advantages compared to other metamodels: exact interpolation property, simple analytical formulations of the predictor, availability of the mean squared error of the predictions and the proved capabilities for modeling of numerical simulators (see [41], [18] or [22]). The reader can refer to [42] for a detailed review on Gp metamodel.

However, for its application to complex industrial problems, developing a robust implementation methodology is required. Indeed, it often implies the estimation of several hyperparameters involved in the covariance function of the Gp (e.g. usual case of anisotropic stationary covariance function). Therefore, some difficulties can arise from the parameter estimation procedure (instability, high number of hyperparameters, see [22] for example). To tackle this issue, we propose a progressive estimation procedure based on the result of the previous screening step and using a joint Gp approach [27]. The interest of the previous screening step becomes twofold. First, the input space, on which each component of the joint Gp is built, can be reduced to the PII space (only the PII are explanatory inputs of Gp). Secondly, the joint Gp is built with a sequential process where the ranked PII are successively included as explanatory inputs in the metamodel. It is expected that these two uses of screening results could significantly make the joint Gp building easier and more efficient.

V.A. Sequential Gp-building process based on successive inclusion of PII as explanatory variables

At the end of the screening step, the PII are ordered by decreasing influence (decreasing R²HSIC). They are successively included as explanatory inputs in the Gp metamodel while the other inputs (the remaining PII and the other non-PII inputs) are joined in a single macro-parameter which is considered as an uncontrollable parameter (i.e. a stochastic parameter, notion detailed in section V.B). Thus, at the j-th iteration, a joint Gp metamodel is built with, as explanatory inputs, the j first ordered PII. The definition and building procedure of a joint Gp is fully described in [27] and summarized in the next subsection.

However, building a Gp or a joint Gp involves to perform a numerical optimization in order to estimate all the parameters of the metamodel (covariance hyperparameters and variance parameter). As we usually consider in computer experiments anisotropic (stationary) covariance, the number of hyperparameters linearly increases with the number of inputs. In order to improve the robustness of the optimization process and deal with a large number of inputs, the estimated hyperparameters obtained at the (j−1)-th iteration are used, as starting points for the optimization algorithm. This procedure is repeated until the inclusion of all the PII. Note that this sequential estimation process is directly adapted from the one proposed by [22].

V.B. Joint Gp metamodeling

We propose to use a joint Gp metamodeling to handle the group of non-PII inputs and capture their residual effect. In the framework of stochastic computer codes, [28] proposed to model the mean and dispersion of the code output by two interlinked Generalized Linear Models (GLM), called "joint GLM". This approach has been extended by [27] to several nonparametric models and the best results on several tests are obtained with two interlinked Gp models, called "joint Gp". In this case, the stochastic input is considered as an uncontrollable parameter denoted X" (i.e. governed by a seed variable).

We extend this approach to a group of non-explanatory variables. More precisely, the input variables X = (X1, ..., Xd) are divided in two subgroups: the explanatory ones denoted Xexp and the others denoted X". The output is thus defined by Y = g(Xexp, X") and the metamodeling process will now focus on fitting the random variable Y|Xexp (d). Under this hypothesis, the joint metamodeling approach yields building two metamodels, one for the mean Ym and another for the dispersion component Yd:

Ym(Xexp) = E(Y|Xexp) (2) Yd(Xexp) = Var(Y|Xexp) = E[(Y − Ym(Xexp))²|Xexp] (3)

where E[Z] is the usual notation for the expected (i.e. mean) value of a random variable Z. (d) Y|Xexp (i.e. Y knowing Xexp) is a random variable as its value depends on the uncontrollable random variable X".

To fit these mean and dispersion components, we propose to use the methodology proposed by [27]. To summarize, it consists in the following steps. First, an initial Gp denoted Gpm,1 is estimated for the mean component with homoscedastic nugget effect(e). A nugget effect is required to relax the interpolation property of the Gp metamodel. Indeed, this property, which would yield zero residuals for the whole learning sample, is no longer desirable as the output Y|Xexp is stochastic. Then, a second Gp, denoted Gpv,1, is built for the dispersion component with, here also, an homoscedastic nugget effect. Gpv,1 is fitted on the squared residuals from the predictor of Gpm,1. Its predictor is considered as an estimator of the dispersion component. The predictor of Gpv,1 provides an estimation of the dispersion at each point, which is considered as the value of the heteroscedastic nugget effect. The homoscedastic hypothesis is so removed and a new Gp, denoted Gpm,2, is fitted on data, with the estimated heteroscedastic nugget. This nugget, as a function of Xexp, accounts for a potential interaction between Xexp and the uncontrollable parameter X". Finally, the Gp on the dispersion component is updated from Gpm,2 following the same methodology as for Gpv,1. (e) Borrowed from geostatistics to refer to an unexpected nugget of gold found in a mining process, a constant nugget effect assumes an additive white noise effect, whose variance constitutes the nugget parameter. Most often, this variance is assumed to be constant, independent from the inputs (here Xexp), and the nugget effect is called homoscedastic. When this variance depends on the value of x (i.e. is a function of X), the nugget effect is called heteroscedastic.

Remark V.1 Note that some parametric choices are made for all the Gp metamodels: a constant trend and a Matérn stationary anisotropic covariance are chosen. All the hyperparameters (covariance parameters) and the nugget effect (when homoscedastic hypothesis is done) are estimated by maximum likelihood optimization process.

V.C. Assessment of metamodel accuracy

To evaluate the accuracy of a metamodel, we use the predictivity coefficient Q²:

Q² = 1 − [Σ(y(i) − ŷ(i))²] / [Σ(y(i) − ȳ)²] (4)

where (x(i)) is a test sample, (y(i)) are the corresponding observed outputs and (ŷ(i)) are the metamodel predictions. Q² corresponds to the coefficient of determination in prediction and can be computed on a test sample independent from the learning sample or by cross-validation on the learning sample. The closer to one the Q², the better the accuracy of the metamodel. In the framework of joint Gp-modeling, Q² criterion will be used to assess the accuracy of the mean part of the joint Gp, namely Gpm,•, whether Gpm,• is a homoscedastic (Gpm,1) or heteroscedastic Gp (Gpm,2). This quantitative information can be supplemented with a plot of predicted against observed values (ŷ(i) with respect to y(i)) or a quantile-quantile plot.

To evaluate the quality of the dispersion part of a joint metamodel (denoted Gpv,•), we use the graphical tool introduced in [27] to assess the accuracy of the confidence intervals predicted by a Gp metamodel. For a given Gp metamodel, It consists in evaluating the proportions of observations that lie within the α-theoretical confidence intervals which are built with the mean squared error of Gp predictions (the whole Gp structure is used and not only the conditional mean). These proportions (i.e. the observed confidence intervals) can be visualized against the α-theoretical confidence intervals, for different values of α.

V.D. Application on IBLOCA test case

The joint Gp metamodel is built from the learning sample of n = 500: the eleven PII identified at the end of the the screening step are considered as the explanatory variables while the sixteen others are considered as the uncontrollable parameter. Gps on mean and dispersion components are built using the sequential building process described in section V.A where PII ordered by decreasing R²HSIC are successively included in Gp. Q² coefficient of mean component Gpm is computed by cross validation at each iteration of the sequential building process. The results which are given by Table III show an increasing predictivity until its stabilization around 0.87, which illustrates the robustness of the Gp building process. The first four PII make the major contribution yielding a Q² around 0.8, the four following ones yield minor improvements (increase of 0.02 on average for each input) while the three last PII does not improve the Gp predictivity. Note that, in this application, these results remain unchanged whether we consider homoscedastic Gp (Gpm,1) or heteroscedastic Gp (Gpm,2), the heteroscedastic nugget effect not significantly improving the Gp predictor for the mean component. Thus, only 13% of the output variability remains here not explained by Gpm, this includes both the inaccuracy of the Gpm (part of Ym not fitted by Gp) and the total effect of the uncontrollable parameter, i.e. the group of non-selected inputs.

For this exercise, 600 remaining CATHARE2 simulations, different from the learning sample, are also available. As they are not used to build the Gp metamodel, this set of simulations will constitutes a test sample. The Q² computed on this test sample is Q² = 0.90 for both Gpm,1 and Gpm,2, which is consistent with the estimations by cross-validation.

TABLE III: Evolution of Gpm metamodel predictivity during the sequential process building, for each new additional PII.

PII	X10	X2	X12	X22	X15	X13	X9	X5	X14	X26	X27
Q²	0.60	0.64	0.70	0.79	0.81	0.83	0.85	0.85	0.87	0.87	0.87

Now, to assess the quality of dispersion part of the joint Gp, the predicted confidence intervals are compared with the theoretical ones (cf. Section V.C). Figure 6 gives the results obtained by Gpm,1 and Gpm,2 on the learning sample (by cross-validation) and the test sample, since available here. It clearly illustrates both the interest of considering a heteroscedatic nugget effect and the efficiency of using a joint Gp model to fit and predict this nugget. It can be seen that the joint Gp yields the most accurate confidence intervals in prediction, especially for the test sample. Indeed, all its points are close to the theoretical y = x line (a deviation is only observed for the learning sample for the highest α), while the homoscedastic Gp tends to give too large confidence intervals. Thus, in this case, the heteroscedasticity hypothesis is justified and, consequently, the proposed joint Gp model is clearly more competitive than the simple one.

Fig. 6. Proportion of observations that lie within the α-confidence interval predicted by the Gp, according to the theoretical α. Top: results for homoscedastic Gp (Gpm,1) on the learning sample (left) and on the test sample (right). Bottom: results for heteroscedastic Gp (Gpm,2) on the learning sample (left) and on the test sample (right).

VI. STEP 4A: VARIANCE-BASED SENSITIVITY ANALYSIS

Sensitivity Analysis methods allow to answer the question "How do the input parameters variations contribute, qualitatively or quantitatively, to the variation of the output?" [43]. These tools can detect non-significant input parameters in a screening context, determine the most significant ones, measure their respective contributions to the output or identify an interaction between several inputs which impacts strongly the model output. From this, engineers can guide the characterization of the model by reducing the output uncertainty: for instance, they can calibrate the most influential inputs and fix the non-influential ones to nominal values. Many surveys on sensitivity analysis exist in the literature, such as [44], [45] or [46]. Sensitivity analysis can be divided into two sub-domains: the local sensitivity analysis and the global sensitivity analysis. The first one studies the effects of small input perturbations around nominal values on the model output [47] while the second one considers the impact of the input uncertainty on the output over the whole variation domain of uncertain inputs [43]. We focus here on one of the most widely used global sensitivity indices, namely Sobol’ indices which are based on output variance decomposition.

A classical approach in global sensitivity analysis is to compute the first-order and total Sobol’ indices which are based on the output variance decomposition [48, 49], see Appendix B for mathematical details. Sobol’ indices are widely used in global sensitivity analysis because they are easy to interpret and directly usable in a dimension reduction approach. However, their estimation (based on Monte-Carlo methods for example) requires a large number of model evaluations, which is intractable for time expensive computer codes. An classical alternative solution consists in using a metamodel to compute these indices. Note that a connection can be made between the estimation error of Sobol’ indices when using a metamodel and the predictivity coefficient Q² of the metamodel. Indeed, when the Q² is estimated on a probabilized sample of the inputs (in other words when it is computed under the probability distribution of the inputs), it provides an estimation of the part of variance unexplained by the metamodel. This can be kept in mind when interpreting the Sobol’ indices estimated with the metamodel.

VI.A. Sobol’ indices with a joint Gp metamodel

In the case where a joint Gp metamodel is used to take into account an uncontrollable input X", [50] and [27] have shown how to deduce Sobol’ indices from this joint metamodel, see Appendix C for mathematical details.

Therefore, from a joint Gp, it is only possible to estimate Sobol’ indices of any subset of Xexp (equation (12)) and the total Sobol’ index of X" (equation (13)). The latter is interpreted as the total sensitivity index of the uncontrollable process. The individual index of X" or any interaction index involving X" are not directly reachable from joint Gp; their contributions in S"T can not be distinguished. This constitutes a limitation of this approach. However, the potential interactions between X" and inputs of Xexp could be pointed out, considering all the primary and total effects of all the other parameters. The sensitivity analysis of Yd can also be a relevant indicator: if a subset Xu of Xexp is not influential on Yd, we can deduce that Su" is equal to zero. Note that in practice, Var(Y) which appears in both equations (12) and (13) can be estimated directly from the learning sample (empirical estimator) or from the fitted joint Gp, using equation (9).

VI.B. Results on IBLOCA test case

From the joint Gp built in section V.D, Sobol’ indices of PII are estimated from Gpm by using equation (12), Var(Y) being estimated with Gpm and Gpd using equation (9). For this, intensive Monte Carlo methods are used (see e.g. the pick-and-freeze estimator of [51]): tens of thousands simulations of Gp are done. Remind that predictions of Gp are very inexpensive (few seconds for several thousand simulations), especially with respect to the thermal-hydraulic simulator. The obtained first Sobol’ indices of PII are given by Table IV and represent 85% of the total variance of the output. Qualitative results of HSIC indices are confirmed and refined: X10 remains the major influential input with 59% of explained variance, followed to a lesser extend by X12 and X22 with for each of them 8% of variance. The partial total Sobol’ indices involving only PII and derived from Gpm show that additional 4% of variance is due to interaction between X10, X12 and X22. The related second order Sobol’ indices are estimated and the significant ones are given in Table IV. The other PII have negligible influence. In short, the set of PII explain a total 89% of the output variance, of which 79% is only due to X10, X12 and X22.

TABLE IV: First and second Sobol’ indices of PII (in %), estimated with Gpm of the joint Gp metamodel.

PII	X10	X2	X12	X22	X15	X13	X9	X5	X14	X26	X27
1st-order Sobol’ indices	59	3	8	8	2	1	2	0	2	0	0
Interaction between PII	X10×X22	X10×X12
2nd-order Sobol’ indices	3	1

For the physical interpretation, these results confirm those revealed in Section IV, with a rigorous quantification of inputs’ importance: the interfacial friction coefficient in the horizontal part of the hot legs (X10) is the main contributor to the uncertainty of the PCT. Moreover, some results have not been revealed by the HSIC-based screening analysis of Table II. At present, Sobol’ indices clearly indicate that the interfacial friction coefficient in the SG inlet plena X12 and the wall to liquid friction (in under-saturated break flow conditions) in the break line X22 are more influential than the minimum stable film temperature in the core X2. X22 has a significant influence on the PCT because its low values lead to higher break flow rates, resulting in a loss of the primary water mass inventory at the higher break, and thus a more significant core uncovering (then higher PCT). For X12, its higher values lead to a greater supply (by the vapor) of liquid possibly stored in the water plena to the rest of the primary loops (then lower PCT). Table IV also shows that there are some small interaction effects (possibly antagonistic) between X10 and X12, as well as between X10 and X22. Let us remark that deepening this question (which is outside the scope of this paper) would be possible via plotting, from the Gp metamodel, the conditional expectations of the PCT as a function of the interaction variables.

At present, from Gpd and using equation (13), the total effect of the group of the sixteen non-PII inputs (i.e. X") is estimated to 9.7%. This includes its effect alone and in interaction with the PII. To further investigate these interactions, Sobol indices of Yd are estimated and HSIC-based statistical dependence tests are applied on Yd. They reveal that only X10, X14, X2, X22 and X4 potentially interact with the group of non-PII inputs. At this stage of our analysis, this result cannot be physically interpreted.

VII. STEP 4B: QUANTILE ESTIMATION

As already said in the introduction, once a predictive Gp metamodel has been built, it can be used to perform uncertainty propagation and in particular, estimate probabilities or, as here, quantiles.

VII.A. Gp-based quantile estimation

The most trivial and intuitive approach to estimate a quantile with a Gp metamodel is to apply the quantile definition to the predictor of the metamodel. This direct approach yields a so called plug-in estimator. More precisely, with a Gp metamodel, this approach consists in using only the predictor of the Gp metamodel (i.e. the expectation conditional to the learning sample) in order to estimate the quantile. As the expectation of the Gp mean is a deterministic function of the inputs, this provides a deterministic expression of the quantile but no confidence intervals are available. Moreover, for high (resp. low) quantiles, this methods tends to substantially underestimate (resp. overestimate) the true quantile because the metamodel predictor is usually constructed by smoothing the computer model output values (see an analysis of this phenomenon in [14]).

To overcome this problem, [52] has proposed to take advantage of the probabilistic-type Gp metamodel by using its entire structure: not only the mean of the conditional Gp metamodel but also its covariance structure are taken into account. In this full-Gp based approach also called modular Bayesian approach, the quantile definition is therefore applied to the global Gp metamodel and leads to a random variable. The expectation of this random variable can be then considered as a quantile estimator. Its variance and, more generally, all its distribution can then be used as an indicator of the accuracy of the quantile estimate. Confidence intervals can be deduced, which constitutes a great advantage of this full-Gp based approach. Moreover, the efficiency of this approach for high quantile (of the order of 95%) has been illustrated by [53].

In practice, the estimation of quantile with the full-Gp approach is based on stochastic simulations (conditionally to the learning sample) of the Gp metamodel, by using the standard method of conditional simulations [54]. We recall just that, to generate conditional Gaussian simulations on a given interval, a discretization of the interval is first considered, then the vector of the conditional expectation and the matrix of the conditional covariance on the discretized interval are required. Note that [55] uses this approach with Gp conditional simulations for the estimation of Sobol’ indices and their associated confidence intervals.

In this paper, from our joint Gp model, we will compare the full-Gp approach applied to either the homoscedastic Gp or the heteroscedastic Gp and will proceed as follows:

For the homoscedastic Gp, the standard technique of conditional simulations is applied to Gpm,1 (built to estimate Ym, with a constant nugget effect).
For the heteroscedastic Gp, we propose a new technique to simulate the conditional Gp trajectories:
1. The heteroscedastic Gp built for Ym (namely Gpm,2) provides the conditional expectation vector and a preliminary conditional covariance matrix.
2. The Gp built for Yd (namely Gpd,2) is predicted and provides the heteroscedastic nugget effect which is added to the diagonal of the conditional covariance matrix of the previous step.
3. Conditional simulations are then done using the standard method [54].

VII.B. Results on IBLOCA test case

In this section, we focus on the estimation of the 95%-quantile of the PCT for the IBLOCA application. From the learning sample of size n = 500 and the joint Gp, we compare here the following approaches to estimate the PCT quantile:

The classical empirical quantile estimator, denoted ^q95_emp. A bootstrap method (see for example [56]) makes it possible to obtain in addition a 90%-confidence interval for this empirical quantile.
The plug-in estimators from the homoscedastic or the heteroscedastic Gp, denoted ^q95_Gpm,1-pi and ^q95_Gpm,2-pi. No confidence intervals are obtained using this estimation method.
The full-Gp estimators from the homoscedastic or the heteroscedastic Gp, denoted ^q95_Gpm,1-full and ^q95_Gpm,2-full respectively. As explained in the previous section, confidence intervals can be deduced with this full-Gp approach.

Table V synthesizes all the results obtained for the PCT 95%-quantile with these different approaches given above. In addition, 90%-confidence intervals are given when they are available. As explained in Section V.D, we also have 600 other CATHARE2 simulations, for a total of 1100 PCT simulations (learning sample plus test sample). A reference value of the 95%-quantile is obtained from this full sample with the classical empirical quantile estimator: ^q95_ref = 742.28 °C

The empirical estimator based on the learning sample is imprecise but its bootstrap-based confidence interval is consistent with the reference value. As expected, the plug-in approaches provide poor estimations of the quantile, which is here strongly underestimated. The full-Gp approach based on the homoscedastic assumption overestimates the quantile; this is consistent with the analysis of Gp confidence intervals in Figure 6 (too large confidence intervals provided by homoscedastic Gp). Finally, the best result is obtained with the conditional simulation method based on the heteroscedastic Gp metamodel, built with the joint Gp method. This approach yields a more accurate prediction than the usual homoscedastic Gp and outperforms the empirical estimator in terms of confidence interval. Once again, the heteroscedasticity hypothesis is clearly relevant in this case.

TABLE V: Results for the 95%-quantile estimates of the PCT (in °C) with its 90%-confidence interval (CI) when available.

	Reference(f)	Empirical(g)	Plug-in(g,h) (homo-Gp)	Plug-in(g,h) (hetero-Gp)	Full-Gp(g,h) (homo-Gp)	Full-Gp(g,h) (hetero-Gp)
Mean	742.28	746.80	736.26	735.83	747.11	741.46
90%-CI		[736.7, 747.41]			[742.93, 751.32]	[738.76, 744.17]

(f) The reference method uses the full sample of size n = 1100. (g) Empirical and Gp-based methods are applied from the learning sample of size n = 500. (h) The plug-in and full-Gp estimators are based on either the homoscedastic or heteroscedastic Gp metamodel.

VIII. CONCLUSION

In the framework of the estimation of safety margins in nuclear accident analysis, it is essential to quantitatively assess the uncertainties tainting the results of Best-estimate codes. In this context, this paper has been focused on an advanced statistical methodology for Best Estimate Plus Uncertainty (BEPU) analysis, illustrated by a high dimensional thermal-hydraulic test case simulating accidental scenario in a Pressurized Water Reactor (IBLOCA test case). Some statistical analyses such as the estimation of high-level quantiles or quantitative sensitivity analysis (e.g., estimation of Sobol’ indices based on variance decomposition) may call in practice for several thousand of code simulations. Complex computer codes, as the ones used in thermal-hydraulic accident scenario simulations, are often too cpu-time expensive to be directly used to perform these studies.

To cope with this limitation, we propose a methodology mainly based on a predictive joint Gp metamodel, built with an efficient sequential algorithm. First, an initial screening step based on advanced dependence measures and associated statistical tests enabled to identify a group of significant inputs, allowing a reduction of the dimension. The efforts of optimization when fitting the metamodel can then be concentrated on the main influential inputs. The robustness of metamodeling is thus increased. Moreover, thanks to the joint metamodel approach, the non-selected inputs are not completely removed: the residual uncertainty due to dimension reduction is integrated in the metamodel and the global influence of non-selected inputs is so controlled. From this joint Gp metamodel, accurate uncertainty propagation and quantitative sensitivity analysis, not feasible with the numerical model because of its computational cost, become accessible. Thus, the uncertainties of model inputs are propagated inside the joint Gp to estimate Sobol’ indices, failure probabilities and/or quantiles, without additional code simulations.

Thus, on the IBLOCA application, a predictive Gp metamodel is built with only a few hundred of code simulations (500 code simulations for 27 uncertain inputs). From this joint Gp, a quantitative sensitivity analysis based on variance decomposition is performed without additional code simulation: Sobol’ indices are computed and reveal that the output is mainly explained by four uncertain inputs. One of them, namely the interfacial friction coefficient in the hot legs, is strongly influential with around 60% of output variance explained, the three others being of minor influence. The quite less influence of all the other inputs is also confirmed. Note that a direct and accurate computation of Sobol’ indices with the thermal-hydraulic code would have required tens of thousands of code simulations.

The physical interpretation of the results obtained with the screening step and the variance-based sensitivity analysis step are known to be useful for modelers and engineers. This study has demonstrated this once again, in the particular case of an IBLOCA safety analysis, by revealing some physical effects on the PCT of influential inputs which cannot be understood without a global statistical approach (e.g. the threshold effect due to the interfacial friction coefficient in the horizontal part of the hot legs). [57] has also underlined the importance of sensitivity analysis in a validation methodology in order to identify relevant physical model uncertainties on which safety engineers must focus their efforts. Counter-intuitive effects are also present during an IBLOCA transient and only a global understanding of physical phenomena can help. As a perspective of the present work, extending the screening and sensitivity analysis tools to a joint analysis of several relevant output variables of interest (as the PCT time, the primary pressure and the core differential pressure) would be essential.

In the IBLOCA test case, we are particularly interested by the estimation of the 95%-quantile of the model output temperature. In nuclear safety, as in other engineering domains, methods of conservative computation of quantiles [6, 58] have been largely studied. We have shown in the present work how to use and simulate the joint Gp metamodel to reach the same objective: the uncertainty of the influential inputs are directly and accurately propagated through the mean component of the joint metamodel while a confidence bound is derived from the dispersion component in order to take into account the residual uncertainty of the other inputs. Results on the IBLOCA test case show that joint Gp provides a more accurate estimation of the 95%-quantile than the empirical approach, at equal calculation budget. Besides, this estimation is very close from the reference value obtained with 600 additional code simulations. Furthermore, the interest of the heteroscedastic approach in joint Gp is also emphasized: the estimated quantile and the associated confidence interval are much better than those of the homoscedastic approach.

As a future application of modern statistical methods on IBLOCA safety issues, one should mention the use of Gp metamodels to identify penalizing thermal-hydraulic transients, with respect to some particular scenario inputs. As in the present work, strong difficulties are raised by the cpu time cost of the computer code and the large number of uncertain (and uncontrollable) inputs.

(The rest of the document, including ACKNOWLEDGMENTS, REFERENCES, and APPENDICES, would follow here, formatted similarly.)

n5321 | 2025年7月28日 22:20

An Historical Perspective on Boeing’s Influence on Dynamic Structural

the Finite Element Method, Craig-Bampton Reduction and the Lanczos Eigenvalue extraction method formed a foundation。

三种方法里面，Craig-Bampton Reduction 和 Lanczos Eigenvalue extraction method 传播度很低。

Craig-Bampton Reduction（Craig-Bampton 模态缩减法）用于在有限元模型中减少自由度（DOF），同时保持结构动力学行为的主要特征。1968 年 Roy R. Craig Jr. 和 M. C. C. Bampton 在他们的经典论文《Coupling of Substructures for Dynamic Analyses》中首次系统提出。当时大型结构的模态分析计算量非常大，需要将复杂结构分解成较小的子结构来分析再组合。

Lanczos Eigenvalue Extraction Method（Lanczos 特征值提取法） Lanczos 方法是一种迭代算法，用于从大型稀疏矩阵中提取低阶特征值和对应特征向量，特别适用于结构振动分析中的模态提取问题。 1950 年 Cornelius Lanczos（匈牙利裔美国数学家）该方法将原始稀疏矩阵投影到一个低维 Krylov 子空间中，在该空间中求解特征值问题，效率远高于直接求解原始大矩阵。 仅关心低频模态（例如前几个模态）

Today’s automobiles have superb handling and are extremely quiet compared to vehicles 30 years ago. The comfortable environment of a modern automobile is largely due to the industry’s focused efforts on Noise, Vibration, and Harshness (NVH) analysis as a means to improve their product and sales. At the heart of each NVH analysis is a multi-million DOF finite element vibro-acoustic model. The low to mid frequency acoustic and structural responses require that thousands of frequencies and mode shapes be calculated.

We look into the role Boeing played in the inspiration, development, and deployment of these numerical solution methods and summarize how these methods are used both within Boeing and outside of Boeing in the development of multitudes of products.

Today within Boeing, the finite element method is pervasive with several thousand engineers utilizing FEA on a regular basis

It turns out that it was Boeing’s need and desire to improve flutter prediction that led to Boeing’s leading role in the development of the finite element method.

Who invented finite elements? In the publication “The Origins of the Finite Element Method” [1], Carlos Felippa states:

“Not just one individual, as this historical sketch will make clear. But if the question is tweaked to: who created the FEM in everyday use? there is no question in the writer’s mind: M. J. (Jon) Turner at Boeing over the period 1950–1962. He generalized and perfected the Direct Stiffness Method, and forcefully got Boeing to commit resources to it while other aerospace companies were mired in the Force Method. During 1952–53 he oversaw the development of the first continuum based finite elements.”

Figure 1: Jon Turner

Jon Turner was the supervisor of the Structural Dynamics Unit at Boeing in Seattle. In the early 1950’s, with the growing popularity of jet aircraft, and with demands for high performance military aircraft, delta wing structures presented new modeling and analysis problems. Existing unidirectional (that is, beam models) models did not provide sufficient accuracy. Instead, two-dimensional panel elements of arbitrary geometry were needed.

At this time, Boeing had a summer faculty program, whereby faculty members from universities were invited to work at Boeing over the summer. In the summers of 1952-53, Jon Turner invited Ray Clough from the University of California at Berkley, and Harold Martin from the University of Washington to work for him on a method to calculate the vibration properties for the low-aspect ratio box beam. This collaboration resulted in the seminal paper by Turner, Clough, Martin and Topp in 1956 [2] which summarized a procedure called the Direct Stiffness Method (DSM) and derived a constant strain triangular element along with a rectangular membrane element. (Topp was a structures engineer at the Boeing Airplane Company, Wichita Division.)

It is apropos to hear this story in the words of Clough. The following passage is from a speech by Clough transcribed and published in 2004. [3]:

“When I applied for the Boeing Summer Faculty job in June 1952, I was assigned to the Structural Dynamics Unit under the supervision of Mr. M. J. Turner. He was a very competent engineer with a background in applied mathematics, and several years of experience with Boeing. The job that Jon Turner had for me was the analysis of the vibration properties of a fairly large model of a ‘delta’ wing structure that had been fabricated in the Boeing shop. This problem was quite different from the analysis of a typical wing structure which could be done using standard beam theory, and I spent the summer of 1952 trying to formulate a mathematical model of the delta wing representing it as an assemblage of typical 1D beam components. The results I was able to obtain by the end of the summer were very disappointing, and I was quite discouraged when I went to say goodbye to my boss, Jon Turner. But he suggested that I come back in Summer 1953. In this new effort to evaluate the vibration properties of a delta wing model, he suggested I should formulate the mathematical model as an assemblage of 2D plate elements interconnected at their corners. With this suggestion, Jon had essentially defined the concept of the finite element method.

“So I began my work in summer 1953 developing in-plane stiffness matrices for 2D plates with corner connections. I derived these both for rectangular and for triangular plates, but the assembly of triangular plates had great advantages in modeling a delta wing. Moreover, the derivation of the in-plane stiffness of a triangular plate was far simpler than that for a rectangular plate, so very soon I shifted the emphasis of my work to the study of assemblages of triangular plate ‘elements’, as I called them. With an assemblage of such triangular elements, I was able to get rather good agreement between the results of a mathematical model vibration analysis and those

好的，这是清理了换-行符并根据段落和结构重新整理好的完整文本：

measured with the physical model in the laboratory. Of special interest was the fact that the calculated results converged toward those of the physical model as the mesh of the triangular elements in the mathematical model was refined.”

While Jon Turner’s application for DSM was vibration calculations to facilitate flutter and dynamic analysis, Ray Clough realized that DSM could be applied to stress analysis. In 1960, Clough penned the famous paper titled “Finite Elements for Plane Stress Analysis” which both adapted the DSM method for stress analysis and simultaneously coined the phrase “Finite Element.” [4].

Besides the work done by those directly affiliated with Boeing, many others contributed to the development and popularization of today’s modern finite element method. In particular, J.H. Argyris, O.C. Zienkiewicz, and E.L. Wilson should be credited with their huge contributions in developing and broadening the scope of the finite element method beyond aerospace applications. References 1, 5 and 6 provide comprehensive historical background on the development and evolution of the finite element method. References 2, 4 and 17 can be considered seminal papers that laid out the foundation of the modern finite element method.

Of significance is that Argyris was a consultant to Boeing [1] in the early 1950’s and continued to collaborate with Boeing well into the 1960’s [17]. Both Turner and Topp were Boeing engineers, and Clough and Martin were affiliated with Boeing via the summer faculty program. Therefore, it is evident that Boeing, both inspired, and was directly involved in, the research and development that directly led to today’s modern finite element method.

Dr. Rodney Dreisbach, (Boeing STF, Retired 2015) summarized Jon Turner’s significance in the FEA development and deployment within Boeing’s ATLAS program nicely. He wrote about this in the BCA Structures Core “Life@Structures Blog” on November 14, 2013. His closing paragraph reads:

“In guiding the not-so-obvious steps leading up to the creation of the FEA Method, Jon Turner has been recognized as a scientist, an engineer, a mathematician, and an innovator. Furthermore, he was a visionary as exemplified by his continued leadership in addressing more advanced flight vehicles such as advanced composite structures for a Mach 2.7 supersonic cruise arrow-wing configuration in 1976, and his continued support and advocacy of Boeing’s development of the integrated multidisciplinary structural design and analysis system called ATLAS. The ATLAS System was a large-scale finite-element-based computing system for linear and nonlinear, metallic and composite, structural optimization, including the ply stackup of advanced composite structures. The engineering disciplines represented by the System included statics, weights, dynamics, buckling, vibrations, aeroelasticity, flutter, structural optimization, substructuring, acoustics, nonlinear mechanics, and damage tolerance. Its architecture was comprised of separate modules for the various technical disciplines, all of which shared a common data management system. The System also included several advanced matrix equation solvers and eigensolvers, as well as state-of-the-art substructuring techniques. Substructured interactions could be considered as being static, or as dynamic using either a modal synthesis or branch modes approach.”

Of significance in the above description of ATLAS, is that it closely describes NASTRAN as well. This is not a coincidence. The roots of both NASTRAN and ATLAS date back to the mid–late 1960’s. Boeing was the industrial center of finite element analysis and was ahead of the other major aerospace companies in recognizing the superiority of the displacement method and deploying that method within Boeing’s precursors to ATLAS.

In 1964, NASA recognized that the future of structural analysis, particularly for complex aerospace structures, was the finite element method. At this time, NASA created a committee composed of representatives from each NASA center and chaired by Tom Butler (considered by Dr. Richard MacNeal to be the Father of NASTRAN). The committee was commissioned to investigate the state of analysis in the aerospace industry and to find an existing finite element program worth recommending to all NASA centers. The first committee action was to visit the aircraft companies that had done prominent work in finite element analysis. In the end, this committee concluded that no single computer program “incorporated enough of the best state of the finite element art to satisfy the committees hopes” and recommended that NASA sponsor development of its own finite element program [18]. This program would be called NASTRAN which is an acronym for NAsa STRuctural ANalysis.

In July, 1965, NASA issued the RFP for NASTRAN. The MacNeal-Schwendler Corporation (MSC) was not recognized as a significant or large enough entity in the finite element world, and so it partnered with Computer Sciences Corporation as the lead in its response to the RFP. Boeing considered the RFP, but in the end did not submit a proposal. Had Boeing participated, according to Dr. MacNeal (co-founder of the MacNeal-Schwendler corporation), the NASTRAN contract would have certainly gone to Boeing since Boeing was the clear industrial leader in the finite element method.

In the mid-to-late 1990’s, as an employee of MSC, the author brought Dr. MacNeal to Boeing’s Renton engineering facility where Dr. MacNeal spoke to BCA’s team of finite element analysis experts. Dr. MacNeal began his talk by thanking Boeing for not participating in the NASTRAN RFP, and he went on to tell the story of how MSC essentially won the eventual NASTRAN contract due to Boeing’s decision to not participate.

Dr. MacNeal writes that Boeing departed NASA’s NASTRAN Bidders’ Conference after being told that they could not have an exception to NASA’s requirement that all work be done on NASA’s computers [18]. The NASA purchasing agent, Bill Doles, said that an exception could not be granted because NASA had determined that their computers had a lot of excess capacity and it would be uneconomical to pay the contractors for use of their computers. Boeing responded that they would carry the costs of their own computers as overhead and not charge NASA. Bill Doles responded that this was unacceptable since most of Boeing’s work was with the government, and the government would have to pay the overhead anyway. After this exchange, at the next break, the Boeing team abruptly departed the conference.

Nonetheless, Boeing had likely influenced the RFP. The RFP was essentially a collection of what NASA perceived to be the state of the art in FEA that it gathered from its studies of the various aerospace FEA codes. The fact that NASTRAN, (developed according to the requirements of the RFP), both architecturally and capability-wise are closely paralleled by ATLAS may not be due to pure coincidence, but perhaps due to the NASA incorporating Boeing’s “state of the finite element art” into the RFP.

III. CRAIG-BAMPTON COMPONENT MODE REDUCTION AND SYNTHESIS

As mentioned in the last section, ATLAS included several “state-of-the-art substructuring techniques.” One of these techniques was Component Mode Reduction. Component Mode Reduction is a technique for reducing a finite element model of a component down to a set of boundary matrices that approximately represent the dynamic characteristics of the component. The accuracy of the approximation is generally improved by increasing the number of component modes retained during the reduction process. The reduced component is generically referred to as a substructure but currently the term superelement, coined by commercial FEA software providers, is more prevalent.

There are a litany of component mode reduction and reduced order modeling techniques, but one technique stands out due to widespread usage and deployment in the popular commercial FEA packages (for example, MSC Nastran, NX Nastran, ABAQUS and ANSYS). This technique is the “Craig-Bampton” (C-B) component mode reduction method and this method is applied to a wide variety of dynamic simulations not only in aerospace, where it was conceived, but also in virtually every industry where structural dynamics has a large influence on the product design and performance, especially the automotive industry. [19]

Within Boeing, the C-B technique is central to the Boeing Aeroelastic Process (BAP) that is used for flight loads and flutter analysis. Of significant importance to the flutter community is that the C-B methodology enables rapid frequency variation studies as well as insertion and tailoring of assumed modes. The C-B method is also extensively applied in propulsion dynamics for Windmilling, Fan Blade Out (FBO) loads and Engine Vibration Related Noise (EVRN) analyses.

The EVRN analysis is a coupled vibro-acoustic analysis where C-B reduction is performed on both the airframe and acoustic fluid model and reduced down to the interface with the engine. Of significance, is that this C-B superelement package can be delivered to the engine manufacturers in the form of boundary matrices and output transformation matrices (OTMs), thereby preserving all Boeing IP, while enabling the engine companies to determine how different engine bearing and mount designs effect the interior cabin noise.

C-B reduction with OTMs is also central to Coupled Loads Analysis in Boeing’s Space spacecraft industry. Coupled Loads Analysis, in this context, is essentially the dynamic structural analysis of the complete space structure. For example, in the case of a rocket or launch vehicle, you also have the cargo (for example, a satellite). The various components of the launch vehicle and cargo are frequently built by different companies neither company can generally have visibility of the other’s finite element models. However, the dynamics of the entire system must be analyzed. This is facilitated by use of superelements typically created using C-B reduction and OTM’s similar to what was described with the propulsion EVRN analysis. This process enables all parties to generate the detailed data necessary to analyze and design their structure while preserving any IP, export, and ITAR data requirements.

Outside of Boeing, it was summarized in the Introduction how the automotive industry applies FEA with C-B reduction to their NVH dynamic analyses of their vehicles and sub-systems. Another class of dynamic analysis performed in the automotive industry, and across virtually every other industry (including aerospace) that analyzes dynamic systems is Multi-Body Dynamic (MBD) simulation.

MBD is a numerical simulation method in which systems are composed as assemblies of rigid and/or elastic bodies. Connections between the bodies are modeled with kinematic joints or linear/nonlinear springs/bushings/dampers. If inertia (mass) is eliminated, and all bodies are rigid links with kinematic constraints, then the multibody analysis reduces down to a kinematic mechanism analysis. However, when mass is included, the analysis is inherently dynamic.

For the dynamic case with flexible bodies, the challenge is to bring the flexibility of each body into the system simulation in an accurate and efficient manner. The standard methodology used to create the “flex body” is to perform a C-B reduction where the body is reduced down to the interface DOFs that connect the body to its surrounding joints. Additional transformations may be done to put the interface matrices in a form compatible with the formulation of the MBD software system. However, the first step is typically the C-B reduction. All the popular commercial finite element packages have the ability to generate “flex bodies” of components from finite element models of the component and the C-B method is used to create the reduced mass and stiffness matrices that are processed to generate the flexible body. (There are other techniques beyond C-B that can be used to generate flex bodies, particularly when nonlinearities of the component model are needed. However, for the linear cases most prevalent today, the C-B method is pervasive.)

Therefore, at this point, we have seen that Boeing had a role with the inspiration and development of the finite element method, and we have discussed how the C-B reduction technique is prevalent across industries performing dynamic structural analysis. The C-B reduction technique was also one of the “state-of-the-art substructuring techniques” present in Atlas.

The seminal paper on the C-B method was published as “Coupling of Substructures for Dynamic Analysis” in July 1968 in the AIAA Journal [9] by Roy Craig of the University of Texas and Mervyn Bampton, a Boeing Sr. Structures Engineer. Hence the name of the method “Craig-Bampton.”

Figure 2: Roy Craig

Of note is that [9] describes both the C-B reduction technique and synthesis of the multiple C-B reduced parts to generate an accurate system level dynamic model of substantially reduced order enabling both accurate and efficient calculation of dynamic characteristics of highly coupled structures. This AIAA paper has more than 1100 subsequent journal citations since publication demonstrating the impact the C-B methodology on subsequent applications and research. Of course, the motivation for this development within Boeing and Atlas was for application of Flutter and Coupled Dynamic Loads analysis of highly redundant space vehicle and airframe structures.

Also of note is that this very same paper was earlier published within Boeing in 1966 as document D6-15509 [10]. (This document is available electronically from library.web.boeing.com.) This document was prepared by R. R. Craig, supervised by M. C. C. Bampton and approved by L.D. Richmond. This work took place when Craig was employed by Boeing as part of the summer faculty program. [19]

Therefore, just as we saw substantial collaboration between Boeing and leading researchers in the development of the finite element method, we see a similar collaboration with Roy Craig in inspiration, development, and deployment of the Craig-Bampton method for Boeing’s dynamic analysis needs. The methodology is most credited to Roy Craig who spent his 40 years at the University of Texas specializing in development of computational and experimental methods of flexible substructures. However, the need by Boeing for efficient and accurate coupled dynamic analysis methods inspired and accelerated the development that became the Craig-Bampton technique and 50 years later, this Craig-Bampton method is omnipresent! [19]

IV. THE LANCZOS METHOD OF EIGENVALUE EXTRACTION

The natural frequencies of a structure may be the most fundamental dynamic characteristic of a structure. Dynamicists use the natural frequencies and associated mode shapes to understand dynamic behavior and interplay of components in a dynamic system. The computation of a structure’s or substructure’s natural frequencies and mode shapes is of fundamental importance to the dynamicist.

From a mathematical perspective, the calculation of natural frequencies and mode shapes is an eigenvalue extraction problem in which the roots (eigenvalues) and associated mode shapes (eigenvectors) are computed from the dynamic equation of motion with the assumption of harmonic motion while neglecting damping and applying no loading.

Eigenvalue/Eigenvector calculation is also a requirement of the C-B reduction method. The C-B method uses the natural frequencies and mode shapes of a component constrained at its interface to generate the dynamic portion of the reduced stiffness, mass and loads matrices. Therefore, a robust and efficient C-B reduction requires a robust and efficient eigenvalue/eigenvector calculation.

The Lanczos eigenvalue extraction method is by far the most prevalent eigenvalue extraction method used today in the popular finite element programs for vibration and buckling modes. Today, the AMLS and ACMS methods are promoted as the state-of-the-art eigenvalue/extraction methods for the largest models commonplace in the automotive industry.

While AMLS and ACMS can easily outperform the Lanczos method on large models, they are essentially automated methods of substructuring the mathematical finite element model utilizing enhanced C-B reduction for an accurate approximation of each substructure. When these C-B reduced substructures are assembled, a final system level eigenvalue extraction is performed to compute approximate system level modes.

This complete substructuring, assembly, and solution process is captured in the AMLS and ACMS methods. However, it is typically the Lanczos method with C-B reduction that is utilized to form the reduced approximate system that was solved to obtain the approximate system frequencies and mode shapes.

The Lanczos method is the bread and butter of dynamicists, whether used directly for computation of natural frequencies and mode shapes, or used indirectly with the AMLS/ACMS and similar methods that are based upon automated component modal synthesis of very large systems.

Prior to the commercial availability of the Lanczos method in the mid 1980’s, dynamicists spent a large amount of thought and time in determining how to reduce a model down to a size that could be efficiently solved with their finite element program and yield an accurate, albeit approximate solution. This is precisely why the C-B and other dynamic reduction techniques were created. However, the underlying weakness of all these methods was an accurate, efficient, and robust eigenvalue extraction method for the reduction process.

From a high level, there were essentially two families of eigenvalue extraction methods from which a dynamicist could choose: 1) Iterative based methods such as Inverse Power and Subspace Iteration, and 2) Tridiagonal methods such as the Householder-QR method. The iterative methods were relatively fast and efficient, but suffered from accuracy issues and struggled with closely spaced and large numbers of roots. The Tridiagonal methods were relatively robust and could accurately solve for all the roots of a system. Unfortunately, they also required enormous amounts of memory and were very inefficient making them impractical for all but the smallest models. The Lanczos method gained instant popularity because it could solve large models both accurately and efficiently, eliminating the tedious reduction process for a large variety of dynamic analyses.

In the 1960’s-1980’s substructuring and component mode reduction were primarily performed to enable computation of a system’s modes when the system could not be solved without reduction on the computers of the time due to memory, disk, and time constraints. After the commercial availability of the Lanczos method, substructuring and component mode reduction were primarily performed for other reasons, such as to enable efficient frequency variation studies (as is the case with BCA’s standard flutter analysis process), or to generate reduced matrix level models of components that can be shared with a third party to assemble into their system.

Only in the last 15 years with the advent of High Performance Computing (HPC) systems, have the AMLS/ACMS methods brought us back to substructuring as the norm for solving the largest eigenvalue problems because parallelization and improved performance is more easily enabled using a substructured solution process.

So what does Boeing have to do with the Lanczos method? It is twofold. First, the method was invented by Cornelius Lanczos. He published the method in 1950 while working at the National Bureau of Standards [11, 14]. However, prior to joining the National Bureau of Standards, Lanczos was employed with the Boeing Aircraft Company in Seattle from 1946-49 where he was inspired to study and improve matrix methods and numerical eigenvalue extraction of linear systems. Shortly after leaving Boeing, he completed the formulation of what we now call the Lanczos eigenvalue extraction method [12, 13].

Cornelius Lanczos was a colleague of Albert Einstein, and on December 22, 1945, he penned this passage in a letter to Einstein:

”In the meantime, my unfavorable situation here at the University has changed for the better. I have been in cooperation with Boeing Aircraft in Seattle, Washington for almost two years. Our relationship has developed in such a way that the company offered me a permanent position. It is somewhat paradoxical that I with my scientific interest can always get on as an applied mathematician.” [13]

Figure 3: Cornelius Lanczos at his desk at Boeing Plant 1, Seattle, WA

More insight into Lanczos’ inspiration from his tenure at Boeing is obtained in his recorded interview by the University of Manchester in 1972 [12]. There are several references to his time at Boeing where among other things, he mentions:

“of course this eigenvalue problem interested me a great deal because in Boeing one encountered this eigenvalue problem all the time and the traditional methods, they give you – it was easy enough to get asymptotically the highest eigenvalue, but the question is how do you get all the eigenvalues and eigenvectors of a matrix in such a way that you shouldn’t lose accuracy as you go to the lower eigenvalues… I knew of course from theoretical physics that eigenvalues and eigenvectors, I mean wave mechanics, everything, is eigenvalues and eigenvectors. Only in this case it was numerical, and in Boeing when I was frequently asked to give lectures, one of the lecture topics was matrices and eigenvalues and linear systems so that I was familiar in a theoretical way of the behavior of linear systems, particularly large linear systems.”

After joining the National Bureau of Standards, Lanczos had the opportunity to complete the formulation of his method based upon his experience at Boeing. He applied it on an analog computer available to him, but in the end, he doubted the practicality of his method. In reference 12, he tells this story:

“And I will never forget when I think it was an 8x8 matrix and the eigenvalues varied in something like 10^6. I mean the highest to the lowest, and I expected that the highest eigenvalues would come out to 10 decimal places and then we gradually lose accuracy but actually all the eigenvalues came out to 10 decimal places. I mean this was a tremendous thrill to see that, that we didn’t lose anything, but of course it had to require the careful reorthogonalization process which makes my method practically, let’s say, of less value or perhaps even of no value.”

It is somewhat entertaining that the roots of the de facto standard eigenvalue extraction method for nearly 30 years were thought by its inventor to be “of less value, or perhaps even no value.” Of course, by Lanczos’ own admission, the method was difficult to apply in practice. However, the significance of the Lanczos method in maintaining accuracy was not lost on the mathematical community and over the years, many mathematicians studied the method and searched for numerical methodologies that would make the method practical and of high value. An in-depth historical development of the Lanczos method is beyond the scope of this writing. However, this leads us to Boeing’s second point of influence on the Lanczos method: The development and deployment of the first robust commercially viable implementation of the Lanczos method.

V. BOEING COMPUTER SERVICES AND BCSLIB

The late 1960’s is a significant period for Boeing as well as for finite element analysis, numerical computing, and mainframe computing data centers. At this time, Boeing had just launched the 747 in 1969 and was about to enter the big “Boeing Bust” which saw its employment drop from >100,000 down to under 40,000 by the end of 1971. At the same time, within Boeing, this bust is perhaps responsible for the consolidation of two largely disconnected Boeing math groups: one on the military side and one on the commercial side. In 1970, Boeing Computer Services (BCS) was formed and these two math groups were brought together under the BCS organization [15].

By the 1980’s, BCS had both a mature data center where time was leased on Boeing computers to run commercial applications like NASTRAN and ANSYS. The expertise of the math group resulted in the establishment of a software group that built and licensed the math library BCSLIB-ext as well as developed the systems and controls software Easy5 (the “-ext” version of BCSLLIB was licensed externally. BCSLIB was used internally).

During the 1980’s and early 1990’s the BCS math/software team had a major impact on solutions of large linear static and dynamic systems. Notably, they were directly responsible for the first significant robust and efficient Lanczos method deployed in a commercial FEA package. In 1985, The MacNeal-Schwendler Corporation (MSC) released Nastran V65 with Boeing’s Lanczos eigensolver [22] and in the decade following, similar implementations were deployed in most of the other popular finite element packages.

The major players on the Boeing side were John Lewis, Horst Simon, Roger Grimes, and their manager Al Erisman. Louis Komzsik, from MSC also played a major role. Louis recognized the impact the Lanczos method would have if implemented robustly. He convinced MSC to fund Boeing to bring the Lanczos method to fruition in MSC Nastran. Louis was a perfectionist and drove the Boeing team to handle everything that could break so as to make it as bomb-proof as possible.

Figure 4: John Lewis, Horst Simon and Roger Grimes Figure 5: Louis Komzsik

Taking the Lanczos method from an unstable, impractical methodology to a highly practical, robust and efficient methodology was the result a many researchers and the coalescence of several key break-throughs. The summary, as provided to the author during an interview with John Lewis in May 2016 is as follows: The Lanczos algorithm in Boeing’s BCSLIB code combined work from five PhD theses with critical industrial support. The key contributions to efficiency are:

Block algorithms (2 Stanford PhD theses – Richard Underwood, John Lewis))
Stability correction only as needed (2 Berkeley PhD theses – David Scott, Horst Simon, part of one of the Stanford theses -- Lewis)
Shifting (Swedish PhD thesis – Thomas Ericsson)
Integrating all of these with a smart algorithm for choosing shifts (BCS – Grimes, Lewis & Simon)

The “creative break through,” according to John Lewis, emerged over a couple of pints with David Scott while in a pub in Reading, England in 1980 where they discussed Ericsson’s work on shifting and came up with a plan to improve upon earlier Lanczos method implementations. However, they could not get funding to implement the plan, so it sat for several years. In John Lewis’s words, Louis Komzsik emerged as the “Guardian Angel” when he brought forward the funding from MSC to implement the plan in MSC Nastran. Louis was Hungarian as was Lanczos, so he had great faith in his countryman’s idea!

Besides the Lanczos component, the other major thrust of BCSLIB was the sparse direct linear equation solver. This solver provided a substantial performance boost in solution of large linear systems and played a significant role in Lanczos performance. Within the Lanczos implementation, a series of linear solutions (matrix decompositions) takes place as the algorithm searches for the eigenvalues. Maximum performance is dependent on minimizing the number of decompositions. This requires algorithms for selection of good trial eigenvalues along with transformations to find both closely space roots and widely separated roots efficiently. (What is described here is the “shifting” and “smart algorithm for choosing shifts” mentioned above.)

The work of the BCS math team cannot be overstated when it is recognized that 30–35 years after their heroic efforts, the Lanczos method is still prominent in the popular commercial FEA packages. We attribute much of the performance improvement in finite element solutions to computing hardware improvements. However, in the late 1980’s, between the Lanczos and Sparse Solver methods, engineers realized order of magnitude gains in solution performance independent of any hardware improvements. These two performance improvements meant that many models that had previously required substantial substructuring and complex dynamic reduction could now be solved directly with the Lanczos method.

Also of significance is that this Boeing team, along with Cray went on to win the 1989 Society of Industrial and Applied Mathematics (SIAM) Gordon Bell Award. They received their award specifically for achieving record performance with their implementation of a general sparse matrix factorization on an 8-processor Cray Y-MP computer. This Sparse Matrix Solver development was another great effort that found its way into the commercial FEA codes that contributed to both Lanczos efficiency and solution efficiency of linear systems of equations.

In closing this section, the overall contribution Al Erisman made, should be noted. Erisman managed and directed the math group from 1975 until his retirement in 2001. According to John Lewis, “Al Erisman created the ethos of the Boeing Math Group, which strongly valued academic-industrial collaboration.” Were it not for Erisman, the industrial collaboration between MSC and Boeing may never have taken place.

VI. CONCLUSION

The finite element method was invented roughly 60 years ago. Craig-Bampton reduction was invented roughly 50 years ago and the modern Lanczos and Sparse solver methods were deployed into commercial FEA packages roughly 30 years ago. Virtually every Boeing product created since the 1950’s relied significantly in whole or in part on these technologies. The same can be said outside of Boeing where multitudes of consumer products ranging from toys to automobiles are engineered with significant application of these technologies. In many cases, engineers are utilizing these technologies today within modern GUI’s with no idea of the underlying solution methods and algorithms at play. The fact that after multiple decades, these technologies persist, albeit in often simpler and automated implementations, is a testament to the significance of these methods. Moreover, while Boeing did not solely invent any of these technologies, Boeing’s need to engineer some of the most complex and high performance structures, had a tremendous influence on the development and eventual deployment of these methods. We feel and see the effects of these technologies in the products all around us today. As we celebrate Boeing’s Centennial, it is appropriate to not only applaud our predecessors for the impact the products they engineered had on our society, but also applaud the engineers and mathematicians at Boeing who contributed to solution methods and algorithms that are routinely applied outside of Boeing to the development of the superb products that grace our society today.

It is also fitting to mention that on the same day this conclusion was penned, the author received an assignment to generate reduced dynamic models for the 777-9X folding wing tip. The author will utilize the aeroelastic finite element model along with C-B reduction and Lanczos eigenvalue extraction to form the flexible body representation of the airframe and folding wing tips. These reduced dynamic models will be integrated into the external controls multibody dynamic system model. Therefore, the work of Boeing engineers/mathematicians Turner, Bampton, Lanczos, Lewis, Simon, and Grimes will be applied to engineer perhaps the most iconic feature of Boeing’s next great commercial airplane. However, this is not unusual since as previously mentioned, superb products are being engineered all over the world with exactly these same methods every day!

ACKNOWLEDGMENTS

I would like to acknowledge John Lewis and Roger Grimes for their time spent outlining and explaining the Boeing BCSLIB Lanczos and Sparse Solver development history. I would like to acknowledge Louis Komzsik who provided both technical and historical background on the Lanczos development. I would like to thank both Dr. Rodney Dreisbach and Dr. Kumar Bhatia. Over the past decade, I discussed this piece of Boeing history with both on numerous occasions. Their passion for these simulation technologies and Boeing inspired me to document this piece of Boeing history.

n5321 | 2025年7月28日 21:48

科学计算，仿真，CAE的VVUQ

科学计算，仿真，CAE算一个意思。
Define： We will refer to the application of a model to produce a result, often including associated numerical approximation errors, as a simulation。mathematical models that take the form of coupled systems of nonlinear partial differential equations. （用非线性偏微分方程描述的数学模型）
目的：understand，predict behavior and optimize Performance！predicting the behavior of natural and engineered systems
问题：是中间存在不确定，所以结果不准确，所以需要VVUQ. a fundamental disconnect often exists between simulations and practical applications.
解决方式：Information on the magnitude, composition, and sources of uncertainty in simulations is critical in the decision-making process for natural and engineered systems.
控制参数：system response quantities
负面后果：decision makers will be ill advised, possibly resulting in inadequate safety, reliability, or performance of the system. Consequently, decision makers could unknowingly put at risk their customers, the public, or the environment. 哥伦比亚号，福岛核电站。
正面后果：CBA（certificate by analysis）
源头：uncertainty (aleatory and epistemic) ：In scientific computing, there are many sources of uncertainty including the model inputs, the form of the model, and poorly characterized numerical approximation errors. All of these sources of uncertainty can be classified as either purely aleatory, purely epistemic, or a mixture of the two.
manufacturing processes, natural material variability, initial conditions, wear or damaged condition of the system, and the system surroundings. the modeling process itself can introduce large uncertainties due to the assumptions in the model（model validation ） as well as the numerical approximations （code and solution verification ）employed in the simulations.
错误类别分类一
1. Aleatory (random) uncertainties in model inputs are treated as random variables, the stochastic process can be characterized via a probability density distribution
  1. Aleatory uncertainty is generally characterized by either a probability density function (PDF) or a cumulative distribution function (CDF).
2. epistemic uncertainty (also called reducible uncertainty or ignorance uncertainty) due to lack of knowledge by the modelers, analysts conducting the analysis, or experimentalists involved in validation. a lack of knowledge on the part of the analyst, or team of analysts, conducting the modeling and simulation.
  1. The lack of knowledge can pertain to, for example, modeling of the system of interest or its surroundings, simulation aspects such as numerical solution error and computer round-off error, and lack of experimental data.
  2. If knowledge is added (through experiments, improved numerical approximations, expert opinion, higher fidelity physics modeling, etc.) then the uncertainty can be reduced.
错误源头（model inputs, numerical approximations, and model form uncertainty ）
1. Model inputs
  1. Model inputs include not only parameters used in the model of the system, but also data from the description of the surroundings (see Figure 1). Model input data includes things such as geometry, constitutive model parameters, and initial conditions, and can come from a range of sources including experimental measurement, theory, other supporting simulations, or expert opinion. Data from the surroundings include boundary conditions and system excitation (e.g., mechanical forces or moments acting on the system, force fields such as gravity and electromagnetism).
2. Numerical approximation
  1. Since differential equation-based models rarely admit exact solutions for practical problems, approximate numerical solutions must be used. The characterization of the numerical approximation errors associated with a simulation is called verification. It includes discretization error, iterative convergence error, round-off error, and also errors due to computer programming (coding mistakes) mistakes.
  2. Figure 2 depicts the propagation of input uncertainties through the model to obtain output uncertainties. The number of individual calculations needed to accurately accomplish the mapping depends on four key factors: (a) the nonlinearity of the partial differential equations, (b) the dependency structure between the uncertain input quantities, (c) the nature of the uncertainties, i.e., whether they are aleatory, epistemic, or mixed uncertainties, and (d) the numerical methods used to compute the mapping.
3. Model form
  1. The form of the model results from all assumptions, conceptualizations, abstractions, approximations, and mathematical formulations on which the model relies.
  2. The characterization of model form uncertainty is commonly estimated using model validation.
  3. assessment of model accuracy by way of comparison of simulation results with experimental measurements.
    (a) assumptions concerning the environment (normal, abnormal, or hostile) to which the system is exposed,
    (b) assumptions concerning the particular scenarios the system is operating under, e.g., various types of damage or misuse of the system, and
    (c) cases where experimental data are not available on any closely related systems, e.g., data are only available on subsystems,
结果：
存在Errors
Numerical approximation errors （verification techniques ）
due to discretization, iteration, and computer round off
Model form uncertainty is quantified using
- (a) model validation procedures statistical comparisons of model predictions to available experimental data
- (b) extrapolation of this uncertainty structure to points in the application domain
procedure：
1. (1) the identification of all sources of uncertainty,
  1. The goals of the analysis should be the primary determinant for what is considered as fixed versus what is considered as uncertain.
2. (2) characterization of model input uncertainties,
  1. (a) assigning a mathematical structure to describe the uncertainty and
  2. (b) determining the numerical values of all of the needed parameters of the structure.
  3. Stated differently, characterizing the uncertainty requires that a mathematical structure be given to the uncertainty and all parameters of the structure be numerically specified such that the structure represents the state of knowledge of every uncertainty considered.
3. (3) elimination or estimation of code and solution verification errors, （Estimate uncertainty due to numerical approximations ）
  1. Recall that verification deals with estimating numerical errors which include discretization error, iterative error, round-off error, and coding mistakes.
4. (4) propagation of input uncertainties through the model to obtain uncertainties in the SRQs,
  1. to determine the effects on the SRQs.
  2. The simplest approach for propagating aleatory uncertainty through a model is sampling.
5. (5) quantification of model form uncertainty
  1. Model form uncertainty is estimated through the process of model validation.
  2. First, we quantitatively estimate the model form uncertainty at the conditions where experimental data are available using a mathematical operator referred to as a validation metric.
  3. Second, we extrapolate the uncertainty structure expressed by the validation metric to the application conditions of interest.
  4. A validation metric is a mathematical operator that requires two inputs: the experimental measurements of the SRQ of interest and the prediction of the SRQ at the conditions used in the experimental measurements.
  5. Model extrapolation: Numerous validation experiments would normally be required in order to estimate the area validation metric over the entire space of model input parameters for the application of interest. (In many cases, however, it is not possible to obtain experimental data at the application conditions of interest. ) The general process for determining the model form uncertainty at the conditions of interest (i.e., the prediction location) is as follows.
    First, a regression fit of the validation metric is performed in the space of the validation domain.
    Next, a statistical analysis is performed to compute the prediction interval at the conditions of interest.
    In the past, it has been common practice to either (a) ignore the model form uncertainty in the predictions for the application conditions or (b) calibrate adjustable parameters in the mathematical model so that improved agreement could be obtained with the available experimental data at conditions “V”.
6. (6) estimation of model form uncertainty due to extrapolation to application conditions of interest. (Determine total uncertainty in the SRQ ) The total uncertainty in the SRQ at the application conditions of interest is computed as follows.
技术：treating predictive uncertainty uses the technique of probability bounds analysis,

n5321 | 2025年7月28日 17:01

CAE问题总结

CAE的本质其实还是自己之前想的 y=f(x)，厂家提供的功能是可用的f，以及y的表示，x的录入框架，宣传的重点在于f的准确性，作为一个工具可以为研发、生产创造价值。直接上看，这是一个极其有价值的东西，可是实际使用的效果不够好。40年来，CAE的应用主要还是在航空业、车辆行业。

为什么？

出错的地方太多了。f本身是最容易不出问题的，容易出问题的是X的框架and Y 的interpretation。

行业需求口径的宽窄程度不同。什么意思呢？就是X实现Y的效果的概率，一组设计方案实现工程师期望的设计效果的概率在不同的行业，不同的产品上是有显著差异的。

凡是那些在现代科学出现以前就存在的行业，都是几乎可以凭直觉分析、判断X是否能够实现Y的效果的。它们的数学模型几乎都是线性的，一维的。

比如说建房子，墙越厚越结实。比如各种武器，铠甲，各种车辆，船舶，农具，水中，都是一看就可以理解，然后就存在抄袭可能性的。

可是在现代科学昌明以后才出来的行业。X跟Y之间有一个复杂的非线性关系的行业，经验、直觉就失效，而且X到Y之间有一个低概率。

爱迪生试错上万次，最后找到碳化竹子来做灯丝。quote：I have not failed. I've just found 10,000 ways that won't work*. 预期正确的概率是万分之一。

是复杂的非线性物理关系，同时设计参数的排列组合最终符合设计目标的概率是低概率模型，才呼唤CAE登场。

比如说噪音、震动问题，流体问题，交流变频下的电磁问题。

从前的项目经验里：一个外转子电机，PWM控制方式，应用场景很广，在1000rpm到5000rpm下面都需要工作。开始的问题是2100rpm下存在共振，如何避开共振？！没有理论方向，客户搞了很久就是搞不定。

CAE做一个固有频率的分析，就简单搞定了。

一个新款PSC电机，公司并没有这个电机类别，然后要给一个电磁方案。工程师做了很多尝试，性能跟客户样机总是存在差距，快要崩溃，最后用CAE解决问题。

所以obstacle里面有两点。第一是麻瓜看不懂！不学点理论，干不下去的行业。第二是低概率，随便给个方案就是会一直碰壁。存在这个障碍的厂家才需要CAE登场。

CAE的价值和局限性是什么？

它的本质是预测性能！predict performance！凡是需要制样试错的行业，才有可能有它的生成空间。制样是verify performance。

它的局限就是准确、可靠的程度！或者说它会在哪些地方存在瑕疵。

所以关键的问题就是如何恰当低使用CAE工具！

以大类来分就是X的不准确，Y的不理解。f反而是出错空间最小的！

什么是X的不准确！

厂家给的X的框架不一定是完美的？X是一个数组。

X里面有若干个参数是不可知的，也有若干个参数是存在随机性的。X本质上是无法实现一致的！

Y的理解是需要理论功底的！

这样子就把绝大多数人劝退了！或者说这样子进一步压缩了它的市场空间。！

把自己的脑子搞清楚。把CAE的问题搞清楚！

n5321 | 2025年7月20日 21:41

世界上第一颗氢弹来自数学建模——仿真

一本《Principles Of Mathematical Modeling Ideas, Methods, Examples》，下载，来头不小。作者A.A. Samarskii ，苏联院士！

上来就说数学建模复兴的第二个原因是爆了核弹！The second was an unprecedented social problem - the Soviet and American national nuclear arms programs, which could not be realized by traditional methods. Mathematical modeling solved that problem: both nuclear explosions and flights of rockets and satellites were previously tested within the computers using mathematical models, and only later were realized in practice.

再看作者，Founder of the Institute of Mathematical Modeling, Moscow, Russia。俄罗斯数学建模中心的founder——铁帽子王。伟大苏联的院士，优秀的我党成员，拿过列宁导师的奖，社会主义劳动英雄，列宁勋章、国家勋章和罗蒙诺索夫奖章获得者，荣获三枚列宁勋章、十月革命勋章、红旗勋章、俄罗斯人民友谊勋章、一级光荣勋章和卫国战争勋章，并荣获“伟大卫国战争中战胜德国”勋章、“保卫莫斯科”勋章等多项勋章。真正的铁帽子王！

更牛逼的是扛过枪，打过二战，踩过地雷，受过重伤，1941年。On December 12, I was on the scout and got at the land mine. They pulled out more than thirty fragments from me - it was a lot of operations. However, the eight fragments were left in me, the surgeons could not get them out. 身体里取了30多块弹片，到那时还留了8块取不出来的弹片。

一篇访谈里写他介绍苏联核弹的历史。媒体是这家：Literary Gazette, 2001

1948年6月10日，伟大导师斯大林亲自签字：铁帽子王A.A. Samarskii跟未来的苏联氢弹之父萨哈罗夫要关禁闭，搞核武器。- could only claim the rooms that they have received.

然后A.A. Samarskii的任务是建立一个原子弹的数学模型，没用计算机，找了几个妇女同志进行运算，获得计算结果，跟最后爆了的炸弹比较起来结果差异是30%！30%好像差得很远，不是的，准确度不低了，但是竞争对手美帝的误差从来没在30%以内过！

为啥干得比美帝漂亮？米帝的计算工作是物理学家操刀的，这是根本区别。大佬请了30个妹子来并行解题。两个月算出结果——原子弹研制第一年最重要的成就。这是1953年。

两年后，大佬就提出了一个更精确的数学模型，研究数值方法的理论，比美帝领先。美帝的资料也承认，在这些复杂的物理过程的数学计算上面，苏修干得更漂亮。

大佬从50年代开始做这个工作，到80年代就把核弹相关的问题基本上搞清楚了。

diss氢弹之父萨哈罗夫。牛逼但是认他是氢弹之父不服，许多与萨哈罗夫共事的物理学家的功劳就都要XXX拉！

关于科学、数学问题：

世界是非线性的，非常难准确预测的！对基础研究的认同度不高，应该先有需求，再有研究。不是先研究再找应用。

重点：。在工程技术领域应用数学建模及其相关的信息化工具需要审慎的智力思考和严谨的组织工作。跟发达国家比起来，俄罗斯在这方面的滞后现象或许比基础科学领域更令人担忧。

西方国家正在大规模部署数学建模和计算机仿真技术。汽车公司购买超级计算机来对车辆的整体设计进行计算，尤其是汽车事故的分析计算，基本上已成为一种常态。这是一项非常有价值的业务，因为发生“事故”的只是数学模型，而不是昂贵的真实的车辆。没有相应的仿真能力的公司将失去竞争力……欧洲“工业数学”联盟应运而生。其核心目的是行业内正确有效地应用数学建模并知道相应的任务清单。

在此背景下，俄罗斯的专家在微电子、仪器仪表、激光和材料热处理等技术领域积累的独特数学建模经验几乎没有得到应用。

大学里要重视数学建模，国家更加要重视数学建模

On June 10, 1948, the Decision of the Council of Ministers of the USSR №1990-774ss/op "On additional work plan of special research for 1948" was issued and was signed by Stalin and stamped "Top Secret (Special Files)". In the 9th paragraph it refers to the provision in "a matter of urgency" of apartments and rooms to several scientists. At that time, the candidate of geophysical sciences A.A. Samarskii and the candidate of physical and mathematical sciences A.D. Sakharov could only claim the rooms that they have received. But most importantly, according to this resolution, they entered the very narrow circle of people who were entrusted to "perform calculations on "PO" for RDS-1, RDS-2, RDS-3, 4-RDS, RDS-5 with different variants of the equation of state ..."

For the uninitiated, I will explain what the abbreviation "RDS" is hiding an atomic bomb, and the "PO" - was the central part of it made of Pu-239 ...

It is how it would happen, but this time a secret decree of the Central Committee came out on the establishment of a mathematical laboratory for solving problems related to the creation of the atomic bomb. It was some meeting, "on the Higher Levels", where Tikhonov proposed a calculation of the atomic bomb ... By the way, the meeting was attended by Landau, who said that "if this can be done, it will be a scientific feat!" Nevertheless, Tikhonov's offer was accepted, and there appeared a tiny laboratory in which there were only a few mathematicians. And about thirty girls-solvers who had graduated from the geodesic institute.

They were instead of computer?

Yes... And we were given a task: create a "numerical model of atomic bomb".

– It was then when the Special Committee had given for you and for candidate Sakharov a room for each?

– That's right, because I had nowhere to live ... However, the decision was carried out only at the end of 1950. Since I was single, not very legitimately, I continued to live in the dormitory of the University, and then began to rent an apartment ... However, I spent at work most of the time – the time period for us to make the job was allotted very small: only about a year! And this problem was of the highest level of difficulty, and besides, the physicists had quite inaccurate data... Their models were very rough, approximate... They worked only with such models... And together with Tikhonov, we agreed that I will work with the exact models.

– Apparently, you have come to the finish line at the same time?

– Yes, at the time of the first test of our bombs, we already had the first results... The error was only 30 percent...

– Only?!

– It's a great result! I do not know what it is now, but the Americans had never had errors less than 30 percent. Thus, our calculations were very accurate ... In the future, we have reduced the error to 10 percent...

– How did you manage it?! I think the Americans had always better computer technology?

– Our initial data was taken correctly. We tried to maintain the correct mathematical description of the physical process, and in this, I am convinced, I was helped by the fact that my first education was physical.

– So, math is in the first place after all?

– And that is what has defined our success. While in Los Alamos, the calculations were made by physicists. This is a fundamental difference ... But how to solve the resulting equations? I am proud that I came up with "parallelization". There were 30 girls-solvers. There were several hundred equations. That is about ten equations for each girl ... They calculated as if independently, but passing their data to each other ... Of course, I explain it simplified, but the idea of the method, it seems to me, is clear ... The "parallelization of calculations" enabled us to make the calculations for two months, we have accelerated the process of work by about 15 times ... I consider it was the most important achievement in the first year of work on the atomic bomb.

– Everyone who started working on the bomb, were young, so there were a lot of such "nuclear families" ... That time, of course, comes to mind with good feelings, although it was very difficult, since at the first stage, we worked with primitive computer technology... But it was very interesting, it was a creative work. Numerical methods were improving rapidly: just two years later, I proposed a more accurate mathematical model... Until 1953, we used manual technique and we advanced quite far in this area... I immediately realized that it is necessary to work on the theory of numerical methods, and it was rightly so, as it was possible to promote special calculation methods. By the way, the Americans have lagged behind in this area - they were hoping for the computer equipment and failed.

– They recognized it: they later confirmed that, despite the strong backlog in computers, we have not conceded to them in the main point: in the calculation of complex physical processes that occur in the explosions of atomic and thermonuclear bombs ... You named the date: 1953. Really you were not dealing with arms after that?!

– I deal with it all my life... The Institute of Applied Mathematics was created in 1953 and our laboratory became a department.

– The director was Mstislav Keldysh, wasn't he?

– He was the director, Tikhonov - the deputy director, me - the head of a department. Our department was the largest in the institute. A little later, Okhotsimsky was made the head of another department. That is cosmos...

– And it was one of the achievements of which I am proud. If we turn to the same bomb, the scheme looked something like this. There was some separation of work between the calculation groups. First, they calculated the initial compression process - a kind of preparation for the explosion, and then the data and the calculations were passed to our department, where we calculated all the processes associated with the explosion... It is curious that the task was written right in my office. For example, Sakharov came and then on my desk gave us a task... By the way, I passed recently to Sarov, i.e. to Arzamas-16, my notebook in which Sakharov and Zeldovich and Babaev kept records.

– And how long did you work for the "atomic project"?

– Very actively - up to about 1980. Then we cooperated only sporadically, when there was a need ... In fact, by that time all the fundamental problems were solved.

– You said: "The problems have been solved"... That is what determined the fact that eminent scientists - Sakharov and Zeldovich - left Arzamas-16. By the way, what is your impression of them?

– Sakharov, no doubt, a remarkable man, devoid of any complexes. He combined talents of visionary physicist and mathematician. But he was credited with the definition of "father of the hydrogen bomb," and this is not true, as it offended many physicists who worked with Sakharov. And there were no need for this, because the Sakharov did not need exaltation - by himself he was an outstanding scientist and a man.

– And what about Zeldovich?

– He was unique in his own way ... "reactive type", he was ready to pounce on any problem, but he lacked a general mathematical culture. He grasped the idea quickly, but he was still scattered in his character... And at the same time, was easier to communicate with him than with Sakharov. I would like to say about the theorist Yuri Romanov. He had long since earned the right to be a member of the Academy, but he was not even elected into corresponding members. It's not fair! .. Unfortunately, this applies to many. For example, to Feoktistov, Shelkin...

Reflections on science. The world is nonlinear, that is, the basic laws of development of animate and inanimate nature (from the micro to the macro world), including the social and economic structures, are nonlinear. This means in particular that there are several possible ways of evolution of a complex object, that is the future is non-uniquely determined by the preset time (by the initial conditions), and it can not be predicted based only on prior experience. The optimal path of the evolution should be chosen based on the knowledge of the laws of its development, it is necessary to calculate and control it. This is a complex and difficult task, but the life requires its solution.

Reflections on science. We should not base on the belief that science will evolve spontaneously to meet its internal needs of self-development and self-organization. Science must perform urgent social order, promoting the scientific and technological progress not in the distant future, but today. We should not use such model (which has a considerable number of supporters): first conduct basic research, and then look for where they can be used. We need to find ways of development of science in a given direction, associated with the solution of certain large problems. Apparently, the methods of resource management (material and human) can be applied to this. It is important to remember that all problems should be resolved quickly and at a high scientific level. The required level of applied research is only possible on the basis of fundamental research, which is oriented in character. In connection with the development and application of computer technology, a special responsibility is on the math. Modern applied mathematics must, performing the social order, to decide "what is necessary" and "how it is necessary".

Reflections on science. Using the benefits of mathematical modeling and the informatics tools that are based on it in the technological applications requires serious intellectual and organizational effort. Symptoms of our backlog in this area from the developed countries, perhaps are more alarming than in the basic sciences. In the West, there has been a move to mass deployment of mathematical modeling and computer simulations in technology. The purchases of supercomputers by automotive companies become typical to calculate the total vehicle design, particularly in the situations of accidents. This is a very profitable business, as "accidents" involve mathematical models and not expensive vehicles. The companies that do not have corresponding calculating techniques become uncompetitive... European consortium "Mathematics in Industry" was created. Its goal - the effective use of mathematical modeling in the industry and the development of corresponding directory of tasks. Against this background, there are almost no use of the accumulated unique mathematical modeling experience of our experts in some of the technologies of microelectronics, instrumentation, laser and thermal processing of materials.

n5321 | 2025年7月19日 12:24

The Nature of Mathematical Modeling

从NASA的人那里看system engineering，感觉像是在讨论数学建模的问题，于是找了本数学建模的本质——CAE的方法论也在数学建模那里！

《The Nature of Mathematical Modeling》

解析解，数值解，有限单元，偏微分方程等等。这都是CAE的概念！

This is a book about the nature of mathematical modeling and about the kinds of techniques that are useful for modeling (both natural and otherwise). It is oriented towards simple, efficient implementations on computers. 大家的目的算是一致了！

The text has three parts. 解析解的问题The first covers exact and approximate analytical techniques (ordinary differential and difference equations, partial differential equations, variational principles, stochastic processes); 数值分析的问题the second, numerical methods (finite differences for ODEs and PDEs, finite elements, cellular automata); and the third, 实验解的问题model inference based on observations (function fitting, data transforms, network architectures, search techniques, density estimation, filtering and state estimation, linear and nonlinear time series).

这个分类本质上跟altair的那本the practice of

Each of these essential topics would be the worthy subject of a dedicated text, but such a narrow treatment obscures the connections among old and new approaches to modeling. By covering so much material so compactly, this book helps bring it to a much broader audience. Each chapter presents a concise summary of the core results in an area, providing an accessible introduction to what they can (and cannot) do, enough background to use them to solve typical problems, and then pointers into the specialized research literature. The text is complemented by a Website and extensive worked problems that introduce extensions and applications. This essential book will be of great value to anyone seeking to develop quantitative and qualitative descriptions of complex phenomena, from physics to finance.

Professor Neil Gershenfeld leads the Physics and Media Group at the MIT Media Lab and codirects the Things That Think research consortium. His laboratory investigates the relationship between the content of information and its physical representation, from building molecular quantum computers to building musical instruments for collaborations ranging from Yo-Yo Ma to Penn & Teller. He has a BA in Physics from Swarthmore College, was a technician at Bell Labs, received a PhD in Applied Physics from Cornell University, and he was a Junior Fellow of the Harvard Society of Fellows.

n5321 | 2025年7月14日 23:18

A Methodology to Identify Physical or Computational Experiment Conditions for Uncertainty Mitigation

近期最佳paper！一个土耳其人，有博世工作经验，做学术，然后拿了米帝佐治亚理工学院的博士,中间做了NASA的project。兼工程&学术背景的人写的东西还是不一样一点。

它有几个基本点：工程师难以全面准确把握产品内部的物理关系——这是他想要解决的问题——工程师的认知问题！

仿真分析是航天业的常规操作，他有经验，所以可能用了一整套航天业的语言来描述这个问题。整体的思路是完全同意，本质上也没有超出我的认知！相当多的一部分人天天还盯着单元类型，自由度等等问题，算是没有进入实战阶段！不过他本质上还是在用概率论来实现目标。

简单一点说：仿真结果是很容易出错，或者说它就是错的，但是它是有用的。那错的原因在哪里，这个paper的分类是不错的，一个是工艺问题，一个是认知的问题，理论上来看，你不可能建立一个完全物理镜像的，跟实物一致的仿真模型，一个是工艺的原因，他有很多偶然性，有公差在！第二个是你对真实世界的还原是存在偏差的，科学的关系都是对现实世界抽象以后获得的数学表达式，你现在要把所有的表达式东西都还原，应用到工程实际中去其实是不可能的。然后有用的原因他讲的也不错。就是仿真的结果跟最后实际使用的结果一致。怎么实现？中间就需要对模型进行很多的处理！怎么处理，去尝试！——算是Swanson那个台词的一个大的补充，跟自己的思路基本上也一致。简单说就是在电脑上试错，把设计参数跟性能结果之间的应变关系基本搞清楚！然后再做模型的处理，最后让计算机实验跟物理实验的结果一致！——达到预测价值！

但是传统行业的人对IT理解不够，如果他懂数据库，知道SQL，再添加上一点data visulization，应该要厉害更多！

Complex engineering systems require the integration of sub-system simulations and the calculation of system-level metrics to support informed design decisions. This paper presents a methodology for designing computational or physical experiments aimed at mitigating system-level uncertainties. 工程问题的解决方案这样子计算两个分类。分析解，数值解，实验解算是做三个分类。

The approach is grounded in a predefined problem ontology, where physical, functional, and modeling architectures（这三个词已经屏蔽了很多人！） are systematically established.——仿真分析的框架已经定下来了。目的，领域，模型都已经有了！By performing sensitivity analysis using system-level tools, critical epistemic uncertainties can be identified. 这个地方是我原来定义的所谓工艺参数！包含不确定和偶然误差。Based on these insights, a framework is proposed for designing targeted computational and physical experiments to generate new knowledge about key parameters and reduce uncertainty.这个framework是有一点兴趣的！跟我的会有什么不一样吗？——他的重点还是在mathmatica modelling上

The methodology is demonstrated through a case study involving the early-stage design of a Blended-Wing-Body (BWB) aircraft concept, illustrating how aerostructures analyses can support uncertainty mitigation through computer simulations or by guiding physical testing. The proposed methodology is flexible and applicable to a wide range of design challenges, enabling more risk-informed and knowledge-driven design processes.

1Introduction and Background

背景——没学历做不下去的行业：

The design of a flight vehicle is a lengthy, expensive process spanning many years. 大海捞针型的低概率行业，存在解决的方案，但是找出这个方案很难！， With the advance in computational capabilities, designers have been relying on computer models to make predictions about the real-life performance of an aircraft. 随着计算能力的进步，设计人员一直依赖计算机模型来预测飞机的实际性能——需要analysis。这个行业是唯一完全CAE覆盖的行业，而且覆盖的特别早！MSC最早做的业务就只有航空航天，而且业务也拓展不出去。However, the results obtained from computational tools are never exact due to a lack of understanding of physical phenomena, inadequate modeling and abstractions in product details [1, 2, 3]. 每一个人都关心的准确性问题！跟求解器不同，CAE厂家的台词不同，这是工程师视角：物理没学好，模型不准确导致仿真结果错误！ The vagueness in quantities of interest is called uncertainty.对不确定的定义。 The uncertainty in simulations may lead to erroneous predictions regarding the product; creating risk.仿真没做好，仿真中的不确定会导致风险！——工程师的核心关注点！

难点——费钱、时间：

Because most of the cost is committed early in the design [4], any decision made on quantities involving significant uncertainty may result in budget overruns, schedule delays and performance shortcomings, as well as safety concerns. 盲人摸象的设计带过来的几个风险词说得也不错！超预算，延期，性能差，安全隐患。

目标——获得全知，准确预测：

Reducing the uncertainty in simulations earlier in the design process will reduce the risk in the final product. The goal of this paper is to present a systematic methodology to identify and mitigate the sources uncertainty in complex, multi-disciplinary problems such as aircraft design, with a focus on uncertainties due to a lack of knowledge (i.e., epistemic).

我取的名字是如何获得上帝视角——omniscience！

1.1The Role of Simulations in Design

Computational tools are almost exclusively used to make predictions about the response of a system under a set of inputs and boundary conditions [5].

工程师确实是应该把CAE厂家在80年代奠定的一系列台词丢掉！这个台词就带一点the first principle的味道！CAE是预测工程方案与设计意图是否匹配！
At the core of computational tools lies a model, representing the reality of interest, commonly in the form of mathematical equations that are obtained from theory or previously measured data. 数学建模 How a computer simulation represents a reality of interest is summarized in Figure 1. Development of the mathematical model implies that there exist some information about the reality of interest (i.e., a physics phenomenon) at different conditions so that the form of the mathematical equation and the parameters that have an impact on the results of the equation can be derived. The parameters include the coefficients and mathematical operations in the equations, as well as anything related to representing the physical artifact, boundary and initial conditions, system excitation [6]. 里面涉及到大量的参数！一个参数错了，结果就不对！

A complete set of equations and parameters are used to calculate the system response quantities (SRQ). Depending on the nature of the problem, the calculation can be straightforward or may require the use of some kind of discretization scheme.

If the physics phenomenon is understood well enough such that the form of the mathematical representation is trusted, a new set parameters in the equations (e.g., coefficients) may be sought in order to better match the results with an experimental observation. This process is called calibration. Calibration（校准）本质上我自己挑的validation不够准确，还是用这个词的描述更好！With the availability of data on similar artifacts, in similar experimental conditions; calibration enables the utilization of existing simulations to make more accurate predictions with respect to the measured "truth“ model. 校准点的周边点可以获得准确预测！这个就毫无新意了!

Refer to caption — Figure 1:Illustration of a generic computer simulation and how System Response Quantities are obtained. Adapted from [6]

Most models are abstractions of the reality of interest as they do not consider the problem at hand in its entirety but only in general aspects that yield useful conclusions without spending time on details that do not significantly impact the information gain. 这是every mocel is wrong ,but some are useful的另外一个讲法！ Generally, models that abstract fewer details of the problem are able to provide more detailed information with a better accuracy, but they require detailed information about the system of interest and external conditions to work with. Such models are called high-fidelity models （HIFI 音响）. Conversely, low-fidelity models abstract larger chunks of the problem and have a quick turnaround time. They may be able to provide valuable information without so much effort going into setting up the model; unlike high-fidelity models, they can generally do it with very low computational cost. The choice of the fidelity of the model is typically left to the practitioner and depends on the application.

搭配看Swanson的一段台词看：

Let me go slightly sideways on that. When I first started teaching ANSYS, I said always start small. If you're going to go an auto crash analysis, you have one mass rep a car and one spring. Understand that? Then you can start modifying. You can put the mass on wheels and allow it to rotate a little bit when it hits a wall, and so on. So each, each new simulation should be a endorsement of what's happened before. Not a surprise. Yeah, if you get a surprise and simulation, you didn't understand the problem. Now, I'm not quite I think that relates to what your question was. And that is basically start at the start small and work your way up. Don't try to solve the big problem without understanding all the pieces to it.

只是同一个意思的不同表达

A few decades ago, engineers had to rely on physical experiments to come up with new designs or tune them as their computational capabilities were insufficient. 做样优化 Such experiments that include artifacts and instrumentation systems are generally time consuming and expensive to develop. In the context of air vehicle design, design is an inherently iterative process and these experiments would need to be rebuilt and reevaluated. Therefore, while they are effective for evaluation purposes, they cannot be treated as parametric design models unless they have been created with a level of adjustability. With the advance of more powerful computers and widespread use of them in all industries（ANSYS都卖身了，明显行业没渗透出去）, engineers turned to simulations to generate information about the product they are working on. Although detailed simulations can be prohibitively expensive in terms of work-hours and computation time; the use of computer simulations is typically cheaper and faster than following a build-and-break approach for most large-scale systems.

As the predictions obtained from simulations played a larger part in the design, concepts such as “simulation driven design" has been more prominent in many disciplines. [7] If the physics models are accurate, constructing solution environments with very fine grids to capture complex physics phenomena accurately become possible.准确分网是CAE分析结果准确的一个因数而已，只是影响收敛性！建立准确的模型！基本上来说准确的模型，准确的仿真结果！ The cost of making a change in the design increases exponentially from initiation to entry into service [8]. If modeling and simulation environments that accurately capture the physics are used in the design loop, it will be possible to identify necessary changes earlier.仿真的价值所在,X准了Y才会准。 Because making design changes later may require additional changes in other connected sub-systems, it will lead to an increase in the overall cost [9].

1.2Modeling Physical Phenomena

When the task of designing complex products involves the design of certain systems that are unlike their counterparts or predecessors, the capability of known physics-based modeling techniques may come short.经验的价值，一个全新的产品不太能建立准确的分析模型！ For example, the goal of making predictions about a novel aircraft configuration aircraft, a gap is to be expected between the simulation predictions and the measurements from the finalized design——仿真结果跟测试结果基本上会存在差异——定量的差异！. If the tools are developed for traditional aircraft concepts (e.g., tube and wing configurations) there might even be a physics phenomenon occurring that will not be expected or captured——甚至有定性的差异！预测不准的源头！. Even if there is none, the accuracy of models in such cases are still to be questioned.——就算仿真跟实际检验结果一致，也可能模型不准！仿真经验丰富的人知道这不是假话！ There are inherent abstractions pertaining to the design and the best way to quantify the impact of variations in the quantities of interest by changing the geometric or material properties is by making a comparative assessment with respect to the historical data 性能设计上面需要PDM. However, in this case, historical data simply do not exist. 没有已知的数据可以做参考

Because of a lack of knowledge or inherent randomness, the parameters used in modeling equations, boundary/initial conditions, and the geometry are inexact, i.e., uncertain. 本质上不确定性是无法消除的，原因他认识是两类，工艺加无知！没有实验室的行业无法做CAE的calibration。 The uncertainty in these parameters and the model itself（模型参数本质上是不确定的！）, manifest themselves as uncertainty in the model output. As mentioned before, any decision made on uncertain predictions will create risk in the design. John A. Swanson（ansys founder）所谓的分析的时候没有意外！他们本质上把CAE当做一个定量的工具，trench是了然的，但是到哪个点不清楚，需要准确判断，才需要CAE。In order to tackle the overall uncertainty, the sources of individual uncertainties must be meticulously tracked （some are useful的本质原因，风险是已知的，类似身体里的淋巴！）and their impact on the SRQs need to be quantified. By studying the source and nature of these constituents, they can be characterized and the necessary next steps to reduce them can be identified.

In a modeling and simulation environment, every source of uncertainty has a varying degree of impact on the overall uncertainty.理解关系 Then, they can be addressed by a specific way depending on their nature. If they are present because of a lack of knowledge about it（认知问题） (i.e., epistemic uncertainty), can by definition be reduced [10]. The means to achieve this goal can be through designing a study or experiment that would generate new information about the model or the parameters in question. In this paper, the focus will be on how to design a targeted experiment for uncertainty reduction purposes（说要提供新的how！）. Such experiments are not be a replication of the same experimental setup in a more trusted domain, but a new setup that is tailored specifically for generating new knowledge pertaining to that source of uncertainty.

减少无知的影响！

An important consideration is in pursuing targeted experiments is the allowed time and budget of the program. If a lower-level, targeted experiment to reduce uncertainty is too costly or carries even more inherent unknowns due to its experimental setup, it might be undesirable to pursue by the designers. Therefore, these lower-level experiments must be analyzed on a case-by-case basis, and the viability need to be assessed. There will be a trade-off on how much reduction in uncertainty can be expected, against the cost of designing and conducting a tailored experiment. From a realistic perspective, only a limited number of them can be pursued. Considering the number of simulations used in the process of design of a new aircraft, trying to validate the accuracy of every parameter or assumption of every tool will lead to an insurmountable number of experiments. For the ultimate goal of reducing the overall uncertainty, the sources of uncertainty that have the greatest impact on the quantities of interest must be identified.工程的价值是实现目的，不是科学，大概其把问题搞清楚，受控就可以了！ Some parameters that have relatively low uncertainty may have a great effect on a response whereas another parameter with great uncertainty may have little to no effect on the response. 这是公差的另外一个说法

In summary, the train of thought that leads to experimentation to reduce the epistemic uncertainty in modeling and simulation environments is illustrated in Figure 2. If the physics of the problem are relatively well understood, then a computational model can be developed. 理解清楚物理关系的数学模型算是基础！If not, one needs to perform discovery experiments, simply to learn about the physics phenomenon 解决认知的问题！ [11]. Then, if the results of this model are consistent and accurate, then it can be applied to the desired problem. 仿真结果跟测试结果一致！ If not, aforementioned lower-level experiments can be pursued to reduce the uncertainty in the models. Created knowledge should enable the reduction of uncertainty in the parameters, or the models; reducing the overall uncertainty.理解清楚关系，才能控制不确定性！

1.3Reduced-Scale Experimentation，航空航天行业的缩小模型制样测试——也是一种预测！

An ideal, accurate physical test would normally require the building duplicates of the system of interest in the corresponding domain so that obtained results reflect actual conditions. Although this poses little to no issues for computational experiments -barring inherent modeling assumptions-, as the scale of the simulation artifact does not matter for a computational tool, it has major implications on the design of ground and flight tests. Producing a full-scale replica of the actual product with its all required details for a certain simulation is a expensive and difficult in the aerospace industry. Although full-scale testing is necessary for some cases and reliability/certification tests [12], it is desirable to reduce the number of them. Therefore, engineers always tried to duplicate the full-scale test conditions on a reduced-scale model of the test artifact.

The principles of similitude have been laid out by Buckingham in early 20th century [13, 14]. By expanding on Rayleigh’s method of dimensional analysis[15, 16], he proposed the Buckingham-Pi Theorem. This method enables to express a complex physical equation in terms of dimensionless and independent quantities ( $\Pi$ groups). Although they need not to be unique, they are independent and form a complete set. These non-dimensional quantities are used to establish similitude between models of different scales. When a reduced scale model satisfies certain similitude conditions with the full-scale model, the same response is expected between the two. Similitudes can be categorized in three different groups[17]:

1.
Geometric similitude: Geometry is equally scaled
2.
Kinematic similitude: “Homologous particles lie at homologous points at homologous times" [18]
3.
Dynamic similitude: homologous forces act on homologous parts or points of the system.

1.4Identification of Critical Uncertainties via Sensitivity Analysis算是不错的概念

Over the recent decades, the variety of problems of interest has led to the development of many sensitivity analysis techniques. While some of them are quantitative and model-free [19], some depend on the specific type of the mathematical model used [20]. Similar to how engineering problems can be addressed with many different mathematical approaches, sensitivity analyses can be carried out in different ways. For the most basic and low-dimensional cases, even graphical representations such as scatter plots may yield useful information about the sensitivities [21]. As the system gets more complicated however, methods such as local sensitivity analyses (LSA), global sensitivity analyses (GSA) and regression-based tools such as prediction profilers may be used.

筛选过滤关键参数！——类似工程图的关键尺寸、重要尺寸！

LSA methods provide a local assessment of the sensitivity of the outputs to changes in the inputs, and are only valid near the current operating conditions of the system. GSA methods are designed to address the limitations of LSA methods by providing a more comprehensive assessment of the sensitivity of the outputs to changes in the inputs. Generally speaking, GSA methods take into account the behavior of the system over a large range of input values, and provide a quantitative measure of the relative importance of different inputs in the system. In addition, GSA methods do not require assumptions about the relationship between the inputs and outputs, such as the function being differentiable, and are well suited for high-dimensional problems. Therefore, GSA methods has been dubbed the golden standard in sensitivity analysis, in the presence of uncertainty [22].

Variance-based methods apportion the variance caused in the model outputs with respect to the model input and their interactions. One of the most popular variance-based methods is Sobol Method [23]. Consider a function $f(X)=Y$ , Sobol index for a variable $X^{i}$ is the ratio of the variability obtained by calculating variability due to all other inputs to the overall variability in the output. These variations can be obtained from parametric or non-parametric sampling techniques. Following this definition, the first-order effect index of input $X^{i}$ can be defined as:

S^{i}_{1}=\frac{V_{X^{i}}(E_{X^{-i}}(Y\mid X^{i}))}{V(Y)}

(1)

where the denominator represents the total variability in the response $Y$ whereas the numerator represents the variation of $Y$ while changing $X^{i}$ but keeping all the other variables constant. The first-order effect represents the variability caused by $X^{i}$ only. Following the same logic, combined effect of two variables $X^{i}$ and $X^{j}$ can be calculated:

S^{i}_{1}+S^{j}_{1}+S^{ij}_{2}=\frac{V_{X^{ij}}(E_{X^{-ij}}(Y\mid X^{ij}))}{V(% Y)}

(2)

Finally, since the total effect of a variable will be the sum of all first-order effects and the sum of all interactions of all orders with other input variables. Because the sum of all sensitivity indices must be unity, then the total effect index of $X^{i}$ can be calculated as [24]:

S^{i}_{T}=1-\frac{V_{X^{-i}}(E_{X^{i}}(Y\mid X^{-i}))}{V(Y)}

(3)

Because Sobol method is tool agnostic and can be used without any approximations of the objective function, it will be employed throughout this paper for sensitivity analysis purposes.

2Development of the Methodology——跟Science的logic一致性很高

The proposed methodology is developed with the purpose of identifying and reducing the sources of epistemic uncertainty in complex design projects with a systematic fashion.——主要是想解决认知问题。帮助工程师搞清楚问题！

First, the problem for which the mitigation of uncertainty is the objective, is defined, and corresponding high-level requirements are identified. In this step, the disciplines that are involved in the problem at hand and how requirements flow down to analyses are noted. Then, the problem ontology is formulated by using functional, physical and modeling decompositions. A top-down decision making framework is followed to create candidate, fit-for-purpose modeling and simulation environments and select the most appropriate one for the problem at hand. Upon completion of this step, a modeling and simulation environment that covers every important aspect of the problem and satisfies the set of modeling requirements will be obtained, while acknowledging the abstractions of the environment.

The third step is to run the model to collect data. Due to aforementioned reasons, there will be many uncertainties in the model. Therefore the focus of on this step is to identify critical uncertainties that have a significant impact on the model response. 这算是他的core process。这是一个大概搞清楚关系的过程！——按这个做法太耗费时间了！

If one deems this uncertainty to be unacceptable, or to have a significant impact on the decision making processes; then a lower-level experiment will be designed in order to create new knowledge pertaining to that uncertainty. 这是他横向的层次，渐进式的把问题搞清楚。This new knowledge can be carried over to the modeling and simulation environment so that the impact of new uncertainty characteristics (i.e., probability distribution) on the model response can be observed.-科学的observe and reason！ The main idea of the proposed methodology is to generate new knowledge in a systematic way（科学的另外一个说法）, and update appropriate components involved in the modeling and simulation environment with the newly obtained piece of information.（问题的推进）

This process is illustrated in a schematic way in Figure 3.

搞全新产品的玩法！

1.
Problem Definition:
All aircraft are designed to answer a specific set of requirements that are borne out of needs of the market or stakeholders.总结的还不错

Definition of the concept of operations outline the broad range of missions and operations that this aircraft will be used for. After the overall purpose of the aircraft is determined, the designers can decide on the required capabilities and the metrics that these capabilities are going to be measured by can be derived. Considering these information, a concept of the aircraft and the rough size of the aircraft can be determined. This process can be completed by decomposing the requirements and tracking their impact on metrics of capability and operations perspectives [25]. Multi-role aircraft will require additional capabilities will emerge and parallel decompositions may need to be used.——目标分解！
2.
Formulate the Problem Ontology:
Following systems-engineering based methods given in References [25, 26], a requirements analysis and breakdown can be performed, identifying the key requirements for the aircraft, the mission and the wing structure to be analyzed. Then, a physical decomposition of the sub-assembly is developed, outlining its key components and their functionalities, decide on the set of abstractions. 类似面向对象编程的概念了！ Finally, a modeling architecture that maps the physical and functional components to the decompositions is created. 类比电机设计的尺寸链，or 产品的概念框架！ This mapping from requirements all the way to the modeling and simulation attributes is called the problem ontology, and illustrated in Figure 4. With this step it is possible to follow a decision-making framework to select the appropriate modeling and simulation tool among many candidates for this task.
3.
Identification of Critical Uncertainties
With a defined M&S environment, it is possible to run cases and identify the critical uncertainties rigorously that have the most impact on the quantities of interest. 这个人本质上对仿真和物理学很熟悉才能玩这个游戏！ As mentioned before, there is a plethora of methods how uncertainty can be mathematically represented and the impact is quantified.通过CAE来实现数学建模！ For this use case, a sensitivity analysis will be performed to determine which parameters have the greatest impact on the output uncertainty in corresponding tools, by calculating total Sobol indices.
4.
Design a Lower-level Experiment:
The next task is to address the identified critical uncertainties at the selected M&S environment.——简化问题，筛选参数！ To this end, the steps corresponding to designing a lower-level experiment are illustrated in Figure 5. It is essential to note here that the primary purpose of this lower-level experiment is not to model the same phenomenon at a subsequent higher-fidelity tool, but rather use the extra fidelity to mitigate uncertainty for system-level investigation purposes.

The experimental design is bifurcated into computational experiments (CX) and physical experiments (PX)——两个同时做！, each may serve a unique purpose within the research context. For computational experiments, the focus is on leveraging computational models to simulate scenarios under various conditions and parameters, allowing for a broad exploration of the problem space without the constraints of physical implementation. Conversely, the physical experiments involve the design and execution of experiments in a physical environment. This phase is intricately linked to the computational experiment by the fact that they can accurately represent the physical experiments can be used to guide the physical experimentation setups. This entails a careful calibration process, ensuring that the computational models reflect the real-world constraints and variables encountered in the physical domain as best as possible.大家的想法差不多！解决这个问题是需要一点技术含量的！ This step is a standalone research area by itself, and it will only be demonstrated on a singular case.
Upon the completion of experimentation procedure, the execution phase takes place, where the experiments are conducted according to the predefined designs. This stage is critical for gathering empirical data and insights, which are then subjected to rigorous statistical analysis. The interpretation of these results forms the basis for drawing meaningful conclusions, ultimately contributing to the generation of new knowledge pertaining to the epistemic uncertainty in question.This methodological approach, characterized by its dual emphasis on computational and physical experimentation, provides a robust framework for analyzing uncertainties，目标不一样，我觉得目标是optimization!.

3Demonstration and Results

3.1Formulating the Problem Ontology

Development of a next-generation air vehicle platform involves significant uncertainties. To demonstrate how the methodology would apply on such a scenario, the problem selected is aerostructures analyses for a Blended-Wing-Body (BWB) aircraft in the conceptual design stage. The goal is to increase confidence in the predictions of the aircraft range by reducing the associated uncertainty on parameters used in design, 飞机行业本质上是作坊式or我们所谓做样式，它的challenge有不一样的地方，但是有技术含量的！. A representative OpenVSP drawing of the BWB aircraft used in this work is given in Figure 6.

3.2Identification of Critical Uncertainties

For the given use case, two tools are found to be appropriate for the early-stage design exploration purposes, FLOPS and OpenAeroStruct.

As a low fidelity tool——极简模型！, NASA’s Flight Optimization System (FLOPS)[27] will be used to calculate the range of the BWB concept for different designs. FLOPS is employed due to its efficiency in early-stage design assessment, providing a quick and broad analysis under varying conditions with minimal computational resources. FLOPS facilitates the exploration of a wide range of design spaces by rapidly estimating performance metrics, which is crucial during the conceptual design phase where multiple design iterations are evaluated for feasibility and performance optimization.目标上、方法论类似ANSYS rmxprt ，它可能是降低单元自由度或者模型简化得更多一点！FLOPS uses historical data and simplified equations 那就还是跟rmxprt一样的！ to estimate the mission performance and the weights breakdown of an aircraft. Because it is mainly based on simpler equations, its run time for a single case is very low, making it possible to run a relatively high number of cases. Because FLOPS uses lumped parameters, it is only logical to go to a slightly higher fidelity tool that is appropriate with the conceptual design phase in order to break down the lumps. Therefore, a more detailed analysis of the epistemic uncertainty variables will be possible.

OpenAeroStruct [28] is a lightweight, open-source tool designed for integrated aerostructural analysis and optimization. It combines aerodynamic and structural analysis capabilities within a gradient-based optimization framework, enabling efficient design of aircraft structures. The tool supports various analyses, including wing deformation effects on aerodynamics and the optimization of wing shape and structure to meet specific design objectives. In this work it will be used as a step following FLOPS anaylses, at it represents a step increase in fidelity. According to the selected analysis tools, the main parameter uncertainties involved are to be investigated are the material properties (e.g., Young’s modulus) and the aerodynamic properties (e.g., lift and drag coefficients).

Table 2:Nomenclature for mentioned FLOPS variables

Variable Name

Description

WENG

Engine weight scaling parameter

OWFACT

Operational empty weight scaling parameter

FACT

Fuel flow scaling factor

RSPSOB

Rear spar percent chord for BWB fuselage at side of the body

RSPCHD

Rear spar percent chord for BWB at fuselage centerline

FCDI

Factor to increase or decrease lift-dependent drag

coefficients

FCDO

Factor to increase or decrease lift-independent drag

coefficients

FRFU

Fuselage weight (composite for BWB)

Span efficiency factor for wing

First, among the list of FLOPS input parameters, 31 of them are selected as they are either not related to the design, or highly abstracted parameters that may capture the highest amount of abstraction and cause variance in the outputs. Of these 31 parameters, 27 of them are scaling factors and are assigned a range between 0.95 and 1.05. Remaining four are related to the design, such as spar percent chord at BWB at fuselage and side of the body and they are swept in estimated, reasonable ranges. The parameter names and their descriptions will be explained throughout the discussion of the results as necessary, but an overview of them are listed in Table 2 for the convenience of the reader. Aircraft range is calculated through a combination of sampled input parameters. Among these parameters 4096 samples are generated using Saltelli sampling [19], a variation of fractional Latin Hypercube sampling, due to its relative ease in calculation of Sobol indices and availability of existing tools. Calculation of the indices are carried out by using the Python library SALib [29].

In Figure 7, total sensitivity indices calculated are given for three different sampling strategies. Blue bars represent the Quasi-Monte Carlo (QMC) sampling that uses all 4096 input factors. To demonstrate the impact of how sensitivity rankings may change through the use of surrogate modeling techniques, two different response equations (RSE) are employed. First one is the an RSE that is constructed using all 4096 points, and the other one is constructed using only 10% of the points, representing a degree of increase in the computational cost. After verifying that these models fit well, it is seen that they are indeed able to capture the trends albeit the ranking of important sensitivities need to be paid attention.

In this analysis, it is seen that aerodynamic properties, material properties and the wing structure location have been found to have significant impact on the wing weight and aerodynamic efficiency. Therefore, they are identified as critical uncertainties. This is indeed expected and consistent with the results found for a tube-and-wing configuration before in Reference [30].

3.3 Experiment Design and Execution

For demonstration of the methodology, the next step will include the low-fidelity aero-structures analysis tool, OpenAeroStruct [28] and how such a tool can be utilized to guide the design of a physical experimentation setup.

1.
Problem Investigation:
- •
  Premise: The variations in range calculations are significantly influenced by uncertainties in the Young’s modulus, wingbox location and aerodynamic properties. These parameters are related to the disciplines of aerodynamics and structures.
- •
  Research Question: How can the uncertainties in these parameters impacting the range can be reduced?
- •
  Hypothesis: A good first-order approximation is the Breguet range equation is used to calculate the maximum range of an aircraft [31]. A more accurate determination of the probability density function describing the aerodynamic performance of the wing will reduce the uncertainty in the wing weight predictions.
2.
Thought Experiment: Visualizing the impact of more accurately determined parameters on the simulation results, we would expect to see a reduction in the variation of the simulation outputs.
3.
Purpose of the Experiment: This experiment aims to reduce the parameter uncertainty in our wing aero-structures model. There is little to none expected impact of unknown physics that would interfere with the simulation results at such a high level of abstraction. In other words, the phenomenological uncertainty is expected to be insignificant for this problem. In order to demonstrate the proposed methodology, both avenues —computational experiment only, and physical experiment— will be pursued.
4.
Experiment Design: We decide to conduct a computational experiment that represents a physical experimentation setup where the parameters pertaining to the airflow, material properties and structural location are varied within their respective uncertainty bounds and observe the resulting lift-to-drag ratios. For subscale physical experiment, the boundary conditions of the experiment need to be optimized for the reduced-scale so the closest objective metrics can be obtained.
5.
Computational Experiments for both cases:
- •
  Define the Model: We use the OpenAeroStruct wing structure model with a tubular spar structures approximation and a wingbox model.
- •
  Set Parameters: The parameters to be varied are angle of attack, Mach number, location of the structures.
- •
  Design the Experiment: We use a Latin Hypercube Sampling to randomly sample the parameter space. Then Sobol indices are computed to observe the global sensitivities over the input space.
- •
  Develop the Procedure: （仿真跟制样测试是同步进行的！）
  - –
    For CX only: For each random set of parameters, run the OpenAeroStruct model and record the resulting predictions, case numbers and run times. After enough runs to sufficiently explore the parameter space, analyze the results.
  - –
    For PX only: Use the wingbox model only in OpenAeroStruct, pose the problem as a constrained optimization problem to get PX experimentation conditions, scale is now a design variable, scale dimensionless parameters accordingly.

3.4Computer Experiments for Uncertainty Mitigation

Reducing the uncertainty in the lift-to-drag ratio would have a direct impact on reducing the uncertainty in range predictions. L/D is a key aerodynamic parameter that determines the efficiency of an aircraft or vehicle in converting lift into forward motion while overcoming drag. By reducing the uncertainty in L/D, one can achieve more accurate and consistent estimates of the aircraft’s efficiency, resulting in improved range predictions with reduced variability and increased confidence.

To calculate L/D and other required metrics, the low-fidelity, open-source aerostructures analysis software OpenAeroStruct is used. BWB concept illustrated in Figure 6 is exported to OpenAeroStruct. For simplicity purposes, vertical stabilizers are ignored in the aerodynamics and structures analyses. The wingbox is abstracted as a continuous structure, spaning from 10% to 60% of the chord, throughout the whole structural grid. This setup is used for both CX and PX cases, and the model parameters are manipulated according to the problem.

3.4.1Test conditions

First, it is necessary to develop a simulation that would develop the full-scale conditions. The simpler approximation models the wingbox structure as a tubular spar, with a reducing diameter from root to tip. The diamaters are calculated through optimization loops so that stress constraints are met. For the wingbox model, the location is approximated from the conceptual design of the structrual elements from public domain knowledge. For both cases, different aerodynamic and structural grids are employed to investigate the variance in SRQs. Cruise conditions at 10000 meters are investigated, with a constant Mach number of 0.84.

Five different model structures are tested for this experimentation setup with the same set of angle of attack, Mach number, spar location and Young’s modulus in order to make an accurate comparisons. Through these runs, lift-to-drag ratios are calculated and the histogram is plotted in Figure 8. In this figure, the first observation is that although the wingbox model is a better representation of reality, its variance is higher than compared to the tubular spar models, as well as being hypersensitive to certain inputs in some conditions. It is also seen that the predictions of the tubular-spar model generally lie between the predictions that of the two different fidelities of the wingbox model.

Furthermore, the runtime statistics have a significant impact on how the results are interpreted, as well as how many cases can be realistically considered. The overview is presented in Table 3. It is obvious that the mesh size is the most dominant factor on the average run time for a single case. An interesting observation is that the coarser wingbox model takes less time to run compared to a lower fidelity tubular spar model, and predicts a higher lift-to-drag ratio. The reason of this was the wingbox model with the coarser mesh was able to converge in fewer iterations compared to the tubular-spar model. Using a shared set of geometry definition and parameters, as much as the corresponding fidelity level allows, showed that decreasing the mesh size resulted in less variance in the predicted SRQs, as expected. However, increasing the fidelity level comes with a new set of assumptions pertaining to the newly included subsystems, or physical behavior. Therefore, one cannot definitively say that increasing the fidelity level would decrease the parameter uncertainty without including the impact of the newly added parameters.

Table 3:OpenAeroStruct runtime and output variation statistics with respect to different model structures, on a 12-core 3.8 GHz 32GB RAM machine

Run type	Std. Deviation in L/D	Mean Runtime [s]
Tubular spar-coarse	0.184	6.79
Tubular spar-medium	0.169	15.38
Wingbox - coarse	0.794	2.5
Wingbox - medium	0.376	20.7
Wingbox - fine	0.401	76.67

3.5Leveraging Computer Experiments for Guiding Physical Experimentation 建立缩小版本的飞机，设计物理测试方案！

3.5.1Feasibility of the full-scale model

As discussed before, construction and testing of a full-scale vehicle models is almost always not viable in the aerospace industry, especially in the earlier design phases. For this demonstration, a free-flying sub-scale test will be pursued. The baseline experiment conditions will be the same as in the computational-only experimentation, except for appropriate scaling of parameters. Therefore, a scale that would be an optimum for a selected cost function needs to be found, considering the constraints. For this use case, following constraints are defined:

•
Scale of the sub-scale model: $n<0.2$
•
Mach number: $0.8<Ma<0.87$ , The effects of compressibility are going to be much more dominant as $Ma=1$ is approached, therefore the upper limit for the Mach number was kept at 0.87.
•
Angle of attack $0<\alpha<10$ , Because all similitude conditions will not be met, it the flight conditions for a different angle of attack need to be simulated. This is normal practice in subscale testing [32].
•
Young’s Modulus: $E_{S}<3E_{F}$ ,Young’s modulus of the model, should be less than three times of the full-scale design.

3.5.2Optimize for similitude

For this optimization problem, a Sequential Least Squares Programming method is used. SLSQP is a numerical optimization algorithm that is particularly suited for problems with constraints [33]. It falls under the category of sequential quadratic programming (SQP) methods, which are iterative methods used for nonlinear optimization problems. The idea behind SQP methods, including SLSQP, is to approximate the nonlinear objective function using a quadratic function and solve a sequence of quadratic optimization problems, hence the term “sequential". In each iteration of the SLSQP algorithm, a quadratic sub-problem is solved to find a search direction. Then, a line search is conducted along this direction to determine the step length. These steps are repeated until convergence. One advantage of SLSQP is that it supports both equality and inequality constraints, which makes it quite versatile in handling different types of problems. It is also efficient in terms of computational resources, which makes it a popular choice for a wide range of applications.

Algorithm 1 Constrained Optimization for finding Physical Experiment Conditions

procedure Optimization(x)

Define Scaling parameters

Define x=[

n

\alpha

, Ma,

h

, E]

Define constraint s

Initialize x with initial guess [0.1, 0, 0.84, 10000, 73.1e9]

while not converged do

Evaluate cost function f(x)

\triangleright

Equation 7

Solve for gradients and search directions

Run OpenAeroStruct optimization

if Failed Case then

Return high cost function

Select new x

end if

end while

return x

end procedure

The algorithm used for this experiment is presented in Algorithm 1. For convenience purposes, the altitude is taken as a proxy for air density. In the optimization process, mass (including the fuel weight and distribution) is scaled according to:

n_{mass}=\frac{\rho_{F}}{\rho_{S}}n^{3}

(4)

where $\rho_{F}$ is the fluid density for the full-scale model, $\rho_{S}$ the fluid density for the sub-scale model, and $n$ is the geometric scaling factor [32]. Since aeroelastic bending and torsion are also of interest, following aeroelastic parameters for bending ( $S_{b}$ ) and torsion ( $S_{t}$ ) need also be satisfied, and they are defined as:

S_{b}=\frac{EI}{\rho V^{2}L^{4}}

(5)

S_{t}=\frac{GJ}{\rho V^{2}L^{4}}

(6)

These two parameters need to be duplicated in order to assure the similitude of inertial and aerodynamic load distributions for the same Mach number or scaled velocity, depending on the compressibility effects at the desired test regime [32]. And the cost function is selected to be:

f(x)=\left|\frac{\frac{C_{L}}{C_{D}}_{S}-\frac{C_{L}}{C_{D}}_{F}}{\frac{C_{L}}% {C_{D}}_{F}}\right|^{2}+30\left|\frac{Re_{S}-Re_{F}}{Re_{F}}\right|^{2}+3000% \left|\frac{Ma_{S}-Ma_{F}}{Ma_{F}}\right|^{2}

(7)

where, lift-to-drag ratio, Reynolds number and the Mach number of the sub-scale model are quadrically penalized with respect to their corresponding deviation form the simulation results of the full scale result. Because the magnitudes of the terms are vastly different, second and third terms are multiplied with coefficients that would scaled their impact to the same level of the first term. For other problems, these coefficients present flexibility for engineers. Depending on how much certain deviations in the ratio of similarity parameters are penalized, the optimum scale and experiment conditions will change. In this application, the simulated altitude for the free-flying model is changed, rather than changing the air density directly.

3.5.3Interpret results

Without the loss of generality, it can be said that the optimum solution may be unconstrained or constrained, depending on the nature and the boundaries of the constraints. At this step, the solution will need to be verified as to whether it will lead to a feasible physical experiment design. Probable causes will be conflicts between the design variables, or infeasibility to match them in a real experimentation environment. In such cases, the optimization process needs to be repeated with these constraints in mind. However, for this problem, the constrained optimum point is found to be:

•
$n=0.2$
•
$Ma=0.86$
•
$\alpha=10$
•
$E_{S}=219GPa$
•
$h=0m$
•
$Re=6.2x10^{6}$

Furthermore,, it can be noted that due to the nature of the constrained optimization problem, the altitude for the sub-scale test was found to be sea-level. While, this is not completely realistic, it points to an experiment condition where high-altitude flight is not necessary. Finally, the Young’s modulus for the sub-scale model is slightly below the upper threshold, which was three times that of the Young’s modulus of the full-scale design. With this solution we can verify that solving a constrained optimization problem to find the experiment conditions is a valid approach, and provides a baseline for other sub-scale problems as well.

Table 3:OpenAeroStruct runtime and output variation statistics with respect to different model structures, on a 12-core 3.8 GHz 32GB RAM machine

4 Conclusion

In this work, a novel approach is introduced for the design of physical experiments, emphasizing the quantification of uncertainty to target the of engineering models小语法错误 with a specific focus on early-stage aircraft design. Sensitivity analysis techniques are intelligently utilized to find out specific computational and physical experimentation conditions, to tackle the challenge mitigation of epistemic uncertainty.

Findings indicate that this methodology not only facilitates the identification and reduction of critical uncertainties through targeted experimentation but also optimizes the design of physical experiments through computational efforts.作者可能对数据库的理解不够！感觉它不懂SQL,不然可以走得更远一点！ This synergy enables more precise predictions and efficient resource utilization. Through a case study on a Blended-Wing-Body (BWB) aircraft concept, the practical application and advantages of the proposed framework are exemplified, demonstrating how subsequent fidelty levels can be leveraged for uncertainty mitigation purposes.

Presented framework for uncertainty management that is adaptable to various design challenges. The study highlights the importance of integrating computational models by guiding physical testing, fostering a more iterative and informed design process that will save resources. Of course, every problem and testing environment got its own challenges. Therefore, dialogue between all parties involved in model development and physical testing is encouraged.

Future research is suggested to extend the application of this methodology to different aerospace design problems, including propulsion systems and structural components. Additionally, the development of more advanced computational tools and algorithms could further refine uncertainty quantification techniques. With more detailed models and physics, integration of high performance computers, it is possible to see the impact of this methodology in later stages of the design cycle. The reduction of uncertainty on performance metrics can contribute to avoiding program risk — excessive cost, performance shortcomings and delays.

n5321 | 2025年7月11日 00:00

Recent blogs

1. AutoEM coreparameter update

2. AutoEM chart update

3. maxwell 作为com对象分析

4. Python modify Ansys Maxwell properties!

5. 潜在bug！

6. AutoEM desktop app design

7. plotly cdn版本生成chart

8. python 版AutoEM

9. AutoEM online tools（在线工具）_debug

10. pycharm_debug_settings.

查看更多文章

关于： "CAE":

Computer Experiments in Fluid Dynamics

中国有限元

Thirty years of development and application of CFD at Boeing Commercial Airplanes,

介绍

The role and value of CFD

CFD 开发和应用过程

Advanced methodology for uncertainty propagation in computer experiments with large number of inputs

I. INTRODUCTION

II. THERMAL-HYDRAULIC TEST CASE

III. STEP 1: DESIGN OF EXPERIMENTS

IV. STEP 2: PRELIMINARY SCREENING BASED ON DEPENDENCE MEASURE

V. STEP 3: JOINT GP METAMODEL WITH SEQUENTIAL BUILDING PROCESS

V.A. Sequential Gp-building process based on successive inclusion of PII as explanatory variables

V.B. Joint Gp metamodeling

V.C. Assessment of metamodel accuracy

V.D. Application on IBLOCA test case

VI. STEP 4A: VARIANCE-BASED SENSITIVITY ANALYSIS

VI.A. Sobol’ indices with a joint Gp metamodel

VI.B. Results on IBLOCA test case

VII. STEP 4B: QUANTILE ESTIMATION

VII.A. Gp-based quantile estimation

VII.B. Results on IBLOCA test case

VIII. CONCLUSION

An Historical Perspective on Boeing’s Influence on Dynamic Structural

III. CRAIG-BAMPTON COMPONENT MODE REDUCTION AND SYNTHESIS

IV. THE LANCZOS METHOD OF EIGENVALUE EXTRACTION

V. BOEING COMPUTER SERVICES AND BCSLIB

VI. CONCLUSION

ACKNOWLEDGMENTS

科学计算，仿真，CAE的VVUQ

科学计算，仿真，CAE算一个意思。

CAE问题总结

世界上第一颗氢弹来自数学建模——仿真

The Nature of Mathematical Modeling

A Methodology to Identify Physical or Computational Experiment Conditions for Uncertainty Mitigation

1Introduction and Background

1.1The Role of Simulations in Design

1.2Modeling Physical Phenomena

1.3Reduced-Scale Experimentation，航空航天行业的缩小模型制样测试——也是一种预测！

1.4Identification of Critical Uncertainties via Sensitivity Analysis算是不错的概念

2Development of the Methodology——跟Science的logic一致性很高

3Demonstration and Results

3.1Formulating the Problem Ontology

3.2Identification of Critical Uncertainties

3.3 Experiment Design and Execution

3.4Computer Experiments for Uncertainty Mitigation

3.4.1Test conditions

3.5Leveraging Computer Experiments for Guiding Physical Experimentation 建立缩小版本的飞机，设计物理测试方案！

3.5.1Feasibility of the full-scale model

3.5.2Optimize for similitude

3.5.3Interpret results

4 Conclusion