Parallel and/or High Performance Computing

  • A. Figueroa and R. Löhner – Experiences Porting a High Order Cartesian Finite Difference Solver to GPUs Using OpenACC; AIAA-2021-1208 (2021). https://doi.org/10.2514/6.2021-1208
  • K.-R. Wichmann, M. Kronbichler, R. Löhner and W. Wall – Practical Applicability of Optimizations and Performance Models to Complex Stencil-Based Loop Kernels in CFD; Int. J. of High Performance Computing Applications 1-17, May (2018).
  • R. Löhner F. Togashi and J.D. Baum – Simple Load Balancing for Chemically Reacting Flows; sl AIAA-2018-0366 (2018).
  • R. Löhner and A. Degro – On Finite Difference Solvers With Minimal Memory Access; sl AIAA-2018-0365 (2018).
  • A. Towari, A. Cauble-Chantrenne, A. Jundt, R. Löhner and J.D. Baum – Running Large-Scale CFD Applications on Intel-KNL-Based Clusters; Int. J. Num. Meth. Fluids (2017).
  • R. Löhner and F. Perazzo – Improvements in Speed and Scalability of a DEM Code; Proc. ENIEF 2017, La Plata, Argentina, Nov. 7-11 (2017).
  • R. Löhner, F. Mut, F.F. Camelli, A. Figueroa, J.D. Baum, O.A. Soto, F. Togashi, L. Carrington, A. Jundt, A. Cauble-Chantrenne, J. Peraza and A. Tiwari – Recent Advances in Scaling Up Complex Fluid-Structure Interaction Simulations; AIAA 2017-0079 (2017).
  • R. Löhner, E. Haug, C. Zinggerling and E. Onate; Real-Time Micro-Modeling of City Evacuations; Comp. Part. Mech. DOI 10.1007/s40571-016-0154-z (2017).
  • R. Löhner – Real-Time Micro-Modeling of a Million Pedestrians; invited plenary lecture, Proc. Workshop I: Mathematical Foundations of Traffic, UCLA, Los Angeles, Sept 28 – Oct 2 (2015).
  • R. Löhner – Fluid-Structure Interaction on High- Performance Computing Systems: What Lies Ahead; AMCA Conf. , Santa Fe, Argentina, Nov 16-17 (2015).
  • R. Löhner and J.D. Baum – On Maximum Achievable Speeds for Field Solvers; Int. J. Num. Meth. Heat and Fluid Flow 24, 7, 1537-1544 (2014).
  • R. Löhner – Recent Advances in Parallel Advancing Front Grid Generation; Archives of Computational Methods in Engineering 21, 2, 127-140 (2014).
  • R. Löhner, A. Corrigan, K.-R. Wichmann and W. Wall – On the Achievable Speeds of Finite Difference Solvers on CPUs and GPUs; AIAA-2013-2852 (2013).
  • R. Löhner and J.D. Baum – Load Balancing for Multiphysics; AIAA-2013-2856 (2013).
  • R. Löhner and J.D. Baum – Handling Tens of Thousands of Cores with Industrial/Legacy Codes: Approaches, Implementation and Timings; Computers and Fluids 85, 53-62 (2013).
  • A. Corrigan and R. Löhner – Semi-Automatic Porting of a Large-Scale CFD Code to Multi-Graphics Processing Unit Clusters; Int. J. Num. Meth. Fluids 69, 11, 1786-1796 (2012).
  • A. Corrigan, F. Camelli, R. Löhner and F. Mut – Semi-Automatic Porting of a Large-Scale Fortran CFD Code to GPUs; Int. J. Num. Meth. Fluids 69, 2, 314-331 (2012).
  • R. Löhner – A 2nd Generation Parallel Advancing Front Generator; pp 457-474 in Proc. 21st Int. Meshing Roundtable (X. Jiao and J.-C. Weill eds.), San Jose, CA, October (2012).
  • R. Löhner, F.F. Camelli, J.D. Baum and A. Corrigan – Large-Scale Blast Calculations on GPU Clusters; AIAA-12-0565 (2012).
  • R. Löhner and A. Corrigan – Semi-Automatic Porting if a General Fortran CFD Code to GPUs: The Difficult Modules; AIAA-11-3219 (2011).
  • A. Corrigan, F.F. Camelli, R. Löhner and J. Wallin – Running Unstructured Grid Based CFD Solvers on Modern Graphics Hardware; Int. J. Num. Meth. Fluids 66, 221-229 (2011).
  • R. Löhner, F. Mut and F.F. Camelli – Timings OF FEFLO on the SGI-ICE Machines; AIAA-11-1064 (2011).
  • A. Corrigan and R. Löhner – Porting of FEFLO to Multi-GPU Clusters; AIAA-11-0948 (2011).
  • A. Corrigan and R. Löhner – Porting Large Fortran Codebases to GPUs; Proc. NVIDIA GPU Technology Conf. 2010, San Jose, CA, September 20-23 (2010).
  • A. Corrigan, F. Camelli, R. Löhner and F. Mut – Porting of FEFLO to GPUs; Proc. ECCOMAS CFD 2010 Conf. Lisbon, Portugal, June 14-17 (2010).
  • R. Löhner – Cache-Efficient Renumbering for Vectorization; Int. J. Num. Meth. Biomed. Eng. 26, 5, 628-636 (2010).
  • A. Corrigan, F. Camelli and R. Löhner – Porting Of An Edge-Based CFD Solver to GPUs; AIAA-10-0523 (2010).
  • A. Corrigan, F. Camelli, R. Löhner and J. Wallin – Running Unstructured Grid Based CFD Solvers on Modern Graphics Hardware; AIAA-09-4001 (2009).
  • F. Camelli, R. Löhner, J. Cebral and E.L. Mestreau – Timings of an Unstructured-Grid CFD Code on Common Hardware Platforms and Compilers; AIAA-08-0477 Invited.
  • F. Camelli, R. Löhner and E.L. Mestreau – Timings of an Unstructured-Grid CFD Code on Common Hardware Platforms and Compilers; AIAA-07-1107 (2007). (pdf)
  • R. Löhner, C. Yang, J.R. Cebral, F.F.Camelli, F. Togashi, J.D. Baum, H. Luo, E.L. Mestreau and O.A. Soto – Moore’s Law, the Life Cycle of Scientific Computing Codes and the Dimininshing Importance of Parallel Computing; Parallel CFD05, College Park, MD, May (2005).
  • H. Luo, J.D. Baum and R. Löhner – Parallel Unstructured Grid GMRES+LU-SGS Method for Turbulent Flows; AIAA-03-0273 (2003). (pdf)
  • R. Löhner – Moore’s Law and the Diminishing Importance of Parallel Computing; Expressions of the IACM 13, 6-8 (2003).
  • R. Löhner and M. Galle – Minimization of Indirect Addressing for Edge-Based Field Solvers; AIAA-02-0967 (2002). (pdf)
  • R. Löhner and M. Galle – Minimization of Indirect Addressing for Edge-Based Field Solvers; Comm. Num. Meth. Eng. 18, 335-343 (2002).
  • R. Löhner – A Parallel Advancing Front Grid Generation Scheme; Int.J. Num. Meth. Eng. 51, 663-678 (2001).
  • R. Löhner – Renumbering Strategies for Unstructured-Grid Solvers Operating on Shared-Memory, Cache-Based Parallel Machines; Comp. Meth. Appl. Mech. Eng. 163, 95-109 (1998).
  • J. Tuszynski and R. Löhner – Parallelizing the Construction of Indirect Access Arrays for Shared-Memory Machines; Comm. Appl. Num. Meth. Eng. 14, 773-781 (1998).
  • R. Löhner – Renumbering Strategies for Unstructured-Grid Solvers Operating on Shared-Memory, Cache-Based Parallel Machines; AIAA97-2045-CP (1997).
  • R. Löhner and R. Ramamurti – A Load Balancing Algorithm for Unstructured Grids; Comp. Fluid Dyn. 5, 39-58 (1995).
  • A. Shostko and R. Löhner – Parallel 3-D H-Refinement; AIAA-95-1662-CP (1995).
  • R. Löhner – Robust, Vectorized Search Algorithms for Interpolation on Unstructured Grids; J.Comp.Phys.118, 380-387 (1995).
  • A. Shostko and R. Löhner – Three-Dimensional Parallel Unstructured Grid Generation; Int.J.Num.Meth.Eng.38, 905-925 (1995).
  • R. Löhner, A. Shostko and R. Ramamurti – Parallel Unstructured Grid Generation and Implicit Flow Solvers; paper presented at the Parallel CFD’94 Conf. , Kyoto, Japan, May (1994).
  • R. Löhner – Edges, Stars, Superedges and Chains; Comp.Meth.Appl.Mech.Eng.111, 255-263 (1994).
  • R. Ramamurti and R. Löhner – Simulation of Flow Past Complex Geometries Using a Parallel Implicit Incompressible Flow Solver; pp. 1049,1050 in Proc.11th AIAA CFD Conf., Orlando, FL, July (1993).
  • R. Löhner, R. Ramamurti and D. Martin – A Parallelizable Load Balancing Algorithm; AIAA-93-0061 (1993).