• 제목/요약/키워드: computational implementation

검색결과 892건 처리시간 0.028초

REVIEW AND IMPLEMENTATION OF STAGGERED DG METHODS ON POLYGONAL MESHES

  • KIM, DOHYUN;ZHAO, LINA;PARK, EUN-JAE
    • Journal of the Korean Society for Industrial and Applied Mathematics
    • /
    • 제25권3호
    • /
    • pp.66-81
    • /
    • 2021
  • In this paper, we review the lowest order staggered discontinuous Galerkin methods on polygonal meshes in 2D. The proposed method offers many desirable features including easy implementation, geometrical flexibility, robustness with respect to mesh distortion and low degrees of freedom. Discrete function spaces for locally H1 and H(div) spaces are considered. We introduce special properties of a sub-mesh from a given star-shaped polygonal mesh which can be utilized in the construction of discrete spaces and implementation of the staggered discontinuous Galerkin method. For demonstration purposes, we consider the lowest case for the Poisson equation. We emphasize its efficient computational implementation using only geometrical properties of the underlying mesh.

영역 분할에 의한 SIMPLER 모델의 병렬화와 성능 분석 (Implementation and Performance Analysis of a Parallel SIMPLER Model Based on Domain Decomposition)

  • 곽호상;이상산
    • 한국전산유체공학회지
    • /
    • 제3권1호
    • /
    • pp.22-29
    • /
    • 1998
  • Parallel implementation is conducted for a SIMPLER finite volume model. The present parallelism is based on domain decomposition and explicit message passing using MPI and SHMEM. Two parallel solvers to tridiagonal matrix equation are employed. The implementation is verified on the Cray T3E system for a benchmark problem of natural convection in a sidewall-heated cavity. The test results illustrate good scalability of the present parallel models. Performance issues are elaborated in view of convergence as well as conventional parallel overheads and single processor performance. The effectiveness of a localized matrix solution algorithm is demonstrated.

  • PDF

LTJ 적응필터의 실용적 구현과 적응반향제거기에 대한 적용 (A Practical Implementation of the LTJ Adaptive Filter and Its Application to the Adaptive Echo Canceller)

  • 유재하
    • 음성과학
    • /
    • 제11권2호
    • /
    • pp.227-235
    • /
    • 2004
  • In this paper, we proposed a new practical implementation method of the lattice transversal joint (LTJ) adaptive filter using speech codec's information. And it was applied to the adaptive echo cancellation problem to verify the efficiency of the proposed method. Realtime implementation of the LTJ adaptive filter is very difficult due to high computational complexity for the filter coefficients compensation. However, in case of using speech codec, complexity can be reduced since linear predictive coding (LPC) coefficients are updated each frame or sub-frame instead of every sample. Furthermore, LPC coefficients can be acquired from speech decoder and transformed to the reflection coefficients. Therefore, the computational complexity for updates of the reflection coefficients can be reduced. The effectiveness of the proposed LTJ adaptive filter was verified by the experiments about convergence and tracking performance of the adaptive echo canceller.

  • PDF

반복기법을 이용한 대규모, 소선형시스템의 병렬처리에 관한 연구 (An experimental study on parallel implementation of an iterative method for large scale, sparse linear system)

  • 김상원;장수영
    • 한국경영과학회:학술대회논문집
    • /
    • 대한산업공학회/한국경영과학회 1991년도 춘계공동학술대회 발표논문 및 초록집; 전북대학교, 전주; 26-27 Apr. 1991
    • /
    • pp.6-22
    • /
    • 1991
  • This thesis presents a parallel implementation of an iterative method for large scale, sparse linear system and gives result of computational experiments performed on both single transputer and multi transputer parallel computers. To solve linear system, we use conjugate gradient method and develope data storage techinique, data communication scheme. In addition to the explanation of conjugate gradient method, the result of computational experiment is summarized.

  • PDF

3차원 오일러 방정식의 계산 효율성 증대를 위한 Adaptive Wavelet 기법의 적용 (IMPLEMENTATION OF ADAPTIVE WAVELET METHOD FOR ENHANCEMENT OF COMPUTATIONAL EFFICIENCY FOR THREE DIMENSIONAL EULER EQUATION)

  • 조동욱;박경현;강형민;이도형
    • 한국전산유체공학회지
    • /
    • 제19권2호
    • /
    • pp.58-65
    • /
    • 2014
  • The adaptive wavelet method is studied for the enhancement of computational efficiency of three-dimensional flows. For implementation of the method for three-dimensional Euler equation, wavelet decomposition process is introduced based on the previous two-dimensional adaptive wavelet method. The order of numerical accuracy of an original solver is preserved by applying modified thresholding value. In order to assess the efficiency of the proposed algorithm, the method is applied to the computation of flow field around ONERA-M6 wing in transonic regime with 4th and 6th order interpolating polynomial respectively. Through the application, it is confirmed that the three-dimensional adaptive wavelet method can reduce the computational time while conserving the numerical accuracy of an original solver.

Assessment of computational performance for a vector parallel implementation: 3D probabilistic model discrete cracking in concrete

  • Paz, Carmen N.M.;Alves, Jose L.D.;Ebecken, Nelson F.F.
    • Computers and Concrete
    • /
    • 제2권5호
    • /
    • pp.345-366
    • /
    • 2005
  • This work presents an assessment of the computational performance of a vector-parallel implementation of probabilistic model for concrete cracking in 3D. This paper shows the continuing efforts towards code optimization as reported in earlier works Paz, et al. (2002a,b and 2003). The probabilistic crack approach is based on the direct Monte Carlo method. Cracking is accounted by means of 3D interface elements. This approach considers that all nonlinearities are restricted to interface elements modeling cracks. The heterogeneity governs the overall cracking behavior and related size effects on concrete fracture. Computational kernels in the implementation are the inexact Newton iterative driver to solve the non-linear problem and a preconditioned conjugate gradient (PCG) driver to solve linearized equations, using an element by element (EBE) strategy to compute matrix-vector products. In particular the paper analyzes code behavior using OpenMP directives in parallel vector processors (PVP), such as the CRAY SV1 and CRAY T94. The impact of the memory architecture on code performance, and also some strategies devised to circumvent this issue are addressed by numerical experiment.

Parallel Implementation of the Recursive Least Square for Hyperspectral Image Compression on GPUs

  • Li, Changguo
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제11권7호
    • /
    • pp.3543-3557
    • /
    • 2017
  • Compression is a very important technique for remotely sensed hyperspectral images. The lossless compression based on the recursive least square (RLS), which eliminates hyperspectral images' redundancy using both spatial and spectral correlations, is an extremely powerful tool for this purpose, but the relatively high computational complexity limits its application to time-critical scenarios. In order to improve the computational efficiency of the algorithm, we optimize its serial version and develop a new parallel implementation on graphics processing units (GPUs). Namely, an optimized recursive least square based on optimal number of prediction bands is introduced firstly. Then we use this approach as a case study to illustrate the advantages and potential challenges of applying GPU parallel optimization principles to the considered problem. The proposed parallel method properly exploits the low-level architecture of GPUs and has been carried out using the compute unified device architecture (CUDA). The GPU parallel implementation is compared with the serial implementation on CPU. Experimental results indicate remarkable acceleration factors and real-time performance, while retaining exactly the same bit rate with regard to the serial version of the compressor.

알고리즘 분해방법을 이용한 Linear Window Operator의 구현 (A Linear Window Operator Based Upon the Algorithm Decomposition)

  • 정재길
    • 정보학연구
    • /
    • 제5권1호
    • /
    • pp.133-142
    • /
    • 2002
  • 본 논문에서는 linear window operator의 효율적인 구현 방안을 제안하였다. 블록 상태 공간함수를 이용하여 computational primitive를 찾아내고 이를 이용하여 프로세사의 datapath의 구현에 사용함으로서 효율적인 linear window operator의 구현을 가능하게 하였다. 또한 linear window operator의 실시간 처리를 위하여 데이터 분할 기법을 사용한 다중 프로세서 구조를 제안하였다. 블록의 크기에 따른 성능 분석을 통하여 제시된 프로세서 및 다중 프로세서의 성능이 우수함을 보였다.

  • PDF

저전력 Uniform 필터 뱅크 구현을 위한 블록 필터 아키텍처 (Block Filter Architecture for Low-pouter Uniform Finer Banks Implementation)

  • 양세정;장영범
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2001년도 하계종합학술대회 논문집(4)
    • /
    • pp.123-126
    • /
    • 2001
  • Block filter implementation technique for uniform filter banks is uniform in this paper. By applying block filter into decimation and interpolation filters, it is shown that down and up samplers are cancelled out in respective liters. Furthermore by applying block filters into uniform filter banks, significant reduction for computational complexity is achieved since prototype filter can be shared in each channel implementation. Also, it is shown that proposed implementation is a reconfigurable structure in terms of order variation.

  • PDF