moasys

세미나

모아시스는 역량 및 전문성 강화, 기술 트렌드 분석 등을 위하여 내부 세미나를 개최하고 있습니다.

(세미나 자료는 홈페이지에 게재하고 있지 않습니다. 궁금한 점이 있으신 경우 메일로 문의바랍니다.)

LangChain

- LangChain
- LCEL(LangChain Expression Language)
- Model I/O
- Retrieval
- Agents

Scheduler(스케줄러와 스케줄링 알고리즘)

- Scheduler
- Scheduling Algorithm
- SuperComputer

AMD GPU Programming with HIP

- What is HIP?
- Heterogeneous Programming Support
- Common APIs
- Profiling
- HIPFort

AWS에서 SLURM 설치

- SLURM이란?
- SLURM의 구조
- AWS란?
- AWS 프리 티어
- SLURM 설치

Docker/Singularity/Kubernetes 비교 실습

- Docker vs. Singularity
- Google Cloud
- Hands-On
- Kubernetes

LangChain을 활용한 ChatBot 구현과 Custom instructions for ChatGPT

- LangChain Remind
- RAG Remind
- NLP
- ChatBot
- Installation / Environment
- ChatBot Practices
- Custom Instructions for ChatGPT
- GPTs Practices

NVIDIA Grace-Hopper Superchip GH200

- GH200 Superchip 개요
- 이기종 컴퓨팅의 캐시 일관성 문제(CPU/GPU)
- NUMA 시스템으로서의 GH200

Vending Machine Program(상품 구매 및 관리 시스템) 개발

- 프로그램 소개
- 개발 환경
- 시스템 설계
- 데이터베이스 설계
- 사용자 인터페이스 설계
- 구현 및 개발

Docker-SLURM 연동 사례

- Docker-SLURM 연동 이슈
- Docker-SLURM 연동 사례
- 기존 연동 사례의 문제점/대처방안

HPC 클러스터 기본 구조

- HPC 개요
- HPC 간략구조
- HPC 구성요소(노드, 관리망, 고속네트워크, 공유파일시스템)
- HPC 사양표 및 견적
- 용어 해설

Apache 설치를 통한 쉘스크립트 분석 및 WEB/WAS 설치와 구성(1)

- WEB/WAS 구성 개요
- WEB(Apache) 설치
- Apache 내 Configure 스크립트 분석
- 쉘스크립트 개요 및 예시

WEB/WAS 설치와 구성(2)

- WAS(Tomcat) 설치
- AJP 프로토콜을 통한 WEB-WAS 연동
- WAS 설정
- WEB 모듈 설정

AI 반도체 칩 (Intel vs. NVIDIA vs. AMD)

- 반도체 칩
- AI 반도체 칩
- 차세대 반도체 선도기업(Intel, NVIDIA, AMD)

서버 구축 실습 (FTP, NFS 서버 구축)

- 테스트 환경 소개
- 서버 구축 시나리오 및 필요 기술
- 서버 구축(FTP, NFS)

Kubernetes 구성

- Kubernetes
- kubeadm
- hands-on

Spack 개요 및 기본 사용방법

- Spack 개요
- 기본 사용방법
- 추가 테스트 및 명령어
- Spack 가상환경

Generative AI and transformer model (feat. ChatGPT)

- Generative AI
- Transformer model
- ChatGPT
- Microsoft Copilot

Building HPC Software Stacks with Spack

- Basic Usage
- HPC Softwares Stack (Compilers, HPC Libraries, HPC Applications, Modules)
- Module File Generation
- Conclusion

AI/딥러닝 논문 리딩/구현 연습

- AI/딥러닝 논문 구성
- 논문 검색용 추천 웹사이트
- GPT 논문 리딩
- 프로젝트 파일 구조/구현

사용자지원 설명 및 유형

- HW 구축
- SW 구축
- 기타 업무
- 사용자지원

First-principles studies of nano-materials using VASP

- Introduction
- Numerical Methods
- Performance Evaluation
- Defective Carbon Nanotubes
- Defective Monolayer Hexagonal Boron Nitride

Heterogeneous Computing with AMD ROCm

- AMD Flagship supercomputers
- Introduction to ROCm
- Design for efficient data sharing between CPU and GPUs
- Testbed

초보 사용자 지원과 WRF 분석

- 초보 사용자 문의
- 사용자 문의 지원
- WRF에 대하여
- WRF 설치 방법
- 오류 및 내용 분석

WRF 설치 및 CUDA Tutorial

- WRF란?
- WRF를 구성하기 위한 의존성 프로그램
- WRF Directory Structure
- WRF 설치 방법
- CUDA란?
- CUDA Programming with Python

(2021년 오프라인 세미나는 COVID-19로 인하여 진행되지 않음)

반도체 및 프로세서의 이해

- 반도체 원리
- 반도체 공정
- 프로세서 (CISC, RISC, CPU, GPU, FGPA, ASIC)
- AI 반도체

Better Scientific Software

- Intro
- Overview
- Git workflow
- Agile

CUDA, OpenACC에서 CUDA 라이브러리 사용

- Memory Allocation
- CUDA Libraries
- Open ACC

Recent Developments in OpenMPI

- Open MPI
- HCOLL, SHARP, NCCL, UCX
- Conclusion

SLURM 설치 및 환경설정

- Slurm 개요
- Slurm 설치
- Slurm 환경설정

CFD(Computational Fluid Dynamics) 소개

- 전산유체역학 개론
- 개인자료

Automating HPC Benchmarks

- Introduction
- Microbenchmark
- HPC benchmark
- DFT benchmark
- MD benchmark

singularity 이미지 생성 테스트

- singularity 빌드
- def 파일 작성
- 오류 확인 및 해결
- 실행 테스트

HPC Benchmarks on AMD Instinct GPU

- MCM 설계
- 인피니티 패브릭
- MI200/MI250 vs A100
- 테스트 결과 및 AMD의 답변 (STREAM, GPU_P2P, HPL, HPCG, GROMACS, LAMMPS)
- 결과 요약

Introduction to cloud computing

- 클라우드 컴퓨팅 소개
- 클라우드 유형
- 클라우드 서비스
- 가상화 vs. 컨테이너
- 클라우드 서비스 사례
- HPC 클라우드 정의
- HPC 클라우드 서비스 사례
- HPC 클라우드 주요 이슈

NVIDIA Multi-instance GPU

- MIG 소개
- MIG 설정
- MIG 실습

머신러닝기초

- 머신러닝
- 학습종류
- 알고리즘
- Google Colab
- Neural Network
- 딥러닝
- CNN(Convolutional Neural Network), RNN(Recurrent Neural Network)
- Keras
- MNIST

프로세스 및 스레드 기본

- Process
- Thread
- Process vs Thread

Neuron Jupyter 실행 테스트

- Jupyter Project
- Jupyter 환경구성
- Jupyter notebook
- JupyterLab

웹 크롤링 & 데이터 시각화

- 데이터 엔지니어링 & 사이언스
- 웹크롤링
- 데이터 시각화

TCP/IP 기초

- OSI 7-Layer
- OSI 7-Layer & TCP/IP
- TCP/IP

Microsoft Build 2020

- Introduction to Build 2020
- Azure Synapse Link
- Microsoft Teams
- Codespace
- Responsible ML Tools in Azure Machine Learning
- AI supercomputers
- Project Reunion
- Imagine cup
- Microsoft Cloud for Healthcare

Compile 및 Compile Test

- 컴파일
- 컴파일러
- 컴파일 옵션
- 컴파일 옵션 테스트

CentOS 7/8 버전의 차이점

- History of CentOS
- CentOS 8 Distribution & Kernel
- CentOS 7 vs CentOS 8 differences
- Software Management
- Firewall

WRF Build

- Dependencies
- gmp, mpfr, mpc, gcc
- openmpi, jasper ...
- WRF, WPS

Horovod 설치 및 실행 테스트

- horovod 설치
- 실행 테스트 오류
- test set
- test 결과

Compile Option

- Library
- Gromacs 2020.2 Test

Rodinia Benchmark Suite for Heterogeneous Computing

- NVIDIA GPU DIRECT™ Technology
- Rodina Benchmark for HEterogenous System

사용자 애플리케이션 설치 및 테스트

- i-PI
- i-PI installation
- test

Intel OneAPI

- 이기종 컴퓨팅(heterogenous compution)
- Intel OneAPI
- DPC++

slurm 스케쥴러 설치 및 설정

- Slurm Workload Manager History
- Installation Slurm Workload Manager
- Configuration Slurm Workload Manager

회사 홈페이지 리뉴얼 진행사항

- 기존 홈페이지 분석 및 비교
- 개발 도구 및 언어, 프레임워크
- 개발 과정 및 방법
- 참고 사항

VASP Performance Bechmarks on Nurion

- VASP Parallelization Model
- VASP Profiling
- Profiling with MKL_VERBOSE=1
- Available Modules on Nurion System
- Complier Optimization Options
...

계산과학 소프트웨어 동향

- Top 10 Company Market Cap Ranking (1998-2018)
- Technology Trends for 2018 ~ 2019
- FFT libraries

CESM

- About CESM
- 기후 vs 기상
- CESM2 사용자 요청 사항
- 모델 실행 방법
- 실행 스크립트
- 경로 설정

OpenFOAM

- OpenFOAM Overview
- NextFOAM 지원 현황
- SNUTT(선박저항성능연구실) 지원 현황

KAT 시스템에서 GPU 사용방법

- KAT 시스템 GPU Info
- Slurm Workload Manager
- GPU Monitoring 방법
- 응용 SW 5종에 대한 GPU 사용방법

CONDA의 활용

- Conda란?
- Conda vs pip
- Conda Commands
- Conda Environment 생성
- Conda Environment 활성화
...

KAT 시스템 MPI 및 라이브러리 설치

- 시스템 구성, 설치 목록
- gcc-8.3.0 mvapich2 설치, gcc-8.3.0 hdf5 설치
- pgi-19.1 openmpi-3.1.0 설치
- 응용 SW 실행 테스트
- 확정성 테스트
- 속도 비교 테스트

OpenFOAM + PerformanceTools

- arm DDT
- Intel Vtune
- Craypat
- Etc.(APS, IPM)

OpenACC

- OpenACC
- Checking Environment
- PGI Compiler Options
- Pi 예제
- GPU Computing
- OpenACC Target Machine
...

OpenACC+CUDA

- Profiling
- OpenACC + CUDA
- PGI 환경변수
- NVIDA CUDA
...

Process & Trread Affinity

- Thread Affinity (OpenMP 4.0)
- Process Affinity

Neuron 시스템 Conda 문의 테스트

- Conda 환경 구성
- torch 설치
- Conda list --export 사용
- Conda-pack 사용

Introduction to Deep Learning

- Artificial Intellligence
- Machine Learning
- Artificail Neural Networks
- Deep Learning
- Tools for Deep Learning

딥러닝 소개 / 다층 퍼셉트론의 구현 및 학습

- 딥러닝 소개
- AI의 역사
- 인공지능, 머신러닝, 딥러닝
- 딥러닝의 발전 과정
- 딥러닝 모델 종류
...

Benchmark System for Nurion/Neuron

- Motivation: Compilers & Libraries Optimization
- Motivation: Portable Process Pinning across MPI Libraries
- Portable Interfaces for IMPI/OMPI/MVAPICH2
...

Pytorch 기본학습

- TensorFlow vs PyTorch
- Deep Learning Framework
- PyTorch

2019 사용자 지원 내용 정리

- 2019년 누리온, 뉴론 사용자 지원 내역
...

PBS 스케줄러

- PBS Pro 특징
- 명령어 모음
- 도입 사례
- 작업 제출 및 삭제
- 수행 중인 작업 조회
- 종료된 작업 조회

Code Optimization

- Pipelining
- 문자연산
- Cache Conflicts
- Register vs Cache
- Vectorization

WRF-3.9.1 설치

- 설치준비
- 의존성 라이브러리
- 공통 라이브러리 설치
- 병렬 라이브러리 설치

cuda_vector_add

- cuda runtime 함수
- cuda_vector 예제

Docker의 활용

- Docker란?
- 컨테이너란?
- 웹서버 구축 VM / Docker
...

* 프로젝트 정리

...

Intel Xeon Phi KNL Benchmark

- Introduction to KNL
- Benchmark (Quantum Espresso)

OpenMP In CLANG

- 컴파일러
- LLVM
- CLANG
- Accelerator offloading
- How to get it, how to build it:clang

회사 홈페이지의 구조와 진행사항

- Structure
- Code
- Dataflow
- 추가진행사항

OpenACC Part 1

- OpenACC 소개 및 기본 clauses

OpenACC Part 2

- OpenACC 소개 및 기본 clauses

Appendix: Task in OpenMP

- OpenMP 4+
- What's new in 3.1
- What's new in 4.0
- What's new in 4.5

Introduction to Deep Learning

- Artificial Intelligence
- Machine learning
- Artificial neural networks
- Deep learning
- Tools for deep learnig

LVS를 이용한 부하분산

- LVS 소개
- L4 스위치
- Load Balancing 방식 ...

OpenACC

- OpenACC
- Original Code
- 동기 vs 비동기연산
- Pipelining
- Blocking Computation
...

Intel Processor 기술동향

- Intel Xeon Platform Evolution
- Intel Xeon Processor Scalable Family
...

* 프로젝트 진행사항 및 계획

...

Interlanguage Calling

- Interlanguage Calling
- calling C Program
- calling fortran program
- Call by Value
- environment variables

Basic usage of singularity and utilization in HPC

- Singularity
- Docker와 Singularity 차이점
- 설치
- Make a Container
- Pull a Container

Benchmark Test in KAT system

- Quantum Espresso
- VASP

코드 최적화

...

* 병렬화

- ...

* 프로젝트 진행사항

...

* 최적화/병렬화

...

PBS Scheduler 사용

- PBS Pro 특징
- 명령어 모음
- 도입 사례
- 작업 제출 및 삭제
- 수행 중인 작업 조회
- 종료된 작업 조회

NAMD install

- TACHYON2 에서 install
- Virtual Machine에서 install

CUDA Fortran

- GPU Computing
- Basic Concepts
- Determing CUDA HW features and limits
- Compiling CUDA Fortran code
...

NVIDIA CUDA Lecture

- Parallel Programming Models

* 과제

...

Docker 병렬환경 구성

- Docker란?
- Docker 설치하기
- Docker 기본 사용법
- Docker 병렬환경 구성
- 기타

Some of MPI-3

- History of MPI
- Neighborhood Collectives
- Remote Memory Access One-Sided Communications
- Supprot for Clusters of SMP Nodes
- Fortran Bindings
- 기타

Scientific Computing using Python

- Introduction to Python
- Python Basics
- Scientific Modules
- Visualization Modules
- Parallelization

* 과제 진행사항

...

SLURM Scheduler

- SLURM Scheduler

UnderStanding Memory

- Virtual Memory
- Program vs Process
- Storage Class and Scope
- Explicit-share
- Program Size
- Memory Map
- call Stack
...

Fortran 2003 소개

- Fortran 2003
...

OpenHPC

- 소개
- Members
- Current SW Components
- 기본 구성
...

* 방안

...

NIS & PAM 설정

- NIS 설정
- PAM 파일

* 과제 빌드

...

GPU Accelerated Libraries

- CUDA Toolkit
- CUDA-based Libraries

OpenCL 프로그래밍

- 병렬 프로그래밍
- OpenCL, 개발환경
- 호스트와 디바이스
- C언어
- FFT & RNG
- OpenCL의 변화
- OpenCL 함수 레퍼런스

OpenGL for CUDA

- 기본 개념
- 3차원 공간 내의 드로잉 : 지오메트리와 버퍼
- 지오메트리 변환과 파이프라인

The FFT with GPU - cuFFT

- Introduction
- Using the cuFFT API
- cuFFT API Reference
- cuFFT Code Example

CUDA by Example

- CUDA 소개
- CUDA C를 이용한 병렬 프로그래밍
- 쓰레드 협력
- 상수 메모리와 이벤트
- 텍스처 메모리
- 그래픽 상호운용성
- 참고자료

OpenCL 프로그래밍

- 병렬 프로그래밍
- OpenCL, 개발환경
- 호스트와 디바이스
- C언어
- FFT & RNG
- OpenCL의 변화
- OpenCL 함수 레퍼런스

Fortran 2008 Standard Parallel Programming Features in Intel Fortran Composer XE

- Introduction to Fortran 2008
- Submodules
- FORALL -a look back
- DO ConCURRENT
- Coarrays

FEM Fomulation

- FDM vs FEM

Seminar_DomainDecomposition

- 영역 분할법

* OpenCL

- C언어 기초
- OpenCL
- OpenCL Utilities

환경 모듈 패키지

- 환경 변수란?
- 환경 모듈 패키지
- 환경 모듈 정보
- 환경 변수
- 환경 모듈 명령의 이해

OpenStack

- Cloud Computing, 대표서비스
- 가상화
- OpenStack
- OpenStack의 주요 프로젝트, 기타 서비스
...

* OpenCL

- C언어 기초
- OpenCL
- OpenCL Utilities
- OpenCL Runtime API 함수

Introduction to ROOT

- What is ROOT
- Installation
- Usage
- Conventions
- Function
- Histogram
...

OpenMP의 새로운 기능들

- OpenMP 2.0
- OpenMP 2.5
- OpenMP 3.0
- OpenMP 3.1

Introduction to high performance computing for scientists and engineers1

- Introduction to high performance computing for scientists and engineers
...

LUSTRE 병렬 파일 시스템

- LUSTRE 란

최적화사례

- 파이프 난류 유동 코드
- Poisson Solver
- Volume Splines

Optimization On NUMA

- Instruction Parallelism
- Data Parallelism
- Cache
- Out-of-order, Speculative Execution

OPENCL 코드분석

- 사용환경설정
- 코드컴파일
- 실행결과
- 코드설명

OPROFILER

- Tutorial Oprofile
- Install

Software Pipelining, Data Dependency

- Software Pipelining
- Data Dependency

* 기획안

...

Intel Compiler Options

- Optmizing Code in Seven Steps
- Where to Parallelize
- Hotspot analysis using the intel compiler
- Hotspot Analysis Using the Auto-Parallelizer

OPENCL 코드실행

- 사용환경
- 프로파일 설정
- 코드컴파일
- 실행결과
- 코드설명

Wavefront Parallelization

- Pipeline Method
- Gauss-Seidel Algorithm
- Wavefront Parallelization
- Shared Memory Model

코드 최적화/병렬화 사례

...

Introduction of OPENMP 4.0 RC2

- Environment Variables
- target Constructs
- SIMD Constructs

* 프로젝트

...

* 최적병렬화 중간발표

...

Heterogeneous Computing with MIC - Intel Xeon Phi

- Introduction to Intel Architecture
- Optimization and Parallelization for MIC
- Benchmark Test of Quantum Espresso code

Quantum Espresso Optimazation on Intel Xeon Phi

- Introduction to Quantum Espresso code
- Optimization and Parallelization for Xeon Phi

최적/병렬화사례

...

NetCDF 활용

- Overview of netCDF
- Classic Data Model
- Utilities
- Library Architecture
- APIs
- NetCDF-4
- Performance

Profiling & Tracing Tool : TAU

- Tuning and Analysis Utilities
- Profiling & Tracing

NCARG 설치 지원사례

- 필수 라이브러리
- 추가 라이브러리
- NCARG 설치
- 설치 오류 해결
- 예제 실행
- 기타

수치예보의 기초이론 및 WRF-ARW 모델 소개

- 수치예보의 기초이론
- WRF-ARW 모델

Some of MPI-3 & RDMA

- MPI-3
- RDMA

Profiling Tools

- OpenSpeedShop
- Scalasca
- Tau