Showing posts with label s3. Show all posts

Thursday, July 20, 2023

AWS S3 수명 주기 관리: 비용 최적화와 데이터 자동화의 실현

July 20, 2023 / No comments

클라우드 컴퓨팅 시대의 데이터는 비즈니스의 핵심 자산이자 동시에 가장 큰 비용 부담 요인 중 하나입니다. 특히 Amazon S3(Simple Storage Service)는 탁월한 내구성, 무한에 가까운 확장성, 그리고 높은 가용성을 바탕으로 현대적인 애플리케이션의 데이터 저장소로 확고히 자리 잡았습니다. 하지만 이러한 편리함 이면에는 데이터의 폭발적인 증가라는 피할 수 없는 현실이 존재합니다. 매일같이 생성되는 방대한 양의 로그 파일, 사용자 업로드 콘텐츠, 머신러닝 학습 데이터, 임시 데이터, 그리고 수 세대에 걸친 백업 파일들은 시간이 흐름에 따라 관리의 복잡성을 가중시키고 막대한 스토리지 비용을 발생시킵니다. 이른바 '데이터의 무덤'이 되어버린 S3 버킷은 불필요한 비용을 야기할 뿐만 아니라, 정작 중요한 데이터를 찾고 활용하는 데 걸림돌이 되기도 합니다.

이러한 데이터 관리의 딜레마를 해결하기 위해 AWS가 제공하는 가장 강력하고 지능적인 기능이 바로 AWS S3 수명 주기(Lifecycle) 관리입니다. S3 수명 주기 관리는 단순히 오래된 파일을 삭제하는 기능을 넘어, 데이터의 가치 변화와 접근 빈도에 따라 스토리지 비용을 동적으로 최적화하는 데이터 거버넌스의 핵심 도구입니다. 사용자가 사전에 정의한 규칙에 따라 객체(Object)를 비용 효율적인 스토리지 클래스로 자동으로 이동시키거나, 보존 기간이 만료된 데이터를 규정에 맞게 영구적으로 삭제함으로써, 수동 개입 없이도 비용을 절감하고 규정 준수를 자동화하며 데이터 관리의 효율성을 극대화할 수 있습니다. 이 글에서는 S3 수명 주기 관리의 근간을 이루는 핵심 개념부터, 다양한 설정 방법, 그리고 실제 운영 환경에서 마주할 수 있는 고급 활용 전략과 모범 사례까지 심도 있게 탐구합니다.

S3 수명 주기 관리의 기반: 스토리지 클래스와 핵심 구성 요소

효과적인 수명 주기 정책을 수립하기 위해서는 먼저 S3가 제공하는 다양한 스토리지 클래스의 특징과 비용 구조를 이해하고, 수명 주기 규칙을 구성하는 핵심 요소들을 명확히 알아야 합니다. 모든 데이터가 동일한 가치와 접근 빈도를 갖지 않기 때문에, 각 데이터의 특성에 맞는 스토리지 클래스를 선택하는 것이 비용 최적화의 첫걸음입니다.

Amazon S3 스토리지 클래스 심층 분석

S3는 데이터의 접근 패턴과 보존 기간에 따라 최적화된 여러 스토리지 클래스를 제공합니다.

S3 Standard: 가장 일반적인 스토리지 클래스로, 밀리초 단위의 빠른 액세스 속도와 높은 처리량이 특징입니다. 자주 접근하는 웹사이트 콘텐츠, 모바일 애플리케이션 데이터, 실시간 분석 데이터 등 활성 데이터(Hot Data) 저장에 최적화되어 있습니다. 스토리지 비용은 가장 높지만 데이터 검색 비용이 없습니다.
S3 Intelligent-Tiering: 접근 패턴을 예측하기 어려운 데이터나 접근 패턴이 시간에 따라 변하는 데이터에 이상적입니다. S3가 객체의 접근 패턴을 자동으로 모니터링하여, 30일 연속 접근하지 않은 데이터는 'Infrequent Access' 계층으로, 90일 연속 접근하지 않으면 'Archive Instant Access' 계층으로 자동 이동시킵니다. 사용자는 성능에 영향을 받지 않으면서 스토리지 비용을 자동으로 최적화할 수 있습니다. 객체당 소액의 모니터링 및 자동화 비용이 발생합니다.
S3 Standard-IA (Infrequent Access): 자주 접근하지는 않지만 필요할 때 즉시(밀리초 단위) 접근해야 하는 데이터에 적합합니다. 백업, 재해 복구(DR) 파일, 장기 보관 데이터 등이 해당됩니다. S3 Standard보다 스토리지 비용은 저렴하지만, 데이터를 검색할 때 GB당 검색 비용이 부과됩니다. 최소 30일의 보관 기간과 128KB의 최소 객체 크기 요구사항이 있습니다.
S3 One Zone-IA: S3 Standard-IA와 유사하지만, 데이터를 하나의 가용 영역(Availability Zone)에만 저장하여 비용을 약 20% 더 절감한 클래스입니다. 이로 인해 해당 가용 영역에 장애가 발생하면 데이터가 유실될 수 있으므로, 쉽게 재생성할 수 있는 데이터의 사본이나 중요도가 낮은 데이터 저장에 적합합니다.
S3 Glacier Instant Retrieval: 아카이브 데이터 중에서도 분기별 보고서나 로그 분석처럼 가끔이지만 즉시(밀리초 단위) 접근해야 하는 데이터에 최적화된 스토리지 클래스입니다. S3 Standard-IA보다 스토리지 비용이 훨씬 저렴하며, 검색 속도는 동일합니다. 최소 90일의 보관 기간이 요구됩니다.
S3 Glacier Flexible Retrieval: 몇 분에서 몇 시간 내에 검색되어도 괜찮은 장기 아카이브 데이터에 적합합니다. 검색 옵션(Expedited: 1-5분, Standard: 3-5시간, Bulk: 5-12시간)에 따라 비용과 속도가 달라집니다. 데이터 아카이빙의 표준으로, 백업 및 미디어 아카이브에 널리 사용됩니다. 최소 90일의 보관 기간이 필요합니다.
S3 Glacier Deep Archive: S3에서 가장 저렴한 스토리지 클래스로, 12시간 이내에 검색되면 충분한, 거의 접근하지 않는 데이터를 위한 콜드 스토리지입니다. 규제 준수를 위한 데이터(예: 금융 거래 기록, 의료 기록)를 7~10년 이상 보관하는 데 이상적입니다. 최소 180일의 보관 기간이 요구됩니다.

수명 주기 규칙의 세 가지 핵심 질문

모든 S3 수명 주기 규칙은 본질적으로 "어떤 객체를(Filter)", "언제(Timing)", "어떻게 처리할 것인가(Action)"라는 세 가지 질문에 대한 구체적인 답변으로 구성됩니다.

수명 주기 규칙(Lifecycle Rule): 하나 이상의 필터와 액션을 포함하는 정책의 기본 단위입니다. 하나의 버킷에는 여러 개의 독립적인 규칙을 적용할 수 있으며, 각 규칙은 고유한 ID를 가집니다.
필터(Filter): 규칙을 적용할 객체의 범위를 정밀하게 지정합니다. 필터를 지정하지 않으면 버킷 내 모든 객체에 규칙이 적용되므로 매우 신중해야 합니다.
- 접두사(Prefix): 가장 일반적인 필터링 방법으로, 특정 폴더 경로(예: logs/2023/)나 파일명 시작 부분(예: temp-)을 기준으로 객체를 선택합니다.
- 객체 태그(Object Tags): 객체에 할당된 키-값 쌍(예: status: temporary 또는 project: alpha)을 기준으로 규칙을 적용합니다. 접두사와 태그를 함께 사용하여 (AND 조건) "processed-media/ 폴더에 있으면서 tier: archive 태그가 붙은 객체"와 같이 매우 구체적인 조건을 만들 수 있습니다.
- 객체 크기(Object Size): 지정된 크기(최소/최대) 범위에 해당하는 객체에만 규칙을 적용할 수 있습니다. 예를 들어, 1MB보다 작은 객체는 전환하지 않고, 1GB보다 큰 객체만 아카이빙하는 규칙을 설정할 수 있습니다. (전환 작업의 경우 최소 128KB 이상 객체에만 적용 가능)
작업(Action) 및 시점(Timing): 필터 조건에 부합하는 객체에 수행할 작업을 정의합니다. 모든 시점은 객체의 생성일 또는 최신 버전이 된 날을 기준으로 계산됩니다.
- 전환(Transition): 객체를 현재 스토리지 클래스에서 비용이 더 저렴한 다른 스토리지 클래스로 이동시킵니다. 예를 들어, '생성 후 30일이 지난 객체는 S3 Standard-IA로 전환하고, 90일이 지나면 S3 Glacier Flexible Retrieval로 전환'과 같은 다단계 전환 정책을 수립할 수 있습니다.
- 만료(Expiration): 객체를 영구적으로 삭제합니다. 이것이 바로 '객체 자동 삭제' 기능의 핵심입니다. 버전 관리가 활성화된 버킷의 경우, 현재 버전 만료(삭제 마커 생성)와 이전 버전 영구 삭제를 구분하여 설정해야 합니다.
- 불완전한 멀티파트 업로드 중단(Abort Incomplete Multipart Upload): 대용량 파일 업로드 시 사용되는 멀티파트 업로드가 지정된 기간 내에 완료되지 않을 경우, 업로드 조각들을 자동으로 삭제하여 불필요한 비용 발생을 방지합니다. 이는 모든 수명 주기 정책에 기본적으로 포함시키는 것이 좋습니다.

1. AWS Management Console: 시각적이고 직관적인 설정

가장 쉽고 빠르게 수명 주기 규칙을 설정하는 방법은 AWS Management Console의 그래픽 사용자 인터페이스(GUI)를 활용하는 것입니다. 몇 번의 클릭만으로 복잡한 규칙도 시각적으로 정의하고 관리할 수 있어, 처음 접하는 사용자에게 가장 적합한 방법입니다.

1단계: 대상 S3 버킷으로 이동 및 규칙 생성 시작

AWS Management Console에 로그인한 후, 상단 서비스 검색창에서 'S3'를 검색하여 S3 서비스 대시보드로 이동합니다.
수명 주기 규칙을 적용하고자 하는 버킷의 이름을 클릭하여 해당 버킷의 세부 정보 페이지로 진입합니다.
상단 탭 메뉴에서 '관리(Management)' 탭을 선택합니다.
'수명 주기 규칙(Lifecycle rules)' 섹션에서 주황색 '수명 주기 규칙 생성(Create lifecycle rule)' 버튼을 클릭하여 규칙 설정 화면으로 이동합니다.

2단계: 규칙 이름 및 범위(필터) 지정

규칙 이름(Rule name): 규칙의 목적을 명확하게 알 수 있는 이름을 입력합니다. 이름은 버킷 내에서 고유해야 합니다. (예: Log-Files-Auto-Delete-After-180-Days)
규칙 범위 선택(Choose a rule scope):
- 하나 이상의 필터를 사용하여 이 규칙의 범위 제한(Limit the scope of this rule using one or more filters): 대부분의 운영 환경에서 권장되는 옵션입니다. 접두사, 객체 태그, 객체 크기를 조합하여 규칙이 적용될 객체를 정밀하게 타겟팅합니다. 예를 들어, 접두사에 logs/를 입력하면 logs 폴더 내의 객체에만 규칙이 적용되어 다른 중요 데이터에 영향을 미치는 것을 방지할 수 있습니다.
- 버킷의 모든 객체에 적용(Apply to all objects in the bucket): 버킷에 저장된 모든 객체에 예외 없이 규칙을 적용합니다. 의도치 않은 데이터 손실을 유발할 수 있으므로, 버킷의 용도가 매우 명확하고 단일할 때만 신중하게 사용해야 합니다.
화면 하단에서 "이 규칙이 버킷의 객체에 적용될 경우 영구적으로 삭제될 수 있음을 인정합니다."라는 경고문 앞의 확인란에 체크합니다. 이는 사용자가 규칙의 파급 효과를 인지했음을 확인하는 절차입니다.

3단계: 수명 주기 작업(Action) 정의

이 단계에서 필터링된 객체들에게 어떤 작업을 수행할지 구체적으로 정의합니다. 자동 삭제가 목적이라면 '객체의 현재 버전 만료'에 집중합니다.

'수명 주기 규칙 작업(Lifecycle rule actions)' 섹션에서 수행하고자 하는 작업의 확인란을 모두 선택합니다.
- 스토리지 클래스 간 객체의 현재 버전 전환(Transition current versions of objects between storage classes): 비용 최적화를 위해 객체를 다른 스토리지 클래스로 옮길 때 사용합니다.
- 객체의 현재 버전 만료(Expire current versions of objects): 이 옵션을 선택하면 지정된 기간이 지난 객체가 자동으로 삭제됩니다.
- 만료된 객체 삭제 마커 또는 완료되지 않은 멀티파트 업로드 삭제(Delete expired object delete markers or incomplete multipart uploads): 비용 누수를 막기 위해 거의 항상 활성화하는 것을 강력히 권장합니다. '완료되지 않은 멀티파트 업로드 정리' 옵션을 선택하고 정리 시작일을 입력합니다(보통 7일).
'객체의 현재 버전 만료'를 선택했다면, 아래에 나타나는 '객체 생성 후 경과 일수(Number of days after object creation)' 필드에 원하는 일수(예: 180)를 입력합니다. 이는 객체가 S3에 처음 업로드된 시점으로부터 180일이 지나면 삭제 대상이 된다는 것을 의미합니다.

4단계: 규칙 검토 및 생성 완료

마지막 페이지에서 지금까지 설정한 모든 내용을 요약하여 보여줍니다. 규칙 이름, 적용 범위(필터), 수행할 작업, 그리고 각 작업의 시점이 모두 올바른지 다시 한번 꼼꼼하게 확인합니다. 모든 내용이 정확하다면 '규칙 생성(Create rule)' 버튼을 클릭하여 설정을 마칩니다. S3는 일반적으로 규칙 생성 후 24시간에서 48시간 이내에 정책을 시스템에 반영하며, 이후 매일 자정(UTC 기준)에 한 번씩 규칙을 실행하여 조건에 맞는 객체를 처리합니다.

2. AWS CLI: 스크립트를 통한 자동화와 정밀 제어

AWS CLI(Command Line Interface)를 사용하면 수명 주기 규칙을 코드로 정의하고 스크립트를 통해 프로그래밍 방식으로 관리할 수 있습니다. 이는 여러 버킷에 동일한 표준 규칙을 일괄적으로 적용하거나, CI/CD 파이프라인에 통합하여 인프라를 코드로 관리(Infrastructure as Code, IaC)하는 DevOps 환경에서 매우 강력한 힘을 발휘합니다.

1단계: AWS CLI 설치 및 자격 증명 구성

CLI를 사용하기에 앞서, 로컬 개발 환경에 AWS CLI가 설치되어 있어야 합니다. 또한, S3 버킷에 접근할 수 있는 권한(s3:PutLifecycleConfiguration)을 가진 IAM 사용자의 Access Key와 Secret Key로 CLI 프로필이 구성되어야 합니다.

# AWS CLI 설치 여부 및 버전 확인
aws --version

# AWS CLI 프로필 구성 (메시지에 따라 Access Key ID, Secret Access Key, Default region name 입력)
aws configure

2단계: 수명 주기 구성 파일(JSON) 작성

CLI에서는 수명 주기 정책 전체를 JSON 형식의 파일로 정의해야 합니다. 텍스트 편집기를 사용하여 lifecycle-policy.json과 같은 이름으로 파일을 생성합니다. 아래는 복합적인 요구사항을 반영한 정책 예시입니다.

{
  "Rules": [
    {
      "ID": "ComprehensiveDataLifecyclePolicy",
      "Status": "Enabled",
      "Filter": {
        "Prefix": "documents/"
      },
      "Transitions": [
        {
          "Days": 60,
          "StorageClass": "STANDARD_IA"
        },
        {
          "Days": 180,
          "StorageClass": "GLACIER_IR"
        }
      ],
      "Expiration": {
        "Days": 1095
      },
      "NoncurrentVersionTransitions": [
        {
          "NoncurrentDays": 30,
          "StorageClass": "STANDARD_IA"
        }
      ],
      "NoncurrentVersionExpiration": {
        "NoncurrentDays": 365
      },
      "AbortIncompleteMultipartUpload": {
        "DaysAfterInitiation": 7
      }
    }
  ]
}

위 JSON 파일의 각 필드는 다음과 같은 정교한 데이터 관리 정책을 정의합니다.

ID: 규칙을 식별하는 고유한 이름 (ComprehensiveDataLifecyclePolicy).
Status: 규칙을 활성화(Enabled)합니다. 테스트 중에는 Disabled로 설정할 수 있습니다.
Filter.Prefix: documents/ 폴더 아래의 모든 객체에 이 규칙을 적용합니다.
Transitions: 현재 버전 객체에 대한 다단계 전환 규칙입니다.
- 생성 후 60일이 지나면 STANDARD_IA로 전환합니다.
- 생성 후 180일이 지나면 GLACIER_IR(Glacier Instant Retrieval)로 다시 전환합니다.
Expiration: 현재 버전 객체를 생성 후 1095일(약 3년)이 지나면 영구 삭제합니다.
Noncurrent...: (버전 관리가 활성화된 버킷용) 이전 버전 객체에 대한 규칙입니다.
- 이전 버전이 된 지 30일이 지나면 STANDARD_IA로 전환합니다.
- 이전 버전이 된 지 365일이 지나면 영구 삭제합니다.
AbortIncompleteMultipartUpload: 시작된 지 7일이 지나도 완료되지 않은 멀티파트 업로드를 중단하고 관련 데이터를 삭제합니다.

3단계: CLI 명령어로 수명 주기 구성 적용 및 확인

작성한 JSON 파일을 put-bucket-lifecycle-configuration 명령어를 사용하여 대상 버킷에 적용합니다. --bucket 파라미터에는 실제 버킷 이름을, --lifecycle-configuration 파라미터에는 file:// 접두사와 함께 JSON 파일 경로를 지정합니다.

aws s3api put-bucket-lifecycle-configuration \
    --bucket YOUR-BUCKET-NAME \
    --lifecycle-configuration file://lifecycle-policy.json

명령이 성공적으로 실행되면 아무런 출력이 표시되지 않습니다. 현재 버킷에 적용된 구성을 확인하려면 get-bucket-lifecycle-configuration 명령을, 규칙 전체를 제거하려면 delete-bucket-lifecycle 명령을 사용할 수 있습니다.

# 현재 적용된 수명 주기 구성 확인
aws s3api get-bucket-lifecycle-configuration --bucket YOUR-BUCKET-NAME

# 버킷의 모든 수명 주기 규칙 삭제
aws s3api delete-bucket-lifecycle --bucket YOUR-BUCKET-NAME

3. AWS SDK (Boto3): 애플리케이션과의 완벽한 통합

애플리케이션 로직 내에서 동적으로 수명 주기 규칙을 생성, 수정 또는 삭제해야 할 경우 AWS SDK를 사용합니다. 여기서는 가장 널리 사용되는 Python용 AWS SDK인 Boto3를 예로 들어 설명합니다. 이는 멀티 테넌트 SaaS 애플리케이션에서 각 고객의 버킷에 맞춤형 데이터 보존 정책을 프로그래밍 방식으로 적용하는 등의 고급 시나리오에 유용합니다.

1단계: Boto3 라이브러리 설치 및 자격 증명 설정

먼저 Python 개발 환경에 Boto3 라이브러리를 설치해야 합니다. AWS 자격 증명은 AWS CLI와 마찬가지로 환경 변수, EC2 인스턴스 프로필(IAM 역할) 등을 통해 자동으로 인식되도록 구성되어 있어야 합니다.

pip install boto3

2단계: Python 스크립트로 수명 주기 정책 관리

Python 스크립트 내에서 수명 주기 구성을 딕셔너리(Dictionary) 형태로 정의하고, Boto3의 S3 클라이언트를 사용하여 버킷에 적용합니다.

import boto3
from botocore.exceptions import ClientError
import logging

# 로깅 설정
logging.basicConfig(level=logging.INFO, format='%(asctime)s - %(levelname)s - %(message)s')

def apply_s3_lifecycle_policy(bucket_name: str, lifecycle_policy: dict) -> bool:
    """
    지정된 S3 버킷에 수명 주기 정책을 적용합니다.

    :param bucket_name: 정책을 적용할 버킷 이름
    :param lifecycle_policy: 적용할 수명 주기 정책 (Python 딕셔너리)
    :return: 성공 시 True, 실패 시 False
    """
    try:
        s3_client = boto3.client('s3')
        s3_client.put_bucket_lifecycle_configuration(
            Bucket=bucket_name,
            LifecycleConfiguration=lifecycle_policy
        )
        logging.info(f"Successfully applied lifecycle policy to bucket '{bucket_name}'.")
    except ClientError as e:
        if e.response['Error']['Code'] == 'NoSuchBucket':
            logging.error(f"Error: Bucket '{bucket_name}' does not exist.")
        else:
            logging.error(f"An unexpected error occurred: {e}")
        return False
    return True

if __name__ == '__main__':
    # 대상 버킷 이름
    target_bucket = 'YOUR-TARGET-BUCKET-NAME'

    # 적용할 수명 주기 정책 정의
    # 임시 이미지 캐시(temp-images/)는 7일 후 삭제,
    # 사용자 업로드 원본(uploads/raw/)은 30일 후 IA로 전환하고 365일 후 삭제
    policy = {
        'Rules': [
            {
                'ID': 'TempImageCacheDeletion',
                'Filter': {
                    'Prefix': 'temp-images/'
                },
                'Status': 'Enabled',
                'Expiration': {
                    'Days': 7
                },
                 'AbortIncompleteMultipartUpload': {
                    'DaysAfterInitiation': 1
                }
            },
            {
                'ID': 'UserUploadsLifecycle',
                'Filter': {
                    'Tag': {
                        'Key': 'data-type',
                        'Value': 'user-upload-raw'
                    }
                },
                'Status': 'Enabled',
                'Transitions': [
                    {
                        'Days': 30,
                        'StorageClass': 'STANDARD_IA'
                    }
                ],
                'Expiration': {
                    'Days': 365
                }
            }
        ]
    }

    # 함수를 호출하여 정책 적용
    apply_s3_lifecycle_policy(target_bucket, policy)

이 스크립트는 두 개의 독립적인 규칙을 정의합니다. 하나는 접두사를 기준으로 7일 된 임시 파일을 삭제하고, 다른 하나는 객체 태그를 기준으로 사용자 원본 파일을 30일 후 Standard-IA로 전환하고 1년 후에 삭제합니다. 이처럼 SDK를 사용하면 애플리케이션의 비즈니스 로직과 긴밀하게 연계하여 매우 동적이고 복잡한 데이터 관리 워크플로우를 완벽하게 자동화할 수 있습니다.

실용적인 수명 주기 전략과 고급 모범 사례

단순히 객체를 삭제하거나 전환하는 것을 넘어, 수명 주기 규칙을 전략적으로 사용하여 비용, 성능, 규정 준수 요구사항의 균형을 맞추는 것이 중요합니다. 실제 운영 환경에서는 다음과 같은 전략과 모범 사례를 고려해야 합니다.

1. 버전 관리(Versioning)와 수명 주기의 공생 관계

중요한 데이터가 저장된 프로덕션 버킷에는 실수로 인한 데이터 삭제나 덮어쓰기를 방지하기 위해 S3 버전 관리를 활성화하는 것이 좋습니다. 버전 관리가 활성화되면 객체를 덮어쓰거나 삭제할 때 이전 버전이 사라지지 않고 보존됩니다. 삭제 시에는 '삭제 마커(Delete Marker)'라는 특수한 객체가 최신 버전으로 생성될 뿐입니다. 이 경우, 비용이 무한정 증가하는 것을 막기 위해 수명 주기 규칙을 반드시 두 단계로 설정해야 합니다.

현재 버전(Current Version)에 대한 규칙: 사용자가 객체를 삭제하면 생성되는 '삭제 마커'를 처리하거나, 최신 버전의 객체를 시간이 지남에 따라 전환/만료시키는 규칙입니다. 예를 들어, '현재 버전 생성 후 365일이 지나면 만료(삭제 마커 생성)'를 설정할 수 있습니다.
이전 버전(Noncurrent Version)에 대한 규칙: 덮어쓰기나 삭제로 인해 생성된 과거 버전들을 처리하는 규칙입니다. 이것이 비용 절감의 핵심입니다. 예를 들어, '이전 버전이 된 지 90일이 지난 객체는 영구 삭제'와 같은 규칙을 추가하여 스토리지 공간을 실제로 확보해야 합니다.
만료된 객체 삭제 마커 정리: 한 객체의 모든 이전 버전이 삭제되고 삭제 마커만 남은 경우, 이 불필요한 삭제 마커를 정리하는 규칙(Delete expired object delete markers)도 함께 설정하는 것이 좋습니다.

이러한 규칙들을 조합해야만 버전 관리의 데이터 보호 기능을 온전히 누리면서도 불필요한 스토리지 비용 증가를 방지할 수 있습니다.

2. 데이터 접근 빈도에 따른 계층적 비용 최적화 전략

모든 데이터를 비싼 S3 Standard에 보관하는 것은 비효율적입니다. 데이터의 가치는 시간에 따라 변하는 경우가 많으므로, 이를 수명 주기 규칙에 반영하여 계층적 스토리지 전략을 자동화할 수 있습니다.

초기 (0 ~ 30일): 자주 접근하는 활성 데이터. S3 Standard에 보관하여 최고의 성능을 보장합니다.
중기 (31 ~ 180일): 접근 빈도가 줄어들지만 즉시 필요할 수 있는 데이터. S3 Standard-IA 또는 S3 Glacier Instant Retrieval로 전환하여 스토리지 비용을 절감합니다. 접근 패턴이 불규칙하다면 S3 Intelligent-Tiering이 훌륭한 대안입니다.
장기 보관 (181일 ~ 수년): 거의 접근하지 않는 아카이브 데이터. 규정 준수 또는 향후 분석을 위해 보관. S3 Glacier Flexible Retrieval 또는 S3 Glacier Deep Archive로 전환하여 비용을 극적으로 낮춥니다.
영구 삭제 (수년 이후): 법적/규제적 보관 기간이 만료된 데이터. Expiration 규칙을 통해 영구 삭제하여 비용 발생을 원천적으로 차단하고 규정 준수 리스크를 줄입니다.

3. '작은 객체 문제(Small Object Problem)' 회피 전략

수명 주기 전환 작업에는 객체 1,000개당 발생하는 요청 비용이 있습니다. 만약 수백만 개의 작은 파일(예: 수 KB 크기의 로그나 센서 데이터)을 S3 Standard-IA 등으로 전환하면, 스토리지 비용 절감 효과보다 전환 요청 비용이 더 커져 오히려 전체 비용이 증가하는 역효과가 발생할 수 있습니다. 이를 '작은 객체 문제'라고 합니다.

사전 집계(Pre-aggregation): 작은 파일들을 S3에 업로드하기 전에, Amazon Kinesis Data Firehose나 AWS Lambda 함수 등을 사용하여 여러 파일을 하나의 큰 파일(예: 수십~수백 MB의 tar.gz 또는 Parquet 파일)로 묶어서 저장하는 것이 가장 근본적인 해결책입니다.
사후 집계(Post-aggregation): 이미 저장된 작은 파일들은 주기적으로 실행되는 AWS Lambda나 AWS Batch 작업을 통해 더 큰 아카이브 파일로 합친 후, 원본 작은 파일들은 삭제하는 전략을 고려할 수 있습니다.
전환 대신 만료: 작은 객체들은 비용 효율적인 전환이 어려우므로, 전환 단계를 건너뛰고 특정 기간이 지나면 바로 만료(삭제)시키는 것이 더 경제적일 수 있습니다.

모니터링, 감사 및 문제 해결

수명 주기 규칙을 설정한 후에는 의도대로 잘 작동하는지, 예상치 못한 부작용은 없는지 지속적으로 확인하는 것이 중요합니다.

Amazon CloudWatch: S3 버킷의 크기(BucketSizeBytes)와 객체 수(NumberOfObjects) 메트릭을 대시보드에서 시각화하고 모니터링합니다. 만료 규칙이 실행될 것으로 예상되는 시점 이후에 이 수치들이 실제로 감소하는지 확인하고, 변화가 없을 경우 알람을 설정하여 문제를 조기에 발견할 수 있습니다.
AWS CloudTrail: S3에서 발생하는 모든 API 호출을 기록합니다. S3 수명 주기에 의해 객체가 실제로 삭제되거나 전환되었는지 확인하려면 CloudTrail 로그에서 S3.LIFECYCLE.DELETE 또는 S3.LIFECYCLE.TRANSITION과 같은 이벤트를 검색할 수 있습니다. 이는 감사 추적 및 문제 해결에 필수적인 정보를 제공합니다.
S3 Storage Lens: 조직 전체의 S3 스토리지 사용량 및 활동 추세에 대한 포괄적인 가시성을 제공하는 강력한 분석 도구입니다. 대화형 대시보드를 통해 데이터 연령 분포, 스토리지 클래스별 비중, 접근 빈도가 높은 접두사 등을 한눈에 파악할 수 있습니다. 이를 통해 새로운 수명 주기 규칙의 기회를 발견하고, 기존 정책의 효과를 시각적으로 분석할 수 있습니다.
S3 Inventory: 버킷 내 모든 객체의 목록과 메타데이터(스토리지 클래스, 생성일 등)를 담은 보고서(CSV, ORC, Parquet 형식)를 정기적으로 생성하는 기능입니다. 이 보고서를 Amazon Athena로 쿼리하여 "지난주에 S3 Standard-IA로 전환된 객체는 총 몇 개인가?"와 같은 복잡한 질문에 대한 답을 얻음으로써 정책의 실행을 정밀하게 검증할 수 있습니다.

결론적으로, AWS S3 수명 주기 관리는 단순히 오래된 파일을 지우는 소극적인 관리 도구가 아닙니다. 이는 데이터의 생애 전체를 아우르는 동적이고 지능적인 거버넌스 프레임워크이며, 클라우드 스토리지의 비용, 성능, 규정 준수를 최적화하는 전략적 필수 요소입니다. 데이터의 특성과 비즈니스 요구사항에 맞는 정교한 규칙을 설계하고 자동화함으로써, 개발자와 클라우드 관리자는 반복적이고 소모적인 작업에서 해방되어 진정으로 비즈니스 가치를 창출하는 데 집중할 수 있게 될 것입니다.

Continue Reading →

AWS S3コスト最適化の核心：ライフサイクル管理の徹底活用

July 20, 2023 / No comments

Amazon Web Services (AWS) が提供するAmazon S3 (Simple Storage Service) は、オブジェクトストレージのデファクトスタンダードとして、世界中のあらゆる規模の組織で利用されています。99.999999999% (イレブンナイン) という驚異的なデータ耐久性、事実上無限のスケーラビリティ、そして比類なき柔軟性により、データのバックアップ、データレイクの基盤、静的ウェブサイトのホスティング、アプリケーションデータの保存など、そのユースケースは枚挙にいとまがありません。しかし、この利便性の裏側で、多くの組織が共通の課題に直面します。それは「データの増殖」とそれに伴う「ストレージコストの増大」および「管理の複雑化」です。データは生成された瞬間からその価値を変化させ、時間とともにその重要性やアクセス頻度は低下していくのが常です。この避けられないデータの時間的価値の変化に対応し、S3を真にコスト効率良く、かつ安全に運用するための鍵となるのが「S3ライフサイクル管理」機能です。本稿では、この強力な機能を基礎から応用まで徹底的に解剖し、不要なオブジェクトの自動削除、ストレージクラスの最適化によるコスト削減、そしてコンプライアンス要件への対応を自動化する方法を、初心者から熟練のクラウドエンジニアまで、すべてのユーザーに役立つ深いレベルで解説します。

第一部：データライフサイクル管理の本質とS3における重要性

S3ライフサイクル管理の詳細に入る前に、まず「データライフサイクル管理（Data Lifecycle Management, DLM）」という概念そのものを理解することが不可欠です。すべてのデータには、その生成から最終的な破棄に至るまでの一連のフェーズ、すなわち「ライフサイクル」が存在します。

例えば、あるeコマースサイトでユーザーが商品を注文した際に生成される注文データを考えてみましょう。

作成とアクティブ利用：注文が作成された直後、このデータは注文処理、在庫引き当て、顧客への通知など、複数のシステムから頻繁にアクセスされます。この段階では、データは即座に利用可能である必要があり、パフォーマンスが最優先されます。
短期的なアクセス：注文処理が完了した後も、数週間から数ヶ月の間は、返品処理、顧客サポートからの問い合わせ対応、短期的な売上分析などでアクセスされる可能性があります。アクセス頻度は低下しますが、依然として比較的迅速なアクセスが求められます。
長期的なアーカイブ：1年が経過すると、個別の注文データが直接参照されることは稀になります。しかし、年次の財務報告、長期的な販売傾向分析、法的な監査要件のために、データを保持しておく必要があります。この段階では、アクセス速度よりも保管コストの低さが重要になります。
破棄：法的に定められた保管期間（例えば7年間）が終了すると、そのデータを保持し続けることは、ストレージコストの無駄遣いであるだけでなく、GDPRなどのプライバシー規制下ではリスクにもなり得ます。この段階で、データは安全かつ確実に削除されるべきです。

データライフサイクル管理とは、このようなデータの価値と利用パターンの変化に合わせて、データを適切な場所に配置し、管理し、最終的に破棄するまでの一連のプロセスを体系的に行うことです。この管理がS3環境において極めて重要である理由は、主に以下の3つの側面に集約されます。

1. 劇的なコスト最適化

AWS S3の料金体系は、主に保存するデータ量（GB単位）と、選択するストレージクラスによって決定されます。S3は多様なアクセスパターンとコスト要件に対応するため、複数のストレージクラスを提供しています。

S3 Standard: 高い可用性とパフォーマンスを誇り、頻繁にアクセスされるデータに最適。最も料金が高い。
S3 Standard-Infrequent Access (Standard-IA): アクセス頻度は低いが、必要時には即座に取り出す必要があるデータ向け。S3 Standardよりストレージ料金は安いが、データ取得料金が発生する。
S3 One Zone-IA: Standard-IAと同様だが、データを単一のアベイラビリティゾーンにのみ保存するため、可用性は低いがさらに低コスト。
S3 Glacier Instant Retrieval: アーカイブデータ向けだが、ミリ秒単位でのアクセスが可能。四半期に一度程度のアクセスを想定。
S3 Glacier Flexible Retrieval: 数分から数時間の範囲でデータを取り出せる、低コストなアーカイブストレージ。
S3 Glacier Deep Archive: S3で最も低コストなストレージクラス。データの取り出しに数時間以上を要し、長期的なデータ保管（規制、コンプライアンス）に最適。
S3 Intelligent-Tiering: アクセスパターンが不明または変動するデータに最適な特殊なクラス。オブジェクトへのアクセスを監視し、アクセスがない期間に応じて自動的に低コストなアクセス階層に移動させる。

ライフサイクル管理を導入することで、「作成後30日はS3 Standardに置き、その後90日間はS3 Standard-IAへ、さらにその後はS3 Glacier Deep Archiveへ移動させ、7年後には完全に削除する」といったポリシーを自動化できます。これにより、手動での介入なしに、データのライフサイクルに合わせて最適なコストのストレージクラスへとデータを移行させ、最終的には不要なデータを削除してコストをゼロにすることが可能になります。これは、テラバイト、ペタバイト級のデータを扱う環境においては、数千ドルから数万ドル、あるいはそれ以上の月額コスト削減に直結します。

2. コンプライアンスとガバナンスの徹底

現代のビジネス環境は、GDPR（EU一般データ保護規則）、CCPA（カリフォルニア州消費者プライバシー法）、HIPAA（医療保険の相互運用性と説明責任に関する法律）、そして日本の個人情報保護法など、厳格なデータ保護規制に囲まれています。これらの規制の多くは、「データ最小化の原則」や「保管期間の制限」を要求しており、事業目的上不要になった個人データを定められた期間内に確実に削除することを義務付けています。 S3ライフサイクルルールによる自動削除ポリシーは、これらのコンプライアンス要件を満たすための強力なメカニズムです。例えば、「`personal-data/` プレフィックスを持つオブジェクトは、作成から5年後に自動的に削除する」というルールを設定すれば、人為的なミスや失念によるコンプライアンス違反のリスクを大幅に低減できます。監査の際には、このライフサイクル設定自体が、データ保持ポリシーを遵守していることの技術的な証拠として機能します。

3. 運用効率の飛躍的な向上

もしライフサイクル管理がなければ、不要なデータのクリーンアップはどのように行われるでしょうか。おそらく、エンジニアが定期的に手動でスクリプトを実行し、特定の基準（作成日など）に基づいて古いファイルをリストアップし、削除することになるでしょう。このプロセスは、以下のような多くの問題を抱えています。

時間とリソースの浪費: エンジニアの貴重な時間を、本来であればもっと価値の高い開発や改善作業に費やすべきところを、ルーチン的なクリーンアップ作業に割くことになります。
ヒューマンエラーのリスク: 手動スクリプトのロジックに誤りがあった場合、あるいは操作ミスを犯した場合、本来削除すべきでない重要なデータを誤って削除してしまう可能性があります。
スケーラビリティの欠如: データ量が数百万、数千万オブジェクトに達すると、手動スクリプトの実行自体が長時間に及び、パフォーマンス上の問題を引き起こす可能性があります。

S3ライフサイクル管理は、このプロセスを完全に自動化し、AWSの堅牢なインフラストラクチャ上で実行します。一度設定すれば、あとはS3が24時間365日、定義されたポリシーに従って粛々とデータを管理してくれます。これにより、運用チームは日々のメンテナンス作業から解放され、より戦略的な業務に集中できるようになります。

第二部：S3ライフサイクルルールの仕組みと設定方法

S3ライフサイクルルールは、「どのオブジェクト（What）」を、「いつ（When）」、「どうするか（Action）」を定義したポリシーの集合体です。S3は、毎日UTC（協定世界時）の午前0時に、バケットに設定されたすべてのライフサイクルルールを評価し、条件に合致するオブジェクトに対して定義されたアクション（ストレージクラスの移行や削除）を実行するキューに追加します。実際の実行は非同期で行われるため、ルールが適用可能になってからアクションが完了するまでに多少の遅延が生じることがあります。

設定方法は、主にAWSマネジメントコンソール（GUI）、AWS CLI（コマンドライン）、AWS SDK（プログラム経由）の3つがあり、それぞれのユースケースに応じて最適な方法を選択します。

設定方法1：AWSマネジメントコンソールを利用した直感的な手順

最も手軽で視覚的に分かりやすいのが、ウェブブラウザからアクセスするAWSマネジメントコンソールを使用する方法です。ここでは、一時的なログファイルを30日後に削除するという一般的なシナリオを例に、具体的な手順を追っていきましょう。

ステップ1：AWSマネジメントコンソールへのログインとS3バケットの選択

まず、ご自身のAWSアカウントでAWSマネジメントコンソールにログインします。サービス一覧から「S3」を選択し、S3のダッシュボードに移動します。バケットのリストが表示されるので、ライフサイクルルールを設定したいバケット名をクリックしてください。

ステップ2：ライフサイクルルールの作成開始

バケットの詳細ページに移動したら、「管理」タブをクリックします。ページの中ほどに「ライフサイクルルール」セクションがありますので、「ライフサイクルルールを作成する」ボタンをクリックして作成画面に進みます。

ステップ3：ルール名とスコープ（適用範囲）の定義

最初に、ルールの目的がひと目で分かるような具体的な名前を付けます。例えば、「log-files-30-day-expiration」などが良いでしょう。

次に、このルールの影響範囲を決定する「スコープ」を定義します。これは誤操作によるデータ損失を防ぐ上で最も重要なステップです。

プレフィックス、または1つ以上のオブジェクトタグに基づいてこのルールをフィルタリングする: このオプションを選択することを強く推奨します。
- プレフィックスによるフィルタ: 特定のフォルダ（S3ではプレフィックスと呼びます）内のオブジェクトのみを対象にします。例えば、`logs/` と指定すれば、`logs/` で始まるキーを持つすべてのオブジェクト（`logs/app-server-1/2023-10-27.log`など）が対象となります。
- オブジェクトタグによるフィルタ: オブジェクトに付与されたタグに基づいてルールを適用します。例えば、`status: temporary` というタグを持つオブジェクトだけを、プレフィックスに関係なくバケット全体から探し出して削除することが可能です。複数のタグを指定してAND条件（例: `project: phoenix` AND `data-class: temporary`）でフィルタリングすることもでき、非常に柔軟な管理が実現します。
このルールをバケット内のすべてのオブジェクトに適用する: バケット全体にルールを適用する場合に選択します。この設定は影響範囲が非常に大きいため、そのバケットが一時ファイル専用であることが明確な場合などを除き、慎重に検討する必要があります。

今回の例では、「プレフィックスによるフィルタ」を選択し、「`logs/`」と入力します。

ステップ4：ライフサイクルルールアクションの選択

次に、フィルタリングされたオブジェクトに対して何を行うかを定義します。自動削除に関連する主要なアクションは以下の通りです。

オブジェクトの現行バージョンを失効させる: オブジェクトが作成されてから指定した日数後（例: 30日）に、そのオブジェクトを期限切れにします。バケットのバージョニングが有効になっていない場合、これはオブジェクトの完全な削除を意味します。バージョニングが有効な場合は、オブジェクト自体は「非現行バージョン」として残り、代わりに「削除マーカー」が最新のバージョンとして作成されます。
オブジェクトの非現行バージョンを完全に削除する: バージョニングが有効なバケットでのみ表示されます。オブジェクトが非現行バージョンになってから（つまり、上書きされるか削除マーカーが作成されてから）指定した日数後に、その古いバージョンを完全に削除します。バージョニングによるストレージコストの増大を防ぐためには必須の設定です。
期限切れのオブジェクト削除マーカーを削除する: バージョニングが有効なバケットで、あるオブジェクトのバージョンがすべて削除され、削除マーカーだけが残っている場合に、その孤立した削除マーカーをクリーンアップします。これを放置すると、LIST操作のパフォーマンスにわずかながら影響を与えたり、意図しない混乱を招いたりする可能性があるため、設定が推奨されます。
不完全なマルチパートアップロードを削除する: S3では大きなファイルをアップロードする際に、ファイルを複数のパートに分割してアップロードする「マルチパートアップロード」が利用されます。何らかの理由でこのアップロードが中断されると、アップロードされなかった不完全なデータ（パート）がS3内に残り、ストレージ料金が発生し続けます。このアクションは、アップロードの開始から指定した日数後（例: 7日後）に、これらの中途半端なデータを自動的にクリーンアップします。これは、データ損失のリスクがほぼゼロでありながらコスト削減効果が高いため、ほぼすべてのバケットで設定すべきベストプラクティスと言えます。

今回のシナリオでは、「オブジェクトの現行バージョンを失効させる」にチェックを入れ、「オブジェクト作成の n 日後」に「30」と入力します。また、安全のために「不完全なマルチパートアップロードを削除する」にもチェックを入れ、「7」日と設定しておくと良いでしょう。

ステップ5：ルールの作成と確認

設定内容に間違いがないかを確認し、画面下部の「ルールを作成」ボタンをクリックします。これで設定は完了です。作成されたルールがリストに表示されます。S3は次のUTC午前0時からこのルールの評価を開始し、条件に合致するオブジェクトを削除キューに追加します。

設定方法2：AWS CLI を利用した Infrastructure as Code (IaC)

AWS CLI (Command Line Interface) を使用すると、ライフサイクル設定をコード（JSONファイル）として管理できます。これにより、設定のバージョン管理、複数のバケットへの一括適用、CI/CDパイプラインへの組み込みなどが可能になり、Infrastructure as Code (IaC) のプラクティスを実践できます。

ステップ1：AWS CLIのインストールと設定

まだインストールしていない場合は、公式ドキュメントを参照してAWS CLIをインストールしてください。インストール後、ターミナルまたはコマンドプロンプトで `aws configure` コマンドを実行し、AWSアクセスキー、シークレットアクセスキー、デフォルトリージョン、出力形式を設定します。

ステップ2：ライフサイクル設定ファイルの作成 (JSON)

ライフサイクルルールはJSON形式のファイルで定義します。お好みのテキストエディタで `lifecycle-config.json` のような名前のファイルを作成します。以下は、複数のルール（移行と削除）を含む、より実践的な例です。

{
  "Rules": [
    {
      "ID": "MoveLogsToGlacierAndExpire",
      "Status": "Enabled",
      "Filter": {
        "Prefix": "audit-logs/"
      },
      "Transitions": [
        {
          "Days": 90,
          "StorageClass": "GLACIER"
        }
      ],
      "Expiration": {
        "Days": 2555
      },
      "NoncurrentVersionExpiration": {
        "NoncurrentDays": 30
      }
    },
    {
      "ID": "AbortIncompleteUploadsRule",
      "Status": "Enabled",
      "Filter": {},
      "AbortIncompleteMultipartUpload": {
        "DaysAfterInitiation": 7
      }
    }
  ]
}

このJSONファイルは2つのルールを定義しています：

MoveLogsToGlacierAndExpire: `audit-logs/` プレフィックス内のオブジェクトを対象とします。
- 作成から90日後にS3 Glacier (Flexible Retrieval) ストレージクラスへ移行します。
- 作成から2555日後（約7年後）に現行バージョンを失効させます。
- 非現行になったバージョンは、30日後に完全に削除します。
AbortIncompleteUploadsRule: バケット全体（`Filter`が空のため）を対象とし、開始から7日以上経過した不完全なマルチパートアップロードを中止・削除します。

ステップ3：ライフサイクル設定の適用

作成したJSONファイルを使って、以下のコマンドを実行します。`YOUR-BUCKET-NAME` の部分とファイルパスは、ご自身の環境に合わせて変更してください。

aws s3api put-bucket-lifecycle-configuration \
    --bucket YOUR-BUCKET-NAME \
    --lifecycle-configuration file://lifecycle-config.json

コマンドが成功すると、通常は何も出力されません。設定が正しく適用されたかを確認するには `get-bucket-lifecycle-configuration` コマンドを、設定をすべて削除するには `delete-bucket-lifecycle` コマンドを使用します。

# 設定の確認
aws s3api get-bucket-lifecycle-configuration --bucket YOUR-BUCKET-NAME

# 設定の削除
aws s3api delete-bucket-lifecycle --bucket YOUR-BUCKET-NAME

設定方法3：AWS SDK を利用したアプリケーションからの動的制御

アプリケーションのロジックに基づいて、バケット作成時や特定のイベント発生時にライフサイクルルールを動的に設定・変更したい場合は、各種プログラミング言語向けのAWS SDKを使用します。ここでは、広く使われているPython用のSDK「Boto3」を例に説明します。

ステップ1：Boto3のインストールと設定

まず、Python環境にBoto3ライブラリをインストールします。

pip install boto3

AWS認証情報の設定は、AWS CLIと同様に `aws configure` を用いるか、IAMロール（EC2インスタンスやLambda関数で実行する場合に推奨）、環境変数などを通じて行います。

ステップ2：ライフサイクル設定を適用するPythonスクリプト

以下のPythonスクリプトは、Boto3を使ってバケットに包括的なライフサイクル設定を適用する例です。

import boto3
from botocore.exceptions import ClientError

# 適用したいライフサイクル設定をPythonの辞書として定義
# 構造はCLIで用いたJSONとほぼ同一
lifecycle_configuration = {
    'Rules': [
        {
            'ID': 'DocumentArchiveAndExpirationRule',
            'Filter': {
                'Tag': {
                    'Key': 'data-lifecycle',
                    'Value': 'archive-7-years'
                }
            },
            'Status': 'Enabled',
            'Transitions': [
                {
                    'Days': 30,
                    'StorageClass': 'STANDARD_IA'
                },
                {
                    'Days': 90,
                    'StorageClass': 'GLACIER_IR' # Glacier Instant Retrieval
                }
            ],
            'Expiration': {
                'Days': 2555 # 7 years
            },
            'NoncurrentVersionExpiration': {
                'NoncurrentDays': 180
            },
            'AbortIncompleteMultipartUpload': {
                'DaysAfterInitiation': 7
            }
        }
    ]
}

# S3クライアントのインスタンスを作成
s3_client = boto3.client('s3')
bucket_name = 'YOUR-BUCKET-NAME' # 対象のバケット名を指定

try:
    # put_bucket_lifecycle_configuration APIを呼び出し
    s3_client.put_bucket_lifecycle_configuration(
        Bucket=bucket_name,
        LifecycleConfiguration=lifecycle_configuration
    )
    print(f"Successfully applied lifecycle configuration to bucket '{bucket_name}'.")

except ClientError as e:
    # エラーハンドリング
    error_code = e.response['Error']['Code']
    print(f"Error applying lifecycle configuration: {error_code} - {e}")

この例では、`data-lifecycle: archive-7-years` というタグが付いたオブジェクトのみを対象としています。オブジェクトは作成30日後にStandard-IAへ、90日後にGlacier Instant Retrievalへ移行し、最終的に7年後に失効するという、より洗練されたルールを設定しています。このようにSDKを使えば、アプリケーションの要件に応じて、タグベースの非常にきめ細かいライフサイクルポリシーをプログラムで管理することが可能になります。

第三部：鉄壁の運用を実現するベストプラクティスと注意点

自動オブジェクト削除と移行は非常に強力ですが、設定ミスは意図しないデータ損失や予期せぬコストに直結する可能性があります。以下のベストプラクティスを遵守し、安全かつ効果的にライフサイクル管理を運用しましょう。

本番適用前の徹底的なテスト: いきなり本番環境の重要なデータが入ったバケットにルールを適用することは絶対に避けるべきです。まず、テスト専用のバケットを作成し、少量のダミーデータを使ってシミュレーションを行いましょう。ライフサイクル期間を「1日」のような非常に短い期間に設定すれば、ルールが意図通りに動作するか（オブジェクトが翌日に移行または削除されるか）を迅速に確認できます。CloudTrailログやS3サーバーアクセスログで、実際にライフサイクルによってアクションが実行されたことを確認するまでがテストです。
S3バージョニングの積極的な活用: 重要なデータを扱うバケットでは、必ず「バージョニングを有効化」してください。バージョニングが有効であれば、ユーザーが誤ってオブジェクトを上書きしたり削除したりしても、以前のバージョンは「非現行バージョン」として保持されるため、容易に復旧が可能です。ライフサイクルルールにおける「現行バージョンを失効させる」アクションも、実際には削除マーカーを作成するだけであり、即座にデータが失われるわけではありません。これは、誤ったライフサイクルルールを適用してしまった際の最後のセーフティネットとして機能します。ただし、非現行バージョンを削除するルール (`NoncurrentVersionExpiration`) を設定している場合は、その日数経過後にはデータが完全に失われるため注意が必要です。
重要なデータのバックアップ戦略: バージョニングはヒューマンエラーに対する強力な防御策ですが、リージョン規模の障害や悪意のあるアカウント乗っ取りなど、より広範な脅威には対応できません。最高レベルのデータ保護を実現するためには、S3クロスリージョンレプリケーション（CRR）を使用して、データのコピーを物理的に離れた別のAWSリージョンに保持することを検討してください。注意点として、デフォルトではソースバケットのライフサイクル設定はレプリカバケットには複製されません。これにより、ソースとレプリカで異なるデータ保持ポリシー（例：ソースは7年、レプリカは10年保持）を適用する柔軟性が得られます。
ルールのスコープを常に最小限に: 「バケット内のすべてのオブジェクトに適用」という設定は、そのバケットの用途が明確に一時的なものに限定されている場合を除き、極力避けるべきです。可能な限り、プレフィックスやタグを使ってルールの適用範囲を厳密に限定することで、設定ミスによる影響範囲を最小限に抑え、事故のリスクを大幅に低減できます。
厳格なIAMポリシーによる権限管理: ライフサイクル設定を変更できる権限 (`s3:PutLifecycleConfiguration`) は非常に強力であり、誤用されると大規模なデータ損失につながる可能性があります。この権限は、インフラ管理者など、本当に必要なIAMユーザーやロールにのみ付与するようにIAMポリシーを設計してください。また、`s3:GetLifecycleConfiguration` 権限をより広範なユーザー（監査担当者など）に与えることで、設定内容の透明性を確保することも重要です。
小さなオブジェクトの罠を理解する: ライフサイクルによるストレージクラスの移行（例: StandardからStandard-IAへ）には、オブジェクトごとに小さな料金（トランジションリクエスト料金）が発生します。数キロバイトの小さなオブジェクトが数百万、数千万とある場合、ストレージ料金の削減効果よりも移行コストの方が高くなってしまう可能性があります。このような場合は、小さなファイルをアーカイブ（tar, zip）して1つの大きなオブジェクトにまとめる、あるいはライフサイクル移行の対象から除外するなどの対策を検討する必要があります。S3 Intelligent-Tieringは、このような小さなオブジェクトを自動的にアーカイブアクセス階層に移動させないという保護機能も備えています。

第四部：監視、監査、そして可視化

ライフサイクルルールを設定した後は、「設定して終わり」ではありません。ルールが期待通りに機能しているか、そしてどのようなオブジェクトがいつ、なぜ削除・移行されたのかを追跡することは、セキュリティ、コンプライアンス、そしてコスト管理の観点から不可欠です。

AWS CloudTrailによるAPIコールの監査: CloudTrailは、お使いのAWSアカウント内で行われたほぼすべてのAPIコールを記録します。ライフサイクルによるオブジェクトの削除は、ユーザーやロールではなく、S3サービス自体が内部的に実行します。そのため、CloudTrailのログには、`userIdentity` フィールドに `s3.amazonaws.com` というサービスプリンシパルが記録され、`eventName` が `DeleteObject` となっているイベントとして記録されます。`userAgent` に "lifecycle" といった文字列が含まれるため、どのオブジェクトがライフサイクルルールによって削除されたかを正確に監査できます。
S3サーバーアクセスログによる詳細なリクエスト追跡: バケット単位でサーバーアクセスログを有効にすると、そのバケットへのすべてのリクエスト（GET, PUT, DELETEなど）が、指定した別のバケットにログファイルとして記録されます。ライフサイクルによる削除は、このログファイル内では `REST.DELETE.LIFECYCLE.OBJECT` というオペレーションとして記録されます。CloudTrailよりも詳細なリクエスト情報（リクエスタ、時刻、キー名など）が含まれており、監査証跡として非常に有用です。
Amazon CloudWatchによるメトリクスの監視: CloudWatchでは、S3バケットの基本的なメトリクス（オブジェクト数 `NumberOfObjects`、合計サイズ `BucketSizeBytes`）を時系列で監視できます。ライフサイクルルールが適用され、オブジェクトが削除されたり、より安価なストレージクラスに移行したりすると、これらのメトリクスが期待通りに減少または変化するはずです。メトリクスに予期せぬ変化がないかを監視することで、ルールの正常な動作を視覚的に確認できます。さらに、S3 Storage Lensは、組織全体のS3利用状況を可視化し、コスト最適化の機会を発見するための高度なダッシュボードを提供します。
S3インベントリによるオブジェクトレベルの棚卸し: S3インベントリは、バケット内の全オブジェクトのリストとそのメタデータ（ストレージクラス、暗号化ステータス、オブジェクトサイズなど）を、CSVやORC, Parquet形式で定期的に出力する機能です。このインベントリレポートを定期的に分析することで、ライフサイクルルールが適用される前後のオブジェクトの状態を詳細に比較したり、ルールが適用されていない「忘れられた」オブジェクトを発見したりするのに役立ちます。大規模なバケットの棚卸しや、詳細なデータ分析を行う上で不可欠なツールです。

まとめ

AWS S3のライフサイクル管理は、単に不要なファイルを削除するだけの機能ではありません。それは、データの時間的価値の変化にインテリジェントに対応し、ストレージコストを継続的に最適化し、厳格なコンプライアンス要件を自動で満たし、そして運用チームを煩雑な手作業から解放するための、データ管理戦略の中核をなす強力なツールです。本稿で解説した、マネジメントコンソールによる直感的な設定、CLIやSDKを用いたIaCアプローチ、そしてバージョニングや監視といったベストプラクティスを組み合わせることで、あらゆる規模のS3環境を安全かつ効率的に運用することが可能になります。もし、まだライフサイクル管理を活用していないのであれば、まずはテストバケットで小さなルールから試してみてください。その効果を一度実感すれば、S3運用に欠かせない機能であることを確信するはずです。

Continue Reading →

Intelligent S3 Data Management: Automating Object Deletion

July 20, 2023 / No comments

Amazon S3, the Simple Storage Service, forms the bedrock of modern cloud infrastructure. Its promise of virtually infinite scalability, exceptional durability, and granular security has made it the default repository for an immense spectrum of data—from static website assets and application logs to multi-petabyte data lakes and critical long-term archives. However, this ease of storage comes with an inherent challenge: as data accumulates relentlessly, it can lead to spiraling costs, increased operational complexity, and significant compliance risks. Proactive data management is not just a best practice; it is an economic and strategic imperative. At the heart of this discipline lies a powerful AWS feature: S3 Lifecycle policies, the primary mechanism for automating the entire lifecycle of your data, including its eventual, necessary deletion.

Automating the deletion of S3 objects transcends simple digital housekeeping. It is a foundational component of a well-architected cloud strategy. This article delves into the strategic importance of automated deletion, provides detailed, step-by-step implementation guides using the AWS Console, CLI, and SDK, and explores advanced concepts and best practices to ensure you wield this powerful tool effectively and safely.

The Compounding Dangers of Unmanaged Data Growth

Before exploring the solution, it's crucial to understand the multifaceted problems that S3 Lifecycle policies address. Allowing data to accumulate indefinitely in an S3 bucket creates a form of "digital debt" that incurs real and substantial costs across multiple domains.

1. The Financial Drain: More Than Just Storage Costs

The most obvious consequence of retaining unnecessary data is the direct cost of storage. While S3 storage prices per gigabyte are low, they become significant at scale. Data that no longer provides business value—such as transient log files from a debugging session months ago, intermediate data from a completed ETL job, or session data from anonymous users—directly inflates your monthly AWS bill. This is compounded by the fact that many organizations store this data in the S3 Standard storage class by default, which is the most expensive tier. The cost of inaction is a perpetually increasing operational expense for zero return.

2. Compliance and Governance Failures

In today's regulatory landscape, data retention is not optional. Frameworks like the General Data Protection Regulation (GDPR) in Europe, the Health Insurance Portability and Accountability Act (HIPAA) in the US, and the California Consumer Privacy Act (CCPA) impose strict mandates on how long personal and sensitive data can be stored. GDPR, for example, enshrines the "right to erasure" (or "right to be forgotten"), requiring organizations to delete user data upon request or after it's no longer needed for its original purpose. Manually tracking and deleting this data across potentially billions of objects is not just impractical but virtually impossible to execute reliably. Failure to comply can result in severe financial penalties, reputational damage, and legal action. Automated deletion is the only scalable way to enforce data retention policies and demonstrate compliance.

3. Operational Drag and Complexity

A bucket containing billions of objects is inherently more difficult to manage than a lean one. Performing simple operations like listing objects, running inventory reports, or searching for specific data becomes slower and more computationally expensive. This operational friction slows down development and administrative tasks. Furthermore, it burdens engineering teams with the manual, error-prone chore of cleanup, diverting their valuable time from innovation and core business logic to routine maintenance.

4. An Expanded Security Attack Surface

From a security perspective, every piece of stored data represents a potential liability. The more data you retain, the larger your "attack surface." In the event of a security breach or an inadvertently misconfigured bucket permission, the impact is directly proportional to the amount and sensitivity of the data exposed. Old, forgotten data that has no business value can become a significant liability if compromised. By systematically deleting data that is no longer required, you minimize your data footprint and reduce the potential blast radius of a security incident.

Anatomy of an S3 Lifecycle Policy

An S3 Lifecycle policy is a declarative set of rules defined in an XML (or represented as JSON for API/CLI interactions) document attached to a bucket. These rules instruct S3 to automatically perform specific actions on objects when they reach a certain point in their lifetime. Each rule within a policy consists of several key components:

Rule ID: A unique name for the rule (e.g., Log-File-Cleanup-30-Days) for easy identification.
Status: Either Enabled or Disabled, allowing you to turn rules on or off without deleting them.
Filter: Defines the scope of the rule, specifying which objects it applies to. This is the most critical part for targeting. You can filter by:
- Prefix: Applies the rule to all objects under a specific path (e.g., logs/, images/temp/).
- Object Tags: Applies the rule to objects that have a specific tag key-value pair (e.g., { "Key": "status", "Value": "temporary" }).
- AND Gate: Allows you to combine a prefix and multiple tags to create highly specific filters.
- Note: If no filter is specified, the rule applies to all objects in the bucket. Use this with extreme caution.
Actions: The operations S3 will perform on the filtered objects. The two main categories of actions are:
1. Transition Actions: These move objects between different S3 storage classes to optimize costs. For example, you can move data from S3 Standard to S3 Standard-Infrequent Access (IA) after 30 days, and then to S3 Glacier Deep Archive after 180 days for long-term archiving.
2. Expiration Actions: These actions are responsible for automatic deletion and are the focus of our discussion.

Deep Dive into Expiration Actions

Expiration actions are nuanced, especially when S3 Versioning is enabled on a bucket. Understanding the different types is key to achieving your desired outcome.

Expire current versions of objects: This action targets the active, current version of an object.
- In a non-versioned bucket, this action permanently deletes the object after the specified number of days from its creation.
- In a versioned bucket, this action does not delete the object data. Instead, it places a special "delete marker" on top of the object, making it the new "current" version. The previous version becomes a non-current (previous) version, and the object is no longer visible in standard listings, effectively hiding it.
Permanently delete noncurrent versions of objects: This is the crucial action for reclaiming storage space in a versioned bucket. It permanently deletes previous object versions after they have been non-current for a specified number of days. For example, you can set a rule to delete non-current versions 90 days after they are superseded.
Delete expired object delete markers: When you delete an object in a versioned bucket (either manually or via an expiration action), a delete marker is created. If there are no older, non-current versions of that object, the delete marker is considered "expired." This action cleans up these lone delete markers, which simplifies bucket listings and can improve performance for some applications.
Abort incomplete multipart uploads: This is a universally recommended best practice. When a large file upload fails, it can leave behind orphaned parts that are invisible in the console but still accrue storage costs. This rule instructs S3 to clean up any multipart uploads that haven't completed within a specified number of days (e.g., 7 days) from their initiation. Every S3 bucket should have this rule.

Method 1: Configuring Policies via the AWS Management Console

The AWS Console provides an intuitive, wizard-driven interface for creating lifecycle rules. It's an excellent method for visual learners and for managing a small number of policies.

Step 1: Navigate to Your S3 Bucket

Log into the AWS Management Console, go to the S3 service, and select the bucket you wish to configure from the list.

Step 2: Access the Management Tab

Within your bucket's page, click on the "Management" tab. This is the central hub for policies that govern your bucket's data, such as replication, inventory, and lifecycle rules.

Step 3: Create a New Lifecycle Rule

Scroll down to the "Lifecycle rules" section and click the "Create lifecycle rule" button.

Step 4: Define the Rule Name and Scope

Rule name: Provide a descriptive name that clearly states its purpose, like Temp-Upload-Cleanup-7-Days.
Choose a rule scope: This is a critical decision.
- Apply to all objects in the bucket: This global option is powerful but dangerous. Only choose this if you are absolutely certain every object should be subject to this rule (e.g., for a bucket dedicated solely to temporary logs).
- Limit the scope... using one or more filters: This is the safer and more common choice. You can define a filter based on a prefix (e.g., temp-data/) or by one or more object tags. For instance, you could create a rule that only applies to objects tagged with data-class=transient.

After defining the scope, you must acknowledge the warning that this rule will affect all matched objects.

Step 5: Configure the Lifecycle Rule Actions

This screen is where you define what the rule will do. Check the boxes for the actions you want to enable. For automatic deletion, you'll focus on the expiration actions:

Expire current versions of objects: Check this and specify the number of days after object creation for the expiration to occur (e.g., 30 days).
Permanently delete noncurrent versions of objects: If your bucket is versioned, this is essential for cost savings. Specify the number of days after an object becomes non-current to delete it (e.g., 60 days).
Delete expired object delete markers: A good housekeeping option for versioned buckets.
Delete incomplete multipart uploads: Check this and specify a short duration, such as 7 days, to clean up failed uploads.

Step 6: Review and Create

The final page presents a summary of your entire rule: its name, scope, and all configured actions. Review this information with extreme care. A typo in the number of days or an incorrect prefix can have irreversible consequences. Once you are confident, click "Create rule". S3 will begin evaluating the policy, and actions will typically start taking effect within 24 to 48 hours.

Method 2: Automation with the AWS Command Line Interface (CLI)

The AWS CLI is the tool of choice for automation, scripting, and incorporating infrastructure changes into CI/CD pipelines. It allows you to define your entire lifecycle policy in a JSON file and apply it with a single command.

Step 1: Install and Configure the AWS CLI

If you don't have it set up, follow the official AWS documentation to install the CLI. Then, configure it with your credentials by running aws configure and providing your Access Key ID, Secret Access Key, and default AWS Region.

Step 2: Craft the Lifecycle Configuration JSON File

Using a text editor, create a JSON file (e.g., lifecycle.json) that defines your rules. The structure must be precise.

Example 1: Rule for Temporary Application Logs

This policy contains two rules. The first deletes objects under the app-logs/ prefix after 90 days. The second is a bucket-wide rule to clean up failed multipart uploads after 7 days.

{
  "Rules": [
    {
      "ID": "AppLogExpiration",
      "Status": "Enabled",
      "Filter": {
        "Prefix": "app-logs/"
      },
      "Expiration": {
        "Days": 90
      }
    },
    {
      "ID": "GlobalMultipartUploadCleanup",
      "Status": "Enabled",
      "Filter": {},
      "AbortIncompleteMultipartUpload": {
        "DaysAfterInitiation": 7
      }
    }
  ]
}

Example 2: Advanced Rule for Versioned, Tagged Data

This complex policy targets objects with the prefix processed-data/ AND tagged with retention=short-term. It transitions the current version to Infrequent Access after 30 days, expires it (creates a delete marker) after 365 days, and permanently deletes any non-current versions 60 days after they are superseded.

{
  "Rules": [
    {
      "ID": "ProcessedDataLifecycle",
      "Status": "Enabled",
      "Filter": {
        "And": {
          "Prefix": "processed-data/",
          "Tags": [
            {
              "Key": "retention",
              "Value": "short-term"
            }
          ]
        }
      },
      "Transitions": [
        {
          "Days": 30,
          "StorageClass": "STANDARD_IA"
        }
      ],
      "Expiration": {
        "Days": 365
      },
      "NoncurrentVersionExpiration": {
        "NoncurrentDays": 60
      }
    }
  ]
}

Step 3: Apply the Policy

From your terminal, execute the put-bucket-lifecycle-configuration command, referencing your bucket and the JSON file.

aws s3api put-bucket-lifecycle-configuration \
    --bucket your-production-bucket-name \
    --lifecycle-configuration file://lifecycle.json

A successful command will return no output. You can verify the policy was applied using:

aws s3api get-bucket-lifecycle-configuration --bucket your-production-bucket-name

Method 3: Programmatic Control with the AWS SDK (Boto3)

For deep integration within your applications or custom Infrastructure as Code (IaC) solutions, the AWS SDKs provide the ultimate flexibility. We'll use Boto3, the AWS SDK for Python.

Step 1: Install and Set Up Boto3

Install the library using pip: pip install boto3. Boto3 automatically discovers credentials from your environment, including those configured for the AWS CLI.

Step 2: Construct and Apply the Policy in Python

The core logic involves creating a Python dictionary that mirrors the JSON structure and passing it to the Boto3 client.

import boto3
from botocore.exceptions import ClientError
import logging

# Configure logging
logging.basicConfig(level=logging.INFO, format='%(asctime)s - %(levelname)s - %(message)s')

def configure_s3_lifecycle(bucket_name: str, policy: dict):
    """
    Applies a lifecycle policy to a specified S3 bucket using Boto3.

    Args:
        bucket_name: The name of the target S3 bucket.
        policy: A dictionary representing the lifecycle policy rules.
    
    Returns:
        bool: True if the policy was applied successfully, False otherwise.
    """
    try:
        s3_client = boto3.client('s3')
        s3_client.put_bucket_lifecycle_configuration(
            Bucket=bucket_name,
            LifecycleConfiguration=policy
        )
        logging.info(f"Successfully applied lifecycle policy to bucket '{bucket_name}'.")
        return True
    except ClientError as e:
        logging.error(f"Failed to apply lifecycle policy to '{bucket_name}': {e}")
        return False

if __name__ == '__main__':
    # --- IMPORTANT: Change this to your bucket name ---
    TARGET_BUCKET = 'your-application-data-bucket'

    # Define a comprehensive lifecycle policy
    lifecycle_policy_definition = {
        'Rules': [
            {
                'ID': 'ExpireTemporaryUploads',
                'Filter': {
                    'Prefix': 'uploads/temp/'
                },
                'Status': 'Enabled',
                'Expiration': {
                    'Days': 7
                },
                # This action is useful for versioned buckets
                'NoncurrentVersionExpiration': {
                    'NoncurrentDays': 1
                }
            },
            {
                'ID': 'ArchiveAndExpireReports',
                'Filter': {
                    'And': {
                        'Prefix': 'reports/',
                        'Tags': [
                            {'Key': 'data-class', 'Value': 'auditable'}
                        ]
                    }
                },
                'Status': 'Enabled',
                'Transitions': [
                    {'Days': 90, 'StorageClass': 'GLACIER_IR'}, # Glacier Instant Retrieval
                    {'Days': 365, 'StorageClass': 'DEEP_ARCHIVE'}
                ],
                'Expiration': {
                    'Days': 2555 # ~7 years
                }
            },
            {
                'ID': 'CleanupFailedMultipartUploads',
                'Filter': {}, # Applies to the entire bucket
                'Status': 'Enabled',
                'AbortIncompleteMultipartUpload': {
                    'DaysAfterInitiation': 3
                }
            }
        ]
    }

    # Execute the function
    if TARGET_BUCKET != 'your-application-data-bucket':
        configure_s3_lifecycle(TARGET_BUCKET, lifecycle_policy_definition)
    else:
        logging.warning("Please update the TARGET_BUCKET variable before running the script.")

Save this as a Python script (e.g., s3_lifecycle_manager.py), update the TARGET_BUCKET variable, and run it with python s3_lifecycle_manager.py. This script is robust, provides clear logging, and can be easily adapted for use in automated workflows, such as being triggered by an AWS Lambda function.

Crucial Precautions and Monitoring Strategies

Automated deletion is an irreversible action. A misconfigured policy can lead to catastrophic data loss. Adhering to a strict set of best practices is non-negotiable.

Test in Isolation: Never deploy a new or modified lifecycle policy directly to a production environment. Create a dedicated test bucket, upload sample objects that match your intended filters (prefixes and tags), apply the policy, and wait 24-48 hours to verify that the correct objects—and only the correct objects—are deleted.
Embrace Versioning and Backups: Before enabling any expiration rule on a critical bucket, enable S3 Versioning. It acts as a safety net. If an expiration rule incorrectly places a delete marker on an object, the previous version remains recoverable. For ultimate protection, use S3 Cross-Region Replication to maintain a complete, independent copy of your data in another AWS Region.
Favor Granularity Over Globality: Always prefer specific filters (prefixes, tags) over bucket-wide rules. This principle of "least privilege" for data management reduces the potential blast radius of an error. Structuring your bucket with well-defined prefixes for different data types (e.g., /logs, /user-assets, /temp) is a foundational best practice that makes lifecycle management far safer and more effective.
Implement Robust Monitoring and Auditing: Do not "set and forget." Continuously verify that your policies are working as expected.
- AWS CloudTrail: CloudTrail captures all API calls. You can query your logs for s3:DeleteObject events initiated by the s3.amazonaws.com principal to audit every deletion performed by the lifecycle service.
- Amazon CloudWatch Alarms: Monitor the NumberOfObjects and BucketSizeBytes metrics for your bucket. After a lifecycle rule is scheduled to run, you should observe a predictable drop. Configure CloudWatch Alarms to trigger notifications if the number of objects drops unexpectedly or by an unusually large amount, which could signal a misconfigured rule.
- S3 Inventory and Athena: For large-scale auditing, configure S3 Inventory to generate daily reports of all object metadata. You can then load these reports into Amazon Athena and use standard SQL queries to analyze object ages, verify tag application, and confirm that objects meeting expiration criteria are indeed being removed over time.

By implementing S3 Lifecycle policies with care, precision, and a robust monitoring framework, you can transform your S3 usage from a reactive cost center into a proactively managed, optimized, and compliant asset. It is the key to mastering cloud storage at scale.

Continue Reading →

실전 AWS: S3 정적 호스팅부터 CloudFront CDN 최적화까지

July 20, 2023 / No comments

현대 웹 개발 환경은 끊임없이 진화하고 있습니다. 과거에는 데이터베이스와 서버사이드 로직이 복잡하게 얽힌 동적 웹사이트가 주류를 이루었지만, Jamstack 아키텍처의 부상과 함께 클라이언트 측 렌더링 및 사전 렌더링(Pre-rendering) 기술이 보편화되면서 정적 웹사이트(Static Website)가 다시금 주목받고 있습니다. 정적 웹사이트는 단순한 HTML, CSS, JavaScript 파일의 집합을 넘어, 강력한 사용자 경험과 뛰어난 성능, 그리고 견고한 보안을 제공하는 현대적인 웹 애플리케이션의 핵심 기반으로 자리 잡았습니다.

이러한 정적 웹사이트를 배포하고 운영하는 데 있어 Amazon Web Services(AWS)의 S3(Simple Storage Service)는 가장 신뢰할 수 있고 비용 효율적인 솔루션 중 하나로 꼽힙니다. 하지만 단순히 S3에 파일을 올리는 것만으로는 현대 웹이 요구하는 속도, 보안, 안정성을 모두 충족시키기 어렵습니다. 진정한 프로덕션 수준의 정적 웹사이트를 구축하기 위해서는 S3를 기반으로 AWS의 다른 강력한 서비스들, 특히 콘텐츠 전송 네트워크(CDN) 서비스인 CloudFront와 DNS 서비스인 Route 53을 유기적으로 결합하는 과정이 필수적입니다. 이 글에서는 AWS S3를 이용한 기본적인 정적 웹사이트 호스팅 설정부터 시작하여, 커스텀 도메인을 연결하고, 최종적으로 CloudFront를 통해 전 세계 사용자에게 빠르고 안전하게 콘텐츠를 제공하는 전체 과정을 심도 있게 다룹니다.

1장: 왜 정적 웹사이트 호스팅에 AWS S3를 선택하는가?

1.1 정적 웹사이트의 부활과 그 본질적 가치

정적 웹사이트는 서버가 요청을 받을 때마다 데이터베이스 쿼리나 템플릿 렌더링 같은 복잡한 처리 과정 없이, 미리 생성된 HTML, CSS, JavaScript, 이미지 등의 정적 파일을 그대로 사용자에게 전달하는 방식의 웹사이트를 의미합니다. 이는 다음과 같은 명확하고 강력한 장점을 가집니다.

압도적인 속도: 서버 측 처리 과정이 없기 때문에 응답 시간이 매우 빠릅니다. 파일은 요청 즉시 전달되며, CDN과 결합될 경우 사용자와 가장 가까운 위치에서 콘텐츠를 제공받아 로딩 속도를 극적으로 단축시킬 수 있습니다.
강화된 보안: WordPress나 Joomla 같은 CMS(콘텐츠 관리 시스템) 기반의 동적 사이트에서 흔히 발견되는 서버사이드 언어(PHP, Python 등)나 데이터베이스 관련 보안 취약점으로부터 자유롭습니다. 공격 표면(Attack Surface) 자체가 매우 작아 보안 관리가 용이합니다.
비용 효율성: 고성능의 애플리케이션 서버나 데이터베이스 서버를 운영할 필요가 없습니다. 저렴한 객체 스토리지와 트래픽 비용만으로 운영이 가능하여, 개인 프로젝트부터 대규모 기업 사이트까지 비용 부담을 크게 줄일 수 있습니다.
뛰어난 확장성: 정적 파일은 캐싱이 매우 용이하므로, 갑작스러운 트래픽 급증에도 CDN을 통해 손쉽게 대응할 수 있습니다. 서버 증설과 같은 복잡한 과정 없이도 글로벌 규모의 트래픽을 감당할 수 있습니다.

Gatsby, Next.js(SSG 모드), Hugo, Jekyll과 같은 현대적인 정적 사이트 생성기(Static Site Generator, SSG)의 등장은 이러한 정적 웹사이트의 가능성을 한 단계 끌어올렸습니다. 개발자들은 React나 Vue와 같은 최신 프레임워크를 사용하여 동적인 개발 경험을 유지하면서도, 빌드 시점에는 최적화된 정적 파일들을 생성하여 정적 호스팅의 모든 이점을 누릴 수 있게 되었습니다.

1.2 객체 스토리지의 거인, AWS S3 심층 분석

AWS S3는 'Simple Storage Service'라는 이름에서 알 수 있듯이 파일을 저장하는 서비스이지만, 그 내부 구조와 철학은 일반적인 파일 시스템과는 근본적으로 다릅니다. S3는 객체 스토리지(Object Storage) 모델을 따릅니다.

파일 스토리지(File Storage) vs. 객체 스토리지(Object Storage): 일반적인 컴퓨터의 파일 시스템(NTFS, ext4 등)은 디렉터리 계층 구조를 가집니다. 반면, 객체 스토리지는 모든 데이터를 '객체(Object)'라는 단위로 취급하며, 이 객체들은 평평한 주소 공간(Flat Address Space)에 저장됩니다. 각 객체는 데이터 본체, 고유한 키(Key, 파일 이름과 경로의 조합), 그리고 풍부한 메타데이터로 구성됩니다. 이러한 구조는 무한에 가까운 확장성을 제공하는 비결입니다.

S3의 핵심 가치:

99.999999999% (11-nines)의 내구성: S3에 저장된 객체는 여러 가용 영역(Availability Zone)에 자동으로 복제되어 저장됩니다. 이는 1만 개의 객체를 1,000만 년 동안 저장했을 때 1개의 객체가 유실될까 말까 한 수준의 극도로 높은 데이터 내구성을 보장합니다.
높은 가용성과 확장성: S3는 전 세계 수백만 개의 애플리케이션에서 발생하는 요청을 처리하도록 설계되었습니다. 저장 공간의 한계나 성능 저하에 대한 걱정 없이 원하는 만큼의 데이터를 저장하고 트래픽을 처리할 수 있습니다.
다양한 스토리지 클래스: 데이터의 접근 빈도와 보관 기간에 따라 S3 Standard, S3 Intelligent-Tiering, S3 Glacier 등 다양한 스토리지 클래스를 선택하여 비용을 최적화할 수 있습니다.

이러한 S3의 특성은 정적 웹사이트의 파일(HTML, CSS, JS, 이미지 등)을 보관하고 서비스하기에 완벽한 환경을 제공합니다. 서버 관리에 대한 부담 없이, 극도의 안정성과 무한한 확장성을 바탕으로 웹사이트를 운영할 수 있는 것입니다.

2장: 첫걸음 떼기: S3 버킷 생성과 파일 업로드

이론을 알았으니 이제 직접 실습해볼 차례입니다. AWS 계정이 있다는 가정 하에, 정적 웹사이트를 호스팅할 S3 버킷을 생성하고 파일을 업로드하는 과정을 단계별로 진행합니다. AWS의 프리 티어는 S3에 대해 상당한 양의 무료 스토리지와 요청 횟수를 제공하므로, 작은 프로젝트는 비용 부담 없이 시작할 수 있습니다.

2.1 S3 버킷 생성: 상세 가이드

AWS Management Console에 로그인한 후, 서비스 검색창에서 'S3'를 검색하여 S3 대시보드로 이동합니다. 그리고 '버킷 만들기' 버튼을 클릭하여 생성 프로세스를 시작합니다.

버킷 이름: 버킷 이름은 S3의 모든 리전, 모든 계정을 통틀어 전역적으로 고유해야 합니다. 즉, 다른 누군가가 이미 사용 중인 이름은 사용할 수 없습니다. 또한, DNS 규약을 준수하는 것이 좋습니다.
- 소문자, 숫자, 하이픈(-)만 사용하세요.
- 마침표(.)를 포함할 수 있지만, SSL 인증서 문제나 다른 서비스와의 연동 시 예기치 않은 문제를 일으킬 수 있으므로, 커스텀 도메인을 연결할 계획이라면 마침표는 피하는 것이 좋습니다.
- 나중에 커스텀 도메인(예: www.example.com)을 연결할 계획이라면, 버킷 이름을 도메인 이름과 똑같이 www.example.com으로 만드는 것이 과거에는 일반적인 관행이었으나, CloudFront를 사용할 경우에는 필수가 아닙니다. 혼동을 피하기 위해 my-awesome-static-site와 같이 프로젝트를 설명하는 이름으로 짓는 것을 권장합니다.
AWS 리전(Region): 버킷이 물리적으로 생성될 데이터 센터의 위치를 선택합니다. 웹사이트의 주 사용자층과 가장 가까운 리전을 선택하는 것이 지연 시간(Latency)을 줄이는 데 유리합니다. 예를 들어, 주 사용자가 한국에 있다면 '아시아 태평양(서울) ap-northeast-2' 리전을 선택하는 것이 좋습니다. 리전별로 약간의 요금 차이가 있을 수 있습니다.
객체 소유권: 기본값인 'ACL 비활성화됨(권장)'을 유지합니다. 이는 버킷의 모든 객체 소유권을 버킷 소유자가 가지며, 접근 제어는 IAM 정책과 버킷 정책으로만 관리하겠다는 의미입니다. 이는 최신 AWS 보안 모범 사례입니다.
이 버킷의 모든 퍼블릭 액세스 차단: 이 부분이 매우 중요합니다. 기본적으로 모든 항목이 체크되어 있어 버킷에 대한 모든 퍼블릭 액세스가 차단됩니다. 정적 웹사이트를 호스팅하려면 인터넷상의 모든 사용자가 파일에 접근할 수 있어야 하므로, '모든 퍼블릭 액세스 차단' 체크박스를 해제해야 합니다. 체크를 해제하면 하위 4개의 개별 설정이 나타나는데, 지금은 모두 해제된 상태로 둡니다. 이후 버킷 정책을 통해 세밀하게 접근을 제어할 것이므로 걱정하지 않아도 됩니다. AWS는 이 설정 변경에 대한 경고와 확인을 요청할 것입니다. 내용을 이해했음을 확인하는 체크박스에 표시합니다.
기타 설정: 버킷 버전 관리, 태그, 기본 암호화 등은 지금 단계에서는 기본값으로 두어도 무방합니다. '버킷 만들기'를 클릭하여 생성을 완료합니다.

2.2 웹사이트 파일 준비 및 업로드

이제 생성된 버킷에 웹사이트를 구성하는 파일들을 업로드해야 합니다. 테스트를 위해 간단한 HTML, CSS, 오류 페이지 파일을 준비해 보겠습니다.

index.html:

<!DOCTYPE html>
<html lang="ko">
<head>
    <meta charset="UTF-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <title>My Awesome Static Site</title>
    <link rel="stylesheet" href="styles.css">
</head>
<body>
    <h1>Welcome to My S3 Hosted Website!</h1>
    <p>This website is served directly from an AWS S3 bucket.</p>
</body>
</html>

styles.css:

body {
    font-family: sans-serif;
    background-color: #f0f0f0;
    color: #333;
    display: flex;
    flex-direction: column;
    justify-content: center;
    align-items: center;
    height: 100vh;
    margin: 0;
}

h1 {
    color: #007BFF;
}

error.html:

<!DOCTYPE html>
<html lang="ko">
<head>
    <meta charset="UTF-8">
    <title>Page Not Found</title>
    <link rel="stylesheet" href="styles.css">
</head>
<body>
    <h1>404 - Page Not Found</h1>
    <p>Oops! The page you are looking for does not exist.</p>
</body>
</html>

파일이 준비되었다면, S3 콘솔에서 방금 생성한 버킷을 클릭하여 들어간 후 '업로드' 버튼을 누릅니다. 파일 추가 또는 폴더 추가 버튼을 이용해 준비한 파일들을 모두 업로드합니다. 업로드 과정에서 권한이나 속성 등은 변경할 필요 없이 기본값으로 두고 업로드를 완료합니다.

전문가를 위한 팁: AWS CLI를 이용한 자동화

매번 콘솔을 통해 수동으로 파일을 업로드하는 것은 비효율적입니다. AWS Command Line Interface(CLI)를 사용하면 이 과정을 자동화할 수 있습니다. 로컬 컴퓨터에 AWS CLI를 설치하고 자격 증명을 설정한 후, 터미널에서 다음 명령어를 실행하면 로컬 폴더의 내용을 S3 버킷과 동기화할 수 있습니다.

aws s3 sync . s3://your-bucket-name --delete

.은 현재 디렉터리를 의미하며, s3://your-bucket-name은 대상 버킷을 지정합니다. --delete 옵션은 로컬에는 없지만 버킷에 존재하는 파일을 삭제하여, 로컬 폴더와 버킷의 상태를 완전히 동일하게 만듭니다. 이 명령어 하나로 웹사이트 배포를 간단하게 처리할 수 있습니다.

3장: 웹사이트의 심장: 권한과 호스팅 활성화

파일 업로드가 끝났다고 해서 웹사이트가 바로 열리지는 않습니다. S3에게 이 버킷을 웹사이트처럼 동작하도록 지시하고, 외부 사용자가 파일에 접근할 수 있도록 권한을 설정하는 두 가지 중요한 과정이 남아있습니다.

3.1 보안과 개방성의 균형: 버킷 정책(Bucket Policy) 설정

과거에는 객체별로 접근 제어 목록(ACL, Access Control List)을 설정하여 퍼블릭 읽기 권한을 부여했지만, 이는 관리가 복잡하고 실수가 발생하기 쉽습니다. 현대적인 AWS 환경에서는 버킷 정책을 사용하여 버킷 전체에 대한 접근 규칙을 중앙에서 관리하는 것을 강력히 권장합니다.

버킷 정책은 JSON 형식의 문서로, '누가(Principal)', '어떤 리소스(Resource)에 대해', '어떤 조건(Condition)에서', '무슨 행동(Action)을', '허용(Allow)할지 또는 거부(Deny)할지'를 명시적으로 정의합니다. 우리의 목표는 인터넷상의 모든 사용자(익명 사용자)가 우리 버킷의 모든 객체를 읽을 수 있도록(다운로드할 수 있도록) 허용하는 것입니다.

버킷 상세 화면에서 '권한' 탭으로 이동합니다.
'버킷 정책' 섹션에서 '편집' 버튼을 클릭합니다.
정책 편집기에 아래의 JSON 코드를 붙여넣습니다. your-bucket-name 부분은 실제 버킷 이름으로 반드시 변경해야 합니다.

{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Sid": "PublicReadGetObject",
            "Effect": "Allow",
            "Principal": "*",
            "Action": "s3:GetObject",
            "Resource": "arn:aws:s3:::your-bucket-name/*"
        }
    ]
}

버킷 정책 분석:

Version: 정책 언어의 버전을 명시합니다. 일반적으로 "2012-10-17"을 사용합니다.
Statement: 실제 정책 규칙들의 배열입니다.
Sid: (Statement ID) 이 규칙을 식별하기 위한 이름입니다. 원하는 대로 지정할 수 있습니다.
Effect: 이 규칙의 효과를 지정합니다. "Allow"(허용) 또는 "Deny"(거부)가 있습니다.
Principal: 정책의 영향을 받는 대상을 지정합니다. "*"는 '모든 사용자(익명 포함)'를 의미합니다.
Action: 허용하거나 거부할 작업을 지정합니다. "s3:GetObject"는 S3 객체를 읽는(다운로드하는) 권한입니다.
Resource: 정책이 적용될 자원을 지정합니다. "arn:aws:s3:::your-bucket-name/*"는 'your-bucket-name' 버킷 내부의 모든 객체(/*)를 의미합니다.

정책을 붙여넣고 '변경 사항 저장'을 클릭합니다. 이제 S3 버킷은 외부에서 파일을 읽어갈 수 있도록 준비되었습니다.

3.2 정적 웹사이트 호스팅 기능 활성화

이제 마지막으로 S3에게 이 버킷이 웹 서버처럼 동작해야 함을 알려주어야 합니다.

버킷 상세 화면에서 '속성' 탭으로 이동합니다.
페이지 맨 아래로 스크롤하여 '정적 웹 사이트 호스팅' 카드에서 '편집' 버튼을 클릭합니다.
'정적 웹 사이트 호스팅'을 '활성화'로 변경합니다.
인덱스 문서: 사용자가 디렉터리 경로(예: /)로 접근했을 때 보여줄 기본 파일의 이름을 입력합니다. 일반적으로 index.html입니다.
오류 문서: 사용자가 존재하지 않는 경로로 접근했을 때(404 Not Found 등) 보여줄 파일의 이름을 입력합니다. 준비해 둔 error.html을 입력합니다.
'변경 사항 저장'을 클릭합니다.

설정이 완료되면 '정적 웹 사이트 호스팅' 카드에 버킷 웹 사이트 엔드포인트가 표시됩니다. http://your-bucket-name.s3-website.ap-northeast-2.amazonaws.com과 같은 형식의 URL입니다. 이 URL을 복사하여 웹 브라우저 주소창에 붙여넣으면, 방금 업로드한 웹사이트가 성공적으로 나타나는 것을 확인할 수 있습니다! 존재하지 않는 주소(예: /non-existent-page.html)를 입력하면 error.html의 내용이 나타나는지도 확인해 보세요.

핵심 개념: 웹사이트 엔드포인트 vs. REST API 엔드포인트

S3 객체에 접근하는 엔드포인트는 두 종류가 있으며, 이 둘의 차이를 이해하는 것이 중요합니다.

REST API 엔드포인트: https://your-bucket-name.s3.ap-northeast-2.amazonaws.com/index.html 과 같은 형식입니다. 이는 S3의 API와 통신하기 위한 일반적인 엔드포인트로, 인덱스 문서나 오류 문서 기능을 지원하지 않습니다. 이 URL로 루트에 접근하면 파일 목록이 보이거나(권한에 따라) 에러가 발생합니다.
웹사이트 엔드포인트: http://your-bucket-name.s3-website.ap-northeast-2.amazonaws.com 과 같은 형식입니다. 이 엔드포인트는 웹 브라우저의 요청에 최적화되어 있으며, 위에서 설정한 인덱스 문서 및 오류 문서 기능을 처리해 줍니다. 따라서 정적 웹사이트를 호스팅할 때는 반드시 이 엔드포인트를 사용해야 합니다.

4장: 세상과 연결하기: Route 53으로 커스텀 도메인 연결

s3-website...로 시작하는 긴 URL은 전문성이 떨어져 보입니다. 이제 소유하고 있는 커스텀 도메인(예: example.com)을 S3 웹사이트에 연결해 보겠습니다. 이 과정에서는 AWS의 DNS(Domain Name System) 서비스인 Amazon Route 53을 사용합니다.

도메인을 아직 소유하고 있지 않다면 Route 53, GoDaddy, 가비아 등 다양한 도메인 등록 기관을 통해 구매할 수 있습니다. 이미 도메인을 가지고 있다면 Route 53을 해당 도메인의 DNS 서버로 사용하도록 네임서버를 변경하는 과정이 필요합니다.

4.1 Route 53 호스팅 영역(Hosted Zone) 설정

Route 53에서 도메인의 DNS 레코드를 관리하기 위해서는 먼저 '호스팅 영역'을 생성해야 합니다. 호스팅 영역은 특정 도메인에 대한 DNS 레코드의 컨테이너 역할을 합니다.

AWS 콘솔에서 'Route 53'으로 이동합니다.
왼쪽 메뉴에서 '호스팅 영역'을 선택하고 '호스팅 영역 생성'을 클릭합니다.
'도메인 이름'에 소유한 도메인(예: my-awesome-site.com)을 입력합니다.
'유형'은 '퍼블릭 호스팅 영역'으로 선택합니다.
'호스팅 영역 생성'을 클릭합니다.

생성이 완료되면 NS(네임서버) 레코드와 SOA(권한 시작) 레코드가 자동으로 생성됩니다. 만약 다른 곳에서 도메인을 구매했다면, NS 레코드에 표시된 4개의 네임서버 주소를 해당 도메인 등록 기관의 네임서버 설정에 입력해야 합니다.

4.2 도메인 연결의 핵심: ALIAS 레코드 생성

이제 Route 53에 "my-awesome-site.com으로 들어오는 요청을 아까 만든 S3 웹사이트 엔드포인트로 보내라"는 규칙을 추가해야 합니다. 이를 위해 레코드를 생성합니다.

여기서 중요한 개념이 등장합니다. 루트 도메인(my-awesome-site.com과 같이 'www'가 없는 도메인, Zone Apex라고도 함)은 DNS 규약상 CNAME 레코드를 가질 수 없습니다. CNAME은 다른 도메인 이름의 별칭을 만드는 것인데, 루트 도메인에는 NS나 SOA 같은 다른 중요한 레코드들이 이미 존재하기 때문입니다. 이 문제를 해결하기 위해 AWS는 ALIAS(별칭)라는 특별한 레코드 유형을 제공합니다.

ALIAS 레코드는 CNAME과 유사하게 동작하지만, 내부적으로는 대상 AWS 리소스(S3 웹사이트 엔드포인트, CloudFront 배포, ELB 등)의 IP 주소로 해석됩니다. 따라서 A 레코드처럼 루트 도메인에 설정할 수 있으면서도, 대상의 IP가 변경되어도 자동으로 추적하는 CNAME의 유연성을 가집니다.

생성한 호스팅 영역의 상세 페이지로 들어가 '레코드 생성'을 클릭합니다.
루트 도메인 (my-awesome-site.com) 설정:
- 레코드 이름: 비워 둡니다.
- 레코드 유형: 'A - IPv4 주소로 트래픽 라우팅'을 선택합니다.
- '별칭(Alias)' 토글을 활성화합니다.
- 트래픽 라우팅 대상 선택: 'S3 웹 사이트 엔드포인트에 대한 별칭'을 선택합니다.
- 리전 선택: S3 버킷을 생성한 리전(예: ap-northeast-2)을 선택합니다.
- S3 엔드포인트 선택: 드롭다운 목록에 나타나는 S3 웹사이트 엔드포인트 (s3-website.ap-northeast-2.amazonaws.com으로 시작하는 주소)를 선택합니다.
- '레코드 생성'을 클릭합니다.
'www' 서브도메인 (www.my-awesome-site.com) 설정 (선택 사항):
사용자가 'www'를 붙여서 접속해도 동일한 사이트로 연결되도록 설정하는 것이 좋습니다. 'www'는 서브도메인이므로 일반적인 CNAME을 사용할 수 있습니다.
- 다시 '레코드 생성'을 클릭합니다.
- 레코드 이름: www를 입력합니다.
- 레코드 유형: 'CNAME - 다른 도메인 이름 및 일부 AWS 리소스로 트래픽 라우팅'을 선택합니다.
- 값: 루트 도메인 이름(예: my-awesome-site.com)을 입력합니다. 이렇게 하면 www 주소는 루트 주소의 별칭이 됩니다.
- '레코드 생성'을 클릭합니다.

DNS 변경 사항이 전 세계로 전파되는 데는 몇 분에서 최대 48시간까지 걸릴 수 있지만, 보통은 수 분 내에 완료됩니다. 잠시 후 웹 브라우저에서 자신의 커스텀 도메인 주소로 접속했을 때 S3 웹사이트가 나타난다면 성공입니다!

5장: 성능과 보안의 날개를 달다: CloudFront 연동

지금까지의 설정만으로도 훌륭한 정적 웹사이트를 운영할 수 있습니다. 하지만 두 가지 치명적인 약점이 남아있습니다.

HTTP 전용: S3 웹사이트 엔드포인트는 HTTPS를 지원하지 않습니다. 요즘 웹 환경에서 HTTPS는 선택이 아닌 필수입니다. 브라우저는 HTTP 사이트를 '안전하지 않음'으로 표시하며, SEO에도 불이익이 있습니다.
지연 시간: 사용자가 S3 버킷이 있는 리전에서 멀리 떨어져 있을수록 웹사이트 로딩 속도는 느려집니다.

이 모든 문제를 한 번에 해결해 주는 서비스가 바로 Amazon CloudFront입니다. CloudFront는 전 세계 수백 개의 엣지 로케이션(Edge Location)에 웹사이트 콘텐츠의 복사본(캐시)을 저장해두는 CDN 서비스입니다. 사용자가 웹사이트에 접속하면, 가장 가까운 엣지 로케이션에서 콘텐츠를 전달받아 지연 시간을 최소화합니다.

5.1 CloudFront를 사용해야 하는 결정적 이유

무료 HTTPS/SSL: CloudFront는 AWS Certificate Manager(ACM)와 통합되어, 커스텀 도메인에 대한 무료 SSL/TLS 인증서를 손쉽게 발급하고 적용할 수 있습니다. 이를 통해 모든 트래픽을 암호화하여 보안을 강화할 수 있습니다.
글로벌 성능 향상: 한국의 사용자는 서울 엣지 로케이션에서, 미국의 사용자는 캘리포니아 엣지 로케이션에서 콘텐츠를 받게 되므로 전 세계 어디서든 빠른 로딩 속도를 경험할 수 있습니다.
강화된 보안 (OAC): Origin Access Control(OAC)라는 기능을 사용하면, S3 버킷에 대한 퍼블릭 액세스를 완전히 차단하고 오직 특정 CloudFront 배포만을 통해 접근하도록 설정할 수 있습니다. 이는 S3 버킷을 외부로부터 완벽하게 보호하는 가장 강력한 보안 구성입니다.
비용 절감 가능성: 대부분의 트래픽이 엣지 로케이션의 캐시에서 처리되므로 S3로의 데이터 전송 요청(GET 요청)이 줄어들어 S3 데이터 전송 요금을 절약할 수 있습니다.

5.2 CloudFront 배포(Distribution) 생성: 상세 가이드

이제 S3 버킷을 원본(Origin)으로 하는 CloudFront 배포를 생성해 보겠습니다.

SSL 인증서 발급 (선행 작업):
- AWS 콘솔에서 AWS Certificate Manager(ACM)로 이동합니다.
- 중요: CloudFront와 함께 사용할 인증서는 반드시 미국 동부(버지니아 북부) us-east-1 리전에서 생성해야 합니다. 다른 리전에서 생성한 인증서는 CloudFront 배포에 연결할 수 없습니다.
- '인증서 요청'을 클릭하고 '퍼블릭 인증서 요청'을 선택합니다.
- '도메인 이름'에 웹사이트의 루트 도메인(my-awesome-site.com)과 www 서브도메인(*.my-awesome-site.com 또는 www.my-awesome-site.com)을 모두 추가합니다.
- 검증 방법은 'DNS 검증'을 선택합니다. Route 53을 사용하고 있다면, ACM이 자동으로 필요한 CNAME 레코드를 생성해주므로 검증이 매우 간편합니다.
- 요청이 완료되고 상태가 '발급됨'으로 변경될 때까지 기다립니다.

CloudFront 배포 생성:
- AWS 콘솔에서 'CloudFront'로 이동하여 '배포 생성'을 클릭합니다.
- 원본 도메인: 드롭다운 목록에서 이전에 생성한 S3 버킷을 선택합니다. 주의: 목록에는 your-bucket-name.s3.ap-northeast-2.amazonaws.com과 같은 REST API 엔드포인트가 표시됩니다. 이것을 선택하는 것이 맞습니다. (웹사이트 엔드포인트가 아닙니다.)
- 원본 액세스: 'Origin access control settings(recommended)'를 선택합니다. 그리고 'Control setting 생성' 버튼을 클릭하여 새로운 OAC 설정을 만듭니다. 기본 이름으로 생성하면 됩니다.
- 생성 후 'Copy policy' 버튼을 클릭하여 S3 버킷에 적용할 정책을 복사하고, 'Go to S3 bucket permissions' 링크를 통해 S3 버킷의 권한 페이지로 이동하여 기존 정책을 이 내용으로 대체합니다. 이 새로운 정책은 오직 이 CloudFront 배포에게만 s3:GetObject 권한을 부여합니다. 이제 S3 버킷의 '모든 퍼블릭 액세스 차단' 설정을 다시 활성화하여 버킷을 비공개로 만듭니다.
- 뷰어 프로토콜 정책: 'Redirect HTTP to HTTPS'를 선택하여 모든 사용자가 안전한 HTTPS 연결을 사용하도록 강제합니다.
- 캐시 키 및 원본 요청: 'Cache policy and origin request policy(recommended)'에서 'CachingOptimized'와 'Managed-CORS-S3Origin'를 기본으로 사용합니다.
- 설정(Settings):
  - 가격 분류: '모든 엣지 로케이션 사용(최고 성능)'을 선택하거나, 특정 지역에만 서비스를 제공한다면 비용 절감을 위해 다른 옵션을 선택할 수 있습니다.
  - 대체 도메인 이름(CNAME): '항목 추가'를 눌러 커스텀 도메인(my-awesome-site.com과 www.my-awesome-site.com)을 모두 입력합니다.
  - 사용자 정의 SSL 인증서: 방금 us-east-1 리전에서 발급받은 ACM 인증서를 선택합니다.
  - 기본값 루트 객체: index.html을 입력합니다.
- 모든 설정이 완료되면 '배포 생성'을 클릭합니다. 배포가 전 세계 엣지 로케이션에 전파되는 데 5~15분 정도 소요될 수 있습니다.

5.3 마지막 단계: Route 53 레코드 업데이트

CloudFront 배포가 완료되면, 이제 사용자의 도메인 요청을 S3가 아닌 CloudFront로 보내도록 Route 53 설정을 변경해야 합니다.

Route 53의 해당 호스팅 영역으로 돌아갑니다.
이전에 생성했던 루트 도메인(my-awesome-site.com)의 A(ALIAS) 레코드를 선택하고 '레코드 편집'을 클릭합니다.
트래픽 라우팅 대상 선택: 기존의 'S3 웹 사이트 엔드포인트에 대한 별칭' 대신 'CloudFront 배포에 대한 별칭'을 선택합니다.
드롭다운 목록에서 방금 생성한 CloudFront 배포(d1234abcd.cloudfront.net과 같은 주소)를 선택합니다.
'변경 사항 저장'을 클릭합니다.
'www' 서브도메인의 CNAME 레코드도 마찬가지로 편집하여, 값이 루트 도메인(my-awesome-site.com)을 가리키도록 하거나, 직접 CloudFront 배포 도메인을 가리키도록 설정할 수 있습니다.

모든 설정이 완료되고 DNS가 전파된 후, https://my-awesome-site.com으로 접속해 보세요. 주소창에 자물쇠 아이콘과 함께 안전한 HTTPS 연결이 표시되고, 웹사이트는 전 세계 엣지 로케이션을 통해 그 어느 때보다 빠르게 로딩될 것입니다. 이로써 비용 효율적이고, 확장 가능하며, 빠르고 안전한 현대적인 정적 웹사이트 아키텍처가 완성되었습니다.

Continue Reading →

Building and Deploying Scalable Static Websites on AWS S3

July 20, 2023 / No comments

The Modern Web: A Paradigm Shift to Static Architectures

In the evolving landscape of web development, a significant trend has emerged: the resurgence of static websites, albeit in a far more powerful and flexible form than their 1990s ancestors. This modern approach, often associated with the JAMstack (JavaScript, APIs, and Markup) architecture, decouples the frontend user interface from the backend logic and database. Instead of generating pages on a server for every single request, websites are pre-built into a collection of static HTML, CSS, and JavaScript files. These files can then be served directly from a global network, offering unparalleled performance, security, and scalability at a fraction of the cost of traditional dynamic hosting. This architectural shift has unlocked new possibilities for developers, from personal blogs and portfolios to complex e-commerce platforms and enterprise-level applications.

At the heart of this revolution lies the need for a robust, reliable, and cost-effective solution for storing and delivering these static assets. This is where Amazon Web Services (AWS) and its foundational storage service, Amazon Simple Storage Service (S3), enter the picture. By leveraging S3's vast infrastructure, developers can host static websites that are automatically distributed, highly available, and can handle virtually any amount of traffic without the need for managing servers, patching operating systems, or worrying about infrastructure scaling. This document explores the comprehensive process of hosting a static website on AWS S3, from the initial bucket creation to advanced optimizations using a Content Delivery Network (CDN) and custom domains.

Deconstructing Amazon S3: More Than Just Storage

Before diving into the practical steps of website hosting, it's crucial to understand the fundamental nature of Amazon S3. To call it merely "storage" would be an oversimplification. S3 is an object storage service, which distinguishes it from file storage (like an external hard drive) or block storage (used for databases and operating systems). In an object storage model, data is managed as objects. Each object consists of the data itself, a unique identifier (or key), and metadata—a set of descriptive attributes about the object.

Core Concepts of Amazon S3

Objects: The fundamental entities stored in S3. An object can be any type of file: an HTML page, a CSS stylesheet, a JavaScript file, an image, a video, a PDF, or even a data backup. The maximum size for a single object is 5 terabytes.
Buckets: Objects are organized into containers called buckets. A bucket is analogous to a top-level folder or directory. Each bucket must have a globally unique name across all AWS accounts in the world. This is because bucket names can be used as part of the DNS address to access the objects within them. Bucket names must be DNS-compliant—they should not contain uppercase letters or underscores.
Keys: The key is the unique identifier for an object within a bucket. You can think of it as the full path and filename. For example, in the S3 URL `s3://my-unique-website-bucket/css/style.css`, "my-unique-website-bucket" is the bucket name, and "css/style.css" is the object key. The use of slashes in the key creates a logical folder hierarchy, even though S3's internal structure is flat.
Regions: When you create an S3 bucket, you must choose an AWS Region where it will reside. A region is a physical geographic location, such as `us-east-1` (N. Virginia) or `eu-west-2` (London). Choosing a region close to your primary audience can reduce latency for data access. However, as we'll see later, using a CDN like Amazon CloudFront makes the bucket's physical location less critical for end-user performance.
Durability and Availability: S3 is engineered for extreme durability and availability. It achieves this by automatically storing your objects across multiple devices in a minimum of three Availability Zones (AZs) within a region. An AZ is one or more discrete data centers with redundant power, networking, and connectivity. This design provides a durability of 99.999999999% (eleven 9s), meaning that if you store 10,000,000 objects, you can on average expect to lose a single object once every 10,000 years. It also provides 99.99% availability over a given year.

This underlying architecture makes S3 an ideal foundation for static website hosting. It eliminates the single points of failure common in traditional hosting environments and provides a level of data integrity that would be prohibitively expensive to replicate on your own.

Phase 1: Creating and Configuring the S3 Bucket

The first practical step is to create the S3 bucket that will hold your website's files. This process involves more than just picking a name; it requires careful consideration of security and access settings to ensure your website is both publicly accessible and secure from unauthorized modifications.

Step 1: AWS Account and Console Navigation

To begin, you need an AWS account. If you don't have one, you can sign up on the official AWS website. Many services, including a certain amount of S3 usage, are available under the AWS Free Tier for the first 12 months, making it easy to experiment. Once logged into the AWS Management Console, use the search bar at the top to find and navigate to the S3 service dashboard.

Step 2: The Bucket Creation Process

On the S3 dashboard, click the "Create bucket" button.
Bucket Name: Enter a globally unique, DNS-compliant name. A common and recommended practice is to name the bucket after the domain you intend to use, such as `www.my-awesome-site.com` or `my-awesome-site.com`.
AWS Region: Select a region. While a CDN will mitigate latency issues, choosing a region where you'll be doing most of your administrative work (uploads, etc.) can be convenient.
Block Public Access settings for this bucket: This is the most critical security setting during bucket creation. By default, AWS enables all four settings under "Block all public access." This is a security best practice designed to prevent accidental data exposure. For now, leave these settings enabled. We will not be making the bucket itself public. Instead, we will later use a more secure method (a bucket policy or a CloudFront Origin Access Identity) to grant the necessary read access. Disabling these settings carelessly is a leading cause of data breaches.
Leave other settings like Bucket Versioning and Tags at their default values for now. We can enable them later if needed.
Click "Create bucket."

Step 3: Uploading Your Website Files

With your bucket created, you can now upload your website's content. This includes your `index.html` file, any other HTML pages, CSS folders, JavaScript files, images, and other assets.

Click on the name of your newly created bucket in the S3 console.
Click the "Upload" button.
You can either drag and drop your files and folders directly onto the console window or use the "Add files" and "Add folder" buttons. Ensure you upload the entire project structure, maintaining the relative paths between your files.
Click "Upload" to begin the transfer. The time it takes will depend on the size of your website and your internet connection speed.

At this point, your files are securely stored in S3, but they are not yet accessible to the public, nor is the bucket configured to behave like a web server.

Phase 2: Enabling Static Website Hosting and Public Access

Now that the files are in place, we need to instruct S3 to serve them as a website. This involves two key steps: enabling the static website hosting feature on the bucket and then creating a policy that allows the public to read the files.

Step 4: Activating Static Website Hosting

In your S3 bucket, navigate to the "Properties" tab.
Scroll down to the bottom of the page to the "Static website hosting" card and click "Edit."
Select "Enable" for Static website hosting.
In the "Index document" field, enter the name of your website's main entry file. This is almost always `index.html`. When a user navigates to a directory (like the root of your site), this is the file S3 will serve.
Optionally, you can specify an "Error document." This is the HTML file that S3 will serve if a user requests a page that does not exist (resulting in a 404 Not Found error). A common name for this is `error.html` or `404.html`. This is highly recommended for a professional user experience.
Click "Save changes."

After saving, S3 will provide you with a unique "Bucket website endpoint" URL on the same card. It will look something like `http://.s3-website..amazonaws.com`. If you try to visit this URL now, you will receive a 403 Forbidden error. This is expected, as we have not yet granted public access to the objects.

Step 5: Granting Public Read Access with a Bucket Policy

The outdated and insecure method for granting access is to manually make each file or folder public using Access Control Lists (ACLs). This is tedious, error-prone, and not recommended. The modern, secure, and scalable way is to apply a bucket policy.

A bucket policy is a JSON-based document that defines who can perform what actions on the objects within the bucket.

Navigate to the "Permissions" tab of your S3 bucket.
Confirm that "Block all public access" is still turned on. We will now create a specific exception. Click "Edit" in the "Bucket policy" section.
You will see a policy editor. Paste the following JSON policy into the editor. You must replace YOUR-BUCKET-NAME with the actual name of your S3 bucket.


{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Sid": "PublicReadGetObject",
            "Effect": "Allow",
            "Principal": "*",
            "Action": "s3:GetObject",
            "Resource": "arn:aws:s3:::YOUR-BUCKET-NAME/*"
        }
    ]
}

Let's break down this policy:

"Effect": "Allow": This statement is permissive; it grants permissions.
"Principal": "*": The principal is the entity being granted permission. The asterisk (`*`) is a wildcard that means "everyone" or "anonymous users."
"Action": "s3:GetObject": This specifies the allowed action. `s3:GetObject` is the permission to read or download an object from S3.
"Resource": "arn:aws:s3:::YOUR-BUCKET-NAME/*": This defines which resources the policy applies to. The Amazon Resource Name (ARN) specifies your bucket. The `/*` at the end is a wildcard that means the policy applies to all objects inside the bucket, but not the bucket itself.

Click "Save changes."

Now, if you go back to the "Properties" tab, copy the bucket website endpoint, and paste it into your browser, your website should load successfully! You have now deployed a globally available static website.

Phase 3: Connecting a Custom Domain with Amazon Route 53

While the S3 endpoint URL works, it's not professional or memorable. The next logical step is to connect your website to a custom domain that you own, such as `www.my-awesome-site.com`. The AWS service for managing DNS is Amazon Route 53.

Step 6: Setting Up a Hosted Zone in Route 53

If you don't already own a domain, you can purchase one through Route 53 or any other domain registrar like GoDaddy or Namecheap. Once you have a domain, you need to tell Route 53 to handle its DNS requests by creating a "Hosted Zone."

Navigate to the Route 53 service in the AWS Management Console.
In the left navigation pane, click "Hosted zones."
Click "Create hosted zone."
In the "Domain name" field, enter your root domain name (e.g., `my-awesome-site.com`).
Select "Public hosted zone" as the type.
Click "Create hosted zone."

Route 53 will now create a hosted zone for your domain and provide you with four "NS" (Name Server) records. If you purchased your domain from a third-party registrar, you must log in to their management portal and update the domain's name servers to these four AWS values. This delegation process can take up to 48 hours to propagate across the internet, but is often much faster.

Step 7: Creating DNS Records to Point to S3

Once your hosted zone is active, you need to create records that map your domain name to your S3 website endpoint. We will create an "A" record. An "A" record maps a domain name to an IP address. However, since the IP address of the S3 endpoint can change, Route 53 has a special feature called an "Alias" record.

An Alias record is an AWS-specific type of record that lets you map a domain name to a specific AWS resource, like an S3 bucket or a CloudFront distribution. Route 53 automatically resolves the alias to the resource's current IP address, making it resilient to changes.

In your hosted zone, click "Create record."
Leave the "Record name" blank to create a record for the root domain (`my-awesome-site.com`).
For "Record type," select "A – Routes traffic to an IPv4 address and some AWS resources."
Toggle the "Alias" switch to "on."
In the "Route traffic to" dropdown, choose "Alias to S3 website endpoint."
In the next dropdown, select the region where your S3 bucket is located.
A final dropdown will appear, which should auto-populate with your S3 bucket's endpoint. Select it.
Leave the "Routing policy" as "Simple routing."
Click "Create records."

Optionally, you can create a `www` version of your site. The best way to do this is to create a separate S3 bucket named `www.my-awesome-site.com`, configure it for static website hosting, but instead of uploading content, you configure it to *redirect* all requests to your root domain bucket (`my-awesome-site.com`). Then, you would create another Alias A record in Route 53 for the `www` subdomain pointing to this new redirecting bucket.

After a few minutes for the DNS changes to take effect, you should be able to type your custom domain into your browser and see your S3-hosted website.

Phase 4: Global Performance and Security with Amazon CloudFront

Your website is now live on a custom domain, but it's served from a single AWS region. This means users far from that region will experience higher latency. Furthermore, the connection is over HTTP, not the secure HTTPS protocol, which is a standard for modern websites. We can solve both of these issues, and more, by using Amazon CloudFront, AWS's global Content Delivery Network (CDN).

The Role of a CDN

A CDN works by caching copies of your website's content (images, CSS, JS, etc.) in a worldwide network of data centers called "Edge Locations." When a user requests your website, they are routed to the nearest edge location, which serves the cached content. This drastically reduces latency. CloudFront has hundreds of edge locations globally. It also provides a layer of security, absorbing traffic spikes and protecting against certain types of DDoS attacks.

Step 8: Securing Your Domain with AWS Certificate Manager (ACM)

Before creating a CloudFront distribution, we need an SSL/TLS certificate to enable HTTPS. AWS Certificate Manager (ACM) provides free public SSL/TLS certificates for use with AWS services like CloudFront.

Navigate to the AWS Certificate Manager (ACM) service. Important: You must be in the `us-east-1` (N. Virginia) region to request a certificate for use with CloudFront. This is a CloudFront requirement, regardless of where your other resources are located.
Click "Request a certificate."
Select "Request a public certificate."
For "Domain names," add both your root domain (`my-awesome-site.com`) and a wildcard version (`*.my-awesome-site.com`) to cover all subdomains like `www`.
Choose "DNS validation" as the validation method. This is generally the easiest method when using Route 53.
After requesting, ACM will ask you to create specific CNAME records in your DNS to prove you own the domain. If you are using Route 53 for the same account, ACM will present a button that says "Create records in Route 53," which will do this for you automatically.
It may take a few minutes to a few hours for the certificate status to change from "Pending validation" to "Issued."

Step 9: Creating the CloudFront Distribution

With the certificate ready, we can now set up the CloudFront distribution.

Navigate to the CloudFront service in the AWS Console.
Click "Create distribution."
In the "Origin domain" field, do not select your S3 bucket from the dropdown list. Instead, paste your S3 bucket's **website endpoint URL** (the one from the static hosting properties page) into the field. This ensures CloudFront correctly handles index documents and redirects.
Under "Viewer protocol policy," select "Redirect HTTP to HTTPS." This enforces a secure connection for all users.
In the "Cache key and origin requests" section, leave the "Cache policy and origin request policy" as their defaults for now (`CachingOptimized` and `Managed-CORS-S3Origin`).
Under "Settings," in the "Alternate domain name (CNAME)" field, add your domain names (e.g., `my-awesome-site.com` and `www.my-awesome-site.com`).
For "Custom SSL certificate," select the certificate you created in ACM from the dropdown list.
Set the "Default root object" to `index.html`. This tells CloudFront what file to request from the origin when a user requests the root URL.
Click "Create distribution."

The distribution will take 5-15 minutes to deploy globally. Its status will change from "InProgress" to the date and time it was last modified. Once deployed, you will be given a CloudFront domain name, such as `d12345abcdef.cloudfront.net`.

Step 10: Updating Route 53 to Point to CloudFront

The final step is to update your Route 53 records to point to the new CloudFront distribution instead of the S3 bucket.

Go back to your Hosted Zone in Route 53.
Edit the "A" record you created earlier for your root domain.
Ensure the "Alias" toggle is still on.
In the "Route traffic to" dropdown, now choose "Alias to CloudFront distribution."
In the final dropdown, your new CloudFront distribution's domain name should appear. Select it.
Save the record. Repeat this process for your `www` record if you have one.

After a few minutes for the DNS to update, your custom domain will now serve traffic through the global CloudFront network, complete with HTTPS encryption. Your website is now faster, more secure, and incredibly scalable.

Advanced Security: Locking Down the S3 Bucket

One final, crucial step for a production-ready setup is to ensure that users can only access your website content through CloudFront, not by going directly to the S3 bucket's website endpoint. This prevents bypassing the CDN and its security features.

This is achieved using an Origin Access Control (OAC). OAC is a feature that creates a special CloudFront identity and a corresponding S3 bucket policy that only allows that identity to access the objects.

In your CloudFront distribution settings, go to the "Origins" tab and edit your S3 origin.
For "Origin access," select "Origin access control settings."
Click "Create control setting." A new setting will be created with default options, which are fine.
After creating it, CloudFront will display a bucket policy that you must apply. Click the "Copy policy" button.
Go to your S3 bucket's "Permissions" tab and edit the bucket policy. Replace the public read policy we created earlier with this new policy from CloudFront. It will look something like this, granting `s3:GetObject` permission only to the CloudFront service principal associated with your specific distribution.
Save the new bucket policy. Now, if you try to access the S3 website endpoint directly, you will get a 403 Forbidden error, but your site will continue to work perfectly through the CloudFront URL and your custom domain.

By following these steps, you have architected a professional, production-grade static website. It leverages the durability and low cost of Amazon S3 for storage, the global performance and security of Amazon CloudFront for delivery, the reliability of Amazon Route 53 for DNS, and the security of AWS Certificate Manager for HTTPS encryption. This serverless architecture provides a powerful platform that can scale to millions of users without any infrastructure management, allowing you to focus solely on creating compelling web content.

Continue Reading →