AWS Solution Architect Professional Notes
Starting with Few Important components/services:
EBS is persistent storage ideal for database, file system.
EBS is made fault tolerant by creating snapshot. To have snapshot consistent, stop the instance or flush memory
Backup strategy
Retention period of snapshot.
Snapshots are stored in S3.
Snapshot needs to be copied in other region and then volume can be created out of it.
EBS volumes are AZ specific.
Note: DR strategy for all DBs are around snapshot creation and creating DB in another region.
Resilient: handle exceptions, graceful handling
Modular: High Cohesion (keeping similar kind of entities together + low coupling)
AWS Step Functions for Orchestration
Use SQS or messaging/aync functions to decouple services
ELB and ASG: Regional services. If EC2s in different region, use R53.
Mulitple AZs to be used for fault tolerant.
DR Approaches
-Backup & Restore (Slowest + low cost)
-Pilot Light: only few critical components are on cloud
-Warm Standby: Smaller env with all processes
-Multi Site (fastest + highest cost)
With a S3 Stored Storage G/W Snapshot, we can launch an EBS volumes or another storage g/w on an EC2
Storage G/W can also provide TCP data for EC2 on AWS
S3: Object storage (photos, videos, files). AMI, snapshots are stored in S3. By default, multiple AZs, cross regions.
RDS: Multi AZ , read replicas. Automatic backups. Transactional logs are stored in RDS with logs 5 mins back. RDS DB Snapshot (manual backup)..no transactional logs
Multi AZ has synchronous Slave for a Master for Multi AZ RDS.
Consolidated Billing: (AWS Organizations)
Turn off sharing of RI instances
Master account is called Payer account.
AWS budget uses cost explorer.
Redshift doesn't work with spot instances.
AWS Organizations:
Creates a Root account which would have OUs or individual account under it. Root is a logical entity.
OU is an container of OU and accounts. OU can only have single parent.
An account can be member of only one OU.
Only 1 Master account which is the Payer account. Single payer account. Logging, control.
All other accounts are member account.
Service Control Policies
AWS Organizations is eventually consistent to replicate settings to all regions.
SCP has no effect on the Master/primary account. Apply SCP at OU level.
AWS organizations allows us to have common LDAP services, shared services.
AWS Organization creates service linked role for Master account with member account.
RTO: Recovery Time Objective- Time taken to restore service after crash.
File G/W: Files are stored in S3.
Storage Gateway: is Block storage. You take snapshot which is copied to S3. Create ec2 from that snapshot. iSCSI interface is placed on on-prem and interface interacts with AWS Storage Gateway service in AWS. Storage G/W is for backup…store.
Volume G/W:
Cached Volume G/W would cache frequently access data on-prem and rest of the stuff on AWS.
Stored volumes: on-prem has main set of data which gets backed-up on AWS asynch
Glacier: Lowest cost storage. Retrieval times needs to be considered.
Snowball: If data takes more than 6-7 days to transfer/copy , use snowball.
VM Import/Export to S3 is free of cost by AWS.
CloudFormation:
Works as Infra as Code. Template creates a Stack.
Change set is created based on delta based on template update.
Resources must always exist in Cloudformation template.
Naming conventions for "Type": AWS::Name of the service: :Instance
Conditions is how you control the template.
Intrinsic functions can be used in Resources, metadata, outputs and update policy attributes.
GetAtt, Ref, FindInMap, GetAZs, ImportValue are few intrinsic values.
CreationPolicy: injects dependency on other resource creation. waitCondition: allows template to wait/delay for creation on policy.
Delete Stack Template would delete all the components created.
Deletion Policy attribute: For DB, snapshot or deletes.
Nested Stacks: reuse common templates. For eg. ALB
Elastic Beanstalk:
Least control on Infra. OpsWork sits in middle . CloudFormation highest control on Infra
Beanstalk created EC2 doesn't have EC2 volumes backedup (ethereal storage). So, do not create RDS with beanstalk
Beanstalk focuses on Application. An application version points to S3 which has the deployable code. (war, ear, jar, zip)
Environment (runs one application version)
Beanstalk runs an Agent Host Manager on EC2 for application monitoring, log files rotation and publishing logs to S3. Only available with Beanstalk.
Packer open source tool use to create AMI.
Single container/ multiple container docker image.
Beanstalk creates S3 bucket with beanstalk-region-accountid
Beanstalk allows custom web server deployment.
AWS Ops Works: Deploying and monitor instances, ALB, DB and application. You CANNOT change region of the stack once created.
Chef : Configs are universally applied.
Cookbook : Contains the configs and instructions called recipes.
Recipe: written in Ruby is set of instructions abt resources and their order.
Stack needs region and operating system (win or linux). Can have multiple layers.
Layers work on same set of instructions. Each layer has its own recipes.
Ops Work Lifecycle: Setup, Deploy, Configure, UnDeploy and Shutdown
Instances: OpsWork installs the agent on instances.
Instance type: 24/7, Time based & load-based.
When communication between OpsWork and OpsWork agent on instance breaks --> Auto healing starts
OpsWork doesn't support Application Load Balancer..only classic.
OpsWork supports CloudTrail logs, event logs and chef logs.
Only 1 CLB per layer in Stack.
AWS Config:
Helps in audit, maintenance and compliance
Overall view of resource.
Configuration item is created whenever a change is recorded on the resource.
History is collection of items. Stored in S3 and SNS.
AWS Service Catalog: creates Portfolio which uses cloudformation
Product is application. Catalog is collection of products. Products are delivered as Portfolios.
Service catalog has constraints which restricts cost and components.
Launch, notification and template constraints.
Cloudwatch:
Metrics are Data points created regionally.
Namespace is container for cloudwatch metrics.
Alarms: OK, ALARM & INSUFFICIENT DATA
Period ( in seconds), evaluation period ( no. of datapoints/ per period), datapoints to alarm: how many datapoints to raise alarm.
Alarms can trigger EC2 actions, ASG or SNS actions..NO LAMBDA or SQS
Empty Log stream retention period is 2 months only.
Log retention from 1 day to 10 years. Logs are stored indefinite.
Cloudwatch logs insight needs JSON based events.
Encryption and metric filters are applied at Log Group level.
Unified cloudwatch agent: works with windows as well. Faster than old one.
Cloudwatch logs --> kinesis, kinesis streams or lambda (real time)
Cross account logging possible
Aws events can be shared with other aws accounts.
Synthetics: Canaries scripts tries to mimic Customer actions. Checks API latency and endpoints.
ServiceLens: integrated with Xray to provide end to end view of application.
Systems Manager:
Needs ssm agent to be deployed and running on host.
Actions: run command, session manager, patch mgr, automation, state mgr
Maintenance window
SSM resource groups are collection of resources in a region
State manager to run scripts on a recurring basis, patch updates, software updates
Amazon QuickSight for visualization
Resource DataSync syncs data from multiple accounts.
Symmetric key better than asymmetric keys
CSR Request --> X509 Types --> private key --> certificate chain
CSR should have CN (FQDN), Country and etc
PEM format (Privacy Encoded mail)
SSL or Session Keys are generated for the session only
AWS VPC:
5 IPs of every subnet is reserved. First 4 and last.
1st - base n/w
2nd- vpc
3rd - router
4th - future use
5th - last ip
VPN setup over internet ipsec based is fastest to achieve connection with on-prem. Performance could be slow. VPN has option of static and dynamic routing.
Direct connect only supports BGP Dynamic routing.
10 customer G/W can connect to 1 VGW (Soft limit) through ipsec tunnel
BGP is Dynamic Routing Mechanism
Autonomous System Number (ASN) ( Eg. Customer G/W and AWS G/W) used by BGP
LAG (Link Aggregator Groups) joins the links together as one. The links must be of same bandwidth.
BGP Communities for routing preferences.
Route table needs to point to VGW. There's no target with direct connect or vpn.
BGP prefers Direct Connect over vpn site to site
Static route is preferred over BGP (Dynamic)
Longest subnet mask is preferred ( /24 over /16)
From customer g/w to vpc, we need to configure local preference or route for BGP
Direct Connect G/W is a global resource and is not linked to a single region. This cloud be attached to multiple regions VGW.
You can ONLY attach one VGW with one Direct Connect G/W.
Private or public VIF could be max 50 per direct connect.
Cannot have more than 4 Dedicated connections per LAG. 10 LAGs per Region.
200 Direct connect gateways per account.
Inter region peering is allowed.
Transit gateway are regional resource. Direct connect g/w is Global.
Transit G/W attaches a VPC or VPN. Doesn't connect to Direct connect
Enhanced EC2 networking -IOPS - SR I/OV- Low latency- HVM AMI
Spread placement group provides Distinct underlying hardware. Max 7 instances each AZ.
NAT G/W is per AZ.
EC2 creates ENI primary by default. ETH0 types
Interface endpoint is an ENI entry.
Load Balancer:
NLB supports TCP and TLS
handles million of request
access log, cross zone LB are disabled by default
lambda as target type not supported for NLB
if instance id is the target type, ec2 instances get the client ip directly
in case of IP address, we need to use proxy protocol
microservices works with IP address
Proxy protocol enables actual client headers to be sent ahead (Only for TCP /layer 4)
For https/http, use x-forwarded- for header
ELB is region specific
Non standard webserver health check should be done with TCP instead of http
Session affinity/stickiness is cookies based
ELB doesn't support 2 way authentication over HTTPS. Client side certificates are not checked.
TCP would allow via proxy protocol settings .
ALB supports Server Name Indication (SNI) certificates (multiple certs pointing to same IP). CLB doesn't support SNI.
Re-resolving of DNS is important for ALB to respond correctly. Caching caches IP
Access logs are disabled by default. Details of client, protocol and etc.
API calls are in Cloudtrail.
100 rules per ALB
One Target Group Can only be attached to one Load balancer.
Target of target group can be ec2 instance, ecs application, Private IP address or one lambda function.
Cannot register the IP of another ALB in same vpc as target.
On-prem instance's IP address can be used as well as same ip address with different ports (microservice).
DNS:
CLB/ALB/NLB, CDN, S3 are routed using DNS. IPs can change so use Alias record.
Cannot create CNAME for apex/naked domain name.
Routing policy: Simple, failover , geo-location, latency, weighted routing policy.
Weighted routing policy has weights defined from 1 -255
No health check would be treated as healthy target. Evaluate target health no would not care about health check.
About 8 recordset as part of multi-value answer r53 policy
R53 Resolvers: (regional service)
Inbound(on-prem to AWS) and outbound (aws vpc to on-prem)
Internet resolver is created by default.
CloudFront:
Global service - not regional
PCI DSS and HIPAA compliant
For dynamic content loading, ttl =0 and use query strings
Streaming distribution is RTMP CDN, progressive/download is web distribution
Rtmp is for adobe media server streaming
CDN cache get and head type request
Signed URLs , cookies and object access identifier
URL should have valid end date and time for validity
Cloudfront Access logs needs to be enabled
Cannot use signed urls or cookies if existing url uses expires, policy, signature, key-pair-id
TTL Cache keeps sending GET/Header call to Origin with a flag isModifiedSince
Default TTL is 1 day 86400
S3 bucket only allow http connection. No https
Server Name Indication (SNI) for multiple certificates . SNI should be supported by browser.
Chunk encoding is supported by Apple HLS (HTTP live streaming), Adobe http dynamic streaming (HDS), Microsoft smooth streaming
Use elastic transcoder to convert video to HLS.
Signed cookies better choice than signed URLs for media streaming.
Signed URLs are more appropriate for static content.
RTMP should have only S3 bucket as origin and should also have web distribution for media player
Cloudfront viewer reports --> user location, devices, browsers
Compute:
AutoScalingGroup:
PCI DSS compliant
One EC2 instance can be part of only one ASG
Instance states -> pending, health check --> in service --> terminating --> terminated
In service to standby or detaching state
Regional component
Merge ASG only via CLI
Suspend ASG policies to troubleshoot EC2s
VMs concept: Traditional approach is Physical H/W --> OS --> Apps
VMs: Physical H/W --> hypervisor --> VM (OS + Apps) ..supports diff OS
Containers: Applications and binaries (libraries) are packaged together
Docker has application, libraries and runtime.
ECS -> Task Definition --> Docker Image
Kubernetes is container management system
Docker Enterprise Edition is also container management system used by ECS
Fargate launch type --> serverless container
AWS Glue is for ETL.
ECS is regional service
Task Definition can have upto 10 containers defined
Create Task Definition (max 10 containers) --> create service (no. of tasks required) --> run the service --> creates tasks which are running containers --> accessed by ENIs
Container agent runs on every container which is EC2 type and reports running tasks and resource utilization.
Clusters are region specific. Can contain both EC2 and Fargate launch types.
ECS Service is kind of ASG. Runs behind a ALB
Only one LB/TargetGroup per Service
AWS doesn't recommend one CLB infront of multiple services.
Mapping of container port to host port is dynamic port mapping in alb
If host port is 0 then it becomes dynamic host port
Lambda:
Trigger based. Passes events. S3, SNS, API triggers.
Configuration for lambda: Memory (128 -3 gb), Maximum execution time (3- 900 sec), IAM execution role.
Networking of lambda would need the vpc, subnets and security group
Invocation of Lambda:
Event sources: SNS, S3, Dynamo DB and etc
Https/ rest: API Gateway backend
Aws sdk: codes to call lambda
Events: scheduled or cron job with lambdas
Event source mapping is done for lambda triggers.
Event sources (S3, CDN) maintains the mapping for lambda
But streams based (Kinesis and DynamoDB) are maintained at lambda
1000 concurrent execution for lambdas per region
AWS layers is zip packaged with runtime, libraries and code.
For Async lambda, the function retries twice.
Dead Letter Queue is configured to handle the lambda async responses.
Stream based polling will stop the lambda processing if there's an issue.
SQS based polling would return the message in the queue if not processed and would be available in the queue after visibility timeout
Lambda@Edge needs to be created in us-east-1. Assign upto 4 lambda on CDN
Also used for http redirect, auth functions
API Throttling is ..how many get/put request being allowed per sec. after that http respoce 429 is returned.
API GW Caching is chargeable per gig of storage
API GW Proxy integration passes the client info to backend system
AWS SAM is based on cloud formation for serverless application
AWS Batch uses containers to run
Batch --> Job --> Job Definition --> Job Queues --> Priorities
Storage Service:
5GB file upload needs multipart. Upto 5TB
Once version is enabled on S3 , it cannot be disabled only suspended.
Delete marker if file is selected not version
Storage Classes:
S3 Standard --> highly durable. Suited for frequent access
RRS (Reduced Redundancy Storage) is not recommended and is on demise path
Infrequent access, IA_One Zone. Suited for infrequent access
Intelligent tiering. Expensive then IA
Glacier -> suited for archival. Retrieval time is upto 5 mins with special provisions
Glacier deep archive needs 12 hours
IA, IA_One_Zone and intelligent tiering have minimum storage for 30 days and size of 128kb
Glacier for 90 days and deep archive for 180 days
Glacier by defaults encrypts data being stored.
Data cannot be uploaded directly via console into Glacier
Not to use Glacier when we have real time data retrieval or frequently changing data
S3 Static hosting URL : bucketname.s3-website.region.amazonaws.com
For S3 Pre-sign URL: from CLI, aws s3 presign --expires-in (secs) s3 location of file
S3 not suited for dynamic data, or long archival (Glacier) data
S3 Object replication also copies metadata, ACL and object tags. Supports Cross Region as well as Same region replication
Delete marker not replicated
AES 256 Encryption is also called SSE S3 .
S3 allows 3500 requests per sec for PUT, COPY, POST and DELETE
5500 for GET and HEAD per prefix
Amazon s3 and glacier select to query s3 data
EBS Backed EC2 volumes. Instance store is ephemeral.
Instance store provides higher IOPS.
For snapshot of root volumes, stop and take snapshot. Cannot detach root volumes
Redundant Array of Independent Disks (RAID) volumes.
RAID 0 has the best IOPS performance. RAID1 has redundancy. RAID10 is mix of both
EFS follows NFS protocol. Shared by multiple EC2s in a VPC
Mount targets for EFS are ENIs. ENI to be created per AZ not per subnet
EFS Suited for BigData and Analytics (high throughput, readafterwrite consistency and low latency operations)
Media processing workflows (video and audio), Content management (web serving)
And home directories
EFS allows 2 storage class (Standard and Infrequent Access) both highly durable but retrieval charges higher in IA storage and cost of storage is less.
EFS lifecycle policies shift files from Standard to IA. File metadata always stored in Standard.
NFS Port 2049 should be allowed for EFS
EFS mount points are created on EC2 instances for sharing
AWS Backup Service used to back EFS data
AWS DataSync to help migrate data from on-prem to EFS or EFS to EFS
Open after close consistency and strong consistency
Performance modes: general purpose (low latency) and max i/o. cannot change once created
Burst and provisioned throughput modes
FSx Windows File Server supports windows and linux mode. Supports SMB(Server Message Block) windows and CIFS(Common Internet File System) protocol.
EFS supports only Linux whereas FSx supports Windows as well. Needs AD Integration for win
SMB port for TCP/UDP 445, RPC at 135
FSx also works with ENI
FSx Lustre is High Performance Computing, distributed, low latency,
Works with linux servers. Needs fsx client to be installed on the linux servers with mount for lustre
Not a repo to store long term data..use S3.
port 988 to be opened for FSx Lustre. No data replication or Multi AZ support for Lustre
SQL Database:
RDS allows Multi AZ synchronous replication and standby instance
Use provisioned iops for multi AZ setup
Loss of primary DB, patching, ec2 or ebs failure would lead to Failover
RDS Failover happens using DNS name
Automated backups are taken from StandBy Instances
RDS Read Replicas are created from primary instance for read operations and data is copied asynchronously.
Read Replica can also be Multi AZ.
Aurora read replicas are synchronously replicated. No standby.
Separate storage and compute for Aurora. Read replicas can be promoted to Primary
Aurora supports PostGreSQL and MySQL
Cluster endpoint is connected to Primary DB. Gets updated incase of failover.
Reader endpoint --> load balances to all reader replicas
Instance endpoint --> direct connection to instance
Custom endpoint --> logical grouping created by user
Aurora can have 15 read replicas. ASG should have at least 1 replica.
Aurora failovers to read replica
For Encrypting Aurora Cluster, we have to take a snapshot and create cluster from snapshot and choose Encryption.
Aurora Global DB: Primary in one region and secondary (read only) in another region. Secondary has 16 Aurora Replicas for readonly
Aurora MySQL can query data from S3 directly
5 Cross Region Read replicas MySQL
Cross region replication happens when clusters r publicly accessible
Aurora DB has Multi Master clusters. Master has read/write both
Aurora serverless works with Aurora Capacity Units..combo of vCPU and Storage
Aurora serverless a good option for reporting or unpredictable loads
Supports only 1 AZ
ElasticCache:
In memory data. Reduces read workload
Has EC2 clusters running in backend. Automatic failover happens
Memcached - plain cache..no DB
Redis - noSQL DB
Memcached is not persistent. Good for stateless app transient data
Doesn't support MultiAZ. No encryption support.
Redis: multiAZ, persistent, snapshotting of cache. Supports Pub/Sub.
Copying Redis snapshot to different region needs copying snapshot in s3 and moving it in that region
Have 2 nodes in Redis Cluster with Multi AZ setup for Automatic Failover.
Complex data operations
Dynamodb:
Supports 3 AZ replication of data
Max 400KB datatype storage
Partition storage back by SSD drives.
Global Secondary Index can be any 2 attributes
GSI has separate storage
Local Secondary Index needs to be with same Primary Key (type and name)
DynamoDB backups cannot be copied to other regions
DynamoDB restore happens to new table
DAX (DynamoDB Accelerator ) for caching micro seconds response
DAX deployed in VPC. Runs in clusters with 9 read replicas and 1 primary
DAX TTL is 5 mins
DynamoDB streams are stored for 24 hrs
TransactWriteItems and TransactGetItems commits or fails all
Sparse indexes doesn’t copy GSI data if value of sort key is empty
Analytics:
Redshift: OLAP DB for Data Warehousing purpose.
AWS Managed. DWH solution for structured data (RDBMS).
Supports replication. Doesn't support huge real time streams ingestion.
Columnar data storage. Min 160GB size
Snapshots stored in S3. Retention 0-35 days
Redshift supports single AZ
Athena:
Queries S3 using SQL kind statements. AWS Managed.
Integrated with QuickSight for visualization support.
For fast retrieval , store data in columnar fashion using EMR. (Apache Parquet & ORC)
Kinsesis:
Streams of data. Multiple sources sending KB /MB of data.
Kinesis is AWS managed. Terabytes of data per hr
Kinesis stream take the data real time and convert them into shards.
Data from stream can be sent to dynamodb, S3 or redshift
Kinesis stream retention by default is 24 hrs
Replicates synch across 3 Azs
Firehose:
Firehose takes streams of data (logs, IoT or Kinesis streams) as the input and delivers the streams to services such as S3, ElasticSearch, Splunk and Redshift
Firehose also transforms the data. Encrypts, compress and batch is also available.
For sending streams to Redshift, first send the streams of data to S3 and then transform
Kinesis Analytics:
Used for running analytics on streams data. Allows running queries to your streams data
EMR (Elastic Map reduce):
Uses hadoop framework with EC2 (clusters) and S3(store input and output).
EMR is used for web indexing, data mining, machine learning, bioinformatics etc
Apache Hive is for DWH, hBase is distributed DB, Spark is compute engine and Apache Hadoop is Software
Hive is used for Querying the DWH using Hive QL.
Pig is used for Analytics ..Pig Latin
EMR has Master node (distributing load. Has software and hbase DB)
Core node..used in Multi node setup. Has software and hbase DB..no distribution logic
Task node..optional component. Only has software..apache hadoop..no persistence. Spot instanc
EMR runs cluster in single AZ.
EMR is not for real time ingestion..use kinesis. It needs S3 to store the input data.
Kinesis connector allows ingestion of streams data into EMR using Hive QL or Pig script
AWS DataPipeLine:
Orchestration which allows on-prem and cloud operations. Moving data and tansforming.
Task Runner: Needs to be installed on computes (on-prem VMs or EC2 or EMR Clusters). Communicates with the Pipeline
Data Nodes: Specifies the input and output data nodes. SQLDN, RedShiftDN, DynamoDBDN
Database supported: RDS, DynamoDB and JDBC
Quicksight:
DataSources --> DataSets (transformed data) --> Analyses -->Visuals --> Dashboard
SPICE is the engine for Quicksight. Superfast Parallel InMemory Calculation Engine
Import the data into SPICE
GLUE:
Fully managed ETL service. Glue Crawlers, Job Authoring (Transform), Job Execution
Glue Data Catalog: Central repo for storing metadata
Glue Crawler connects to datasource and identifies the type of DS and populates Data catalog
Use Glue when needs to run ETL with S3 and query using Amazon Athena and Redshift Spectrum
Glue is based on Apache Spark..unlike Data Pipeline (EMR)
Kinesis Video Streams:
Intakes streams of data from video, images producers and sends it to EC2s
Usecase: need to stream videos to many devices real time
Put media supports only mkv format
AWS X-Ray:
Helps in tracking operations of distributed applications.
Has a trace ID. Segment
Not an audit or compliance tool
X-Ray Agent- installed on service to send traces. EC2, ECS, Lambda
X-Ray Group is group of traces to generate service graph
Security:
IAM Role: Assumes permissions using STS for the policies attached.
Resource based policy: allows to have cross account policies without IAM roles being compromised
Service Role: role specific to services. Have permissions defined
S3,Glacier, SNS and SQS allows Resource Based Policies
AWS STS: Secure Token Service to grant temp credentials to trusted users.
Assume role : using IAM roles
Assume role SAML: SAML 2.0 Identity Provider..corporate AD
Assume role with WebIdentity: OIDC web identities such as Google, FB
GetSessionToken: used for MFA
GetFederationToken: used by IAM users
For Web identity federation login, register as a developer with IDP. You'll get a developer id
Register ur application with these IDPs to get an application id. Create roles for them
OIDC ID from IDP as trust
SAML 2.0 IDP. For enterprise based AD login
AssumeRoleWithSAML call includes ARN of IDP XML, ARN of role requested and SAML assertion from IDP (ADFS)
AWS Signin URL: https://signin.aws.amazon.com/saml
Identity Broker calls GetFederationToken directly to STS
Active Directory Services:
- AWS Microsoft AD: Suitable for a fresh AD setup.
- AD Connector: connect to on-prem AD for AWS services. Existing
- Simple AD. Good if we have users less than 5000. Samba 4 AD
AWS Microsoft AD supports Simple Edition (upto 5000 users/30000 objects) and Enterprise Edition (upto 500K objects).
This is only setup compatible with RDS Microsoft SQL DB.
For authentication via AD between AWS and on-prem, we need VPN setup. Direct connect is not encrypted but VPN is.
AD Connector is a proxy service for on-prem AD setup.
Key Management System:
AWS KMS is FIPS compliant. Cloudtrail monitors the usage
KMS is Global but keys are regional
CMK is used to create Data key which is used for encrypt and decrypt
Customer Managed CMK and AWS Managed CMK (S3 default)
CMK keys have key policy to decide user permissions
GenerateDataKeyWithoutPlaintext
AWS Server Side Encryption: SSE-S3, SSE-C & SSE-KMS
SSE-S3 is free
Header x-amz-server-side-encryption:aws-kms
Data key/volume key/object key are all same
Glacier automatically encrypts data at rest with AES 256.
Storage G/W uses SSL in transit
EMR uses SSE-S3 while copying data in S3
RDS optionally uses AWS CMK to encrypt
Cloudtrail:
CloudTrail trail can be created to send trail logs to S3 bucket
Cloudtrail logging for all regions is sent to one S3 bucket
Allows 5 trails per region
Stores 90 days of data
Cloudtrail to S3 is SSE-S3 encrypted
Cloudtrail can be configured to receive logs from different AWS account
For putting the logs in S3 from different account, use bucket policies
For reading the logs in S3 from different account, use IAM role
Limit access to S3 bucket storing cloudtrail logs. Provide readOnly and MFA
CloudTrail allows log integrity check using SHA 256 hashing and digital signing.
DDoS: Distributed Denial of Service Attacks
Attacks happen at layer 3,4,6 & 7 (network/transport and application)
Spoof attack: Spoofing of Target IP as src sent out to mediators/reflectors which then tries to respond back to Src creating multiple magnified responses.
TCP SYN Attack: SYN --> SYN-ACK --> ACK. In this case, ACK is not sent by the attacker and connection are reserved making them unusable for any other use.
HTTP Flood Attacks: Emulates human interaction and sends HTTP traffic
Cache busting attack: by using different query string, bypassing the CDN to fetch results from origin servers
DNS Query Flood attacks: flooding the DNS with different DNS names
Mitigation Techniques:
AWS Shield: Free Standard edition by AWS for network atacks
Use R53, Cloudfront, WAF
AWS Shield Advance: provides may features for DDoS attacks
ALB/ELB doesn't allow UDP traffic
CDN closes SYN Attacks..half connections
WAF allows Web ACL rules
Reduce Surface Attacks: Minimize Internet access and reach. Use CDN ..no EIP..No DB in public subnet
Use CDN, ELB, Private Subnets, API GW for obfuscating resources.
WAF works at HTTP HTTPS Application layer for CDN and ALB, API GW
WAF blocks Cross Site Scripting XSS attack, SQL injection attacks
WACL have rules to allow, block or monitor (count)
3rd Party WAF is also a good solution to obfuscate internal components
Intrusion Detection System & Intrusion Protection System
Host Based and Network Based Intrusion Detection System
Promiscuous port/ network tapping/sniffers not allowed in AWS
Introduce IDS/IPS 3rd Party sandwich setup (ELB --> EC2 with IDS --> ELB (APP)) for security
RAM: Resource Access Manager...sharing of AWS resources among AWS accounts
Sharing of Transit G/W, Subnets, R53 rules allowed to share.
RAM eliminates the need to create duplicate resources in multiple accounts, reducing the operational overhead of managing those resources in every single account you own.
SNS: Push based
By Default, only topic creator can publish message but can provide access to other AWS users to publish message
SNS Mobile Push Notifications for Mobile notifications. Pop up
Register ur app and device with SNS
SQS: Pull based,used for decoupling the application
Delete the messages in the queue without deleting the queue
SQS messages can be processed and then forwarded to other SQS
Standard Queue: no sequence, duplication of msg possible, at least one delivery, high throughput
FIFO queue: limited throughput 300 TPS, exactly one delivery, FIFO
Polling mechanism: Short and Long Polling (preferred)
Short returns the response immediately even if queue is empty. Receivemessagewaittime is = 0
Short is default
LongPolling is preferred option. ReceiveMessageWaitTime> 0 and <=20 secs
Default retention period is 4 days with max 14 days and min 1 min
SQS queues can have priorities ..higher ones are handled fast
Create as many queues you want
SQS Visbility timeout is the time for which the message is locked by an instance for processing and no other instance would be able to pick it up. Only after visibility time out is done, message is sent back to queue.
30 secs bydefault upto 12 hours
Once ACK is received the message should be deleted from queue
SQS is regional component .HA in Multi AZ setup
IAM policies on SQS can decide who can send and receive messages
SNS topic can publish message to SQS as subscriber
Amazon Mechanical Turk:
Online crowdsourcing platform where requester can post work and worker
can accept and work on the same.
Amazon Rekognition:
Rekognition reads images and videos. Input can be uploaded as binary to rekognition or uploaded to S3.
Output of rekognition can be S3, Kinesis streams & SNS.
If input is Live video stream so need to use Kinesis Video Streams and output would be Kinesis streams.
Has recognize celebrities API. Can detect labels like tree, flower, table etc. Events like wedding, party. Landscapes.
DetectLabels API (Images) StartLabelDetection (Videos)
DetectFaces(Images) StartFaceDetections(Videos) CreateStreamProcessor(Streams)
People path ..activity tracking..only in videos. StartPersonTracking API
DetectText API to read from Images.
Jpg or png allowed for S3 stored images. Base64 for direct image
AWS Simple Workflow System: SWF
Orchestration of an asynchronous workflow system
SWF has workers. Workers performs the tasks and reports back
Workers perform tasks. Tasks can be done by person or machine
SWF has deciders which decides which activity tasks to perform.
SWF does long polling.
3 types of tasks:
Activity tasks: used by workers
Lambda Tasks: executes lambda
Decisions Tasks: used by Deciders to decide next actions
One domain SWF Workflow is independent of others
SWF is Task Oriented. SQS is Message Oriented
Workflow execution can run upto 1 year
AWS Step Function: Better SWF with Visual workflow
Automatic trigger and retries
Amazon states language
Activity Tasks (polls for activity ) and Service Tasks (calls another service push)
API G/W can trigger Step function with a specific state
AWS Application Discovery Service: Helps in migration effort
Integrated with Migration HUB
Agentless Discovery Service: Sits on VMWare and collects data on cpu, ram and disk io
Agent Based Discovery: Installed on Physical servers. Windows and linux supported
Information pushed to S3. Integrated with Amazon athena and quicksight.
Data exploration must be Turned On.
AWS Storage G/W:
Connects on-prem using iSCSI interface. VMDK file is installed on on-prem servers
File G/W: NFS type. Transfers file to AWS S3
Volume G/W:
Storage G/W: iSCSI type. Uses AWS for storage perspective (backups)
Cached G/W: Frequently accessed data is on-prem and rest of the data on AWS. Block mode
VTL G/W: Virtual Tapes storage in AWS
File G/W connection would need gateway to be created. Hyper V, vmware or computes.
Create a file share and mount the same on the servers.
Snowball Family:
Devices to do large scale data migration
Good choice for data > 10 TB
If the link can transfer data in a week..use link..more than that use Snowball
Snowball--> 50 (Only US) - 80 TB device. Plain import/export jobs
Snowball Edge: 100TB device with compute
Snowmobile: Exabyte storage truck. 1Exabyte = 1000 petabyte = 1000000 TB
AWS Migration HUB:
Services to do the migration. DB and Server Migration Services. Allows central tracking of these services. Used with Discovery services as well.
3 ways Migration HUB gets the data from on-prem:
- Migration HUB import
- Agentless Discovery Agent (VM)
- Agent Based Discovery Agent
AWS Server Migration Services:
Used for on-prem migration of server VMWare, HyperV or Azure virtual machines.
Server migration Connector gets on-prem data as AMIs. Uses Cloud Formation template to create the stack. So, AMIs --> CloudFormation . Template defines DB, App and other layers.
AWS Database Migration Services (DMS):
DMS is used for migration from on-prem to AWS and vice versa.
Also used to maintain the replica of DB.
Supports security at rest and in-transit
Creating indexes, primary keys and etc on target DB can be done with the help of SCT tool
Replication Task has the rules and actual tables defined.
SCT Tool is installed on-prem. Clones the source DB and uses the agent to copy data to S3. DMS copies from S3 to target.
DMS instance comes in 50GB or 100GB sizes.
DMS supports MultiAZ support.
Source & Target endpoints to have the connection of DB. You can test the connections
No CDC enabled for MS Azure SQL migration. IBM DB2 as source and not target.
MongoDB is document DB NoSQL. Document is row and collection of docs is collections ..table
MongoDB as source and not as target. Document and Table mode supported
Document DB, Redshift, Elasticsearch , kinesis cannot be source DB
MySQL, Oracle can be source as well as target for DMS.
AWS Amplify is a development platform for building secure, scalable mobile and web applications. It makes it easy for you to authenticate users, securely store data and user metadata, authorize selective access to data, integrate machine learning, analyze application metrics, and execute server-side code. Amplify covers the complete mobile application development workflow from version control, code testing, to production deployment, and it easily scales with your business from thousands of users to tens of millions.
Key Notes:
--> Custom Application should have listeners configured for TCP.
--> Additional ENIs have a mixed MAC address that does not change. ETH0 is the Primary ENI which cannot be detached.
--> you cannot point an A record to Load Balancer. Either Alias or CNAME.
--> A low RPO solution is asynchronous replication
--> for encrypting an EBS running volume, create a snapshot and launch a volume from that
--> On-prem and AWS CIDR Overlaps won't allow communication
--> Enhanced networking instance and EC2 IOPS types for low latency and hight n/w throughput
--> RAID 0 increases Read and write capacity
--> SSE-KMS is envelope encryption
--> SSH/RDP to limit access to ec2 NOT IAM roles
--> Advertise AWS Public Ips over Public VIF not Private VIF
--> During RDS Failover, DNS Record changes but not the RDS endpoint
--> Cross Region Read Replica exists for RDS and Aurora
--> EBS volume replication in the same region is also via snapshot
--> NAT G/W cannot be assigned security group. Can be assigned Elastic IP but not Public IP
--> 2 tunnels configured for each VPN Connection
--> you need one private VIF(VPC) and one public VIF (AWS services) for Direct connect.
--> DX connection with a VPN backup
--> IPV6 is globally unique..same as Public IP. Egress is related to IPV6.
--> instance store are virtual hard drive..ephemeral storage
--> you can increase the size of volume but cant scale down
--> bastion host allows SSH or RDP access. Can be assigned Public IP or Elastic IP. Sits in Public subnet and allows access to private subnet.
--> EC2 can have secondary ENI which can be detached and assigned to another EC2 incase of failover.
--> Copying the EBS volume from one AZ to another, create a snapshot and launch the EBS from it. EBS volumes are AZ specific.
--> For sharing encrypted snapshot with others , one would have to share the CMK key permissions as well
--> ELB routes the traffic to Eth0 primary IP of ur EC2. ELB needs 8 IP address to scale.
--> NLB doesn't support Lambda target type
--> you cannot read/write to StandBy RDS instance
--> Snapshots and Auto Backups done on StandBy Instance. Only Manual snapshots can be shared..not automatic ones
--> The first snapshot is full and after that it's incremental backups. DB Transaction logs allows upto 5 mins of RPO
--> you cannot restore into an existing DB..it has to be a new DB instance.
--> Automatic backups must be enabled for Read Replica to work
--> RDS Read Replicas can be in a different region as well. Based on automatic snapshots being copied asynchronously.
--> you cannot encrypt the DB instance on the go. Create a snapshot--> copy the snapshot and encrypt it and create a DB instance out of it.
--> you can't disable encryption as well.
--> IAM DB authentication works with My SQL and PostGre SQL.
--> AWS doesn't support IP Multicast
--> Redshift can load data only from S3 Standard.
--> Aurora spans Multi AZ in one region..maintains at least 6 copies
--> Failover priority for Aurora replicas. 15 read replicas supported.
--> Aurora Clusters can have single instance or multiple instance behind it. Aurora clusters are created inside the VPC.
--> Aurora Global cluster has a primary cluster in one region and readonly cluster in another region. 16 replicas in other region.
--> Backtracking takes the data backup to a point in time. Doesn't need a new DB instance to recover.
--> Cloudtrail event history is logged for 90 days. Cloud trail logging integrity check via SHA hashing on S3
--> Log retention of cloudwatch 1 day to 10 years. Auto delete. CW Logs encrypted at Rest and In-transit.
--> Unified cloudwatch agent is preferred over old one.
--> Create custom metric from CW logs --> Create Alarm using that metric filter --> Attach an EC2 instance/ASG or SNS topic with that
--> Logs could be sent to cross account using kinesis streams
--> S3 Max size is 5TB. Multipart upload for more than 5GB. Preferred for > 100MB
--> Standard IA for backups. 30 day minimum charge for storage
--> Glacier for Archives with retrieval time of minutes. Min storage is 90 days. Can sustain data loss in 2 facilities. Doesn't maintain metadata. Use a DB for that.
--> Deep Archive ..not for real time retrieval. 180 days of min storage.
--> RRS. Frequently accessed non critical data. Not recommended.
--> S3 uses SSE and KMS encryption. S3 Static hosting allows redirection but only http
--> Pre-Signed URLs to provide specific object access to users for limited time.
--> For S3 Cross Region or same region replication, the bucket versioning must be enabled.
--> S3 replicated the objects, metadata and the tags as well. Replicates the delete marker but doesn't delete.
--> S3 transfer acceleration used for uploading objects over internet.
--> S3 provides 3500 PUT/COPY/POST/DELETE per seconds and 5500 GET/HEAD. Prefixes created multiply the capacity.
--> Elemental MediaStore is caching and distribution for video workflows. Delivers videos from S3.
--> S3 Select to query data in S3 if stored in CSV, JSON format. Glacier Select for Glacier.
--> Permission to enable server accesss log only via bucket ACL not by policy. ACLs are handy for object level permissions.
--> Requester pay buckets for charging requester to pay for access or downloading
-->EFS is POSIX (UNIX) compliant and needs to be mounted on EC2 or On-prem servers. Suited for BigData & Analytics. Low latency file operations.
--> EFS can be mounted on linux on-prem only
--> Amazon FSx is windows file share. Windows shadow copies are point in time snapshot stored in S3.
--> Lustre is Open source high performance computing. Linux based DFS. Not durable
--> cname cannot be created for naked zone apex.
--> Traffic flow policy only for public hosted zone
--> S3 only allows HTTP protocol for Origin access from Cloudfront.
--> Invalidate Only Web objects from CDN
--> EMR is not for real time ingestion or large data
--> Redshift supports Single AZ
--> EMR leverages Apache Hadoop + Hive . SQL based query and support for unstructured data.
--> EMR kinesis connector to read streaming data from kinesis to EMR for processing
--> Glue Crawlers identifies metadata and populates the glue catalog used by Glue to transform the data
--> SQS Queue can be encryption enabled.
--> AWS Serverless Application Module is an extension of Cloudformation.
--> AWS Batch uses ECS Container to run jobs
--> You cannot have mount points for the same EFS file systems in different VPC. Only 1 at a time.
--> Amazon FSx has Standby file server in a different AZ synchronously replicated.
--> NAT GW uses only Elastic IPs. No Public IP
--> Session stickiness/affinity can be application controlled or LB controlled. Cookie header inserted.
--> ALB supports only load balancer cookies. Cookie name AWSALB
--> DAX for Dynamo DB is Only for eventual consistent and deployed on EC2 instance with a DAX agent
--> SCP has no effect on the Primary/Master account though it is applied at root level. Whitelisting or Blacklisting policies at SCP.
--> AWS Organization creates Service Linked Role in each of member accounts to access.
--> IAM roles Trust Policies have principals which are Trusted Entities. Such as federated: saml-provider/ADFS or service: ec2 or lambda
--> RDS Read Replicas are created using the snapshot from primary DB or Standby (MultiAZ). Asynchronously update
--> Aurora Serverless clusters are always encrypted
--> Cloudformation has Creation policy which allows waitCondition to delay resource creation. Deletion policy to delete, retain or snapshot resource.
--> blue green deployment is an Active StandBy configuration
--> Cloudwatch Synthetics are used for running Canaries Script to automate user/customer actions for the service.
--> Cloudwatch ServiceLens is used for endtoend debugging with Cloudwatch + Xray
--> AWS Service Catalog allows organizations to create and manage catalogs of IT services that are approved for use on AWS. Also, maintain the catalog
portfolio stack which are cloud formation stack ready to build common stacks.
--> AWS Resource DataSync allows to collect data configuration from multiple resources.
--> AWS DNS Doesn't support DNSSEC (Domain Name System Security Extensions). Use 3rd Party DNS Provider for this support.
--> AWS Systems Manager Patch Manager automates the process of patching managed instances with security-related updates.
Patch Manager uses patch baselines, which include rules for auto-approving patches. A patch group is an optional means of organizing instances for patching.
For example, you can create patch groups for different operating systems (Linux or Windows), different environments (Development, Test, and Production),
or different server functions (web servers, file servers, databases).
--> If you want a single store for configuration and secrets, you can use Parameter Store. If you want a dedicated secrets store with lifecycle management, use Secrets Manager.
--> To use a certificate with Elastic Load Balancing for the same site (the same fully qualified domain name, or FQDN, or set of FQDNs) in a different Region,
you must request a new certificate for each Region in which you plan to use it. To use an ACM certificate with Amazon CloudFront,
you must request the certificate in the US East (N. Virginia) region.
--> DynamoDB Global Table. A replica table (or replica, for short) is a single DynamoDB table that functions as a part of a global table. Each replica stores the same set of
data items. Any given global table can only have one replica table per AWS Region.
--> AWS CloudHSM is a cloud-based hardware security module (HSM) that enables you to easily generate and use your own encryption keys on the AWS Cloud.
Handles SSL offloading as well and for HA, needs 2 subnets
--> can't deploy an application to your on-premises servers using Elastic Beanstalk
--> AWS Resource Access Manager (AWS RAM) enables you to share specified AWS resources that you own with other AWS accounts. To enable trusted access
with AWS Organizations: From the AWS RAM CLI, use the enable-sharing-with-aws-organizations command.
Name of the IAM service-linked role that can be created in accounts when trusted access is enabled: AWSResourceAccessManagerServiceRolePolicy
--> Rehost (“lift and shift”) - In a large legacy migration scenario where the organization is looking to quickly implement its migration and scale to meet a business case,
we find that the majority of applications are rehosted. Most rehosting can be automated with tools such as AWS SMS although you may prefer to do this manually as
you learn how to apply your legacy systems to the cloud.
You may also find that applications are easier to re-architect once they are already running in the cloud. This happens partly because your organization will have developed better skills to do so and partly because the hard part - migrating the application, data, and traffic - has already been accomplished.
--> Replatform (“lift, tinker and shift”) -This entails making a few cloud optimizations in order to achieve some tangible benefit without changing the core architecture of the application. For example, you may be looking to reduce the amount of time you spend managing database instances by migrating to a managed relational database service such as Amazon Relational Database Service (RDS), or migrating your application to a fully managed platform like AWS Elastic Beanstalk.
--> By default, the data in a Redis node on ElastiCache resides only in memory and is not persistent. If a node is rebooted, or if the underlying physical
server experiences a hardware failure, the data in the cache is lost.
If you require data durability, you can enable the Redis append-only file feature (AOF). When this feature is enabled, the node writes all of the commands that change cache data to an append-only file. When a node is rebooted and the cache engine starts, the AOF is "replayed"; the result is a warm Redis cache with all of the data intact.
-->Turn off the Reserved Instance (RI) sharing on the master account for all of the member accounts.
--> You can use AWS SAM with a suite of AWS tools for building serverless applications. To build a deployment pipeline for your serverless applications, you can use CodeBuild, CodeDeploy, and CodePipeline. You can also use AWS CodeStar to get started with a project structure, code repository, and a CI/CD pipeline that's automatically configured for you. To deploy your serverless application, you can use the Jenkins plugin, and you can use Stackery.io's toolkit to build production-ready applications.
--> You can improve performance by increasing the proportion of your viewer requests that are served from CloudFront edge caches instead of going to your origin servers for content; that is, by improving the cache hit ratio for your distribution. To increase your cache hit ratio, you can configure your origin to add a Cache-Control max-age directive to your objects, and specify the longest practical value for max-age. The shorter the cache duration, the more frequently CloudFront forwards another request to your origin to determine whether the object has changed and, if so, to get the latest version.
--> you can set up an origin failover by creating an origin group with two origins with one as the primary origin and the other as the second origin which CloudFront automatically switches to when the primary origin fails.
--> Modifying the enableDnsHostNames attribute of your VPC to false and the enableDnsSupport attribute to true is incorrect because with this configuration, your EC2 instances launched in the VPC will not get public DNS hostnames.
--> SCPs DO NOT affect any service-linked role. Service-linked roles enable other AWS services to integrate with AWS Organizations and can't be restricted by SCPs.
--> You can bring part or all of your public IPv4 address range from your on-premises network to your AWS account. You continue to own the address range, but AWS advertises it on the Internet. After you bring the address range to AWS, it appears in your account as an address pool. You can create an Elastic IP address from your address pool and use it with your AWS resources, such as EC2 instances, NAT gateways, and Network Load Balancers. This is also called "Bring Your Own IP Addresses (BYOIP)".
--> Amazon Connect provides a seamless omnichannel experience through a single unified contact center for voice and chat. Contact center agents and managers don’t have to learn multiple tools, because Amazon Connect has the same contact routing, queuing, analytics, and management tools in a single UI across voice, web chat, and mobile chat.
--> Amazon Lex is a service for building conversational interfaces into any application using voice and text. Amazon Lex provides the advanced deep learning functionalities of automatic speech recognition (ASR) for converting speech to text, and natural language understanding (NLU) to recognize the intent of the text, to enable you to build applications with highly engaging user experiences and lifelike conversational interactions.
--> Amazon Redshift workload management (WLM) enables users to flexibly manage priorities within workloads so that short, fast-running queries won't get stuck in queues behind long-running queries. Amazon Redshift WLM creates query queues at runtime according to service classes, which define the configuration parameters for various types of queues, including internal system queues and user-accessible queues.
--> By setting up cross-account access in this way, you don't need to create individual IAM users in each account.
--> installing the SSM Agent to all of your instances is also required when using the AWS Systems Manager Patch Manager.
--> the best solution is to use a combination of CloudFront, Elastic Load Balancer and SQS to provide a highly scalable architecture. SQS decouples the service. Reduces cost.
--> Inter-Region VPC Peering provides a simple and cost-effective way to share resources between regions or replicate data for geographic redundancy.
--> Adding tags to the EC2 instances in the production environment and adding resource-level permissions to the developers with an explicit deny on terminating the instance
which contains the tag
--> SSE-S3 provides strong multi-factor encryption in which each object is encrypted with a unique key. It also encrypts the key itself with a master key that it rotates regularly accurately describes how SSE-S3 encryption works.
--> SCP does not grant any permissions, unlike an IAM Policy. SCP policy simply specifies the services and actions that users and roles can use in the accounts.
--> If you have a VPC peered with multiple VPCs that have overlapping or matching CIDR blocks, ensure that your route tables are configured to avoid sending response traffic
from your VPC to the incorrect VPC. Add a static route on VPC A's route table with a destination of 10.0.0.0/24 and a target of pcx-aaaabbbb. The route for 10.0.0.0/24 traffic
is more specific than 10.0.0.0/16, therefore, traffic destined for the 10.0.0.0/24 IP address range goes via VPC peering connection pcx-aaaabbbb instead of pcx-aaaacccc.
--> In AWS Storage Gateway, your iSCSI initiators connect to your volumes as iSCSI targets. Storage Gateway uses Challenge-Handshake Authentication Protocol (CHAP) to
authenticate iSCSI and initiator connections. CHAP provides protection against playback attacks by requiring authentication to access storage volume targets
--> AWS Systems Manager Patch Manager automates the process of patching managed instances with security-related updates.
--> won't be able to upload their articles to the Read Replicas in the event that the primary database goes down.
--> Cloudfront signed cookies feature is primarily used if you want to provide access to multiple restricted files, for example, all of the files for a video in HLS format or all
of the files in the subscribers' area of website.
--> If you have multiple VPN connections, you can provide secure communication between sites using the AWS VPN CloudHub. This enables your remote sites to communicate with each other, and not just with the VPC.
--> The deployment services offer two methods to help you update your application stack, namely in-place and disposable. An in-place upgrade involves performing application updates on live Amazon EC2 instances. A disposable upgrade, on the other hand, involves rolling out a new set of EC2 instances by terminating older instances.
--> Hybrid Deployment model: combines the simplicity of managing AWS infrastructure provided by Elastic Beanstalk and the automation of custom network segmentation
provided by AWS CloudFormation.
--> AWS Control Tower is best suited if you want an automated deployment of a multi-account environment with AWS best practices. If you want to define your own custom
multi-account environment with advanced governance and management capabilities, we would recommend AWS Organizations
--> AWS organization entities are globally accessible, similar to how AWS Identity and Access Management (IAM) works today.
--> You cannot change which AWS account is the master account.
--> ELB cannot have EIP or Static IP attached.
--> OpsCenter is a Systems Manager capability that provides a central location where operations engineers, IT professionals, and others can view, investigate, and
resolve operational issues related to their environment.
--> EMR doesn't provide Detailed Monitoring. ELB, R53, RDS does
--> IPSec Tunnel provides data encryption across internet, protection of data in transit, peer identity auth vpn GW and Customer GW and data integrity protection.
--> ARN syntax arn:aws:service:region:account:resource. For IAM, region is left blank.
--> Default Security Group allows No Inbound Traffic. Allows All Oubound Traffic. Allows instances to communicate which have same SG.
--> ApplyImmediately to apply changes on RDS intances.
--> Amazon cognito maintains the last written version of the data using synchronize() method. Cognito uses SNS Push () to send notifications to devices.
-->AWS IAM passwords can contain the Basic Latin characters. Policy names cannot have /,\, * or ?. These are reserved. Path names should start and end with /. Max length 64 chars
--> AWS Doesn't Auto assign Public IP to Instance with Multiple ENIs.
--> you can create a VPC with multiple subnets and assign users to have access to only their specific subnet.
--> ELB supports only 1 Subnet from 1 AZ
--> IAM also has NotPrincipal in Policy.
--> Minimum storage for Provisioned IOPS MySQL RDS is 100 GB and min IOPS is 1000.
--> DataPipeline retries are allowed upto 10 retries.
--> SNS doesn't push notifications to Microsoft Windows Mobile Messaging. Microsoft Push Notification is supported.
--> as-describe-launch-configs-show-long shows launch config name, instance type, ami id.
--> ElasticCache stores critical piece of data in memory for low latency access.
--> Exception to a list of actions in IAM policy is provided by NotAction. MultiFactorAuthAge to check in seconds last MFA action.
--> Amazon ElasticCache's cache security group are applicable to cache clusters running outside of VPC.
--> PIOPS EBS Volumes supports 4GiB to 16 TiB and provision upto 20000 IOPS. But the ratio should be max 30. Now, it is 50:1
--> AWS Elastic Beanstalk provides several options for how deployments are processed, including deployment policies
(All at once, Rolling, Rolling with additional batch, and Immutable)
--> API GW Throttling error code is 429 while lambda or integration connection gives 504
--> In CloudFront, there are 3 options that you can choose as the value for your Origin Protocol Policy: HTTP Only, HTTPS Only and Match Viewer.
--> Amazon Inspector enables you to analyze the behavior of your AWS resources and helps you to identify potential security issues. Using Amazon Inspector, you can
define a collection of AWS resources that you want to include in an assessment target. You can then create an assessment template and launch a security assessment
run of this target. Just assessment..no capability to change or update.
--> In Redshift, if your query operation hangs or stops responding: Connection to the Database Is Dropped--> Reduce the size of maximum transmission unit (MTU)
Connection to the Database Times Out--> hang or timeout when running long queries, such as a COPY command
Client-Side Out-of-Memory Error--> ODBC or JDBC Out of Memory. There Is a Potential Deadlock --> Check STV_LOCKS and STL_TR_CONFLICT.
Use the PG_CANCEL_BACKEND and PG_TERMINATE_BACKEND.
--> Amazon MQ is a managed message broker service for Apache ActiveMQ that makes it easy to set up and operate message brokers in the cloud. Connecting your current
applications to Amazon MQ is easy because it uses industry-standard APIs and protocols for messaging, including JMS, NMS, AMQP, STOMP, MQTT, and WebSocket
--> ALB Only supports http/https protocol..no tcp/tls
--> Amazon AppStream 2.0 is a fully managed application streaming service. Suited for standalone desktop applications.
--> There are a lot of available AWS Managed Policies that you can directly attach to your IAM Users, such as Administrator, Billing, Database Administrator, Data Scientist,
Developer Power User, Network Administrator, Security Auditor, System Administrator and many others
--> The following scenarios highlight patterns that may not be well suited for blue/green deployments:
Are your schema changes too complex to decouple from the code changes? Is sharing of data stores not feasible?
Does your application need to be "deployment aware"?
Does your commercial off-the-shelf (COTS) application come with a predefined update/upgrade process that isn’t blue/green deployment friendly?
--> Oracle RAC is not supported by RDS. RMAN
--> Create a new CloudFront web distribution and configure it to serve HTTPS requests using dedicated IP addresses in order to associate your alternate domain names with
a dedicated IP address in each CloudFront edge location.
--> Code deploy provides Canary deployment configuration, the traffic is shifted in two increments.
--> Tape Gateway which will back up your data in Amazon S3 and archive in Amazon Glacier using your existing tape-based processes.
--> A service control policy (SCP) is a policy that specifies the services and actions that users and roles can use in the specified AWS accounts.
SCPs are similar to IAM permission policies except that they don't grant any permissions. Instead, SCPs specify the maximum permissions for an organization,
organizational unit (OU), or account.
--> you can configure more than one load balancer with an autoscaling group.
--> If an Auto Scaling group is launching more than one instance, the cool down period for each instance starts after that instance is launched. The group remains locked until
the last instance that was launched has completed its cool down period. In this case the cool down period for the first instance starts after 3 minutes and finishes at the 10th
minute (3+7 cool down), while for the second instance it starts at the 4th minute and finishes at the 11th minute (4+7 cool down). Thus, the Auto Scaling group will receive
another request only after 11 minutes
--> Tags are assigned automatically to the instances created by an Auto Scaling group.
--> If you have a running instance using an Amazon EBS boot partition, you can also call the Stop Instances API to release the compute resources but preserve the data on the boot
partition
--> When a user launches an Amazon EBS-backed dedicated instance, the EBS volume does not run on single-tenant hardware.
--> If there is more than one rule for a specific port, we apply the most permissive rule. For example, if you have a rule that allows access to TCP port 22 (SSH) from IP address
203.0.113.1 and another rule that allows access to TCP port 22 from everyone, everyone has access to TCP port 22.
--> The tag key cannot have a prefix as "aws:", although it can have only "aws".
--> The Amazon EC2 console provides a "Launch more like this" wizard which copies Instance Type, AMI, user-data, tags, placement group
--> You are charged for the stack resources for the time they were operating (even if you deleted the stack right away)
--> CloudFormation: Actual resource names are a combination of the stack and logical resource name.
--> To connect to Amazon Virtual Private Cloud (Amazon VPC) by using AWS Direct Connect, you must first do the following:
Provide a private Autonomous System Number (ASN) to identify your network on the Internet. Amazon then allocates a private IP address in the 169.x.x.x range to you.
Create a virtual private gateway and attach it to your VPC
--> IKE Security Association is established first between the virtual private gateway and customer gateway using the Pre-Shared Key as the authenticator.
--> To establish redundant VPN connections and customer gateways on your network, you would need to set up a second VPN connection. However, you must ensure that the
customer gateway IP address for the second VPN connection is publicly accessible.
--> DynamoDB Local Secondary Indexes can only be created while Table creation. DynamoDB uses JSON only as a transport protocol, not as a storage format.
--> you can copy data from an Amazon DynamoDB table into Amazon Redshift.
--> To construct the mount target's DNS name, use the following generic form: availability-zone.file-system-id.efs.aws-region.amazonaws.com
--> Component is code--> Workload is set of components --> technical capability is set of workloads.
--> using the AWS Server Migration Service (SMS) and installing the Server Migration Connector to your on-premises virtualization environment.
--> Amazon WorkDocs is a fully managed, secure content creation, storage, and collaboration service. With Amazon WorkDocs, you can easily create, edit, and share content, and because it’s stored centrally on AWS, access it from anywhere on any device. Amazon WorkDocs makes it easy to collaborate with others, and lets you easily share content, provide rich feedback, and collaboratively edit documents.
--> EC2Rescue can help you diagnose and troubleshoot problems on Amazon EC2 Linux and Windows Server instances. You can run the tool manually or you can run the tool automatically by using Systems Manager Automation and the AWSSupport-ExecuteEC2Rescue document. The AWSSupport-ExecuteEC2Rescue document is designed to perform a combination of Systems Manager actions, AWS CloudFormation actions, and Lambda functions that automate the steps normally required to use EC2Rescue.
--> AWS Step Functions provides serverless orchestration for modern applications. Orchestration centrally manages a workflow by breaking it into multiple steps, adding flow logic, and tracking the inputs and outputs between the steps.
--> Data Pipeline is for batch jobs.
--> Implementing database caching with CloudFront is incorrect because you cannot use CloudFront for database caching. CloudFront is primarily used to securely deliver data, videos, applications, and APIs to customers globally with low latency and high transfer speeds.
--> Snowball is suitable for the following use cases:
Import data into Amazon S3, Export from Amazon S3,
On the other hand, Snowball Edge is suitable for the below:
Import data into Amazon S3, Export from Amazon S3, Durable local storage, Local compute with AWS Lambda, Local compute instances, Use in a cluster of devices
Use with AWS Greengrass (IoT), Transfer files through NFS with a GUI
--> If you got your certificate from a third-party CA, import the certificate into ACM or upload it to the IAM certificate store. Hence, AWS Certificate Manager and IAM certificate store are the correct answers.
--> You can use an AWS Direct Connect gateway to connect your AWS Direct Connect connection over a private virtual interface to one or more VPCs in your account that are located in the same or different regions. You associate a Direct Connect gateway with the virtual private gateway for the VPC, and then create a private virtual interface for your AWS Direct Connect connection to the Direct Connect gateway.
--> All at once – Deploy the new version to all instances simultaneously. All instances in your environment are out of service for a short time while the deployment occurs. If the deployment fails, a system downtime will occur.
Rolling – Deploy the new version in batches. Each batch is taken out of service during the deployment phase, reducing your environment's capacity by the number of instances in a batch. If the deployment fails, a single batch will be out of service.
Rolling with additional batch – Deploy the new version in batches, but first launch a new batch of instances to ensure full capacity during the deployment process. This is quite similar with Rolling option. If the first batch fails, the impact would be minimal.
Immutable – Deploy the new version to a fresh group of instances by performing an immutable update. If the deployment fails, the impact is minimal.
Blue/green deployment – Deploy the new version to a separate environment, and then swap CNAMEs of the two environments to redirect traffic to the new version instantly. If the deployment fails, the impact is minimal.
--> Memacached allows MultiThreaded execution unlike Redis.
--> You can change the placement group for an instance in any of the following ways:
Move an existing instance to a placement group. Move an instance from one placement group to another. Remove an instance from a placement group.
Before you move or remove the instance, the instance must be in the stopped state. You can move or remove an instance using the AWS CLI or an AWS SDK.
--> Cognito User Pool handles the user AuthN to provide temp credentials to access EC2, ECS, API. Cognito Identity pool provides AuthZ to allows other aws service access (roles)
--> EFS storage is 47.9 TB, S3 5TB and EBS size of EBS.
--> Only NLB provides Static and Elastic IP. SNI provided by ALB and NLB.
--> gp2 16000 IOPS, PIOPS 64000 IOPS
--> EC2 health check watches for instance availability from hypervisor and networking point of view. For example, in case of a hardware problem, the check will fail. Also, if an instance was misconfigured and doesn't respond to network requests, it will be marked as faulty.
ELB health check verifies that a specified TCP port on an instance is accepting connections OR a specified web page returns 2xx code. Thus ELB health checks are a little bit smarter and verify that actual app works instead of verifying that just an instance works.
--> Redis single AZ setup has Append Only File . MultiAZ setup has redis read replica. You can have only 1 active at a time
--> Redis cluster mode enabled(multiple shards and multi AZ setup) and disabled (single shard and multi AZ setup).
--> If data needs to be transferred from multiple location then use Transfer Acceleration (Cloudfront edge) or use Snowball or edge. If repeated file transfers, use Direct connect
--> S3 Transfer Acce supports downloads as well
--> SSM State Manager maintains the state of EC2 and Hybrid infra.
--> CloudFormation has templates, stack and change sets
--> OpsWork can auto heal ur stack.
--> CloudFormation and Beanstalk can Only create infra in AWS not on-prem. AWS OpsWork and CodeDeploy can create infra on-prem and in AWS.
--> Elastic N/W Adapter better for enhanced n/wing. Multiple ENIs shouldn't be used.
--> The maximum size of the data payload of a Kineis Data Stream record before base64-encoding is up to 1 MB.
--> AMIs are a regional resource. Therefore, sharing an AMI makes it available in that region. To make an AMI available in a different Region, copy the AMI to the Region and then share it.
--> AWS Organizations comes with All Features(includes SCP and etc) or Consolidated Billing mode (only billing share)
--> The native tools allow you to migrate your data with minimal downtime. For eg. Mysqldump for MySQL migration
--> Tape gateway in AWS Storage Gateway service is primarily used as an archive solution. Cannot access the files on Tape Gateway directly..use File GW.
--> You share resources in one account with users in a different account. By setting up cross-account access in this way, you don't need to create individual IAM users in each account. U create role in Prod account , assign dev account role as trustee.
--> SCP Policies: Users and roles must still be granted permissions with appropriate IAM permission policies. A user without any IAM permission policies has no access, even if the applicable SCPs allow all services and all actions.
--> AWS tags are case sensitive. Pro-Active taggings are done using AWS Cloudformation , AWS Service Catalog. AWS IAM can allow/disallow service creation if tags are not there.
--> You can use AWS Config with CloudWatch Events to trigger automated responses to missing or incorrect tags.
--> Direct Connect provides consistent performance and latency for hybrid workloads and predictable performance.
--> If your exising backup software does not natively support cloud storage for backup or archive, you can use a storage gateway device, such as a bridge, between the backup software and Amazon S3 or Amazon Glacier.
--> AWS DataSync is a data transfer service that makes it easy for you to automate moving data between on-premises storage and Amazon S3, Amazon Elastic File System (Amazon EFS), or Amazon FSx for Windows File Server.
--> When migrating from one database source or version to a new platform or software version, AWS Database Migration Service keeps the source database fully operational during the migration, minimizing downtime to applications that rely on the database.
-->EC2 Failover to a replacement instance or (running) spare instance by remapping your elastic IP address to the new instance. An Elastic IP address is a static, public, IPv4 address allocated to your AWS account.
--> EBS volumes can be attached to a running EC2 instance and can persist independently from the instance.
--> Because snapshots represent the on-disk state of the application, care must be taken to flush in-memory data to disk before initiating a snapshot.
--> When you create a member account using the AWS Organizations console, AWS Organizations automatically creates an IAM role named OrganizationAccountAccessRole in the account. This role has full administrative permissions in the member account.
--> You can use trusted access to enable an AWS service that you specify, called the trusted service, to perform tasks in your organization and its accounts on your behalf.When you enable access, the trusted service can create an IAM role called a service-linked role in every account in your organization whenever that role is needed.
 
Brilliant Notes thanks so much !!
ReplyDelete