Hey Hey Hey


  • Home

  • Archives

  • Tags

  • Search

AWS - Redshift

Posted on 2020-12-31 |

Reference

https://youtu.be/lj8oaSpCFTc

Terminology

  • massively parallel, share Nothing Columnar architecture

Best Practices: Encoding & Compression

https://youtu.be/lj8oaSpCFTc?t=657

  • Use AZt4

Basics

Read more »

AWS - Reference Case, API First

Posted on 2020-12-29 |

https://aws.amazon.com/blogs/aws/introducing-amazon-managed-workflows-for-apache-airflow-mwaa/

https://aws.amazon.com/blogs/containers/how-affirm-uses-aws-fargate-and-apache-airflow-to-manage-batch-jobs/

Read more »

AWS - BlogList

Posted on 2020-12-29 |

https://aws.amazon.com/blogs/big-data/accessing-and-visualizing-external-tables-in-an-apache-hive-metastore-with-amazon-athena-and-amazon-quicksight/

https://aws.amazon.com/blogs/big-data/setting-up-automated-data-quality-workflows-and-alerts-using-aws-glue-databrew-and-aws-lambda/

https://aws.amazon.com/blogs/big-data/optimizing-amazon-emr-for-resilience-and-cost-with-capacity-optimized-spot-instances/

https://aws.amazon.com/blogs/big-data/amazon-emr-studio-preview-a-new-notebook-first-ide-experience-with-amazon-emr/

https://aws.amazon.com/blogs/big-data/apply-record-level-changes-from-relational-databases-to-amazon-s3-data-lake-using-apache-hudi-on-amazon-emr-and-aws-database-migration-service/

https://aws.amazon.com/blogs/big-data/amazon-emr-studio-preview-a-new-notebook-first-ide-experience-with-amazon-emr/

https://aws.amazon.com/blogs/big-data/orchestrating-analytics-jobs-by-running-amazon-emr-notebooks-programmatically/

Read more »

AWS - HPC

Posted on 2020-08-05 |

References

  • AWS re:Invent 2019: [REPEAT 1] HPC on AWS: Innovating without infrastructure constraints (CMP204-R1)

https://youtu.be/g70bvcGlPY4

  • AWS re:Invent 2016: Building HPC Clusters as Code in the (Almost) Infinite Cloud( CMP318 )

https://youtu.be/x7M3m1jZ7L8

  • https://youtu.be/0bGZdqx6w1Q
  • https://youtu.be/tHylCR0NIwU
Read more »

AWS - KMS

Posted on 2020-08-03 |
  • If you want to use AWS managed keys, then you can’t control key rotation, it would be every 3 years.
  • If you want to use Customer Managed Keys (CMK), you can turn on automatic rotation for sysmetric keys, it would be every year.
  • CMK sysmetric key and asysmetric private key never left KMS unencrypted
  • How to choose from Sysmetric and Asysmetric key

https://docs.aws.amazon.com/kms/latest/developerguide/symm-asymm-choose.html

Read more »

LoraWAN

Posted on 2020-07-18 |

References

https://youtu.be/8Oxcp9wQQnk

Terminology

Lora vs LoraWAN

  • Lora is the protocol, __Lo__ng __Ra__nge ; LoRa is Layer2
  • LoraWAN is the IoT solution based on Lora technology

Lora Pro/Cons

  • ISM Open frequency(415,868,915MHz, free ; no license required
  • Interference ; low data rate

Limitations / Parameters

Target: transmission message about 10 km and the battery last for 2 years.

  • Frequency: Pay attention to band requirement per country
  • Tx power (transmission power): 2-14 dbm / 5-20 dBm; the higher the power , the longer distance signals can cover
  • Bandwidth (125/250/500 KHz): the higher the more data can be include in one transmission; the higher the bandwidth, the shorter battery life, the shorter range and more interference.(??); checked the local laws
  • spreading factor: (7-12), the larger spreading factor, the longer distance and shorter battery life.
  • coding rate: 4/5, 4/6, 4/7, 4/8,
    4/5 means 5 error bits used to correct 4 bit of data. The more coding rate, means your data can transfer longer distance but lower battery life.
Read more »

AWS - Security

Posted on 2020-06-16 |

Reference

https://youtu.be/u6BCVkXkPnM

AWS re:Inforce 2019: Security Best Practices the Well-Architected Way (SDD318)

Incident response

https://d1.awsstatic.com/whitepapers/aws_security_incident_response.pdf

Playbook vs Runbook: run book have more details

https://docs.aws.amazon.com/guardduty/latest/ug/guardduty_finding-types-active.html

  • predefined query against cloudwatch event

IAM

Read more »

AWS - Reference Case, API First

Posted on 2020-02-23 |

Reference

https://youtu.be/TKgml4bSiZA

Key Take Away

  • No IT / Business separation
  • Cross functional teams
  • Born agile (DevOps)
  • TDD , automation and ChatOps
  • Customer-centric design
  • CD

Archi

Reference Openbanking with HSBC

https://youtu.be/QNM9LVV_eI0

Read more »

AWS - Protection Ring

Posted on 2020-01-24 |

https://en.wikipedia.org/wiki/Protection_ring

Read more »

AWS - Kinesis

Posted on 2020-01-18 |

Reference

https://youtu.be/jKPlGznbfZ0

Why Streaming

  • Data loses value quickly over time
    • “Time critical decisions” need streaming data
    • inject as it’s generated, process on the fly and do real-time analytics/ML/Alert/Action
  • Common streaming use case
    • Smart home / automation / log / Data Lake / IoT
  • Real time analytics demo (User Dashboard)

Streams Producers and Consumers

Producer limits

  • bandwidth limitation: 1MB/sec/shard
  • if not, aggregate your message, and use throughput limitation: 1k record/sec/shard

Normal consumer

  • The slowest consumer will also impact number of shards, you might need increase the shards to allow the slowest consumer can process the message concurrently to pick up all the messages

  • The fastest speed you can get the data is one trasaction per 200ms

  • Multiple consumers share the 5 transaction/sec/shard and 1M data / sec /shard limitations.

    • Multiple consumers will decrease the troughput as well as increase your latency
  • Workaround , use master stream and copied slave stream

Read more »
12…18
Rachel Rui Liu

Rachel Rui Liu

178 posts
193 tags
RSS
GitHub Linkedin
© 2021 Rachel Rui Liu
Powered by Hexo
Theme - NexT.Pisces
0%