Archives:

bigdata OLTP , OLAP No ratings yet.

row-based vs col based db or format row based –> good for OLTP ( transcation),   e.g: cassendra col based –> good for OLAP (? easy to aggreation etc?), druid Parquet ( column based data format): https://www.jumpingrivers.com/blog/parquet-file-format-big-data-r/ https://www.upsolver.com/blog/apache-parquet-why-use   hadoop: big data storage, what is the alternatives? S3 on cloud? https://www.alluxio.io/learn/hdfs/basic-file-operations-commands/ https://stackoverflow.com/questions/31011078/data-retention-in-hadoop-hdfs   pinot vs cassandra • Read More »


grpc deep drive No ratings yet.

GRPC   client side msg: header, msg, EOS server side msg:  header, msg, msg,  Trailer over http2 keep live? c++ async or sync? https://grpc.io/docs/languages/cpp/async/ , does it provider more performance than sync one? https://github.com/grpc/grpc/blob/v1.41.0/examples/cpp/helloworld/greeter_async_server.cc https://stackoverflow.com/questions/68767309/difference-between-sync-and-async-grpc https://docs.microsoft.com/en-us/dotnet/standard/async-in-depth https://www.hellsoft.se/understanding-cpu-and-i-o-bound-for-asynchronous-operations/ CPU Bound means the rate at which process progresses is limited by the speed of the CPU. A task that • Read More »


bigdata platform with Kubernets or Hadoop No ratings yet.

Hadoop: Hadoop kubernets MapReduce Spark on K8s Flink stream HDFS S3? any better one Resource manager Yarn/Mesos K8s itself   During its evolution phase, Hadoop provided three main functionalities that made it a Big Data-ready solution: a distributed computer mechanism (MapReduce), a robust data storage (HDFS), and a resource manager (YARN/Mesos). But modern technologies now • Read More »