Episode 43 : DevoxxFr, Kafka, AWS, Microsoft CosmosDB, AML

15/05/2017    bigdata 

Kafka

Confluent Cloud : Managed Apache Kafka par Confluent
https://www.confluent.io/confluent-cloud/
https://www.forbes.com/sites/alexkonrad/2017/05/08/confluent-brings-kafka-to-cloud-and-challenges-aws/amp/

Kafka with Docker: A Docker introduction
https://ngeor.wordpress.com/2017/03/25/kafka-with-docker-a-docker-introduction/amp/

Apache Flink and Apache Kafka Streams: a comparison and guideline for users
https://www.confluent.io/blog/apache-flink-apache-kafka-streams-comparison-guideline-users/

The Continued Rise of Apache Kafka
https://redmonk.com/fryan/2017/05/07/the-continued-rise-of-apache-kafka/

Kafka Summit - Introduction to Kafka Streams with a Real-Life Example by Alexis Seigneurin
https://speakerdeck.com/aseigneurin/kafka-summit-introduction-to-kafka-streams-with-a-real-life-example

Webinar Boontadata avec @benjguin du 10/05/17 (replay bientôt disponible)
https://aka.ms/wp-boontadata

Microsoft

Serving AI with data: A summary of Build 2017 data innovations
https://blogs.technet.microsoft.com/dataplatforminsider/2017/05/10/serving-ai-with-data-a-summary-of-build-2017-data-innovations/

Azure Cosmos DB: The industry’s first globally-distributed, multi-model database service
https://azure.microsoft.com/en-us/blog/azure-cosmos-db-microsofts-globally-distributed-multi-model-database-service/

Using Jupyter notebooks and Pandas with Azure Data Lake Store
https://medium.com/azure-data-lake/using-jupyter-notebooks-and-pandas-with-azure-data-lake-store-48737fbad305

End-to-End Scenarios Enabled by the Data Science Virtual Machine: Webinar Video
https://blogs.technet.microsoft.com/machinelearning/2017/05/02/end-to-end-scenarios-enabled-by-the-data-science-virtual-machine-video/

AWS

AWS now lets you migrate MongoDB databases to DynamoDB
https://venturebeat.com/2017/04/10/aws-now-lets-you-migrate-mongodb-databases-to-dynamodb/amp/

Deep Dive on Amazon EC2 Instances - January 2017 Online Tech Talks
https://www.youtube.com/watch?v=29QZPttiKJA

Datascience

Automated Machine Learning — A Paradigm Shift That Accelerates Data Scientist Productivity @ Airbnb https://medium.com/airbnb-engineering/automated-machine-learning-a-paradigm-shift-that-accelerates-data-scientist-productivity-airbnb-f1f8a10d61f8

Divers

Managed Service for Elassandra provided by Instaclustr https://www.instaclustr.com/blog/2017/05/09/managed-service-elassandra-provided-instaclustr/

The new BigData file format for Faster Data analysis http://carbondata.apache.org/

Elasticsearch succombe au machine learning http://www.silicon.fr/elasticsearch-succombe-au-machine-learning-174421.html

GDPR

La conformité un avantage compétitif http://www.zdnet.fr/actualites/la-conformite-un-avantage-competitif-39850544.htm

Privacy by design http://www.zdnet.fr/actualites/privacy-by-design-kezako-39850666.htm

Contenus liés

Nuage de tags

bigdata cloud aws ai news postgresql kubernetes azure cassandra interview databricks snowflake timeseries spark kafka france warp10 python apache dbt google grafana llm microsoft ovhcloud hadoop sql bigquery docker nosql pulsar trino data-science ia mongodb flink foundationdb duckdb influxdb timescale clickhouse googlecloud redis rust terraform elastic scaleway arm datastax gcp java mysql s3 sqlite confluent data database datalake ml nvidia quickwit rgpd serverless github influxdata iot machine-learning mlops clever-cloud databases europe lakehouse superset vscode cdc cloudera cnil cockroach facebook hashicorp machine_learning opensource prometheus search spanner sécurité arrow aurora catalog cockroachdb datascience dataviz datawarehouse delta gdpr haskell huggingface jupyter meta notebook openai pandas parquet pinot redshift souveraineté streaming yugabyte airbyte airflow apple architecture cloud-souverain cncf copilot couchbase data-mesh delta-lake devoxxfr docker-compose etcd etl feature-store gaia-x gke golang jetbrains kestra metabase nocode oss palantir pycaret raft redpanda scikit-learn senx agpl aiven beam bookkeeper chatgpt cloudflare compose datacontract dataiku datamesh datatask dynamodb eks elasticsearch firebolt french genai gitpod gpu ibm iceberg jepsen kernel lambda lucene mesos mimir netapp netflix opensearch pgsql postgres powerbi privacy pytorch questdb r raspberrypi rds scylla storage streamlit talend timescaledb traefik vector zookeeper algolia amd analytics analytique anthos api atlassian bi biglake blockchain bloom-filter consul containerd covid19 cve dagger dagster data-engineering datadog dataflow dataquality datastack delta-live delta-sharing discovery doctolib ebpf ec2 elixir elt excel exoscale fabric flows git google-analytics helm hudi istio json linux log4j maif malloy mapr mqtt neo4j nlp nomad oracle orchestration ovh phoenix presto prestosql privacy-shield prophet quantique rabbitmq risc-v salesforce shell slack sncf ssd synapse tableau tikv time-series 2019 accord aks alertes amazon astradb atos automl azure-ml babelfish benchmark bintray cabourotte centos chine citus cloud-de-confiance collibra cookie cosmosdb couchdb cube dash data-catalog data-engineer data-quality datageek datagouv dataops dataplex dataproc datarobot deep-fake deep-learning deltalake deployment dhakira dremio druid emr euclidia event faq fastapi faunadb flask frenchtech gaiax gartner gitlab gladia gpt-3 health-data-hub hive iam ibis ingestion inria instacluster instaclustr intellij ipv6 jcenter jdk jfrog julia k8ssandra kapacitor knative kotlin licence live log log4shell lsf m3db machinelearning memcached memsql metaverse micro-service microservice microsoft-sql-server minio mirabelle mistral mlflow n8n nft nodejs noel npm nrtsearch okta openjdk openmetadata operator orange pandera paxos planetscale podman prestodb prompt qemu qovery r2 radar registryops reverse-etl rlang rocksdb rook sagemaker scalabilité scylladb secnumcloud sigfox small-data smalldata solr spring sql-server ssh stack-overflow starburst starbust stargate state statefulset streamnative système-distribué tabular telegraf tempo test thales thematique timestream uber usa vault vectordb velero vitess voltron warpstudio wasm wifi zig 2017 accenture actors actu acv adoptium adoptopenjdk aeure agi agrocd akami akhp akka alerts alibaba alloydb allydb almalinux amado analyse android angular anniversaire anomalie anomaly-detection anthropic apache-arrow apache-druid apache-pinot apache-yunikorn apachespark arcadedb archive archlinux argo-cd articdb assembly astria astro astronomer atlas audacity augly aurads auth0 authentication authorization authz automatisation automerge autopilot avanade aws-summit back-market backblaze backup ballisa bash berkeleydb bert bgp biais biscuit bitcoin bleu bnp bodywork bootstrap bpi bpifrance broadcom business calcite calvin cap-theorem carbondata carrefour castordoc cdn celery ceph ceresdb cgroups chaosdb chiffrement cicd classification clevercloud cli clockhouse cloudact cluster-api clusterset cobol code-whisperer codecov codeurs-en-seine collecte colossus comptabilité conduktor conference conseil consensus consul-connect container conteneurs cookies cortex coscreen course-au-large covid-19 cpu craftsmanship criteo crux crypto cryptomonnaie csi csv cuda cue culture cybersécurité d1 dall-e dalle dashboard dask data-discovery data-gouvernance data-ops data-platform data-prep data-vault data-wrangling datacatalog datacenter dataform dataframes datafusion datagouvernance datahub datakin datalakehouse datamodeling dataops.rocks datapreparation datasearch datasketches dbscan ddos debezium delos devfest-lille dewitt diagram diagrams digital direct distinct distributed distributed-systems django dlt docker-desktop dockershim documentdb dolt dragonfly drift driftctl drill ebs echantillonage echart ecs egress entreprise entreprise4.0 epyc erlang eurybia evidence exadata expert-comptable explicabilité exploration falco faster fb feast feature finalizer fintech fiscal flaml flight forecast foundationndb fourier francais freebsd freenode french-tech ftp fugue fundings futur gafam gc geopandas geospatial gil github-actions gitlab-ci gitops glitchtip glue gobblin google-ads google-app-engine google-appengine google-font gourvernance gouvernance gp3 gpg gpt graph graphql graviton gravitron gunicorn hamilton hashicorrp hasicorp haskel hbase hdd hdh hex hfactory hfiles hibernate hop http husky image impala imply incident indexes indexima industrie industrie-4.0 inflation influx infomaniak internet interopérabilité iops iouring iox ipo ipv4 jedi jespen jinja jpa jquery jvm k-means k6 k8s k8saandra kaggle kalman-filter kappa kapsule kata-container kensu kibana kinesis komodor ksqldb kubecon kubectx kubeflow kubens kuma kyutai lake-formation langchain leap-second ledger lens letsencrypt letsencrypy license lighton ligthdash lineage linkbynet linkedin linky linode linter litestream llama lobe logica logiciel-libre loki low-code lowcode lru lsm-tree légal m1 maestro mangodb manticore-search mapie markov mathématiques matillion matrix medusa memcache memorydb messaging metadata metrics meuse microsoft-build microsoftazure mirantis mmap modeling moderndatastack modèle-relationnel modélisation monolith monolithe mpp msgpack msgspec multi-cloud musk méthodologie namespace netgear network newsletter newsql nicegui nifi no-code nodb notebooks nsa ntp numérique nvme object-storage observability observabilitycon ocaml olap onehouse onetable opacus open-policy-agent opendata opendatasoft openlineage opensourcesoftware p99conf paas pagnol partitionning password pcie performance pex pgcon pgrest pi pinterest pixie pixley plateforme pluralith podcast poetry polardb polars pony popsink posgresql pranadb predictions process processeur prolog prospective psp pub-sub pubsub pulumi pushmetrics pyre pyscript qlik qualité quantmetry quasardb query querybook quic quorum r2devops radix ram rancher rapport-gauvain re-invent readme readyset reapder recommendation redash redhat reed-salomon reinvent replibyte retention-policy revue rhel ribbon-filter riscv roblox rockset rockylinux rondb rpgd rppd rsync rtc rust-vmm salaire salon santé satellite scikitlearn scrapping security.txt segmentation select server-less service service-mesh servicediscovery servicemesh shapash shapsh shard shards shotover simulation singer slideshare snapash snapshot snoflake snowpark software souveraineté-numérique sowflake splunk spot sqlmesh sre srecon stable starlight startree startup statistiques steamsets streams sudo suisse supply-chain-attack suse syntec sysdig tanzu tar tdengine teads tech terality tesla text2speech the-last-pickle theseus thoughtworks thématique tiered-storage tigerbeetle tigergraph tigris tika tla+ tls tomcat tpc transformation trasnformers trifacta trinot tsfr twitter u-sql ua-parser-js ubeeko udap udf ui unikernel union-européenne upsert usage usb vc vectodb vectorized vertex vie-privée vm vmware voile voilà voix warehouse warp.dev wasi web web-components webassembly webassmelby wikimedia workflow ydb yelp youtube zanzibar zeenea zepl zeppelin zevent zstd éthique

Syndication

Restez informé(s) de notre actualité en vous abonnant au flux des épisodes, des brèves ou abonnez-vous au podcast dans votre application favorite

Le podcast est sponsorisé par Affini-Tech et CérénIT

À compter de l'épisode 104, le générique a été composé et réalisé par Maxence Lecointe

© 2014-2023 | Contenus sous licence Creative Commons BY-SA