Episode 55 : News

05/03/2018    french bigdata data-science data-engineering 

Building Reliable Reprocessing and Dead Letter Queues with Kafka
https://eng.uber.com/reliable-reprocessing/

Data Lineage sur Apache Spark avec Spline
http://blog.ippon.fr/2018/02/19/data-lineage-spark-avec-spline/

Elastic - Doubling Down on Open
https://www.elastic.co/blog/doubling-down-on-open
https://www.elastic.co/products/x-pack/open

JupyterLab is Ready for Users
https://blog.jupyter.org/jupyterlab-is-ready-for-users-5a6f039b8906

Cherami: Uber Engineering’s Durable and Scalable Task Queue in Go
https://eng.uber.com/cherami/

Streams in and out of Pravega
http://blog.pravega.io/2018/02/12/streams-in-and-out-of-pravega/
http://pravega.io/

Migrating Batch ETL to Stream Processing: A Netflix Case Study with Kafka and Flink
https://www.infoq.com/articles/netflix-migrating-stream-processing

Machine Learning pour les grand-mères
https://www.saagie.com/fr/blog/machine-learning-pour-les-grand-meres

AUTOMATED ML : IS IT THE END OF THE SEXIEST JOB OF THE 21ST CENTURY ?
http://blog.xebia.fr/2018/02/20/automated-machine-learning-is-it-the-end-of-the-sexiest-job-of-the-21st-century/

Google Cloud Auto ML
https://cloud.google.com/automl/

Apache MXNet - A flexible and efficient library for deep learning.
http://mxnet.incubator.apache.org/

Confluent and Apache Kafka in 2017
https://www.confluent.io/blog/confluent-apache-kafka-2017/

Oracle : l’insulte faite aux DBA
https://www.dsfc.net/infrastructure/base-de-donnees-infrastructure/oracle-insulte-faite-aux-dba/amp/

Apache Cassandra 3.11.2 release
https://www.mail-archive.com/dev@cassandra.apache.org/msg12075.html

Docker Meet Cassandra. Cassandra Meet Docker.
http://thelastpickle.com/blog/2018/01/23/docker-meet-cassandra.html

Autoscaling Dataproc clusters
https://blog.doit-intl.com/autoscaling-google-dataproc-clusters-21f34beaf8a3

Lisez le blog d’affini-Tech
http://blog.affini-tech.com


http://www.bigdatahebdo.com

Vincent : https://twitter.com/vhe74

Alexander : https://twitter.com/alexanderdeja

Cette publication est sponsorisée par Affini-Tech ( http://affini-tech.com https://twitter.com/affinitech ) On recrute ! venez cruncher de la data avec nous ! écrivez nous à recrutement@affini-tech.com

Nuage de tags

bigdata cloud aws ai news postgresql kubernetes azure cassandra interview databricks snowflake timeseries spark kafka france warp10 python apache dbt google grafana llm microsoft ovhcloud hadoop sql bigquery docker nosql pulsar trino data-science ia mongodb flink foundationdb duckdb influxdb timescale clickhouse googlecloud redis rust terraform elastic scaleway arm datastax gcp java mysql s3 sqlite confluent data database datalake ml nvidia quickwit rgpd serverless github influxdata iot machine-learning mlops clever-cloud databases europe lakehouse superset vscode cdc cloudera cnil cockroach facebook hashicorp machine_learning opensource prometheus search spanner sécurité arrow aurora catalog cockroachdb datascience dataviz datawarehouse delta gdpr haskell huggingface jupyter meta notebook openai pandas parquet pinot redshift souveraineté streaming yugabyte airbyte airflow apple architecture cloud-souverain cncf copilot couchbase data-mesh delta-lake devoxxfr docker-compose etcd etl feature-store gaia-x gke golang jetbrains kestra metabase nocode oss palantir pycaret raft redpanda scikit-learn senx agpl aiven beam bookkeeper chatgpt cloudflare compose datacontract dataiku datamesh datatask dynamodb eks elasticsearch firebolt french genai gitpod gpu ibm iceberg jepsen kernel lambda lucene mesos mimir netapp netflix opensearch pgsql postgres powerbi privacy pytorch questdb r raspberrypi rds scylla storage streamlit talend timescaledb traefik vector zookeeper algolia amd analytics analytique anthos api atlassian bi biglake blockchain bloom-filter consul containerd covid19 cve dagger dagster data-engineering datadog dataflow dataquality datastack delta-live delta-sharing discovery doctolib ebpf ec2 elixir elt excel exoscale fabric flows git google-analytics helm hudi istio json linux log4j maif malloy mapr mqtt neo4j nlp nomad oracle orchestration ovh phoenix presto prestosql privacy-shield prophet quantique rabbitmq risc-v salesforce shell slack sncf ssd synapse tableau tikv time-series 2019 accord aks alertes amazon astradb atos automl azure-ml babelfish benchmark bintray cabourotte centos chine citus cloud-de-confiance collibra cookie cosmosdb couchdb cube dash data-catalog data-engineer data-quality datageek datagouv dataops dataplex dataproc datarobot deep-fake deep-learning deltalake deployment dhakira dremio druid emr euclidia event faq fastapi faunadb flask frenchtech gaiax gartner gitlab gladia gpt-3 health-data-hub hive iam ibis ingestion inria instacluster instaclustr intellij ipv6 jcenter jdk jfrog julia k8ssandra kapacitor knative kotlin licence live log log4shell lsf m3db machinelearning memcached memsql metaverse micro-service microservice microsoft-sql-server minio mirabelle mistral mlflow n8n nft nodejs noel npm nrtsearch okta openjdk openmetadata operator orange pandera paxos planetscale podman prestodb prompt qemu qovery r2 radar registryops reverse-etl rlang rocksdb rook sagemaker scalabilité scylladb secnumcloud sigfox small-data smalldata solr spring sql-server ssh stack-overflow starburst starbust stargate state statefulset streamnative système-distribué tabular telegraf tempo test thales thematique timestream uber usa vault vectordb velero vitess voltron warpstudio wasm wifi zig 2017 accenture actors actu acv adoptium adoptopenjdk aeure agi agrocd akami akhp akka alerts alibaba alloydb allydb almalinux amado analyse android angular anniversaire anomalie anomaly-detection anthropic apache-arrow apache-druid apache-pinot apache-yunikorn apachespark arcadedb archive archlinux argo-cd articdb assembly astria astro astronomer atlas audacity augly aurads auth0 authentication authorization authz automatisation automerge autopilot avanade aws-summit back-market backblaze backup ballisa bash berkeleydb bert bgp biais biscuit bitcoin bleu bnp bodywork bootstrap bpi bpifrance broadcom business calcite calvin cap-theorem carbondata carrefour castordoc cdn celery ceph ceresdb cgroups chaosdb chiffrement cicd classification clevercloud cli clockhouse cloudact cluster-api clusterset cobol code-whisperer codecov codeurs-en-seine collecte colossus comptabilité conduktor conference conseil consensus consul-connect container conteneurs cookies cortex coscreen course-au-large covid-19 cpu craftsmanship criteo crux crypto cryptomonnaie csi csv cuda cue culture cybersécurité d1 dall-e dalle dashboard dask data-discovery data-gouvernance data-ops data-platform data-prep data-vault data-wrangling datacatalog datacenter dataform dataframes datafusion datagouvernance datahub datakin datalakehouse datamodeling dataops.rocks datapreparation datasearch datasketches dbscan ddos debezium delos devfest-lille dewitt diagram diagrams digital direct distinct distributed distributed-systems django dlt docker-desktop dockershim documentdb dolt dragonfly drift driftctl drill ebs echantillonage echart ecs egress entreprise entreprise4.0 epyc erlang eurybia evidence exadata expert-comptable explicabilité exploration falco faster fb feast feature finalizer fintech fiscal flaml flight forecast foundationndb fourier francais freebsd freenode french-tech ftp fugue fundings futur gafam gc geopandas geospatial gil github-actions gitlab-ci gitops glitchtip glue gobblin google-ads google-app-engine google-appengine google-font gourvernance gouvernance gp3 gpg gpt graph graphql graviton gravitron gunicorn hamilton hashicorrp hasicorp haskel hbase hdd hdh hex hfactory hfiles hibernate hop http husky image impala imply incident indexes indexima industrie industrie-4.0 inflation influx infomaniak internet interopérabilité iops iouring iox ipo ipv4 jedi jespen jinja jpa jquery jvm k-means k6 k8s k8saandra kaggle kalman-filter kappa kapsule kata-container kensu kibana kinesis komodor ksqldb kubecon kubectx kubeflow kubens kuma kyutai lake-formation langchain leap-second ledger lens letsencrypt letsencrypy license lighton ligthdash lineage linkbynet linkedin linky linode linter litestream llama lobe logica logiciel-libre loki low-code lowcode lru lsm-tree légal m1 maestro mangodb manticore-search mapie markov mathématiques matillion matrix medusa memcache memorydb messaging metadata metrics meuse microsoft-build microsoftazure mirantis mmap modeling moderndatastack modèle-relationnel modélisation monolith monolithe mpp msgpack msgspec multi-cloud musk méthodologie namespace netgear network newsletter newsql nicegui nifi no-code nodb notebooks nsa ntp numérique nvme object-storage observability observabilitycon ocaml olap onehouse onetable opacus open-policy-agent opendata opendatasoft openlineage opensourcesoftware p99conf paas pagnol partitionning password pcie performance pex pgcon pgrest pi pinterest pixie pixley plateforme pluralith podcast poetry polardb polars pony popsink posgresql pranadb predictions process processeur prolog prospective psp pub-sub pubsub pulumi pushmetrics pyre pyscript qlik qualité quantmetry quasardb query querybook quic quorum r2devops radix ram rancher rapport-gauvain re-invent readme readyset reapder recommendation redash redhat reed-salomon reinvent replibyte retention-policy revue rhel ribbon-filter riscv roblox rockset rockylinux rondb rpgd rppd rsync rtc rust-vmm salaire salon santé satellite scikitlearn scrapping security.txt segmentation select server-less service service-mesh servicediscovery servicemesh shapash shapsh shard shards shotover simulation singer slideshare snapash snapshot snoflake snowpark software souveraineté-numérique sowflake splunk spot sqlmesh sre srecon stable starlight startree startup statistiques steamsets streams sudo suisse supply-chain-attack suse syntec sysdig tanzu tar tdengine teads tech terality tesla text2speech the-last-pickle theseus thoughtworks thématique tiered-storage tigerbeetle tigergraph tigris tika tla+ tls tomcat tpc transformation trasnformers trifacta trinot tsfr twitter u-sql ua-parser-js ubeeko udap udf ui unikernel union-européenne upsert usage usb vc vectodb vectorized vertex vie-privée vm vmware voile voilà voix warehouse warp.dev wasi web web-components webassembly webassmelby wikimedia workflow ydb yelp youtube zanzibar zeenea zepl zeppelin zevent zstd éthique

Syndication

Restez informé(s) de notre actualité en vous abonnant au flux des épisodes, des brèves ou abonnez-vous au podcast dans votre application favorite

Le podcast est sponsorisé par Affini-Tech et CérénIT

À compter de l'épisode 104, le générique a été composé et réalisé par Maxence Lecointe

© 2014-2023 | Contenus sous licence Creative Commons BY-SA