Data Berlin #39
Sun, warm, and relax. The last time to recharge your batteries before Summer.
We didn’t want this, but the long weekend was stronger than us, this issue, number 39 of our newsletter is going out only today, Tuesday. Apologies in advance, we hope we can make up for this. It was also stupidly hot in Berlin, so if you spent the break melting on a balcony instead of catching up on newsletters, fair enough.
The one evening still worth planning around: Thursday, 28 May hosted by Adsquare for our May We Talk About Data? meetups: audience data at scale, measuring campaigns in the physical world, and offline impact when the dashboard is not the whole story. If you have been meaning to RSVP (or update your RSVP), this is the polite last call.
For everything after this week, subscribe on Luma so you do not miss the next one.
Looking further out: WeAreDevelopers World Congress is in Berlin 8–10 July at CityCube, Messe Berlin. If you are going, use our community code Community_DataBerlin.
The rest of this issue is the usual stack — reads, jobs, things shipping — for skimming between whatever got rescheduled because of the heat.
What’s new on our job board
databerlin.net/jobs is now at more than 600 open data and AI roles across Berlin — probably the lowest-friction way to see what is actually hiring here without opening five tabs and three LinkedIn searches.
The bit we want your eyes on this week: “Tailor my CV to this job” on any role page. One click gives you a prompt with the company, title, and description already filled in (and your CV too, if you have uploaded it on the site). Paste it into Claude, ChatGPT, or whatever you use, answer the follow-up questions, and you get a CV tuned to that posting. We are still tuning the prompt — if you try it, tell us what worked and what felt off (reply here, email, Slack, or grab us at the meetup).
Early evidence it is not completely cursed: at least one friend used it, got an offer, and their tailored CV did not scare recruiters off. Your mileage may vary; we would still like the feedback.
If you have not poked around the board lately: bookmark roles, compare saved jobs, skim Skill DNA on job pages — all still on the site, still free, still in your browser. And if we are missing a company you keep seeing on LinkedIn, ping us — finding new sources is the boring part of the job.
📖Interesting Readings
Near-universal AI adoption has not moved the real bottlenecks: practitioners still rank modeling, ownership, and leadership above new pipelines, which matters because faster codegen without semantics mostly scales mess, not value. – AI Is Here, But The Hard Parts Haven’t Changed
That same thread continues in a new one-minute survey on how those org problems show up where you work, which matters if you want the next public dataset to reflect Berlin teams, not just SF keynote vibes. – The Organizational State of Data Engineering
Experimental evidence and practitioner notes suggest leaning on agents for new skills cuts retention without saving time, while unvetted skills files and weak CLIs expose security gaps, which matters when your moat is process design not generated lines of code. – ctrl+r #12: AI learning cost, workflow moats, skills security, and the CLI disappointment
You can scrub through 300-plus procedural visualizations and run Python against 100-plus problems offline after a one-time purchase, which matters when interview prep or fundamentals need hands-on pacing instead of passive video. – The Interactive Handbook on Data Structures and Algorithms
Company-scale rollouts amplify culture that was already strong or weak: juniors spend more tokens with less payoff, review depth collapses under giant PRs, and maintenance lands on fewer people who still read the code, which matters before leadership treats output metrics as proof of quality. – AI’s impact on software engineers in 2026: key trends, Part 2
William Playfair’s lesson still holds: encode quantitative and categorical fields with the right retinal and spatial channels and cap how many you stack, which matters because the difference between insight and noise is usually design discipline not a fancier chart type. – The Art and Science of Data Visualization
🚀 Shipping Now
dltHub Pro is GA as a Claude/Codex/Cursor-native platform where agents build dlt pipelines locally and Pro runs them in production with scheduling, observability, and managed runtime.
pg_infer 1.0 treats small transformer internals as SQL relations with an index-backed similarity operator on PostgreSQL 18+.
Lance in DuckDB adds vector, full-text, and hybrid search over versioned lakehouse tables inside the same SQL workflow you already use for analytics.
Google’s Data Agent Kit open-sources skills and MCP tools for BigQuery, Spark, and GCS in VS Code, Claude Code, Codex, and Gemini CLI.
nao’s MCP app renders signed interactive charts inside ChatGPT and Claude so agent answers match what you see in the nao UI.
dbt’s May ship list puts the Developer Agent in preview, opens self-serve Fusion upgrades, and lands Core v1.12 beta with UDF overloads and a
--sqlrun-operation flag.duckle compiles drag-and-drop pipelines to SQL on DuckDB with a local assistant and 290+ connectors for a visual, local-first studio.
☕ Upcoming Meetups & Events
May 27 — AI Agent Builders Berlin · powered by Dataiku 🛰️
May 28 — Tech Europe · Applied AI Conf Berlin 🤖
May 28 — Data Berlin · May We Talk About Data? 🐻
Jun 3 — The AI Native Enterprise · Agent Gateway Deep Dive w Christian Posta 🌐
Jun 4 — Berlin Startup School · AI Prototyping: idea to app 🚀
Jun 16 — AI for Non-Techies · AI for Non-Techies 🧠
Jun 18 — Data Berlin · Agentic Analytics Meetup 🐻
Jul 8–10 — WeAreDevelopers · World Congress — CityCube, Messe Berlin; Data Berlin code Community_DataBerlin 🎪
💼 This Week’s Job Picks
📊 Data Analyst
Mid Data Analyst - Finance – SumUp → Apply
Staff Data Analyst – Delivery Hero → Apply
Full Stack Engineer - Analytics – Contentful → Apply
And more here.
📈 Analytics Engineer
Senior Analytics Engineer - Web Tracking (Storefront) – About You → Apply
Analytics Engineer – Distribusion → Apply
Senior Analytics Engineer – Intercom → Apply
And more here.
⚙️ Data Engineer
Senior Data Platform Engineer – FlixMobility → Apply
Data Engineer – Deel → Apply
Senior Platform Engineer - Kafka (d/f/m) – emnify → Apply
And more here.
🤖 AI / ML / LLM + Data Scientist
Senior Machine Learning Engineer, Recommendations (Product) – SoundCloud → Apply
Machine Learning Engineer – Wolt → Apply
Senior ML Engineer – Redcare Pharmacy → Apply
And more AI/ML roles here and Data Scientist roles here.
If something here saved you a click or started an argument worth having, like the post, leave a comment, or share it with someone who still thinks “offline measurement” is optional.
Cannot make Thursday? No drama — the next Data Berlin meetup will be announced soon. Subscribe on Luma and you will get it in your inbox instead of hearing about it three days late from a colleague’s calendar invite.
— Data Berlin team








