Shanidar

Nombre de resultats 25 per a i

19/01/2023 - Roskilde

Considerada una de les ciutats més antigues de dinamarca, Roskilde té el Museu dels vaixells vikings, on es poden veure el que queda de cinc vaixells enfonsats al segle XI. Al museu et pots vestir com els vikings de l’època, … Continua llegint →

18/01/2023 - Castell de Frederiksborg

Situat a la població de Hillerod, al nord de Selandia, la més gran de les illes de Dinamarca, el Castell de Frederiksborg es troba a una quarantena de quilòmetres de la capital danesa, i està connectat amb Copenhaguen per tren. … Continua llegint →

17/01/2023 - Copenhaguen

Copenhaguen, amb més d’un milió d’habitants, és la capital de Dinamarca. Situada estratègicament a l’entrada de la mar Bàltica a l’illa de Sjælland, és una ciutat molt tranquil·la, perfecte per perdre’s pels seus carrers i disfrutar dels seus parrcs. La millor … Continua llegint →

16/01/2023 - Dinamarca 2022

Reprenem els viatges d’estiu, aquest 2022 hem visitat el país de l’escriptor Hans Christian Andersen El passat estiu vam visitar un país que feia temps que li teníem ganes, Dinamarca. Tot i que ens vam perdre molts llocs xulos, vam … Continua llegint →

06/09/2019 - Trieste

El vol de tornada el teníem des de Triste, per això vam aprofitar per quedar-nos una nit a la ciutat i visitar-la. 24 hores per fer un tastet d’Itàlia! La primera idea per anar de Piran a Trieste era fer-ho, bé … Continua llegint →

05/09/2019 - Piran

Eslovènia té només 48 km de costa, però no ens la volíem perdre, així que l’última parada d’Eslovènia va ser a Piran, una ciutat petita, tranquil·la, amb aires venecians, que es troba a la península d’Istria. El casc antic és pràcticament … Continua llegint →

04/09/2019 - Coves de Postojna

El sisè dia de viatge ja vam fer les maletes per marxar de la zona dels Alps Julians i marxar cap a la costa, propera parada: Piran, però pel camí va fer una parada per visitar les coves de Postojna, … Continua llegint →

31/08/2019 - Llac Bohinj

El quart i últim dia que estàvem per la zona de Bled, vam decidir anar-se’n la zona del llac Bohinj. La nit abans ens havia diluviat, així que el dia no era per banyar-nos-hi, tot i que va quedar un … Continua llegint →

30/08/2019 - Parc Nacional de Triglav

El Triglavski Narodni Park té una superfícies de 840 km2 i és una de les reserves nacionals més grans d’Europa.Un parc espectacular de muntanyes rocoses, gorges fluvials, barrancs, llacs, coves, rius, cascades i carreteres vertiginoses. De camí a Kranka Gora, … Continua llegint →

29/08/2019 - Gorges Vintgar i llac Bled

El tercer dia de viatge va ser un dels mes esperats, i el que ens vam trobar més gent, també s’ha de dir, ja que amb anar a visitar dos dels llocs més emblemàtics del país, les gorges de … Continua llegint →

28/08/2019 - Ljubelj

A mitja tarda vam arribar al Kamp Poldjubelji, i ens vam instal·lar a la que seria la nostra casa els propers quatre dies, una caseta de fusta en un càmping molt bàsic però on vam poder allunyar-nos de les aglomeracions … Continua llegint →

27/08/2019 - Liubliana

Amb una hora de retard, la veritat és que ens pensàvem que perdríem el segon vol, vam arribar per la tarda a Liubliana, la capital d’Eslovènia i una de les ciutats més verdes i agradables d’Europa. El centre té el … Continua llegint →

07/08/2019 - Anem a descobrir Eslovènia!

Aquest any, natura a dojo! En breu començarem la nostra aventura per terres eslovenes. 9 dies de viatge on esperem descobrir els racons de la desconeguda Ljubljana, capbussar-nos als llacs Bled i Bohing i a les aigües de Piran, caminar … Continua llegint →

08/07/2019 - 2019#19 Readings of the week

NOTE: The themes are varied, and some links below are affiliate links. History, haskell. Expect a similar wide range in the future as well. You can check all my weekly readings by checking the tag here. You can also get these as a weekly newsletter by subscribing here.

Photo by Chris Marquardt on Unsplash

When Pepsi Had a Navy

From the title, I thought it was related to the “Sugar Wars”, but the reality is weirder.

Your Work Peak Is Earlier Than You Think

Mildly depressing. A quote I found interesting:

Careers that rely primarily on fluid intelligence tend to peak early, while those that use more crystallized intelligence peak later. For example, Dean Keith Simonton has found that poets—highly fluid in their creativity—tend to have produced half their lifetime creative output by age 40 or so. Historians—who rely on a crystallized stock of knowledge—don’t reach this milestone until about 60.

Why I (as of June 22 2019) think Haskell is the best general purpose language (as of June 22 2019)

Although I can somewhat read Haskell code, I still can’t write much. So, can’t really agree... Yet.

Easy Parsing with Parser Combinators

I have a small project I wrote with AWK that I want to rewrite “properly” in Scala, and I’d need to do some real parsing (not the ad-hoc parsing you always end up writing in AWK), so learning to use FastParse is worth it. Also Li is a very clear writer.

Sam and Max Hit the Road Development History

So many fond memories from this game, I’m getting it again from GOG any day soon. Also this post made me look up where my copy of the ??? was.

Comonads for Life

Implementing the Game of Life using comonads (the dual of monads)

Real-world dynamic programming: seam carving

I have always been fascinated by seam carving. It’s so amazing

The surprising story of the Basque language

Uhm, Armenia is not even close.

History Will Not Be Kind to Jony Ive

Flexgate, antennagate, keyboardgate. There have been many f-ups that can be traced back to looking for a specific design. I hope Apple gets back to top-design and top-usability now.

'It's getting warmer, wetter, wilder': the Arctic town heating faster than anywhere

Longyearbyen (Svalbard’s capital) is in danger.

Reasons of State: Why Didn't Denmark Sell Greenland?

An analysis by Gwern Branwen about why Denmark didn’t sell Greenland to the US after WWII (they wanted it as a base).

'Like a military operation': restoration of Rembrandt's Night Watch begins

The weight of the painting will surprise you.

📚 Sprint

The nitty gritty details of how to run design sprints (a Google practice). Interesting.

📚 Diaspora

This is the second Egan book I read (after Clockwork Rocket) and I found Diaspora much better, although with a weak ending and a somewhat erratic plot. The book has some strong Star maker vibes. By the way, when adding the link above I thought the narrator for Audible was this Adam Epstein (a pretty darn good mathematician I know)

📚 Thinking in bets

The general idea is sound, but it’s one of those books that could be written as a long blog post or short essay.

📚 Set your voice free

Exercises to improve your voice. I have recently started to do the daily warm-ups, but so far have felt no difference. Of course, other people should be the ones to tell me.

🎥 Wardley mapping interview

I was recently interviewed by John Grant and Ben Mosior as part of a set of interviews leading up to MapCamp London 2019. If you are interested in my thoughts about mapping the tech landscape, I wrote my train of thought in this blog post as well.

Newsletter?

These weekly posts are also available as a newsletter. These days (since RSS went into limbo) most of my regular information comes from several newsletters I’m subscribed to, instead of me going directly to a blog. If this is also your case, subscribe by clicking here.

07/07/2019 - A (section) of a map of the data engineering space

Table of contents

The map and problem described here were part of my presentation Mapping as a tool for thought, and mentioned in my interview with John Grant and Ben Mosior (to appear sometime soon in the Wardley Maps community youtube channel). I’m looking for ideas on how to make this map easier to understand and useful, so I posted it to the Wardley Maps Community forums requesting comments.

Problem I'm trying to solve

As a consultant (and as someone always trying to keep up with technology) I'm interested in being able to answer three questions of a language or technology:

How easy it is to find work/workers in the area right now?
How hard it is to learn?
How easy is it going to be to find work/workers in the area once I'm proficient enough?

Also, I need to know the relationship between any of them.

This problem has been on the back of my mind for many years, and upon getting proficient with Wardley mapping, I thought I could just map it. Of course, it's not a Wardley map, because the axes are completely different, but having anchors and movement, it is spiritually close enough for me.

In the diagram I will show in a while, I have placed technologies I am proficient in, currently learning, or looking forward to learn. In all of them I am at least a beginner in the sense that I know what they are used for and have done some minor PoC (proof of concept) to get an idea of how the work.

Looking for axis metrics

There are several ways you can address this technology space to answer the questions above. The first and easiest metric, and one of the axes I have used (Y axis) is Difficulty. Since I know something about each technology I can rank them on Difficulty, at least in relationship with each other. It's only a qualitative metric of difficulty, because in any new technology there are always unknown unknowns. There is no movement assumed in this axis, because Difficulty is supposed to be consistent throughout (leaving aside the more you know the easier it is to learn as well as familiarity with similar concepts that offset that, you could think of these two concepts as doctrine in such a map).

One natural metric for the other axis could be popularity, as measured by any of the several programming language/framework popularity rankings. You can use popularity as one of the axes, and use arrows to indicate whether it is growing in popularity or diminishing in popularity. But, popularity alone does not help in answering questions 1 and 3. What we need is knowing how large the market for this technology is, and how large the pool of workers in this market is. Could we use either as an axis?

If we were to use market size as X axis, we would probably have large markets on the right and small markets on the left, we would likely use arrows to indicate growing markets and shrinking markets. But, market size alone won't answer the questions either. A small thought experiment: imagine we have the largest market possible, it is growing... but the pool of workers for that technology is 2x the size of the market. It would be impossible to find work there (but, would be easy to find workers). This suggests that a possible correct for the X axis is market saturation, i.e. the ratio of market size with worker pool. Highly saturated markets are uninteresting to look for work, but are very interesting if you are starting a company: you'd have an easy time finding hires for that technology. Market saturation is related to flows (as in Systems thinking flow analysis) of users into a variable-sized container.

Markets become saturated in one of 3 ways:

Market is growing, but the pool of workers grows faster
Market is stagnating with a growing pool of workers
Market is shrinking faster than the pool of workers is shrinking

Cases 1 and 2 are the most usual (I'd put Python as type 1 and Java as type 2), but 3 is an interesting situation: it would indicate a technology that has died in favour of another. Workers in that pool have retrained in the new technology, but are still in pool for the dying technology (for instance, traditional MapReduce).

Markets become desaturated in one of 3 ways as well:

Market is growing faster than the pool of workers is growing
Market is stagnating with a shrinking pool of workers
Market is shrinking slower than the pool of workers is shrinking

Likewise, here, cases 1 and 3 may be the most interesting. I'd put Kubernetes in 1, and Scala in either 2 or 3. Please note that not only are these subjective evaluations, but are not meant to be negative. Scala has been my preferred language for a long while.

We can represent all these with slanted arrows: slant up covers growing markets, slant down covers shrinking markets. And then the arrow points left or right whether it is becoming saturated or desaturated.

With market saturation as an axis and arrows to indicate evolution of a technology, we can now almost answer questions 1 and 3. There is still the question of market size, which can't be represented with such a relative measure. Although we could add circles to represent current market size, that would bring an already weird map to more weirdness. Hence, market size is not considered.

The map

Here you can see the map. Before I get a bit into the topography of it, let me quickly define some of the technologies:

APL: programming language based on non-ASCII symbols designed in the 70s. Not extensively used, but in use
Airflow: workflow scheduler for data operations
Akka: Scala/JVM actor framework used for reactive programming, clustering, stream processing, etc
Arrow: cross-language platform for in-memory data. Used in Spark, Pandas, etc
Awk: special-purpose programming language designed for text processing
Beam: unified model for data processing pipelines. Can use Spark, Flink and others as execution engines
Dask: cluster-capable, library for parallel computation in Python.
Datafusion: rust-based, Arrow-powered in-memory data analytics
Docker: containerisation solution
FP: as in Functional Programming. Software development paradigm based on immutable state, among other things. Scala and Haskell are some the most mainstream languages for it
Flink: cluster computing framework for big data, stream focused
Forth: stack based, low level programming language. Not in common use.
FoundationDB: multi-model distributed NoSQL database, offering "build your own abstraction" capabilities
Go: statically typed, compiled programming language
Haskell: statically typed, purely functional programming language
Hive: data warehousing project over Hadoop, roughly based in "tables"
Kafka: cluster based stream processing platform (often used as a message bus) written in Scala
Kubernetes: container orchestration system for managing application deployment and scaling. Written in Go, depending (non-strictly) on Docker
Presto: distributed SQL engine for big data
Python: interpreted high level programming language, very extended in data science and engineering
Rust: memory safe, concurrency safe programming language. Has some functional capabilities
Scala: JVM based language offering strong typing and functional and OOP capabilities
Spark: cluster computing framework for big data, batch and stream (stronger in batch)

These cover a range of the data engineering space (Flink, Spark), as well as technologies I want to get better at and are close enough (Kubernetes, FoundationDB) and technologies I know but are not directly related (AWK, APL, Forth) and are used as anchors.

Anchors

To display relative positions, I needed to anchor some of the technologies. For instance, Haskell and APL set the bar for difficulty, with AWK setting the minimum, and Python and Forth set the extremes for saturation. Everything else is placed in relation with these.

Links and colours

In the map, I have used colours to distinguish languages, frameworks, libraries, containerisation and databases. Colour is not fundamental though, links are. Related technologies are linked: Spark is written in Scala, and can be used with Scala, Python and other languages. Hence, changes in market saturation for Spark indirectly affect market saturation for Scala.

Empirical map division

We can think of the map as divided in 2 areas vertically (high barrier to entry and low barrier to entry) and 3 areas horizontally (saturated, accessible, desaturated).

Quick overview

If you were a CTO, you'd probably be interested in:

Low barrier to entry, saturated market for high turnover positions (easier and cheaper to hire and train)
Accessible currently becoming saturated for more stable positions (becoming easier to hire in the future)
High barrier to entry, with growing markets for proof of concepts, exploration (exploring new technologies that may make an impact in the future)

A map like the one above can help make these decisions.
For deciding on future work, I'd be (as a consultant) interested in:

First, growing market, high barrier to entry for current learning. If the barrier to entry is too low and the market is lucrative enough any return on time investment will tend to 0 in the end otherwise.
Second, accessible with low-to-mid barrier to entry for imminent work opportunities.
Finally, to hedge bets on a long term plan, anything which is very low saturation, and where the market is unknown or may be in growth.

With this map (which is suited to my current knowledge and interests) I can definitely answer the question of where I should put my efforts right now.

Curiosities

Haskell and Rust raise interesting questions. Haskell has a very high barrier to entry, the market is not very large and might not be growing fast, but there are developers working in other languages (Scala, Rust, Kotlin, Go, even Python) that would love the opportunity to work in Haskell. This makes the Haskell job market actually saturated (or at least saturated if you consider worldwide market). Thus, starting a company focused on Haskell might not be as bad of an idea as it might sound. Similarly with Rust: Rust is growing as a side project language, the amount of developers familiar with the language is growing faster than the market and thus is an interesting target for a starting company.

We'd have Python on the other side: since it is taking over as the lingua franca of data science and engineering, and becoming one of the teaching languages at universities, the amount of developers with enough knowledge to become part of the work pool is growing faster than the market (even if the market for Python is growing at a fast pace). It makes it an ideal language for creating a company or a consultancy company (large pool of candidate workers), but not so interesting for being an independent consultant, since competition could be too large

Questions and further ideas

This is the approach and train of thought I have followed to trace these ideas, but it’s still a work in progress. I’d like to hear what you have to say about this: what would you change? What would you do different? I'm still unsure about using market saturation and arrows to show market and pool of workers behaviour, but I have not found anything easier to represent. Ideas? And here are some areas I’m unsure or where I have more questions.

What other approaches would you have taken to explore this questions?

There are probably many other ways you can take to approach these questions. What would be yours?

Do you think market size would need to be shown?

It could be shown with a circle (different sizes to be able to compare) below the arrow indicating behaviour of saturation but that could make the map way too complex. Any other ideas? Do you think it is that important, as long as saturation is taken into account?

Links between related technologies are a bit hazy

The links between technologies are a bit too abstract. An "increase" in "Python" moves "higher" Airflow, Spark, Dask and any related technologies... but in what sense? Popularity? Market share? Market saturation? I suspect the link is useful to see, and it is supposed to bring some dynamic/movement, but I'm still unsure how.

Flows

An interesting approach I didn't pursue is using flow maps. For each programming language, there is a set of flows into other languages. For instance, developers in Scala have a tendency to be interested in Kotlin, Rust and Haskell, with some making the jump as soon as market is able to absorb them (and for each of these flows we can assume there is a non-zero flow to the other side). Similarly, we'd have flows from Python to Go and Scala, from Go to Rust. These could inform on market trends and behaviours, but they are not only hard to show on a map (what would be the axes? what would be the anchors?) but also might not be interesting enough on their own. What do you think?

25/06/2019 - 2019#18 Readings of the week

Photo by Nik Shuliahin on Unsplash

NOTE: The themes are varied, and some links below are affiliate links. Software engineering, history, planning, data engineering. Expect a similar wide range in the future as well. You can check all my weekly readings by checking the tag here . You can also get these as a weekly newsletter by subscribing here.

Fresh look at mysterious Nasca lines in Peru

An analysis of what kind of birds they represented.

4 Simple Steps To Set-up Your WLM in AWS Redshift

Good suggestions on workload management and process queues for Redshift

Adventure Games and Eigenvalues

Finding dead ends in a game using Markov processes (instead of a formal language approach)

When pigs fly: optimising bytecode interpreters

Quite a meaty post about... well, optimising interpreters (incidentally, bytecode based)

See through words

Did you know metaphor design is a thing? Read this article for more.

Decision tables

Calling decision tables a formal method may be a stretch, but they can clarify your thinking. And that is one of the powerful things formal methods bring.

Monad Transformers aren’t hard!

No, they aren't, but your heap can suffer in Scala!

🎥 Wardley Maps Saved The Day - How Stack Overflow Enterprise automated all the things...

I’ve been into Wardley mapping for several months (even gave a presentation on Mapping at SoCraTes UK 2019) and I’m basically consuming any content related to it. This short video is a very good intro to Wardley mapping, by the way.

🎥 Cartoons are about how drawing and writing work together on the page

A quite funny video presentation by Tom Gauld

📚 The goal: a process of ongoing improvement

It’s like Sophie’s World but for Theory of Constraints (with a whiz of the Toyota Production System probably)

📚 Make your contacts count

Networking tips. It describes very directed ways of building a network and actually being useful in it, not just a parasite.

Newsletter?

17/06/2019 - 2019#17 Readings of the week

NOTE: This week is a bit light on technical content because I was attending Scala Days 2019 in Lausanne and I had enough with the talks.

NOTE: The themes are varied, and some links below are affiliate links. Software engineering, psychology, history. Expect a similar wide range in the future as well. You can check all my weekly readings by checking the tag here . You can also get these as a weekly newsletter by subscribing here.

Photo by Eduardo Romero on Unsplash

The mindfulness conspiracy

It’s a trap!

One Diagram To Mind Them All: Hyperspace in the 1970s

The beginnings and history of mind mapping, and Tony Buzan in particular.

A tale of lost WW2 uranium cubes shows why Germany’s nuclear program failed

Splitting your efforts won’t lead to faster splitting of the atom.

Exploring Domain-Driven Design at CircleCI

DDD is an interesting concept, and knowing how companies implement it is always good. This lacks a bit of more material though.

A Year’s Worth

An interesting way of visualizing a year’s worth of technical effort, by Kent Beck.

The Making of Lemmings

I enjoy a lot tales of the dark ages of gaming. If you haven’t, read The Making of Prince of Persia (you may enjoy the “prequel”, The Making of Karateka) or Masters of Doom).

Feynman’s Breakthrough, Disregard Others!

If you focus too much in what others do, you’ll lose your uniqueness.

📚 Tribe of mentors

A book by Tim Ferris, where he quick-interviews a lot of people, asking tips. It’s a kind of a not-amusing read, because it’s not designed to be “read”, I suspect.

📚 How to Draw Fantasy Art and RPG Maps: Step by Step Cartography for Gamers and Fans

I had a lot of fun with this one. Super-readable and easy to follow.

Newsletter?

10/06/2019 - 2019#16 Readings of the week

Photo by Roman Mager on Unsplash

NOTE: The themes are varied, and some links below are affiliate links. Wardley mapping, data engineering and big data, maths. Expect a similar wide range in the future as well. You can check all my weekly readings by checking the tag here . You can also get these as a weekly newsletter by subscribing here.

Meet the Money Whisperer to the Super-Rich N.B.A. Elite

It surely has to be an interesting job

Thought as a Technology

An essay by Michael Nielsen. Very recommended.

A tale of lost WW2 uranium cubes shows why Germany’s nuclear program failed

Curio

Counting to infinity at compile time – The Startup – Medium

You can do really weird things with Scala at compile time, and base 3 is pretty cool.

A visual introduction to Morse theory

I was surprised to see this posted on Hacker News, so decided to read it for the "good old times". Learnt something (since I never had a class on Morse theory, it was just one of those intriguing sections in the library)

The cutting-edge of cutting: How Japanese scissors have evolved

I have some weird scissors, but these go beyond weird.

A different kind of string theory: Antoni Gaudi

That idea was clearly genius.

Formally Specifying a Package Manager

By using Alloy. I really like Alloy.

Maker's Schedule, Manager's Schedule

I was talking with a friend earlier about a previous post in my Weekly Readings series (Do I truly want to become a manager?) and he hadn't read this one, so I'll share it here too.

Schema evolution in Avro, Protocol Buffers and Thrift

I kind of like the sound of how Avro handles schemas. Seems an efficient way, although seems... prone to possible disasters.

Improve Apache Spark write performance on Apache Parquet formats with the EMRFS S3-optimized committer

This is optional in EMR 5.19 and will be the default in 5.20 so we won't see much difference, except in performance.

Why Enthusiasm is a Bad Thing in a CTO

Hey, look a squirrel is 10x more dangerous when it's the CTO spotting squirrels.

🖥 Mapping as a Tool for Thought

A presentation I gave on Wardley mapping and "mapping" in general at SoCraTesUK 2019. I really enjoy this conference.

🎥 Skip the first three months of development for your next app

This ties very well with some of the Wardley mapping concepts.

Newsletter?

26/05/2019 - 2019#14 Readings of the week

Photo by Darius Soodmand on Unsplash

NOTE: The themes are varied, and some links below are affiliate links. Data engineering, adtech, functional programing, formal specification. Expect a similar wide range in the future as well. You can check all my weekly readings by checking the tag here . You can also get these as a weekly newsletter by subscribing here.

Schema Management With Skeema

How SendGrid manages schema updates internally. A pity it’s focused on MySQL and family and I prefer Postgres

How to do hard things

I wasn’t aware, but this is exactly how I approach anything I don’t know how to do. Don’t miss this one.

The bullshit I had to go through while organizing a software conference

I’ve been an organiser, luckily didn’t encounter this. Crossing fingers, since we are preparing stuff at PyBCN.

The reason I am using Altair for most of my visualization in Python

I usually like having the possibility of maximum power and expressiveness... But eventually I just want to make easy things easy. I’ll try Altair next time I need to plot anything. Lately I’ve gone a lot to gnuplot, to be fair: nothing beats it to just plot a text file you have lying around.

Introducing Argo — A Container-Native Workflow Engine for Kubernetes

This is what is now being used at BitPhy, after I recommended them to... well, not use Airflow. To be fair they were considering Argo and Airflow, and given they are heavy on Kubernetes, Argo sounds a better fit.

Do I truly want to become a manager?

Six questions you should consider before thinking of making the leap out of the IC route. Thanks to CM for sharing it.

Ask HN: What overlooked class of tools should a self-taught programmer look into

You can find some suggestions in these answers.

Give meaning to 100 Billion events a day

How Teads leverages AWS Redshift.

Performant Functional Programming to the max with ZIO

You were wondering: which ZIO post is he going to share this week?

Meet Matt Calkins: Billionaire, Board Game God And Tech's Hidden Disruptor

A friend of mine is designing and producing a board game (I’ll share the Kickstarter when is ready), so this was a fun read. He’s not a billionaire though.

Open-sourcing the first OpenRTB Scala framework

I’m not sure what the performance of a RTB can be in Scala, but I’m definitely interested.

Zero Cost Abstractions

Rust is always sold as a zero-cost abstraction language. What does that exactly mean?

A novel data-compression technique for faster computer programs

Don’t know, this sounds pretty much what Blosc does.

🎥 Fast Data with Apache Ignite and Apache Spark

(this is an oldie) I tried Ignite+Spark around 1 year ago, and couldn’t get it to work properly (segmentation fault!). I’ll try again: it can open up a lot of things if it works as promised.

🎥 Thinking for Programmers

Leslie Lamport selling you the why of specifications. Very recommended, specially for people who think specifying is complex, long, unnecessary or anti-agile.

🎥 Live Coding with Rust and Actix

I was very impressed by this video (I watched it as “background”, not actively). Very focused, and all written in one go.

🎥 Solving every-day data problems with FoundationDB

FoundationDB is on my list of technologies to watch/follow and learn. Get a glimpse about why here.

📚 Soft Skills

It’s not a bad book, but is basically a summary of stuff I got from other places already. You can give it a go.

🎼 Time Out by the Dave Brubeck Quartet

Saw this recommended somewhere, and I’m liking it a lot (won’t remove Bill Evans as my favourite jazz pianist though)

Newsletter?

13/05/2019 - 2019#12 Readings of the week

Photo by Zsolt Palatinus on Unsplash

NOTE: The themes are varied, and some links below are affiliate links. Big data and data engineering, metallurgy, Rust (not the metallurgy-related rust). Expect a similar wide range in the future as well. You can check all my weekly readings by checking the tag here . You can also get these as a weekly newsletter by subscribing here.

Testing Incrementally with ZIO Environment

I’ve been reading a lot about ZIO lately, even to the point of writing a few simple thinks with the help of a more advanced Scala friend. As I’ve mentioned before, it looks solid

Medieval Africans Had a Unique Process for Purifying Gold With Glass

No plans on smelting anything soon, but this was interesting.

Rust Runtime

The proposed approaches for async-await in Rust are shaping up.

Moving from Ruby to Rust (at Deliveroo)

It should surprise no-one, except maybe hardcore rubyists that moving from Ruby to Rust yielded terrific speed-ups. Something similar should happen in Python, though.

Zero-cost futures in Rust

One of the core tenets of Rust is having zero-cost abstractions (as in, being as close to manually written). Here you can read how futures fit that goal.

A quick look at trait objects in Rust

Traits will be familiar to Scala or even Go or Java developers, but there are a few gotchas/differences you need to be aware.

Druid Design

A friend of mine has used Apache Druid before, and has been really happy with its performance, so I’ve been looking into possible use cases to road-test it.

Make your own GeoIP API (in Python)

I’ve linked before to GeoIP approaches, this one is interesting as long as you only want country-level resolution, I usually need city or better.

An in-depth look at the HBase architecture

Similarly to Druid, I’ve been checking several things to speed up some areas. Apache HBase might be one of them, and I wanted to see how it works under the hood, I love these details

🎥 ZIO Schedule

As mentioned above, ZIO is shaping up nicely, and the scheduling helpers presented here are one of the selling points, depending on what you do.

🎥 Pure functional programming in Excel

Duh... but... to be fair... there is a point in here.

🎥 Gradual typing of production applications

Incremental approach suggestions for typing in Python, at Facebook (Instagram to be precise)

🎥 Effects as Data

Everything here should sound familiar if you are anything into functional programming

Newsletter?

19/02/2019 - Apache Hive and java.lang.ClassCastException on start

Photo by Annie Spratt on Unsplash

A couple of days ago I installed Hive from Homebrew on my Mac. Sadly, when I tried to run the hive command, I got the weird-looking error

Exception in thread "main" java.lang.ClassCastException: 
  class jdk.internal.loader.ClassLoaders$AppClassLoader 
   cannot be cast to 
  class java.net.URLClassLoader 
  (jdk.internal.loader.ClassLoaders$AppClassLoader 
    and 
  java.net.URLClassLoader are in module java.base of loader 'bootstrap')

That looked like a JVM incompatibility, so I switched from GraalVM (the one I use by default) to Java 8 (I have aliases jgrce, jgree, j8 and j11 to switch JVMs). Still, the same error regardless. Weird. Maybe Java 11 (the other JVM I have installed)? Nope, same error.

A quick Googling confirmed that this was related to Hive picking up Java 11, but only working with 7, 8 or 9 (not sure about 9). This in turn is due to the Hive boot scripts looking for the latest JRE which is at least 7, like the hive command here:

JAVA_HOME="$(/usr/libexec/java_home --version 1.7+)" \ 
HIVE_HOME="/usr/local/Cellar/hive/3.1.1/libexec" exec \
"/usr/local/Cellar/hive/3.1.1/libexec/bin/hive" "$@"

This will pick 11, which no longer has URLClassLoader (I think this was changed in Java 9). So, won't start.

Sadly the only reasonable fix is modifying the scripts after installation, unless you want to just uninstall Java > 1.8. For me this was not an option, so I just modified the scripts by removing the JAVA_HOME condition (since I set my JAVA_HOME globally when I switch between JVMs). And crossing fingers to remember I did so next time I upgrade HomeBrew.

27/01/2019 - 2019-3 Readings of the week

NOTE: The themes are varied. Software/data engineering, history, formal systems. Expect a similar wide range in the future as well. You can check all weekly readings by checking the tag here . You can also get these as a weekly newsletter by subscribing here.

Fermentation and Daily Life

I’m not a fan of fermented food, but my girlfriend is. The article is interesting even for me.

Why Are Young People Pretending to Love Work?

The “grind and hustle” gets old pretty quick.

📚 Specifying Systems

The book for learning TLA+ (and, free to download from the link above). I’m reading it right now, step by step. You can also get a paperback version from Amazon (affiliate link) but it's kind of expensive.

The Perils and Pleasures of Bartending in Antarctica

It kind of makes sense.

“O Uommibatto”: How the Pre-Raphaelites Became Obsessed with the Wombat

Whomwhat?

Why Don’t People Use Formal Methods?

As you may have realized, I’m interested in formal methods and verification. I’m not the only one, and since I now pay more attention to articles on the subject, I find more articles to share. Hillel is the author of Practical TLA+ (affiliate link), the book that finally got me to write specs.

The 1859 Carrington Event

The idea of this happening “now” is actually scary.

Ask HN: What are your “brain hacks” that help you manage everyday situations?

You may pick up one or two tricks that can be useful.

What if the Placebo Effect Isn’t a Trick?

Here, take this sugar pill. You’ll be cured.

Kafka at Criteo

Slides from Slideshare. The scale is astounding. Note that the engineering blog at Criteo is top notch, but your adblocker is probably going to give you a hard time reading it.

When Rust is safer than Haskell

I’m closer to doing useful stuff in Rust than in Haskell, so it’s always good to know Rust has some nice tricks up its sleeve.

Inductive invariants

More TLA+ goodness, from Lorin Hochstein.

Spark Barcelona Meetup: Speeding up PySpark with Arrow

This Thursday I’m speaking about how PySpark got faster by using Arrow internally. If you are around Barcelona please join us! Note that the slides for this talk are not up yet!

📚 Meditation for Fidgety Skeptics by Dan Harris

(note there are affiliate links in here) This is the follow-up to 10% Happier. MfFS is good, offering a more practical take than the previous one. As books to stand on its own, 10% Happier is better though.

📽 10 tips for failing badly at microservices

This is a very fun talk about what you should do if you want to prevent (in an ironic way) your company from moving to a microservices-based architecture. You may get flashbacks to the Simple Sabotage Field Manual from the CIA.

Newsletter?

I’m considering converting this into a weekly newsletter in addition to a blog post. These days (since RSS went into limbo) most of my regular information comes from several newsletters I’m subscribed to, instead of me going directly to a blog. If this is also your case, subscribe by clicking here and if enough people join I’ll send these every Sunday night or so.

05/05/2018 - Notifications from Spark on an Apple Watch (via IFTTT)

This week I have been working a lot with a relatively large dataset on a Spark shell. It was a graph with 1 billion nodes and 2 billion edges that I wanted to analyse with GraphFrames (the successor of GraphX on Spark). This is quite large: before running the graph algorithms I did some exploratory analysis, and each step took at least 10 minutes. Checking stage/task progress bars or generated analysis plans is only interesting for the first fee... I wanted a way to get a subtle notification when the process finished. This way I could work on something else while the process is doing its thing, and I could come back for the next step as soon as the data is ready.

I have most of my notifications deactivated, though. No emails, Twitter, WhatsApp. Nothing shows on my Mac screen, very few show on my phone or watch, and only a handful are allowed to vibrate (none to make a sound). What could I do to get a notification? Even if I overrode some of my settings, I needed something that either could work cross-device or could make my watch vibrate, since it’s the only device I have always with me.

Well, IFTTT can actually do that. IFTTT is a service to plumb external services with iOS/Android devices, to build workflows. It also has a very handy webhook you can use as a trigger for workflows. And the IFTTT app can send notifications, to the phone or watch. Ticks all the boxes.

To use the webhook (a POST endpoint) from Scala I used a library I had never used before: scalaj-http It seems very convenient for these quick-and-dirty “make a request” in a program that includes no other http library.

When I have some action I expect to run for a while, I’ll finish the command with ; notif(“Process X finished”) This way, when the command finishes my wrist will gently buzz and I’ll know I can go back to work more on it.

It is worth noting that this would also work for a long running bash or sbt command (I’m looking at you, Spark test suite), or compiling boost or anything else that can, basically, run curl against an endpoint at the end of the process.

By the way, to run a spark shell with this library, use

spark-shell --packages org.scalaj:scalaj-http_2.11:2.3.0

Remember that multiple packages are separated by commas in case you also happen to, you know, use GraphFrames.

Oh, and if you happen to want notifications after an sbt task, you can use the sbt-ifttt plugin.

31/08/2017 - Solta

De totes les illes que hi ha prop de Split vam decidir buscar la tranquil·litat de Solta. Hvar i Brac ens van semblar massa turístiques, i per tant massa gent. En una hora de vaixell van arribar a Rogac, a … Continua llegint →

30/08/2017 - Split

Vam arribar de Zadar a Split per la tarda, i ens vam dedicar una estona a posar ordre a la maleta, bàsicament a rentar roba, que ja se’ns havia acabat, i comprar alguna cosa de menjar per aquests tres dies … Continua llegint →

Totes les seccions | Seccions | Tots els articles |

19/01/2023 - Roskilde

18/01/2023 - Castell de Frederiksborg

17/01/2023 - Copenhaguen

16/01/2023 - Dinamarca 2022

06/09/2019 - Trieste

05/09/2019 - Piran

04/09/2019 - Coves de Postojna

31/08/2019 - Llac Bohinj

30/08/2019 - Parc Nacional de Triglav

29/08/2019 - Gorges Vintgar i llac Bled

28/08/2019 - Ljubelj

27/08/2019 - Liubliana

07/08/2019 - Anem a descobrir Eslovènia!

08/07/2019 - 2019#19 Readings of the week

📚 Sprint

📚 Diaspora

📚 Thinking in bets

📚 Set your voice free

🎥 Wardley mapping interview

Newsletter?

07/07/2019 - A (section) of a map of the data engineering space

Problem I'm trying to solve

Looking for axis metrics

The map

Anchors

Links and colours

Empirical map division

Quick overview

Curiosities

Questions and further ideas

What other approaches would you have taken to explore this questions?

Do you think market size would need to be shown?

Links between related technologies are a bit hazy

Flows

25/06/2019 - 2019#18 Readings of the week

Newsletter?

17/06/2019 - 2019#17 Readings of the week

Newsletter?

10/06/2019 - 2019#16 Readings of the week

Newsletter?

26/05/2019 - 2019#14 Readings of the week

Newsletter?

13/05/2019 - 2019#12 Readings of the week

🎥 ZIO Schedule

🎥 Pure functional programming in Excel

🎥 Gradual typing of production applications

🎥 Effects as Data

Newsletter?

19/02/2019 - Apache Hive and java.lang.ClassCastException on start

27/01/2019 - 2019-3 Readings of the week

Newsletter?

05/05/2018 - Notifications from Spark on an Apple Watch (via IFTTT)

31/08/2017 - Solta

30/08/2017 - Split

Últimament llegeixo...

Núvol d'etiquetes

Últims posts

Jo col·laboro