ben_w 2 days ago

Given what happened with DeepSeek, "not state of the art" can still be simultaneously really close to the top, very sudden, very cheap, and from one small private firm.

1
rafaelmn 2 days ago

Not really with the EU data sources disclosure mindset, GDPR and all that. China has a leg up in the data game because they care about copyright/privacy and IP even less than US companies. EU is supposedly booting US companies because of this.

ben_w 2 days ago

The data sources is kinda what this court case is about, and even here on HN a lot of people get very annoyed by the application of the "open source" label to model weights that don't have the source disclosure the EU calls for.

GDPR is about personally identifiable humans. I'm not sure how critical that information really is to these models, though given the difficulty of deleting it from a trained model when found, yes I agree it poses a huge practical problem.

rafaelmn 1 day ago

> and even here on HN a lot of people get very annoyed by the application of the "open source" label to model weights that don't have the source disclosure the EU calls for.

That's because they are obviously trained on copyrighted content but nobody wants to admit it openly because that opens them to even more legal trouble. Meanwhile China has no problem violating copyright or IP so they will gladly gobble up whatever they can.

I don't think you can really compete in this space with the EU mindset, US is playing it smart and leaving this to play out before regulating. This is why EU is not the place for these kinds of innovations, the bureaucrats and the people aren't willing to tolerate disruption.