• 0 Posts
  • 74 Comments
Joined 11 months ago
cake
Cake day: July 28th, 2023

help-circle

  • The Turing test is flawed, because while it is supposed to test for intelligence it really just tests for a convincing fake. Depending on how you set it up I wouldn’t be surprised if a modern LLM could pass it, at least some of the time. That doesn’t mean they are intelligent, they aren’t, but I don’t think the Turing test is good justification.

    For me the only justification you need is that they predict one word (or even letter!) at a time. ChatGPT doesn’t plan a whole sentence out in advance, it works token by token… The input to each prediction is just everything so far, up to the last word. When it starts writing “As…” it has no concept of the fact that it’s going to write “…an AI A language model” until it gets through those words.

    Frankly, given that fact it’s amazing that LLMs can be as powerful as they are. They don’t check anything, think about their answer, or even consider how to phrase a sentence. Everything they do comes from predicting the next token… An incredible piece of technology, despite it’s obvious flaws.






  • S24 ultra, I have one for a similar reason, although I also really like a lot of its other features. While you can certainly get good photos with other phones, it is among the best on the market.

    I was considering a £500-600 DSLR like I’ve had in the past, but ultimately I like to take photos when an opportunity arises, not just at the times I happen to have my expensive camera on me. If you take a lot of photos but you aren’t a professional, the best thing is a high end phone. Doubly so, because unless you’re very experienced at setting up the camera correctly for the conditions, your phone camera is almost certainly going to do a better job than you would on a manual camera.

    So, in the end, rather than getting a cheaper phone and a camera, I combined the two. I know Samsung suck in a lot of ways, but when it comes to actually using my phone, they’re excellent compared to other brands I’ve tried.





  • AI models don’t resynthesize their training data. They use their training data to determine parameters which enable them to predict a response to an input.

    Consider a simple model (too simple to be called AI but really the underlying concepts are very similar) - a linear regression. In linear regression we produce a model which follows a straight line through the “middle” of our training data. We can then use this to predict values outside the range of the original data - albeit will less certainty about the likely error.

    In the same way, an LLM can give answers to questions that were never asked in its training data - it’s not taking that data and shuffling it around, it’s synthesising an answer by predicting tokens. Also similarly, it does this less well the further outside the training data you go. Feed them the right gibberish and it doesn’t know how to respond. ChatGPT is very good at dealing with nonsense, but if you’ve ever worked with simpler LLMs you’ll know that typos can throw them off notably… They still respond OK, but things get weirder as they go.

    Now it’s certainly true that (at least some) models were trained on CSAM, but it’s also definitely possible that a model that wasn’t could still produce sexual content featuring children. It’s training set need only contain enough disparate elements for it to correctly predict what the prompt is asking for. For example, if the training set contained images of children it will “know” what children look like, and if it contains pornography it will “know” what pornography looks like - conceivably it could mix these two together to produce generated CSAM. It will probably look odd, if I had to guess? Like LLMs struggling with typos, and regression models being unreliable outside their training range, image generation of something totally outside the training set is going to be a bit weird, but it will still work.

    None of this is to defend generating AI CSAM, to be clear, just to say that it is possible to generate things that a model hasn’t “seen”.