Test Information Space

8903 bookmarks
Custom sorting
Inside China's AI Ecosystem: A View From Beijing
Inside China's AI Ecosystem: A View From Beijing
In this episode, we explore the Chinese AI ecosystem with 'L-squared,' an anonymous tech worker based in Beijing. We discuss major players, model quality, public engagement, regulation, and the US 'chip ban.' Discover the similarities and differences between US and Chinese AI landscapes, and gain a nuanced perspective on the current state of AI in China.
·youtube.com·
Inside China's AI Ecosystem: A View From Beijing
OpenEQA: From word models to world models
OpenEQA: From word models to world models
OpenEQA combines challenging open-vocabulary questions with the ability to answer in natural language. This results in a straightforward benchmark that demonstrates a strong understanding of the environment—and poses a considerable challenge to current foundational models. We hope this work motivates additional research into helping AI understand and communicate about the world it sees.
·ai.meta.com·
OpenEQA: From word models to world models
How LLMs Can Boost Legal Productivity (with Accuracy and Privacy)
How LLMs Can Boost Legal Productivity (with Accuracy and Privacy)
Generative AI is significantly advancing modern legal technology, and we see two primary groups using it to build new solutions. The first, unsurprisingly, is the legal tech market, which is estimated to exceed $45 billion by 2030. Legal tech firms can give solo lawyers and small firms the technological leverage they need to compete against their larger counterparts. The second user group is large legal firms. They have the resources needed to deploy generative AI solutions themselves.
·txt.cohere.com·
How LLMs Can Boost Legal Productivity (with Accuracy and Privacy)
Benchmarking the leading AI chat experience | You.com
Benchmarking the leading AI chat experience | You.com

In February 2024, You.com conducted a benchmarking study to evaluate the performance of its AI chat experience compared to competitors. You.com partnered with an independent vendor, Invisible Technologies, where independent evaluators rated responses from eight AI models, including free and paid offerings, across five criteria using a set of 120 representative user queries.

YouPro Modes, the premium offerings from You.com, outperformed ChatGPT 4 and Perplexity Pro in overall user preference. YouPro Modes also scored higher on comprehensiveness, factual accuracy, and faithfulness to the prompt’s intent. You.com’s free Smart Mode was the top-performing free model, beating ChatGPT 3.5 and Perplexity in overall user preference as well as accuracy and clarity.

·about.you.com·
Benchmarking the leading AI chat experience | You.com
Which other authors appear to be the most influential to the answerer in the QUORA CONTENT?
Which other authors appear to be the most influential to the answerer in the QUORA CONTENT?

Was able to upload and interrogate the pair of HTML files downloaded for a combined 12.3 MB from Quora content. The apps may eventually be more integrated. At least, chatbots have benefitted elsewhere from editors or notebooks. Although it did think the voice was more Gen Xer than Boomer. Incidentally, Google AI Studio also offers access to Gemini 1.5. However, neither could parse YouTube links like the Gemini Pro/Ultra site.

·poe.com·
Which other authors appear to be the most influential to the answerer in the QUORA CONTENT?
AI to Make Originalist Historical Analysis Easier, US Judge Says
AI to Make Originalist Historical Analysis Easier, US Judge Says
“Perhaps judges may rely on AI for assistance—a form of expert opinion if you will,” Bush said. “For instance, one could argue that it would be permissible for a judge to use AI analysis of statistical probability that a word or phrase had a particular sense or meaning in a particular historical period.”
·news.bloomberglaw.com·
AI to Make Originalist Historical Analysis Easier, US Judge Says
In a future with more ‘mind reading,’ thanks to neurotech, we may need to rethink freedom of thought
In a future with more ‘mind reading,’ thanks to neurotech, we may need to rethink freedom of thought
But one thing is certain: With or without neurotech, our control over our own minds is already less absolute than many of us like to think.
But one thing is certain: With or without neurotech, our control over our own minds is already less absolute than many of us like to think.
·theconversation.com·
In a future with more ‘mind reading,’ thanks to neurotech, we may need to rethink freedom of thought
Google releases ‘prompting guide’ with tips for Gemini in Workspace
Google releases ‘prompting guide’ with tips for Gemini in Workspace
Coming in at 45 pages, there are example personas and prompts that go through refinements for: Customer service, Executives and entrepreneurs, Human resources, Marketing , Project management, Sales. Ultimately, Google says to review outputs for “clarity, relevance, and accuracy” before using it.
download the guide here
·9to5google.com·
Google releases ‘prompting guide’ with tips for Gemini in Workspace
Symbolica hopes to head off the AI arms race by betting on symbolic models | TechCrunch
Symbolica hopes to head off the AI arms race by betting on symbolic models | TechCrunch
In a memo this year, two executives at TSMC, the semiconductor fabricator, said that, if the AI trend continues at its current pace, the industry will need a 1-trillion-transistor chip — a chip containing 10x as many transistors as the average chip today — within a decade.
Symbolica AI
·techcrunch.com·
Symbolica hopes to head off the AI arms race by betting on symbolic models | TechCrunch
Anil, C., Durmus, E., Sharma, M., Benton, J., Kundu, S., Batson, J., ... & Duvenaud, D. (2024). Many-shot Jailbreaking.
Anil, C., Durmus, E., Sharma, M., Benton, J., Kundu, S., Batson, J., ... & Duvenaud, D. (2024). Many-shot Jailbreaking.

Long contexts represent a new front in the struggle to control LLMs. We explored a family of attacks that are newly feasible due to longer context lengths, as well as candidate mitigations. We found that the effectiveness of attacks, and of in-context learning more generally, could be characterized by simple power laws. This provides a richer source of feedback for mitigating long-context attacks than the standard approach of measuring frequency of success

·www-cdn.anthropic.com·
Anil, C., Durmus, E., Sharma, M., Benton, J., Kundu, S., Batson, J., ... & Duvenaud, D. (2024). Many-shot Jailbreaking.