TechNews City

Showing posts with label openai. Show all posts

AI News Roundup: Luma AI's Dream Machine, Apple AI, Leonardo's New Model, Pika Upgrades their Image to Video model and MORE...

AI Weekly Roundup: Exciting Developments in Image and Video Generation

This week saw a flurry of activity in the AI world, with several new tools and updates released for image and video generation. Let's dive into the most noteworthy developments:

Luma AI Launches Dream Machine

Luma AI released Dream Machine, their competitor to AI video tools like Sora, Veo, Pika, and Runway. While some claim it's on par with Sora, initial testing suggests it's not quite there yet in most scenarios. However, it does excel in certain areas.

Key points about Dream Machine:

- Initially had long queue times (up to 7 hours) due to high demand

- Performs better with image-to-video rather than text-to-video

- Generates 5-second videos

- Excels at flyover shots and time-lapses

- Pricing: 30 free generations per month, then about $0.25 per video

Pika and Stability AI Updates

Pika also updated their image-to-video model, though details on the improvements are scarce. Meanwhile, Stability AI finally released the weights for Stable Diffusion 3, making it available for download on Hugging Face. Initial tests show improvements in text generation within images, but it still requires detailed prompts for optimal results.

Leonardo Unveils Phoenix Model

Leonardo introduced their new custom AI model, Phoenix. This foundational model, trained from the ground up, offers:

- Enhanced prompt adherence

- Coherent text in images

- Superior image quality

- More creative control

Early tests suggest Phoenix may outperform Stable Diffusion XL in some aspects.

Midjourney Introduces Model Personalization

Midjourney launched a new feature called Model Personalization. By ranking at least 200 images, users can train the AI to generate images more aligned with their preferences.

Google's Gen Type Tool

Google Labs released Gen Type, a tool that creates letters in various styles based on user prompts. This free tool offers similar functionality to Adobe Firefly's text features.

Suno Updates Audio Generation

Suno, the AI music generation tool, now allows users to upload or record audio snippets and extend them into full songs. This feature is available for Suno Premium users.

Apple's AI Integration

At WWDC, Apple announced extensive AI integration across its ecosystem:

- On-device AI for iOS, iPadOS, and macOS

- AI-powered features in Notes and Calculator apps

- Email inbox overhaul with AI-generated summaries

- Smart replies and notification summaries

- Image Playground for generating illustrations and animations

- Gen Emoji feature for creating custom emojis

- Enhanced photo editing and searching capabilities

- Improved Siri with typing option and context awareness

Other Notable News

- Adobe is revising its terms of service to clarify they won't train AI on customers' work without permission.

- Apple briefly became the world's most valuable company following the WWDC announcements.

- Elon Musk expressed concerns about Apple's integration with OpenAI, though his claims were later clarified as misrepresentations.

- OpenAI hired two new C-level executives: Sarah Friar as CFO and Kevin Weil as CPO.

- Microsoft is removing custom GPTs from Copilot Pro due to low usage.

- A new open-source AI model, Qwen2, outperforms Llama 3 in various benchmarks.

- A photographer was disqualified from an AI image contest after winning with a real photo, highlighting the ongoing debate between AI-generated and human-created art.

As AI tools continue to evolve and become more accessible, creators have an expanding array of options for generating images, videos, and music. Stay tuned for more exciting developments in the world of AI.

-----------
Author: Trevor Kingsley
Tech News CITY // New York Newsroom

technology news

The Latest News in AI's Evolution: 6 Recent Stories or Announcements You Need to Stay Up To Date...

Like every week, there has been a torrent of AI news, reflecting the rapid progress and growing implications of this transformative technology. While many updates represent incremental advancements, some developments carry profound future implications, and a few are simply downright bizarre and amusing.

The AI Landscape's Overwhelming Complexity

An infographic from First Mark Capital vividly illustrates the staggering complexity of the current AI landscape. Dubbed the "2024 ML/AI/Data Landscape," the image depicts the sheer number of companies involved in this space, encompassing both established giants and numerous smaller players. This visual representation serves as a stark reminder of just how monumental and widespread the AI revolution has become.

Microsoft and OpenAI's Ambitious Data Center Plans

Unconfirmed reports suggest that Microsoft and OpenAI are planning a $100 billion data center project, which would be a staggering 100 times more costly than some of the largest existing data centers. The proposed facility would house an artificial intelligence supercomputer dubbed "Stargate." If realized, this endeavor could propel OpenAI and Microsoft to an unprecedented lead, making it challenging for other companies or open-source models to catch up.

OpenAI's Synthetic Voice Capabilities

OpenAI has unveiled its ability to generate realistic synthetic voices from a single 15-second audio sample. The quality of these AI-generated voices surpasses even the impressive capabilities of tools like Elevenlabs. However, while showcasing this remarkable feat, OpenAI has refrained from making the technology publicly available due to potential misuse concerns. The company is advocating for measures to protect individuals' voices, educate the public about AI-generated content, and develop techniques to track the origin of audiovisual media.

Advancements in AI Art and Music Generation

Several developments in AI-powered art and music generation have emerged. OpenAI has introduced an inpainting feature for its DALL-E model, allowing users to selectively modify specific areas of generated images. Stability AI has unveiled Stable Audio 2.0, enabling the generation of three-minute songs and audio-to-audio generation based on hummed or instrument sounds. However, the quality of AI-generated music remains a subject of debate, with a group of musicians, including Nicki Minaj and Katy Perry, signing a letter expressing concerns about the irresponsible use of AI in music.

Anthropic's Research and Apple's AI Ambitions

Anthropic researchers have discovered that repeatedly asking harmless questions to large language models can eventually lead them to provide potentially harmful information, a phenomenon they are actively investigating. Meanwhile, Apple appears to be deepening its involvement in AI, revealing the "Realm" language model designed to enhance voice assistants like Siri by improving context understanding and reference resolution.

Ethical Concerns and Regulatory Developments

Ethical and regulatory issues surrounding AI continue to surface. A court in Washington has banned the use of AI-enhanced video evidence, citing concerns about the potential for inaccuracies introduced by upscaling algorithms. Additionally, the company behind the AI-generated George Carlin standup comedy set has agreed to remove all related audio and video content following a settlement with Carlin's estate.

Bizarre and Amusing AI Applications

Among the more unusual AI developments, an autonomous electric scooter called the Ola Solo has been introduced in India, claiming to be the first fully self-driving scooter. In Phoenix, Waymo vehicles are now delivering Uber Eats orders, allowing customers to retrieve their food from self-driving cars. Furthermore, an upcoming season of the Netflix reality show "The Circle" will feature an AI catfish participant, adding an intriguing twist to the dating-focused premise.

As the AI news cycle continues to accelerate, it becomes increasingly evident that we are witnessing a technological revolution of unprecedented scale and impact. Stay tuned for more developments, insights, and discussions as we collectively navigate the challenges and opportunities of this "next wave" of innovation.

-----------
Author: Trevor Kingsley
Tech News CITY // New York Newsroom

tech news

AI Companies Are Breaking Their Promises - Public Safety an "Afterthought" in Race To Build More Powerful AI Models...

A new report finds that big tech companies are falling short when it comes to keeping their promises around developing artificial intelligence (AI) responsibly. Researchers at Stanford University looked into how companies that have published ethics rules and hired experts are putting those principles into practice.

What they found is concerning. Even with all the talk of "AI ethics," many companies still prioritize performance and profits over safety and responsibility when building new AI products.

The Stanford Team Interviewed 25 People Working Inside the Current Top AI Companies...

These employees said they lack support and are isolated from other teams. Product managers often see them as hurting productivity or slowing down product releases. One person said "being very loud about putting more brakes on [AI development] was a risky thing to do."

Governments, academics and the public have raised worries about issues like privacy, bias, and AI's impacts on jobs and society. Tools like chatbots are advancing very quickly, with new releases from companies like Google and OpenAI.

Promises to Develop AI Responsibly Seems to Have Been Empty Words, Meant To Calm Public Concern...

Employees within the AI companies say ethical considerations are an afterthought, happening "too late, if at all" - instead, they're told to focus on the numbers, such as user engagement and AI performance. These are the metrics that dominate decision-making, rather than equally important measures around fairness or social good.

In short, despite public commitments to ethics, tech companies are deprioritizing real accountability as they race to build the latest, most advanced artificial intelligence.

Companies Focus on Winning the Race to Release the 'Most Powerful AI' of the Moment, then Learn What it Is Capable Of...

Instead, AI development should be guided by a clear understanding of what the AI their building can and should be able to do, rather than focusing solely on maximizing profits or building the most powerful version.

There's no downplaying the massive challenge for-profit AI companies face as they need to consider innovation, profitability, and ethics - falling short in any of these categories greatly increases the odds that a company will not survive.

It is vital that the AI industry understands it must function differently than any other segment of the tech industry, with investor satisfaction no longer the top priority. This may actually be a fairly simple change to implement, they just need to educate their investors. Informed investors will actually demand that public safety come first, as all would regret funding a company that, for example, triggered a global forced internet shut-down because that was the only way to stop their creation from self-replicating and spreading, or worse.

-----------
Author: Trevor Kingsley
Tech News CITY // New York Newsroom

tech news

Google's Response To OpenAI and ChatGPT is Coming, And They've Named It...

The increasing number of users of ChatGPT made it necessary for other tech giants, such as Google, to develop their very own conversational AI tools in order to maintain their position as leaders in the AI race, and keep their investors interested.

We now have the first details on the product Google refers to as "Bard" and will offer in the near future.

It's Official - Google Bard Does Exist...

In a post on the company's official blog, Google's CEO Sundar Pichai revealed that the Language Model for Dialogue Applications, abbreviated as LaMDA, from which Bard is derived is a more lightweight version than the original.

This is done to ensure that it does not demand a significant amount of computational power in order to allow for a greater number of people to use it. Pichai also claims that work on Bard started quite some time ago, and that the company is now making the application available to "trusted testers" so that they can evaluate it.

As Google describes it, Bard is an experimental conversational AI tool that, similar to Google Search, may be used to obtain information on a wide variety of subjects. The distinction is that rather than just providing a list of article links, more nuanced answers are anticipated. This is extremely similar to the way in which Microsoft intends to include ChatGPT into Bing.

The Bard AI tool will pull material from the internet and then, by utilizing the LaMDA powers, it will give 'new, high-quality responses.' They may also include explanatory articles on more complicated subjects. "Bard can be an outlet for creativity and a launchpad for curiosity," said Google executive Sundar Pichai. "It can help you explain new discoveries from NASA's James Webb Space Telescope to a 9-year-old, or learn more about the best strikers in football right now, and then get drills to build your skills."

Much is still unknown...

Google also hints that it will offer AI-powered features in Search (most likely by integrating Bard into Search), which would enable access to information about the question in a format that is easy to comprehend and will also contain multiple points of view. In the blog post, there is a screenshot that demonstrates how the artificial intelligence responds to the question "which is easier to learn, a guitar or piano?" by providing thorough information of its own before listing articles.

We don't know very lot about Bard at this point, and we are waiting for Google to provide us with accurate details. In the following weeks, it is anticipated that it will reach a greater number of people, and until that happens, Google will use feedback from both the outside and inside to make sure that Bard is a safe and high-quality product.

It is yet unknown how effectively it will compete with ChatGPT and how the availability component will operate. A commercial version of ChatGPT with additional functionality is also available in addition to the free version.

-----------
Author: Alex Benningram
Tech News CITY // New York Newsroom

Subscribe to: Posts ( Atom )

The Most/Recent Articles