Hot
text
<p>An assistant that enhances human intelligence. Our four main features: <br>① 🤖 Support more than 100 AI Bot You can quickly use AI commands in the same way as other IM software, using @, which is as convenient as @ing people in a group chat. We currently support more than 100 AI Bot, covering topics such as education, writing, entertainment, etc. For example, you can use a tarot card master to give you a test of today's fortune, so you don't have to learn and set up a tedious AI Prompt. <br>② 🌅 Support Text-to-Image Model In addition, we not only support text model, but also support the text-generated diagram model, and in order to reduce your learning cost, the default preset some image styles, without having to enter too long commands to generate a nice effect. <br>③ 💬 Message support quoting Although AI supports contextual chat, we found that sometimes it's very difficult to make AI understand "this" or "the above" commands accurately. That's why in PoleStar Chat, we provide the function of quoting. It allows you to give commands to the AI accurately. In addition, you can also use the reference function to link the bots together. <br>④ 🔖 Supports multiple result displays Currently, AI technology is still at an early stage, and the content produced by AI is still not very stable, so it is often necessary for AI to re-output. However, most of the products on the market adopt the way of overwriting or resending to solve the retry. Overwriting will lose the historical data, and the resending will disrupt the context. In order to solve this problem, PoleStar Chat supports displaying multiple solutions in the same bubble for easy comparison.</p>
Canva is an online design platform that focuses on simple, convenient, and quick to get started, founded in 2013. Users can choose different design modes on the Canva platform according to their different needs, and design corresponding layouts, backgrounds, and text according to different templates. At the same time, users can also search online in Canva's resource library, including image materials, icons, chart materials, etc.
image
ComfyUI is a powerful and modular stable diffusion GUI with a graph/nodes interface. It allows users to design and execute advanced stable diffusion pipelines with a flowchart-based interface. It supports SD1.x and SD2.x and offers many optimizations, such as re-executing only parts of the workflow that change between executions. It also supports loading checkpoints and safetensors models, and various upscaling models (ESRGAN, ESRGAN variants, SwinIR, Swin2SR, etc.). It can also save/load workflows as JSON files, and generate and load full workflows from PNG files.
A new way to quickly generate 3D models from text. Normally, it can take multiple hours to generate one 3D model from text, but this new method can generate a 3D model in only 1-2 minutes. To do this, it first generates a picture from the text then uses a different model to turn the picture into a 3D point cloud. The 3D models produced with this method are not as good as the ones made with the normal method, but it is much faster.
DeepFaceLab is a tool that utilizes deep learning recognition to exchange faces in images and videos. This technology can produce very realistic and natural face changing videos in specific situations. DeepFaceLab is one of the easiest, most convenient, and fastest to install software among many others
Shap-E is designed to create 3D objects that are dependent on specific text or images. It utilizes advanced technology to generate 3D objects based on user input and can produce a wide variety of shapes and designs. This tool is particularly useful for designers, artists, and architects who require complex 3D models for their work. It can streamline the creative process and save a significant amount of time and effort in the design and production of 3D objects. With Shap-E, users can generate high-quality 3D models that are conditioned on specific text or images, making it an invaluable tool for various industries.
InvokeAI is an implementation of Stable Diffusion, the open source text-to-image and image-to-image generator. It provides a streamlined process with various new features and options to aid the image generation process. It runs on Windows, Mac and Linux machines, and runs on GPU cards with as little as 4 GB or RAM.
You can use trained facial models to exchange faces in webcams or in videos. There is also a Face Animator module in the DeepFaceLive application. You can use your own face in the video or camera to control static facial images. The quality is not the best and requires precise facial matching and parameter adjustment for each pair of faces, but it is sufficient to meet interesting videos and memes or use a 35 TFLOPS GPU for real-time streaming at 25 fps.
Canva is an online design platform that focuses on simple, convenient, and quick to get started, founded in 2013. Users can choose different design modes on the Canva platform according to their different needs, and design corresponding layouts, backgrounds, and text according to different templates. At the same time, users can also search online in Canva's resource library, including image materials, icons, chart materials, etc.
video
Runway is a next-generation content creation suite that provides powerful AI tools and real-time collaboration to help users create content faster. It offers a range of features including automated background removal, text to image and 3D texture generation, erase and replace, motion tracking, green screen masking, noise removal, and more. It also has a library of professionally crafted templates, animations, effects, and filters. Additionally, Runway provides secure collaboration and user management tools, asset hub, and audio editing capabilities.
D-ID is a key building block of the Generative AI ecosystem. Our deep-learning technology enables users to create videos featuring talking avatars. With a self-service studio and an API, content creators from all industries can bring still images of faces to life by animating them to match any written or recorded script, with super-fast streaming-ready 60FPS rendering.
DeepFaceLab is a tool that utilizes deep learning recognition to exchange faces in images and videos. This technology can produce very realistic and natural face changing videos in specific situations. DeepFaceLab is one of the easiest, most convenient, and fastest to install software among many others
VEED is an online video editing platform that helps users create professional-quality videos quickly and easily. It offers a range of tools, including a video editor, screen recorder, subtitles & transcription features, and more, to help users create professional videos. It also provides resources, such as video tutorials and templates, to help users get started.
You can use trained facial models to exchange faces in webcams or in videos. There is also a Face Animator module in the DeepFaceLive application. You can use your own face in the video or camera to control static facial images. The quality is not the best and requires precise facial matching and parameter adjustment for each pair of faces, but it is sufficient to meet interesting videos and memes or use a 35 TFLOPS GPU for real-time streaming at 25 fps.
Kapwing is an AI-powered video editing tool that provides creative features to help users create videos faster and better. Features such as Smart Cut, remove video background, AI-generated voice narration, and automatic subtitles can help users supercharge their content creation workflow. Kapwing also offers free templates and paid plans with additional features, storage, and support.
D-ID uses generative AI to create customized videos featuring talking avatars at a touch of a button for businesses and creators. The Creative Reality Studio uses the latest AI tools to generate talking avatars from images, audio, or text. Additionally, the Live Portrait and Speaking Portrait products enable users to create videos from photos and talking head videos from text or audio respectively.
Kaiber.ai is an amazing tool that allows you to animate your photos and bring your memories to life. This incredible technology uses artificial intelligence to bring your photos to life in a unique and exciting way. With just a few simple steps, you can turn your still photos into dynamic moving masterpieces. Kaiber.ai makes it easy to relive your favorite moments and keep your memories forever. Whether you're looking to create a special video to share on social media, or just want to keep your memories in a fun and innovative way, Kaiber.ai is the perfect tool for you.
audio
Polymath uses machine learning to convert any music library (e.g from Hard-Drive or YouTube) into a music production sample-library. The tool automatically separates songs into stems (beats, bass, etc.), quantizes them to the same tempo and beat-grid (e.g. 120bpm) and analyzes musical structure (e.g. verse, chorus, etc.), key (e.g C4, E3, etc.) and other infos (timbre, loudness, etc.). The result is a searchable sample library that streamlines the workflow for music producers, DJs, and ML audio developers.
MIDI-GPT is a MIDI-GPT Fork Repl37 that uses GPT-3.5-turbo and few-shot prompting to generate MIDI files from natural language. It features a calculate() function that uses NumPy to output the mean, variance, standard deviation, max, min, and sum of the rows, columns, and elements in a 3x3 matrix. It also includes a for loop creative project that displays all the numbers from 1-100 except for one number, with the user needing to input the missing number in order to exit the loop. The MIDI export titling has been changed so that the MIDI filename is now set to track_name.
This powerful online voice generator tool offers an extensive range of 130+ AI voices across different accents and tonalities, so you can easily find the perfect voice for your videos, presentations, brand commercial, e-learning content, and more. Leveraging advanced AI algorithms and deep learning, Murf's AI voices sound super realistic and don't sound robotic and monotonous. Plus, with Murf's easy-to-use interface, sleek design, and high-end features, you can generate realistic-sounding voice overs in just minutes! Try Murf today and experience the power of AI-generated speech.
Musiclips is an AI-powered music discovery app that helps users find new songs and create personalized playlists based on their musical preferences. It allows users to integrate their Spotify accounts and swipe right to add songs to their library or left to skip songs. The app features a vast library of tracks from various genres and provides tailored recommendations to match user's interests. It also has a simple and intuitive interface that makes it easy to explore new artists and sounds.
Provides 2 free quick tools to enhance audio for your content. Enhance speech- Enhance speech by removing all background noise and echo. Mic Check - Unlock quality sound from your microphone. The main product promises to provide AI-powered audio recording and editing, all on the web and is under waitlist.
A tool that allows users to turn text into a song. It uses natural language processing to convert textual input into an audio composition. The tool allows the user to choose from a variety of music styles and instruments, as well as adjust parameters such as tempo, key, and dynamics. The resulting track can be exported as a high-quality audio file.
MuseNet is a deep neural network created by OpenAI that can generate 4-minute musical compositions with 10 different instruments, and can combine styles from country to Mozart to the Beatles. It uses the same general-purpose unsupervised technology as GPT-2, a large-scale transformer model trained to predict the next token in a sequence, whether audio or text. The model is trained on data from MIDI files and can generate samples in a chosen style by starting with a prompt. It uses several embeddings such as positional embeddings, a timing embedding, and structural embeddings to give the model more context.
Epidemic Sound Soundmatch tool allows users to quickly and easily find the perfect soundtrack for their video project. It uses AI to identify the scenes in the video and generates relevant keywords for a semantic search to generate recommendations that match the visuals. Other recent product releases from Epidemic Sound include #Vibey playlists, ad blockers for YouTube, and a music license model with pricing.
This free online application helps users remove vocals from a song and create a karaoke version. It uses artificial intelligence to separate the vocals from the instrumental components. Once the song is chosen, processing usually takes 10 seconds. The user will receive two tracks - one with no vocals and one with isolated vocals.
Whisper is an open-source automatic speech recognition system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. It is designed to be robust to accents, background noise and technical language, and can transcribe and translate speech in multiple languages into English. It is a simple end-to-end approach, implemented as an encoder-decoder Transformer. It is also capable of performing language identification and phrase-level timestamps. It is designed to be easy to use and have high accuracy, allowing developers to add voice interfaces to more applications.
TTS Voice Wizard is a tool that enables users to convert their speech to text, and then back to speech, through Microsoft Azure Voice Recognition and TTS. It also sends OSC messages to VRChat to display text on an avatar. The tool has a number of customization options, including 100+ different voices, 20+ supported languages, and the ability to show a song title, artist, and progress above the user.
business
The intended use for the AI Text Classifier is to foster conversation about the distinction between human-written and AI-generated content. The results may help, but should not be the sole piece of evidence, when deciding whether a document was generated with AI. The model is trained on human-written text from a variety of sources, which may not be representative of all kinds of human-written text.
ChatGPT File Uploader Extended is a Google Chrome Extension that allows users to upload and process various file types directly in the ChatGPT interface. It offers support for PDFs, Word documents, Excel spreadsheets, and now even image files. The extension can automatically extract text content from these files and provides configurable chunked processing for handling large files. It also generates conversation prompts based on the file context, offers a user-friendly interface for file selection and progress monitoring, and is compatible with Google Chrome. This extension enhances the file processing workflow in ChatGPT, providing convenience and unlocking new possibilities for users.
<p>Notion is an excellent collaborative tool that integrates notes, knowledge base, and task management. It applies the thought of "everything is an object" to notes, allowing users to create, drag, and link freely. Its functions also cover project management, wikis, documents, etc. For programmers, this notebook has greater magic - the syntax support for Markdown, and the code highlighting of dozens of grammars are all the attractions of note as a Comparison of note-taking software.</p>
code
Polymath uses machine learning to convert any music library (e.g from Hard-Drive or YouTube) into a music production sample-library. The tool automatically separates songs into stems (beats, bass, etc.), quantizes them to the same tempo and beat-grid (e.g. 120bpm) and analyzes musical structure (e.g. verse, chorus, etc.), key (e.g C4, E3, etc.) and other infos (timbre, loudness, etc.). The result is a searchable sample library that streamlines the workflow for music producers, DJs, and ML audio developers.
GPT Engineer is a code-generation tool that enables users to specify what they want to build, and the AI will ask for clarification and then build it. It is designed to be simple and easy to adapt, extend, and make the agent learn how users want their code to look. It generates an entire codebase based on a prompt and comes with features such as identity customization, fast handovers between AI and human, and resumable and persistent computation.
GitHub Copilot is an AI-driven programming assistant that helps developers code faster, focus on solving bigger problems, and stay in the flow longer. It integrates directly into editors such as Neovim, JetBrains IDEs, Visual Studio, and Visual Studio Code, and suggests code and entire functions in real-time. It also shares recommendations based on project context and style conventions.
ChatGPT Demo is built based on the structure of ChatGPT-4. We are revolutionizing the way people interact with artificial intelligence. With advanced machine learning algorithms and flexible design, one of the most important advantages of ChatGPT Demo is that it allows users to use it for free without the need for login.
3d
instaVerse tool is a 3D world creation tool that uses AI to create a playable 3D world with just one click. It produces a 3D environment with realistic terrain, terrain textures, trees, buildings, and other objects that can be used to create a fun and engaging virtual world. The tool also allows users to customize the look of the environment with different lighting, foliage, and textures.
A 3D modelling platform that simplifies the process of creating 3D models. It has a vast library of generators that can be used to create any 3D model, and these models can be quickly customized to fit the user's style. The models are also UV-unwrapped and optimized for real-time use. Sloyd also offers an SDK for real-time generation within game engines. There are also FAQs, a Discord server and other help and support resources.
Infinigen is an open-source procedural generator for creating diverse and high-quality 3D scenes. Developed by Princeton Vision & Learning Lab, it is optimized for computer vision research and generates realistic training data. Infinigen uses randomized mathematical rules to generate unlimited variations of shapes, materials, and details. It offers a wide range of generators for natural objects and scenes, with a focus on accurate geometry rather than faking details. Infinigen also provides automatic annotations for various computer vision tasks and encourages community contribution and expansion of its capabilities.
Common Sense Machines provides APIs, interfaces, and open source software to translate multi-modal inputs and experiences into a digital simulator for AI training and content creation. We believe that learning generative world models is a systematic path towards achieving AGI, similar to how a child learns about its world from experience.
Movmi is a free motion capture software. It supports many lifestyle locations and different camera devices. Movmi was developed in an attempt to make human animation easier for the animators and game developers, so they are not in need of spending a lot of time animating a single character. It was built with advanced computer vision algorithms to meet the different humanoid scenes. It supports the Face and Body capturing of any human being. Movmi store was built to supply the animators with Art materials so they can use them in their artwork. Movmi store contains a collection of 3D characters and animations that are free of charge. Movmi store supports the ability to apply the captured motion on any of Movmi characters.
Vossle is a cloud-based SaaS platform for businesses & agencies to create Web-based augmented reality experiences. Reach millions of users instantly with App-less Augmented Reality (WebAR) Experience that works on every modern smartphone browser the moment you publish! No app installs required!
other
Infinigen is an open-source procedural generator for creating diverse and high-quality 3D scenes. Developed by Princeton Vision & Learning Lab, it is optimized for computer vision research and generates realistic training data. Infinigen uses randomized mathematical rules to generate unlimited variations of shapes, materials, and details. It offers a wide range of generators for natural objects and scenes, with a focus on accurate geometry rather than faking details. Infinigen also provides automatic annotations for various computer vision tasks and encourages community contribution and expansion of its capabilities.
GPT Researcher is an autonomous agent designed to perform comprehensive online research on a variety of tasks. It uses GPT-4 language models to generate research questions, trigger a crawler agent to scrape online resources for information, summarize based on relevant information, filter and aggregate the information, and generate a research report. It includes an easy-to-use web interface and can export research reports to PDF.
You can use trained facial models to exchange faces in webcams or in videos. There is also a Face Animator module in the DeepFaceLive application. You can use your own face in the video or camera to control static facial images. The quality is not the best and requires precise facial matching and parameter adjustment for each pair of faces, but it is sufficient to meet interesting videos and memes or use a 35 TFLOPS GPU for real-time streaming at 25 fps.