650 KiB
650 KiB
| id | status | question | skyvern link | skyvern summary | skyvern output | workflow_run_id | |
|---|---|---|---|---|---|---|---|
| 384 | GitHub--0 | completed | Search for an open-source project related to 'climate change data visualization' on GitHub and report the project with the most stars. | https://eval.skyvern.com/wpid_348594987306512010/wr_348594987306512012/overview | The AI assistant successfully identified the open-source project related to 'climate change data visualization' with the most stars on GitHub. The project is 'resource-watch/resource-watch' with 71 stars. | {'top_project': {'repository_name': 'resource-watch/resource-watch', 'stars': 71}} | wr_348594987306512012 |
| 385 | GitHub--1 | completed | Search for an open-source repository for machine learning in Python, specifically focused on decision trees, updated within the last 2 days. | https://eval.skyvern.com/wpid_348595016094780354/wr_348595016094780356/overview | The user searched for open-source repositories related to machine learning in Python, specifically focused on decision trees, and updated within the last 2 days. Two repositories matching the criteria were identified: 'catboost/catboost' and 'microsoft/LightGBM'. Both repositories are relevant to decision trees, with 'catboost/catboost' being implemented in Python and updated most recently. | {'repositories': [{'repository_name': 'catboost/catboost', 'description': 'A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks.', 'programming_language': 'Python', 'stars': 8200, 'last_updated': '2025-01-14T11:44:00Z'}, {'repository_name': 'microsoft/LightGBM', 'description': 'A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms.', 'programming_language': 'C++', 'stars': 16900, 'last_updated': '2025-01-13T00:29:00Z'}]} | wr_348595016094780356 |
| 386 | GitHub--2 | failed | Look for the trending Python repositories on GitHub with most stars. | https://eval.skyvern.com/wpid_348595037569616852/wr_348595037569616854/overview | The AI assistant successfully identified and extracted a list of trending Python repositories on GitHub along with their respective star counts. This information helps the user identify the most popular repositories based on stars. | {'trending_python_repositories': [{'repository_name': 'doforce/github-trending', 'stars': 56}, {'repository_name': 'manojkarthick/git-trend', 'stars': 53}, {'repository_name': 'Call-for-Code/Prometeo-Pyrrha', 'stars': 27}, {'repository_name': 'taranjeet/github-trending-cli', 'stars': 19}, {'repository_name': 'monk1337/Facebook-Trending-News-with-Python', 'stars': 17}, {'repository_name': 'blurred-machine/RNN-based-Stock-Price-Prediction-using-LSTM', 'stars': 15}, {'repository_name': 'evyatarmeged/github-trending', 'stars': 13}, {'repository_name': 'whitone/awesome-discoveries', 'stars': 8}, {'repository_name': 'mzhangyb/Github-Network-Prediction', 'stars': 5}, {'repository_name': 'codewithrajranjan/scrape-trending-repositories-from-github-using-python', 'stars': 2}]} | wr_348595037569616854 |
| 387 | GitHub--3 | completed | Find out how much more package storage the Enterprise version has over Team in GitHub Pricing. | https://eval.skyvern.com/wpid_348595068910890652/wr_348595068910890654/overview | The user has successfully determined the difference in package storage between the GitHub Enterprise and Team plans. The Enterprise plan offers 50GB of package storage, while the Team plan offers 2GB, resulting in a difference of 48GB. | {'team_package_storage': '2GB', 'enterprise_package_storage': '50GB', 'storage_difference': '48GB'} | wr_348595068910890654 |
| 388 | GitHub--4 | failed | Find a popular JavaScript repository created in the last 30 days on GitHub with a Readme file. | https://eval.skyvern.com/wpid_348595111860563630/wr_348595116155530928/overview | nan | nan | wr_348595116155530928 |
| 389 | GitHub--5 | completed | Find a Python repository on GitHub that has been updated in the past 2 days and has at least 500 stars. | https://eval.skyvern.com/wpid_348595140648831976/wr_348595140648831978/overview | The search for Python repositories on GitHub has been completed. A repository named 'kubernetes-client/python' meets the user goal as it has been updated in the past 2 days (13 hours ago) and has over 500 stars. | {'repository': {'name': 'kubernetes-client/python', 'description': 'Official Python client library for kubernetes', 'stars': 2100, 'last_updated': '2025-01-13T20:00:00Z', 'url': 'https://github.com/kubernetes-client/python'}} | wr_348595140648831978 |
| 390 | GitHub--6 | completed | Search for an open-source project related to 'cryptocurrency wallet' updated in the past 30 days and provide the top three contributors. | https://eval.skyvern.com/wpid_348595187893472250/wr_348595187893472252/overview | The user searched for an open-source project related to 'cryptocurrency wallet' and identified the Valora mobile cryptocurrency wallet repository, which has been updated in the past 30 days. The top three contributors to the repository, along with their usernames and profile links, have been extracted. | {'repository': {'name': 'Valora mobile cryptocurrency wallet', 'last_updated': '2025-01-15', 'top_contributors': [{'username': 'valora-bot-crowdin', 'profile_link': 'https://github.com/valora-bot-crowdin'}, {'username': 'valora-bot', 'profile_link': 'https://github.com/valora-bot'}, {'username': 'MuckT', 'profile_link': 'https://github.com/MuckT'}]}} | wr_348595187893472252 |
| 391 | GitHub--7 | completed | Find the official GitHub repository for ALBERT and show me what files the repo changed in the most recent commit. | https://eval.skyvern.com/wpid_348595210644811458/wr_348595214939778756/overview | The user successfully located the official GitHub repository for ALBERT and reviewed the details of the most recent commit. The commit, made on April 12, 2023, replaced deprecated TensorFlow functionality (tf.contrib.data.map_and_batch) with the updated tf.data.experimental.map_and_batch. Two files were modified in this commit: classifier_utils.py and squad_utils.py, with a total of 2 additions and 4 deletions. |
{'repository': 'https://github.com/google-research/albert', 'most_recent_commit': {'commit_id': 'b772393', 'date': '2023-04-12', 'author': 'bert-copybara', 'message': 'Replace deprecated tf.contrib.data.map_and_batch(...) with tf.data.experimental.map_and_batch(...).', 'files_changed': [{'file_name': 'classifier_utils.py', 'changes': {'additions': 1, 'deletions': 2}}, {'file_name': 'squad_utils.py', 'changes': {'additions': 1, 'deletions': 2}}]}} |
wr_348595214939778756 |
| 392 | GitHub--8 | completed | Look up the latest stable release version of Vuex and find out when it was published. | https://eval.skyvern.com/wpid_348595243728047116/wr_348595243728047118/overview | The user successfully located the Vuex repository and extracted the latest stable release version (v4.0.2) along with its publication date (June 17, 2021). | {'latest_release': {'version': 'v4.0.2', 'publication_date': '2021-06-17'}} | wr_348595243728047118 |
| 393 | GitHub--9 | completed | Locate a repository on GitHub that was created in the last week and has 50 or more stars. Provide brief details about the project's purpose and its programming language. | https://eval.skyvern.com/wpid_348595266479386324/wr_348595266479386326/overview | The user successfully located repositories on GitHub that were created in the last week and have 50 or more stars. Details about the repositories, including their names, purposes, programming languages, and star counts, were extracted. | {'repositories': [{'name': 'SamuelSchmidgall/AgentLaboratory', 'description': 'Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research.', 'language': 'Python', 'stars': 2723, 'last_updated': '2025-01-13T13:42:00Z'}, {'name': 't3dotgg/stripe-recommendations', 'description': "I've built with Stripe for years. This is how I do it without going mad.", 'language': None, 'stars': 2337, 'last_updated': '2025-01-13T02:12:00Z'}, {'name': 'henrythe9th/AI-Crash-Course', 'description': 'AI Crash Course to help busy builders catch up to the public frontier of AI research in 2 weeks.', 'language': None, 'stars': 1400, 'last_updated': '2025-01-11T08:47:00Z'}, {'name': 'NovaSky-AI/SkyThought', 'description': 'Sky-T1: Train your own O1 preview model within $450.', 'language': 'Python', 'stars': 1334, 'last_updated': '2025-01-14T00:28:00Z'}, {'name': 'David-patrick-chuks/Riona-AI-Agent', 'description': 'Riona 🌸 is built with Node.js and TypeScript 🛠️. Designed to run jobs 📸 effortlessly. Lightweight, efficient, and a work in progress 🚧—more to come.', 'language': 'TypeScript', 'stars': 1128, 'last_updated': '2025-01-14T06:12:00Z'}, {'name': 'PollensAI/Pollens', 'description': 'Buzz Together in Real Time: Your BeeSync for Collaborative Creation and Connection.', 'language': 'TypeScript', 'stars': 932, 'last_updated': '2025-01-09T15:01:00Z'}, {'name': 'hybridgroup/go-haystack', 'description': "Track personal Bluetooth devices via Apple's 'Find My' network using OpenHaystack and Macless-Haystack with tools written in Go/TinyGo.", 'language': 'Go', 'stars': 858, 'last_updated': '2025-01-13T14:46:00Z'}, {'name': 'zhangchenchen/self-consistent-coder', 'description': '如何成为一名自洽的程序员', 'language': 'HTML', 'stars': 766, 'last_updated': '2025-01-13T08:37:00Z'}, {'name': 'paulreapz/lumen_validators_ai', 'description': 'LUMEN: AI-Powered Solana Validator Manager.', 'language': 'Python', 'stars': 500, 'last_updated': '2025-01-09T20:53:00Z'}]} | wr_348595266479386326 |
| 394 | GitHub--10 | completed | If I start using Copilot Individual, how much US dollars will it cost per year and what features does it have? | https://eval.skyvern.com/wpid_348595295267654688/wr_348595299562621986/overview | The user has successfully obtained information about the yearly cost and features of the GitHub Copilot Individual (Pro) plan. The plan costs $120 per year and includes a wide range of features such as unlimited messages and interactions, access to 4 models, context-aware coding support, debugging assistance, and more. | {'plan': 'Pro', 'yearly_cost_usd': 120, 'features': ['Unlimited messages and interactions', 'Access to 4 models', 'Context-aware coding support and explanations', 'Debugging and security remediation assistance', 'Access to knowledge from top open source repositories', 'Generate tests, docs, and more with slash commands', 'Web search powered by Bing', 'Explain failed Actions jobs', 'Answers about issues, PRs, discussions, files, commits, etc.', 'Multi-file editing in VS Code', 'Real-time code suggestions', 'Comments to code', 'Tailor chat conversations to your private codebase', 'Unlimited integrations with GitHub Copilot Extensions', 'Build a private extension for internal tooling', 'Personalize responses with custom instructions', 'Inline chat and prompt suggestions', 'Slash commands and context variables', 'Commit message generation', 'Summaries for pull requests, issues, and discussions', 'Code feedback in VS Code', 'Explanations in Visual Studio’s Quick Info', 'Debug assistant in Visual Studio', 'Upgrade assistant for Java in VS Code', 'Code review in GitHub', 'Copilot Workspace in pull requests', 'Technical Preview access to Copilot Workspace', 'Supported in editors and IDEs', 'Supported on github.com', 'Supported on GitHub Mobile', 'Supported on GitHub CLI and Windows Terminal', 'Public code filter with code referencing']} | wr_348595299562621986 |
| 395 | GitHub--11 | failed | Find a newly created open-source project on GitHub related to 'climate change' that has been initiated in January 2023; check the main programming language used and the project's description. | https://eval.skyvern.com/wpid_348595326608928486/wr_348595326608928488/overview | nan | nan | wr_348595326608928488 |
| 396 | GitHub--12 | completed | Retrieve the latest release from the 'electron/electron' repository on GitHub and note down the release version number and date. | https://eval.skyvern.com/wpid_348595352378732280/wr_348595352378732282/overview | The latest release from the 'electron/electron' repository on GitHub has been retrieved. The release version is 'v34.0.0', and it was published on '2025-01-14T10:10:00Z'. | {'release_version': 'v34.0.0', 'release_date': '2025-01-14T10:10:00Z'} | wr_348595352378732282 |
| 397 | GitHub--13 | completed | Identify the latest top-trending open-source project in the category of 'Machine Learning' on GitHub, and check the number of stars it has received. | https://eval.skyvern.com/wpid_348595381167000626/wr_348595381167000628/overview | The AI assistant identified the latest top-trending open-source project in the 'Machine Learning' category on GitHub based on the most recent update date. The project is 'lazyprogrammer/machine_learning_examples', which has received 8,500 stars and was last updated on 2025-01-05. | {'repository_name': 'lazyprogrammer/machine_learning_examples', 'stars': 8500, 'last_updated': '2025-01-05T08:25:00Z'} | wr_348595381167000628 |
| 398 | GitHub--14 | completed | Locate the repository for the open-source project "vscode" and identify the top three contributors. | https://eval.skyvern.com/wpid_348595408213307146/wr_348595408213307148/overview | The repository for the open-source project 'vscode' has been located on GitHub, and the top three contributors have been identified along with their usernames, profile URLs, and avatar URLs. | {'repository': {'name': 'vscode', 'organization': 'microsoft', 'url': 'https://github.com/microsoft/vscode'}, 'top_contributors': [{'username': 'bpasero', 'profile_url': 'https://github.com/bpasero', 'avatar_url': 'https://avatars.githubusercontent.com/u/900690?s=64&v=4'}, {'username': 'jrieken', 'profile_url': 'https://github.com/jrieken', 'avatar_url': 'https://avatars.githubusercontent.com/u/1794099?s=64&v=4'}, {'username': 'Tyriar', 'profile_url': 'https://github.com/Tyriar', 'avatar_url': 'https://avatars.githubusercontent.com/u/2193314?s=64&v=4'}]} | wr_348595408213307148 |
| 399 | GitHub--15 | completed | Locate a repository on GitHub related to 'quantum computing' that has been updated within the last week and has at least 50 stars. Provide a brief description of the project. | https://eval.skyvern.com/wpid_348595459752914716/wr_348595459752914718/overview | The user successfully located a GitHub repository related to 'quantum computing' that has been updated within the last week and has at least 50 stars. The repository 'PennyLaneAI/pennylane' meets the criteria, with 2445 stars and a recent update on 2025-01-14. It is a cross-platform Python library for quantum computing, quantum machine learning, and quantum chemistry. | {'repository': {'name': 'PennyLaneAI/pennylane', 'description': 'PennyLane is a cross-platform Python library for quantum computing, quantum machine learning, and quantum chemistry. Train a quantum computer the same way as a neural network.', 'stars': 2445, 'last_updated': '2025-01-14T04:36:00'}} | wr_348595459752914718 |
| 400 | GitHub--16 | completed | Find the GitHub Skill section and how many courses are under the 'First day on GitHub' heading. | https://eval.skyvern.com/wpid_348595515587489590/wr_348595515587489592/overview | The user successfully navigated to the GitHub Skills section and identified the number of courses listed under the 'First day on GitHub' heading. There are 3 courses under this heading. | {'section': 'GitHub Skills', 'heading': 'First day on GitHub', 'numberOfCourses': 3} | wr_348595515587489592 |
| 401 | GitHub--17 | completed | Locate a C++ project on GitHub that has been recently updated in the last week and has at least 500 stars, then describe its main purpose. | https://eval.skyvern.com/wpid_348595570145561668/wr_348595574440528966/overview | The user successfully located multiple C++ projects on GitHub that were updated in the last week and have at least 500 stars. Key details about these repositories, including their names, descriptions, main purposes, star counts, and last updated timestamps, were extracted. | {'repositories': [{'name': 'grpc/grpc', 'description': 'The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)', 'main_purpose': ['C++', 'Python', 'Ruby', 'Objective-C', 'PHP', 'C#'], 'stars': 42300, 'last_updated': '2025-01-14T07:32:00', 'language': 'C++'}, {'name': 'ggerganov/llama.cpp', 'description': 'LLM inference in C/C++', 'main_purpose': ['llama', 'ggml'], 'stars': 70700, 'last_updated': '2025-01-14T09:47:00', 'language': 'C++'}, {'name': 'ggerganov/whisper.cpp', 'description': "Port of OpenAI's Whisper model in C/C++", 'main_purpose': ['inference', 'transformer', 'speech-recognition', 'openai', 'speech-to-text'], 'stars': 36900, 'last_updated': '2025-01-14T03:38:00', 'language': 'C++'}, {'name': 'danmar/cppcheck', 'description': 'static analysis of C/C++ code', 'main_purpose': ['c', 'c-plus-plus', 'cross-platform', 'cpp', 'static-analysis'], 'stars': 5900, 'last_updated': '2025-01-14T08:03:00', 'language': 'C++'}]} | wr_348595574440528966 |
| 402 | GitHub--18 | completed | Identify and report the most popular (in terms of stars) open-source image processing tool on GitHub. | https://eval.skyvern.com/wpid_348595604505300054/wr_348595604505300056/overview | The most popular open-source image processing tool on GitHub has been identified based on the number of stars. The repository 'WZMIAOMIAO/deep-learning-for-image-processing' has the highest star count of 23,800 among the extracted repositories. | {'most_popular_repository': {'repository_name': 'WZMIAOMIAO/deep-learning-for-image-processing', 'star_count': 23800}} | wr_348595604505300056 |
| 403 | GitHub--19 | failed | Look up the most recently updated Python repository on GitHub that is tagged with 'web scraping' and has over 100 stars. | https://eval.skyvern.com/wpid_348595634570071144/wr_348595634570071146/overview | nan | nan | wr_348595634570071146 |
| 404 | GitHub--21 | completed | Find the Security topic in GitHub Resources and answer the role of GitHub Advanced Security. | https://eval.skyvern.com/wpid_348595657321410380/wr_348595657321410382/overview | The user successfully located the Security topic in GitHub Resources and extracted detailed information about the role of GitHub Advanced Security. GitHub Advanced Security is designed to help developers and security teams collaborate to eliminate security debt, prevent new vulnerabilities, and secure software supply chains. It provides features such as AI-powered code scanning, secret scanning, security campaigns, and Copilot Autofix for automated remediation. It integrates seamlessly into workflows with support for over 17,000 app integrations and offers tools for managing software supply chains, generating SBOMs, and addressing zero-day vulnerabilities. | {'role_of_github_advanced_security': {'key_features': ['Helps developers and security teams eliminate security debt and prevent new vulnerabilities.', 'Copilot Autofix provides code suggestions for 90% of alert types in supported languages.', 'Code scanning detects vulnerabilities, provides contextual explanations, and suggests fixes.', 'Secret scanning with push protection guards against over 200 token types and patterns.', 'Security campaigns target and generate autofixes for up to 1,000 alerts at a time.', 'Provides one workflow for the entire toolchain with support for over 17,000 app integrations.', 'Enables secure software supply chain management with automated security and SBOM generation.', 'Offers insights and automation to ship secure software on schedule.'], 'use_cases': ['Detecting and fixing vulnerabilities in code before production.', 'Preventing secret leaks and managing sensitive information.', 'Reducing the risk of application vulnerabilities and zero-day attacks.', 'Managing and reporting on software supply chains.'], 'automation_and_integration': {'ai_powered_tools': ['Copilot Autofix for automated code fixes.', 'Code scanning for vulnerability detection and contextual advice.'], 'workflow_integration': 'Supports over 17,000 app integrations and actions templates.', 'supply_chain_security': 'Automates security, version updates, and generates SBOMs.'}, 'url': 'https://github.com/enterprise/advanced-security'}} | wr_348595657321410382 |
| 405 | GitHub--22 | failed | Find an open-source repository on GitHub focused on natural language processing in Ruby, updated within the last week. | https://eval.skyvern.com/wpid_348595687386181470/wr_348595687386181472/overview | nan | nan | wr_348595687386181472 |
| 406 | GitHub--23 | completed | Find the wiki page of ohmyzsh on GitHub and tell me how to change the theme of zsh to agnoster. | https://eval.skyvern.com/wpid_348595711879482508/wr_348595711879482510/overview | The user has successfully navigated to the 'ohmyzsh/ohmyzsh' repository on GitHub and located the instructions for changing the Zsh theme to 'agnoster'. The steps involve editing the Zsh configuration file (~/.zshrc) and setting the ZSH_THEME variable to 'agnoster'. Additional notes include ensuring the installation of a Powerline or Nerd Font for proper rendering of the theme. |
{'repository': 'https://github.com/ohmyzsh/ohmyzsh', 'theme_change_instructions': {'file_to_edit': '~/.zshrc', 'variable_to_set': 'ZSH_THEME', 'value': 'agnoster', 'notes': ['Install a Powerline Font or Nerd Font for proper rendering.', 'Themes only control the prompt appearance, not terminal colors or fonts.']}} | wr_348595711879482510 |
| 407 | GitHub--24 | completed | Locate the GitHub repository for the open-source project "angular" and identify the last three issues closed. | https://eval.skyvern.com/wpid_348595733354319006/wr_348595733354319008/overview | The GitHub repository for the open-source project 'angular' was located, and the last three closed issues were identified along with their details, including titles, issue numbers, authors, dates closed, comment counts, statuses, and labels. | {'repository': 'angular', 'last_three_closed_issues': [{'title': 'Import with forwardRef is not used within the template of Component', 'issue_number': 59500, 'author': 'Mawi137', 'date_closed': '2025-01-13T00:00:00Z', 'comments_count': 4, 'status': 'Closed', 'labels': ['area: compiler']}, {'title': 'The feature with the schematic to clean up unused imports duplicates the import array', 'issue_number': 59466, 'author': 'amanplans', 'date_closed': '2025-01-10T00:00:00Z', 'comments_count': 1, 'status': 'Closed', 'labels': ['area: migrations']}, {'title': 'Docs: broken style is doc snippets', 'issue_number': 59442, 'author': 'JeanMeche', 'date_closed': '2025-01-10T00:00:00Z', 'comments_count': 5, 'status': 'Closed', 'labels': ['area: docs-infra']}]} | wr_348595733354319008 |
| 408 | GitHub--25 | completed | Search for a 'virtual reality' related repository on GitHub updated in the last 10 days with at least 200 stars and summarize its main objective. | https://eval.skyvern.com/wpid_348595790465396592/wr_348595790465396594/overview | The search for 'virtual reality' related repositories on GitHub updated in the last 10 days with at least 200 stars has been completed. Two repositories meeting the criteria were identified: 'aframevr/aframe' and 'bjornbytes/lovr'. Their main objectives are summarized below. | {'repositories': [{'name': 'aframevr/aframe', 'description': 'Web framework for building virtual reality experiences.', 'stars': 16800, 'last_updated': '2025-01-11T21:48:00-05:00'}, {'name': 'bjornbytes/lovr', 'description': 'Lua Virtual Reality Framework', 'stars': 2200, 'last_updated': '2025-01-10T18:00:00-05:00'}]} | wr_348595790465396594 |
| 409 | GitHub--26 | completed | Find the Resolve merge conflicts course in GitHub Skills and what actions learners will perform in this course. | https://eval.skyvern.com/wpid_348595837710036866/wr_348595837710036868/overview | The 'Resolve merge conflicts' course in GitHub Skills has been located. This course teaches learners why merge conflicts happen and how to resolve them. | {'course_name': 'Resolve merge conflicts', 'course_description': 'Learn why conflicts happen and how to resolve them.', 'actions_learners_perform': ['Understand why merge conflicts occur', 'Learn techniques to resolve merge conflicts']} | wr_348595837710036868 |
| 410 | GitHub--27 | completed | Find a Ruby repository on GitHub that has been updated in the past 3 days and has at least 1000 stars. | https://eval.skyvern.com/wpid_348595872069775252/wr_348595876364742550/overview | The user goal of finding Ruby repositories on GitHub that have been updated in the past 3 days and have at least 1000 stars has been achieved. A list of repositories meeting the criteria has been extracted, including their names, star counts, and last updated times. | {'repositories': [{'repository_name': 'rails/rails', 'star_count': '56.3k', 'last_updated': '2025-01-15T02:25:00-05:00'}, {'repository_name': 'jekyll/jekyll', 'star_count': '49.5k', 'last_updated': '2025-01-12T10:57:00-05:00'}, {'repository_name': 'mastodon/mastodon', 'star_count': '47.5k', 'last_updated': '2025-01-15T02:58:00-05:00'}, {'repository_name': 'huginn/huginn', 'star_count': '44.3k', 'last_updated': '2025-01-13T23:49:00-05:00'}, {'repository_name': 'discourse/discourse', 'star_count': '42.9k', 'last_updated': '2025-01-15T03:15:00-05:00'}, {'repository_name': 'Homebrew/brew', 'star_count': '42.1k', 'last_updated': '2025-01-14T17:48:00-05:00'}, {'repository_name': 'freeCodeCamp/devdocs', 'star_count': '35.6k', 'last_updated': '2025-01-14T09:05:00-05:00'}, {'repository_name': 'maybe-finance/maybe', 'star_count': '34.9k', 'last_updated': '2025-01-14T18:52:00-05:00'}, {'repository_name': 'rapid7/metasploit-framework', 'star_count': '34.6k', 'last_updated': '2025-01-13T03:37:00-05:00'}, {'repository_name': 'gitlabhq/gitlabhq', 'star_count': '23.9k', 'last_updated': '2025-01-15T01:32:00-05:00'}]} | wr_348595876364742550 |
| 411 | GitHub--28 | failed | Identify the most starred JavaScript repositories on GitHub that were created after 2023-12-29. | https://eval.skyvern.com/wpid_348595935217781938/wr_348595935217781940/overview | nan | nan | wr_348595935217781940 |
| 412 | GitHub--29 | completed | Compare the maximum number of private repositories allowed in the Free and Pro plans in GitHub Pricing. | https://eval.skyvern.com/wpid_348595957969121190/wr_348595957969121192/overview | The user has successfully compared the maximum number of private repositories allowed in the Free and Pro (Team) plans on GitHub Pricing. Both plans allow an unlimited number of private repositories. | {'free_plan': {'max_private_repositories': 'Unlimited'}, 'pro_plan': {'max_private_repositories': 'Unlimited'}} | wr_348595957969121192 |
| 413 | GitHub--30 | failed | Search for an open-source project related to 'blockchain technology' on GitHub updated in the past 15 days and list the top five contributors. | https://eval.skyvern.com/wpid_348595986757389508/wr_348595986757389510/overview | nan | nan | wr_348595986757389510 |
| 414 | GitHub--31 | completed | Find the official GitHub repository for TensorFlow and list the files changed in the last commit. Tell me the name of changed files, total additions and total deletion. | https://eval.skyvern.com/wpid_348596018098663352/wr_348596018098663354/overview | The user successfully located the official GitHub repository for TensorFlow and extracted details about the files changed in the latest commit. The extracted information includes the names of the changed files, the number of additions and deletions for each file, and the total additions and deletions across all files. | {'changed_files': [{'file_name': 'third_party/xla/xla/backends/gpu/codegen/transforms/lower_tensors.cc', 'additions': 91, 'deletions': 62}, {'file_name': 'third_party/xla/xla/backends/gpu/codegen/transforms/passes.td', 'additions': 22, 'deletions': 1}, {'file_name': 'third_party/xla/xla/backends/gpu/codegen/transforms/tests/lower_tensors.mlir', 'additions': 35, 'deletions': 0}, {'file_name': 'third_party/xla/xla/backends/gpu/codegen/transforms/tests/vectorize_loads_stores.mlir', 'additions': 35, 'deletions': 0}, {'file_name': 'third_party/xla/xla/backends/gpu/codegen/transforms/vectorize_loads_stores.cc', 'additions': 155, 'deletions': 12}], 'total_additions': 304, 'total_deletions': 75} | wr_348596018098663354 |
| 415 | GitHub--32 | failed | Discover the latest C# repository on GitHub related to 'game development' and having over 150 stars, and describe its main features. | https://eval.skyvern.com/wpid_348596048163434442/wr_348596048163434444/overview | nan | nan | wr_348596048163434444 |
| 416 | GitHub--33 | completed | Find Customer Stories on the GitHub page and list the 2 stories that appear on the web page. | https://eval.skyvern.com/wpid_348596081246670038/wr_348596081246670040/overview | The user successfully identified and listed the first two customer stories displayed on the GitHub page. The extracted stories include titles, industries (if available), and links to the full stories. | {'customerStories': [{'title': 'Duolingo boosts developer speed by 25% with GitHub Copilot', 'industry': 'Education', 'link': 'https://github.com/customer-stories/duolingo'}, {'title': '2024 Gartner® Magic Quadrant™ for AI Code Assistants', 'industry': None, 'link': 'https://www.gartner.com/doc/reprints?id=1-2IKO4MPE&ct=240819&st=sb'}]} | wr_348596081246670040 |
| 417 | GitHub--34 | completed | Search for an open-source project on GitHub related to 'Protein prediction' and identify the project with the highest number of forks. | https://eval.skyvern.com/wpid_348596108292976604/wr_348596112587943902/overview | The AI assistant successfully identified the open-source project on GitHub related to 'Protein prediction' with the highest number of forks. The project is 'DeepPurpose' with 277 forks. Relevant data about other repositories and their fork counts were also extracted. | {"highest_forked_project":{"repository_url":"https://github.com/kexinhuang12345/DeepPurpose","forks_count":277},"all_projects":[{"repository_url":"https://github.com/dellacortelab/prospr","forks_count":85},{"repository_url":"https://github.com/uw-ipd/RoseTTAFold2NA","forks_count":77},{"repository_url":"https://github.com/nf-core/proteinfold","forks_count":43},{"repository_url":"https://github.com/kexinhuang12345/DeepPurpose","forks_count":277},{"repository_url":"https://github.com/patrickbryant1/Umol","forks_count":18},{"repository_url":"https://github.com/Graylab/DL4Proteins-notebooks","forks_count":37},{"repository_url":"https://github.com/samsinai/VAE_protein_function","forks_count":24},{"repository_url":"https://github.com/biolib/openprotein","forks_count":55},{"repository_url":"https://github.com/wudejian789/2020TIANCHI-ProteinSecondaryStructurePrediction-TOP1","forks_count":21}]} | wr_348596112587943902 |
| 418 | GitHub--35 | completed | Check the latest release version of React and the date it was published on GitHub. | https://eval.skyvern.com/wpid_348596151242649582/wr_348596151242649584/overview | The user successfully identified the latest release version of React and its publication date from the official React GitHub repository. The latest release version is 19.0.0, and it was published on December 5, 2024. | {'latest_release': {'version': '19.0.0', 'publication_date': '2024-12-05'}} | wr_348596151242649584 |
| 419 | GitHub--36 | completed | Identify a new open-source project on GitHub related to 'AI agriculture' that created in 2022, and note its main programming language and description. | https://eval.skyvern.com/wpid_348596202782255106/wr_348596202782255108/overview | The AI assistant successfully identified several open-source GitHub projects related to 'AI agriculture' that were created in 2022. For each project, the repository name, description, and main programming language were extracted. However, due to a network error, further exploration of additional repositories was not possible. | {'repositories': [{'repository_name': 'microsoft/farmvibes-ai', 'description': 'FarmVibes.AI: Multi-Modal GeoSpatial ML Models for Agriculture and Sustainability', 'main_language': 'Jupyter Notebook'}, {'repository_name': 'nesl/agrobot', 'description': 'Neural-Kalman GNSS/INS Navigation for Precision Agriculture', 'main_language': 'Jupyter Notebook'}, {'repository_name': 'innovayio/AI-Plants-Diseases-SmartAgriculture', 'description': 'Developing a real-life use case implementing AI for plants diseases in smart agriculture in localized context and leveraging open-source …', 'main_language': None}, {'repository_name': 'nikhilkeshava/crop-yield-prediction-using-AI-and-ml', 'description': 'India being an agricultural country, its economy predominantly depends on agriculture yield growth and allied agro industry products. In …', 'main_language': 'Jupyter Notebook'}, {'repository_name': 'Souradeep2233/UAV-AIOT_COLLAB', 'description': 'IoT and AI applications in UAVs enable real-time data collection through sensors for weather forecasting, precision agriculture, infrastr…', 'main_language': 'Jupyter Notebook'}, {'repository_name': 'h4zm1/IrrigationIntelligente', 'description': 'AI based agriculture application', 'main_language': 'Dart'}, {'repository_name': 'BishalDali/AIMS', 'description': 'Agriculture Information Management System(CRUD) The system should allow users to add crops(any 10 crops) detail according to the province…', 'main_language': 'JavaScript'}, {'repository_name': 'manab-kb/Crop_Recognition_System', 'description': 'An Agriculture Assistant - Comes with an AI model, along with a website to make the task of monitoring crop produce and quality easier, f…', 'main_language': 'Python'}, {'repository_name': 'YukunXia/Farmland-Exploration', 'description': 'A prototype system for autonomous exploration in an agriculture environment. Course project for 16765-A "Special Topics: Robotics & AI for …', 'main_language': 'Jupyter Notebook'}, {'repository_name': 'Anshul-13J/Agriculture-Monitoring-Using-AI-and-IoT', 'description': 'This is an open-repo of our project for PowerToFly Hackathon', 'main_language': 'Jupyter Notebook'}]} | wr_348596202782255108 |
| 420 | GitHub--37 | completed | List the 3 features mentioned in GitHub's Copilot product page. | https://eval.skyvern.com/wpid_348596254321862676/wr_348596254321862678/overview | The user successfully navigated to GitHub Copilot's product page and extracted the three main features highlighted on the page. These features include customization and contextual AI capabilities, the ability to choose from multiple AI models, and support for multi-file edits directly in the editor. | {'features': [{'title': 'Customizable. Contextual. AI-powerful.', 'description': 'Create tests, docs, and more. Ask GitHub Copilot a question, get the right answer for you, and accept the code with a single click. GitHub Copilot can use your code and Custom Instructions to code the way you prefer.'}, {'title': 'Choose your model', 'description': "Use models like Anthropic's Claude 3.5 Sonnet, OpenAI o1, and GPT 4o to excel at coding tasks one minute — and deeper reasoning and understanding the next."}, {'title': 'Multi-file edits', 'description': 'Tell GitHub Copilot what to do, and watch it update multiple files right in your editor.'}]} | wr_348596254321862678 |
| 421 | GitHub--38 | failed | Identify and report the most popular (by stars) open-source repo related to cybersecurity on GitHub. | https://eval.skyvern.com/wpid_348596284386633766/wr_348596288681601064/overview | The most popular open-source repository related to cybersecurity on GitHub has been identified based on the number of stars. The repository 'amark/gun' with 18,200 stars is the most popular among the extracted repositories. | {'most_popular_repository': {'repository_name': 'amark/gun', 'star_count': 18200}} | wr_348596288681601064 |
| 422 | GitHub--39 | completed | Browse the GitHub Trending and find out which developer is currently ranked first this month and the corresponding repository. | https://eval.skyvern.com/wpid_348596321764838632/wr_348596326059805930/overview | The user successfully identified the first-ranked developer on GitHub Trending for this month and extracted their name, GitHub username, and details of their most popular repository. | {'developer_name': 'Mitchell Hashimoto', 'github_username': 'mitchellh', 'popular_repo': {'repo_name': 'libxev', 'repo_description': 'libxev is a cross-platform, high-performance event loop that provides abstractions for non-blocking IO, timers, events, and more and work…'}} | wr_348596326059805930 |
| 423 | GitHub--40 | failed | Select Sign up on the GitHub homepage to see if email 'test123@gmail.com' already exists. | https://eval.skyvern.com/wpid_348596360419544314/wr_348596360419544316/overview | nan | nan | wr_348596360419544316 |