Home » Microsoft released Data Wrangler, a data preparation tool in VS Code Insiders, which can easily generate data and operate Python code | iThome

Microsoft released Data Wrangler, a data preparation tool in VS Code Insiders, which can easily generate data and operate Python code | iThome

by admin
Microsoft released Data Wrangler, a data preparation tool in VS Code Insiders, which can easily generate data and operate Python code | iThome

For data scientists, Microsoft launched Data Wrangler, a tool that can process tabular data in Python. Data Wrangler is an extension of the VS Code Insiders version. It can perform data preparation, cleaning and visualization, help users identify and repair data errors, and analyze data quality and convert the data into the desired format.

Data Wrangler’s built-in conversion and visualization function library, when the user performs data changes, the expansion package will automatically use the open source Python function library to generate code for the user’s data operation, which means that the user can more quickly and accurately Write data preparation programs.

Since the quality of data will directly affect the quality of model predictions, data scientists usually need to spend a lot of time preparing data. In the process of exploring data, data scientists need to write many small code fragments to delete data rows or remove missing For numerical values, Microsoft mentioned that there is currently a lack of tools to simplify data preparation. Data scientists often need to search for code snippets on Stack Overflow and copy and paste them into the program.

Data Wrangler’s interactive user interface can quickly generate code for users. When users view and visualize Python data analysis module Pandas data frames (Dataframes), Data Wrangler can generate code for target operations. For example, users only need to Right-click on a data row header and delete it, and Data Wrangler can automatically generate Python code to do this.

In addition, when data scientists want to create a new derived data row from the data row of the Pandas data frame, the process of writing custom code is prone to errors, and Data Wrangler allows users to provide an output example and tell Data Wrangler that they want to derive data If the data is in the form of a row, the extension will write Python code through the artificial intelligence synthesis technology PROSE.

See also  Full line of Samsung audio and video mobile products Game Launcher renamed Gaming Hub

Now data scientists want to use Data Wrangler in VS Code Insiders, they can download it directly from the application market, and start Data Wrangler from the Pandas data frame of Jupyter Notebook, or in VS Code Insiders, choose to open CSV or Parquet archives.

You may also like

Leave a Comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.

This website uses cookies to improve your experience. We'll assume you're ok with this, but you can opt-out if you wish. Accept Read More

Privacy & Cookies Policy