Use OSS data as machine learning training samples
background information
Operation steps
Data is uploaded to the bucket. Take the uploaded file Sample_superstore.csv, which is uploaded to the target storage space examplebucket in East China 1 (Hangzhou) as an example. Construct the Sample_superstore.csv file data sample. order_id,order_date,customer_id,item,sales,quantity 1,20240101,1,aa,10,100 Upload the Sample_superstore.csv file to examplebucket. See Simple upload 。
Connect OSS and PAI. Create a new workflow in East China 1 (Hangzhou). See New Custom Workflow 。 Click the new workflow, and then select 。 Double click the read CSV file component, and click the Parameter setting Tab, File Path Set to oss://examplebucket/Sample_superstore.csv , Schema Set to order_id string,order_date string,customer_id string,item string,sales string,quantity string , Open Whether to ignore the first row of data Switch and other parameters remain the default configuration. Right click the Read CSV File component, and then click Execute this node 。 After the execution is completed, right-click the Read CSV File component, and then click 。 Under the component, view the table information. Data preview only supports 1000 records. If you need to view the full table, please follow the page instructions to DataWorks.
Data exploration process
conclusion
The paper and stapler shelves are placed in the middle, and the shelves of other products are placed in a ring around the two, so that no matter which shelf customers enter from, they can quickly find the paper and stapler with a high degree of correlation. Place the paper and stapler shelves at both ends of the stationery store. Customers need to cross the entire stationery store to buy the other one. Passing the shelves of other products halfway can improve the cross purchase rate. Of course, this layout method sacrifices the convenience of users' shopping, and should be cautious in actual operation.