How to copy data from one workbook to another with some rules using spreadsheetlight?

Question

I have a workbook with a worksheet like this

User's image

I want create another workbook using some of the data of the above workbook with the following rules and using spreadsheetlight library :

The data will be copied from only the columns 1,2,3,7,8,9 & 12
The row whose column 10 text is "INVOICE" should be ignored/not copied or used
If a/multiple consecutive row's column 1 value is empty and there is a value present in column 12 of those same row's column 12 value must be added to its previous rows column 12 value (check rows marked for better understanding)

User's image

I've got no idea how to do point 3 and I tried to do at least point 1 & 2 properly with below code but resulting workbook is a mess. Please help!

public static void Main(string[] args)
		{

			DataTable dt = new DataTable();

			dt.Columns.Add("OrderNo");
			dt.Columns.Add("OrderDt");
			dt.Columns.Add("Vendor");
			dt.Columns.Add("OrderAmount");
			dt.Columns.Add("ChallanNo");
			dt.Columns.Add("ChallanDt");
			dt.Columns.Add("ChallanAmount");
			
			using (SLDocument sl= new SLDocument(@"C:\Users\Desktop\TEST_WORKBOOK.xlsx", "Sheet1"))
			{
				
				SLWorksheetStatistics st=sl.GetWorksheetStatistics();

				for (int i = 2; i <= st.EndRowIndex; i++)
				{
					if (sl.GetCellValueAsString(i,10) != "INVOICE")
					{
						DataRow dr = dt.NewRow();

						dt.Rows.Add(
							sl.GetCellValueAsString(i,1),
							sl.GetCellValueAsDateTime(i,2).ToString("dd-MM-yyyy"),
							sl.GetCellValueAsString(i,3),
							sl.GetCellValueAsString(i,7),
							sl.GetCellValueAsString(i,8),
							sl.GetCellValueAsDateTime(i,9).ToString("dd-MM-yyyy"),
							sl.GetCellValueAsString(i,12)
						);
					}
					
					dt.AcceptChanges();
				}
				
				sl.ImportDataTable(1, 1, dt, true);
				sl.SaveAs("ImportDataTable.xlsx");
			}
			
			
			Console.WriteLine("Done");
			Console.ReadLine();
		}

Accepted Answer

The data is a little inconsistent. For example, look at records 10 to 13. If we want to sum the values of column 12, what should happen for the values of columns 8, 9 which are different?

Usually the values of column 8,9 in records 11,12 should be empty (like column 7) or should be exactly the same as values of the record 10.

Anyways, in this answer, to make is simple to understand the rules, I use a smaller table of data:

We have 5 columns, and we only want 1, 4, and 5
Column 1 is the key of grouping. So if value of column 1 is empty, in a row, it will use the value of first row in group.
Column 2, we ignore it's data, and will not be included in result.
Column 3 is our filter column, with GOOD and BAD as value, and we only are interested in GOOD values. We also will not include it in result.
Column 4, is wanted for result, and if it belongs to a row in a group, it will use the value of the column from first row in group(because column 1 of this row is of course empty)
Column 5, is our summary column, so after reducing the results, we calculate sum of the values of this column, in each group

To keep the answer focused on the main issue, like the previous answer, I only focus on processing, filtering, and reducing data, so I start with a DataTable, assuming you have read it from your file, and I'll produce a DataTable assuming you will later write it to a file.

That said, here is the code that does the trick for us:

using System;
using System.Data;
using System.Linq;

namespace SampleConsoleApp
{
    internal class Program
    {
        static void Main(string[] args)
        {
            var input = new DataTable();
            input.Columns.Add("C1", typeof(string)); //Our key column, in output
            input.Columns.Add("C2", typeof(string)); //not in output
            input.Columns.Add("C3", typeof(string)); //Our Filter column, not in output
            input.Columns.Add("C4", typeof(string)); //in output, so follows value of first row in group
            input.Columns.Add("C5", typeof(double)); //in output, The summary column

            input.Rows.Add("A", "AA", "GOOD", "AAAA", 100);
            input.Rows.Add("", "xyz", "GOOD", "xyz", 100);
            input.Rows.Add("", "xyz", "BAD", "xyz", 100);
            input.Rows.Add("", "xyz", "GOOD", "xyz", 100);
            input.Rows.Add("B", "BB", "BAD", "BBBB", 100);
            input.Rows.Add("", "xyz", "GOOD", "xyz", 100);
            input.Rows.Add("", "xyz", "GOOD", "xyz", 100);
            input.Rows.Add("C", "xyz", "GOOD", "CCCC", 100);


            //Step 1 - Fix the data,
            //For all the columns which are not part of filter
            //Or are not in sum, copy value of the key row to next rows of group
            //Here, column 1, 2, 4
            for (int i = 0; i < input.Rows.Count; i++)
            {
                //1st column: Copy value of prevuous row, if the value is empty
                if (string.IsNullOrEmpty(input.Rows[i].Field("C1")))
                {
                    input.Rows[i]["C1"] = input.Rows[i - 1]["C1"];
                    //2nd column, don't touch we do not need
                    //3rd column, don't touch, we will use it in filter later 
                    //4th column, treat it like 1st column
                    input.Rows[i]["C4"] = input.Rows[i - 1]["C4"];
                    //5th column, don't touch, we will sum it later
                }
            }

            //Step 2 - Create data structure of output
            var output = new DataTable();
            output.Columns.Add("C1", typeof(string));
            output.Columns.Add("C4", typeof(string));
            output.Columns.Add("C5", typeof(double));

            //Step 2  - Filter, Group by and Sum

            input.Rows.Cast()
                .Where(row => row.Field("C3") == "GOOD") /
                .GroupBy(
                x => new { C1 = x["C1"], C4 = x["C4"] },
                x => new { C5 = x.Field("C5") })
                .Select(g => new
                {
                    C1 = g.Key.C1,
                    C4 = g.Key.C4,
                    C5 = g.Sum(x => x.C5)
                })
                .ToList()
                .ForEach(x =>
                {
                    output.Rows.Add(x.C1, x.C4, x.C5);
                });

            foreach (DataRow item in output.Rows)
            {
                Console.WriteLine($"C1:{item["C1"]}, C4:{item["C4"]}, C5:{item["C5"]}");
            }
            Console.ReadLine();
        }
    }
}

And here is the result:

C1:A, C4:AAAA, C5:300
C1:B, C4:BBBB, C5:200
C1:C, C4:CCCC, C5:100

How to copy data from one workbook to another with some rules using spreadsheetlight?

0 additional answers