Author: ryan
# powershell script: 06-27-16## 1) gets list of all xlsb files in directory# 2) goes through each and saves worksheets as separate csv##If you want to search through subdirectories also, add ” -Recurse” before “| Foreach-Object”$scriptpath = “B:”#path to searchAdd-Type -AssemblyName Microsoft.Office.Interop.Excel$xlFixedFormat = [Microsoft.Office.Interop.Excel.XlFileFormat]::xlCSV#needed to lookup$excel = new-object -ComObject “Excel.Application”;$excel.DisplayAlerts=$false;$excel.Visible =$false; $csvloc = “b:csv”#output directory Get-ChildItem -LiteralPath b: -Filter *.xlsb | ForEach-Object { $wb = $excel.Workbooks.Open($_.FullName) $lastmod = $_.LastWriteTime.ToString(‘yyyy-MM-ddThh-mmss’) $wbn = $wb.name foreach($ws in $wb.Worksheets) { $n = $lastmod + “_” + $wbn + “_” + $ws.Name $ws.SaveAs($csvLoc + $n + “.csv”, $xlFixedFormat); } $wb.close($False) #End file-specific code } $excel.Quit();[void][System.Runtime.Interopservices.Marshal]::ReleaseComObject($excel);
Problem: Some of my Excel files are XLSB and therefore don’t play well others (aka Python)Solution: Use PowerShell to Loop and Convert Excel Files to CSV
Python Speech_Recognition
Python https://github.com/Uberi/speech_recognition Google Cloud API – need invite https://cloud.google.com/speech/ Microsoft Big Voice API https://www.microsoft.com/cognitive-services/en-us/speech-api/ Amazon IBM Watson http://www.ibm.com/ibmwatson/speech-to-text CMUSphinx http://cmusphinx.sourceforge.net/wiki/gstreamer
Decided to put off learning about Objects in Python in order to see if I could figure out a more efficient solution than taking screenshots of everyslide. After a few searches I came across a python package called OpenCV (which by the way was not the easiest because compatibility issues).
Found some data but uses JavaScript and don’t know how to extract.