-
Notifications
You must be signed in to change notification settings - Fork 444
Description
When I run this script in cmd, there is a pause after the print statement before tabula.convert_into. Then it quits with no output csv file created or error message
I don't really know where to go given the lack of information as to what is going on behinds the scenes... I'm not super familiar with wrappers or java so any help here is appreciated!
Script
import tabula
input_folder="<path to file>"
input_file_name="<file name.pdf>"
input_path_name=input_folder + "/" + input_file_name
output_path_name= input_path_name[:-3] + "csv"
print("before convert_into") stops running after printing this statement
tabula.convert_into(input_path=input_path_name,output_path = output_path_name, output_format="csv", pages="all")
print("finished")
Already Attempted Troubleshooting:
Output: from running tabula.environment_info()
Python version:
3.12.8 (tags/v3.12.8:2dc476b, Dec 3 2024, 19:30:04) [MSC v.1942 64 bit (AMD64)]
Java version:
java version "24" 2025-03-18
Java(TM) SE Runtime Environment (build 24+36-3646)
Java HotSpot(TM) 64-Bit Server VM (build 24+36-3646, mixed mode, sharing)
tabula-py version: 2.10.0
platform: Windows-11-10.0.22631-SP0
JDK has also been installed and is in the Java Folder with JRE
In Environmental Variables (User variables) Java is on my Path as both
C:\Program Files\Java\jre1.8.0_441\bin and C:\Program Files\Java\jdk-24
In Environmental Variables (System variables) Path contains:
C:\Program Files\Common Files\Oracle\Java\javapath; C:\Program Files (x86)\Common Files\Oracle\Java\java8path
where java in cmd outputs:
C:\Program Files\Common Files\Oracle\Java\javapath\java.exe
C:\Program Files (x86)\Common Files\Oracle\Java\java8path\java.exe
C:\Program Files\Java\jre1.8.0_441\bin\java.exe
C:\Program Files\Java\jdk-24\bin\java.exe
Are the first two results the cause of the problems I'm running into? Should they be reordered? Should the last two be moved from path in User Variables to path in System Variables?
Tabula was pip installed as pip install tabula-py NOT pip install tabula
Tabula module is successfully imported after the import statement
Typing java in cmd is recognized and prints a list of options
Command prompt has been restarted multiple times
When running the code in Jupyter, this message is output
The Kernel crashed while executing code in the current cell or a previous cell. Please review the code in the cell(s) to identify a possible cause of the failure.
[error] Disposing session as kernel process died ExitCode: 3221226356, Reason: