When I convert to JSON, None of the table cells have any "content" field. There's no way to extract their text, just their layout info:
{
"type": "t",
"id": 32,
"level": "6",
"bounding box": "[74, 95, 1369, 447]",
"number of rows": 18,
"number of columns": 4,
"next table id": 0,
"rows": [
{
"type": "table row",
"row number": 1,
"cells": [
{
"type": "table cell",
"page number": 1,
"bounding box": [74.991, 422.509908, 331.4606122712785, 447.93136799999996],
"row number": 1,
"column number": 1,
"row span": 1,
"column span": 1,
"kids": [],
},
{
"type": "table cell",
"page number": 1,
"bounding box": [
331.4606122712785,
422.509908,
656.9203648709424,
447.93136799999996,
],
"row number": 1,
"column number": 2,
"row span": 1,
"column span": 1,
"kids": [],
},
{
"type": "table cell",
"page number": 1,
"bounding box": [
656.9203648709424,
422.509908,
948.636221747935,
447.93136799999996,
],
"row number": 1,
"column number": 3,
"row span": 1,
"column span": 1,
"kids": [],
},
[...]
When I convert to JSON, None of the table cells have any "content" field. There's no way to extract their text, just their layout info: