How to extract Geojson Schema with spark










-1















I have a Geojson file and I want to extract the schema(structtype) correspondent with spark. Any help would be appreciated



I use spark 2.3.1



Geojson:
{
"type": "FeatureCollection",
"features": [

"type": "Feature",
"geometry":
"type": "MultiLineString",
"coordinates": [
[
[
7.0847794888,
50.7242091272
],
[
7.0859976701,
50.7239505872
],
...
[
7.0946504307,
50.722884129
]
]
]
,
"properties":
"strecke_id": 3,
"auswertezeit": "2018-11-13T16:10:00",
"geschwindigkeit": 26,
"verkehrsstatus": "erh�hte Verkehrsbelastung"

,.....


Thank you for your help










share|improve this question
























  • Can you post what you already tried. Did you try reading as spark.read.json("filePath").schema?

    – Shasankar
    Nov 13 '18 at 15:39











  • val data = spark.read.schema(newSchema).json("hdfs://........./file.json")

    – Mak
    Nov 13 '18 at 15:51











  • i try to read geojson data from hdfs(file.json) and to extract the schema.

    – Mak
    Nov 13 '18 at 16:04















-1















I have a Geojson file and I want to extract the schema(structtype) correspondent with spark. Any help would be appreciated



I use spark 2.3.1



Geojson:
{
"type": "FeatureCollection",
"features": [

"type": "Feature",
"geometry":
"type": "MultiLineString",
"coordinates": [
[
[
7.0847794888,
50.7242091272
],
[
7.0859976701,
50.7239505872
],
...
[
7.0946504307,
50.722884129
]
]
]
,
"properties":
"strecke_id": 3,
"auswertezeit": "2018-11-13T16:10:00",
"geschwindigkeit": 26,
"verkehrsstatus": "erh�hte Verkehrsbelastung"

,.....


Thank you for your help










share|improve this question
























  • Can you post what you already tried. Did you try reading as spark.read.json("filePath").schema?

    – Shasankar
    Nov 13 '18 at 15:39











  • val data = spark.read.schema(newSchema).json("hdfs://........./file.json")

    – Mak
    Nov 13 '18 at 15:51











  • i try to read geojson data from hdfs(file.json) and to extract the schema.

    – Mak
    Nov 13 '18 at 16:04













-1












-1








-1








I have a Geojson file and I want to extract the schema(structtype) correspondent with spark. Any help would be appreciated



I use spark 2.3.1



Geojson:
{
"type": "FeatureCollection",
"features": [

"type": "Feature",
"geometry":
"type": "MultiLineString",
"coordinates": [
[
[
7.0847794888,
50.7242091272
],
[
7.0859976701,
50.7239505872
],
...
[
7.0946504307,
50.722884129
]
]
]
,
"properties":
"strecke_id": 3,
"auswertezeit": "2018-11-13T16:10:00",
"geschwindigkeit": 26,
"verkehrsstatus": "erh�hte Verkehrsbelastung"

,.....


Thank you for your help










share|improve this question
















I have a Geojson file and I want to extract the schema(structtype) correspondent with spark. Any help would be appreciated



I use spark 2.3.1



Geojson:
{
"type": "FeatureCollection",
"features": [

"type": "Feature",
"geometry":
"type": "MultiLineString",
"coordinates": [
[
[
7.0847794888,
50.7242091272
],
[
7.0859976701,
50.7239505872
],
...
[
7.0946504307,
50.722884129
]
]
]
,
"properties":
"strecke_id": 3,
"auswertezeit": "2018-11-13T16:10:00",
"geschwindigkeit": 26,
"verkehrsstatus": "erh�hte Verkehrsbelastung"

,.....


Thank you for your help







scala apache-spark apache-spark-sql geojson






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Nov 13 '18 at 15:26







Mak

















asked Nov 13 '18 at 15:20









MakMak

126




126












  • Can you post what you already tried. Did you try reading as spark.read.json("filePath").schema?

    – Shasankar
    Nov 13 '18 at 15:39











  • val data = spark.read.schema(newSchema).json("hdfs://........./file.json")

    – Mak
    Nov 13 '18 at 15:51











  • i try to read geojson data from hdfs(file.json) and to extract the schema.

    – Mak
    Nov 13 '18 at 16:04

















  • Can you post what you already tried. Did you try reading as spark.read.json("filePath").schema?

    – Shasankar
    Nov 13 '18 at 15:39











  • val data = spark.read.schema(newSchema).json("hdfs://........./file.json")

    – Mak
    Nov 13 '18 at 15:51











  • i try to read geojson data from hdfs(file.json) and to extract the schema.

    – Mak
    Nov 13 '18 at 16:04
















Can you post what you already tried. Did you try reading as spark.read.json("filePath").schema?

– Shasankar
Nov 13 '18 at 15:39





Can you post what you already tried. Did you try reading as spark.read.json("filePath").schema?

– Shasankar
Nov 13 '18 at 15:39













val data = spark.read.schema(newSchema).json("hdfs://........./file.json")

– Mak
Nov 13 '18 at 15:51





val data = spark.read.schema(newSchema).json("hdfs://........./file.json")

– Mak
Nov 13 '18 at 15:51













i try to read geojson data from hdfs(file.json) and to extract the schema.

– Mak
Nov 13 '18 at 16:04





i try to read geojson data from hdfs(file.json) and to extract the schema.

– Mak
Nov 13 '18 at 16:04












1 Answer
1






active

oldest

votes


















0














val data = spark.read.json("hdfs://........./file.json")
val schema = data.schema


This should give you the schema in StructType






share|improve this answer






















    Your Answer






    StackExchange.ifUsing("editor", function ()
    StackExchange.using("externalEditor", function ()
    StackExchange.using("snippets", function ()
    StackExchange.snippets.init();
    );
    );
    , "code-snippets");

    StackExchange.ready(function()
    var channelOptions =
    tags: "".split(" "),
    id: "1"
    ;
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function()
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled)
    StackExchange.using("snippets", function()
    createEditor();
    );

    else
    createEditor();

    );

    function createEditor()
    StackExchange.prepareEditor(
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader:
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    ,
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    );



    );













    draft saved

    draft discarded


















    StackExchange.ready(
    function ()
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53284140%2fhow-to-extract-geojson-schema-with-spark%23new-answer', 'question_page');

    );

    Post as a guest















    Required, but never shown

























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    0














    val data = spark.read.json("hdfs://........./file.json")
    val schema = data.schema


    This should give you the schema in StructType






    share|improve this answer



























      0














      val data = spark.read.json("hdfs://........./file.json")
      val schema = data.schema


      This should give you the schema in StructType






      share|improve this answer

























        0












        0








        0







        val data = spark.read.json("hdfs://........./file.json")
        val schema = data.schema


        This should give you the schema in StructType






        share|improve this answer













        val data = spark.read.json("hdfs://........./file.json")
        val schema = data.schema


        This should give you the schema in StructType







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Nov 13 '18 at 16:09









        ShasankarShasankar

        272310




        272310



























            draft saved

            draft discarded
















































            Thanks for contributing an answer to Stack Overflow!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid


            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.

            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53284140%2fhow-to-extract-geojson-schema-with-spark%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            這個網誌中的熱門文章

            Barbados

            How to read a connectionString WITH PROVIDER in .NET Core?

            Node.js Script on GitHub Pages or Amazon S3