Transform some kind of exponential distribution into normal distribution










0















I have the following exponential distribution, generated with the following code:



import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import random

np.random.seed(1)
tags_ratio = np.random.exponential(1/25, 1000)
plt.hist(tags_ratio, range=(0, 1), bins=100)
plt.show()


enter image description here



I'm trying to transform my data, resides in tags_ratio into normal distribution, but with no success.



Tried with the log function and square functions. it given decent results. But I'm interesting in more ideas. Maybe more sophisticated.










share|improve this question
























  • Have you tried log transformation?

    – Scratch'N'Purr
    Nov 15 '18 at 8:40











  • Yeah I'm currently doing it with the log.. It's not that normal but it might be sufficient. Anyway I'm interested in more ideas

    – Eran Moshe
    Nov 15 '18 at 8:41











  • have you tried with boxcox?

    – Dejan Marić
    Nov 15 '18 at 8:42











  • boxcox results doing something similar the sqrt does. It leaves the 0 value hanged high and totally out of the distribution.

    – Eran Moshe
    Nov 15 '18 at 8:45















0















I have the following exponential distribution, generated with the following code:



import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import random

np.random.seed(1)
tags_ratio = np.random.exponential(1/25, 1000)
plt.hist(tags_ratio, range=(0, 1), bins=100)
plt.show()


enter image description here



I'm trying to transform my data, resides in tags_ratio into normal distribution, but with no success.



Tried with the log function and square functions. it given decent results. But I'm interesting in more ideas. Maybe more sophisticated.










share|improve this question
























  • Have you tried log transformation?

    – Scratch'N'Purr
    Nov 15 '18 at 8:40











  • Yeah I'm currently doing it with the log.. It's not that normal but it might be sufficient. Anyway I'm interested in more ideas

    – Eran Moshe
    Nov 15 '18 at 8:41











  • have you tried with boxcox?

    – Dejan Marić
    Nov 15 '18 at 8:42











  • boxcox results doing something similar the sqrt does. It leaves the 0 value hanged high and totally out of the distribution.

    – Eran Moshe
    Nov 15 '18 at 8:45













0












0








0


0






I have the following exponential distribution, generated with the following code:



import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import random

np.random.seed(1)
tags_ratio = np.random.exponential(1/25, 1000)
plt.hist(tags_ratio, range=(0, 1), bins=100)
plt.show()


enter image description here



I'm trying to transform my data, resides in tags_ratio into normal distribution, but with no success.



Tried with the log function and square functions. it given decent results. But I'm interesting in more ideas. Maybe more sophisticated.










share|improve this question
















I have the following exponential distribution, generated with the following code:



import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import random

np.random.seed(1)
tags_ratio = np.random.exponential(1/25, 1000)
plt.hist(tags_ratio, range=(0, 1), bins=100)
plt.show()


enter image description here



I'm trying to transform my data, resides in tags_ratio into normal distribution, but with no success.



Tried with the log function and square functions. it given decent results. But I'm interesting in more ideas. Maybe more sophisticated.







python numpy scipy






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Nov 15 '18 at 8:42







Eran Moshe

















asked Nov 15 '18 at 8:31









Eran MosheEran Moshe

1,388723




1,388723












  • Have you tried log transformation?

    – Scratch'N'Purr
    Nov 15 '18 at 8:40











  • Yeah I'm currently doing it with the log.. It's not that normal but it might be sufficient. Anyway I'm interested in more ideas

    – Eran Moshe
    Nov 15 '18 at 8:41











  • have you tried with boxcox?

    – Dejan Marić
    Nov 15 '18 at 8:42











  • boxcox results doing something similar the sqrt does. It leaves the 0 value hanged high and totally out of the distribution.

    – Eran Moshe
    Nov 15 '18 at 8:45

















  • Have you tried log transformation?

    – Scratch'N'Purr
    Nov 15 '18 at 8:40











  • Yeah I'm currently doing it with the log.. It's not that normal but it might be sufficient. Anyway I'm interested in more ideas

    – Eran Moshe
    Nov 15 '18 at 8:41











  • have you tried with boxcox?

    – Dejan Marić
    Nov 15 '18 at 8:42











  • boxcox results doing something similar the sqrt does. It leaves the 0 value hanged high and totally out of the distribution.

    – Eran Moshe
    Nov 15 '18 at 8:45
















Have you tried log transformation?

– Scratch'N'Purr
Nov 15 '18 at 8:40





Have you tried log transformation?

– Scratch'N'Purr
Nov 15 '18 at 8:40













Yeah I'm currently doing it with the log.. It's not that normal but it might be sufficient. Anyway I'm interested in more ideas

– Eran Moshe
Nov 15 '18 at 8:41





Yeah I'm currently doing it with the log.. It's not that normal but it might be sufficient. Anyway I'm interested in more ideas

– Eran Moshe
Nov 15 '18 at 8:41













have you tried with boxcox?

– Dejan Marić
Nov 15 '18 at 8:42





have you tried with boxcox?

– Dejan Marić
Nov 15 '18 at 8:42













boxcox results doing something similar the sqrt does. It leaves the 0 value hanged high and totally out of the distribution.

– Eran Moshe
Nov 15 '18 at 8:45





boxcox results doing something similar the sqrt does. It leaves the 0 value hanged high and totally out of the distribution.

– Eran Moshe
Nov 15 '18 at 8:45












1 Answer
1






active

oldest

votes


















1














You can try to see if this helps:



from scipy.stats import boxcox
tags_ratio = boxcox(tags_ratio, 0.3)
plt.hist(tags_ratio)
plt.show()


result:



enter image description here



for more explanations and theory about Box-Cox click here.






share|improve this answer























  • boxcox is nice. I went with log10 but might change it to boxcox when the time is right..

    – Eran Moshe
    Nov 15 '18 at 11:30










Your Answer






StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













draft saved

draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53315215%2ftransform-some-kind-of-exponential-distribution-into-normal-distribution%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown

























1 Answer
1






active

oldest

votes








1 Answer
1






active

oldest

votes









active

oldest

votes






active

oldest

votes









1














You can try to see if this helps:



from scipy.stats import boxcox
tags_ratio = boxcox(tags_ratio, 0.3)
plt.hist(tags_ratio)
plt.show()


result:



enter image description here



for more explanations and theory about Box-Cox click here.






share|improve this answer























  • boxcox is nice. I went with log10 but might change it to boxcox when the time is right..

    – Eran Moshe
    Nov 15 '18 at 11:30















1














You can try to see if this helps:



from scipy.stats import boxcox
tags_ratio = boxcox(tags_ratio, 0.3)
plt.hist(tags_ratio)
plt.show()


result:



enter image description here



for more explanations and theory about Box-Cox click here.






share|improve this answer























  • boxcox is nice. I went with log10 but might change it to boxcox when the time is right..

    – Eran Moshe
    Nov 15 '18 at 11:30













1












1








1







You can try to see if this helps:



from scipy.stats import boxcox
tags_ratio = boxcox(tags_ratio, 0.3)
plt.hist(tags_ratio)
plt.show()


result:



enter image description here



for more explanations and theory about Box-Cox click here.






share|improve this answer













You can try to see if this helps:



from scipy.stats import boxcox
tags_ratio = boxcox(tags_ratio, 0.3)
plt.hist(tags_ratio)
plt.show()


result:



enter image description here



for more explanations and theory about Box-Cox click here.







share|improve this answer












share|improve this answer



share|improve this answer










answered Nov 15 '18 at 8:44









Dejan MarićDejan Marić

448213




448213












  • boxcox is nice. I went with log10 but might change it to boxcox when the time is right..

    – Eran Moshe
    Nov 15 '18 at 11:30

















  • boxcox is nice. I went with log10 but might change it to boxcox when the time is right..

    – Eran Moshe
    Nov 15 '18 at 11:30
















boxcox is nice. I went with log10 but might change it to boxcox when the time is right..

– Eran Moshe
Nov 15 '18 at 11:30





boxcox is nice. I went with log10 but might change it to boxcox when the time is right..

– Eran Moshe
Nov 15 '18 at 11:30



















draft saved

draft discarded
















































Thanks for contributing an answer to Stack Overflow!


  • Please be sure to answer the question. Provide details and share your research!

But avoid


  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53315215%2ftransform-some-kind-of-exponential-distribution-into-normal-distribution%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







這個網誌中的熱門文章

How to read a connectionString WITH PROVIDER in .NET Core?

In R, how to develop a multiplot heatmap.2 figure showing key labels successfully

Museum of Modern and Contemporary Art of Trento and Rovereto