webscraping not able to get the source code of page
I am trying to scrape https://onlineservices.ocswssw.org/Thinclient/Public/PR/EN
Below is the code.
import requests
from bs4 import BeautifulSoup as BS
sess = requests.session()
html = sess.get(url,headers='User-Agent': 'Mozilla/5.0',allow_redirects=True)
Soup = BS(html.text,'lxml')
with open('ocswssw.html,'w') as f:
print(Soup.prettify())
if you compare the ocswssw.html
and the website in chrome. they don't match.
but some how the source code I am receiving is not complete. Please let me know what went wrong.
I don't like to use selenium where browser popups.
python web-scraping beautifulsoup request
|
show 4 more comments
I am trying to scrape https://onlineservices.ocswssw.org/Thinclient/Public/PR/EN
Below is the code.
import requests
from bs4 import BeautifulSoup as BS
sess = requests.session()
html = sess.get(url,headers='User-Agent': 'Mozilla/5.0',allow_redirects=True)
Soup = BS(html.text,'lxml')
with open('ocswssw.html,'w') as f:
print(Soup.prettify())
if you compare the ocswssw.html
and the website in chrome. they don't match.
but some how the source code I am receiving is not complete. Please let me know what went wrong.
I don't like to use selenium where browser popups.
python web-scraping beautifulsoup request
What do you mean by "is not complete"? What you expect to find in source code?
– Andersson
Nov 12 at 6:12
if you run the url in chrome and paste the soup in a txt they don't match
– Sandhya Thotakura
Nov 12 at 6:17
Of course they will not match: in Chrome you see rendered page with executed JavaScript.requests
returns you page source.... So what is your expected output?
– Andersson
Nov 12 at 6:19
I want to search social workers in the website. with the output I receive I can't do that.
– Sandhya Thotakura
Nov 12 at 6:21
Do you mean "Corporate names", e.g. " A. Bacchus Social Work Professional Corporation"?
– Andersson
Nov 12 at 6:22
|
show 4 more comments
I am trying to scrape https://onlineservices.ocswssw.org/Thinclient/Public/PR/EN
Below is the code.
import requests
from bs4 import BeautifulSoup as BS
sess = requests.session()
html = sess.get(url,headers='User-Agent': 'Mozilla/5.0',allow_redirects=True)
Soup = BS(html.text,'lxml')
with open('ocswssw.html,'w') as f:
print(Soup.prettify())
if you compare the ocswssw.html
and the website in chrome. they don't match.
but some how the source code I am receiving is not complete. Please let me know what went wrong.
I don't like to use selenium where browser popups.
python web-scraping beautifulsoup request
I am trying to scrape https://onlineservices.ocswssw.org/Thinclient/Public/PR/EN
Below is the code.
import requests
from bs4 import BeautifulSoup as BS
sess = requests.session()
html = sess.get(url,headers='User-Agent': 'Mozilla/5.0',allow_redirects=True)
Soup = BS(html.text,'lxml')
with open('ocswssw.html,'w') as f:
print(Soup.prettify())
if you compare the ocswssw.html
and the website in chrome. they don't match.
but some how the source code I am receiving is not complete. Please let me know what went wrong.
I don't like to use selenium where browser popups.
python web-scraping beautifulsoup request
python web-scraping beautifulsoup request
edited Nov 12 at 6:38
asked Nov 12 at 6:04
Sandhya Thotakura
1107
1107
What do you mean by "is not complete"? What you expect to find in source code?
– Andersson
Nov 12 at 6:12
if you run the url in chrome and paste the soup in a txt they don't match
– Sandhya Thotakura
Nov 12 at 6:17
Of course they will not match: in Chrome you see rendered page with executed JavaScript.requests
returns you page source.... So what is your expected output?
– Andersson
Nov 12 at 6:19
I want to search social workers in the website. with the output I receive I can't do that.
– Sandhya Thotakura
Nov 12 at 6:21
Do you mean "Corporate names", e.g. " A. Bacchus Social Work Professional Corporation"?
– Andersson
Nov 12 at 6:22
|
show 4 more comments
What do you mean by "is not complete"? What you expect to find in source code?
– Andersson
Nov 12 at 6:12
if you run the url in chrome and paste the soup in a txt they don't match
– Sandhya Thotakura
Nov 12 at 6:17
Of course they will not match: in Chrome you see rendered page with executed JavaScript.requests
returns you page source.... So what is your expected output?
– Andersson
Nov 12 at 6:19
I want to search social workers in the website. with the output I receive I can't do that.
– Sandhya Thotakura
Nov 12 at 6:21
Do you mean "Corporate names", e.g. " A. Bacchus Social Work Professional Corporation"?
– Andersson
Nov 12 at 6:22
What do you mean by "is not complete"? What you expect to find in source code?
– Andersson
Nov 12 at 6:12
What do you mean by "is not complete"? What you expect to find in source code?
– Andersson
Nov 12 at 6:12
if you run the url in chrome and paste the soup in a txt they don't match
– Sandhya Thotakura
Nov 12 at 6:17
if you run the url in chrome and paste the soup in a txt they don't match
– Sandhya Thotakura
Nov 12 at 6:17
Of course they will not match: in Chrome you see rendered page with executed JavaScript.
requests
returns you page source.... So what is your expected output?– Andersson
Nov 12 at 6:19
Of course they will not match: in Chrome you see rendered page with executed JavaScript.
requests
returns you page source.... So what is your expected output?– Andersson
Nov 12 at 6:19
I want to search social workers in the website. with the output I receive I can't do that.
– Sandhya Thotakura
Nov 12 at 6:21
I want to search social workers in the website. with the output I receive I can't do that.
– Sandhya Thotakura
Nov 12 at 6:21
Do you mean "Corporate names", e.g. " A. Bacchus Social Work Professional Corporation"?
– Andersson
Nov 12 at 6:22
Do you mean "Corporate names", e.g. " A. Bacchus Social Work Professional Corporation"?
– Andersson
Nov 12 at 6:22
|
show 4 more comments
3 Answers
3
active
oldest
votes
The page is created by using javascript.
So,you can't get page source by only using requests/bs4
how to resoleve:use HeadlessChrome which create page source created by javascript
add a comment |
I am not completely clear on what you are trying to accomplish ultimately, but when it comes to receiving the source I:
1)
Added the missing apostrophe for your ocswssw.html argument using the open() method and
2)
Ran the code and received pretty much the same source as Google Chrome provides.
Result from BS:
<!DOCTYPE html>
<html>
<head>
<meta charset="utf-8"/>
<meta content="width=device-width, initial-scale=1" name="viewport"/>
<title>
OCSWSSW | Member Search
</title>
<link href="/Thinclient/favicon.ico" rel="shortcut icon" type="image/x-icon"/>
<link href="/Thinclient/Content/bootstrap.071220161413.css" rel="stylesheet" type="text/css"/>
<link href="/Thinclient/Content/kendo/kendo.common-bootstrap.min.css" rel="stylesheet"/>
<link href="/Thinclient/Content/kendo/kendo.bootstrap.min.css" rel="stylesheet"/>
<link href="/Thinclient/Content/ThinStyle.110820150951.css" rel="stylesheet" title="Blue" type="text/css"/>
<link href="//maxcdn.bootstrapcdn.com/font-awesome/4.3.0/css/font-awesome.min.css" rel="stylesheet"/>
<link href="/Thinclient/Content/icheck/square/blue.css" rel="stylesheet"/>
<link href="/Thinclient/Content/GlobalStyleSheet.css" rel="stylesheet"/>
<script type="text/javascript">
HomeURL = "#/forms/new/?table=0x800000000000003D&form=0x800000000000004D&command=0x8000000000000C2D";
AfterLoginData = null
LanguageDictionary = ;
LanguageDictionary.TC_COMMON = "OkButtonTextOK":"Ok","OkButtonTextContinue":"Continue","OkButtonTextYes":"Yes","OkButtonTextDelete":"Clear","CancelButtonTextCancel":"Cancel","CancelButtonTextNo":"No","CancelButtonTextLogout":"Logout","MiddleButtonTextNo":"No","AjaxRequestError":"The Web server does not respond currently. Please try again later.","UserIdleMessage":"You are innactive, do you want to continue or you disconnect?","ErrorTitle":"Error","ErrorHeaderTitle":"Application error","ErrorHeaderText":"An application error has occurred while processing the current request. The error was recorded and sent to the site administrator. Provide your administrator ID error below.","ErrorMessage":"Message:","ErrorIdentifier":"Identify:","ErrorDate":"Date:"
LanguageDictionary.TC_SEARCH = "OperatorNotEqual":"Not =","OperatorIsDefined":"Is Defined","OperatorIsNotDefined":"Is Not Defined","OperatorContains":"Contains","OperatorDoesNotContain":"Does not contain","OperatorBeginsWith":"Begins with","OperatorDoesNotBeginWith":"Does not begin with","OperatorIsEmpty":"Is Empty","OperatorIsNotEmpty":"Is not empty","CustomFiltersNotComplete":"One or more custom filters are not complete. Examine each custom filter and make sure that the valid search criteria are provided.","NavigateAwayFromSearchWithFilterSet":"You are about to leave this page without performing the search filters custom.","NoGlobalSearchPermissions":"Password","SearchDefinitionLostAlert":"The definition of research will be lost if the primary table is changed. Are you sure you want to change the primary table of the research."
LanguageDictionary.TC_FORM = "RequiredFieldsNotSet":"Unable to save the form data. Provide a value for all required fields.","NavigateAwayFromUnsavedForm":"You are about to exit the form without saving it","RefreshFormLosesModifiedData":"The data of the form has changed. The changes you made will be lost when you refresh the form. Do you want to continue?","SaveDataBeforeClose":"The data of the form has changed. Do you want to save them before closing?","DeleteWarning":"The form data will be deleted. Are you sure you want to continue?","DeleteSecondaryWarning":"You are about to delete the form data.","RequiredField":"This is a required field","InvalidFormat":"The format for this field is not valid"
LanguageDictionary.TC_GLOBALSEARCH = "CollapseAllLabel":"Reduce everything","ExpandAllLabel":"About expand"
LanguageDictionary.TC_WIDGETS = "CallListItem":"Appeal","FaxListItem":"Fax","SmsListItem":"SMS"
</script>
<script src="/Thinclient/Scripts/jquery-1.11.1.min.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/jquery-migrate-1.2.1.min.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/icheck.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/kendo.all.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/kendo.timezones.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/kendo.aspnetmvc.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/cultures/kendo.culture.en-US.min.js">
</script>
<script>
kendo.culture("en-US");
</script>
</head>
<body class="k-content">
<div class="k-loading-mask" id="loadingMsg" style="width:100%;height:100%">
<span class="k-loading-text">
Loading...
</span>
<div class="k-loading-image">
<div class="k-loading-color">
</div>
</div>
</div>
<input id="hdPollingFrequency" type="hidden" value="32767"/>
<input id="hdPrivateComputerTimeout" type="hidden" value="32767"/>
<input id="hdPublicComputerTimeout" type="hidden" value="32767"/>
<input id="hdWarningDisplayDuration" type="hidden" value="0"/>
<input id="hdWindowsAuthentication" type="hidden" value="false"/>
<div class="container">
<div id="content">
</div>
</div>
<div id="loading" style="display: none;">
<h1>
We are processing your request. Please be patient.
</h1>
<input class="abortButton" type="button" value="Abort"/>
</div>
<script id="taskpadGroupTmpl" type="text/x-jquery-tmpl">
<div class="panelBlock">
<div class="panelTitle"><div class="panelLink"><a class="panelDD-dn" id="$DisplayName" href="#">$DisplayName</a></div><div class = "imgPanel">
<a class="imgPanelDD" href="#"> </a></div>
</div>
<div class="panelContent1" id="panelContent1 + $DisplayName">
<ul>
tmpl(TaskItemCollection) "#taskpadItemTmpl"
</ul>
</div>
</div>
</script>
<script id="KendoTestTemplate" type="text/x-kendo-template">
<h2>#= test #</h2>
<ul>
#= kendo.render(kendo.template($("\#KendoTestLiTemplate").html()), litest) #
</ul>
</script>
<script id="KendoTestLiTemplate" type="text/x-kendo-template">
<li>#= displayName#</li>
</script>
<script id="ErrorTemplate" type="text/x-jquery-tmpl">
<div class="errorMsg k-widget k-notification k-notification-error " data-role="alert" style="display: block; opacity: 1;">
<div class="k-notification-wrap">
<span class="k-icon k-i-note">
error
</span>
$errorMsg
<span class="k-icon k-i-close">
Hide
</span>
</div>
</div>
</script>
<script id="HelpButtonTemplate" type="text/x-jquery-tmpl">
<button class="k-button k-primary helpButton" id="$id" onclick="return false;">?</button>
</script>
<script id="IconTemplate" type="text/x-jquery-tmpl">
<span class="k-icon $icon"></span>
</script>
<script id="trash" type="text/x-kendo-template">
<li style="background: url(./Images/#=item.ImageId#.#=item.ImageHash#.#=item.ImageFileExtension#) no-repeat;"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
<li class="#=GetCssClass(item.ContentType)#"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
</script>
<script id="taskpadItemTmpl" type="text/x-jquery-tmpl">
if ImageId
<li style="background: url(./Images/$ImageId.$ImageHash.$ImageFileExtension) no-repeat;"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
else
<li class="$GetCssClass(ContentType)"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
/if
</script>
<script id="buttonBarButtonTmpl" type="text/x-jquery-tmpl">
<button value="submit" class="submitBtn k-button k-primary" data-actionCommand="$Action" data-Disabled="$Disabled" data-Visible="$Visible" data-Name="$Name">
<span>$DisplayName</span>
</button>
</script>
<script src="/Thinclient/Scripts/jquery.filedownload.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/jquery.tmpl.min.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/pubsub.150420151641.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/jquery.form.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/bootstrap.min.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/sameheight.min.150420151641.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/Core.141120161617.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/PivotalThinClient.150420151641.js" type="text/javascript">
</script>
</body>
</html>
Result from Browser source
<!DOCTYPE html>
<html>
<head>
<meta charset="utf-8">
<meta name="viewport" content="width=device-width, initial-scale=1">
<title>OCSWSSW | Member Search</title>
<link href="/Thinclient/favicon.ico" type="image/x-icon" rel="shortcut icon" />
<link href="/Thinclient/Content/bootstrap.071220161413.css" rel="stylesheet" type="text/css" />
<link rel="stylesheet" href="/Thinclient/Content/kendo/kendo.common-bootstrap.min.css" />
<link rel="stylesheet" href="/Thinclient/Content/kendo/kendo.bootstrap.min.css" />
<link href="/Thinclient/Content/ThinStyle.110820150951.css" rel="stylesheet" title="Blue" type="text/css" />
<link rel="stylesheet" href="//maxcdn.bootstrapcdn.com/font-awesome/4.3.0/css/font-awesome.min.css">
<link rel="stylesheet" href="/Thinclient/Content/icheck/square/blue.css" />
<link rel="stylesheet" href="/Thinclient/Content/GlobalStyleSheet.css" />
<script type="text/javascript" >
HomeURL = "#/forms/new/?table=0x800000000000003D&form=0x800000000000004D&command=0x8000000000000C2D";
AfterLoginData = null
LanguageDictionary = ;
LanguageDictionary.TC_COMMON = "OkButtonTextOK":"Ok","OkButtonTextContinue":"Continue","OkButtonTextYes":"Yes","OkButtonTextDelete":"Clear","CancelButtonTextCancel":"Cancel","CancelButtonTextNo":"No","CancelButtonTextLogout":"Logout","MiddleButtonTextNo":"No","AjaxRequestError":"The Web server does not respond currently. Please try again later.","UserIdleMessage":"You are innactive, do you want to continue or you disconnect?","ErrorTitle":"Error","ErrorHeaderTitle":"Application error","ErrorHeaderText":"An application error has occurred while processing the current request. The error was recorded and sent to the site administrator. Provide your administrator ID error below.","ErrorMessage":"Message:","ErrorIdentifier":"Identify:","ErrorDate":"Date:"
LanguageDictionary.TC_SEARCH = "OperatorNotEqual":"Not =","OperatorIsDefined":"Is Defined","OperatorIsNotDefined":"Is Not Defined","OperatorContains":"Contains","OperatorDoesNotContain":"Does not contain","OperatorBeginsWith":"Begins with","OperatorDoesNotBeginWith":"Does not begin with","OperatorIsEmpty":"Is Empty","OperatorIsNotEmpty":"Is not empty","CustomFiltersNotComplete":"One or more custom filters are not complete. Examine each custom filter and make sure that the valid search criteria are provided.","NavigateAwayFromSearchWithFilterSet":"You are about to leave this page without performing the search filters custom.","NoGlobalSearchPermissions":"Password","SearchDefinitionLostAlert":"The definition of research will be lost if the primary table is changed. Are you sure you want to change the primary table of the research."
LanguageDictionary.TC_FORM = "RequiredFieldsNotSet":"Unable to save the form data. Provide a value for all required fields.","NavigateAwayFromUnsavedForm":"You are about to exit the form without saving it","RefreshFormLosesModifiedData":"The data of the form has changed. The changes you made will be lost when you refresh the form. Do you want to continue?","SaveDataBeforeClose":"The data of the form has changed. Do you want to save them before closing?","DeleteWarning":"The form data will be deleted. Are you sure you want to continue?","DeleteSecondaryWarning":"You are about to delete the form data.","RequiredField":"This is a required field","InvalidFormat":"The format for this field is not valid"
LanguageDictionary.TC_GLOBALSEARCH = "CollapseAllLabel":"Reduce everything","ExpandAllLabel":"About expand"
LanguageDictionary.TC_WIDGETS = "CallListItem":"Appeal","FaxListItem":"Fax","SmsListItem":"SMS"
</script>
<script type="text/javascript" src="/Thinclient/Scripts/jquery-1.11.1.min.js"></script>
<script src="/Thinclient/Scripts/jquery-migrate-1.2.1.min.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/icheck.min.js"></script>
<script src="/Thinclient/Scripts/kendo/kendo.all.min.js"></script>
<script src="/Thinclient/Scripts/kendo/kendo.timezones.min.js"></script>
<script src="/Thinclient/Scripts/kendo/kendo.aspnetmvc.min.js"></script>
<script src="/Thinclient/Scripts/kendo/cultures/kendo.culture.en-US.min.js"></script>
<script>
kendo.culture("en-US");
</script>
</head>
<body class="k-content">
<div id="loadingMsg" class="k-loading-mask" style="width:100%;height:100%">
<span class="k-loading-text">Loading...</span>
<div class="k-loading-image">
<div class="k-loading-color"></div>
</div>
</div>
<input type="hidden" ID="hdPollingFrequency" value="32767"/>
<input type="hidden" ID="hdPrivateComputerTimeout" value= "32767"/>
<input type="hidden" ID="hdPublicComputerTimeout" value="32767"/>
<input type="hidden" ID="hdWarningDisplayDuration" value="0"/>
<input type="hidden" id="hdWindowsAuthentication" value="false"/>
<div class="container">
<div id="content">
</div>
</div>
<div id="loading" style="display: none;">
<h1>
We are processing your request. Please be patient.</h1>
<input type="button" value="Abort" class="abortButton" />
</div>
<script id="taskpadGroupTmpl" type="text/x-jquery-tmpl">
<div class="panelBlock">
<div class="panelTitle"><div class="panelLink"><a class="panelDD-dn" id="$DisplayName" href="#">$DisplayName</a></div><div class = "imgPanel">
<a class="imgPanelDD" href="#"> </a></div>
</div>
<div class="panelContent1" id="panelContent1 + $DisplayName">
<ul>
tmpl(TaskItemCollection) "#taskpadItemTmpl"
</ul>
</div>
</div>
</script>
<script id="KendoTestTemplate" type="text/x-kendo-template">
<h2>#= test #</h2>
<ul>
#= kendo.render(kendo.template($("\#KendoTestLiTemplate").html()), litest) #
</ul>
</script>
<script id="KendoTestLiTemplate" type="text/x-kendo-template">
<li>#= displayName#</li>
</script>
<script id="ErrorTemplate" type="text/x-jquery-tmpl">
<div class="errorMsg k-widget k-notification k-notification-error " data-role="alert" style="display: block; opacity: 1;">
<div class="k-notification-wrap">
<span class="k-icon k-i-note">
error
</span>
$errorMsg
<span class="k-icon k-i-close">
Hide
</span>
</div>
</div>
</script>
<script id="HelpButtonTemplate" type="text/x-jquery-tmpl">
<button class="k-button k-primary helpButton" id="$id" onclick="return false;">?</button>
</script>
<script id="IconTemplate" type="text/x-jquery-tmpl">
<span class="k-icon $icon"></span>
</script>
<script id="trash" type="text/x-kendo-template">
<li style="background: url(./Images/#=item.ImageId#.#=item.ImageHash#.#=item.ImageFileExtension#) no-repeat;"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
<li class="#=GetCssClass(item.ContentType)#"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
</script>
<script id="taskpadItemTmpl" type="text/x-jquery-tmpl">
if ImageId
<li style="background: url(./Images/$ImageId.$ImageHash.$ImageFileExtension) no-repeat;"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
else
<li class="$GetCssClass(ContentType)"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
/if
</script>
<script id="buttonBarButtonTmpl" type="text/x-jquery-tmpl">
<button value="submit" class="submitBtn k-button k-primary" data-actionCommand="$Action" data-Disabled="$Disabled" data-Visible="$Visible" data-Name="$Name">
<span>$DisplayName</span>
</button>
</script>
<script src="/Thinclient/Scripts/jquery.filedownload.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/jquery.tmpl.min.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/pubsub.150420151641.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/jquery.form.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/bootstrap.min.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/sameheight.min.150420151641.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/Core.141120161617.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/PivotalThinClient.150420151641.js" type="text/javascript"></script>
</body>
</html>
<script>
//$( window ).load(
//$(".k-state-default").hover(function ()
// $(this).toggleClass("k-state-hover");
//)
//);
</script>
Is this not what you are looking for from Beautiful Soup?
please run in browser and compare to the code you receive
– Sandhya Thotakura
Nov 12 at 6:37
I added the source code from Chrome in the answer below the result from BS4. I don't see anything missing or incomplete here..
– Jonathan
Nov 12 at 6:46
Hi don't use "view page source".. instead use "ctrl+shift+I" and go to "elements" that is the original source code.
– Sandhya Thotakura
Nov 12 at 6:47
Are you referring to the drop-down arrows for each parent element's children? What's different/missing from this?
– Jonathan
Nov 12 at 7:01
in the website you can see fields first name, last name but the script wont get that
– Sandhya Thotakura
Nov 12 at 7:02
|
show 1 more comment
it dynamic page (Ajax) you can't use bs4
, if you dont like selenium where browser popups you can add --headless
option to hide it. here example
from selenium import webdriver
from selenium.webdriver.chrome.options import Options
from bs4 import BeautifulSoup
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
options = Options()
options.add_argument('--headless')
#options.add_argument('--disable-gpu') # maybe needed if running on Windows.
driver = webdriver.Chrome(chrome_options=options)
print("Loading Page...")
driver.get('https://onlineservices.ocswssw.org/Thinclient/Public/PR/EN/')
# wait max 20 second until ajax content rendered
print("Wait Ajax finished...")
WebDriverWait(driver, 20).until(EC.presence_of_element_located((By.ID , 'MainForm')))
html = driver.execute_script("return document.documentElement.outerHTML")
Soup = BeautifulSoup(html, 'html.parser')
with open('ocswssw.html', 'w') as f:
sourceCode = Soup.prettify().encode('utf-8')
f.write(sourceCode)
print(sourceCode)
driver.quit()
what makes you pick theMainForm
element for your wait until?
– Kamikaze_goldfish
Nov 12 at 15:56
because this is one of element that presence after ajax loaded
– ewwink
Nov 12 at 21:07
So was it just a randomly picked element or was it specific to what you wanted to interact with?
– Kamikaze_goldfish
Nov 12 at 23:29
it can be randomly
– ewwink
Nov 13 at 8:06
add a comment |
Your Answer
StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53256639%2fwebscraping-not-able-to-get-the-source-code-of-page%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
3 Answers
3
active
oldest
votes
3 Answers
3
active
oldest
votes
active
oldest
votes
active
oldest
votes
The page is created by using javascript.
So,you can't get page source by only using requests/bs4
how to resoleve:use HeadlessChrome which create page source created by javascript
add a comment |
The page is created by using javascript.
So,you can't get page source by only using requests/bs4
how to resoleve:use HeadlessChrome which create page source created by javascript
add a comment |
The page is created by using javascript.
So,you can't get page source by only using requests/bs4
how to resoleve:use HeadlessChrome which create page source created by javascript
The page is created by using javascript.
So,you can't get page source by only using requests/bs4
how to resoleve:use HeadlessChrome which create page source created by javascript
answered Nov 12 at 6:35
user7196197
62
62
add a comment |
add a comment |
I am not completely clear on what you are trying to accomplish ultimately, but when it comes to receiving the source I:
1)
Added the missing apostrophe for your ocswssw.html argument using the open() method and
2)
Ran the code and received pretty much the same source as Google Chrome provides.
Result from BS:
<!DOCTYPE html>
<html>
<head>
<meta charset="utf-8"/>
<meta content="width=device-width, initial-scale=1" name="viewport"/>
<title>
OCSWSSW | Member Search
</title>
<link href="/Thinclient/favicon.ico" rel="shortcut icon" type="image/x-icon"/>
<link href="/Thinclient/Content/bootstrap.071220161413.css" rel="stylesheet" type="text/css"/>
<link href="/Thinclient/Content/kendo/kendo.common-bootstrap.min.css" rel="stylesheet"/>
<link href="/Thinclient/Content/kendo/kendo.bootstrap.min.css" rel="stylesheet"/>
<link href="/Thinclient/Content/ThinStyle.110820150951.css" rel="stylesheet" title="Blue" type="text/css"/>
<link href="//maxcdn.bootstrapcdn.com/font-awesome/4.3.0/css/font-awesome.min.css" rel="stylesheet"/>
<link href="/Thinclient/Content/icheck/square/blue.css" rel="stylesheet"/>
<link href="/Thinclient/Content/GlobalStyleSheet.css" rel="stylesheet"/>
<script type="text/javascript">
HomeURL = "#/forms/new/?table=0x800000000000003D&form=0x800000000000004D&command=0x8000000000000C2D";
AfterLoginData = null
LanguageDictionary = ;
LanguageDictionary.TC_COMMON = "OkButtonTextOK":"Ok","OkButtonTextContinue":"Continue","OkButtonTextYes":"Yes","OkButtonTextDelete":"Clear","CancelButtonTextCancel":"Cancel","CancelButtonTextNo":"No","CancelButtonTextLogout":"Logout","MiddleButtonTextNo":"No","AjaxRequestError":"The Web server does not respond currently. Please try again later.","UserIdleMessage":"You are innactive, do you want to continue or you disconnect?","ErrorTitle":"Error","ErrorHeaderTitle":"Application error","ErrorHeaderText":"An application error has occurred while processing the current request. The error was recorded and sent to the site administrator. Provide your administrator ID error below.","ErrorMessage":"Message:","ErrorIdentifier":"Identify:","ErrorDate":"Date:"
LanguageDictionary.TC_SEARCH = "OperatorNotEqual":"Not =","OperatorIsDefined":"Is Defined","OperatorIsNotDefined":"Is Not Defined","OperatorContains":"Contains","OperatorDoesNotContain":"Does not contain","OperatorBeginsWith":"Begins with","OperatorDoesNotBeginWith":"Does not begin with","OperatorIsEmpty":"Is Empty","OperatorIsNotEmpty":"Is not empty","CustomFiltersNotComplete":"One or more custom filters are not complete. Examine each custom filter and make sure that the valid search criteria are provided.","NavigateAwayFromSearchWithFilterSet":"You are about to leave this page without performing the search filters custom.","NoGlobalSearchPermissions":"Password","SearchDefinitionLostAlert":"The definition of research will be lost if the primary table is changed. Are you sure you want to change the primary table of the research."
LanguageDictionary.TC_FORM = "RequiredFieldsNotSet":"Unable to save the form data. Provide a value for all required fields.","NavigateAwayFromUnsavedForm":"You are about to exit the form without saving it","RefreshFormLosesModifiedData":"The data of the form has changed. The changes you made will be lost when you refresh the form. Do you want to continue?","SaveDataBeforeClose":"The data of the form has changed. Do you want to save them before closing?","DeleteWarning":"The form data will be deleted. Are you sure you want to continue?","DeleteSecondaryWarning":"You are about to delete the form data.","RequiredField":"This is a required field","InvalidFormat":"The format for this field is not valid"
LanguageDictionary.TC_GLOBALSEARCH = "CollapseAllLabel":"Reduce everything","ExpandAllLabel":"About expand"
LanguageDictionary.TC_WIDGETS = "CallListItem":"Appeal","FaxListItem":"Fax","SmsListItem":"SMS"
</script>
<script src="/Thinclient/Scripts/jquery-1.11.1.min.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/jquery-migrate-1.2.1.min.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/icheck.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/kendo.all.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/kendo.timezones.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/kendo.aspnetmvc.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/cultures/kendo.culture.en-US.min.js">
</script>
<script>
kendo.culture("en-US");
</script>
</head>
<body class="k-content">
<div class="k-loading-mask" id="loadingMsg" style="width:100%;height:100%">
<span class="k-loading-text">
Loading...
</span>
<div class="k-loading-image">
<div class="k-loading-color">
</div>
</div>
</div>
<input id="hdPollingFrequency" type="hidden" value="32767"/>
<input id="hdPrivateComputerTimeout" type="hidden" value="32767"/>
<input id="hdPublicComputerTimeout" type="hidden" value="32767"/>
<input id="hdWarningDisplayDuration" type="hidden" value="0"/>
<input id="hdWindowsAuthentication" type="hidden" value="false"/>
<div class="container">
<div id="content">
</div>
</div>
<div id="loading" style="display: none;">
<h1>
We are processing your request. Please be patient.
</h1>
<input class="abortButton" type="button" value="Abort"/>
</div>
<script id="taskpadGroupTmpl" type="text/x-jquery-tmpl">
<div class="panelBlock">
<div class="panelTitle"><div class="panelLink"><a class="panelDD-dn" id="$DisplayName" href="#">$DisplayName</a></div><div class = "imgPanel">
<a class="imgPanelDD" href="#"> </a></div>
</div>
<div class="panelContent1" id="panelContent1 + $DisplayName">
<ul>
tmpl(TaskItemCollection) "#taskpadItemTmpl"
</ul>
</div>
</div>
</script>
<script id="KendoTestTemplate" type="text/x-kendo-template">
<h2>#= test #</h2>
<ul>
#= kendo.render(kendo.template($("\#KendoTestLiTemplate").html()), litest) #
</ul>
</script>
<script id="KendoTestLiTemplate" type="text/x-kendo-template">
<li>#= displayName#</li>
</script>
<script id="ErrorTemplate" type="text/x-jquery-tmpl">
<div class="errorMsg k-widget k-notification k-notification-error " data-role="alert" style="display: block; opacity: 1;">
<div class="k-notification-wrap">
<span class="k-icon k-i-note">
error
</span>
$errorMsg
<span class="k-icon k-i-close">
Hide
</span>
</div>
</div>
</script>
<script id="HelpButtonTemplate" type="text/x-jquery-tmpl">
<button class="k-button k-primary helpButton" id="$id" onclick="return false;">?</button>
</script>
<script id="IconTemplate" type="text/x-jquery-tmpl">
<span class="k-icon $icon"></span>
</script>
<script id="trash" type="text/x-kendo-template">
<li style="background: url(./Images/#=item.ImageId#.#=item.ImageHash#.#=item.ImageFileExtension#) no-repeat;"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
<li class="#=GetCssClass(item.ContentType)#"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
</script>
<script id="taskpadItemTmpl" type="text/x-jquery-tmpl">
if ImageId
<li style="background: url(./Images/$ImageId.$ImageHash.$ImageFileExtension) no-repeat;"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
else
<li class="$GetCssClass(ContentType)"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
/if
</script>
<script id="buttonBarButtonTmpl" type="text/x-jquery-tmpl">
<button value="submit" class="submitBtn k-button k-primary" data-actionCommand="$Action" data-Disabled="$Disabled" data-Visible="$Visible" data-Name="$Name">
<span>$DisplayName</span>
</button>
</script>
<script src="/Thinclient/Scripts/jquery.filedownload.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/jquery.tmpl.min.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/pubsub.150420151641.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/jquery.form.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/bootstrap.min.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/sameheight.min.150420151641.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/Core.141120161617.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/PivotalThinClient.150420151641.js" type="text/javascript">
</script>
</body>
</html>
Result from Browser source
<!DOCTYPE html>
<html>
<head>
<meta charset="utf-8">
<meta name="viewport" content="width=device-width, initial-scale=1">
<title>OCSWSSW | Member Search</title>
<link href="/Thinclient/favicon.ico" type="image/x-icon" rel="shortcut icon" />
<link href="/Thinclient/Content/bootstrap.071220161413.css" rel="stylesheet" type="text/css" />
<link rel="stylesheet" href="/Thinclient/Content/kendo/kendo.common-bootstrap.min.css" />
<link rel="stylesheet" href="/Thinclient/Content/kendo/kendo.bootstrap.min.css" />
<link href="/Thinclient/Content/ThinStyle.110820150951.css" rel="stylesheet" title="Blue" type="text/css" />
<link rel="stylesheet" href="//maxcdn.bootstrapcdn.com/font-awesome/4.3.0/css/font-awesome.min.css">
<link rel="stylesheet" href="/Thinclient/Content/icheck/square/blue.css" />
<link rel="stylesheet" href="/Thinclient/Content/GlobalStyleSheet.css" />
<script type="text/javascript" >
HomeURL = "#/forms/new/?table=0x800000000000003D&form=0x800000000000004D&command=0x8000000000000C2D";
AfterLoginData = null
LanguageDictionary = ;
LanguageDictionary.TC_COMMON = "OkButtonTextOK":"Ok","OkButtonTextContinue":"Continue","OkButtonTextYes":"Yes","OkButtonTextDelete":"Clear","CancelButtonTextCancel":"Cancel","CancelButtonTextNo":"No","CancelButtonTextLogout":"Logout","MiddleButtonTextNo":"No","AjaxRequestError":"The Web server does not respond currently. Please try again later.","UserIdleMessage":"You are innactive, do you want to continue or you disconnect?","ErrorTitle":"Error","ErrorHeaderTitle":"Application error","ErrorHeaderText":"An application error has occurred while processing the current request. The error was recorded and sent to the site administrator. Provide your administrator ID error below.","ErrorMessage":"Message:","ErrorIdentifier":"Identify:","ErrorDate":"Date:"
LanguageDictionary.TC_SEARCH = "OperatorNotEqual":"Not =","OperatorIsDefined":"Is Defined","OperatorIsNotDefined":"Is Not Defined","OperatorContains":"Contains","OperatorDoesNotContain":"Does not contain","OperatorBeginsWith":"Begins with","OperatorDoesNotBeginWith":"Does not begin with","OperatorIsEmpty":"Is Empty","OperatorIsNotEmpty":"Is not empty","CustomFiltersNotComplete":"One or more custom filters are not complete. Examine each custom filter and make sure that the valid search criteria are provided.","NavigateAwayFromSearchWithFilterSet":"You are about to leave this page without performing the search filters custom.","NoGlobalSearchPermissions":"Password","SearchDefinitionLostAlert":"The definition of research will be lost if the primary table is changed. Are you sure you want to change the primary table of the research."
LanguageDictionary.TC_FORM = "RequiredFieldsNotSet":"Unable to save the form data. Provide a value for all required fields.","NavigateAwayFromUnsavedForm":"You are about to exit the form without saving it","RefreshFormLosesModifiedData":"The data of the form has changed. The changes you made will be lost when you refresh the form. Do you want to continue?","SaveDataBeforeClose":"The data of the form has changed. Do you want to save them before closing?","DeleteWarning":"The form data will be deleted. Are you sure you want to continue?","DeleteSecondaryWarning":"You are about to delete the form data.","RequiredField":"This is a required field","InvalidFormat":"The format for this field is not valid"
LanguageDictionary.TC_GLOBALSEARCH = "CollapseAllLabel":"Reduce everything","ExpandAllLabel":"About expand"
LanguageDictionary.TC_WIDGETS = "CallListItem":"Appeal","FaxListItem":"Fax","SmsListItem":"SMS"
</script>
<script type="text/javascript" src="/Thinclient/Scripts/jquery-1.11.1.min.js"></script>
<script src="/Thinclient/Scripts/jquery-migrate-1.2.1.min.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/icheck.min.js"></script>
<script src="/Thinclient/Scripts/kendo/kendo.all.min.js"></script>
<script src="/Thinclient/Scripts/kendo/kendo.timezones.min.js"></script>
<script src="/Thinclient/Scripts/kendo/kendo.aspnetmvc.min.js"></script>
<script src="/Thinclient/Scripts/kendo/cultures/kendo.culture.en-US.min.js"></script>
<script>
kendo.culture("en-US");
</script>
</head>
<body class="k-content">
<div id="loadingMsg" class="k-loading-mask" style="width:100%;height:100%">
<span class="k-loading-text">Loading...</span>
<div class="k-loading-image">
<div class="k-loading-color"></div>
</div>
</div>
<input type="hidden" ID="hdPollingFrequency" value="32767"/>
<input type="hidden" ID="hdPrivateComputerTimeout" value= "32767"/>
<input type="hidden" ID="hdPublicComputerTimeout" value="32767"/>
<input type="hidden" ID="hdWarningDisplayDuration" value="0"/>
<input type="hidden" id="hdWindowsAuthentication" value="false"/>
<div class="container">
<div id="content">
</div>
</div>
<div id="loading" style="display: none;">
<h1>
We are processing your request. Please be patient.</h1>
<input type="button" value="Abort" class="abortButton" />
</div>
<script id="taskpadGroupTmpl" type="text/x-jquery-tmpl">
<div class="panelBlock">
<div class="panelTitle"><div class="panelLink"><a class="panelDD-dn" id="$DisplayName" href="#">$DisplayName</a></div><div class = "imgPanel">
<a class="imgPanelDD" href="#"> </a></div>
</div>
<div class="panelContent1" id="panelContent1 + $DisplayName">
<ul>
tmpl(TaskItemCollection) "#taskpadItemTmpl"
</ul>
</div>
</div>
</script>
<script id="KendoTestTemplate" type="text/x-kendo-template">
<h2>#= test #</h2>
<ul>
#= kendo.render(kendo.template($("\#KendoTestLiTemplate").html()), litest) #
</ul>
</script>
<script id="KendoTestLiTemplate" type="text/x-kendo-template">
<li>#= displayName#</li>
</script>
<script id="ErrorTemplate" type="text/x-jquery-tmpl">
<div class="errorMsg k-widget k-notification k-notification-error " data-role="alert" style="display: block; opacity: 1;">
<div class="k-notification-wrap">
<span class="k-icon k-i-note">
error
</span>
$errorMsg
<span class="k-icon k-i-close">
Hide
</span>
</div>
</div>
</script>
<script id="HelpButtonTemplate" type="text/x-jquery-tmpl">
<button class="k-button k-primary helpButton" id="$id" onclick="return false;">?</button>
</script>
<script id="IconTemplate" type="text/x-jquery-tmpl">
<span class="k-icon $icon"></span>
</script>
<script id="trash" type="text/x-kendo-template">
<li style="background: url(./Images/#=item.ImageId#.#=item.ImageHash#.#=item.ImageFileExtension#) no-repeat;"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
<li class="#=GetCssClass(item.ContentType)#"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
</script>
<script id="taskpadItemTmpl" type="text/x-jquery-tmpl">
if ImageId
<li style="background: url(./Images/$ImageId.$ImageHash.$ImageFileExtension) no-repeat;"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
else
<li class="$GetCssClass(ContentType)"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
/if
</script>
<script id="buttonBarButtonTmpl" type="text/x-jquery-tmpl">
<button value="submit" class="submitBtn k-button k-primary" data-actionCommand="$Action" data-Disabled="$Disabled" data-Visible="$Visible" data-Name="$Name">
<span>$DisplayName</span>
</button>
</script>
<script src="/Thinclient/Scripts/jquery.filedownload.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/jquery.tmpl.min.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/pubsub.150420151641.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/jquery.form.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/bootstrap.min.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/sameheight.min.150420151641.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/Core.141120161617.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/PivotalThinClient.150420151641.js" type="text/javascript"></script>
</body>
</html>
<script>
//$( window ).load(
//$(".k-state-default").hover(function ()
// $(this).toggleClass("k-state-hover");
//)
//);
</script>
Is this not what you are looking for from Beautiful Soup?
please run in browser and compare to the code you receive
– Sandhya Thotakura
Nov 12 at 6:37
I added the source code from Chrome in the answer below the result from BS4. I don't see anything missing or incomplete here..
– Jonathan
Nov 12 at 6:46
Hi don't use "view page source".. instead use "ctrl+shift+I" and go to "elements" that is the original source code.
– Sandhya Thotakura
Nov 12 at 6:47
Are you referring to the drop-down arrows for each parent element's children? What's different/missing from this?
– Jonathan
Nov 12 at 7:01
in the website you can see fields first name, last name but the script wont get that
– Sandhya Thotakura
Nov 12 at 7:02
|
show 1 more comment
I am not completely clear on what you are trying to accomplish ultimately, but when it comes to receiving the source I:
1)
Added the missing apostrophe for your ocswssw.html argument using the open() method and
2)
Ran the code and received pretty much the same source as Google Chrome provides.
Result from BS:
<!DOCTYPE html>
<html>
<head>
<meta charset="utf-8"/>
<meta content="width=device-width, initial-scale=1" name="viewport"/>
<title>
OCSWSSW | Member Search
</title>
<link href="/Thinclient/favicon.ico" rel="shortcut icon" type="image/x-icon"/>
<link href="/Thinclient/Content/bootstrap.071220161413.css" rel="stylesheet" type="text/css"/>
<link href="/Thinclient/Content/kendo/kendo.common-bootstrap.min.css" rel="stylesheet"/>
<link href="/Thinclient/Content/kendo/kendo.bootstrap.min.css" rel="stylesheet"/>
<link href="/Thinclient/Content/ThinStyle.110820150951.css" rel="stylesheet" title="Blue" type="text/css"/>
<link href="//maxcdn.bootstrapcdn.com/font-awesome/4.3.0/css/font-awesome.min.css" rel="stylesheet"/>
<link href="/Thinclient/Content/icheck/square/blue.css" rel="stylesheet"/>
<link href="/Thinclient/Content/GlobalStyleSheet.css" rel="stylesheet"/>
<script type="text/javascript">
HomeURL = "#/forms/new/?table=0x800000000000003D&form=0x800000000000004D&command=0x8000000000000C2D";
AfterLoginData = null
LanguageDictionary = ;
LanguageDictionary.TC_COMMON = "OkButtonTextOK":"Ok","OkButtonTextContinue":"Continue","OkButtonTextYes":"Yes","OkButtonTextDelete":"Clear","CancelButtonTextCancel":"Cancel","CancelButtonTextNo":"No","CancelButtonTextLogout":"Logout","MiddleButtonTextNo":"No","AjaxRequestError":"The Web server does not respond currently. Please try again later.","UserIdleMessage":"You are innactive, do you want to continue or you disconnect?","ErrorTitle":"Error","ErrorHeaderTitle":"Application error","ErrorHeaderText":"An application error has occurred while processing the current request. The error was recorded and sent to the site administrator. Provide your administrator ID error below.","ErrorMessage":"Message:","ErrorIdentifier":"Identify:","ErrorDate":"Date:"
LanguageDictionary.TC_SEARCH = "OperatorNotEqual":"Not =","OperatorIsDefined":"Is Defined","OperatorIsNotDefined":"Is Not Defined","OperatorContains":"Contains","OperatorDoesNotContain":"Does not contain","OperatorBeginsWith":"Begins with","OperatorDoesNotBeginWith":"Does not begin with","OperatorIsEmpty":"Is Empty","OperatorIsNotEmpty":"Is not empty","CustomFiltersNotComplete":"One or more custom filters are not complete. Examine each custom filter and make sure that the valid search criteria are provided.","NavigateAwayFromSearchWithFilterSet":"You are about to leave this page without performing the search filters custom.","NoGlobalSearchPermissions":"Password","SearchDefinitionLostAlert":"The definition of research will be lost if the primary table is changed. Are you sure you want to change the primary table of the research."
LanguageDictionary.TC_FORM = "RequiredFieldsNotSet":"Unable to save the form data. Provide a value for all required fields.","NavigateAwayFromUnsavedForm":"You are about to exit the form without saving it","RefreshFormLosesModifiedData":"The data of the form has changed. The changes you made will be lost when you refresh the form. Do you want to continue?","SaveDataBeforeClose":"The data of the form has changed. Do you want to save them before closing?","DeleteWarning":"The form data will be deleted. Are you sure you want to continue?","DeleteSecondaryWarning":"You are about to delete the form data.","RequiredField":"This is a required field","InvalidFormat":"The format for this field is not valid"
LanguageDictionary.TC_GLOBALSEARCH = "CollapseAllLabel":"Reduce everything","ExpandAllLabel":"About expand"
LanguageDictionary.TC_WIDGETS = "CallListItem":"Appeal","FaxListItem":"Fax","SmsListItem":"SMS"
</script>
<script src="/Thinclient/Scripts/jquery-1.11.1.min.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/jquery-migrate-1.2.1.min.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/icheck.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/kendo.all.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/kendo.timezones.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/kendo.aspnetmvc.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/cultures/kendo.culture.en-US.min.js">
</script>
<script>
kendo.culture("en-US");
</script>
</head>
<body class="k-content">
<div class="k-loading-mask" id="loadingMsg" style="width:100%;height:100%">
<span class="k-loading-text">
Loading...
</span>
<div class="k-loading-image">
<div class="k-loading-color">
</div>
</div>
</div>
<input id="hdPollingFrequency" type="hidden" value="32767"/>
<input id="hdPrivateComputerTimeout" type="hidden" value="32767"/>
<input id="hdPublicComputerTimeout" type="hidden" value="32767"/>
<input id="hdWarningDisplayDuration" type="hidden" value="0"/>
<input id="hdWindowsAuthentication" type="hidden" value="false"/>
<div class="container">
<div id="content">
</div>
</div>
<div id="loading" style="display: none;">
<h1>
We are processing your request. Please be patient.
</h1>
<input class="abortButton" type="button" value="Abort"/>
</div>
<script id="taskpadGroupTmpl" type="text/x-jquery-tmpl">
<div class="panelBlock">
<div class="panelTitle"><div class="panelLink"><a class="panelDD-dn" id="$DisplayName" href="#">$DisplayName</a></div><div class = "imgPanel">
<a class="imgPanelDD" href="#"> </a></div>
</div>
<div class="panelContent1" id="panelContent1 + $DisplayName">
<ul>
tmpl(TaskItemCollection) "#taskpadItemTmpl"
</ul>
</div>
</div>
</script>
<script id="KendoTestTemplate" type="text/x-kendo-template">
<h2>#= test #</h2>
<ul>
#= kendo.render(kendo.template($("\#KendoTestLiTemplate").html()), litest) #
</ul>
</script>
<script id="KendoTestLiTemplate" type="text/x-kendo-template">
<li>#= displayName#</li>
</script>
<script id="ErrorTemplate" type="text/x-jquery-tmpl">
<div class="errorMsg k-widget k-notification k-notification-error " data-role="alert" style="display: block; opacity: 1;">
<div class="k-notification-wrap">
<span class="k-icon k-i-note">
error
</span>
$errorMsg
<span class="k-icon k-i-close">
Hide
</span>
</div>
</div>
</script>
<script id="HelpButtonTemplate" type="text/x-jquery-tmpl">
<button class="k-button k-primary helpButton" id="$id" onclick="return false;">?</button>
</script>
<script id="IconTemplate" type="text/x-jquery-tmpl">
<span class="k-icon $icon"></span>
</script>
<script id="trash" type="text/x-kendo-template">
<li style="background: url(./Images/#=item.ImageId#.#=item.ImageHash#.#=item.ImageFileExtension#) no-repeat;"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
<li class="#=GetCssClass(item.ContentType)#"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
</script>
<script id="taskpadItemTmpl" type="text/x-jquery-tmpl">
if ImageId
<li style="background: url(./Images/$ImageId.$ImageHash.$ImageFileExtension) no-repeat;"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
else
<li class="$GetCssClass(ContentType)"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
/if
</script>
<script id="buttonBarButtonTmpl" type="text/x-jquery-tmpl">
<button value="submit" class="submitBtn k-button k-primary" data-actionCommand="$Action" data-Disabled="$Disabled" data-Visible="$Visible" data-Name="$Name">
<span>$DisplayName</span>
</button>
</script>
<script src="/Thinclient/Scripts/jquery.filedownload.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/jquery.tmpl.min.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/pubsub.150420151641.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/jquery.form.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/bootstrap.min.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/sameheight.min.150420151641.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/Core.141120161617.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/PivotalThinClient.150420151641.js" type="text/javascript">
</script>
</body>
</html>
Result from Browser source
<!DOCTYPE html>
<html>
<head>
<meta charset="utf-8">
<meta name="viewport" content="width=device-width, initial-scale=1">
<title>OCSWSSW | Member Search</title>
<link href="/Thinclient/favicon.ico" type="image/x-icon" rel="shortcut icon" />
<link href="/Thinclient/Content/bootstrap.071220161413.css" rel="stylesheet" type="text/css" />
<link rel="stylesheet" href="/Thinclient/Content/kendo/kendo.common-bootstrap.min.css" />
<link rel="stylesheet" href="/Thinclient/Content/kendo/kendo.bootstrap.min.css" />
<link href="/Thinclient/Content/ThinStyle.110820150951.css" rel="stylesheet" title="Blue" type="text/css" />
<link rel="stylesheet" href="//maxcdn.bootstrapcdn.com/font-awesome/4.3.0/css/font-awesome.min.css">
<link rel="stylesheet" href="/Thinclient/Content/icheck/square/blue.css" />
<link rel="stylesheet" href="/Thinclient/Content/GlobalStyleSheet.css" />
<script type="text/javascript" >
HomeURL = "#/forms/new/?table=0x800000000000003D&form=0x800000000000004D&command=0x8000000000000C2D";
AfterLoginData = null
LanguageDictionary = ;
LanguageDictionary.TC_COMMON = "OkButtonTextOK":"Ok","OkButtonTextContinue":"Continue","OkButtonTextYes":"Yes","OkButtonTextDelete":"Clear","CancelButtonTextCancel":"Cancel","CancelButtonTextNo":"No","CancelButtonTextLogout":"Logout","MiddleButtonTextNo":"No","AjaxRequestError":"The Web server does not respond currently. Please try again later.","UserIdleMessage":"You are innactive, do you want to continue or you disconnect?","ErrorTitle":"Error","ErrorHeaderTitle":"Application error","ErrorHeaderText":"An application error has occurred while processing the current request. The error was recorded and sent to the site administrator. Provide your administrator ID error below.","ErrorMessage":"Message:","ErrorIdentifier":"Identify:","ErrorDate":"Date:"
LanguageDictionary.TC_SEARCH = "OperatorNotEqual":"Not =","OperatorIsDefined":"Is Defined","OperatorIsNotDefined":"Is Not Defined","OperatorContains":"Contains","OperatorDoesNotContain":"Does not contain","OperatorBeginsWith":"Begins with","OperatorDoesNotBeginWith":"Does not begin with","OperatorIsEmpty":"Is Empty","OperatorIsNotEmpty":"Is not empty","CustomFiltersNotComplete":"One or more custom filters are not complete. Examine each custom filter and make sure that the valid search criteria are provided.","NavigateAwayFromSearchWithFilterSet":"You are about to leave this page without performing the search filters custom.","NoGlobalSearchPermissions":"Password","SearchDefinitionLostAlert":"The definition of research will be lost if the primary table is changed. Are you sure you want to change the primary table of the research."
LanguageDictionary.TC_FORM = "RequiredFieldsNotSet":"Unable to save the form data. Provide a value for all required fields.","NavigateAwayFromUnsavedForm":"You are about to exit the form without saving it","RefreshFormLosesModifiedData":"The data of the form has changed. The changes you made will be lost when you refresh the form. Do you want to continue?","SaveDataBeforeClose":"The data of the form has changed. Do you want to save them before closing?","DeleteWarning":"The form data will be deleted. Are you sure you want to continue?","DeleteSecondaryWarning":"You are about to delete the form data.","RequiredField":"This is a required field","InvalidFormat":"The format for this field is not valid"
LanguageDictionary.TC_GLOBALSEARCH = "CollapseAllLabel":"Reduce everything","ExpandAllLabel":"About expand"
LanguageDictionary.TC_WIDGETS = "CallListItem":"Appeal","FaxListItem":"Fax","SmsListItem":"SMS"
</script>
<script type="text/javascript" src="/Thinclient/Scripts/jquery-1.11.1.min.js"></script>
<script src="/Thinclient/Scripts/jquery-migrate-1.2.1.min.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/icheck.min.js"></script>
<script src="/Thinclient/Scripts/kendo/kendo.all.min.js"></script>
<script src="/Thinclient/Scripts/kendo/kendo.timezones.min.js"></script>
<script src="/Thinclient/Scripts/kendo/kendo.aspnetmvc.min.js"></script>
<script src="/Thinclient/Scripts/kendo/cultures/kendo.culture.en-US.min.js"></script>
<script>
kendo.culture("en-US");
</script>
</head>
<body class="k-content">
<div id="loadingMsg" class="k-loading-mask" style="width:100%;height:100%">
<span class="k-loading-text">Loading...</span>
<div class="k-loading-image">
<div class="k-loading-color"></div>
</div>
</div>
<input type="hidden" ID="hdPollingFrequency" value="32767"/>
<input type="hidden" ID="hdPrivateComputerTimeout" value= "32767"/>
<input type="hidden" ID="hdPublicComputerTimeout" value="32767"/>
<input type="hidden" ID="hdWarningDisplayDuration" value="0"/>
<input type="hidden" id="hdWindowsAuthentication" value="false"/>
<div class="container">
<div id="content">
</div>
</div>
<div id="loading" style="display: none;">
<h1>
We are processing your request. Please be patient.</h1>
<input type="button" value="Abort" class="abortButton" />
</div>
<script id="taskpadGroupTmpl" type="text/x-jquery-tmpl">
<div class="panelBlock">
<div class="panelTitle"><div class="panelLink"><a class="panelDD-dn" id="$DisplayName" href="#">$DisplayName</a></div><div class = "imgPanel">
<a class="imgPanelDD" href="#"> </a></div>
</div>
<div class="panelContent1" id="panelContent1 + $DisplayName">
<ul>
tmpl(TaskItemCollection) "#taskpadItemTmpl"
</ul>
</div>
</div>
</script>
<script id="KendoTestTemplate" type="text/x-kendo-template">
<h2>#= test #</h2>
<ul>
#= kendo.render(kendo.template($("\#KendoTestLiTemplate").html()), litest) #
</ul>
</script>
<script id="KendoTestLiTemplate" type="text/x-kendo-template">
<li>#= displayName#</li>
</script>
<script id="ErrorTemplate" type="text/x-jquery-tmpl">
<div class="errorMsg k-widget k-notification k-notification-error " data-role="alert" style="display: block; opacity: 1;">
<div class="k-notification-wrap">
<span class="k-icon k-i-note">
error
</span>
$errorMsg
<span class="k-icon k-i-close">
Hide
</span>
</div>
</div>
</script>
<script id="HelpButtonTemplate" type="text/x-jquery-tmpl">
<button class="k-button k-primary helpButton" id="$id" onclick="return false;">?</button>
</script>
<script id="IconTemplate" type="text/x-jquery-tmpl">
<span class="k-icon $icon"></span>
</script>
<script id="trash" type="text/x-kendo-template">
<li style="background: url(./Images/#=item.ImageId#.#=item.ImageHash#.#=item.ImageFileExtension#) no-repeat;"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
<li class="#=GetCssClass(item.ContentType)#"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
</script>
<script id="taskpadItemTmpl" type="text/x-jquery-tmpl">
if ImageId
<li style="background: url(./Images/$ImageId.$ImageHash.$ImageFileExtension) no-repeat;"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
else
<li class="$GetCssClass(ContentType)"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
/if
</script>
<script id="buttonBarButtonTmpl" type="text/x-jquery-tmpl">
<button value="submit" class="submitBtn k-button k-primary" data-actionCommand="$Action" data-Disabled="$Disabled" data-Visible="$Visible" data-Name="$Name">
<span>$DisplayName</span>
</button>
</script>
<script src="/Thinclient/Scripts/jquery.filedownload.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/jquery.tmpl.min.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/pubsub.150420151641.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/jquery.form.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/bootstrap.min.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/sameheight.min.150420151641.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/Core.141120161617.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/PivotalThinClient.150420151641.js" type="text/javascript"></script>
</body>
</html>
<script>
//$( window ).load(
//$(".k-state-default").hover(function ()
// $(this).toggleClass("k-state-hover");
//)
//);
</script>
Is this not what you are looking for from Beautiful Soup?
please run in browser and compare to the code you receive
– Sandhya Thotakura
Nov 12 at 6:37
I added the source code from Chrome in the answer below the result from BS4. I don't see anything missing or incomplete here..
– Jonathan
Nov 12 at 6:46
Hi don't use "view page source".. instead use "ctrl+shift+I" and go to "elements" that is the original source code.
– Sandhya Thotakura
Nov 12 at 6:47
Are you referring to the drop-down arrows for each parent element's children? What's different/missing from this?
– Jonathan
Nov 12 at 7:01
in the website you can see fields first name, last name but the script wont get that
– Sandhya Thotakura
Nov 12 at 7:02
|
show 1 more comment
I am not completely clear on what you are trying to accomplish ultimately, but when it comes to receiving the source I:
1)
Added the missing apostrophe for your ocswssw.html argument using the open() method and
2)
Ran the code and received pretty much the same source as Google Chrome provides.
Result from BS:
<!DOCTYPE html>
<html>
<head>
<meta charset="utf-8"/>
<meta content="width=device-width, initial-scale=1" name="viewport"/>
<title>
OCSWSSW | Member Search
</title>
<link href="/Thinclient/favicon.ico" rel="shortcut icon" type="image/x-icon"/>
<link href="/Thinclient/Content/bootstrap.071220161413.css" rel="stylesheet" type="text/css"/>
<link href="/Thinclient/Content/kendo/kendo.common-bootstrap.min.css" rel="stylesheet"/>
<link href="/Thinclient/Content/kendo/kendo.bootstrap.min.css" rel="stylesheet"/>
<link href="/Thinclient/Content/ThinStyle.110820150951.css" rel="stylesheet" title="Blue" type="text/css"/>
<link href="//maxcdn.bootstrapcdn.com/font-awesome/4.3.0/css/font-awesome.min.css" rel="stylesheet"/>
<link href="/Thinclient/Content/icheck/square/blue.css" rel="stylesheet"/>
<link href="/Thinclient/Content/GlobalStyleSheet.css" rel="stylesheet"/>
<script type="text/javascript">
HomeURL = "#/forms/new/?table=0x800000000000003D&form=0x800000000000004D&command=0x8000000000000C2D";
AfterLoginData = null
LanguageDictionary = ;
LanguageDictionary.TC_COMMON = "OkButtonTextOK":"Ok","OkButtonTextContinue":"Continue","OkButtonTextYes":"Yes","OkButtonTextDelete":"Clear","CancelButtonTextCancel":"Cancel","CancelButtonTextNo":"No","CancelButtonTextLogout":"Logout","MiddleButtonTextNo":"No","AjaxRequestError":"The Web server does not respond currently. Please try again later.","UserIdleMessage":"You are innactive, do you want to continue or you disconnect?","ErrorTitle":"Error","ErrorHeaderTitle":"Application error","ErrorHeaderText":"An application error has occurred while processing the current request. The error was recorded and sent to the site administrator. Provide your administrator ID error below.","ErrorMessage":"Message:","ErrorIdentifier":"Identify:","ErrorDate":"Date:"
LanguageDictionary.TC_SEARCH = "OperatorNotEqual":"Not =","OperatorIsDefined":"Is Defined","OperatorIsNotDefined":"Is Not Defined","OperatorContains":"Contains","OperatorDoesNotContain":"Does not contain","OperatorBeginsWith":"Begins with","OperatorDoesNotBeginWith":"Does not begin with","OperatorIsEmpty":"Is Empty","OperatorIsNotEmpty":"Is not empty","CustomFiltersNotComplete":"One or more custom filters are not complete. Examine each custom filter and make sure that the valid search criteria are provided.","NavigateAwayFromSearchWithFilterSet":"You are about to leave this page without performing the search filters custom.","NoGlobalSearchPermissions":"Password","SearchDefinitionLostAlert":"The definition of research will be lost if the primary table is changed. Are you sure you want to change the primary table of the research."
LanguageDictionary.TC_FORM = "RequiredFieldsNotSet":"Unable to save the form data. Provide a value for all required fields.","NavigateAwayFromUnsavedForm":"You are about to exit the form without saving it","RefreshFormLosesModifiedData":"The data of the form has changed. The changes you made will be lost when you refresh the form. Do you want to continue?","SaveDataBeforeClose":"The data of the form has changed. Do you want to save them before closing?","DeleteWarning":"The form data will be deleted. Are you sure you want to continue?","DeleteSecondaryWarning":"You are about to delete the form data.","RequiredField":"This is a required field","InvalidFormat":"The format for this field is not valid"
LanguageDictionary.TC_GLOBALSEARCH = "CollapseAllLabel":"Reduce everything","ExpandAllLabel":"About expand"
LanguageDictionary.TC_WIDGETS = "CallListItem":"Appeal","FaxListItem":"Fax","SmsListItem":"SMS"
</script>
<script src="/Thinclient/Scripts/jquery-1.11.1.min.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/jquery-migrate-1.2.1.min.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/icheck.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/kendo.all.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/kendo.timezones.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/kendo.aspnetmvc.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/cultures/kendo.culture.en-US.min.js">
</script>
<script>
kendo.culture("en-US");
</script>
</head>
<body class="k-content">
<div class="k-loading-mask" id="loadingMsg" style="width:100%;height:100%">
<span class="k-loading-text">
Loading...
</span>
<div class="k-loading-image">
<div class="k-loading-color">
</div>
</div>
</div>
<input id="hdPollingFrequency" type="hidden" value="32767"/>
<input id="hdPrivateComputerTimeout" type="hidden" value="32767"/>
<input id="hdPublicComputerTimeout" type="hidden" value="32767"/>
<input id="hdWarningDisplayDuration" type="hidden" value="0"/>
<input id="hdWindowsAuthentication" type="hidden" value="false"/>
<div class="container">
<div id="content">
</div>
</div>
<div id="loading" style="display: none;">
<h1>
We are processing your request. Please be patient.
</h1>
<input class="abortButton" type="button" value="Abort"/>
</div>
<script id="taskpadGroupTmpl" type="text/x-jquery-tmpl">
<div class="panelBlock">
<div class="panelTitle"><div class="panelLink"><a class="panelDD-dn" id="$DisplayName" href="#">$DisplayName</a></div><div class = "imgPanel">
<a class="imgPanelDD" href="#"> </a></div>
</div>
<div class="panelContent1" id="panelContent1 + $DisplayName">
<ul>
tmpl(TaskItemCollection) "#taskpadItemTmpl"
</ul>
</div>
</div>
</script>
<script id="KendoTestTemplate" type="text/x-kendo-template">
<h2>#= test #</h2>
<ul>
#= kendo.render(kendo.template($("\#KendoTestLiTemplate").html()), litest) #
</ul>
</script>
<script id="KendoTestLiTemplate" type="text/x-kendo-template">
<li>#= displayName#</li>
</script>
<script id="ErrorTemplate" type="text/x-jquery-tmpl">
<div class="errorMsg k-widget k-notification k-notification-error " data-role="alert" style="display: block; opacity: 1;">
<div class="k-notification-wrap">
<span class="k-icon k-i-note">
error
</span>
$errorMsg
<span class="k-icon k-i-close">
Hide
</span>
</div>
</div>
</script>
<script id="HelpButtonTemplate" type="text/x-jquery-tmpl">
<button class="k-button k-primary helpButton" id="$id" onclick="return false;">?</button>
</script>
<script id="IconTemplate" type="text/x-jquery-tmpl">
<span class="k-icon $icon"></span>
</script>
<script id="trash" type="text/x-kendo-template">
<li style="background: url(./Images/#=item.ImageId#.#=item.ImageHash#.#=item.ImageFileExtension#) no-repeat;"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
<li class="#=GetCssClass(item.ContentType)#"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
</script>
<script id="taskpadItemTmpl" type="text/x-jquery-tmpl">
if ImageId
<li style="background: url(./Images/$ImageId.$ImageHash.$ImageFileExtension) no-repeat;"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
else
<li class="$GetCssClass(ContentType)"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
/if
</script>
<script id="buttonBarButtonTmpl" type="text/x-jquery-tmpl">
<button value="submit" class="submitBtn k-button k-primary" data-actionCommand="$Action" data-Disabled="$Disabled" data-Visible="$Visible" data-Name="$Name">
<span>$DisplayName</span>
</button>
</script>
<script src="/Thinclient/Scripts/jquery.filedownload.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/jquery.tmpl.min.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/pubsub.150420151641.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/jquery.form.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/bootstrap.min.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/sameheight.min.150420151641.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/Core.141120161617.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/PivotalThinClient.150420151641.js" type="text/javascript">
</script>
</body>
</html>
Result from Browser source
<!DOCTYPE html>
<html>
<head>
<meta charset="utf-8">
<meta name="viewport" content="width=device-width, initial-scale=1">
<title>OCSWSSW | Member Search</title>
<link href="/Thinclient/favicon.ico" type="image/x-icon" rel="shortcut icon" />
<link href="/Thinclient/Content/bootstrap.071220161413.css" rel="stylesheet" type="text/css" />
<link rel="stylesheet" href="/Thinclient/Content/kendo/kendo.common-bootstrap.min.css" />
<link rel="stylesheet" href="/Thinclient/Content/kendo/kendo.bootstrap.min.css" />
<link href="/Thinclient/Content/ThinStyle.110820150951.css" rel="stylesheet" title="Blue" type="text/css" />
<link rel="stylesheet" href="//maxcdn.bootstrapcdn.com/font-awesome/4.3.0/css/font-awesome.min.css">
<link rel="stylesheet" href="/Thinclient/Content/icheck/square/blue.css" />
<link rel="stylesheet" href="/Thinclient/Content/GlobalStyleSheet.css" />
<script type="text/javascript" >
HomeURL = "#/forms/new/?table=0x800000000000003D&form=0x800000000000004D&command=0x8000000000000C2D";
AfterLoginData = null
LanguageDictionary = ;
LanguageDictionary.TC_COMMON = "OkButtonTextOK":"Ok","OkButtonTextContinue":"Continue","OkButtonTextYes":"Yes","OkButtonTextDelete":"Clear","CancelButtonTextCancel":"Cancel","CancelButtonTextNo":"No","CancelButtonTextLogout":"Logout","MiddleButtonTextNo":"No","AjaxRequestError":"The Web server does not respond currently. Please try again later.","UserIdleMessage":"You are innactive, do you want to continue or you disconnect?","ErrorTitle":"Error","ErrorHeaderTitle":"Application error","ErrorHeaderText":"An application error has occurred while processing the current request. The error was recorded and sent to the site administrator. Provide your administrator ID error below.","ErrorMessage":"Message:","ErrorIdentifier":"Identify:","ErrorDate":"Date:"
LanguageDictionary.TC_SEARCH = "OperatorNotEqual":"Not =","OperatorIsDefined":"Is Defined","OperatorIsNotDefined":"Is Not Defined","OperatorContains":"Contains","OperatorDoesNotContain":"Does not contain","OperatorBeginsWith":"Begins with","OperatorDoesNotBeginWith":"Does not begin with","OperatorIsEmpty":"Is Empty","OperatorIsNotEmpty":"Is not empty","CustomFiltersNotComplete":"One or more custom filters are not complete. Examine each custom filter and make sure that the valid search criteria are provided.","NavigateAwayFromSearchWithFilterSet":"You are about to leave this page without performing the search filters custom.","NoGlobalSearchPermissions":"Password","SearchDefinitionLostAlert":"The definition of research will be lost if the primary table is changed. Are you sure you want to change the primary table of the research."
LanguageDictionary.TC_FORM = "RequiredFieldsNotSet":"Unable to save the form data. Provide a value for all required fields.","NavigateAwayFromUnsavedForm":"You are about to exit the form without saving it","RefreshFormLosesModifiedData":"The data of the form has changed. The changes you made will be lost when you refresh the form. Do you want to continue?","SaveDataBeforeClose":"The data of the form has changed. Do you want to save them before closing?","DeleteWarning":"The form data will be deleted. Are you sure you want to continue?","DeleteSecondaryWarning":"You are about to delete the form data.","RequiredField":"This is a required field","InvalidFormat":"The format for this field is not valid"
LanguageDictionary.TC_GLOBALSEARCH = "CollapseAllLabel":"Reduce everything","ExpandAllLabel":"About expand"
LanguageDictionary.TC_WIDGETS = "CallListItem":"Appeal","FaxListItem":"Fax","SmsListItem":"SMS"
</script>
<script type="text/javascript" src="/Thinclient/Scripts/jquery-1.11.1.min.js"></script>
<script src="/Thinclient/Scripts/jquery-migrate-1.2.1.min.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/icheck.min.js"></script>
<script src="/Thinclient/Scripts/kendo/kendo.all.min.js"></script>
<script src="/Thinclient/Scripts/kendo/kendo.timezones.min.js"></script>
<script src="/Thinclient/Scripts/kendo/kendo.aspnetmvc.min.js"></script>
<script src="/Thinclient/Scripts/kendo/cultures/kendo.culture.en-US.min.js"></script>
<script>
kendo.culture("en-US");
</script>
</head>
<body class="k-content">
<div id="loadingMsg" class="k-loading-mask" style="width:100%;height:100%">
<span class="k-loading-text">Loading...</span>
<div class="k-loading-image">
<div class="k-loading-color"></div>
</div>
</div>
<input type="hidden" ID="hdPollingFrequency" value="32767"/>
<input type="hidden" ID="hdPrivateComputerTimeout" value= "32767"/>
<input type="hidden" ID="hdPublicComputerTimeout" value="32767"/>
<input type="hidden" ID="hdWarningDisplayDuration" value="0"/>
<input type="hidden" id="hdWindowsAuthentication" value="false"/>
<div class="container">
<div id="content">
</div>
</div>
<div id="loading" style="display: none;">
<h1>
We are processing your request. Please be patient.</h1>
<input type="button" value="Abort" class="abortButton" />
</div>
<script id="taskpadGroupTmpl" type="text/x-jquery-tmpl">
<div class="panelBlock">
<div class="panelTitle"><div class="panelLink"><a class="panelDD-dn" id="$DisplayName" href="#">$DisplayName</a></div><div class = "imgPanel">
<a class="imgPanelDD" href="#"> </a></div>
</div>
<div class="panelContent1" id="panelContent1 + $DisplayName">
<ul>
tmpl(TaskItemCollection) "#taskpadItemTmpl"
</ul>
</div>
</div>
</script>
<script id="KendoTestTemplate" type="text/x-kendo-template">
<h2>#= test #</h2>
<ul>
#= kendo.render(kendo.template($("\#KendoTestLiTemplate").html()), litest) #
</ul>
</script>
<script id="KendoTestLiTemplate" type="text/x-kendo-template">
<li>#= displayName#</li>
</script>
<script id="ErrorTemplate" type="text/x-jquery-tmpl">
<div class="errorMsg k-widget k-notification k-notification-error " data-role="alert" style="display: block; opacity: 1;">
<div class="k-notification-wrap">
<span class="k-icon k-i-note">
error
</span>
$errorMsg
<span class="k-icon k-i-close">
Hide
</span>
</div>
</div>
</script>
<script id="HelpButtonTemplate" type="text/x-jquery-tmpl">
<button class="k-button k-primary helpButton" id="$id" onclick="return false;">?</button>
</script>
<script id="IconTemplate" type="text/x-jquery-tmpl">
<span class="k-icon $icon"></span>
</script>
<script id="trash" type="text/x-kendo-template">
<li style="background: url(./Images/#=item.ImageId#.#=item.ImageHash#.#=item.ImageFileExtension#) no-repeat;"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
<li class="#=GetCssClass(item.ContentType)#"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
</script>
<script id="taskpadItemTmpl" type="text/x-jquery-tmpl">
if ImageId
<li style="background: url(./Images/$ImageId.$ImageHash.$ImageFileExtension) no-repeat;"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
else
<li class="$GetCssClass(ContentType)"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
/if
</script>
<script id="buttonBarButtonTmpl" type="text/x-jquery-tmpl">
<button value="submit" class="submitBtn k-button k-primary" data-actionCommand="$Action" data-Disabled="$Disabled" data-Visible="$Visible" data-Name="$Name">
<span>$DisplayName</span>
</button>
</script>
<script src="/Thinclient/Scripts/jquery.filedownload.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/jquery.tmpl.min.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/pubsub.150420151641.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/jquery.form.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/bootstrap.min.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/sameheight.min.150420151641.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/Core.141120161617.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/PivotalThinClient.150420151641.js" type="text/javascript"></script>
</body>
</html>
<script>
//$( window ).load(
//$(".k-state-default").hover(function ()
// $(this).toggleClass("k-state-hover");
//)
//);
</script>
Is this not what you are looking for from Beautiful Soup?
I am not completely clear on what you are trying to accomplish ultimately, but when it comes to receiving the source I:
1)
Added the missing apostrophe for your ocswssw.html argument using the open() method and
2)
Ran the code and received pretty much the same source as Google Chrome provides.
Result from BS:
<!DOCTYPE html>
<html>
<head>
<meta charset="utf-8"/>
<meta content="width=device-width, initial-scale=1" name="viewport"/>
<title>
OCSWSSW | Member Search
</title>
<link href="/Thinclient/favicon.ico" rel="shortcut icon" type="image/x-icon"/>
<link href="/Thinclient/Content/bootstrap.071220161413.css" rel="stylesheet" type="text/css"/>
<link href="/Thinclient/Content/kendo/kendo.common-bootstrap.min.css" rel="stylesheet"/>
<link href="/Thinclient/Content/kendo/kendo.bootstrap.min.css" rel="stylesheet"/>
<link href="/Thinclient/Content/ThinStyle.110820150951.css" rel="stylesheet" title="Blue" type="text/css"/>
<link href="//maxcdn.bootstrapcdn.com/font-awesome/4.3.0/css/font-awesome.min.css" rel="stylesheet"/>
<link href="/Thinclient/Content/icheck/square/blue.css" rel="stylesheet"/>
<link href="/Thinclient/Content/GlobalStyleSheet.css" rel="stylesheet"/>
<script type="text/javascript">
HomeURL = "#/forms/new/?table=0x800000000000003D&form=0x800000000000004D&command=0x8000000000000C2D";
AfterLoginData = null
LanguageDictionary = ;
LanguageDictionary.TC_COMMON = "OkButtonTextOK":"Ok","OkButtonTextContinue":"Continue","OkButtonTextYes":"Yes","OkButtonTextDelete":"Clear","CancelButtonTextCancel":"Cancel","CancelButtonTextNo":"No","CancelButtonTextLogout":"Logout","MiddleButtonTextNo":"No","AjaxRequestError":"The Web server does not respond currently. Please try again later.","UserIdleMessage":"You are innactive, do you want to continue or you disconnect?","ErrorTitle":"Error","ErrorHeaderTitle":"Application error","ErrorHeaderText":"An application error has occurred while processing the current request. The error was recorded and sent to the site administrator. Provide your administrator ID error below.","ErrorMessage":"Message:","ErrorIdentifier":"Identify:","ErrorDate":"Date:"
LanguageDictionary.TC_SEARCH = "OperatorNotEqual":"Not =","OperatorIsDefined":"Is Defined","OperatorIsNotDefined":"Is Not Defined","OperatorContains":"Contains","OperatorDoesNotContain":"Does not contain","OperatorBeginsWith":"Begins with","OperatorDoesNotBeginWith":"Does not begin with","OperatorIsEmpty":"Is Empty","OperatorIsNotEmpty":"Is not empty","CustomFiltersNotComplete":"One or more custom filters are not complete. Examine each custom filter and make sure that the valid search criteria are provided.","NavigateAwayFromSearchWithFilterSet":"You are about to leave this page without performing the search filters custom.","NoGlobalSearchPermissions":"Password","SearchDefinitionLostAlert":"The definition of research will be lost if the primary table is changed. Are you sure you want to change the primary table of the research."
LanguageDictionary.TC_FORM = "RequiredFieldsNotSet":"Unable to save the form data. Provide a value for all required fields.","NavigateAwayFromUnsavedForm":"You are about to exit the form without saving it","RefreshFormLosesModifiedData":"The data of the form has changed. The changes you made will be lost when you refresh the form. Do you want to continue?","SaveDataBeforeClose":"The data of the form has changed. Do you want to save them before closing?","DeleteWarning":"The form data will be deleted. Are you sure you want to continue?","DeleteSecondaryWarning":"You are about to delete the form data.","RequiredField":"This is a required field","InvalidFormat":"The format for this field is not valid"
LanguageDictionary.TC_GLOBALSEARCH = "CollapseAllLabel":"Reduce everything","ExpandAllLabel":"About expand"
LanguageDictionary.TC_WIDGETS = "CallListItem":"Appeal","FaxListItem":"Fax","SmsListItem":"SMS"
</script>
<script src="/Thinclient/Scripts/jquery-1.11.1.min.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/jquery-migrate-1.2.1.min.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/icheck.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/kendo.all.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/kendo.timezones.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/kendo.aspnetmvc.min.js">
</script>
<script src="/Thinclient/Scripts/kendo/cultures/kendo.culture.en-US.min.js">
</script>
<script>
kendo.culture("en-US");
</script>
</head>
<body class="k-content">
<div class="k-loading-mask" id="loadingMsg" style="width:100%;height:100%">
<span class="k-loading-text">
Loading...
</span>
<div class="k-loading-image">
<div class="k-loading-color">
</div>
</div>
</div>
<input id="hdPollingFrequency" type="hidden" value="32767"/>
<input id="hdPrivateComputerTimeout" type="hidden" value="32767"/>
<input id="hdPublicComputerTimeout" type="hidden" value="32767"/>
<input id="hdWarningDisplayDuration" type="hidden" value="0"/>
<input id="hdWindowsAuthentication" type="hidden" value="false"/>
<div class="container">
<div id="content">
</div>
</div>
<div id="loading" style="display: none;">
<h1>
We are processing your request. Please be patient.
</h1>
<input class="abortButton" type="button" value="Abort"/>
</div>
<script id="taskpadGroupTmpl" type="text/x-jquery-tmpl">
<div class="panelBlock">
<div class="panelTitle"><div class="panelLink"><a class="panelDD-dn" id="$DisplayName" href="#">$DisplayName</a></div><div class = "imgPanel">
<a class="imgPanelDD" href="#"> </a></div>
</div>
<div class="panelContent1" id="panelContent1 + $DisplayName">
<ul>
tmpl(TaskItemCollection) "#taskpadItemTmpl"
</ul>
</div>
</div>
</script>
<script id="KendoTestTemplate" type="text/x-kendo-template">
<h2>#= test #</h2>
<ul>
#= kendo.render(kendo.template($("\#KendoTestLiTemplate").html()), litest) #
</ul>
</script>
<script id="KendoTestLiTemplate" type="text/x-kendo-template">
<li>#= displayName#</li>
</script>
<script id="ErrorTemplate" type="text/x-jquery-tmpl">
<div class="errorMsg k-widget k-notification k-notification-error " data-role="alert" style="display: block; opacity: 1;">
<div class="k-notification-wrap">
<span class="k-icon k-i-note">
error
</span>
$errorMsg
<span class="k-icon k-i-close">
Hide
</span>
</div>
</div>
</script>
<script id="HelpButtonTemplate" type="text/x-jquery-tmpl">
<button class="k-button k-primary helpButton" id="$id" onclick="return false;">?</button>
</script>
<script id="IconTemplate" type="text/x-jquery-tmpl">
<span class="k-icon $icon"></span>
</script>
<script id="trash" type="text/x-kendo-template">
<li style="background: url(./Images/#=item.ImageId#.#=item.ImageHash#.#=item.ImageFileExtension#) no-repeat;"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
<li class="#=GetCssClass(item.ContentType)#"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
</script>
<script id="taskpadItemTmpl" type="text/x-jquery-tmpl">
if ImageId
<li style="background: url(./Images/$ImageId.$ImageHash.$ImageFileExtension) no-repeat;"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
else
<li class="$GetCssClass(ContentType)"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
/if
</script>
<script id="buttonBarButtonTmpl" type="text/x-jquery-tmpl">
<button value="submit" class="submitBtn k-button k-primary" data-actionCommand="$Action" data-Disabled="$Disabled" data-Visible="$Visible" data-Name="$Name">
<span>$DisplayName</span>
</button>
</script>
<script src="/Thinclient/Scripts/jquery.filedownload.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/jquery.tmpl.min.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/pubsub.150420151641.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/jquery.form.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/bootstrap.min.150420151637.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/sameheight.min.150420151641.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/Core.141120161617.js" type="text/javascript">
</script>
<script src="/Thinclient/Scripts/PivotalThinClient.150420151641.js" type="text/javascript">
</script>
</body>
</html>
Result from Browser source
<!DOCTYPE html>
<html>
<head>
<meta charset="utf-8">
<meta name="viewport" content="width=device-width, initial-scale=1">
<title>OCSWSSW | Member Search</title>
<link href="/Thinclient/favicon.ico" type="image/x-icon" rel="shortcut icon" />
<link href="/Thinclient/Content/bootstrap.071220161413.css" rel="stylesheet" type="text/css" />
<link rel="stylesheet" href="/Thinclient/Content/kendo/kendo.common-bootstrap.min.css" />
<link rel="stylesheet" href="/Thinclient/Content/kendo/kendo.bootstrap.min.css" />
<link href="/Thinclient/Content/ThinStyle.110820150951.css" rel="stylesheet" title="Blue" type="text/css" />
<link rel="stylesheet" href="//maxcdn.bootstrapcdn.com/font-awesome/4.3.0/css/font-awesome.min.css">
<link rel="stylesheet" href="/Thinclient/Content/icheck/square/blue.css" />
<link rel="stylesheet" href="/Thinclient/Content/GlobalStyleSheet.css" />
<script type="text/javascript" >
HomeURL = "#/forms/new/?table=0x800000000000003D&form=0x800000000000004D&command=0x8000000000000C2D";
AfterLoginData = null
LanguageDictionary = ;
LanguageDictionary.TC_COMMON = "OkButtonTextOK":"Ok","OkButtonTextContinue":"Continue","OkButtonTextYes":"Yes","OkButtonTextDelete":"Clear","CancelButtonTextCancel":"Cancel","CancelButtonTextNo":"No","CancelButtonTextLogout":"Logout","MiddleButtonTextNo":"No","AjaxRequestError":"The Web server does not respond currently. Please try again later.","UserIdleMessage":"You are innactive, do you want to continue or you disconnect?","ErrorTitle":"Error","ErrorHeaderTitle":"Application error","ErrorHeaderText":"An application error has occurred while processing the current request. The error was recorded and sent to the site administrator. Provide your administrator ID error below.","ErrorMessage":"Message:","ErrorIdentifier":"Identify:","ErrorDate":"Date:"
LanguageDictionary.TC_SEARCH = "OperatorNotEqual":"Not =","OperatorIsDefined":"Is Defined","OperatorIsNotDefined":"Is Not Defined","OperatorContains":"Contains","OperatorDoesNotContain":"Does not contain","OperatorBeginsWith":"Begins with","OperatorDoesNotBeginWith":"Does not begin with","OperatorIsEmpty":"Is Empty","OperatorIsNotEmpty":"Is not empty","CustomFiltersNotComplete":"One or more custom filters are not complete. Examine each custom filter and make sure that the valid search criteria are provided.","NavigateAwayFromSearchWithFilterSet":"You are about to leave this page without performing the search filters custom.","NoGlobalSearchPermissions":"Password","SearchDefinitionLostAlert":"The definition of research will be lost if the primary table is changed. Are you sure you want to change the primary table of the research."
LanguageDictionary.TC_FORM = "RequiredFieldsNotSet":"Unable to save the form data. Provide a value for all required fields.","NavigateAwayFromUnsavedForm":"You are about to exit the form without saving it","RefreshFormLosesModifiedData":"The data of the form has changed. The changes you made will be lost when you refresh the form. Do you want to continue?","SaveDataBeforeClose":"The data of the form has changed. Do you want to save them before closing?","DeleteWarning":"The form data will be deleted. Are you sure you want to continue?","DeleteSecondaryWarning":"You are about to delete the form data.","RequiredField":"This is a required field","InvalidFormat":"The format for this field is not valid"
LanguageDictionary.TC_GLOBALSEARCH = "CollapseAllLabel":"Reduce everything","ExpandAllLabel":"About expand"
LanguageDictionary.TC_WIDGETS = "CallListItem":"Appeal","FaxListItem":"Fax","SmsListItem":"SMS"
</script>
<script type="text/javascript" src="/Thinclient/Scripts/jquery-1.11.1.min.js"></script>
<script src="/Thinclient/Scripts/jquery-migrate-1.2.1.min.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/icheck.min.js"></script>
<script src="/Thinclient/Scripts/kendo/kendo.all.min.js"></script>
<script src="/Thinclient/Scripts/kendo/kendo.timezones.min.js"></script>
<script src="/Thinclient/Scripts/kendo/kendo.aspnetmvc.min.js"></script>
<script src="/Thinclient/Scripts/kendo/cultures/kendo.culture.en-US.min.js"></script>
<script>
kendo.culture("en-US");
</script>
</head>
<body class="k-content">
<div id="loadingMsg" class="k-loading-mask" style="width:100%;height:100%">
<span class="k-loading-text">Loading...</span>
<div class="k-loading-image">
<div class="k-loading-color"></div>
</div>
</div>
<input type="hidden" ID="hdPollingFrequency" value="32767"/>
<input type="hidden" ID="hdPrivateComputerTimeout" value= "32767"/>
<input type="hidden" ID="hdPublicComputerTimeout" value="32767"/>
<input type="hidden" ID="hdWarningDisplayDuration" value="0"/>
<input type="hidden" id="hdWindowsAuthentication" value="false"/>
<div class="container">
<div id="content">
</div>
</div>
<div id="loading" style="display: none;">
<h1>
We are processing your request. Please be patient.</h1>
<input type="button" value="Abort" class="abortButton" />
</div>
<script id="taskpadGroupTmpl" type="text/x-jquery-tmpl">
<div class="panelBlock">
<div class="panelTitle"><div class="panelLink"><a class="panelDD-dn" id="$DisplayName" href="#">$DisplayName</a></div><div class = "imgPanel">
<a class="imgPanelDD" href="#"> </a></div>
</div>
<div class="panelContent1" id="panelContent1 + $DisplayName">
<ul>
tmpl(TaskItemCollection) "#taskpadItemTmpl"
</ul>
</div>
</div>
</script>
<script id="KendoTestTemplate" type="text/x-kendo-template">
<h2>#= test #</h2>
<ul>
#= kendo.render(kendo.template($("\#KendoTestLiTemplate").html()), litest) #
</ul>
</script>
<script id="KendoTestLiTemplate" type="text/x-kendo-template">
<li>#= displayName#</li>
</script>
<script id="ErrorTemplate" type="text/x-jquery-tmpl">
<div class="errorMsg k-widget k-notification k-notification-error " data-role="alert" style="display: block; opacity: 1;">
<div class="k-notification-wrap">
<span class="k-icon k-i-note">
error
</span>
$errorMsg
<span class="k-icon k-i-close">
Hide
</span>
</div>
</div>
</script>
<script id="HelpButtonTemplate" type="text/x-jquery-tmpl">
<button class="k-button k-primary helpButton" id="$id" onclick="return false;">?</button>
</script>
<script id="IconTemplate" type="text/x-jquery-tmpl">
<span class="k-icon $icon"></span>
</script>
<script id="trash" type="text/x-kendo-template">
<li style="background: url(./Images/#=item.ImageId#.#=item.ImageHash#.#=item.ImageFileExtension#) no-repeat;"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
<li class="#=GetCssClass(item.ContentType)#"><a href="#=item.ActionCommand#" #if (item.ShowInNewWindow)# target="_blank" ## class="#if (!item.ShowInNewWindow)# ajax-links # if (item.ContentType == 'Email')# mailto-links ## linkTaskItem">#=item.DisplayName#</a></li>
</script>
<script id="taskpadItemTmpl" type="text/x-jquery-tmpl">
if ImageId
<li style="background: url(./Images/$ImageId.$ImageHash.$ImageFileExtension) no-repeat;"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
else
<li class="$GetCssClass(ContentType)"><a href="$ActionCommand" if ShowInNewWindow target="_blank" /if class="if !ShowInNewWindow ajax-links /if if (ContentType == 'Email') mailto-links /if linkTaskItem">$DisplayName</a></li>
/if
</script>
<script id="buttonBarButtonTmpl" type="text/x-jquery-tmpl">
<button value="submit" class="submitBtn k-button k-primary" data-actionCommand="$Action" data-Disabled="$Disabled" data-Visible="$Visible" data-Name="$Name">
<span>$DisplayName</span>
</button>
</script>
<script src="/Thinclient/Scripts/jquery.filedownload.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/jquery.tmpl.min.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/pubsub.150420151641.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/jquery.form.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/bootstrap.min.150420151637.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/sameheight.min.150420151641.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/Core.141120161617.js" type="text/javascript"></script>
<script src="/Thinclient/Scripts/PivotalThinClient.150420151641.js" type="text/javascript"></script>
</body>
</html>
<script>
//$( window ).load(
//$(".k-state-default").hover(function ()
// $(this).toggleClass("k-state-hover");
//)
//);
</script>
Is this not what you are looking for from Beautiful Soup?
edited Nov 12 at 6:44
answered Nov 12 at 6:35
Jonathan
6619
6619
please run in browser and compare to the code you receive
– Sandhya Thotakura
Nov 12 at 6:37
I added the source code from Chrome in the answer below the result from BS4. I don't see anything missing or incomplete here..
– Jonathan
Nov 12 at 6:46
Hi don't use "view page source".. instead use "ctrl+shift+I" and go to "elements" that is the original source code.
– Sandhya Thotakura
Nov 12 at 6:47
Are you referring to the drop-down arrows for each parent element's children? What's different/missing from this?
– Jonathan
Nov 12 at 7:01
in the website you can see fields first name, last name but the script wont get that
– Sandhya Thotakura
Nov 12 at 7:02
|
show 1 more comment
please run in browser and compare to the code you receive
– Sandhya Thotakura
Nov 12 at 6:37
I added the source code from Chrome in the answer below the result from BS4. I don't see anything missing or incomplete here..
– Jonathan
Nov 12 at 6:46
Hi don't use "view page source".. instead use "ctrl+shift+I" and go to "elements" that is the original source code.
– Sandhya Thotakura
Nov 12 at 6:47
Are you referring to the drop-down arrows for each parent element's children? What's different/missing from this?
– Jonathan
Nov 12 at 7:01
in the website you can see fields first name, last name but the script wont get that
– Sandhya Thotakura
Nov 12 at 7:02
please run in browser and compare to the code you receive
– Sandhya Thotakura
Nov 12 at 6:37
please run in browser and compare to the code you receive
– Sandhya Thotakura
Nov 12 at 6:37
I added the source code from Chrome in the answer below the result from BS4. I don't see anything missing or incomplete here..
– Jonathan
Nov 12 at 6:46
I added the source code from Chrome in the answer below the result from BS4. I don't see anything missing or incomplete here..
– Jonathan
Nov 12 at 6:46
Hi don't use "view page source".. instead use "ctrl+shift+I" and go to "elements" that is the original source code.
– Sandhya Thotakura
Nov 12 at 6:47
Hi don't use "view page source".. instead use "ctrl+shift+I" and go to "elements" that is the original source code.
– Sandhya Thotakura
Nov 12 at 6:47
Are you referring to the drop-down arrows for each parent element's children? What's different/missing from this?
– Jonathan
Nov 12 at 7:01
Are you referring to the drop-down arrows for each parent element's children? What's different/missing from this?
– Jonathan
Nov 12 at 7:01
in the website you can see fields first name, last name but the script wont get that
– Sandhya Thotakura
Nov 12 at 7:02
in the website you can see fields first name, last name but the script wont get that
– Sandhya Thotakura
Nov 12 at 7:02
|
show 1 more comment
it dynamic page (Ajax) you can't use bs4
, if you dont like selenium where browser popups you can add --headless
option to hide it. here example
from selenium import webdriver
from selenium.webdriver.chrome.options import Options
from bs4 import BeautifulSoup
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
options = Options()
options.add_argument('--headless')
#options.add_argument('--disable-gpu') # maybe needed if running on Windows.
driver = webdriver.Chrome(chrome_options=options)
print("Loading Page...")
driver.get('https://onlineservices.ocswssw.org/Thinclient/Public/PR/EN/')
# wait max 20 second until ajax content rendered
print("Wait Ajax finished...")
WebDriverWait(driver, 20).until(EC.presence_of_element_located((By.ID , 'MainForm')))
html = driver.execute_script("return document.documentElement.outerHTML")
Soup = BeautifulSoup(html, 'html.parser')
with open('ocswssw.html', 'w') as f:
sourceCode = Soup.prettify().encode('utf-8')
f.write(sourceCode)
print(sourceCode)
driver.quit()
what makes you pick theMainForm
element for your wait until?
– Kamikaze_goldfish
Nov 12 at 15:56
because this is one of element that presence after ajax loaded
– ewwink
Nov 12 at 21:07
So was it just a randomly picked element or was it specific to what you wanted to interact with?
– Kamikaze_goldfish
Nov 12 at 23:29
it can be randomly
– ewwink
Nov 13 at 8:06
add a comment |
it dynamic page (Ajax) you can't use bs4
, if you dont like selenium where browser popups you can add --headless
option to hide it. here example
from selenium import webdriver
from selenium.webdriver.chrome.options import Options
from bs4 import BeautifulSoup
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
options = Options()
options.add_argument('--headless')
#options.add_argument('--disable-gpu') # maybe needed if running on Windows.
driver = webdriver.Chrome(chrome_options=options)
print("Loading Page...")
driver.get('https://onlineservices.ocswssw.org/Thinclient/Public/PR/EN/')
# wait max 20 second until ajax content rendered
print("Wait Ajax finished...")
WebDriverWait(driver, 20).until(EC.presence_of_element_located((By.ID , 'MainForm')))
html = driver.execute_script("return document.documentElement.outerHTML")
Soup = BeautifulSoup(html, 'html.parser')
with open('ocswssw.html', 'w') as f:
sourceCode = Soup.prettify().encode('utf-8')
f.write(sourceCode)
print(sourceCode)
driver.quit()
what makes you pick theMainForm
element for your wait until?
– Kamikaze_goldfish
Nov 12 at 15:56
because this is one of element that presence after ajax loaded
– ewwink
Nov 12 at 21:07
So was it just a randomly picked element or was it specific to what you wanted to interact with?
– Kamikaze_goldfish
Nov 12 at 23:29
it can be randomly
– ewwink
Nov 13 at 8:06
add a comment |
it dynamic page (Ajax) you can't use bs4
, if you dont like selenium where browser popups you can add --headless
option to hide it. here example
from selenium import webdriver
from selenium.webdriver.chrome.options import Options
from bs4 import BeautifulSoup
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
options = Options()
options.add_argument('--headless')
#options.add_argument('--disable-gpu') # maybe needed if running on Windows.
driver = webdriver.Chrome(chrome_options=options)
print("Loading Page...")
driver.get('https://onlineservices.ocswssw.org/Thinclient/Public/PR/EN/')
# wait max 20 second until ajax content rendered
print("Wait Ajax finished...")
WebDriverWait(driver, 20).until(EC.presence_of_element_located((By.ID , 'MainForm')))
html = driver.execute_script("return document.documentElement.outerHTML")
Soup = BeautifulSoup(html, 'html.parser')
with open('ocswssw.html', 'w') as f:
sourceCode = Soup.prettify().encode('utf-8')
f.write(sourceCode)
print(sourceCode)
driver.quit()
it dynamic page (Ajax) you can't use bs4
, if you dont like selenium where browser popups you can add --headless
option to hide it. here example
from selenium import webdriver
from selenium.webdriver.chrome.options import Options
from bs4 import BeautifulSoup
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
options = Options()
options.add_argument('--headless')
#options.add_argument('--disable-gpu') # maybe needed if running on Windows.
driver = webdriver.Chrome(chrome_options=options)
print("Loading Page...")
driver.get('https://onlineservices.ocswssw.org/Thinclient/Public/PR/EN/')
# wait max 20 second until ajax content rendered
print("Wait Ajax finished...")
WebDriverWait(driver, 20).until(EC.presence_of_element_located((By.ID , 'MainForm')))
html = driver.execute_script("return document.documentElement.outerHTML")
Soup = BeautifulSoup(html, 'html.parser')
with open('ocswssw.html', 'w') as f:
sourceCode = Soup.prettify().encode('utf-8')
f.write(sourceCode)
print(sourceCode)
driver.quit()
answered Nov 12 at 8:35
ewwink
9,37922236
9,37922236
what makes you pick theMainForm
element for your wait until?
– Kamikaze_goldfish
Nov 12 at 15:56
because this is one of element that presence after ajax loaded
– ewwink
Nov 12 at 21:07
So was it just a randomly picked element or was it specific to what you wanted to interact with?
– Kamikaze_goldfish
Nov 12 at 23:29
it can be randomly
– ewwink
Nov 13 at 8:06
add a comment |
what makes you pick theMainForm
element for your wait until?
– Kamikaze_goldfish
Nov 12 at 15:56
because this is one of element that presence after ajax loaded
– ewwink
Nov 12 at 21:07
So was it just a randomly picked element or was it specific to what you wanted to interact with?
– Kamikaze_goldfish
Nov 12 at 23:29
it can be randomly
– ewwink
Nov 13 at 8:06
what makes you pick the
MainForm
element for your wait until?– Kamikaze_goldfish
Nov 12 at 15:56
what makes you pick the
MainForm
element for your wait until?– Kamikaze_goldfish
Nov 12 at 15:56
because this is one of element that presence after ajax loaded
– ewwink
Nov 12 at 21:07
because this is one of element that presence after ajax loaded
– ewwink
Nov 12 at 21:07
So was it just a randomly picked element or was it specific to what you wanted to interact with?
– Kamikaze_goldfish
Nov 12 at 23:29
So was it just a randomly picked element or was it specific to what you wanted to interact with?
– Kamikaze_goldfish
Nov 12 at 23:29
it can be randomly
– ewwink
Nov 13 at 8:06
it can be randomly
– ewwink
Nov 13 at 8:06
add a comment |
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Some of your past answers have not been well-received, and you're in danger of being blocked from answering.
Please pay close attention to the following guidance:
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53256639%2fwebscraping-not-able-to-get-the-source-code-of-page%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
What do you mean by "is not complete"? What you expect to find in source code?
– Andersson
Nov 12 at 6:12
if you run the url in chrome and paste the soup in a txt they don't match
– Sandhya Thotakura
Nov 12 at 6:17
Of course they will not match: in Chrome you see rendered page with executed JavaScript.
requests
returns you page source.... So what is your expected output?– Andersson
Nov 12 at 6:19
I want to search social workers in the website. with the output I receive I can't do that.
– Sandhya Thotakura
Nov 12 at 6:21
Do you mean "Corporate names", e.g. " A. Bacchus Social Work Professional Corporation"?
– Andersson
Nov 12 at 6:22