The Web.Contents() function in M is the key to getting data from web pages and web services, and has a number of useful – but badly documented – options that make it easier to construct urls for your web service calls.
Consider the following url:
https://data.gov.uk/api/3/action/package_search?q=cows
It is a call to the metadata api (documentation here) for https://data.gov.uk/, the UK government’s open data portal, and returns a JSON document listing all the datasets found for a search on the keyword “cows”. You can make this call using Web.Contents() quite easily like so:
Web.Contents( "https://data.gov.uk/api/3/action/package_search?q=cows" )
However, instead of having one long string for your url (which will probably need to be constructed in a separate step) you can use the RelativePath and Query options with Web.Contents(). They are given in the second parameter of the function and passed through as fields in a record. RelativePath adds some extra text to the base url given in the first parameter for the function, while Query allows you to add query parameters to the url, and is itself a record.
So, taking the example above, if the base url for the api is https://data.gov.uk/api we can use these options like so:
Web.Contents( "https://data.gov.uk/api", [ RelativePath="3/action/package_search", Query=[q="cows"] ] )
RelativePath is just the string “3/action/package_search” and is added to the base url. There is just one query parameter “q”, the search query, and the search term is “cows”, so Query takes a record with one field: [q=”cows”]. If you want to specify multiple query parameters you just need to add more fields to the Query record; for example:
Web.Contents( "https://data.gov.uk/api", [ RelativePath="3/action/package_search", Query= [ q="cows", rows="20" ] ] )
Generates a call that returns 20 results, rather than the default 10:
https://data.gov.uk/api/3/action/package_search?q=cows&rows=20
Obviously these options make it easier to construct urls and the code is much clearer, but there are also other benefits to using these options which I’ll cover in another blog post soon.
Note: at the time of writing there is a bug that causes the value given in RelativePath to be appended twice when the Web.Page() function is also used. Hopefully this will be fixed soon.
UPDATE January 2021: This blog post has more details on using the Query option: https://blog.crossjoin.co.uk/2021/01/10/handling-multiple-url-query-parameters-with-the-same-name-using-web-contents-in-power-query-power-bi/
Pingback: Dew Drop - August 16, 2016 (#2310) - Morning Dew
Pingback: daily 08/16/2016 | Cshonea's Blog
Pingback: Web.Contents(), M Functions And Dataset Refresh Errors In Power BI – Chris Webb's BI Blog
Pingback: Dynamic Web.Contents() and Power BI Refresh Errors – Data Inspirations
2018-05-21 Still not fixed. Thanks for your solution, it saved my day.
Pingback: Implementing Basic Query Folding On A Web Service In Power Query/M And Power BI « Chris Webb's BI Blog
Pingback: Credentials, Data Privacy Settings And Data Sources In Power Query/Power BI « Chris Webb's BI Blog
Hi Chris, quick question which keeps puzzling us. After finding this brilliant solution of yours we proceeded but ran into a new problem. The API we are calling uses the same query option 4 times.
The URL would be something along the lines of:
https://domain/path?&properties=name&properties=description&properties=price&hapikey=xxxxxxxxxxxxxxxxxxxxxx
In your example this would lead to:
Query =
[
properties = “value1”,
properties = “value2”,
properties = “value3”,
properties = “value4”,
hapikey = “xxxxxxxxxxxxxxxxxxxxxx”,
]
I feared this would pose a problem, knowing how standard arrays work, the last value assigned to properties would over write all previous ones. And indeed Power BI M actually warns for this problem.
Any idea how to circumvent this?
sincerely, Wouter
Hmm, interesting question. Can you try something like [ properties = “value1&properties=””value2″”&properties=””value3″””] ?
Thank you very much for your reply. Unfortunately power bi encodes all data between “”, and although it skips the quotes, the ampersand and the equal sign all get encoded rendering the string unusable. I even installed Netmon to observe this from happening. (thankfully the api was reachable using http instead of https) I – again – am at a loss. Any new ideas?
Hmm, no – I would need to do some tests to see if there’s a workaround in M
If you would do that, that would be great, but that sounds like something I cannot ask of you 🙂
I’ve tried a few things but I can’t get anything to work. Sorry…
Neverteless, thank you very much for getting back to me. Much appreciated.
Just a little feedback. I pushed everything in the RelativePath variable. Problem solved. Custom Functions are also not allowed, so those I have integrated into alle Table-code. 🙂
Interesting, thanks!
Hi Chris,
Thanks for this article.
I encounter an issue where the data from the website does not return correctly it seems if I use multiple query string.
For example,
this works fine – the data returns correctly (not using Query option)
Source = Json.Document(Web.Contents(“http://useragentstring.com/?uas=”&userAgent_URLEncoded&”&getJSON=all”)),
but this will cause failure in Scheduled Data Refresh in Power BI, so I have to use the Query option, like below.
Source = Json.Document(Web.Contents(“http://useragentstring.com/”,[Query=[uas=userAgent_URLEncoded,getJSON=”all”]])),
However, it seems that the data is not parsed in correctly, and I’ve got unknown values for most of the records.
Try using
userAgent_URLEncoded = Mozilla%2F5.0%20%28Windows%20NT%2010.0%3B%20Win64%3B%20×64%29%20AppleWebKit%2F537.36%20%28KHTML%20%20like%20Gecko%29%20Chrome%2F69.0.3497.100%20Safari%2F537.36
Do you know why?
Thanks
I don’t know for sure, but I’ve heard of similar problems. I need to do some research to find out what’s going on here…
We have tried using your method, and it works for PBIX and updates in the workbool, but when we upload to PBIS, and try to run the refresh there, the only APIs that will connect are those that do not use your method of breaking the relative path out from the main URL string. All of our data sources (APIs) that are called with the relative path in the main URL string work, and those where we use your method to break them out into their own relative path are unable to connect to the API for data refresh via PBIS. The same queries that work on the desktop do not connect or work in PBIS.
Hmm, that suggests something else is going on here. Have you turned off data privacy checks in Power BI Desktop? If so, does refresh fail when you turn on the data privacy checks?
Pingback:
Refresh (any) OAuth2 API from PowerBI Online / Dataflows, using Azure Functions
https://www.linkedin.com/pulse/refresh-any-oauth2-api-from-powerbi-online-dataflows-using-jansman/
What about if the query parameter name contains a pipe?
Like this – query_start_date|lt
(not me, but the API i am trying to hit) – is there any way to wrap it so PowerBI accepts it? currently throws an Invalid Identifier error
Something like this should work:
#”query_start_date|It”
This worked perfectly for me but then I found that it requires a gateway. I’m doing straight web calls, any idea how to skip the gateway?
I can’t make it work for any way in order to schedule refresh data. I have even check 404 errors. In Power Bi desktop works perfectly. It is azure devops api.
I’m getting: “We reached the end of the buffer.. The exception was raised by the IDataReader interface. Table: Query1”
let
Capacity = (ProjectSK as text, IterationSK as text, AssignedToUserSK as text) =>
let
Url = Web.Contents(“https://dev.azure.com”, [RelativePath=”/[Companyname]/” & ProjectSK & “/” & “_apis/work/teamsettings/iterations/” & IterationSK & “/capacities/” & AssignedToUserSK , ManualStatusHandling={404}, Query=[#”api-version”=”5.0″]] ),
Origen = Table.FromColumns({Lines.FromBinary(Url, null, null, 65001)}),
#”JSON analizado” = Table.TransformColumns(Origen,{},Json.Document),
#”Se expandió Column1″ = Table.ExpandRecordColumn(#”JSON analizado”, “Column1”, {“teamMember”, “activities”, “daysOff”, “url”, “_links”}, {“Column1.teamMember”, “Column1.activities”, “Column1.daysOff”, “Column1.url”, “Column1._links”}),
#”Se expandió Column1.activities” = Table.ExpandListColumn(#”Se expandió Column1″, “Column1.activities”),
#”Se expandió Column1.activities1″ = Table.ExpandRecordColumn(#”Se expandió Column1.activities”, “Column1.activities”, {“capacityPerDay”, “name”}, {“Column1.activities.capacityPerDay”, “Column1.activities.name”}),
#”Columnas quitadas” = Table.RemoveColumns(#”Se expandió Column1.activities1″,{“Column1.teamMember”, “Column1.activities.name”, “Column1.daysOff”, “Column1.url”, “Column1._links”}),
#”Columnas con nombre cambiado” = Table.RenameColumns(#”Columnas quitadas”,{{“Column1.activities.capacityPerDay”, “capacityPerDay”}}),
GetMetadata = Value.Metadata(Url),
GetResponseStatus = GetMetadata[Response.Status],
Output = if GetResponseStatus=404 then Table.FromRecords({[capacityPerDay = 0]}) else #”Columnas con nombre cambiado”
in
Output
in
Capacity
It’s hard to say what’s going on here, but it doesn’t look like a problem with your code.
It is curious how Power Bi Service is different to Desktop. I found my solution taking [CompanyName] out of relative path to join de URL parameter in Web.Content. It doesn’t have so much sense, but it is actually working.
Since this albeit very useful post is written in 2016 it ends with “Note: at the time of writing there is a bug that causes the value given in RelativePath to be appended twice when the Web.Page() function is also used. Hopefully this will be fixed soon.”
This bug seems to be ongoing still today.
I can’t get my function to work with the Query parameter when I use Web.Page() around the Web.Contents(“url”, [Query=[term=value]])
However it works when I use Web.Page(Web.Contents(“url/?term=value)) (No
I really wanted to let my custom connection run through a gateway but stumbled upon the unsupported source error because of Web.Contents() function – hence me reading this post.
Now I seem to have two problems instead of one. 😐
Can you try using Web.BrowserContents instead of Web.Page?
I have a API URL which works like this :
https://jira.company.com/rest/api/2/issue/RC-2345/worklog
to get the data from JIRA , the param is RC-2345 in this rest is static , how can we create a similar funtion to pass the JIRAID as param , i tried the below sample function but it doesnt pass it correctly.
= (JIRAID as text) =>
let Source = Json.Document(Web.Contents(“https://jira.company.com/rest/api/2/issue”, [Query=[JIRAID]], [RelativePath=”worklog/”]))
in Source
Why are you using the Query option here? Both JIRAID and worklog should be passed in through RelativePath here I think
Pingback: Historical Stock Price Function in Power Query - Power BI Tips and Tricks
Have you ever seen an issue where it adds a “/” into the url?
The API I’m trying to hit is something like this:
“www.website.com/api/measures/history?metrics=metric1,metric2,metric3&project=ProjectName
RelativePath =”?metrics=metric1,metric2,metric3&project=”
Query=[project=ProjectName]
The problem is that it shoots out
http://www.website.com/api/measures/history/?metrics=metric1,metric2,metric3&project=ProjectName
because it adds a “/” automatically before the relativepath.
Pingback: Tips, Gateway, Embedding and Analysis Services – Roundup #57 – 360reports.io Power BI Videos
Pingback: Power BI Auditing – 360reports.io Power BI Videos
Hi Chris,
I have been exploring using USERNAME() or USEPRINCIPALNAME() to dynamically assign the Paranmeter value in the Web.Contents query. The idea is to extract relevant data based on Power BI login user. Using RLS is possible however this alternative (if possible) will greatly reduce the number of records extracted for respective users.
Do you think this is possible ?
No, you can’t pass a value from DAX back to a Power Query function, you will have to use RLS
Chris, first of all, thanks for this incredibly helpful post! I’ve tried to follow what you have done and the code works fine in the desktop, but I’m still having problems with not being able to refresh the data when I have published to the powerbi service.
I’m actually connecting to an ODATA service, and had to figure out how to use params with ‘$’ signs in the name, so the example below might be helpful to others doing the same thing.
I did try and import the data using an ODATA data source, but was having problems, hence ended up going this route:
BaseUri = “https://myurl/v1/”,
QueryRecord =
[ RelativePath=”tickets/”,
Query=
[
token = “xxxxxxxxxx”,
#”$select” = “id,type,origin,baseStatus,createdDate,slaSolutionDate,slaSolutionTime,resolvedIn,urgency,serviceFull,subject”,
#”$filter” = “createdDate gt ” & FormattedDate & “-05:00 and baseStatus ne ‘Canceled'”,
#”$top” = “1000”,
#”$skip” = “1”
]
],
Source = Json.Document(
Web.Contents( BaseUri, QueryRecord )
Anything obvious I am doing wrong?
Thanks
Dan
Hi,
I would also like to know the correct way to express “select” parameter from Query.
Thanks,
Razvan
I am also extremely interested in a solution for this problem. We are trying to use Microsoft Graph api “skiptoken” element which is preceded by a dollarsign too.
Pingback: Chris Webb's BI Blog: Implementing Basic Query Folding On A Web Service In Power Query/M And Power BI Chris Webb's BI Blog
Pingback: Chris Webb's BI Blog: Web.Contents(), M Functions And Dataset Refresh Errors In Power BI Chris Webb's BI Blog
I have been facing this Issue for a week and tried various approaches to fix it with no positive results 🙁
I have set a Parameter that provides the ability to make a Dynamic URL with the Following Code which would later be merged with a list-to-table Query:
let
Document=(Numero as number)as table =>
let
Source = Web.Contents(“http://www.sanaa.hn/colonias/dbo_fechas_agua_list.php?masterkey1=”&Number.ToText(Numero)&”&mastertable=dbo.barriocolonia”,[RelativePath=”Numero”]),
#”Extracted Table From Html” = Html.Table(Source, {{“Column1”, “TABLE[id=’form_grid_8′] > TR > :nth-child(1), TABLE[id=’form_grid_8′] > * > TR > :nth-child(1)”}, {“Column2”, “TABLE[id=’form_grid_8′] > TR > :nth-child(2), TABLE[id=’form_grid_8′] > * > TR > :nth-child(2)”}, {“Column3”, “TABLE[id=’form_grid_8′] > TR > :nth-child(3), TABLE[id=’form_grid_8′] > * > TR > :nth-child(3)”}, {“Column4”, “TABLE[id=’form_grid_8′] > TR > :nth-child(4), TABLE[id=’form_grid_8′] > * > TR > :nth-child(4)”}}, [RowSelector=”TABLE[id=’form_grid_8′] > TR, TABLE[id=’form_grid_8′] > * > TR”]),
#”Promoted Headers” = Table.PromoteHeaders(#”Extracted Table From Html”, [PromoteAllScalars=true]),
#”Changed Type” = Table.TransformColumnTypes(#”Promoted Headers”,{{“Id Colonia”, Int64.Type}, {“Barrio o Colonia”, type text}, {“Salida de Tanque”, type text}, {“Horario”, type text}})
in
#”Changed Type”
in Document
Static example of one of the URLS would be :”http://www.sanaa.hn/colonias/dbo_fechas_agua_list.php?masterkey1=1&mastertable=dbo.barriocolonia”
How would the Relative Path and Query solution be applied in this context to solve the Refresh Solution in the PBI Service?
Just Fixed it with help of a friend, turns out it was simpler than I thought:
Source = Web.Contents(“http://www.sanaa.hn/colonias/dbo_fechas_agua_list.php?masterkey1=1&mastertable=dbo.barriocolonia”,[Query=[masterkey1=Number.ToText(Numero),mastertable=”dbo.barriocolonia”]]),
I have a link which content some special character such as “%24” in the Query,
Ex: http://api.*****.com/topics/36261/mentions-trendline?%24date_from=2019-01-01T17%3A00%3A00Z&%24date_to=2019-12-31T17%3A00%3A00Z
How can I change it to Relative Path in Power Query
Pingback: Hente data fra API’er til Power BI | Gaute Holmin - Teknologi, livet og sånt...
I got a similiar problem with the upload refresh and i am unable to implement a RelativePath. Either I have a too big dimension or my syntax is not correct. How would you do this? The line without the Relative Path is
WebCall = Json.Document(Web.Contents(URL & “/webapp/api/v1/salesOrder/?pageSize=1000&sort=-id&page=”&Text.From([Page])&””, [Headers=[AuthenticationToken=”” & token & “”, #”Content-Type”=”application/json”, Accept=”application/json”]]))
Best regards
Raphael Schmidt
Pingback: Power BI Incremental Refresh with Web API Data Source | Data and Analytics with Dustin Ryan
Pingback: Como obter dados de APIs e atualizar automaticamente - Aprenda Power BI
Pingback: Resources to learn Power Query – Ninmonkey
Chris Webb you beaut! Saved the day with this blog post.
Hi Chris,
I am having a very similar issue with trying to connect to a SharePoint.Files() source.
I need to be able to access a set of files that over time will move from a Development SharePoint folder to an Operations folder.
My original plan was to connect to an excel file that would contain the url pathways (root, path, file), and then load my data using parameters against that excel file.
It works a treat in desktop, but then I get the “Dynamic data source” error once published to a workspace.
Is it possible to use a similar “relative path” solution using SharePoint.Files() as a source?
Pingback: How to get and automatically refresh APIs data - Power BI Experience
Hi Chris,
Dynamic URL with Web.Contents and RelativePath worked well on Power BI Report Server (May 2020).
But since we upgraded to Power BI Report Server (October 2020) the refresh failed with this beautiful message :
[0] -1056505856: COM error: mscorlib, An item with the same key has already been added..
[1] -1056505856: COM error: System.Core, Sequence contains no elements.
[2] -1055784828: The database operation was cancelled because of an earlier failure.
That looks like a bug – can you open a support case
We’re getting the same error an in our case, it looks like it’s being caused by a Table.Combine statement – it worked fine before the October release and now fails to refresh on PBI Server. Not sure if that’s your issue, but if so, just here to tell you that I’m not aware of a work-around….
Pingback: Chris Webb's BI Blog: Handling Multiple URL Query Parameters With The Same Name Using Web.Contents In Power Query/Power BI Chris Webb's BI Blog
Need help with power Query to schedule dynamic refresh of data and keep old data
Hello,
I need to keep the data for the past 24 hours that I refresh in power BI desctop (thats how I model my report and dataset)
I have premium and pro account
I have set RangeStart and RangeEnd parameters
schedule data refresh and update the data in first point every 15 minutes in power BI workspace ( here I get that my data source is not supported for refresh error – ” This dataset includes a dynamic data source. “)
I have read this blog post about “Faking Out” Web.Contents (http://blog.datainspirations.com/2018/02/17/dynamic-web-contents-and-power-bi-refresh-errors/), but I cannot seem to get it working.
Below is my query, any help would be much appreciated:
let
EntitiesPerPage = 500,
Limit=”&limit=” & Text.From(EntitiesPerPage),
Url = “https://*******.********.com/api/1/rest/public/runtime/dev_org2?org_wide=true&last_hours=24” & Limit,
GetJson = (Url) =>
let
RawData = Web.Contents(Url),
Json = Json.Document(RawData)
in Json,
GetEntityCount = () =>
let Url = Url & “&offset=0″,
Json = GetJson(Url),
Count = Json[#”response_map”],
Count1 = Count[#”total”]
in
Count1,
GetPage = (Index) =>
let
//(option A)offset equal to previous row count
offset = “&offset=” & Text.From(Index * EntitiesPerPage),
//(option B)offset equal to page numer
//offset = “&offset=” & Text.From(Index),
Url = Url & offset,
Json = GetJson(Url),
Value = Json[#”response_map”],
Value1 = Value[#”entries”]
in
Value1,
EntityCount = GetEntityCount(),
PageCount = Number.RoundUp(EntityCount / EntitiesPerPage),
PageIndices = { 0 .. PageCount – 1 },
Pages = List.Transform(PageIndices, each GetPage(_)),
Entities = List.Union(Pages),
Table = Table.FromList(Entities, Splitter.SplitByNothing(), null, null, ExtraValues.Error),
#”Expanded Column1″ = Table.ExpandRecordColumn(Table, “Column1”, {“documents”, “state_timestamp”, “error_documents”, “label”, “path_id”, “state”, “create_time”, “duration”, “cc_label”, “runtime_label”}, {“Column1.documents”, “Column1.state_timestamp”, “Column1.error_documents”, “Column1.label”, “Column1.path_id”, “Column1.state”, “Column1.create_time”, “Column1.duration”, “Column1.cc_label”, “Column1.runtime_label”}),
#”Renamed Columns” = Table.RenameColumns(#”Expanded Column1″,{{“Column1.cc_label”, “cc_label”}, {“Column1.create_time”, “create_time”}, {“Column1.documents”, “documents”}, {“Column1.duration”, “duration”}, {“Column1.error_documents”, “error_documents”}, {“Column1.label”, “label”}, {“Column1.path_id”, “path_id”}, {“Column1.state”, “state”}, {“Column1.runtime_label”, “runtime_label”}, {“Column1.state_timestamp”, “state_timestamp”}}),
#”Changed Type” = Table.TransformColumnTypes(#”Renamed Columns”,{{“state_timestamp”, type datetime}, {“create_time”, type datetime}}),
#”Filtered Rows” = Table.SelectRows(#”Changed Type”, each [state_timestamp] >= RangeStart and [state_timestamp] < RangeEnd)
in
#"Filtered Rows"
Regards,
Aleksandar