Fix for Google News Sources
If you are using Google news for content feeds – even if you are running them through Inoreader – then this is an important update. The rejection filters in the version 10 MTurk workflows do not work with Google News URLs. I explain the reason and provide the workflow additions to fix this problem in this 15 minute video. Here is the script used in the video:
<#assign u=doc.url>
<#if u?starts_with("https://www.google.com/url")>
<#assign uu=LIB.getUrlUtil(u)>
<#assign url=uu.get("url")>
<#assign dumy = doc.setFeedItemUrl(url)>
<#assign newdom = LIB.getSoup(url)>
${newdom.html()}
<#else>
${doc.content}
</#if>
This 9 minute video shows how to fix Google News sourced documents that are in process in a v10 Reference Persona. The script I use in the video is almost identical to the one above… all you have to do is duplicate it and change the first line to:
<#assign u=doc.custom2>
Contact support if you need help with this.
Leave a Reply