-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
copy/paste of queries at the bottom of the query_SDGX.xml pages into Scopus fails #6
Comments
Hi Erik, The orgin for rendering the XML to HTML can be found in the XSL I tried to fix that by adding a non-breaking space after the OR statement in the XSL template. However, then I tried this, the rendering of the HTML breaks completely. (I could not test it on my laptop. Somehow a modern version of a web browser does not render xml to html when opened on localhost but only when accessed via https.) If you have a -tested- solution for the forced space after the OR statement in the XSL. please help me out. Warm regards, |
Hi Maurice, Thanks for taking a look at the issue -- I am not sure how to remedy it I'm afraid, and have changed my approach since I reported this. I am now just parsing the raw XML files and compositing the scopus queries in a Python script. This suits me better anyway, since my goal is to transform the queries to work on Postgres. Regards, |
Nice! that is why we put it in xml, for automation, and human readability. |
Yes, I am an RSEC at Erasmus! Agreed that XML is a nice format for this kind of thing -- although to facilitate translation of the queries to other languages, it might be worthwhile to consider making things a bit finer-grained, and perhaps slightly less Scopus-centric (although I understand those are the origins). Just as an example, <aqd:query-line field="TITLE-ABS-KEY">
("poverty line*") OR ("poverty indicator*")
</aqd:query-line> To write a script to translate this to other query languages, I need to parse first the XML and then the Scopus query (for which, to my knowledge, no explicit grammar is publicly available so I've had to cobble something together myself using Antlr). Suppose, instead, the XML looked something like this: <aqd:query-line field="TITLE-ABS-KEY">
<aqg:query-or>
<aqd:query-parens>
"poverty line*"
</aqd:query-parens>
<aqd:query-parens>
"poverty indicator*"
</aqd:query-parens>
</aqg:query-or>
</aqd:query-line> That way the tree structure of the query is reflected explicitly in XML, and it would be much easier to transform to other query languages. Of course the XSLT to render the Scopus queries would become a bit more complicated. Now that I have a Antlr grammar that appears to correctly parse the Scopus trees, I suppose it would be pretty easy to use that to automatically transform the former to the latter XML, so that wouldn't have to be done manually. As an aside, I'm beginning to regret the choice (not mine) for Postgres. The argument at the time was that it supports something like Scopus' For one thing, the Scopus proximity operator is not directional, i.e. Another complication is that Scopus' Of course, these problems compound very quickly if multiple proximity operators occur in a single query: if I am given Finally, I end up not using the more advanced features of Postgres text search, and in fact I have to force it to "simple" mode in order to make the Scopus wildcards work. Postgres would like to help me with this, stemming words in the documents and queries for me, ignoring stop words, and leveraging a built-in thesaurus for synonyms. But the way the Scopus queries are given here defeats this, for example I can imagine, although it will be a lot of work, enriching the Aurora XML with a few valid expansions of the wild-carded terms. That way I can use those in my Postgres queries and have it do its magic. Anyway, I have to get on with the next phase and unfortunately can't linger on these issues. Just thought I'd mention these observations while they are fresh on my mind. If I have a bit more time, I might revisit this if you are interested and try to come up with some more constructive / concrete proposals. (*) Incidentally, Scopus does also have a directed variant, |
Describe the bug
Copy/pasting the entire query at the bottom of the pages into Scopus advanced search gives syntax errors
To Reproduce
Steps to reproduce the behavior:
Expected behavior
These queries should work, or the "usage" text should be adapted?
Desktop (please complete the following information):
Additional context
The problem seems to be that the concatenation of subqueries with "\nOR\n" is disregarded by the scopus query editor, and it tries to execute queries containing "ORTITLE-ABS-KEY".
The text was updated successfully, but these errors were encountered: