Www doc88 com как скачать бесплатно
На сайте можно просматривать документы — при открытии ссылки на тот или иной документ появляется окно просмотра — файл потихоньку подгружается — после некоторого времени документ доступен к просмотру полностью.
Вопрос собственно в том, как можно скачать документ при таком раскладе? (большинство документов там в .pdf или .doc форматах)
Помогите пожалуйста — очень нужно
Aleks78
Я смотрю Вы меня невзлюбили. Может вместо злостных оценок нужно было подсказать правильную ветку форума?
Extract pdf / image from doc88 Flash viewer
doc88.com uses some kind of encryption to protect pdf files. I used Chrome developer tools and found that it loads .ebt file. I think its encrypted Pdf / Swf file.
I found the following reply Here but i still cannot download the pdf file. Can anyone help ?
Okay. The encryption that docin.com uses is absolutely unknown to me, but I determined that doc88.com probably uses software from cryptbot.com, through I was unable to extract the key: it’s probably buried deep into the flash viewer. – whitequark
2 Answers 2
Try https://www.npmjs.com/package/doc88-download It saves a PNG of each page, which could then be converted to a PDF or other format as a separate step.
This is how to get a pdf file from www.doc88.com:
Go to the website of the document of interest and load every page of the document you want to extract by hovering over them for a few seconds (to do this faster, zoom out). This will save the pdf in the cache of Chrome. By default, not all pages are loaded in the first place.
Right click anywhere on the screen and select «Print. «.
Use a tool to crop the parts of the page that do not belong to the PDF. For instance, in Linux you can use pdfjam. More examples here.
Use an OCR program to reconvert the image to text. Quality is not assured. Some utilities for Linux here.
Где скачать научную статью легко и доступно?
Некоторые издательства журналов предлагают доступ к научным статьям сразу после публикации, а некоторые через какое-то время.
Однако стоимость одной статьи от платных издателей может составлять 150 долларов, что не так уж и мало по сравнению с размером стипендии студента.
Здесь собраны совершенно бесплатные ресурсы, где вы можете найти интересующую Вас научную статью.
Сервисы для скачивания
Удобный бесплатный телеграмм бот для скачивания любых статей. Отправляешь DOI или URL статьи и получаешь pdf. Бот сканирует большинство изложенных ниже ресурсов.
Ori-Pixel/doc88-downloader
This is a POC downloader of documents from doc88.com. It saves pages of a given document as PNGs or JPEGs. It doesn’t have any dependencies — it’s a bit of JavaScript that you paste into Developer Tools’ Console. It was tested in Chrome and Firefox.
The download procedure is a bit of a PITA, but hey… it’s a POC.
Navigate to the desired document in your browser.
Make sure browser’s zoom level is set to 100% — based on some tests it seems that zoom levels lower than 100% can result in lower quality of captured pages.
Scroll through all the pages in the document, one by one, and make sure all of them have loaded. Depending on the document this might be the most arduous part of the process.
Open Developer Tools (e.g. press Ctrl + Shift + I ).
Switch to JavaScript Console.
For PNGs paste this JavaScript in Console and confirm with Enter .
For JPEGs paste this JavaScript in Console and confirm with Enter .
Download pages in batches. Type:
in Console and hit Enter to download pages 1 through 10.
- ℹ It is advised to download 10 pages at a time. After saving a batch of pages simply enter downloadPages(11, 20) to download pages 11 through 20, and so on.
- ℹ In case of Chrome, the first time you download a batch of pages you may see a popup stating that «This site is attempting to download multiple files». You have to allow it as each page is downloaded as a separate file.
Make sure all desired pages were downloaded correctly.
Converting downloaded images back to a PDF
Under Linux you can easily convert downloaded images back to a PDF. You will need ImageMagick package first:
Then — in directory in which the images are — issue the following command which will produce output.pdf PDF file from the images:
If you further want to OCR the PDF (recognize the text in it and make it searchable), install the OCRmyPDF package: