-
Notifications
You must be signed in to change notification settings - Fork 270
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Clarification throughout the abstract and introduction #813
Conversation
That makes sense and sounds like a good plan! Hope the re-review goes smoothly! |
@rasbt thanks again for your suggestions. The manuscript has been provisionally accepted (#820) so we are in somewhat uncharted territory regarding adding new contributors. I propose that we merge these changes and add you to the acknowledgements for your contributions. Before proceeding, I wanted to confirm with you that sounds fair. If so, @cgreene or I will review this pull request to comment on a couple specific lines. |
Congratulations, that's great news! And sure, that's totally fine, I didn't expect to be e.g., a co-author for these relatively minor suggestions :) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some thoughts. I think there are a few revisions needed. But I didn't have good answers in many cases. I'm interested in @agitter's thoughts.
content/01.abstract.md
Outdated
We examine applications of deep learning to a variety of biomedical problems---patient classification, fundamental biological processes, and treatment of patients---and discuss whether deep learning will be able to transform these tasks or if the biomedical sphere poses unique challenges. | ||
As a result from an extensive literature review, we find that deep learning has yet to revolutionize biomedicine or definitively resolve any of the most pressing challenges in the field, but promising advances have been made on the prior state of the art. | ||
Even though improvements over previous baselines have been modest, the recent progress indicates that deep learning methods will provide valuable means for speeding up or aiding human investigation. | ||
However, deep learning models are still regarded as black box algorithms, and more work is needed to address the common concerns related to interpretability and how to best model each problem. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm a bit reluctant to use the terminology black box
here. I feel it's overused and under-justified, when many other methods that don't get this terminology applied have similar issues. What about:
Though progress has been made in determining the primary factors that lead a specific deep neural network to make a specific prediction in a certain case, understanding how users should interpret these models to make specific mechanistic hypotheses remains an open challenge.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
☝️ I'm not a huge fan of that, but I feel like the sentiment is what i'm going for so I'll put it there.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I like the sentiment as well, but determining the primary factors that lead a specific deep neural network to make a specific prediction in a certain case
is quite verbose
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What about this:
Though progress has been made linking a specific neural networks prediction to input features, understanding how users should interpret these models to make testable hypotheses about the system under study remains an open challenge.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍 but add '
in network's
content/01.abstract.md
Outdated
We find that deep learning has yet to revolutionize or definitively resolve any of these problems, but promising advances have been made on the prior state of the art. | ||
Even when improvement over a previous baseline has been modest, we have seen signs that deep learning methods may speed or aid human investigation. | ||
More work is needed to address concerns related to interpretability and how to best model each problem. | ||
Deep learning, which describes a class of machine learning algorithms focussing on the training of deep artificial neural networks, has recently shown impressive results across a variety of domains. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we specifically left artificial neural network
algorithms out of our definitions in many cases. What I think is more important is that these methods can work on raw data to produce intermediate features that are then used for some subsequent task. What about:
Deep learning describes a class of machine learning algorithms that are capable of learning to combine relatively raw inputs into layers of intermediate features, and such algorithms now perform impressively across a variety of domains.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@cgreene that version has good ideas, but I think it would work better as 2 sentences. How about:
Deep learning describes a class of machine learning algorithms that are capable of combining raw inputs into layers of intermediate features. These algorithms now perform impressively across a variety of domains.
I'm not sure why, but the longer recently shown impressive results across a variety of domains
sounds more natural to me.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree with recently shown impressive results across a variety of domains
. Another suggestion would be produces remarkable results across a variety of domains compared to more traditional methods
, but I would prefer the former.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's go with the more compact recently shown impressive results across a variety of domains
content/01.abstract.md
Outdated
Even when improvement over a previous baseline has been modest, we have seen signs that deep learning methods may speed or aid human investigation. | ||
More work is needed to address concerns related to interpretability and how to best model each problem. | ||
Deep learning, which describes a class of machine learning algorithms focussing on the training of deep artificial neural networks, has recently shown impressive results across a variety of domains. | ||
Biology and medicine are data-rich disciplines, but the data are complex and often ill-understood. Hence, deep learning techniques may be particularly well-suited to solve problems of these fields. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't really like this (or our initial framing). What about:
Deep learning techniques may be particularly well suited for challenges in biology and medicine, which are data-rich disciplines where there is often a complex biological system between what we can measure and what we wish to know.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
☝️ needs revision too, but I vaguely like the direction it's going in
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm okay with the @rasbt version here but am happy to switch it if you want.
@@ -7,11 +7,11 @@ Automated algorithms that extract meaningful patterns could lead to actionable k | |||
|
|||
The term _deep learning_ has come to refer to a collection of new techniques that, together, have demonstrated breakthrough gains over existing best-in-class machine learning algorithms across several fields. | |||
For example, over the past five years these methods have revolutionized image classification and speech recognition due to their flexibility and high accuracy [@doi:10.1038/nature14539]. | |||
More recently, deep learning algorithms have shown promise in fields as diverse as high-energy physics [@doi:10.1038/ncomms5308], dermatology [@doi:10.1038/nature21056], and translation among written languages [@arxiv:1609.08144]. | |||
More recently, deep learning algorithms have shown promise in fields as diverse as high-energy physics [@doi:10.1038/ncomms5308], computational chemistry [@doi:10.1002/jcc.24764], dermatology [@doi:10.1038/nature21056], and translation among written languages [@arxiv:1609.08144]. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍
Also noting that we cite this review later in the intro.
Across fields, "off-the-shelf" implementations of these algorithms have produced comparable or higher accuracy than previous best-in-class methods that required years of extensive customization, and specialized implementations are now being used at industrial scales. | ||
|
||
Deep learning approaches grew from research in neural networks, which were first proposed in 1943 [@doi:10.1007/BF02478259] as a model for how our brains process information. | ||
The history of neural networks is interesting in its own right [@doi:10.1103/RevModPhys.34.135]. | ||
Deep learning approaches grew from research on artificial neurons, which were first proposed in 1943 [@doi:10.1007/BF02478259] as a model for how the neurons in a biological brain process information. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍
@@ -26,11 +26,11 @@ In particular, deep learning approaches can be used both in *supervised* applica | |||
Deep learning methods may in fact combine both of these steps. | |||
When sufficient data are available and labeled, these methods construct features tuned to a specific problem and combine those features into a predictor. | |||
In fact, if the dataset is "labeled" with binary classes, a simple neural network with no hidden layers and no cycles between units is equivalent to logistic regression if the output layer is a sigmoid (logistic) function of the input layer. | |||
Similarly, for continuous outcomes, linear regression can be seen as a simple neural network. | |||
Thus, in some ways, supervised deep learning approaches can be seen as a generalization of regression models that allow for greater flexibility. | |||
Similarly, for continuous outcomes, linear regression can be seen as a single-layer neural network. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍
content/02.intro.md
Outdated
Recently, hardware improvements and very large training datasets have allowed these deep learning techniques to surpass other machine learning algorithms for many problems. | ||
In a famous and early example, scientists from Google demonstrated that a neural network "discovered" that cats, faces, and pedestrians were important components of online videos [@url:http://research.google.com/archive/unsupervised_icml2012.html] without being told to look for them. | ||
What if, more generally, deep learning could solve the challenges presented by the growth of data in biomedicine? Could these algorithms identify the "cats" hidden in our data---the patterns unknown to the researcher---and suggest ways to act on them? In this review, we examine deep learning's application to biomedical science and discuss the unique challenges that biomedical data pose for deep learning methods. | ||
What if, more generally, deep learning take advantage of the growth of data in biomedicine to tackle challenges in this field? Could these algorithms identify the "cats" hidden in our data---the patterns unknown to the researcher---and suggest ways to act on them? In this review, we examine deep learning's application to biomedical science and discuss the unique challenges that biomedical data pose for deep learning methods. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good changes overall. We usually wouldn't be so picky, but because this is the final version of the abstract let's continue to work on some of the phrasing together before merging. We also need to keep the total word count <= 200.
content/01.abstract.md
Outdated
We find that deep learning has yet to revolutionize or definitively resolve any of these problems, but promising advances have been made on the prior state of the art. | ||
Even when improvement over a previous baseline has been modest, we have seen signs that deep learning methods may speed or aid human investigation. | ||
More work is needed to address concerns related to interpretability and how to best model each problem. | ||
Deep learning, which describes a class of machine learning algorithms focussing on the training of deep artificial neural networks, has recently shown impressive results across a variety of domains. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@cgreene that version has good ideas, but I think it would work better as 2 sentences. How about:
Deep learning describes a class of machine learning algorithms that are capable of combining raw inputs into layers of intermediate features. These algorithms now perform impressively across a variety of domains.
I'm not sure why, but the longer recently shown impressive results across a variety of domains
sounds more natural to me.
content/01.abstract.md
Outdated
Even when improvement over a previous baseline has been modest, we have seen signs that deep learning methods may speed or aid human investigation. | ||
More work is needed to address concerns related to interpretability and how to best model each problem. | ||
Deep learning, which describes a class of machine learning algorithms focussing on the training of deep artificial neural networks, has recently shown impressive results across a variety of domains. | ||
Biology and medicine are data-rich disciplines, but the data are complex and often ill-understood. Hence, deep learning techniques may be particularly well-suited to solve problems of these fields. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm okay with the @rasbt version here but am happy to switch it if you want.
content/01.abstract.md
Outdated
Deep learning, which describes a class of machine learning algorithms focussing on the training of deep artificial neural networks, has recently shown impressive results across a variety of domains. | ||
Biology and medicine are data-rich disciplines, but the data are complex and often ill-understood. Hence, deep learning techniques may be particularly well-suited to solve problems of these fields. | ||
We examine applications of deep learning to a variety of biomedical problems---patient classification, fundamental biological processes, and treatment of patients---and discuss whether deep learning will be able to transform these tasks or if the biomedical sphere poses unique challenges. | ||
As a result from an extensive literature review, we find that deep learning has yet to revolutionize biomedicine or definitively resolve any of the most pressing challenges in the field, but promising advances have been made on the prior state of the art. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could switch As a result from
to Following
to help the word count
content/01.abstract.md
Outdated
Biology and medicine are data-rich disciplines, but the data are complex and often ill-understood. Hence, deep learning techniques may be particularly well-suited to solve problems of these fields. | ||
We examine applications of deep learning to a variety of biomedical problems---patient classification, fundamental biological processes, and treatment of patients---and discuss whether deep learning will be able to transform these tasks or if the biomedical sphere poses unique challenges. | ||
As a result from an extensive literature review, we find that deep learning has yet to revolutionize biomedicine or definitively resolve any of the most pressing challenges in the field, but promising advances have been made on the prior state of the art. | ||
Even though improvements over previous baselines have been modest, the recent progress indicates that deep learning methods will provide valuable means for speeding up or aiding human investigation. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Perhaps have been modest in general
? There have been a few impressive exceptions since we first drafted the abstract.
content/01.abstract.md
Outdated
We examine applications of deep learning to a variety of biomedical problems---patient classification, fundamental biological processes, and treatment of patients---and discuss whether deep learning will be able to transform these tasks or if the biomedical sphere poses unique challenges. | ||
As a result from an extensive literature review, we find that deep learning has yet to revolutionize biomedicine or definitively resolve any of the most pressing challenges in the field, but promising advances have been made on the prior state of the art. | ||
Even though improvements over previous baselines have been modest, the recent progress indicates that deep learning methods will provide valuable means for speeding up or aiding human investigation. | ||
However, deep learning models are still regarded as black box algorithms, and more work is needed to address the common concerns related to interpretability and how to best model each problem. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I like the sentiment as well, but determining the primary factors that lead a specific deep neural network to make a specific prediction in a certain case
is quite verbose
content/citation-tags.tsv
Outdated
@@ -80,6 +80,7 @@ Gerstein2016_scaling doi:10.1186/s13059-016-0917-0 | |||
Ghandi2014_enhanced doi:10.1371/journal.pcbi.1003711 | |||
Ghosh1992_sequence doi:10.1117/12.140112 | |||
Glorot2011_domain url:http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.231.3442 | |||
Goh2017_compchemistry doi:10.1002/jcc.24764 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We can remove the tag if the DOI is being using in the reference above. The tag is only needed when we use it directly in the citation from Markdown.
@@ -7,11 +7,11 @@ Automated algorithms that extract meaningful patterns could lead to actionable k | |||
|
|||
The term _deep learning_ has come to refer to a collection of new techniques that, together, have demonstrated breakthrough gains over existing best-in-class machine learning algorithms across several fields. | |||
For example, over the past five years these methods have revolutionized image classification and speech recognition due to their flexibility and high accuracy [@doi:10.1038/nature14539]. | |||
More recently, deep learning algorithms have shown promise in fields as diverse as high-energy physics [@doi:10.1038/ncomms5308], dermatology [@doi:10.1038/nature21056], and translation among written languages [@arxiv:1609.08144]. | |||
More recently, deep learning algorithms have shown promise in fields as diverse as high-energy physics [@doi:10.1038/ncomms5308], computational chemistry [@doi:10.1002/jcc.24764], dermatology [@doi:10.1038/nature21056], and translation among written languages [@arxiv:1609.08144]. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍
Also noting that we cite this review later in the intro.
content/02.intro.md
Outdated
Similarly, for continuous outcomes, linear regression can be seen as a simple neural network. | ||
Thus, in some ways, supervised deep learning approaches can be seen as a generalization of regression models that allow for greater flexibility. | ||
Similarly, for continuous outcomes, linear regression can be seen as a single-layer neural network. | ||
Thus, in some ways, supervised deep learning approaches can be seen as a extension of regression models that allow for greater flexibility. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
an extension
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe as an add-on to that sentence: and are especially well-suited for modeling non-linear relationships among the input features.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍
Sure, happy to do that and will get to it later this evening |
Just addressed all the comments in the last commit. You probably need to trim the abstract quite a bit if it's a 200 and not 250 word limit -- but I think it's been already a bit longer prior to these changes ;) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the revisions. Everything looks great to me now. You're right that we were already over the abstract word count. We haven't worried about going a few (thousand) words over our limits before, so no need to start now.
I pushed a new commit acknowledging @rasbt so that we can merge that at the same time.
@cgreene please merge if you approve these changes.
This build is based on 588cfa6. This commit was created by the following Travis CI build and job: https://travis-ci.org/greenelab/deep-review/builds/349261851 https://travis-ci.org/greenelab/deep-review/jobs/349261852 [ci skip] The full commit message that triggered this build is copied below: Clarification throughout the abstract and introduction (#813) * suggested changes to the abstract * suggested improv in intro sec * fix line split * address comments * Add acknowledgement
This build is based on 588cfa6. This commit was created by the following Travis CI build and job: https://travis-ci.org/greenelab/deep-review/builds/349261851 https://travis-ci.org/greenelab/deep-review/jobs/349261852 [ci skip] The full commit message that triggered this build is copied below: Clarification throughout the abstract and introduction (#813) * suggested changes to the abstract * suggested improv in intro sec * fix line split * address comments * Add acknowledgement
Hi, all,
this is a really nice document listing a lot of interesting literature concerning deep learning and biomedicine. While I was reading it, made some notes about certain wordings that I found a bit awkward and added suggested re-wordings to this PR in hope it's going to be helpful.