查看: 843|回复: 0
打印 上一主题 下一主题

[求助] 词组搜索,词组分类

[复制链接]

该用户从未签到

12

主题

93

回帖

1592

积分

解元

Rank: 5Rank: 5

积分
1592
跳转到指定楼层
1
发表于 2015-11-25 12:21:34 | 只看该作者 回帖奖励 |倒序浏览 |阅读模式
求助:
1.我开发了一个词组检测的程序,里面好多词组太简单了,怎么才能去掉?

比如下面文章,以及检测的词组,抱歉有些长啊
Putting together a probabilistic model-- that is, a model of a random phenomenon or a random experiment-- involves two steps. First step, we describe the possible outcomes of the phenomenon or experiment of interest. Second step, we describe our beliefs about the likelihood of the different possible outcomes by specifying a probability law. Here, we start by just talking about the first step, namely, the description of the possible outcomes of the experiment. So we carry out an experiment. For example, we flip a coin. Or maybe we flip five coins simultaneously. Or maybe we roll a die. Whatever that experiment is, it has a number of possible outcomes, and we start by making a list of the possible outcomes-- or, a better word, instead of the word "list", is to use the word "set", which has a more formal mathematical meaning. So we create a set that we usually denote by capital omega. That set is called the sample space and is the set of all possible outcomes of our experiment.  The elements of that set should have certain properties. Namely, the elements should be mutually exclusive and collectively exhaustive. What does that mean? Mutually exclusive means that, if at the end of the experiment, I tell you that this outcome happened, then it should not be possible that this outcome also happened. At the end of the experiment, there can only be one of the outcomes that has happened. Being collectively exhaustive means something else-- that, together, all of these elements of the set exhaust all the possibilities. So no matter what, at the end, you will be able to point to one of the outcomes and say, that's the one that occurred. To summarize-- this set should be such that, at the end of the experiment, you should be always able to point to one, and exactly one, of the possible outcomes and say that this is the outcome that occurred.  Physically different outcomes should be distinguished in the sample space and correspond to distinct points. But when we say physically different outcomes, what do we mean? We really mean different in all relevant aspects but perhaps not different in irrelevant aspects. Let's make more precise what I mean by that by looking at a very simple, and maybe silly, example, which is the following. Suppose that you flip a coin and you see whether it resulted in heads or tails. So you have a perfectly legitimate sample space for this experiment which consists of just two points-- heads and tails. Together these two outcomes exhaust all possibilities. And the two outcomes are mutually exclusive. So this is a very legitimate sample space for this experiment. Now suppose that while you were flipping the coin, you also looked outside the window to check the weather. And then you could say that my sample space is really, heads, and it's raining.  Another possible outcome is heads and no rain.  Another possible outcome is tails, and it's raining, and, finally, another possible outcome is tails and no rain.  This set, consisting of four elements, is also a perfectly legitimate sample space for the experiment of flipping a coin. The elements of this sample space are mutually exclusive and collectively exhaustive. Exactly one of these outcomes is going to be true, or will have materialized, at the end of the experiment. So which sample space is the correct one? This sample space, the second one, involves some irrelevant details. So the preferred sample space for describing the flipping of a coin, the preferred sample space is the simpler one, the first one, which is sort of at the right granularity, given what we're interested in. But ultimately, the question of which one is the right sample space depends on what kind of questions you want to answer. For example, if you have a theory that the weather affects the behavior of coins, then, in order to play with that theory, or maybe check it out, and so on, then, in such a case, you might want to work with the second sample space. This is a common feature in all of science. Whenever you put together a model, you need to decide how detailed you want your model to be. And the right level of detail is the one that captures those aspects that are relevant and of interest to you.
  1. put sth together
  2. put together
  3. to talk about
  4. talk about ... (spoken)
  5. carry sth out
  6. carry out
  7. to carry out
  8. carry out sth
  9. a number of
  10. make sth of sb/sth
  11. instead of
  12. be the end (BrE, spoken)
  13. can only
  14. all of sth
  15. all of
  16. all the
  17. no matter
  18. be able to
  19. point to sth
  20. in all
  21. to look at
  22. look at
  23. look at sth
  24. look at sb/sth
  25. to look at sb/sth; by the look of sb/sth (informal)
  26. you see
  27. you see (spoken)
  28. to result in
  29. result in sth
  30. to consist of
  31. consist of sth
  32. and then
  33. go to sth
  34. go to sb/sth
  35. to go to
  36. going to
  37. sort of
  38. to be interested in
  39. to the question
  40. to depend on
  41. depend on sth
  42. depend on sb/sth
  43. kind of
  44. in order that...
  45. in order that
  46. in order that (formal)
  47. in order to
  48. in order
  49. and so on
复制代码