; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G09520 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G09520
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionZinc finger, CCHC-type
Genome locationChr2:9357065..9360527
RNA-Seq ExpressionCSPI02G09520
SyntenyCSPI02G09520
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO72413.1 gag-pol polyprotein [Oryza sativa Japonica Group]5.5e-28546.46Show/hide
Query:  GIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQE-GTRTRCDQEDAILLS
        G +RG+  G G G GRG     S+ G S   +G RDKSHIKCF C   GHY+++C   K +  EAHL    +  PTL++ V++     +R D    +++ 
Subjt:  GIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQE-GTRTRCDQEDAILLS

Query:  KERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE------------------KNE
        +ER+ P +   +      D+WYLDNGASNHM+    KF+ELDE  TG+V+FG  S++QIM KG+++F CKNGDQ    +                  +  
Subjt:  KERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE------------------KNE

Query:  NKVQMTEDIMKVSDRS-GKLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMITKQARL
        ++V M  D ++V D++  +L+M V+R+ NRLY+I L+    VCLL SL+DP WLWH RLGHVNFH LKL+ +K++  GVP V  PN+L  AC++ KQ R 
Subjt:  NKVQMTEDIMKVSDRS-GKLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMITKQARL

Query:  PFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFCKKE
        PF   + Y AE PLELLH D+C PI+P T  GN+YF+LIVDD + WMW++++K+K  A  AF+KFK L EN     I+TLRTDRGGEFLS EF + C   
Subjt:  PFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFCKKE

Query:  GIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHLKKL
        GIERH T PYSPQQN +VE RNR VMA  RSL K M VP +  GEAVRHAV+LLN LP KA+G RT FEAW G+KPHL HLRVFGC A+ K T PHLKKL
Subjt:  GIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHLKKL

Query:  DDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYSNEFENLEDA-------------------ETGFEN
        DDRS+P+VY GVEEG KAHRL+DP R ++ +SRDV+F EN  W W+    +    TEF V +   + +    E A                   E   + 
Subjt:  DDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYSNEFENLEDA-------------------ETGFEN

Query:  VLPHAT---------EIPAIGETSP----------SPSSTNTLVHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENELKS
          P A+           P +G +S           +  S +  V  RSL++I      V+   D+ + E ++V  EE T Y+EA  E  W  AM  EL++
Subjt:  VLPHAT---------EIPAIGETSP----------SPSSTNTLVHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENELKS

Query:  IEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEEV
        IEKN TWSL  LP  HK IGLKWVFKLKK+  G+V+KHKARLVAKGYVQRQG+DF+EVFAPVARLDT+R IL +  ++ W+VHHLD KSAFLNG+LEEEV
Subjt:  IEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEEV

Query:  YVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSD
        YV+Q EGF    ++H VY+LSKALY LRQAPRAWN RLDRS+K LGF +C QEQAVYTR  G   ++VGVYVDDLIVTG S   +  FKQQMM +FEMSD
Subjt:  YVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSD

Query:  LGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTTM
        LGLL+YYLGIE                                        LHKD +G+ +D TEYR ++GCLRYLL+T P+LSY VG+ASR+MERPT M
Subjt:  LGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTTM

Query:  HYKR--------------------------------------------------------DSWNGAKTGNIICGQQICDSSH------------------
        H+K                                                          SW+  K   +      C++                    
Subjt:  HYKR--------------------------------------------------------DSWNGAKTGNIICGQQICDSSH------------------

Query:  -----------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRN
                               + PVFHGRSKHIDT +HFIRECVE+G+I +EFV   EQRAD LTK L   +LA  R LLGVR+
Subjt:  -----------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRN

ABB47537.2 retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group]1.5e-27746.82Show/hide
Query:  KGRGFGGIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQE-GTRTRCDQE
        +G   G  +RGR RG G G   GT+R  S  G+S    G RDKSHIKCF C   GHY+++C   K +  EAHL    +  P L++ V++      R D  
Subjt:  KGRGFGGIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQE-GTRTRCDQE

Query:  DAILLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE---------------
          +++ +ER+ P +   +      D+WYLDNGASNHM+  R KF+ELDE  TG+V+FG  S++QIM  G+++F CKNGDQ    +               
Subjt:  DAILLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE---------------

Query:  ---KNENKVQMTEDIMKVSDRS-GKLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMI
           +  ++V M  D +KV D++  +L+M V+RT NRLY+I L+   QVCLL SL++P WLWH R+GHVNFH LKL+ +K++  GVP V  PN+L  AC++
Subjt:  ---KNENKVQMTEDIMKVSDRS-GKLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMI

Query:  TKQARLPFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFT
         KQ R PF   + YRAE PLELLH D+C PI+P T AGN+YF+LIVDD + WMW++++K K  A   F+KFK L +N     I+TLRTDRGGEFLS +F 
Subjt:  TKQARLPFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFT

Query:  QFCKKEGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTT
        + C    IERH TAPYSPQQND+VE RNR VMA  RSL K M VP +  GEAVRHA++LLN LP KA+G RT FEAW G+KPHL HLRVFGC A+ K T 
Subjt:  QFCKKEGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTT

Query:  PHLKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYSNEFENLEDAETGFENVLPHATEIPAIGE
        PHLKKLDDRS+P VY GVEEG KAHRL+DP R ++ +SRDV+F EN  W W+       E+T         S EFE  E          P   E PA+ E
Subjt:  PHLKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYSNEFENLEDAETGFENVLPHATEIPAIGE

Query:  ----TSPSPSSTNT-LVHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENELKSIEKNNTWSLTELPLGHKPIGLKWVFKL
             SP  + ++   V  RSL++I      V+   D+ +   ++   EE   Y+EA  E  W  AM  EL++IEKN TWSL  LP GHK IGLKWVFKL
Subjt:  ----TSPSPSSTNT-LVHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENELKSIEKNNTWSLTELPLGHKPIGLKWVFKL

Query:  KKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEEVYVTQSEGFEVPNEKHRVYRLSKALYRL
        KK+  G+V+KHKARLVA GYVQ+QG+DF+EVFAPVARLDT+R ILA+ A++ W+VHHLD KSAFLNG+LEEEVYV+Q EGF    ++H VY LSKALY L
Subjt:  KKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEEVYVTQSEGFEVPNEKHRVYRLSKALYRL

Query:  RQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGLLSYYLGIEVEQHKGRILLKQPTYA
        RQAPRAWN RLDRS+K LGF                                S   ++  FKQQMM +FE+SDLGLL+YYLGIEV Q    I +KQ  YA
Subjt:  RQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGLLSYYLGIEVEQHKGRILLKQPTYA

Query:  KRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTTMHYKR-----------------------
        ++IL+ FGM DCN+T  P+E ++QLHK  EG+ +D TEYR ++GCLRYLL+T P+LSY VG+ S++ME+ T MH+K                        
Subjt:  KRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTTMHYKR-----------------------

Query:  ---------------------------------DSWNGAKTGNIICGQQICDSSH-----------------------------------------EEPV
                                          SW+  K   +      C++                                           + PV
Subjt:  ---------------------------------DSWNGAKTGNIICGQQICDSSH-----------------------------------------EEPV

Query:  FHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVK
        FHGRSKHIDT +HFIRECVE+G+I +EFV T EQRAD LTK L   +
Subjt:  FHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVK

ABF94034.1 retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group]4.5e-28746.51Show/hide
Query:  GRGFGGIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQE-GTRTRCDQED
        GRG GG  RGR RG G G G GT R +S  G+     G RDKSHIKCF C   GHY+++C   K +  EAHL    +  P L++ V+++   R  C    
Subjt:  GRGFGGIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQE-GTRTRCDQED

Query:  AILLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE----------------
         +++ ++R+ P++   D  G   DVW+LDNGASNHMT  R KF+ELDE  TGRVKFG  ST+QI  KG+++F CKN DQ   Q+                
Subjt:  AILLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE----------------

Query:  --KNENKVQMTEDIMKVSDRS-GKLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMIT
          +  ++V M ED+++V D+S  +L+M  +R                        P        G   F   +  G +    GVP +  PN+L  +C++ 
Subjt:  --KNENKVQMTEDIMKVSDRS-GKLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMIT

Query:  KQARLPFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQ
        KQAR PF   + +RAE+PLELLH D+C PI+P T++GN+YF LIVDD +RW+W+++LK K  + +AF KFK L EN    +++TLR+DRGGEFLS EF Q
Subjt:  KQARLPFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQ

Query:  FCKKEGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTP
         C++ GI+RH TAPYSPQQN +VE RNR VMA  RSL K M VP +F GEAVRHAV+LLN LP KA+G+RT FEAW GRKP L HL+VFGC A+ KNT P
Subjt:  FCKKEGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTP

Query:  HLKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYSNEFENLEDAETGFENVLPH----------
        HLKKLDDRS+P VY GVEEG KAHRL+DP RG++ +SRDV+F+EN+ W W +    G+E T+F + ++      E L    T    V P+          
Subjt:  HLKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYSNEFENLEDAETGFENVLPH----------

Query:  ------ATEIPA-----------IGETSPSPSSTNTL------------------VHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEA
              A E+P+            G  +P   STN+                      RSL+D+      V+   DE + E ++  SEE + Y+EA  + 
Subjt:  ------ATEIPA-----------IGETSPSPSSTNTL------------------VHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEA

Query:  RWYQAMENELKSIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEK
         W +AM+ E+++IEKN TW L  LP GH+ IGLKWV+KLKK+  G+++KHKARLVAKGYVQ+QG+DFEEVFAPVARLDT+RV+LA+ A++ W+VHHLD K
Subjt:  RWYQAMENELKSIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEK

Query:  SAFLNGELEEEVYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKF
        SAFLNGELEEEVYV Q EGF    ++H V +L KALY LRQAPRAWNIRLDRSL+ LGF++CTQEQAVYTR  G + ++VGVYVDDLIVTG +  ++  F
Subjt:  SAFLNGELEEEVYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKF

Query:  KQQMMAKFEMSDLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVG
        K+QMM +FEMSDLGLL+YYLGIEV+Q +    LKQ  YAK++LS FGM +CN+   P++P++QL KD EG  +DATEYR I+G LRYLL+T P+LSY VG
Subjt:  KQQMMAKFEMSDLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVG

Query:  MASRYMERPTTMHYK-------------------------------RDS-------------------------WNGAKTGNIICGQQICDSSH------
        +ASR+MERPT MH+K                                DS                         W+  K   +      C++        
Subjt:  MASRYMERPTTMHYK-------------------------------RDS-------------------------WNGAKTGNIICGQQICDSSH------

Query:  -----------------------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRNLE
                                           + PVFHGRSKHIDT +HFIRECV+ GQI+VEFV T EQRAD LTK L   KL   R LLGVR+L 
Subjt:  -----------------------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRNLE

Query:  SCQD
        S Q+
Subjt:  SCQD

EEC84282.1 hypothetical protein OsI_30754 [Oryza sativa Indica Group]0.0e+0049.87Show/hide
Query:  RGFGGIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQEGTRTRCDQEDAI
        R  GG  RGR RG G G GR +  +NS  G S    G RDKSHIKC+ C   GHY+++C   K +  EAHL    +  P L++ V+++      +    +
Subjt:  RGFGGIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQEGTRTRCDQEDAI

Query:  LLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE------------------
        ++ +ER+ P++   D+     D+W+LDNGASNHMT  R KF++LD   TG VKFG  ST++I  KG+++F CKNGDQ   Q+                  
Subjt:  LLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE------------------

Query:  KNENKVQMTEDIMKVSDRSG-KLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMITKQ
        +  ++V M ED+++V D+S  +L+M V+RT NRLY+I LK    VCLLT +++P WLWH RLGHVNF  +KL+ +K +  G+P +T PN+L  AC++ KQ
Subjt:  KNENKVQMTEDIMKVSDRSG-KLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMITKQ

Query:  ARLPFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFC
         R PF   + +RAE+PLELLH D+C PI+P T+AGN+YF+LIVDD +RWMW++++K K  A EAF KFK L EN    +I+TLR+DRGGEFLS EF Q C
Subjt:  ARLPFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFC

Query:  KKEGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHL
        ++ GI+RH TAPYSPQQN +VE RNR+VMA  RSL K M VP +F GEAVRHAVYLLN LP KA+G+RT FEAW GRKP L HLRVFGC+A+ K TTP+ 
Subjt:  KKEGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHL

Query:  KKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYS------------------------------N
        KKLDDRS+P VY GVEEG KAHRL+DP  G++ +SRDV+F+EN+ W W +VV+  +  TEF V +                                   
Subjt:  KKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYS------------------------------N

Query:  EFENLEDAETGFENVLPHATEIPAIGETSP--SPSSTNTL------VHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENE
           +L  +     +  P +T  PA G   P  SP S+  L      V  RSL DI      V+   DE + + ++   EE + Y+EA  +  W  AM  E
Subjt:  EFENLEDAETGFENVLPHATEIPAIGETSP--SPSSTNTL------VHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENE

Query:  LKSIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELE
        L++IEKN+TW+LT LP GHKPIGLKWV+KLKK+  G+V+KHKARLVAKGYVQRQG+DFEEVFAPVARLDT+RVILA+ A++ WEVHHLD KSAFLNG+LE
Subjt:  LKSIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELE

Query:  EEVYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFE
        EEVYV Q EGF    E+H V RLSKALY LRQAPRAWN RLD+ LK LGF +CTQEQAVYTR +G+  V+VGVYVDDLIVTG + +++  FKQQMM +FE
Subjt:  EEVYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFE

Query:  MSDLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERP
        MSDLGLLSYYLGIEV Q +  I +KQ  YAK+ILS FGM  CN T  PMEP++ LHKD +G  IDATEYR ++GCLRYLL+T P+LSY VG+ASR+MERP
Subjt:  MSDLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERP

Query:  TTMHYKR-------------------------------------DSWNGAKTGNII----------CGQQ-------ICDSSH-----------------
        TTMH K                                      D  +   TG +           C Q+        C++                   
Subjt:  TTMHYKR-------------------------------------DSWNGAKTGNII----------CGQQ-------ICDSSH-----------------

Query:  ------------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRNLESCQD
                                + PVFHGRSKHIDT +HFIRECVE+GQI++EFV++ EQRAD +TK L   KLA  R LLGVR+L   QD
Subjt:  ------------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRNLESCQD

KAB8107251.1 hypothetical protein EE612_041900 [Oryza sativa]0.0e+0050.04Show/hide
Query:  RGFGGIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQEGTRTRCDQEDAI
        R  GG  RGR RG G G GRG+  +NS  G S    G RDKSHIKC+ C   GHY+++C   K +  EAHL    +  P L++ V+++      +    +
Subjt:  RGFGGIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQEGTRTRCDQEDAI

Query:  LLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE------------------
        ++ +ER+ P++   D+     D+W+LDNGASNHMT  R KF++LD   TG VKFG  ST++I  KG+++F CKNGDQ   Q+                  
Subjt:  LLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE------------------

Query:  KNENKVQMTEDIMKVSDRSG-KLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMITKQ
        +  ++V M ED+++V D+S  +L+M V+RT NRLY+I LK    VCLLT +++P WLWH RLGHVNF  +KL+ +K +  G+P +T PN+L  AC++ KQ
Subjt:  KNENKVQMTEDIMKVSDRSG-KLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMITKQ

Query:  ARLPFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFC
         R PF   + +RAE+PLELLH D+C PI+P T+AGN+YF+LIVDD +RWMW++++K K  A EAF KFK L EN    +I+TLR+DRGGEFLS EF Q C
Subjt:  ARLPFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFC

Query:  KKEGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHL
        ++ GI+RH TAPYSPQQN +VE RNR+VMA  RSL K M VP +F GEAVRHAVYLLN LP KA+G+RT FEAW GRKP L HLRVFGC+A+ K TTP+ 
Subjt:  KKEGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHL

Query:  KKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYSNE-------------------FENLEDAETG
        KKLDDRS+P VY GVEEG KAHRL+DP  G++ +SRDV+F+EN+ W W +VV+  +  TEF V +                          ++   +  G
Subjt:  KKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYSNE-------------------FENLEDAETG

Query:  FENVLPHATE---------IPAIGETSP--SPSSTNTL------VHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENELK
          + LP + +          PA G   P  SP S   L      V  RSL DI      V+   DE + + ++   EE + Y+EA  +  W  AM  EL+
Subjt:  FENVLPHATE---------IPAIGETSP--SPSSTNTL------VHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENELK

Query:  SIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEE
        +IEKN+TW+LT LP GHKPIGLKWV+KLKK+  G+V+KHKARLVAKGYVQRQG+DFEEVFAPVARLDT+RVILA+ A++ WEVHHLD KSAFLNG+LEEE
Subjt:  SIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEE

Query:  VYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMS
        VYV Q EGF    E+H V RLSKALY LRQAPRAWN RLD+ LK LGF +CTQEQAVYTR +G+  V+VGVYVDDLIVTG +  ++  FKQQMM +FEMS
Subjt:  VYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMS

Query:  DLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTT
        DLGLLSYYLGIEV Q +  I +KQ  YAK+ILS FGM  CN T  PMEP++ LHKD +G  IDATEYR ++GCLRYLL+T P+LSY VG+ASR+MERPTT
Subjt:  DLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTT

Query:  MHYKR-------------------------------------DSWNGAKTGNII----------CGQQ-------ICDSSH-------------------
        MH K                                      D  +   TG +           C Q+        C++                     
Subjt:  MHYKR-------------------------------------DSWNGAKTGNII----------CGQQ-------ICDSSH-------------------

Query:  ----------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRNLESCQD
                              + PVFHGRSKHIDT +HFIRECVE+GQI++EFV + EQRAD +TK L   KLA  R LLGVR+L   QD
Subjt:  ----------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRNLESCQD

TrEMBL top hitse value%identityAlignment
A0A0P0XB91 Os08g0125300 protein0.0e+0050.04Show/hide
Query:  RGFGGIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQEGTRTRCDQEDAI
        R  GG  RGR RG G G GRG+  +NS  G S    G RDKSHIKC+ C   GHY+++C   K +  EAHL    +  P L++ V+++      +    +
Subjt:  RGFGGIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQEGTRTRCDQEDAI

Query:  LLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE------------------
        ++ +ER+ P++   D+     D+W+LDNGASNHMT  R KF++LD   TG VKFG  ST++I  KG+++F CKNGDQ   Q+                  
Subjt:  LLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE------------------

Query:  KNENKVQMTEDIMKVSDRSG-KLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMITKQ
        +  ++V M ED+++V D+S  +L+M V+RT NRLY+I LK    VCLLT +++P WLWH RLGHVNF  +KL+ +K +  G+P +T PN+L  AC++ KQ
Subjt:  KNENKVQMTEDIMKVSDRSG-KLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMITKQ

Query:  ARLPFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFC
         R PF   + +RAE+PLELLH D+C PI+P T+AGN+YF+LIVDD +RWMW++++K K  A EAF KFK L EN    +I+TLR+DRGGEFLS EF Q C
Subjt:  ARLPFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFC

Query:  KKEGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHL
        ++ GI+RH TAPYSPQQN +VE RNR+VMA  RSL K M VP +F GEAVRHAVYLLN LP KA+G+RT FEAW GRKP L HLRVFGC+A+ K TTP+ 
Subjt:  KKEGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHL

Query:  KKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYSNE-------------------FENLEDAETG
        KKLDDRS+P VY GVEEG KAHRL+DP  G++ +SRDV+F+EN+ W W +VV+  +  TEF V +                          ++   +  G
Subjt:  KKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYSNE-------------------FENLEDAETG

Query:  FENVLPHATE---------IPAIGETSP--SPSSTNTL------VHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENELK
          + LP + +          PA G   P  SP S   L      V  RSL DI      V+   DE + + ++   EE + Y+EA  +  W  AM  EL+
Subjt:  FENVLPHATE---------IPAIGETSP--SPSSTNTL------VHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENELK

Query:  SIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEE
        +IEKN+TW+LT LP GHKPIGLKWV+KLKK+  G+V+KHKARLVAKGYVQRQG+DFEEVFAPVARLDT+RVILA+ A++ WEVHHLD KSAFLNG+LEEE
Subjt:  SIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEE

Query:  VYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMS
        VYV Q EGF    E+H V RLSKALY LRQAPRAWN RLD+ LK LGF +CTQEQAVYTR +G+  V+VGVYVDDLIVTG +  ++  FKQQMM +FEMS
Subjt:  VYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMS

Query:  DLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTT
        DLGLLSYYLGIEV Q +  I +KQ  YAK+ILS FGM  CN T  PMEP++ LHKD +G  IDATEYR ++GCLRYLL+T P+LSY VG+ASR+MERPTT
Subjt:  DLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTT

Query:  MHYKR-------------------------------------DSWNGAKTGNII----------CGQQ-------ICDSSH-------------------
        MH K                                      D  +   TG +           C Q+        C++                     
Subjt:  MHYKR-------------------------------------DSWNGAKTGNII----------CGQQ-------ICDSSH-------------------

Query:  ----------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRNLESCQD
                              + PVFHGRSKHIDT +HFIRECVE+GQI++EFV + EQRAD +TK L   KLA  R LLGVR+L   QD
Subjt:  ----------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRNLESCQD

B8BDZ6 Uncharacterized protein0.0e+0049.87Show/hide
Query:  RGFGGIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQEGTRTRCDQEDAI
        R  GG  RGR RG G G GR +  +NS  G S    G RDKSHIKC+ C   GHY+++C   K +  EAHL    +  P L++ V+++      +    +
Subjt:  RGFGGIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQEGTRTRCDQEDAI

Query:  LLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE------------------
        ++ +ER+ P++   D+     D+W+LDNGASNHMT  R KF++LD   TG VKFG  ST++I  KG+++F CKNGDQ   Q+                  
Subjt:  LLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE------------------

Query:  KNENKVQMTEDIMKVSDRSG-KLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMITKQ
        +  ++V M ED+++V D+S  +L+M V+RT NRLY+I LK    VCLLT +++P WLWH RLGHVNF  +KL+ +K +  G+P +T PN+L  AC++ KQ
Subjt:  KNENKVQMTEDIMKVSDRSG-KLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMITKQ

Query:  ARLPFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFC
         R PF   + +RAE+PLELLH D+C PI+P T+AGN+YF+LIVDD +RWMW++++K K  A EAF KFK L EN    +I+TLR+DRGGEFLS EF Q C
Subjt:  ARLPFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFC

Query:  KKEGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHL
        ++ GI+RH TAPYSPQQN +VE RNR+VMA  RSL K M VP +F GEAVRHAVYLLN LP KA+G+RT FEAW GRKP L HLRVFGC+A+ K TTP+ 
Subjt:  KKEGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHL

Query:  KKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYS------------------------------N
        KKLDDRS+P VY GVEEG KAHRL+DP  G++ +SRDV+F+EN+ W W +VV+  +  TEF V +                                   
Subjt:  KKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYS------------------------------N

Query:  EFENLEDAETGFENVLPHATEIPAIGETSP--SPSSTNTL------VHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENE
           +L  +     +  P +T  PA G   P  SP S+  L      V  RSL DI      V+   DE + + ++   EE + Y+EA  +  W  AM  E
Subjt:  EFENLEDAETGFENVLPHATEIPAIGETSP--SPSSTNTL------VHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENE

Query:  LKSIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELE
        L++IEKN+TW+LT LP GHKPIGLKWV+KLKK+  G+V+KHKARLVAKGYVQRQG+DFEEVFAPVARLDT+RVILA+ A++ WEVHHLD KSAFLNG+LE
Subjt:  LKSIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELE

Query:  EEVYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFE
        EEVYV Q EGF    E+H V RLSKALY LRQAPRAWN RLD+ LK LGF +CTQEQAVYTR +G+  V+VGVYVDDLIVTG + +++  FKQQMM +FE
Subjt:  EEVYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFE

Query:  MSDLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERP
        MSDLGLLSYYLGIEV Q +  I +KQ  YAK+ILS FGM  CN T  PMEP++ LHKD +G  IDATEYR ++GCLRYLL+T P+LSY VG+ASR+MERP
Subjt:  MSDLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERP

Query:  TTMHYKR-------------------------------------DSWNGAKTGNII----------CGQQ-------ICDSSH-----------------
        TTMH K                                      D  +   TG +           C Q+        C++                   
Subjt:  TTMHYKR-------------------------------------DSWNGAKTGNII----------CGQQ-------ICDSSH-----------------

Query:  ------------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRNLESCQD
                                + PVFHGRSKHIDT +HFIRECVE+GQI++EFV++ EQRAD +TK L   KLA  R LLGVR+L   QD
Subjt:  ------------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRNLESCQD

Q0J8A6 Os08g0125300 protein0.0e+0050.04Show/hide
Query:  RGFGGIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQEGTRTRCDQEDAI
        R  GG  RGR RG G G GRG+  +NS  G S    G RDKSHIKC+ C   GHY+++C   K +  EAHL    +  P L++ V+++      +    +
Subjt:  RGFGGIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQEGTRTRCDQEDAI

Query:  LLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE------------------
        ++ +ER+ P++   D+     D+W+LDNGASNHMT  R KF++LD   TG VKFG  ST++I  KG+++F CKNGDQ   Q+                  
Subjt:  LLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE------------------

Query:  KNENKVQMTEDIMKVSDRSG-KLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMITKQ
        +  ++V M ED+++V D+S  +L+M V+RT NRLY+I LK    VCLLT +++P WLWH RLGHVNF  +KL+ +K +  G+P +T PN+L  AC++ KQ
Subjt:  KNENKVQMTEDIMKVSDRSG-KLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMITKQ

Query:  ARLPFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFC
         R PF   + +RAE+PLELLH D+C PI+P T+AGN+YF+LIVDD +RWMW++++K K  A EAF KFK L EN    +I+TLR+DRGGEFLS EF Q C
Subjt:  ARLPFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFC

Query:  KKEGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHL
        ++ GI+RH TAPYSPQQN +VE RNR+VMA  RSL K M VP +F GEAVRHAVYLLN LP KA+G+RT FEAW GRKP L HLRVFGC+A+ K TTP+ 
Subjt:  KKEGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHL

Query:  KKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYSNE-------------------FENLEDAETG
        KKLDDRS+P VY GVEEG KAHRL+DP  G++ +SRDV+F+EN+ W W +VV+  +  TEF V +                          ++   +  G
Subjt:  KKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYSNE-------------------FENLEDAETG

Query:  FENVLPHATE---------IPAIGETSP--SPSSTNTL------VHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENELK
          + LP + +          PA G   P  SP S   L      V  RSL DI      V+   DE + + ++   EE + Y+EA  +  W  AM  EL+
Subjt:  FENVLPHATE---------IPAIGETSP--SPSSTNTL------VHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENELK

Query:  SIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEE
        +IEKN+TW+LT LP GHKPIGLKWV+KLKK+  G+V+KHKARLVAKGYVQRQG+DFEEVFAPVARLDT+RVILA+ A++ WEVHHLD KSAFLNG+LEEE
Subjt:  SIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEE

Query:  VYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMS
        VYV Q EGF    E+H V RLSKALY LRQAPRAWN RLD+ LK LGF +CTQEQAVYTR +G+  V+VGVYVDDLIVTG +  ++  FKQQMM +FEMS
Subjt:  VYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMS

Query:  DLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTT
        DLGLLSYYLGIEV Q +  I +KQ  YAK+ILS FGM  CN T  PMEP++ LHKD +G  IDATEYR ++GCLRYLL+T P+LSY VG+ASR+MERPTT
Subjt:  DLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTT

Query:  MHYKR-------------------------------------DSWNGAKTGNII----------CGQQ-------ICDSSH-------------------
        MH K                                      D  +   TG +           C Q+        C++                     
Subjt:  MHYKR-------------------------------------DSWNGAKTGNII----------CGQQ-------ICDSSH-------------------

Query:  ----------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRNLESCQD
                              + PVFHGRSKHIDT +HFIRECVE+GQI++EFV + EQRAD +TK L   KLA  R LLGVR+L   QD
Subjt:  ----------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRNLESCQD

Q10F84 Gag-pol polyprotein2.7e-28546.46Show/hide
Query:  GIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQE-GTRTRCDQEDAILLS
        G +RG+  G G G GRG     S+ G S   +G RDKSHIKCF C   GHY+++C   K +  EAHL    +  PTL++ V++     +R D    +++ 
Subjt:  GIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQE-GTRTRCDQEDAILLS

Query:  KERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE------------------KNE
        +ER+ P +   +      D+WYLDNGASNHM+    KF+ELDE  TG+V+FG  S++QIM KG+++F CKNGDQ    +                  +  
Subjt:  KERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE------------------KNE

Query:  NKVQMTEDIMKVSDRS-GKLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMITKQARL
        ++V M  D ++V D++  +L+M V+R+ NRLY+I L+    VCLL SL+DP WLWH RLGHVNFH LKL+ +K++  GVP V  PN+L  AC++ KQ R 
Subjt:  NKVQMTEDIMKVSDRS-GKLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMITKQARL

Query:  PFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFCKKE
        PF   + Y AE PLELLH D+C PI+P T  GN+YF+LIVDD + WMW++++K+K  A  AF+KFK L EN     I+TLRTDRGGEFLS EF + C   
Subjt:  PFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFCKKE

Query:  GIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHLKKL
        GIERH T PYSPQQN +VE RNR VMA  RSL K M VP +  GEAVRHAV+LLN LP KA+G RT FEAW G+KPHL HLRVFGC A+ K T PHLKKL
Subjt:  GIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHLKKL

Query:  DDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYSNEFENLEDA-------------------ETGFEN
        DDRS+P+VY GVEEG KAHRL+DP R ++ +SRDV+F EN  W W+    +    TEF V +   + +    E A                   E   + 
Subjt:  DDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYSNEFENLEDA-------------------ETGFEN

Query:  VLPHAT---------EIPAIGETSP----------SPSSTNTLVHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENELKS
          P A+           P +G +S           +  S +  V  RSL++I      V+   D+ + E ++V  EE T Y+EA  E  W  AM  EL++
Subjt:  VLPHAT---------EIPAIGETSP----------SPSSTNTLVHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENELKS

Query:  IEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEEV
        IEKN TWSL  LP  HK IGLKWVFKLKK+  G+V+KHKARLVAKGYVQRQG+DF+EVFAPVARLDT+R IL +  ++ W+VHHLD KSAFLNG+LEEEV
Subjt:  IEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEEV

Query:  YVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSD
        YV+Q EGF    ++H VY+LSKALY LRQAPRAWN RLDRS+K LGF +C QEQAVYTR  G   ++VGVYVDDLIVTG S   +  FKQQMM +FEMSD
Subjt:  YVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSD

Query:  LGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTTM
        LGLL+YYLGIE                                        LHKD +G+ +D TEYR ++GCLRYLL+T P+LSY VG+ASR+MERPT M
Subjt:  LGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTTM

Query:  HYKR--------------------------------------------------------DSWNGAKTGNIICGQQICDSSH------------------
        H+K                                                          SW+  K   +      C++                    
Subjt:  HYKR--------------------------------------------------------DSWNGAKTGNIICGQQICDSSH------------------

Query:  -----------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRN
                               + PVFHGRSKHIDT +HFIRECVE+G+I +EFV   EQRAD LTK L   +LA  R LLGVR+
Subjt:  -----------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRN

Q10RM4 Retrotransposon protein, putative, unclassified2.2e-28746.51Show/hide
Query:  GRGFGGIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQE-GTRTRCDQED
        GRG GG  RGR RG G G G GT R +S  G+     G RDKSHIKCF C   GHY+++C   K +  EAHL    +  P L++ V+++   R  C    
Subjt:  GRGFGGIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKC-RGKGRDDEAHLTCAIEEEPTLMMVVSQE-GTRTRCDQED

Query:  AILLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE----------------
         +++ ++R+ P++   D  G   DVW+LDNGASNHMT  R KF+ELDE  TGRVKFG  ST+QI  KG+++F CKN DQ   Q+                
Subjt:  AILLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQE----------------

Query:  --KNENKVQMTEDIMKVSDRS-GKLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMIT
          +  ++V M ED+++V D+S  +L+M  +R                        P        G   F   +  G +    GVP +  PN+L  +C++ 
Subjt:  --KNENKVQMTEDIMKVSDRS-GKLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMIT

Query:  KQARLPFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQ
        KQAR PF   + +RAE+PLELLH D+C PI+P T++GN+YF LIVDD +RW+W+++LK K  + +AF KFK L EN    +++TLR+DRGGEFLS EF Q
Subjt:  KQARLPFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQ

Query:  FCKKEGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTP
         C++ GI+RH TAPYSPQQN +VE RNR VMA  RSL K M VP +F GEAVRHAV+LLN LP KA+G+RT FEAW GRKP L HL+VFGC A+ KNT P
Subjt:  FCKKEGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTP

Query:  HLKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYSNEFENLEDAETGFENVLPH----------
        HLKKLDDRS+P VY GVEEG KAHRL+DP RG++ +SRDV+F+EN+ W W +    G+E T+F + ++      E L    T    V P+          
Subjt:  HLKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQFYSNEFENLEDAETGFENVLPH----------

Query:  ------ATEIPA-----------IGETSPSPSSTNTL------------------VHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEA
              A E+P+            G  +P   STN+                      RSL+D+      V+   DE + E ++  SEE + Y+EA  + 
Subjt:  ------ATEIPA-----------IGETSPSPSSTNTL------------------VHLRSLSDIYANTEEVE-GGDEQENEVMMVVSEELTCYQEAVTEA

Query:  RWYQAMENELKSIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEK
         W +AM+ E+++IEKN TW L  LP GH+ IGLKWV+KLKK+  G+++KHKARLVAKGYVQ+QG+DFEEVFAPVARLDT+RV+LA+ A++ W+VHHLD K
Subjt:  RWYQAMENELKSIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEK

Query:  SAFLNGELEEEVYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKF
        SAFLNGELEEEVYV Q EGF    ++H V +L KALY LRQAPRAWNIRLDRSL+ LGF++CTQEQAVYTR  G + ++VGVYVDDLIVTG +  ++  F
Subjt:  SAFLNGELEEEVYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKF

Query:  KQQMMAKFEMSDLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVG
        K+QMM +FEMSDLGLL+YYLGIEV+Q +    LKQ  YAK++LS FGM +CN+   P++P++QL KD EG  +DATEYR I+G LRYLL+T P+LSY VG
Subjt:  KQQMMAKFEMSDLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVG

Query:  MASRYMERPTTMHYK-------------------------------RDS-------------------------WNGAKTGNIICGQQICDSSH------
        +ASR+MERPT MH+K                                DS                         W+  K   +      C++        
Subjt:  MASRYMERPTTMHYK-------------------------------RDS-------------------------WNGAKTGNIICGQQICDSSH------

Query:  -----------------------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRNLE
                                           + PVFHGRSKHIDT +HFIRECV+ GQI+VEFV T EQRAD LTK L   KL   R LLGVR+L 
Subjt:  -----------------------------------EEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRNLE

Query:  SCQD
        S Q+
Subjt:  SCQD

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-10726.65Show/hide
Query:  KSHIKCFTCNTMGHYTSKC-------RGKGRDDEAHLTCAIEEEPTLMMVVSQEGTRTRCDQEDAILLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHM
        K  +KC  C   GH    C         K +++E  +  A       M+                    KE        N+ +  +N  + LD+GAS+H+
Subjt:  KSHIKCFTCNTMGHYTSKC-------RGKGRDDEAHLTCAIEEEPTLMMVVSQEGTRTRCDQEDAILLSKERLLPEMYCNDNNGGNNDVWYLDNGASNHM

Query:  TCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQEKNENKVQMTEDIMKVSDRSGKLLMSVKRTQN------------RLYKITLKT
                  +  +T  V+      I + ++G  ++  K G     + +N++++ + ED++   + +G  LMSVKR Q              + K  L  
Subjt:  TCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQEKNENKVQMTEDIMKVSDRSGKLLMSVKRTQN------------RLYKITLKT

Query:  LKQVCLLTSL--------------EDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMIT------KQARLPFSR-QSTYRAEKPLELL
        +K   +L ++              ++   LWH R GH++   L  +  K +     L+   N L ++C I       KQARLPF + +     ++PL ++
Subjt:  LKQVCLLTSL--------------EDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMIT------KQARLPFSR-QSTYRAEKPLELL

Query:  HADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFCKKEGIERHHTAPYSPQQNDI
        H+D+C PI+P TL    YF++ VD  T +   Y++K K D F  F+ F    E     K+  L  D G E+LS E  QFC K+GI  H T P++PQ N +
Subjt:  HADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFCKKEGIERHHTAPYSPQQNDI

Query:  VEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKAL--GERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHLK----KLDDRSSPMVYFG
         E   R +    R++     +   F GEAV  A YL+N +P +AL    +T +E W  +KP+L HLRVFG   YV     H+K    K DD+S   ++ G
Subjt:  VEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKAL--GERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHLK----KLDDRSSPMVYFG

Query:  VEEGCKAHRLYDPGRGKLQISRDVLFQE------------------NLEWAWNNVVSDGKEI--TEFPVMDQFYSNEFENLEDAETGFENVLPH------
         E      +L+D    K  ++RDV+  E                  + E    N  +D ++I  TEFP   +   N  + L+D++       P+      
Subjt:  VEEGCKAHRLYDPGRGKLQISRDVLFQE------------------NLEWAWNNVVSDGKEI--TEFPVMDQFYSNEFENLEDAETGFENVLPH------

Query:  ATEIP-------------------------------------AIGETSPSPS-STNTLVHLRSLS-DIYANTEEVE---------------GGDEQENEV
         TE P                                     + G  +P+ S  + T  HL+ +  D     + +E                 +E++N +
Subjt:  ATEIP-------------------------------------AIGETSPSPS-STNTLVHLRSLS-DIYANTEEVE---------------GGDEQENEV

Query:  MMVVSEELTCYQEAVT----------EARWYQAMENELKSIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFA
          VV    T + +             ++ W +A+  EL + + NNTW++T+ P     +  +WVF +K +  G  +++KARLVA+G+ Q+  ID+EE FA
Subjt:  MMVVSEELTCYQEAVT----------EARWYQAMENELKSIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFA

Query:  PVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEEVYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRR
        PVAR+ + R IL+L    + +VH +D K+AFLNG L+EE+Y+   +G    ++   V +L+KA+Y L+QA R W    +++LK   F   + ++ +Y   
Subjt:  PVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEEVYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRR

Query:  EG--EECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEG
        +G   E + V +YVDD+++      ++N FK+ +M KF M+DL  + +++GI +E  + +I L Q  Y K+ILS F M +CNA   P+  K         
Subjt:  EG--EECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEG

Query:  AQIDATEYRSIVGCLRY-LLNTTPNLSYVVGMASRYMERPTT---------------------------------MHYKRDSWNGAK------TG-----
           + T  RS++GCL Y +L T P+L+  V + SRY  +  +                                 + Y    W G++      TG     
Subjt:  AQIDATEYRSIVGCLRY-LLNTTPNLSYVVGMASRYMERPTT---------------------------------MHYKRDSWNGAK------TG-----

Query:  ---NIIC--------------------------------------------------GQQICDSSHEEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVN
           N+IC                                                    Q C S    P  H R+KHID  +HF RE V+N  I +E++ 
Subjt:  ---NIIC--------------------------------------------------GQQICDSSHEEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVN

Query:  TGEQRADVLTKALTGVKLAAMRQLLGV
        T  Q AD+ TK L   +   +R  LG+
Subjt:  TGEQRADVLTKALTGVKLAAMRQLLGV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-13028.42Show/hide
Query:  GHGRGTKRQNSARGTSNT--ENGTRDKSHIK-CFTCNTMGHYTSKC----RGK----GRDDEAHLTCAIEEEPTLMMVVSQEGTRTRCDQEDAILLSKER
        G GR  +R ++  G S    ++  R KS ++ C+ CN  GH+   C    +GK    G+ ++ +    ++    +++ +++E        E+ + LS   
Subjt:  GHGRGTKRQNSARGTSNT--ENGTRDKSHIK-CFTCNTMGHYTSKC----RGK----GRDDEAHLTCAIEEEPTLMMVVSQEGTRTRCDQEDAILLSKER

Query:  LLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQ---KAFQEKNENKVQMTEDIMKVSD----
                    G    W +D  AS+H T  R+ F        G VK G  S  +I   G +  K   G     K  +   + ++ +   I    D    
Subjt:  LLPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQ---KAFQEKNENKVQMTEDIMKVSD----

Query:  ---------RSGKLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMITKQARLPFSRQS
                   G L+++    +  LY+   +  +        E    LWH R+GH++   L+++ +K L+      T   K    C+  KQ R+ F + S
Subjt:  ---------RSGKLLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMITKQARLPFSRQS

Query:  TYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFCKKEGIERHH
        + R    L+L+++D+C P+   ++ GNKYF+  +DD++R +W+Y+LK K   F+ F+KF  L+E +T  K++ LR+D GGE+ S EF ++C   GI    
Subjt:  TYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFCKKEGIERHH

Query:  TAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSP
        T P +PQ N + E  NR ++  +RS+ +   +P  F GEAV+ A YL+N  P   L        W  ++   +HL+VFGC A+         KLDD+S P
Subjt:  TAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSP

Query:  MVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQEN----------------------LEWAWNNVVSDGKEITEFPVMDQFYSNEFENLEDAETGFENVLP
         ++ G  +    +RL+DP + K+  SRDV+F+E+                      +    NN  S      E     +      E  E  + G E V  
Subjt:  MVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQEN----------------------LEWAWNNVVSDGKEITEFPVMDQFYSNEFENLEDAETGFENVLP

Query:  HATEIPAIGETSPSPSSTNTLVHLRSLSDIYANTEEVEGGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENELKSIEKNNTWSLTELPLGHKPIGLKW
           E P  GE    P   +    + S    Y +TE V   D++E E +    +E+  + E   + +  +AM+ E++S++KN T+ L ELP G +P+  KW
Subjt:  HATEIPAIGETSPSPSSTNTLVHLRSLSDIYANTEEVEGGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENELKSIEKNNTWSLTELPLGHKPIGLKW

Query:  VFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEEVYVTQSEGFEVPNEKHRVYRLSKA
        VFKLKKD + K++++KARLV KG+ Q++GIDF+E+F+PV ++ +IR IL+L A+   EV  LD K+AFL+G+LEEE+Y+ Q EGFEV  +KH V +L+K+
Subjt:  VFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEEVYVTQSEGFEVPNEKHRVYRLSKA

Query:  LYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGE-ECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGLLSYYLGIEV--EQHKGRIL
        LY L+QAPR W ++ D  +K   + K   +  VY +R  E   +++ +YVDD+++ G     + K K  +   F+M DLG     LG+++  E+   ++ 
Subjt:  LYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGE-ECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGLLSYYLGIEV--EQHKGRIL

Query:  LKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKD------TEGAQIDATEYRSIVGCLRY-LLNTTPNLSYVVGMASRYMERPTTMHYKRDSW----NG
        L Q  Y +R+L  F M +      P+    +L K        E   +    Y S VG L Y ++ T P++++ VG+ SR++E P   H++   W      
Subjt:  LKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKD------TEGAQIDATEYRSIVGCLRY-LLNTTPNLSYVVGMASRYMERPTTMHYKRDSW----NG

Query:  AKTGNIIC---------------------------------------------------------------GQQI--------------------CDSS-
          TG+ +C                                                               G+++                    CDS  
Subjt:  AKTGNIIC---------------------------------------------------------------GQQI--------------------CDSS-

Query:  ----HEEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRN
             +  ++H R+KHID  +H+IRE V++  + V  ++T E  AD+LTK +   K    ++L+G+ +
Subjt:  ----HEEPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRN

P25600 Putative transposon Ty5-1 protein YCL074W5.8e-2730.08Show/hide
Query:  LDEKSAFLNGELEEEVYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEK
        +D  +AFLN  ++E +YV Q  GF        V+ L   +Y L+QAP  WN  ++ +LK +GF +   E  +Y R   +  + + VYVDDL+V   S + 
Subjt:  LDEKSAFLNGELEEEVYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEK

Query:  VNKFKQQMMAKFEMSDLGLLSYYLGIEVEQ-HKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNT-TPN
         ++ KQ++   + M DLG +  +LG+ + Q   G I L    Y  +  S   +     T+ P+     L + T     D T Y+SIVG L +  NT  P+
Subjt:  VNKFKQQMMAKFEMSDLGLLSYYLGIEVEQ-HKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNT-TPN

Query:  LSYVVGMASRYMERPTTMHYK------------RDSWNGAKTGNIICGQQICDSSH
        +SY V + SR++  P  +H +            R      ++G+ +     CD+SH
Subjt:  LSYVVGMASRYMERPTTMHYK------------RDSWNGAKTGNIICGQQICDSSH

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.4e-9426.64Show/hide
Query:  WYLDNGASNHMTCHREKFQELDEYFTG--RVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQ------------------EKNENKVQMTEDIMKVSDRSGK
        W LD+GA++H+T        L + +TG   V    GSTI I   G+     K+                           N   V+      +V D +  
Subjt:  WYLDNGASNHMTCHREKFQELDEYFTG--RVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQ------------------EKNENKVQMTEDIMKVSDRSGK

Query:  LLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTW--LWHVRLGHVNFHDLKLMGEKKLVVGVPL-VTQPNKLFVA---CMITKQARLPFSRQSTYRAEKP
        + +   +T++ LY+  + + + V L  S         WH RLGH     L       ++    L V  P+  F++   C+I K  ++PFS QST  + +P
Subjt:  LLMSVKRTQNRLYKITLKTLKQVCLLTSLEDPTW--LWHVRLGHVNFHDLKLMGEKKLVVGVPL-VTQPNKLFVA---CMITKQARLPFSRQSTYRAEKP

Query:  LELLHADICEPISPCTLAGN-KYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFCKKEGIERHHTAPYSP
        LE +++D+    SP     N +Y+++ VD  TR+ WLY LK K    E F  FK L+EN+ + +I T  +D GGEF  V   ++  + GI    + P++P
Subjt:  LELLHADICEPISPCTLAGN-KYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFCKKEGIERHHTAPYSP

Query:  QQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGV
        + N + E ++R+++ T  +L     +P  +   A   AVYL+N LP   L   + F+   G  P+   LRVFGC  Y      +  KLDD+S   V+ G 
Subjt:  QQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGV

Query:  EEGCKAHRLYDPGRGKLQISRDVLFQENL--------------------EWAW------------------------------------NNVVS------
             A+        +L ISR V F EN                        W                                    N+ VS      
Subjt:  EEGCKAHRLYDPGRGKLQISRDVLFQENL--------------------EWAW------------------------------------NNVVS------

Query:  -----------------DGKEITEFPVMDQFYSNEFENLEDAETGFE--NVLPHATEIPA-IGETSPSP----SSTNT-------LVH-LRSLSDIYANT
                         +G + T  P   Q  ++  +N        E  + L  +   PA    +SPSP    SS++T       L+H    L+ I  N 
Subjt:  -----------------DGKEITEFPVMDQFYSNEFENLEDAETGFE--NVLPHATEIPA-IGETSPSP----SSTNT-------LVH-LRSLSDIYANT

Query:  EE----------------VEGGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENELKSIEKNNTWSLTELPLGHKPI-GLKWVFKLKKDPNGKVLKHKA
         +                ++   +    V +    E     +A+ + RW  AM +E+ +   N+TW L   P  H  I G +W+F  K + +G + ++KA
Subjt:  EE----------------VEGGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENELKSIEKNNTWSLTELPLGHKPI-GLKWVFKLKKDPNGKVLKHKA

Query:  RLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEEVYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDR
        RLVAKGY QR G+D+ E F+PV +  +IR++L +  ++SW +  LD  +AFL G L ++VY++Q  GF   +  + V +L KALY L+QAPRAW + L  
Subjt:  RLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEEVYVTQSEGFEVPNEKHRVYRLSKALYRLRQAPRAWNIRLDR

Query:  SLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCN
         L  +GF     + +++  + G+  V + VYVDD+++TG+    ++     +  +F + D   L Y+LGIE ++    + L Q  Y   +L+   M    
Subjt:  SLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCN

Query:  ATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTTMHY--------------------------------------
            PM P  +L   +     D TEYR IVG L+YL  T P++SY V   S++M  PT  H                                       
Subjt:  ATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTTMHY--------------------------------------

Query:  -KRD----------------SWNGAKTGNIICG----------------QQICDSSHE-----------------------EPVFHGRSKHIDTHFHFIR
         K D                SW+  K   ++                  Q IC    E                        PVFH R KHI   +HFIR
Subjt:  -KRD----------------SWNGAKTGNIICG----------------QQICDSSHE-----------------------EPVFHGRSKHIDTHFHFIR

Query:  ECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGV
          V++G + V  V+T +Q AD LTK L+          +GV
Subjt:  ECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.1e-8925.63Show/hide
Query:  DRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKCRGKGRDDEAHLTCAIEEEPTLMMVVSQEGTRTRCDQEDAILLSKERL
        +RG  R   + + R    Q S+ G S ++N        +C  C+  GH   +C                  P L    S     T   Q+     +  + 
Subjt:  DRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKCRGKGRDDEAHLTCAIEEEPTLMMVVSQEGTRTRCDQEDAILLSKERL

Query:  LPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTG-RVKFGYGSTIQIMEKGTVMFKCKNGD---QKAFQEKNENK---------------V
           +  N     NN  W LD+GA++H+T           Y  G  V    GSTI I   G+      +      K     N +K               V
Subjt:  LPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTG-RVKFGYGSTIQIMEKGTVMFKCKNGD---QKAFQEKNENK---------------V

Query:  QMTEDIMKVSDRSGKLLMSVKRTQNRLYKITLKTLKQVCLLTS--LEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVA--CMITKQARL
        +      +V D +  + +   +T++ LY+  + + + V +  S   +     WH RLGH +   L ++        +P++   +KL     C I K  ++
Subjt:  QMTEDIMKVSDRSGKLLMSVKRTQNRLYKITLKTLKQVCLLTS--LEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVA--CMITKQARL

Query:  PFSRQSTYRAEKPLELLHADICEPISPCTLAGN-KYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFCKK
        PFS  ST  + KPLE +++D+    SP     N +Y+++ VD  TR+ WLY LK K    + F  FK L+EN+ + +I TL +D GGEF  V    +  +
Subjt:  PFSRQSTYRAEKPLELLHADICEPISPCTLAGN-KYFLLIVDDSTRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFCKK

Query:  EGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHLKK
         GI    + P++P+ N + E ++R+++    +L     VP  +   A   AVYL+N LP   L  ++ F+   G+ P+   L+VFGC  Y      +  K
Subjt:  EGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYLLNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHLKK

Query:  LDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENL----------------------EWAWNNVV----------------------------
        L+D+S    + G      A+       G+L  SR V F E                         W  +  +                            
Subjt:  LDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENL----------------------EWAWNNVV----------------------------

Query:  -----------------------------SDGKEITEFPVMDQ-------FYSNEFENLEDAETGFEN--------VLPH-ATEIPAIGE-TSPSPSSTN
                                      +G + T  P   Q         +N   N     +  +N          PH  T   +I E  SPS SST+
Subjt:  -----------------------------SDGKEITEFPVMDQ-------FYSNEFENLEDAETGFEN--------VLPH-ATEIPAIGE-TSPSPSSTN

Query:  T-------------LVHLRSLSDIYANTEEVEGGDEQENE-----VMMVVSEELTCYQEAVTEARWYQAMENELKSIEKNNTWSLTELPLGHKPI-GLKW
        T              V+ ++  + ++     + G  + N+       +  + E     +A+ + RW QAM +E+ +   N+TW L   P     I G +W
Subjt:  T-------------LVHLRSLSDIYANTEEVEGGDEQENE-----VMMVVSEELTCYQEAVTEARWYQAMENELKSIEKNNTWSLTELPLGHKPI-GLKW

Query:  VFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEEVYVTQSEGFEVPNEKHRVYRLSKA
        +F  K + +G + ++KARLVAKGY QR G+D+ E F+PV +  +IR++L +  ++SW +  LD  +AFL G L +EVY++Q  GF   +    V RL KA
Subjt:  VFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEEVYVTQSEGFEVPNEKHRVYRLSKA

Query:  LYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGLLSYYLGIEVEQHKGRILLKQ
        +Y L+QAPRAW + L   L  +GF     + +++  + G   + + VYVDD+++TG+ T  +      +  +F + +   L Y+LGIE ++    + L Q
Subjt:  LYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGLLSYYLGIEVEQHKGRILLKQ

Query:  PTYAKRILSHFGMADCNATKYPM--EPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTTMHY---KR--------------
          Y   +L+   M        PM   PK  LH  T+    D TEYR IVG L+YL  T P+LSY V   S+YM  PT  H+   KR              
Subjt:  PTYAKRILSHFGMADCNATKYPM--EPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTTMHY---KR--------------

Query:  --------------------------------------DSWNGAKTGNIICG----------------QQICDSSHE-----------------------
                                               SW+  K   ++                  Q IC    E                       
Subjt:  --------------------------------------DSWNGAKTGNIICG----------------QQICDSSHE-----------------------

Query:  EPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGV
         PVFH R KHI   +HFIR  V++G + V  V+T +Q AD LTK L+ V      + +GV
Subjt:  EPVFHGRSKHIDTHFHFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.5e-5935.12Show/hide
Query:  VMMVVSEELTCYQEAVTEARWYQAMENELKSIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIR
        V +  ++E + Y EA     W  AM++E+ ++E  +TW +  LP   KPIG KWV+K+K + +G + ++KARLVAKGY Q++GIDF E F+PV +L +++
Subjt:  VMMVVSEELTCYQEAVTEARWYQAMENELKSIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIR

Query:  VILALTANQSWEVHHLDEKSAFLNGELEEEVYVTQSEGFEVPN----EKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEEC
        +ILA++A  ++ +H LD  +AFLNG+L+EE+Y+    G+          + V  L K++Y L+QA R W ++   +L G GF +   +   + +      
Subjt:  VILALTANQSWEVHHLDEKSAFLNGELEEEVYVTQSEGFEVPN----EKHRVYRLSKALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEEC

Query:  VLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATE
        + V VYVDD+I+  ++   V++ K Q+ + F++ DLG L Y+LG+E+ +    I + Q  YA  +L   G+  C  +  PM+P       + G  +DA  
Subjt:  VLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATE

Query:  YRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTTMH
        YR ++G L YL  T  ++S+ V   S++ E P   H
Subjt:  YRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTTMH

ATMG00810.1 DNA/RNA polymerases superfamily protein1.1e-2038.76Show/hide
Query:  VYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSI
        +YVDD+++TGSS   +N    Q+ + F M DLG + Y+LGI+++ H   + L Q  YA++IL++ GM DC     P+  K      T     D +++RSI
Subjt:  VYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGLLSYYLGIEVEQHKGRILLKQPTYAKRILSHFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSI

Query:  VGCLRYLLNTTPNLSYVVGMASRYMERPT
        VG L+YL  T P++SY V +  + M  PT
Subjt:  VGCLRYLLNTTPNLSYVVGMASRYMERPT

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.1e-1546.07Show/hide
Query:  AVTEARWYQAMENELKSIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVIL
        A+ +  W QAM+ EL ++ +N TW L   P+    +G KWVFK K   +G + + KARLVAKG+ Q +GI F E ++PV R  TIR IL
Subjt:  AVTEARWYQAMENELKSIEKNNTWSLTELPLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAAAGGGCGCGGATTTGGTGGTATCGATCGAGGAAGATGGCGAGGATGTGGTCATGGTCATGGTCGTGGCACTAAGCGTCAAAATAGTGCGAGAGGCACTAGCAA
CACTGAAAATGGCACTCGTGATAAAAGTCACATTAAGTGTTTCACTTGCAACACGATGGGACATTACACATCGAAATGTCGTGGAAAAGGTCGTGACGACGAAGCTCATC
TAACTTGTGCCATCGAAGAAGAACCGACTTTGATGATGGTCGTGTCCCAGGAGGGGACACGCACTAGATGTGATCAGGAGGATGCCATACTACTTAGCAAAGAGCGGTTG
TTGCCAGAGATGTATTGCAACGACAATAATGGAGGAAATAATGATGTTTGGTATCTTGACAACGGTGCTAGTAACCACATGACATGCCATCGTGAGAAGTTCCAAGAATT
AGATGAATACTTCACTGGGAGGGTGAAGTTTGGCTATGGATCAACCATTCAGATCATGGAAAAAGGAACAGTCATGTTCAAGTGCAAGAACGGTGATCAGAAGGCTTTCC
AAGAGAAAAACGAAAACAAGGTGCAAATGACAGAAGATATCATGAAAGTGTCTGACAGGAGTGGAAAGCTTTTGATGTCGGTGAAGCGAACTCAAAATCGTTTGTACAAG
ATAACTTTGAAGACACTCAAGCAAGTCTGCCTTCTGACAAGCCTAGAAGATCCAACATGGTTATGGCACGTGAGACTTGGCCATGTAAATTTTCATGACTTGAAGCTCAT
GGGGGAGAAGAAATTGGTAGTTGGAGTACCACTAGTGACTCAACCGAACAAGTTATTTGTAGCGTGCATGATTACCAAACAGGCCAGATTGCCCTTCTCCCGTCAATCAA
CATATAGAGCAGAGAAGCCGTTAGAACTACTCCATGCTGATATATGCGAACCGATTTCACCATGTACTCTTGCAGGAAACAAGTATTTTCTGTTGATCGTTGACGATTCC
ACGAGATGGATGTGGTTGTATATGTTGAAGGCAAAAAGGGACGCATTTGAAGCATTCAAGAAATTCAAACTCTTAATGGAGAACAAGACGGAGTACAAGATCAGAACACT
CCGAACGGATCGAGGTGGTGAGTTCTTATCTGTAGAATTCACTCAATTTTGCAAAAAAGAAGGAATCGAACGACACCACACCGCTCCATATTCACCACAACAAAATGACA
TTGTAGAGCATCGTAACCGCAACGTAATGGCGACGATGAGATCACTCTTCAAAAGCATGCACGTGCCTGTAAAATTTTTGGGAGAGGCAGTGAGACACGCAGTTTATTTG
TTAAACAGTCTTCCAATGAAGGCCCTTGGAGAACGCACATCATTTGAAGCTTGGATGGGGAGAAAGCCACATCTTGCACACTTGAGAGTCTTTGGTTGTGTGGCATATGT
AAAGAACACAACCCCTCACCTCAAGAAACTCGACGATCGAAGCTCACCAATGGTATATTTTGGTGTCGAAGAAGGATGCAAAGCCCATCGCTTATATGACCCAGGCCGTG
GAAAACTACAAATTAGTAGAGATGTTCTATTTCAAGAGAATCTTGAATGGGCTTGGAACAATGTCGTTAGTGACGGTAAGGAGATTACAGAGTTTCCGGTGATGGACCAA
TTTTATTCTAACGAGTTCGAAAACTTGGAGGATGCAGAAACTGGGTTTGAAAATGTCTTACCACATGCAACTGAGATTCCTGCGATTGGAGAGACCAGTCCATCTCCTTC
ATCGACGAACACACTGGTCCATCTAAGATCTCTCAGTGACATCTACGCCAACACAGAGGAAGTTGAAGGTGGTGATGAACAAGAGAATGAAGTGATGATGGTAGTGTCTG
AAGAACTGACTTGTTACCAAGAAGCTGTTACAGAGGCACGCTGGTACCAAGCAATGGAGAACGAATTAAAATCTATTGAGAAAAACAACACATGGAGTCTGACCGAGCTT
CCACTAGGACACAAACCCATTGGTCTAAAATGGGTGTTCAAATTGAAGAAAGACCCTAATGGAAAAGTTCTCAAGCACAAAGCAAGATTGGTTGCTAAAGGCTATGTACA
AAGACAAGGCATTGACTTTGAAGAAGTTTTTGCACCGGTTGCAAGACTTGACACCATTCGAGTCATTCTTGCACTCACTGCAAACCAAAGTTGGGAGGTACACCATCTAG
ATGAAAAGTCGGCATTTCTCAATGGAGAATTGGAAGAGGAAGTATATGTTACTCAATCGGAGGGTTTTGAGGTTCCAAATGAAAAACACAGGGTGTATAGATTGTCAAAG
GCTCTCTACAGGTTGAGGCAAGCTCCACGAGCTTGGAACATTCGACTTGATAGGAGTCTCAAAGGTCTTGGTTTTGAAAAATGCACTCAAGAACAAGCAGTCTATACAAG
AAGAGAAGGAGAAGAATGTGTTCTTGTTGGAGTGTATGTTGACGATCTCATTGTAACAGGAAGTAGCACTGAAAAGGTCAATAAGTTCAAGCAACAAATGATGGCAAAAT
TTGAAATGAGCGACTTAGGCCTTCTCTCTTACTACTTAGGAATTGAAGTTGAACAACATAAGGGTCGAATCCTGCTCAAACAACCAACTTATGCCAAAAGAATTTTGTCC
CACTTTGGAATGGCTGATTGCAATGCCACAAAGTACCCGATGGAACCCAAGGCACAACTTCATAAAGACACGGAAGGAGCACAAATTGATGCTACGGAGTATAGAAGCAT
CGTTGGTTGTCTTAGATACTTACTGAACACAACGCCAAATCTTTCATATGTTGTTGGGATGGCGAGTAGGTATATGGAAAGGCCTACAACCATGCATTACAAGCGAGATA
GTTGGAATGGTGCCAAGACCGGTAACATTATTTGTGGACAACAAATCTGCGATAGCTCTCATGAAGAACCTGTATTTCATGGTCGCAGCAAGCACATAGATACACATTTT
CATTTCATTCGAGAGTGTGTCGAGAATGGACAAATTATCGTTGAATTTGTCAACACTGGAGAACAACGAGCCGATGTTCTGACTAAAGCATTGACAGGAGTAAAGTTAGC
TGCTATGCGTCAGCTACTTGGTGTTCGTAATTTAGAATCATGTCAGGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACAAAGGGCGCGGATTTGGTGGTATCGATCGAGGAAGATGGCGAGGATGTGGTCATGGTCATGGTCGTGGCACTAAGCGTCAAAATAGTGCGAGAGGCACTAGCAA
CACTGAAAATGGCACTCGTGATAAAAGTCACATTAAGTGTTTCACTTGCAACACGATGGGACATTACACATCGAAATGTCGTGGAAAAGGTCGTGACGACGAAGCTCATC
TAACTTGTGCCATCGAAGAAGAACCGACTTTGATGATGGTCGTGTCCCAGGAGGGGACACGCACTAGATGTGATCAGGAGGATGCCATACTACTTAGCAAAGAGCGGTTG
TTGCCAGAGATGTATTGCAACGACAATAATGGAGGAAATAATGATGTTTGGTATCTTGACAACGGTGCTAGTAACCACATGACATGCCATCGTGAGAAGTTCCAAGAATT
AGATGAATACTTCACTGGGAGGGTGAAGTTTGGCTATGGATCAACCATTCAGATCATGGAAAAAGGAACAGTCATGTTCAAGTGCAAGAACGGTGATCAGAAGGCTTTCC
AAGAGAAAAACGAAAACAAGGTGCAAATGACAGAAGATATCATGAAAGTGTCTGACAGGAGTGGAAAGCTTTTGATGTCGGTGAAGCGAACTCAAAATCGTTTGTACAAG
ATAACTTTGAAGACACTCAAGCAAGTCTGCCTTCTGACAAGCCTAGAAGATCCAACATGGTTATGGCACGTGAGACTTGGCCATGTAAATTTTCATGACTTGAAGCTCAT
GGGGGAGAAGAAATTGGTAGTTGGAGTACCACTAGTGACTCAACCGAACAAGTTATTTGTAGCGTGCATGATTACCAAACAGGCCAGATTGCCCTTCTCCCGTCAATCAA
CATATAGAGCAGAGAAGCCGTTAGAACTACTCCATGCTGATATATGCGAACCGATTTCACCATGTACTCTTGCAGGAAACAAGTATTTTCTGTTGATCGTTGACGATTCC
ACGAGATGGATGTGGTTGTATATGTTGAAGGCAAAAAGGGACGCATTTGAAGCATTCAAGAAATTCAAACTCTTAATGGAGAACAAGACGGAGTACAAGATCAGAACACT
CCGAACGGATCGAGGTGGTGAGTTCTTATCTGTAGAATTCACTCAATTTTGCAAAAAAGAAGGAATCGAACGACACCACACCGCTCCATATTCACCACAACAAAATGACA
TTGTAGAGCATCGTAACCGCAACGTAATGGCGACGATGAGATCACTCTTCAAAAGCATGCACGTGCCTGTAAAATTTTTGGGAGAGGCAGTGAGACACGCAGTTTATTTG
TTAAACAGTCTTCCAATGAAGGCCCTTGGAGAACGCACATCATTTGAAGCTTGGATGGGGAGAAAGCCACATCTTGCACACTTGAGAGTCTTTGGTTGTGTGGCATATGT
AAAGAACACAACCCCTCACCTCAAGAAACTCGACGATCGAAGCTCACCAATGGTATATTTTGGTGTCGAAGAAGGATGCAAAGCCCATCGCTTATATGACCCAGGCCGTG
GAAAACTACAAATTAGTAGAGATGTTCTATTTCAAGAGAATCTTGAATGGGCTTGGAACAATGTCGTTAGTGACGGTAAGGAGATTACAGAGTTTCCGGTGATGGACCAA
TTTTATTCTAACGAGTTCGAAAACTTGGAGGATGCAGAAACTGGGTTTGAAAATGTCTTACCACATGCAACTGAGATTCCTGCGATTGGAGAGACCAGTCCATCTCCTTC
ATCGACGAACACACTGGTCCATCTAAGATCTCTCAGTGACATCTACGCCAACACAGAGGAAGTTGAAGGTGGTGATGAACAAGAGAATGAAGTGATGATGGTAGTGTCTG
AAGAACTGACTTGTTACCAAGAAGCTGTTACAGAGGCACGCTGGTACCAAGCAATGGAGAACGAATTAAAATCTATTGAGAAAAACAACACATGGAGTCTGACCGAGCTT
CCACTAGGACACAAACCCATTGGTCTAAAATGGGTGTTCAAATTGAAGAAAGACCCTAATGGAAAAGTTCTCAAGCACAAAGCAAGATTGGTTGCTAAAGGCTATGTACA
AAGACAAGGCATTGACTTTGAAGAAGTTTTTGCACCGGTTGCAAGACTTGACACCATTCGAGTCATTCTTGCACTCACTGCAAACCAAAGTTGGGAGGTACACCATCTAG
ATGAAAAGTCGGCATTTCTCAATGGAGAATTGGAAGAGGAAGTATATGTTACTCAATCGGAGGGTTTTGAGGTTCCAAATGAAAAACACAGGGTGTATAGATTGTCAAAG
GCTCTCTACAGGTTGAGGCAAGCTCCACGAGCTTGGAACATTCGACTTGATAGGAGTCTCAAAGGTCTTGGTTTTGAAAAATGCACTCAAGAACAAGCAGTCTATACAAG
AAGAGAAGGAGAAGAATGTGTTCTTGTTGGAGTGTATGTTGACGATCTCATTGTAACAGGAAGTAGCACTGAAAAGGTCAATAAGTTCAAGCAACAAATGATGGCAAAAT
TTGAAATGAGCGACTTAGGCCTTCTCTCTTACTACTTAGGAATTGAAGTTGAACAACATAAGGGTCGAATCCTGCTCAAACAACCAACTTATGCCAAAAGAATTTTGTCC
CACTTTGGAATGGCTGATTGCAATGCCACAAAGTACCCGATGGAACCCAAGGCACAACTTCATAAAGACACGGAAGGAGCACAAATTGATGCTACGGAGTATAGAAGCAT
CGTTGGTTGTCTTAGATACTTACTGAACACAACGCCAAATCTTTCATATGTTGTTGGGATGGCGAGTAGGTATATGGAAAGGCCTACAACCATGCATTACAAGCGAGATA
GTTGGAATGGTGCCAAGACCGGTAACATTATTTGTGGACAACAAATCTGCGATAGCTCTCATGAAGAACCTGTATTTCATGGTCGCAGCAAGCACATAGATACACATTTT
CATTTCATTCGAGAGTGTGTCGAGAATGGACAAATTATCGTTGAATTTGTCAACACTGGAGAACAACGAGCCGATGTTCTGACTAAAGCATTGACAGGAGTAAAGTTAGC
TGCTATGCGTCAGCTACTTGGTGTTCGTAATTTAGAATCATGTCAGGATTAG
Protein sequenceShow/hide protein sequence
MNKGRGFGGIDRGRWRGCGHGHGRGTKRQNSARGTSNTENGTRDKSHIKCFTCNTMGHYTSKCRGKGRDDEAHLTCAIEEEPTLMMVVSQEGTRTRCDQEDAILLSKERL
LPEMYCNDNNGGNNDVWYLDNGASNHMTCHREKFQELDEYFTGRVKFGYGSTIQIMEKGTVMFKCKNGDQKAFQEKNENKVQMTEDIMKVSDRSGKLLMSVKRTQNRLYK
ITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLFVACMITKQARLPFSRQSTYRAEKPLELLHADICEPISPCTLAGNKYFLLIVDDS
TRWMWLYMLKAKRDAFEAFKKFKLLMENKTEYKIRTLRTDRGGEFLSVEFTQFCKKEGIERHHTAPYSPQQNDIVEHRNRNVMATMRSLFKSMHVPVKFLGEAVRHAVYL
LNSLPMKALGERTSFEAWMGRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNNVVSDGKEITEFPVMDQ
FYSNEFENLEDAETGFENVLPHATEIPAIGETSPSPSSTNTLVHLRSLSDIYANTEEVEGGDEQENEVMMVVSEELTCYQEAVTEARWYQAMENELKSIEKNNTWSLTEL
PLGHKPIGLKWVFKLKKDPNGKVLKHKARLVAKGYVQRQGIDFEEVFAPVARLDTIRVILALTANQSWEVHHLDEKSAFLNGELEEEVYVTQSEGFEVPNEKHRVYRLSK
ALYRLRQAPRAWNIRLDRSLKGLGFEKCTQEQAVYTRREGEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGLLSYYLGIEVEQHKGRILLKQPTYAKRILS
HFGMADCNATKYPMEPKAQLHKDTEGAQIDATEYRSIVGCLRYLLNTTPNLSYVVGMASRYMERPTTMHYKRDSWNGAKTGNIICGQQICDSSHEEPVFHGRSKHIDTHF
HFIRECVENGQIIVEFVNTGEQRADVLTKALTGVKLAAMRQLLGVRNLESCQD