; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g10150 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g10150
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr9:8579964..8583482
RNA-Seq ExpressionMoc09g10150
SyntenyMoc09g10150
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN62051.1 hypothetical protein VITISV_016641 [Vitis vinifera]5.4e-9426.04Show/hide
Query:  EKISNFPYLKFGCIYSTKKQ-PSELNMTESLFSDDEEGLNISIS-SQDSPPFLDIHAECATDLAEFSLNGELTNLFDPPLFSPATQD-KSVFCEDPPNIE
        E +   PYL      ST++  PS   +T SL +      N S+S    S P L + A  +    E  +N E         F  A          D PN E
Subjt:  EKISNFPYLKFGCIYSTKKQ-PSELNMTESLFSDDEEGLNISIS-SQDSPPFLDIHAECATDLAEFSLNGELTNLFDPPLFSPATQD-KSVFCEDPPNIE

Query:  GEIQIPSFFDSVGIALSPETKLQQIDKETIKSLWSSKDVAWASIEAEG-SEGRSGGILTLWDETKIQIR--EVLEGSHSVSISISFFNLKEIVITNIYGP
         E+  P+  + V  +++P         E   +L           + EG S  R   +  +     I++R  EV+ GS SVSI  +    + + ++ +YGP
Subjt:  GEIQIPSFFDSVGIALSPETKLQQIDKETIKSLWSSKDVAWASIEAEG-SEGRSGGILTLWDETKIQIR--EVLEGSHSVSISISFFNLKEIVITNIYGP

Query:  TDYKSRKHLWSELRNISGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSN
         +   RK  W EL +I+G     WC+GGDFN+ R  S++  G R+T  MK F+  I + EL++ PL +  YTWS M  N     LDRF  S +W+ +F  
Subjt:  TDYKSRKHLWSELRNISGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSN

Query:  SSVNRVERITSDHFPIVLDAGGFSLGPLSLQNEK-------------------------------------------------QRKEAEDKIVAEISLLD
        S    + R TSDH+PIVL+   F  GP   + E                                                  +  + +  I+A ++  D
Subjt:  SSVNRVERITSDHFPIVLDAGGFSLGPLSLQNEK-------------------------------------------------QRKEAEDKIVAEISLLD

Query:  IKDENFGLSLDEIERRGSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGI
          ++  GLS + + +R   K +L  L +REE +  QK++V W+K GD NS FFH+    R+ +  I E+++ +G+ L N   I++EI  +F+ LY     
Subjt:  IKDENFGLSLDEIERRGSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGI

Query:  SRFTPRDISWRPISCQDSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLN----------------------
          +    + W PI  + +S L+  F E E++ A+  + R+KAPGPDGFTI  F   W   K D +R+FAEF R+G +N                      
Subjt:  SRFTPRDISWRPISCQDSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLN----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -AC-----------------------------------------------------------GNPKLEAFWFPIKDKILKKLDRWKRYQLSRGGRLTLCQ
          C                                                           GNPK   FW P+ ++I ++LD W++  LS GGR+TL Q
Subjt:  -AC-----------------------------------------------------------GNPKLEAFWFPIKDKILKKLDRWKRYQLSRGGRLTLCQ

Query:  AVLGSIPLYYLSLFQMPGHISEQFEKYIRKFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNKNHALLAKWGWRFSLETSALWRRVVASIYETDCF
        + L  +P Y+LSLF++P  ++ + E+  R+F W     G   HL  WD+  +P + GGLG G +  +N ALL KW WR+  E SALW +V+ SIY +   
Subjt:  AVLGSIPLYYLSLFQMPGHISEQFEKYIRKFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNKNHALLAKWGWRFSLETSALWRRVVASIYETDCF

Query:  DWNTLEKKNLGCRSPWNNIQKQWKKVQKFSSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNPYLSIHEAWD-STTSSWRIEVRRSLKDGEFAEYI
         W+         R PW  I   +++  KF+   V +G R  FW  LW G+  L  ++P +  V ++    I      S   SW    RR+L D E  +  
Subjt:  DWNTLEKKNLGCRSPWNNIQKQWKKVQKFSSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNPYLSIHEAWD-STTSSWRIEVRRSLKDGEFAEYI

Query:  ELLDNLNLVLLNDD-EDKLSWNLNKNGTF
         L+ +L+ + ++    DK SW+++ +G F
Subjt:  ELLDNLNLVLLNDD-EDKLSWNLNKNGTF

CAN69126.1 hypothetical protein VITISV_008195 [Vitis vinifera]6.1e-9825.78Show/hide
Query:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI
        + +  ETK ++ D+  + S+WS ++  WA++ A G+   SGGIL +WD  K++  EV+ GS SVSI  +    + + ++ +YGP +   RK  W EL +I
Subjt:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI

Query:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI
        +G     WC+GGDFN+ R  S++  G R+T  MK F+  I + EL++ PL +  YTWS M  N     LDRF  S +W+ +F  S    + R TSDH+PI
Subjt:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI

Query:  VLDAGGFSLGPLSLQNEK-------------------------------------------------QRKEAEDKIVAEISLLDIKDENFGLSLDEIERR
        VL+   F  GP   + E                                                  +  + +  I+A ++  D  ++  GLS + + +R
Subjt:  VLDAGGFSLGPLSLQNEK-------------------------------------------------QRKEAEDKIVAEISLLDIKDENFGLSLDEIERR

Query:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ
           K +L  L +REE +  QK++V W+K GD NS FFH+    R+ +  I E+++ +G+ L N   I++EI  +F+ LY       +    + W PI  +
Subjt:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ

Query:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLN--------------------------------------
         +S L+  F E E++ A+  + R+KAPGPDGFTI  F   W   K D +R+FAEF R+G +N                                      
Subjt:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLN--------------------------------------

Query:  -------------------------------------------------------------------------------------AC-------------
                                                                                              C             
Subjt:  -------------------------------------------------------------------------------------AC-------------

Query:  -----------------------------------------------------------------------------------------------GNPKL
                                                                                                       GNPK 
Subjt:  -----------------------------------------------------------------------------------------------GNPKL

Query:  EAFWFPIKDKILKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIRKFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNK
          FW P+ ++I ++LD W++  LS GGR+TL Q+ L  +P Y+LSLF++P  ++ + E+  R+F W     G   HL  WD+  +P + GGLG G +  +
Subjt:  EAFWFPIKDKILKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIRKFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNK

Query:  NHALLAKWGWRFSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKFSSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNP
        N ALL KW WR+  E SALW +V+ SIY +    W+         R PW  I   +++  KF+   V +G R  FW  LW G+  L  ++P +  V ++ 
Subjt:  NHALLAKWGWRFSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKFSSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNP

Query:  YLSIHEAWD-STTSSWRIEVRRSLKDGEFAEYIELLDNLNLVLLNDD-EDKLSWNLNKNGTF
           I      S   SW    RR+L D E  +   L+ +L+ + ++    DK SW+++ +G F
Subjt:  YLSIHEAWD-STTSSWRIEVRRSLKDGEFAEYIELLDNLNLVLLNDD-EDKLSWNLNKNGTF

CAN74319.1 hypothetical protein VITISV_035345 [Vitis vinifera]3.4e-9626.39Show/hide
Query:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI
        I +  ETK  + D+  + SLW++++  WA + A G+   SGGIL +WD  K+   EV+ GS SVS+  +    ++  ++ +YGP     RK  W EL +I
Subjt:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI

Query:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI
         G     WC+GGDFN+ R  S++  GGR+T  MK  +  I E EL++ PL +  +TWS M  +     LDRF  S +W+ LF     + + R TSDH+ I
Subjt:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI

Query:  VLDAGGFSLGPLSLQNEK-----------------------------QRK--------------------EAEDKIVAEISLLDIKDENFGLSLDEIERR
        VL+   F  GP   + E                               RK                    E +  I+ +I+  D  ++  GLS + + +R
Subjt:  VLDAGGFSLGPLSLQNEK-----------------------------QRK--------------------EAEDKIVAEISLLDIKDENFGLSLDEIERR

Query:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ
           K +L  L +REE +  QK++V W+K GD NS FFH+    R+ +  I  +++  G+ L N   I++EI  +F+ LY S     +    + W PIS +
Subjt:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ

Query:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLNAC------------------------------------
         +S L+  F E E+  A+  + R KAPGPDGFTI  F   W   K D +R+F EF R+G +N                                      
Subjt:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLNAC------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------GNPKLEAFWFPIKDKI
                                                                                            GNPK  +FW P+ ++I
Subjt:  ------------------------------------------------------------------------------------GNPKLEAFWFPIKDKI

Query:  LKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIRKFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNKNHALLAKWGWR
          +LD W++  LS GGR+TL ++ L  +P Y+LSLF++P  ++ + E+  R F W     G   HL  W++  +    GGLG+GM+  +N ALL KW WR
Subjt:  LKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIRKFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNKNHALLAKWGWR

Query:  FSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKFSSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNPYLSIHEAWDST
        +  E SALW +V+ SIY +    W+         R PW  I + ++   KF+   V +G R  FW  LW G+  L  +FP + R   +  + I     ST
Subjt:  FSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKFSSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNPYLSIHEAWDST

Query:  TS-SWRIEVRRSLKDGEFAEYIELLDNLNLVLLNDD-EDKLSWNLNKNGTF
           SW    RR+L D E  E   L+ +L+ + L+    DK SW+L+ +G F
Subjt:  TS-SWRIEVRRSLKDGEFAEYIELLDNLNLVLLNDD-EDKLSWNLNKNGTF

RVW65579.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]9.2e-9424.8Show/hide
Query:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI
        + +  ETK ++ D+  + S+WS ++  WA++ A G+   SGGIL +WD  K++  EV+ GS SVSI  +    + + ++ +YGP +   RK  W EL +I
Subjt:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI

Query:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI
        +G     WC+GGDFN+ R  S++  G R+T  MK F+  I + EL++ PL +  YTWS M  N     LDRF  S +W+ +F  S    + R TSDH+PI
Subjt:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI

Query:  VLDAGGFSLGPLSLQNEK-------------------------------------------------QRKEAEDKIVAEISLLDIKDENFGLSLDEIERR
        VL+   F  GP   + E                                                  +  + +  I+A ++  D  ++  GLS + + +R
Subjt:  VLDAGGFSLGPLSLQNEK-------------------------------------------------QRKEAEDKIVAEISLLDIKDENFGLSLDEIERR

Query:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ
           K +L  L +REE +  QK++V W+K GD NS FFH+    R+ +  I E+++ +G+ L N   I++EI  +F+ LY       +    + W PI  +
Subjt:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ

Query:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLN--------------------------------------
         +S L+  F E E++ A+  + R+KAPGPDGFTI  F   W   K D +R+FAEF R+G +N                                      
Subjt:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLN--------------------------------------

Query:  -------------------------------------------------------------------------------------AC-------------
                                                                                              C             
Subjt:  -------------------------------------------------------------------------------------AC-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------GNPKLEAFWFPIKDKILKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIR
                                         GNPK   FW P+ ++I ++LD W++  LS GGR+TL Q+ L  +P Y+LSLF++P  ++ + E+  R
Subjt:  ---------------------------------GNPKLEAFWFPIKDKILKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIR

Query:  KFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNKNHALLAKWGWRFSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKF
        +F W     G   HL  WD+  +P + GGLG G +  +N ALL KW WR+  E SALW +V+ SIY +    W+         R PW  I   +++  KF
Subjt:  KFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNKNHALLAKWGWRFSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKF

Query:  SSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNPYLSIHEAWD-STTSSWRIEVRRSLKDGEFAEYIELLDNLNLVLLNDD-EDKLSWNLNKNGTF
        +   V +G R  FW  LW G+  L  ++P +  V ++    I      S   SW    RR+L D E  +   L+ +L+ + ++    DK SW+++ +G F
Subjt:  SSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNPYLSIHEAWD-STTSSWRIEVRRSLKDGEFAEYIELLDNLNLVLLNDD-EDKLSWNLNKNGTF

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]3.2e-9425Show/hide
Query:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI
        + +  ETK ++ D+  + S+W++++  WA++ A G+   SGGIL +WD  K+   EV+ GS SVSI  +    + + ++ +YGP +   RK LW EL +I
Subjt:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI

Query:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI
        +G     WC+GGDFN+ R  S++  G R+T  MK F+  I + EL+++PL +  +TWS M  N     LDRF  S +W+  F  S    + R TSDH+PI
Subjt:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI

Query:  VLDAGGFSLGPLSLQNEK-------------------------------------------------QRKEAEDKIVAEISLLDIKDENFGLSLDEIERR
        VL+   F  GP   + E                                                  +  + ++ I++ +   D  ++  GLS + + +R
Subjt:  VLDAGGFSLGPLSLQNEK-------------------------------------------------QRKEAEDKIVAEISLLDIKDENFGLSLDEIERR

Query:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ
           K +L  L +REE +  QK++V W+K GD NS FFH+    R+ +  I E+++ NG  + N   I++EI  +F+ LY S     +    + W PIS +
Subjt:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ

Query:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLN--------------------------------------
         +  L+  F E E+  A+  + R+KAPGPDGFTI  F   W+  K D +++F EF R+G +N                                      
Subjt:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLN--------------------------------------

Query:  -------------------------------------------------------------------------------------AC-------------
                                                                                              C             
Subjt:  -------------------------------------------------------------------------------------AC-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------GNPKLEAFWFPIKDKILKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIR
                                         GNPK   FW P+ ++I ++LD W++  LS GGR+TL Q+ L  +P Y+LSLF++P  ++ + E+  R
Subjt:  ---------------------------------GNPKLEAFWFPIKDKILKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIR

Query:  KFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNKNHALLAKWGWRFSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKF
         F W     G   HL  WD+  +P + GGLG G +  +N ALL KW WR+  E SALW +V+ SIY +    W+         R PW  I   +++  KF
Subjt:  KFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNKNHALLAKWGWRFSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKF

Query:  SSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNPYLSIHEAWDSTTS-SWRIEVRRSLKDGEFAEYIELLDNLN-LVLLNDDEDKLSWNLNKNGTF
        +   V NG R  FW  LW GE  L  ++P + RV ++    I     ST   SW    RR+L D E  +   L+ + + L + +   DK SW+L+ +G F
Subjt:  SSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNPYLSIHEAWDSTTS-SWRIEVRRSLKDGEFAEYIELLDNLN-LVLLNDDEDKLSWNLNKNGTF

TrEMBL top hitse value%identityAlignment
A0A438G038 Transposon TX1 uncharacterized 149 kDa protein4.4e-9424.8Show/hide
Query:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI
        + +  ETK ++ D+  + S+WS ++  WA++ A G+   SGGIL +WD  K++  EV+ GS SVSI  +    + + ++ +YGP +   RK  W EL +I
Subjt:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI

Query:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI
        +G     WC+GGDFN+ R  S++  G R+T  MK F+  I + EL++ PL +  YTWS M  N     LDRF  S +W+ +F  S    + R TSDH+PI
Subjt:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI

Query:  VLDAGGFSLGPLSLQNEK-------------------------------------------------QRKEAEDKIVAEISLLDIKDENFGLSLDEIERR
        VL+   F  GP   + E                                                  +  + +  I+A ++  D  ++  GLS + + +R
Subjt:  VLDAGGFSLGPLSLQNEK-------------------------------------------------QRKEAEDKIVAEISLLDIKDENFGLSLDEIERR

Query:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ
           K +L  L +REE +  QK++V W+K GD NS FFH+    R+ +  I E+++ +G+ L N   I++EI  +F+ LY       +    + W PI  +
Subjt:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ

Query:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLN--------------------------------------
         +S L+  F E E++ A+  + R+KAPGPDGFTI  F   W   K D +R+FAEF R+G +N                                      
Subjt:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLN--------------------------------------

Query:  -------------------------------------------------------------------------------------AC-------------
                                                                                              C             
Subjt:  -------------------------------------------------------------------------------------AC-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------GNPKLEAFWFPIKDKILKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIR
                                         GNPK   FW P+ ++I ++LD W++  LS GGR+TL Q+ L  +P Y+LSLF++P  ++ + E+  R
Subjt:  ---------------------------------GNPKLEAFWFPIKDKILKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIR

Query:  KFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNKNHALLAKWGWRFSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKF
        +F W     G   HL  WD+  +P + GGLG G +  +N ALL KW WR+  E SALW +V+ SIY +    W+         R PW  I   +++  KF
Subjt:  KFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNKNHALLAKWGWRFSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKF

Query:  SSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNPYLSIHEAWD-STTSSWRIEVRRSLKDGEFAEYIELLDNLNLVLLNDD-EDKLSWNLNKNGTF
        +   V +G R  FW  LW G+  L  ++P +  V ++    I      S   SW    RR+L D E  +   L+ +L+ + ++    DK SW+++ +G F
Subjt:  SSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNPYLSIHEAWD-STTSSWRIEVRRSLKDGEFAEYIELLDNLNLVLLNDD-EDKLSWNLNKNGTF

A0A438GDE7 LINE-1 retrotransposable element ORF2 protein1.5e-9425Show/hide
Query:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI
        + +  ETK ++ D+  + S+W++++  WA++ A G+   SGGIL +WD  K+   EV+ GS SVSI  +    + + ++ +YGP +   RK LW EL +I
Subjt:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI

Query:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI
        +G     WC+GGDFN+ R  S++  G R+T  MK F+  I + EL+++PL +  +TWS M  N     LDRF  S +W+  F  S    + R TSDH+PI
Subjt:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI

Query:  VLDAGGFSLGPLSLQNEK-------------------------------------------------QRKEAEDKIVAEISLLDIKDENFGLSLDEIERR
        VL+   F  GP   + E                                                  +  + ++ I++ +   D  ++  GLS + + +R
Subjt:  VLDAGGFSLGPLSLQNEK-------------------------------------------------QRKEAEDKIVAEISLLDIKDENFGLSLDEIERR

Query:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ
           K +L  L +REE +  QK++V W+K GD NS FFH+    R+ +  I E+++ NG  + N   I++EI  +F+ LY S     +    + W PIS +
Subjt:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ

Query:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLN--------------------------------------
         +  L+  F E E+  A+  + R+KAPGPDGFTI  F   W+  K D +++F EF R+G +N                                      
Subjt:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLN--------------------------------------

Query:  -------------------------------------------------------------------------------------AC-------------
                                                                                              C             
Subjt:  -------------------------------------------------------------------------------------AC-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------GNPKLEAFWFPIKDKILKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIR
                                         GNPK   FW P+ ++I ++LD W++  LS GGR+TL Q+ L  +P Y+LSLF++P  ++ + E+  R
Subjt:  ---------------------------------GNPKLEAFWFPIKDKILKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIR

Query:  KFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNKNHALLAKWGWRFSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKF
         F W     G   HL  WD+  +P + GGLG G +  +N ALL KW WR+  E SALW +V+ SIY +    W+         R PW  I   +++  KF
Subjt:  KFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNKNHALLAKWGWRFSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKF

Query:  SSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNPYLSIHEAWDSTTS-SWRIEVRRSLKDGEFAEYIELLDNLN-LVLLNDDEDKLSWNLNKNGTF
        +   V NG R  FW  LW GE  L  ++P + RV ++    I     ST   SW    RR+L D E  +   L+ + + L + +   DK SW+L+ +G F
Subjt:  SSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNPYLSIHEAWDSTTS-SWRIEVRRSLKDGEFAEYIELLDNLN-LVLLNDDEDKLSWNLNKNGTF

A5AK27 Reverse transcriptase domain-containing protein3.0e-9825.78Show/hide
Query:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI
        + +  ETK ++ D+  + S+WS ++  WA++ A G+   SGGIL +WD  K++  EV+ GS SVSI  +    + + ++ +YGP +   RK  W EL +I
Subjt:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI

Query:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI
        +G     WC+GGDFN+ R  S++  G R+T  MK F+  I + EL++ PL +  YTWS M  N     LDRF  S +W+ +F  S    + R TSDH+PI
Subjt:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI

Query:  VLDAGGFSLGPLSLQNEK-------------------------------------------------QRKEAEDKIVAEISLLDIKDENFGLSLDEIERR
        VL+   F  GP   + E                                                  +  + +  I+A ++  D  ++  GLS + + +R
Subjt:  VLDAGGFSLGPLSLQNEK-------------------------------------------------QRKEAEDKIVAEISLLDIKDENFGLSLDEIERR

Query:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ
           K +L  L +REE +  QK++V W+K GD NS FFH+    R+ +  I E+++ +G+ L N   I++EI  +F+ LY       +    + W PI  +
Subjt:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ

Query:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLN--------------------------------------
         +S L+  F E E++ A+  + R+KAPGPDGFTI  F   W   K D +R+FAEF R+G +N                                      
Subjt:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLN--------------------------------------

Query:  -------------------------------------------------------------------------------------AC-------------
                                                                                              C             
Subjt:  -------------------------------------------------------------------------------------AC-------------

Query:  -----------------------------------------------------------------------------------------------GNPKL
                                                                                                       GNPK 
Subjt:  -----------------------------------------------------------------------------------------------GNPKL

Query:  EAFWFPIKDKILKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIRKFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNK
          FW P+ ++I ++LD W++  LS GGR+TL Q+ L  +P Y+LSLF++P  ++ + E+  R+F W     G   HL  WD+  +P + GGLG G +  +
Subjt:  EAFWFPIKDKILKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIRKFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNK

Query:  NHALLAKWGWRFSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKFSSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNP
        N ALL KW WR+  E SALW +V+ SIY +    W+         R PW  I   +++  KF+   V +G R  FW  LW G+  L  ++P +  V ++ 
Subjt:  NHALLAKWGWRFSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKFSSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNP

Query:  YLSIHEAWD-STTSSWRIEVRRSLKDGEFAEYIELLDNLNLVLLNDD-EDKLSWNLNKNGTF
           I      S   SW    RR+L D E  +   L+ +L+ + ++    DK SW+++ +G F
Subjt:  YLSIHEAWD-STTSSWRIEVRRSLKDGEFAEYIELLDNLNLVLLNDD-EDKLSWNLNKNGTF

A5B9F0 Reverse transcriptase domain-containing protein1.6e-9626.39Show/hide
Query:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI
        I +  ETK  + D+  + SLW++++  WA + A G+   SGGIL +WD  K+   EV+ GS SVS+  +    ++  ++ +YGP     RK  W EL +I
Subjt:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI

Query:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI
         G     WC+GGDFN+ R  S++  GGR+T  MK  +  I E EL++ PL +  +TWS M  +     LDRF  S +W+ LF     + + R TSDH+ I
Subjt:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI

Query:  VLDAGGFSLGPLSLQNEK-----------------------------QRK--------------------EAEDKIVAEISLLDIKDENFGLSLDEIERR
        VL+   F  GP   + E                               RK                    E +  I+ +I+  D  ++  GLS + + +R
Subjt:  VLDAGGFSLGPLSLQNEK-----------------------------QRK--------------------EAEDKIVAEISLLDIKDENFGLSLDEIERR

Query:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ
           K +L  L +REE +  QK++V W+K GD NS FFH+    R+ +  I  +++  G+ L N   I++EI  +F+ LY S     +    + W PIS +
Subjt:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ

Query:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLNAC------------------------------------
         +S L+  F E E+  A+  + R KAPGPDGFTI  F   W   K D +R+F EF R+G +N                                      
Subjt:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLNAC------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------GNPKLEAFWFPIKDKI
                                                                                            GNPK  +FW P+ ++I
Subjt:  ------------------------------------------------------------------------------------GNPKLEAFWFPIKDKI

Query:  LKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIRKFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNKNHALLAKWGWR
          +LD W++  LS GGR+TL ++ L  +P Y+LSLF++P  ++ + E+  R F W     G   HL  W++  +    GGLG+GM+  +N ALL KW WR
Subjt:  LKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIRKFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNKNHALLAKWGWR

Query:  FSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKFSSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNPYLSIHEAWDST
        +  E SALW +V+ SIY +    W+         R PW  I + ++   KF+   V +G R  FW  LW G+  L  +FP + R   +  + I     ST
Subjt:  FSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKFSSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFRVSSNPYLSIHEAWDST

Query:  TS-SWRIEVRRSLKDGEFAEYIELLDNLNLVLLNDD-EDKLSWNLNKNGTF
           SW    RR+L D E  E   L+ +L+ + L+    DK SW+L+ +G F
Subjt:  TS-SWRIEVRRSLKDGEFAEYIELLDNLNLVLLNDD-EDKLSWNLNKNGTF

M5WPQ5 Reverse transcriptase domain-containing protein4.0e-9526.26Show/hide
Query:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI
        I +  ETK + +D++ +  +W S+   W       S GRSGGI  LW+   + + + + G  SVSI I      +  ++ IYGP   + R   W EL ++
Subjt:  IALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNI

Query:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI
         G+    WC+GGDFN+ R+ +++S+ GR+TK M+ FN  I+E  L +  L N  +TWS +  N+    LDRF VS  W+  F +     + RITSDH PI
Subjt:  SGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI

Query:  VLDAGGFSLGPLSLQNEKQRKEAED--------------------KIVAEISLLDIK-----DENFG-------------LSLDEIE-----------RR
         LD+     GP   + E       D                    K +  + +L  K      E FG             L LD+ E            R
Subjt:  VLDAGGFSLGPLSLQNEKQRKEAED--------------------KIVAEISLLDIK-----DENFG-------------LSLDEIE-----------RR

Query:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ
         +L   + +L  REE    Q+ KV W + GD N+ FFHR     +++  I +++  +   +     IE+E+  FF  LY       +    ++W PIS  
Subjt:  GSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQ

Query:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLN--------------------------------------
        ++ WL+R F+  EV  A+ + G++K+PGPDGF++ FF   W+  KGD M++  +FF++G +N                                      
Subjt:  DSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQLN--------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------AC----------------------------
                                                                              +C                            
Subjt:  ----------------------------------------------------------------------AC----------------------------

Query:  -------GNPKLEAFWFPIKDKILKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIRKFFWGDGSNGSLKHLARWDLASRPSA
               GNP+   FW P+ +K+ K+L +WKR  LS+GGRLTL QAVL SIP YY+SLF+MP  ++ + E+ +R F W     G   HL RW+  ++   
Subjt:  -------GNPKLEAFWFPIKDKILKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIRKFFWGDGSNGSLKHLARWDLASRPSA

Query:  NGGLGIGMLLNKNHALLAKWGWRFSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKFSSLSVKNGKRTLFWHHLWLGESFLKD
         GGLGIG L  +N AL AKW WRF LET++LW R++ S Y  D   W+T     + CR+PW  I K +    +    SV NG++  FW  LWL E  LKD
Subjt:  NGGLGIGMLLNKNHALLAKWGWRFSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKFSSLSVKNGKRTLFWHHLWLGESFLKD

Query:  KFPLMFRVS--SNPYLSIHEAWDSTTSSWRIEVRRSLKDGEFAEYIELLDNL-NLVLLNDDEDKLSWNLNKNGTF
         FP +  +S   N  ++          +W  + RR+L + E AE + LLD L N+ L     D+ SW + + G+F
Subjt:  KFPLMFRVS--SNPYLSIHEAWDSTTSSWRIEVRRSLKDGEFAEYIELLDNL-NLVLLNDDEDKLSWNLNKNGTF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.2e-1022.54Show/hide
Query:  WASI-EAEGSEGRSGGILTLWDETKI---QIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNISGFRDKYWCMGGDFNITRWVSDR
        W  I +A G + ++G  + + D+T     +I+   EG H + +  S    +E+ I NIY P     R  +   L ++    D +  + GDFN    + DR
Subjt:  WASI-EAEGSEGRSGGILTLWDETKI---QIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELRNISGFRDKYWCMGGDFNITRWVSDR

Query:  SSGGRITKGMKRFNSIIEELELMEI-----PLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERIT---SDHFPIVLD----------AG
        S+  ++ K  +  NS + + +L++I     P S  +YT+    ++ T+S +D    S+        S   R E IT   SDH  I L+          + 
Subjt:  SSGGRITKGMKRFNSIIEELELMEI-----PLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERIT---SDHFPIVLD----------AG

Query:  GFSLGPLSLQNEKQRKEAEDKIVAEISLLDIKDENFGLSLDEIERRGSLKTDLLNLYVREEQ------------NLIQKSKVH-----------------
         + L  L L +     E + +I       + KD  +    D  +     K   LN Y R+++             L ++ + H                 
Subjt:  GFSLGPLSLQNEKQRKEAEDKIVAEISLLDIKDENFGLSLDEIERRGSLKTDLLNLYVREEQ------------NLIQKSKVH-----------------

Query:  -------WLKAGDENSNFFH----------RFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCS--EGISRF-TPRDISWRP-ISCQDSSW
                 K  +  S FF           R +  ++ K +I  IK+  G    +  EI+  I  ++  LY +  E +    T  D    P ++ ++   
Subjt:  -------WLKAGDENSNFFH----------RFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCS--EGISRF-TPRDISWRP-ISCQDSSW

Query:  LQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQL
        L R    +E+ + + +L   K+PGPDGFT EF+ ++ +      ++LF    + G L
Subjt:  LQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQL

P08548 LINE-1 reverse transcriptase homolog6.3e-1322.76Show/hide
Query:  LSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDET----KIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELR
        L P+    Q    T+K  +  K   W+SI     + +  GI  L+ +       +IR+  +G H + +        EI I NIY P ++ + + +   L 
Subjt:  LSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDET----KIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSRKHLWSELR

Query:  NISGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEI----PLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERIT
        ++S        + GDFN    V DRSS  +++K +   NS I+ L+L +I      +  +YT+     + T+S +D     +    L     +  +  I 
Subjt:  NISGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEI----PLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERIT

Query:  SDHFPIVLDAGG----------FSLGPLSLQNEKQRKEAEDKIVAEISLLDIKDENFG--------------------LSLDEIERRGSLKTDLLNL---
        SDH  I ++             + L  L L++     E + +I   +   + +D N+                     L   E E   +L   L  L   
Subjt:  SDHFPIVLDAGG----------FSLGPLSLQNEKQRKEAEDKIVAEISLLDIKDENFG--------------------LSLDEIERRGSLKTDLLNL---

Query:  --------------YVREEQNLIQKSKVHWLKAGDENSNFFHRF---------LAARKR-KARISEIKDGNGVSLVNQREIEKEISSFFDALYCS--EGI
                       +R E N I+  ++   +     S FF +          L  +KR K+ IS I++GN     +  EI+K ++ ++  LY    E +
Subjt:  --------------YVREEQNLIQKSKVHWLKAGDENSNFFHRF---------LAARKR-KARISEIKDGNGVSLVNQREIEKEISSFFDALYCS--EGI

Query:  SRFTP--RDISWRPISCQDSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQL
                      +S ++   L R    +E+ S ++NL + K+PGPDGFT EF+  F +      + LF    + G L
Subjt:  SRFTP--RDISWRPISCQDSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQL

P0C2F6 Putative ribonuclease H protein At1g657506.5e-1826.22Show/hide
Query:  IKDKILKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIRKFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNKNHALLA
        I +++  ++  W+   LS  GRLTL +AVL S+P++ +S   +P  I  + ++  R F WG  +    +HL +W     P   GGLG+    + N AL++
Subjt:  IKDKILKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIRKFFWGDGSNGSLKHLARWDLASRPSANGGLGIGMLLNKNHALLA

Query:  KWGWRFSLETSALWRRVVASIYET-DCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKFSSLSVK-NGKRTLFWHHLWL-GESFLK----DKFPLMFRVSSN
        K GWR   E ++LW  V+   Y   +  D   L  K     S W +I    + V       +  +G++  FW   W+ G+  L+    ++      V + 
Subjt:  KWGWRFSLETSALWRRVVASIYET-DCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKFSSLSVK-NGKRTLFWHHLWL-GESFLK----DKFPLMFRVSSN

Query:  PYLSIHEAWD------STTSSWRIEVRRSLKDGEFAEYIELLDNLNLVLLNDDEDKLSWNLNKNGTF
                WD       TT++ R+E+R  + D                L+    D+LSW  +++G F
Subjt:  PYLSIHEAWD------STTSSWRIEVRRSLKDGEFAEYIELLDNLNLVLLNDDEDKLSWNLNKNGTF

P14381 Transposon TX1 uncharacterized 149 kDa protein4.2e-0926.7Show/hide
Query:  ENFGLSLDEIERRGSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRF
        E+  L  + +ER+ +L+    N+  R+ +    +S++  L   D  S FF+     +  + +I+ +   +G  L +   I     SF+  L+  + IS  
Subjt:  ENFGLSLDEIERRGSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRF

Query:  TPRDI--SWRPISCQDSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQL
           ++      +S +    L+      E+  AL+ +  NK+PG DG TIEFF  FW     DF R+  E F+ G+L
Subjt:  TPRDI--SWRPISCQDSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.9e-1824.25Show/hide
Query:  DKYWCMGGDFNITRWVSDRSSGGRIT---KGMKRFNSIIEELELMEIPLSNGKYTWSK-MGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI
        D+   + GDF+     SD  S  + +   +G++ F + + + +L++IP     YTWS    +N     LDR   + DW + F ++         SDH P 
Subjt:  DKYWCMGGDFNITRWVSDRSSGGRIT---KGMKRFNSIIEELELMEIPLSNGKYTWSK-MGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITSDHFPI

Query:  VL---------------------------------------DAGGFSLGP--------LSLQNEKQRKEAEDKIVAEI-SLLDIKDENFGLSLDEIERRG
        ++                                        +  FSLG           L N +     + K    + SL  I+ +      D + R  
Subjt:  VL---------------------------------------DAGGFSLGP--------LSLQNEKQRKEAEDKIVAEI-SLLDIKDENFGLSLDEIERRG

Query:  SLKTDLLNLYVREEQNLI-QKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDI----SWRP
         +     N +    ++   QKS++ WL+ GD N+ FFH+ + A + K  I  ++  + V + N  ++++ I +++  L  S+     TP  +       P
Subjt:  SLKTDLLNLYVREEQNLI-QKSKVHWLKAGDENSNFFHRFLAARKRKARISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDI----SWRP

Query:  ISCQD--SSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQL
          C D  +S L     + E+ +A+  + RNKAPGPD FT EFF + W   K   +    EFFR G L
Subjt:  ISCQD--SSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAEFFRNGQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGTGAGCAGGATCTGGACAAGAAAGAGAAAATTAGTAATTTTCCATACCTAAAATTCGGCTGCATTTATTCAACCAAAAAGCAGCCATCTGAATTAAAT
ATGACTGAATCTTTATTCTCAGATGATGAGGAAGGTCTAAACATTAGCATCAGTAGTCAGGATTCTCCACCTTTTTTGGATATTCACGCAGAATGTGCAACGGAT
CTTGCTGAATTTTCCTTGAATGGTGAGTTGACTAATCTTTTCGACCCTCCCCTATTTTCTCCAGCTACACAAGACAAGTCAGTTTTTTGTGAGGACCCTCCGAAT
ATTGAGGGTGAAATTCAGATTCCTTCCTTCTTTGACTCGGTTGGAATTGCTCTCTCACCTGAGACAAAGTTACAACAGATTGACAAAGAGACAATTAAATCGTTA
TGGAGCTCTAAAGATGTAGCGTGGGCCAGCATTGAAGCAGAAGGTTCAGAAGGTCGATCGGGGGGGATTCTTACCCTCTGGGATGAGACTAAAATTCAGATTCGG
GAAGTATTAGAGGGCAGCCATTCAGTGTCAATCTCCATCTCATTCTTTAATTTAAAGGAAATTGTTATAACTAACATCTATGGTCCAACTGATTACAAAAGTCGG
AAGCATCTGTGGTCAGAATTACGAAATATATCCGGCTTTAGAGACAAATACTGGTGTATGGGGGGAGATTTTAACATCACAAGGTGGGTATCAGACAGGTCTTCA
GGAGGTAGAATAACAAAAGGAATGAAGAGATTTAATAGTATAATTGAAGAATTAGAATTGATGGAAATTCCACTTTCTAATGGCAAGTACACTTGGTCGAAAATG
GGCAACAATAGTACTCATTCATTACTGGACAGGTTCTTTGTTTCGCAAGATTGGGACACTCTCTTCAGCAACTCTAGTGTAAATAGAGTGGAACGAATAACGTCT
GACCATTTTCCTATTGTTCTTGATGCTGGGGGATTTTCATTGGGGCCCCTCTCCCTTCAGAATGAAAAGCAGCGAAAGGAAGCGGAGGATAAAATAGTAGCTGAA
ATATCCTTATTGGATATTAAGGATGAGAATTTTGGTCTCTCCTTGGATGAAATCGAACGAAGAGGTTCTTTGAAGACAGACTTACTGAATTTGTATGTAAGGGAG
GAGCAAAACTTGATACAAAAAAGTAAGGTACATTGGCTGAAGGCAGGGGATGAAAATAGTAATTTTTTTCATAGATTCTTAGCTGCCCGCAAAAGGAAGGCCCGG
ATTTCTGAGATTAAGGATGGAAATGGTGTATCTCTTGTTAACCAAAGAGAAATTGAGAAGGAAATCTCCAGCTTTTTCGACGCCTTGTACTGTAGTGAAGGAATT
TCACGTTTTACTCCAAGAGACATCTCTTGGAGGCCTATCTCTTGTCAAGACAGTAGCTGGTTACAAAGGAATTTTGAGGAAGCTGAGGTTTGGTCTGCTCTAAAA
AATTTGGGAAGAAATAAGGCGCCAGGGCCAGACGGATTTACCATTGAATTCTTCATCAAATTTTGGCAGCATTGGAAAGGAGATTTTATGCGACTTTTTGCTGAG
TTCTTTCGAAATGGCCAATTGAATGCTTGTGGAAATCCAAAGCTTGAAGCCTTTTGGTTCCCTATTAAGGACAAGATCCTAAAGAAACTCGATAGATGGAAGCGA
TATCAATTATCTAGAGGAGGTAGACTGACTTTGTGCCAAGCGGTGTTGGGTAGCATCCCTTTGTATTACTTATCGCTTTTTCAAATGCCTGGACATATCAGCGAG
CAATTTGAAAAGTATATCAGAAAATTTTTTTGGGGAGATGGTTCAAATGGGTCTTTAAAGCATTTGGCTCGATGGGATTTAGCTTCTCGCCCTAGTGCAAATGGT
GGCTTGGGCATTGGTATGTTATTGAATAAGAACCATGCTCTTCTTGCGAAATGGGGTTGGAGGTTTAGTTTGGAAACTTCTGCGCTATGGCGCCGGGTTGTGGCC
AGCATTTATGAAACAGATTGTTTTGATTGGAACACTCTAGAAAAGAAGAATTTGGGGTGTCGTAGCCCTTGGAACAACATTCAAAAACAGTGGAAAAAGGTACAG
AAATTCTCTTCATTATCAGTAAAAAATGGCAAACGTACTTTGTTTTGGCATCACTTGTGGTTGGGCGAGTCTTTTTTAAAGGACAAGTTTCCACTTATGTTTCGG
GTTTCCTCGAACCCCTACCTTTCGATTCATGAAGCTTGGGATTCAACCACCTCCTCTTGGAGGATTGAAGTTAGAAGGTCGCTAAAAGATGGAGAATTTGCTGAA
TATATTGAATTATTGGACAATCTGAATTTGGTTTTGTTAAATGATGATGAAGACAAACTTTCTTGGAATCTGAATAAAAATGGGACTTTTCTGTCAGCTCCTTAT
GTAACTCTAAGGCTCCCTTTGCTACATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGTGAGCAGGATCTGGACAAGAAAGAGAAAATTAGTAATTTTCCATACCTAAAATTCGGCTGCATTTATTCAACCAAAAAGCAGCCATCTGAATTAAAT
ATGACTGAATCTTTATTCTCAGATGATGAGGAAGGTCTAAACATTAGCATCAGTAGTCAGGATTCTCCACCTTTTTTGGATATTCACGCAGAATGTGCAACGGAT
CTTGCTGAATTTTCCTTGAATGGTGAGTTGACTAATCTTTTCGACCCTCCCCTATTTTCTCCAGCTACACAAGACAAGTCAGTTTTTTGTGAGGACCCTCCGAAT
ATTGAGGGTGAAATTCAGATTCCTTCCTTCTTTGACTCGGTTGGAATTGCTCTCTCACCTGAGACAAAGTTACAACAGATTGACAAAGAGACAATTAAATCGTTA
TGGAGCTCTAAAGATGTAGCGTGGGCCAGCATTGAAGCAGAAGGTTCAGAAGGTCGATCGGGGGGGATTCTTACCCTCTGGGATGAGACTAAAATTCAGATTCGG
GAAGTATTAGAGGGCAGCCATTCAGTGTCAATCTCCATCTCATTCTTTAATTTAAAGGAAATTGTTATAACTAACATCTATGGTCCAACTGATTACAAAAGTCGG
AAGCATCTGTGGTCAGAATTACGAAATATATCCGGCTTTAGAGACAAATACTGGTGTATGGGGGGAGATTTTAACATCACAAGGTGGGTATCAGACAGGTCTTCA
GGAGGTAGAATAACAAAAGGAATGAAGAGATTTAATAGTATAATTGAAGAATTAGAATTGATGGAAATTCCACTTTCTAATGGCAAGTACACTTGGTCGAAAATG
GGCAACAATAGTACTCATTCATTACTGGACAGGTTCTTTGTTTCGCAAGATTGGGACACTCTCTTCAGCAACTCTAGTGTAAATAGAGTGGAACGAATAACGTCT
GACCATTTTCCTATTGTTCTTGATGCTGGGGGATTTTCATTGGGGCCCCTCTCCCTTCAGAATGAAAAGCAGCGAAAGGAAGCGGAGGATAAAATAGTAGCTGAA
ATATCCTTATTGGATATTAAGGATGAGAATTTTGGTCTCTCCTTGGATGAAATCGAACGAAGAGGTTCTTTGAAGACAGACTTACTGAATTTGTATGTAAGGGAG
GAGCAAAACTTGATACAAAAAAGTAAGGTACATTGGCTGAAGGCAGGGGATGAAAATAGTAATTTTTTTCATAGATTCTTAGCTGCCCGCAAAAGGAAGGCCCGG
ATTTCTGAGATTAAGGATGGAAATGGTGTATCTCTTGTTAACCAAAGAGAAATTGAGAAGGAAATCTCCAGCTTTTTCGACGCCTTGTACTGTAGTGAAGGAATT
TCACGTTTTACTCCAAGAGACATCTCTTGGAGGCCTATCTCTTGTCAAGACAGTAGCTGGTTACAAAGGAATTTTGAGGAAGCTGAGGTTTGGTCTGCTCTAAAA
AATTTGGGAAGAAATAAGGCGCCAGGGCCAGACGGATTTACCATTGAATTCTTCATCAAATTTTGGCAGCATTGGAAAGGAGATTTTATGCGACTTTTTGCTGAG
TTCTTTCGAAATGGCCAATTGAATGCTTGTGGAAATCCAAAGCTTGAAGCCTTTTGGTTCCCTATTAAGGACAAGATCCTAAAGAAACTCGATAGATGGAAGCGA
TATCAATTATCTAGAGGAGGTAGACTGACTTTGTGCCAAGCGGTGTTGGGTAGCATCCCTTTGTATTACTTATCGCTTTTTCAAATGCCTGGACATATCAGCGAG
CAATTTGAAAAGTATATCAGAAAATTTTTTTGGGGAGATGGTTCAAATGGGTCTTTAAAGCATTTGGCTCGATGGGATTTAGCTTCTCGCCCTAGTGCAAATGGT
GGCTTGGGCATTGGTATGTTATTGAATAAGAACCATGCTCTTCTTGCGAAATGGGGTTGGAGGTTTAGTTTGGAAACTTCTGCGCTATGGCGCCGGGTTGTGGCC
AGCATTTATGAAACAGATTGTTTTGATTGGAACACTCTAGAAAAGAAGAATTTGGGGTGTCGTAGCCCTTGGAACAACATTCAAAAACAGTGGAAAAAGGTACAG
AAATTCTCTTCATTATCAGTAAAAAATGGCAAACGTACTTTGTTTTGGCATCACTTGTGGTTGGGCGAGTCTTTTTTAAAGGACAAGTTTCCACTTATGTTTCGG
GTTTCCTCGAACCCCTACCTTTCGATTCATGAAGCTTGGGATTCAACCACCTCCTCTTGGAGGATTGAAGTTAGAAGGTCGCTAAAAGATGGAGAATTTGCTGAA
TATATTGAATTATTGGACAATCTGAATTTGGTTTTGTTAAATGATGATGAAGACAAACTTTCTTGGAATCTGAATAAAAATGGGACTTTTCTGTCAGCTCCTTAT
GTAACTCTAAGGCTCCCTTTGCTACATTAA
Protein sequenceShow/hide protein sequence
MGSEQDLDKKEKISNFPYLKFGCIYSTKKQPSELNMTESLFSDDEEGLNISISSQDSPPFLDIHAECATDLAEFSLNGELTNLFDPPLFSPATQDKSVFCEDPPN
IEGEIQIPSFFDSVGIALSPETKLQQIDKETIKSLWSSKDVAWASIEAEGSEGRSGGILTLWDETKIQIREVLEGSHSVSISISFFNLKEIVITNIYGPTDYKSR
KHLWSELRNISGFRDKYWCMGGDFNITRWVSDRSSGGRITKGMKRFNSIIEELELMEIPLSNGKYTWSKMGNNSTHSLLDRFFVSQDWDTLFSNSSVNRVERITS
DHFPIVLDAGGFSLGPLSLQNEKQRKEAEDKIVAEISLLDIKDENFGLSLDEIERRGSLKTDLLNLYVREEQNLIQKSKVHWLKAGDENSNFFHRFLAARKRKAR
ISEIKDGNGVSLVNQREIEKEISSFFDALYCSEGISRFTPRDISWRPISCQDSSWLQRNFEEAEVWSALKNLGRNKAPGPDGFTIEFFIKFWQHWKGDFMRLFAE
FFRNGQLNACGNPKLEAFWFPIKDKILKKLDRWKRYQLSRGGRLTLCQAVLGSIPLYYLSLFQMPGHISEQFEKYIRKFFWGDGSNGSLKHLARWDLASRPSANG
GLGIGMLLNKNHALLAKWGWRFSLETSALWRRVVASIYETDCFDWNTLEKKNLGCRSPWNNIQKQWKKVQKFSSLSVKNGKRTLFWHHLWLGESFLKDKFPLMFR
VSSNPYLSIHEAWDSTTSSWRIEVRRSLKDGEFAEYIELLDNLNLVLLNDDEDKLSWNLNKNGTFLSAPYVTLRLPLLH