; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g05950 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g05950
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr4:4054809..4060186
RNA-Seq ExpressionMoc04g05950
SyntenyMoc04g05950
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN70431.1 hypothetical protein VITISV_030910 [Vitis vinifera]7.4e-8043.01Show/hide
Query:  MSDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGFNSK--
        +SDYRPISL T++YKI+AKVL+ RL+ VL  TI+ +Q AFV+GR I D+ L+ANE++D  R + ++G++ K+D EKA+D +DW FL  VL  KGF+ K  
Subjt:  MSDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGFNSK--

Query:  ------------------------------------------------CQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLIL
                                                         + L  A   GL EGF +  + T +S L FADD +  S       +NL +IL
Subjt:  ------------------------------------------------CQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLIL

Query:  KSFEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYY
          F Q SGL INL KS+ISGIN   + L  LA ++ C +   P SYLG+PLGGNP++  FW  ++ER+ R+L+GW  +++S GGR+TL+QS LS +P+Y+
Subjt:  KSFEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYY

Query:  LSIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGIGRINTINMSLLTKW
        LS+FK PA V+  +E+ +RNFLW G GE    HL+RW++ + PKELGGLG G+I+  N++LL KW
Subjt:  LSIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGIGRINTINMSLLTKW

KAA0045262.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-8148.5Show/hide
Query:  DYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGF-------
        D+RPISLTT+IYK++AK L++RLK  LP+TI+  Q AF++ RQITD+ LMANE +DYW+  K KG ILKLD+EKAFD L+W F+  VL +K F       
Subjt:  DYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGF-------

Query:  --------------NSKCQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILKSFEQASGLNINLNKSSISGINVSHDELHYL
                      N + Q    A N+GL +G  + +N  +ISH+LFADDILL    ++    NL + L  FE+ASGL INL+KS++  +NVS +     
Subjt:  --------------NSKCQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILKSFEQASGLNINLNKSSISGINVSHDELHYL

Query:  AGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYLSIFKPPAHVSKILERKRRNFLWEGMGEKGF
        A  W     +LP +YLG+PLGGNP+S  FW  I +++ +KL  W Y+ ISKGGRLTL+++ LSSLP Y LS+F+ P+   K +E+  RNFLW+G      
Subjt:  AGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYLSIFKPPAHVSKILERKRRNFLWEGMGEKGF

Query:  SHLIRWDITTLPKELGGLGIGRINTINMSLLTKW
        SHLI W   T PKE GGLGI R+   N +LLTKW
Subjt:  SHLIRWDITTLPKELGGLGIGRINTINMSLLTKW

KAA0056839.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.1e-8046.43Show/hide
Query:  SDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGFNSK---
        SDYRPISLTT++YK++AK LA+RLK+ LP+TIA  Q AF++GRQI D+ L+ANE ID W+  K KG +LKLDLEKAFDK+ W F+  +L+ K F  K   
Subjt:  SDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGFNSK---

Query:  -----------------------------------------------CQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILK
                                                        + L    +KG I+G    +N  +ISHLLFADD+L+    +ER   NL + L 
Subjt:  -----------------------------------------------CQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILK

Query:  SFEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYL
         FE+ASGL  N +KS+IS IN+S      +A  +    + LP +YLG+PLGGNPRS SFW+  IE + +KL GW YS ISKGGRLTL+++ LSSLPTY L
Subjt:  SFEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYL

Query:  SIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGIGRINTINMSLLTKW
        S FK P  V K +E+  R+FLW G  +K  +HLI W+I T PKELGGLGI ++   N +LL KW
Subjt:  SIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGIGRINTINMSLLTKW

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.5e-8046.43Show/hide
Query:  SDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGFNSK---
        SDYRPISLTT++YKI+AK LA+RLK+ LP+TIA  Q AF++GRQI D+ L+ANE ID W+  K KG +LKLD+EKAFDK+ W F+  +L+ K F  K   
Subjt:  SDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGFNSK---

Query:  -----------------------------------------------CQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILK
                                                        + L    +KG I+G    +N  +ISHLLFADD+L+    +ER   NL + L 
Subjt:  -----------------------------------------------CQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILK

Query:  SFEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYL
         FE+ASGL  N +KS+IS IN+S      +A  +    + LP +YLG+PLGGNPRS SFW   IE + +KL GW YS ISKGGRLTL+++ LSSLPTY L
Subjt:  SFEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYL

Query:  SIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGIGRINTINMSLLTKW
        S FK P  V K +E+  R+FLW G  +K  +HLI W+I T PKELGGLGI ++   N +LL KW
Subjt:  SIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGIGRINTINMSLLTKW

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.5e-8046.43Show/hide
Query:  SDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGFNSK---
        SDYRPISLTT++YKI+AK LA+RLK+ LP+TIA  Q AF++GRQI D+ L+ANE+ID W+  K KG +LKLD+EKAFDK+ W F+  +L+ K F  K   
Subjt:  SDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGFNSK---

Query:  -----------------------------------------------CQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILK
                                                        + L    +KG I+G    +N  +ISHLLFADD+L+    +ER   NL + L 
Subjt:  -----------------------------------------------CQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILK

Query:  SFEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYL
         FE+ASGL  N +KS+IS IN+S      +A  +    + LP +YLG+PLGGNPRS SFW   IE + +KL GW YS ISKGGRLTL+++ LSSLPTY L
Subjt:  SFEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYL

Query:  SIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGIGRINTINMSLLTKW
        S FK P  V K +E+  R+FLW G  +K  +HLI W+I T PKELGGLGI ++   N +LL KW
Subjt:  SIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGIGRINTINMSLLTKW

TrEMBL top hitse value%identityAlignment
A0A5A7US62 LINE-1 retrotransposable element ORF2 protein1.2e-8046.43Show/hide
Query:  SDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGFNSK---
        SDYRPISLTT++YKI+AK LA+RLK+ LP+TIA  Q AF++GRQI D+ L+ANE ID W+  K KG +LKLD+EKAFDK+ W F+  +L+ K F  K   
Subjt:  SDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGFNSK---

Query:  -----------------------------------------------CQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILK
                                                        + L    +KG I+G    +N  +ISHLLFADD+L+    +ER   NL + L 
Subjt:  -----------------------------------------------CQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILK

Query:  SFEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYL
         FE+ASGL  N +KS+IS IN+S      +A  +    + LP +YLG+PLGGNPRS SFW   IE + +KL GW YS ISKGGRLTL+++ LSSLPTY L
Subjt:  SFEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYL

Query:  SIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGIGRINTINMSLLTKW
        S FK P  V K +E+  R+FLW G  +K  +HLI W+I T PKELGGLGI ++   N +LL KW
Subjt:  SIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGIGRINTINMSLLTKW

A0A5A7UTI6 LINE-1 retrotransposable element ORF2 protein5.5e-8146.43Show/hide
Query:  SDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGFNSK---
        SDYRPISLTT++YK++AK LA+RLK+ LP+TIA  Q AF++GRQI D+ L+ANE ID W+  K KG +LKLDLEKAFDK+ W F+  +L+ K F  K   
Subjt:  SDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGFNSK---

Query:  -----------------------------------------------CQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILK
                                                        + L    +KG I+G    +N  +ISHLLFADD+L+    +ER   NL + L 
Subjt:  -----------------------------------------------CQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILK

Query:  SFEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYL
         FE+ASGL  N +KS+IS IN+S      +A  +    + LP +YLG+PLGGNPRS SFW+  IE + +KL GW YS ISKGGRLTL+++ LSSLPTY L
Subjt:  SFEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYL

Query:  SIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGIGRINTINMSLLTKW
        S FK P  V K +E+  R+FLW G  +K  +HLI W+I T PKELGGLGI ++   N +LL KW
Subjt:  SIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGIGRINTINMSLLTKW

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein7.2e-8146.43Show/hide
Query:  SDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGFNSK---
        SDYRPISLTT++YKI+AK LA+RLK+ LP+TIA  Q AF++GRQI D+ L+ANE+ID W+  K KG +LKLD+EKAFDK+ W F+  +L+ K F  K   
Subjt:  SDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGFNSK---

Query:  -----------------------------------------------CQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILK
                                                        + L    +KG I+G    +N  +ISHLLFADD+L+    +ER   NL + L 
Subjt:  -----------------------------------------------CQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILK

Query:  SFEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYL
         FE+ASGL  N +KS+IS IN+S      +A  +    + LP +YLG+PLGGNPRS SFW   IE + +KL GW YS ISKGGRLTL+++ LSSLPTY L
Subjt:  SFEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYL

Query:  SIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGIGRINTINMSLLTKW
        S FK P  V K +E+  R+FLW G  +K  +HLI W+I T PKELGGLGI ++   N +LL KW
Subjt:  SIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGIGRINTINMSLLTKW

A0A5D3DZ07 LINE-1 retrotransposable element ORF2 protein6.5e-8248.5Show/hide
Query:  DYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGF-------
        D+RPISLTT+IYK++AK L++RLK  LP+TI+  Q AF++ RQITD+ LMANE +DYW+  K KG ILKLD+EKAFD L+W F+  VL +K F       
Subjt:  DYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGF-------

Query:  --------------NSKCQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILKSFEQASGLNINLNKSSISGINVSHDELHYL
                      N + Q    A N+GL +G  + +N  +ISH+LFADDILL    ++    NL + L  FE+ASGL INL+KS++  +NVS +     
Subjt:  --------------NSKCQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILKSFEQASGLNINLNKSSISGINVSHDELHYL

Query:  AGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYLSIFKPPAHVSKILERKRRNFLWEGMGEKGF
        A  W     +LP +YLG+PLGGNP+S  FW  I +++ +KL  W Y+ ISKGGRLTL+++ LSSLP Y LS+F+ P+   K +E+  RNFLW+G      
Subjt:  AGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYLSIFKPPAHVSKILERKRRNFLWEGMGEKGF

Query:  SHLIRWDITTLPKELGGLGIGRINTINMSLLTKW
        SHLI W   T PKE GGLGI R+   N +LLTKW
Subjt:  SHLIRWDITTLPKELGGLGIGRINTINMSLLTKW

A5BQI4 Reverse transcriptase domain-containing protein3.6e-8043.01Show/hide
Query:  MSDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGFNSK--
        +SDYRPISL T++YKI+AKVL+ RL+ VL  TI+ +Q AFV+GR I D+ L+ANE++D  R + ++G++ K+D EKA+D +DW FL  VL  KGF+ K  
Subjt:  MSDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGFNSK--

Query:  ------------------------------------------------CQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLIL
                                                         + L  A   GL EGF +  + T +S L FADD +  S       +NL +IL
Subjt:  ------------------------------------------------CQTLDAAYNKGLIEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLIL

Query:  KSFEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYY
          F Q SGL INL KS+ISGIN   + L  LA ++ C +   P SYLG+PLGGNP++  FW  ++ER+ R+L+GW  +++S GGR+TL+QS LS +P+Y+
Subjt:  KSFEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYY

Query:  LSIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGIGRINTINMSLLTKW
        LS+FK PA V+  +E+ +RNFLW G GE    HL+RW++ + PKELGGLG G+I+  N++LL KW
Subjt:  LSIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGIGRINTINMSLLTKW

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.0e-1123.68Show/hide
Query:  DYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKG-VILKLDLEKAFDKLDWHFLLKVLSLKGFNSK-CQ
        ++RPISL     KIL K+LA+R++  +   I   Q  F+ G Q   +   +  +I +   AK K  VI+ +D EKAFDK+   F+LK L+  G +    +
Subjt:  DYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKG-VILKLDLEKAFDKLDWHFLLKVLSLKGFNSK-CQ

Query:  TLDAAYNKGL---------IEGFRIRD---NGTHISHL--------------------------------LFADDILLLSSPDERKFRNLHLILKSFEQA
         + A Y+K           +E F ++     G  +S L                                LFADD+++         +NL  ++ +F + 
Subjt:  TLDAAYNKGL---------IEGFRIRD---NGTHISHL--------------------------------LFADDILLLSSPDERKFRNLHLILKSFEQA

Query:  SGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRS--ESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYLSI-
        SG  IN+ KS     N +      + G     I +    YLG+ L  + +   +  +  +++ +      W     S  GR+ +V+  +     Y  +  
Subjt:  SGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRS--ESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYLSI-

Query:  -FKPPAHVSKILERKRRNFLW
          K P      LE+    F+W
Subjt:  -FKPPAHVSKILERKRRNFLW

P08548 LINE-1 reverse transcriptase homolog2.4e-0921.98Show/hide
Query:  DYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKG-VILKLDLEKAFDKLDWHFLLKVLSLKGFNSK-CQ
        +YRPISL     KIL K+L +R++  +   I   Q  F+ G Q   +   +  +I +    K K  +IL +D EKAFD +   F+++ L   G      +
Subjt:  DYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKG-VILKLDLEKAFDKLDWHFLLKVLSLKGFNSK-CQ

Query:  TLDAAYNKG----LIEGFRIRD--------NGTHISHLLF--ADDILLLSSPDERKFRNLHL------------------------------ILKSFEQA
         ++A Y+K     ++ G +++          G  +S LLF    ++L ++  +E+  + +H+                              ++K +   
Subjt:  TLDAAYNKG----LIEGFRIRD--------NGTHISHLLF--ADDILLLSSPDERKFRNLHL------------------------------ILKSFEQA

Query:  SGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPS--SYLGMPLGGNPRS--ESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYLS
        SG  IN +K S++ I  ++++        S P   +P    YLG+ L  + +   +  +  + + +   +  W     S  GR+ +V+  +     Y  +
Subjt:  SGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPS--SYLGMPLGGNPRS--ESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYLS

Query:  I--FKPPAHVSKILERKRRNFLW
            K P    K LE+   +F+W
Subjt:  I--FKPPAHVSKILERKRRNFLW

P0C2F6 Putative ribonuclease H protein At1g657507.8e-1634.48Show/hide
Query:  MPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYLSIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGG
        MP+     ++  +  I+ERV  ++ GW    +S  GRLTL ++VLSS+P + +S    P  +   L++  R FLW    EK   HL++W     PK+ GG
Subjt:  MPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYLSIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGG

Query:  LGIGRINTINMSLLTK
        LG+    ++N +L++K
Subjt:  LGIGRINTINMSLLTK

P11369 LINE-1 retrotransposable element ORF2 protein2.4e-0923.38Show/hide
Query:  MSDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKG-VILKLDLEKAFDKLDWHFLLKVLSLKGFNSK-
        + ++RPISL     KIL K+LA+R++  +   I   Q  F+ G Q   +   +  +I Y    K K  +I+ LD EKAFDK+   F++KVL   G     
Subjt:  MSDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKG-VILKLDLEKAFDKLDWHFLLKVLSLKGFNSK-

Query:  CQTLDAAYNKGL----------------------------------------------IEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILKS
           + A Y+K +                                              I+G +I      IS  L ADD+++  S  +   R L  ++ S
Subjt:  CQTLDAAYNKGL----------------------------------------------IEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILKS

Query:  FEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRS--ESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYY
        F +  G  IN NKS       +      +       I      YLG+ L    +   +  + ++ + +   L  W     S  GR+ +V+  +     Y 
Subjt:  FEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRS--ESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYY

Query:  LSI--FKPPAHVSKILERKRRNFLW
         +    K P      LE     F+W
Subjt:  LSI--FKPPAHVSKILERKRRNFLW

P14381 Transposon TX1 uncharacterized 149 kDa protein5.0e-1523.93Show/hide
Query:  MSDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGF-----
        + ++RP+SL +T YKI+AK ++ RLK+VL   I   QS  V GR I D+  +  +++ + R        L LD EKAFD++D  +L+  L    F     
Subjt:  MSDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGF-----

Query:  --------NSKC-------QTLDAAYNKGL-------------------------IEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILKSFEQ
                +++C        T   A+ +G+                         + G  +++    +    +ADD++L++  D           + +  
Subjt:  --------NSKC-------QTLDAAYNKGL-------------------------IEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILKSFEQ

Query:  ASGLNINLNKSSISGINVSHDELHYLAGIW-SCPIQALPSSYLGMPLGGN--PRSESFWAAIIERVDRKLEGWN--YSHISKGGRLTLVQSVLSSLPTYY
        AS   IN +KS  SG+     ++ +L   +     ++    YLG+ L     P S++F   + E V  +L  W      +S  GR  ++  +++S   Y 
Subjt:  ASGLNINLNKSSISGINVSHDELHYLAGIW-SCPIQALPSSYLGMPLGGN--PRSESFWAAIIERVDRKLEGWN--YSHISKGGRLTLVQSVLSSLPTYY

Query:  LSIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGI
        L    P       ++R+  +FLW G       H +   +++LP + GG G+
Subjt:  LSIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGI

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.7e-1629.49Show/hide
Query:  ALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYLSIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDIT
        ALP  YLG+PL     + S +  ++E++  ++  W   H+S  GRL L+ SV+ SL  +++S F+ P+   K ++    +FLW G         + W   
Subjt:  ALPSSYLGMPLGGNPRSESFWAAIIERVDRKLEGWNYSHISKGGRLTLVQSVLSSLPTYYLSIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDIT

Query:  TLPKELGGLGIGRINTINMSLLTKWK-RGEIFLGTYIPEGLYSPEKKSSGILCHPI
          PK+ GGLGI  +   N    + W   G   LG+++ + +      +SG + H I
Subjt:  TLPKELGGLGIGRINTINMSLLTKWK-RGEIFLGTYIPEGLYSPEKKSSGILCHPI

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.8e-0840.51Show/hide
Query:  LADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGV----ILKLDLEKAFDKLDWHFLLKVLSLKGF
        + +RLK ++ N I  AQ++F+ GR  TD+ +   E +   R  +KKGV    +LKLDLEKA+D++ W +L   L   GF
Subjt:  LADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGV----ILKLDLEKAFDKLDWHFLLKVLSLKGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGATTACCGACCGATCAGCTTGACAACCACAATATATAAAATATTAGCGAAAGTCCTTGCAGACCGTCTCAAAACTGTGCTACCAAACACCATTGCATCCGCTCA
GTCTGCATTTGTCCAAGGCCGTCAAATCACTGACTCTACACTAATGGCTAATGAGATGATCGATTATTGGCGTTGTGCTAAAAAGAAGGGTGTTATCCTCAAACTAGATC
TTGAAAAAGCCTTTGACAAGCTTGATTGGCATTTCCTTCTTAAGGTCCTATCCCTTAAGGGATTCAATAGCAAATGCCAAACGCTTGATGCCGCCTATAATAAAGGATTG
ATCGAAGGTTTTCGCATTCGGGACAATGGAACTCACATTTCGCATTTATTATTTGCCGATGACATTTTGCTCCTTTCTAGTCCTGATGAGCGAAAGTTCCGAAACCTACA
TCTTATACTGAAGTCTTTTGAACAAGCTTCTGGGCTCAATATTAATCTTAACAAATCCTCCATATCTGGGATAAATGTCTCTCATGATGAACTTCACTACTTGGCTGGTA
TATGGAGTTGCCCCATTCAAGCTTTACCTTCCTCTTATCTGGGCATGCCTCTCGGTGGTAATCCAAGATCAGAATCTTTTTGGGCCGCTATAATTGAAAGAGTTGATAGG
AAGCTTGAGGGATGGAATTACTCACACATTTCCAAAGGCGGTCGTCTCACTTTGGTGCAGTCGGTTTTGAGTAGCCTTCCGACTTATTACCTCTCGATTTTCAAACCTCC
AGCCCACGTATCAAAAATTTTGGAGCGCAAAAGGAGAAATTTCCTTTGGGAAGGCATGGGTGAAAAAGGTTTTTCTCATCTCATTCGGTGGGATATCACAACCCTCCCCA
AAGAGCTTGGAGGATTGGGCATTGGCCGGATCAATACCATTAATATGTCCCTCCTTACTAAATGGAAGCGTGGCGAAATTTTTCTTGGGACATATATCCCCGAAGGCCTC
TATTCACCCGAGAAGAAGAGCTCTGGAATTCTCTGTCATCCAATTTTGCTACCCCAGGTGACGACCGCCATATGGAATCTATCCGACAATGGCATCTTTTCGCTTGCCGA
AGAAGTGCAAATTCTTCATCTGGTGTGTTATTCACTGAGGGATATCAATACTCATGAGAAGCTCCAAGCTAGAATGCAGAATATGTATCTTAATCCTAGCATTTGCCACC
TTTGCCGCTGTGATTCTGAAACCTTGGAACTTTGGCACCGTCTTTTTGCTGCGTTCCATCTAATCCTGCCTCTCCAAGAACATATAGAAGCCTTCATTACTGAAGCCTTC
TTCTATCCCTGTAGCAGTAAACGAAATACTCTTTGGTGTAATGCGGTGGGCTCGATCCTTTGGCTCGTCTGGTTGGAGCGCACAGCAGGTGCTTCACGGATTCCTCAAAG
TCAGGCCTCTATTTTGTGGGATGATATTATGTGCGCCATGGCGTTGATTGGGGTAACTTCGATGAAATTTTGGATCCATCTGAGAAACTTGGTGGAGGAACGCTCTGATA
TTGCTGGTTTTTGTAATGCTCTAGATGATTGTAGTCTTCTCAATTTAGTTGATCACATGGATTATGGCCGGTGGCCCTCTTTGTCGCTCAACAGTTGTGGCTACAAGGCT
AAGATATGCGACGTTACATCTCGTGTTAAGATGACTATTGCAAACCTCCATTATCAATCAACTAGGGATGATCCCTTTGCAGTGGAGGCCGACCTGGAAAAGGTCTTACT
GGAAGAGGAGGAGTATTGGAAGCAGTTGGAAGCAGCGGTCGAGAGATTACCGGTTAAAATGGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGATTACCGACCGATCAGCTTGACAACCACAATATATAAAATATTAGCGAAAGTCCTTGCAGACCGTCTCAAAACTGTGCTACCAAACACCATTGCATCCGCTCA
GTCTGCATTTGTCCAAGGCCGTCAAATCACTGACTCTACACTAATGGCTAATGAGATGATCGATTATTGGCGTTGTGCTAAAAAGAAGGGTGTTATCCTCAAACTAGATC
TTGAAAAAGCCTTTGACAAGCTTGATTGGCATTTCCTTCTTAAGGTCCTATCCCTTAAGGGATTCAATAGCAAATGCCAAACGCTTGATGCCGCCTATAATAAAGGATTG
ATCGAAGGTTTTCGCATTCGGGACAATGGAACTCACATTTCGCATTTATTATTTGCCGATGACATTTTGCTCCTTTCTAGTCCTGATGAGCGAAAGTTCCGAAACCTACA
TCTTATACTGAAGTCTTTTGAACAAGCTTCTGGGCTCAATATTAATCTTAACAAATCCTCCATATCTGGGATAAATGTCTCTCATGATGAACTTCACTACTTGGCTGGTA
TATGGAGTTGCCCCATTCAAGCTTTACCTTCCTCTTATCTGGGCATGCCTCTCGGTGGTAATCCAAGATCAGAATCTTTTTGGGCCGCTATAATTGAAAGAGTTGATAGG
AAGCTTGAGGGATGGAATTACTCACACATTTCCAAAGGCGGTCGTCTCACTTTGGTGCAGTCGGTTTTGAGTAGCCTTCCGACTTATTACCTCTCGATTTTCAAACCTCC
AGCCCACGTATCAAAAATTTTGGAGCGCAAAAGGAGAAATTTCCTTTGGGAAGGCATGGGTGAAAAAGGTTTTTCTCATCTCATTCGGTGGGATATCACAACCCTCCCCA
AAGAGCTTGGAGGATTGGGCATTGGCCGGATCAATACCATTAATATGTCCCTCCTTACTAAATGGAAGCGTGGCGAAATTTTTCTTGGGACATATATCCCCGAAGGCCTC
TATTCACCCGAGAAGAAGAGCTCTGGAATTCTCTGTCATCCAATTTTGCTACCCCAGGTGACGACCGCCATATGGAATCTATCCGACAATGGCATCTTTTCGCTTGCCGA
AGAAGTGCAAATTCTTCATCTGGTGTGTTATTCACTGAGGGATATCAATACTCATGAGAAGCTCCAAGCTAGAATGCAGAATATGTATCTTAATCCTAGCATTTGCCACC
TTTGCCGCTGTGATTCTGAAACCTTGGAACTTTGGCACCGTCTTTTTGCTGCGTTCCATCTAATCCTGCCTCTCCAAGAACATATAGAAGCCTTCATTACTGAAGCCTTC
TTCTATCCCTGTAGCAGTAAACGAAATACTCTTTGGTGTAATGCGGTGGGCTCGATCCTTTGGCTCGTCTGGTTGGAGCGCACAGCAGGTGCTTCACGGATTCCTCAAAG
TCAGGCCTCTATTTTGTGGGATGATATTATGTGCGCCATGGCGTTGATTGGGGTAACTTCGATGAAATTTTGGATCCATCTGAGAAACTTGGTGGAGGAACGCTCTGATA
TTGCTGGTTTTTGTAATGCTCTAGATGATTGTAGTCTTCTCAATTTAGTTGATCACATGGATTATGGCCGGTGGCCCTCTTTGTCGCTCAACAGTTGTGGCTACAAGGCT
AAGATATGCGACGTTACATCTCGTGTTAAGATGACTATTGCAAACCTCCATTATCAATCAACTAGGGATGATCCCTTTGCAGTGGAGGCCGACCTGGAAAAGGTCTTACT
GGAAGAGGAGGAGTATTGGAAGCAGTTGGAAGCAGCGGTCGAGAGATTACCGGTTAAAATGGAGTGA
Protein sequenceShow/hide protein sequence
MSDYRPISLTTTIYKILAKVLADRLKTVLPNTIASAQSAFVQGRQITDSTLMANEMIDYWRCAKKKGVILKLDLEKAFDKLDWHFLLKVLSLKGFNSKCQTLDAAYNKGL
IEGFRIRDNGTHISHLLFADDILLLSSPDERKFRNLHLILKSFEQASGLNINLNKSSISGINVSHDELHYLAGIWSCPIQALPSSYLGMPLGGNPRSESFWAAIIERVDR
KLEGWNYSHISKGGRLTLVQSVLSSLPTYYLSIFKPPAHVSKILERKRRNFLWEGMGEKGFSHLIRWDITTLPKELGGLGIGRINTINMSLLTKWKRGEIFLGTYIPEGL
YSPEKKSSGILCHPILLPQVTTAIWNLSDNGIFSLAEEVQILHLVCYSLRDINTHEKLQARMQNMYLNPSICHLCRCDSETLELWHRLFAAFHLILPLQEHIEAFITEAF
FYPCSSKRNTLWCNAVGSILWLVWLERTAGASRIPQSQASILWDDIMCAMALIGVTSMKFWIHLRNLVEERSDIAGFCNALDDCSLLNLVDHMDYGRWPSLSLNSCGYKA
KICDVTSRVKMTIANLHYQSTRDDPFAVEADLEKVLLEEEEYWKQLEAAVERLPVKME