; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035169 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035169
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:16048246..16054776
RNA-Seq ExpressionLag0035169
SyntenyLag0035169
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]1.8e-12534.45Show/hide
Query:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD
        + FS  WI  IMSCI+T  +S+LIN +P GL KP RGLRQG PLSPYLF+ CAE FS LL++ E    I G +  +   ++TH+ FADDSL+F K    D
Subjt:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD

Query:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA
         + L  +   Y + SGQ  N EKS+   S   + + +S  + I  +K       YL +P   GRNK   F+ VK +V   +  W  KLFS GGKE+LIKA
Subjt:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA

Query:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFRHL-----AMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF
        VAQA+P Y MS F+LP  +C+DI +  ARFWWG  K K   HW  W  M K K  GGLGFR L     A++AK  WRLV+ PNSL+ + ++ RY+K+  F
Subjt:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFRHL-----AMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF

Query:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVHSHLRGYRVDHLLDDDNQWKEDLIRQNFHSPDVEDILNIPTGSK
          A +G  P   WRSI WG  + K+G RWR+G+G+ + +  D WI R    +P++  +      V  L+D +N+W+ D + Q+F   D+E IL I   S 
Subjt:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVHSHLRGYRVDHLLDDDNQWKEDLIRQNFHSPDVEDILNIPTGSK

Query:  EAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLESTTHL
        + +DE++W+ D KG +S+KS Y LA++  +++      +N  +  WK  W      + KI +W+ +++ILPT  N+ K+     P+C  C  ++E+ +H+
Subjt:  EAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLESTTHL

Query:  IWECKTSRKGWVRFIPKTYNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNK-VANFQNTGAEALYRSIDLSIKEIEAAYLKSRPSGRI
        + ECK +RK W    P            +       W + +  E E+   +V  W IW+ RNK +   + + +  L    D  +K  +         G  
Subjt:  IWECKTSRKGWVRFIPKTYNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNK-VANFQNTGAEALYRSIDLSIKEIEAAYLKSRPSGRI

Query:  GNHSSQAPKHNPSPISWQLKSDASWNVYLGCGGVGWMM--LEAKAMLDGLKQIFDTFKRRSIAIEA--------------------QRDALEIINIVSEK
             Q     PS    +L  DA+ +      G+G ++   E K +  G+KQ    F+ R    EA                    + D  E++ +++  
Subjt:  GNHSSQAPKHNPSPISWQLKSDASWNVYLGCGGVGWMM--LEAKAMLDGLKQIFDTFKRRSIAIEA--------------------QRDALEIINIVSEK

Query:  TEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSAMDLSS
            +E+  +   ++  + + + V FS   R  NT AH +A+ A+  SS
Subjt:  TEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSAMDLSS

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]8.8e-12841.09Show/hide
Query:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD
        M F E WI+ +M CI++VSYSIL+N    G   P+RGLRQGDP+SPY+FL CA+GFS+LL+       I+G  I + CP +TH+FFADDSL+F K + ++
Subjt:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD

Query:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA
         Q+L+ +L+ YE+ SGQ IN++KS+   S N  ++      R+L   + T    YL +P+  G++K E F  VK+RVE+ L GWKEKL S GG+E+LIKA
Subjt:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA

Query:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFR-----HLAMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF
        VAQAIPTYTMSCF++P ++C++I+ M  RFWWG+   + K  W+SW+K+CK K+ GG+GFR     +LAMLAK  WRL+ NPNSL+ +  + RY+   + 
Subjt:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFR-----HLAMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF

Query:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVHSHLRGY-RVDHLLD-DDNQWKEDLIRQNFHSPDVEDILNIPTG
         +A LG +P  TWRSI  G ++ +RG RWRVGNG  I I  D W+      + ++       Y RV  L+D +  +WK+D++R  F   +   IL+IP  
Subjt:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVHSHLRGY-RVDHLLD-DDNQWKEDLIRQNFHSPDVEDILNIPTG

Query:  SKEAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHY-VESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLEST
            +D+IIW  + KG FS+KSAY++A+ +I++  V   S  +  +  W+ +W     P+ +I  WK+  + LPT  N+++KGV+I  +C  CG + ES 
Subjt:  SKEAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHY-VESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLEST

Query:  THLIWECKTSRKGWVRFIPKTYNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNKV
         H+  +C+ +++ W  ++    +L ++     +  D    + D     ++    V+ W IW  RNK+
Subjt:  THLIWECKTSRKGWVRFIPKTYNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNKV

XP_024156142.1 uncharacterized protein LOC112164137 [Rosa chinensis]1.3e-12333.02Show/hide
Query:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD
        M F + WI  IM C++TVSYS L+N +P+G   P+RGLRQGD +SPYLFL CAEG S +LS EE  H + G  I    PS+ H+FFADDS +F+K  R++
Subjt:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD

Query:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA
           +  +LKWYE+ SGQ +N +KS    SKN++         +  +++      YL +P +   +K EAF+ + ++ +  ++ WK+K  S  GKEV+IK+
Subjt:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA

Query:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFRHL-----AMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF
        V Q++PTY MSCF LP  +C ++ R  A FWWG+++  RK HW++W KMC  KE GGLGFR++     A+LAK  WR++++P+SLL KTL+ +YF + +F
Subjt:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFRHL-----AMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF

Query:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLA-VHSHLRGYRVDHLLDDDNQ-WKEDLIRQNFHSPDVEDILNIPTG
        + A + +    TWRS+  G+ L ++G R++VG G  IS+  DPWI R  + RP + V   L    V  L+D D++ W  D + + F + +V+ I  IP  
Subjt:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLA-VHSHLRGYRVDHLLDDDNQ-WKEDLIRQNFHSPDVEDILNIPTG

Query:  SKEAKDEIIWNLDTKGNFSIKSAYHLA---MDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLE
         +  +D +IW+ D +G +S+KS YH+A     L  H   S S  +K+   W+ +W  +  P+ +  VW+++++I+PTK N+ ++      +C  C  + E
Subjt:  SKEAKDEIIWNLDTKGNFSIKSAYHLA---MDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLE

Query:  STTHLIWECKTSRKGWVRFIPKTYNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNKVANFQNTGAEALYRSIDLSIKEIEAAYLKSRP
        +T H+  EC      W   +  +  L +      + K+W   + D L + ++    +++W IW+ RNK+    N G      ++  S+  + + Y +  P
Subjt:  STTHLIWECKTSRKGWVRFIPKTYNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNKVANFQNTGAEALYRSIDLSIKEIEAAYLKSRP

Query:  SGRIGNHSSQAPKHNPSPISWQLK--SDASWNVYLGCGGVGWMMLEAKAMLDGLKQ-----IFDTFKRRSIA---------------IEAQRDALEIINI
                  A      P   +LK   D ++    GCGG+G ++ +   +  G +      ++  F   + A               +E + D   +   
Subjt:  SGRIGNHSSQAPKHNPSPISWQLK--SDASWNVYLGCGGVGWMMLEAKAMLDGLKQ-----IFDTFKRRSIA---------------IEAQRDALEIINI

Query:  VSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSA
        ++++ ED SEV  + D  K        +   H  R  N+ A+ +A  A
Subjt:  VSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSA

XP_024172304.2 uncharacterized protein LOC112178381 [Rosa chinensis]3.5e-12433.02Show/hide
Query:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD
        M F + WI+ IM C++TVSYS L+N +P+G   P+RGLRQGD +SPYLFL CAEG S +LS EE  H + G  I    PS+ H+FFADDS +F+K  R++
Subjt:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD

Query:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA
           +  +LKWYE+ SGQ +N +KS    SKN++         +  +++      YL +P +   +K EAF+ + ++    ++ WK+K  S  GKEV+IK+
Subjt:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA

Query:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFRHL-----AMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF
        V Q++PTY MSCF LP  +C ++ R  A FWWG+++  RK HW++W KMC  KE GGLGFR++     A+LAK  WR++++P+SLL KTL+ +YF + +F
Subjt:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFRHL-----AMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF

Query:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLA-VHSHLRGYRVDHLLDDDNQ-WKEDLIRQNFHSPDVEDILNIPTG
        + A + +    TWRS+  G+ L ++G R++VG+G  IS+  DPWI R  + RP + V   L    V  L+D D++ W  D + + F + +V+ I  IP  
Subjt:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLA-VHSHLRGYRVDHLLDDDNQ-WKEDLIRQNFHSPDVEDILNIPTG

Query:  SKEAKDEIIWNLDTKGNFSIKSAYHLA---MDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLE
         +  +D +IW+ D +G +S+KS YH+A     L  H   S S  +K+   W+ +W  +  P+ +  VW+++++I+PTK N+ ++      +C  C  + E
Subjt:  SKEAKDEIIWNLDTKGNFSIKSAYHLA---MDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLE

Query:  STTHLIWECKTSRKGWVRFIPKTYNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNKVANFQNTGAEALYRSIDLSIKEIEAAYLKSRP
        +T H+  EC      W   +  +  L +      + K+W   + D L + ++    +++W IW+ RNK+    N G      ++  S+  + + Y +  P
Subjt:  STTHLIWECKTSRKGWVRFIPKTYNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNKVANFQNTGAEALYRSIDLSIKEIEAAYLKSRP

Query:  SGRIGNHSSQAPKHNPSPISWQLK--SDASWNVYLGCGGVGWMMLEAKAMLDGLKQ-----IFDTFKRRSIA---------------IEAQRDALEIINI
                  A      P   +LK   D ++    GCGG+G ++ +   +  G +      +   F   + A               +E + D   +   
Subjt:  SGRIGNHSSQAPKHNPSPISWQLK--SDASWNVYLGCGGVGWMMLEAKAMLDGLKQ-----IFDTFKRRSIA---------------IEAQRDALEIINI

Query:  VSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSA
        ++++ ED SEV  + D  K        +   H  R  N+ A+ +A  A
Subjt:  VSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSA

XP_030496634.1 uncharacterized protein LOC115712492 [Cannabis sativa]4.1e-12533.68Show/hide
Query:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD
        M F   WI  IMSC+S+ S+S ++N +  G  KP+RGLRQGDPLSPYLFL C+EG S LL  EES  ++ G ++ ++ PS++H+ FADDSL+F +     
Subjt:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD

Query:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA
          +L RVL+ Y   SGQ +N  KS    S N ++       R L +  +     YL +PA + R+K E F  VK+R+ + L  W +KLFS GGKEVL+KA
Subjt:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA

Query:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFR-----HLAMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF
        V Q+IPTY MSCFRLP++ C  ++ M A FWWG  K   K HW SW+ +CK K  GG+GFR     + A+LAK +WR+++ PNSLL + L+ RYF + NF
Subjt:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFR-----HLAMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF

Query:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVHSHLRGYRVDHLLDDDNQWKEDLIRQNFHSPDVEDILNIPTGSK
        LEA LG +P LTW+ I WGR+L   G R+++GNG  +S   D WI    + +P++ +       V H + D  +W   L+ Q F S DV+ I+ IP    
Subjt:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVHSHLRGYRVDHLLDDDNQWKEDLIRQNFHSPDVEDILNIPTGSK

Query:  EAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLESTTHL
        +  D +IW+  T G +++ S +HLA +L E+  +    ++  + +WK+ W  +   + KI  W+++Q+ LP    ++++ V  +  C +C    ES  H 
Subjt:  EAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLESTTHL

Query:  IWECKTSRKGW--VRFI---PKTYNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNKV------------ANFQNTGAEALYRSIDLSI
        ++ C T+RK W   +F      T+N+++         D+   L     ++++   +  MW IW+ RNKV            A+F +T      R+   + 
Subjt:  IWECKTSRKGW--VRFI---PKTYNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNKV------------ANFQNTGAEALYRSIDLSI

Query:  KEIE--AAYLKSRPSGRIGNHSSQAPKHNPSPISW--------QLKSDASWNVYLGCGGVGWMM-----------------------LEAKAMLDGLKQI
        ++     A+  S  S      +S + + + S  SW        ++  DA+ N      GVG ++                       +EAKA+   L   
Subjt:  KEIE--AAYLKSRPSGRIGNHSSQAPKHNPSPISW--------QLKSDASWNVYLGCGGVGWMM-----------------------LEAKAMLDGLKQI

Query:  FDTFKRRSIAIEAQRDALEIINIVSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSAMDLSSVV
            + +      + DAL + + ++    +LS    L   +  + S    V  SH  R  N  AH +A+ A++L   V
Subjt:  FDTFKRRSIAIEAQRDALEIINIVSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSAMDLSSVV

TrEMBL top hitse value%identityAlignment
A0A2N9GI95 Reverse transcriptase domain-containing protein3.7e-12435.06Show/hide
Query:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD
        M FS  W+  +M CISTVSYSIL+N +P G  KPSRGLRQGDPLSPYLFL CAEGF +LL +E+    + G  I++  P +TH+FFADDSL+F K    D
Subjt:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD

Query:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA
        ++ +  +L  YE  SGQ IN +K+    SK+  +   +  + +L +        YL +P+  GR K  +F ++K+RV   L+GWKEKL SQ G+E+LIK+
Subjt:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA

Query:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFRHL-----AMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF
        VAQAIP Y MSCFRLPN +  +I+ +  RFWWG++  K K HW+SW  +CK K +GG+GFR L     A+LAK  WRL+ NP+SL +K  + +YF   + 
Subjt:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFRHL-----AMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF

Query:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGN---ARPLAVHSHLRGYRVDHLLDDDNQ-WKEDLIRQNFHSPDVEDILNIP
        LEA+        W+SI   RDL K+G  WRVGNG  I I  D W+    N   + P    S+L    V HL+D  ++ WKE+LIR+ F   D   I+ IP
Subjt:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGN---ARPLAVHSHLRGYRVDHLLDDDNQ-WKEDLIRQNFHSPDVEDILNIP

Query:  TGSKEAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLES
          S    D ++W     G ++++S YHL +           D  +    W SIW  +  P+T+ C+W+   + LPT+ N+  + +  +P C  C  ++E+
Subjt:  TGSKEAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLES

Query:  TTHLIWECKTSRKGWVRFIPKTYNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNKVANFQNTGAEALYRSIDLSIKEIEAAYLKSRPS
        T H +W CKT +  W + +P    L  +  A +   D W      L   E+    ++ W IW +RN+V   Q T  + + R +  + +E+   +   +  
Subjt:  TTHLIWECKTSRKGWVRFIPKTYNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNKVANFQNTGAEALYRSIDLSIKEIEAAYLKSRPS

Query:  GRIGNHSSQAPKHNPSPISWQLKSDASWNV-YLGC-------GGVGWMMLEAKAMLDG-----------LKQIFDTFKRRSI---------AIEAQRDAL
          +    S+A     + I WQ  ++  + V Y G         G+G ++  A+  + G           ++ +  +  R +I          IE + D+ 
Subjt:  GRIGNHSSQAPKHNPSPISWQLKSDASWNV-YLGC-------GGVGWMMLEAKAMLDG-----------LKQIFDTFKRRSI---------AIEAQRDAL

Query:  EIINIVSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSA
         +++ +       +    + + IK IA  + +V F H  R  N  AH +A+ A
Subjt:  EIINIVSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSA

A0A803PQ30 Uncharacterized protein1.4e-12333.98Show/hide
Query:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD
        M F+E WI  IMSC++T S+S LIN +  G   PSRGLRQG PLSPYLFL C+EGFS LL  ++   N+ GF++ ++ P +TH+FFADDSL+F +   + 
Subjt:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD

Query:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA
          ++  VL+ Y + SGQ +NL KS    S N          + L++        YL +P+ +GR+K E F  +K+++ K +  W EK F  GGKEVL+KA
Subjt:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA

Query:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFR-----HLAMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF
        V Q+IPTY MSCFRLP   C  ++ M   FWWG  +   K HW SW  +CK K  GG+GFR     + A+LAK +WR+  NP+SLL + L+ RYF +  F
Subjt:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFR-----HLAMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF

Query:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVHSHLRGYRVDHLLDDDNQWKEDLIRQNFHSPDVEDILNIPTGSK
        LEA LG +P LTW+ I W R+L  +G RW+VG+GR I  + DPWI       P        G  V +L+ D+ QW   ++ Q F   DV+ IL IP    
Subjt:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVHSHLRGYRVDHLLDDDNQWKEDLIRQNFHSPDVEDILNIPTGSK

Query:  EAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLESTTHL
          +D +IW+  + G +++ S YHL  DL    ++  S +   + +WK  W  +  P+ KI  WK + D LP    ++K+ V  +  C +C +  ES  H 
Subjt:  EAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLESTTHL

Query:  IWECKTSRKGWVRFIPKTYNLFSIRRAGWNPK-DWWCWLKDNLEEEEITKGVVIMWQIWNYRNKVANFQN-------TGAEALYRSIDLSIKEIEAAYLK
        ++ C+ +R  W      +   F  R A    K D+   L   L + E+ +    +W IWN RN+  + Q            A Y S   + ++       
Subjt:  IWECKTSRKGWVRFIPKTYNLFSIRRAGWNPK-DWWCWLKDNLEEEEITKGVVIMWQIWNYRNKVANFQN-------TGAEALYRSIDLSIKEIEAAYLK

Query:  SRPSGRIGNHSSQA-----PKHNPSPI----------------SWQLKSDASWNVYLGCGGVGWMMLEAKAMLDGL--KQIFDTFKRRSI----------
        +  +     H + A     P+ NP P                  ++L  DA+ +   G  GVG ++ ++  ++     KQ+   F+   +          
Subjt:  SRPSGRIGNHSSQA-----PKHNPSPI----------------SWQLKSDASWNVYLGCGGVGWMMLEAKAMLDGL--KQIFDTFKRRSI----------

Query:  -AIE-------AQRDALEIINIVSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSAM
         AI+       A+ DAL ++N +      +S    L   I  + S + NV  SH     N  AH +A+ A+
Subjt:  -AIE-------AQRDALEIINIVSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSAM

A0A803Q8E0 Uncharacterized protein2.6e-12533.42Show/hide
Query:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD
        M F   WI  IMSC+S+  +S ++N +  G  KP+RGLRQGDPLSPYLFL C+EG S LL  EES   + G ++ ++ PS++H+ FADDSL+F +     
Subjt:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD

Query:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA
          +L RVL+ Y   SGQ +N  KS    S N ++         L +  +     YL +PA + R+K E F  VK+R+ + L  W +KLFS GGKEVL+KA
Subjt:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA

Query:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFR-----HLAMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF
        V Q+IPTY MSCFRLP++ C  ++ M A FWWG  K   K HW SW+ +CK K  GG+GFR     + A+LAK +WR+ + PNSLL + L+ RYF + NF
Subjt:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFR-----HLAMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF

Query:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVHSHLRGYRVDHLLDDDNQWKEDLIRQNFHSPDVEDILNIPTGSK
        LEA LG +P L W+ I WGR+L   G R+++GNG  +S   D WI    + +P++ +       V H + D  +W   L+ Q F S DV+ I+ IP    
Subjt:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVHSHLRGYRVDHLLDDDNQWKEDLIRQNFHSPDVEDILNIPTGSK

Query:  EAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLESTTHL
         + D +IW+  T G +++ S +HLA +L E+  +    ++  + +WK+ W      + KI  W+++Q+ LP    ++++ V  +  C +C    ES  H 
Subjt:  EAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLESTTHL

Query:  IWECKTSRKGW--VRFI---PKTYNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNKV------------ANFQNTGAEALYRSIDLSI
        ++ C T+RK W   +FI     T+N+++         D+  +L     ++++   +  MW IW+ RNKV            A+F +T     YR+   + 
Subjt:  IWECKTSRKGW--VRFI---PKTYNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNKV------------ANFQNTGAEALYRSIDLSI

Query:  KEIE--AAYLKSRPSGRIGNHSSQAPKHNPSPISW--------QLKSDASWNVYLGCGGVGWMM-----------------------LEAKAMLDGLKQI
        ++      +  S  S  + N ++   +   S  SW        ++  DA+ N      GVG ++                       +EAKA+   L   
Subjt:  KEIE--AAYLKSRPSGRIGNHSSQAPKHNPSPISW--------QLKSDASWNVYLGCGGVGWMM-----------------------LEAKAMLDGLKQI

Query:  FDTFKRRSIAIEAQRDALEIINIVSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSAMDLSSVV
            + +      + DAL + + ++  + +LS    L   +  + S    V  SH  R  N  AH +A+ A++L   V
Subjt:  FDTFKRRSIAIEAQRDALEIINIVSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSAMDLSSVV

A0A803QC75 Uncharacterized protein3.9e-12934.41Show/hide
Query:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD
        M F+E WI  IMSC++T ++S +IN +  G   PSRGLRQG PLSPYLFL C+EGFS LL  E+  +N+ GF++ ++ P +TH+FFADDSL+F + + + 
Subjt:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD

Query:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA
          ++ RVL  Y + SGQ +NL+KS    S N          + LS+        YL +P+ +GR+K E F  +K+R+ K +  W EK+FS GGKE+L+KA
Subjt:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA

Query:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFR-----HLAMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF
        V Q+IPTY MSCFRLP   C  ++ M A FWWG  +   + HW SW  +CK K  GG+GFR     + A+LAK +WR+ + P+SLL + L+ RYF + NF
Subjt:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFR-----HLAMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF

Query:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVHSHLRGYRVDHLLDDDNQWKEDLIRQNFHSPDVEDILNIPTGSK
        LEA LG +P LTW+ I W R+L  +G RW+VG+GR I   SDPWI       P        G  V +L+ D+ QW   L++Q F + DVE IL++P    
Subjt:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVHSHLRGYRVDHLLDDDNQWKEDLIRQNFHSPDVEDILNIPTGSK

Query:  EAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLESTTHL
         ++D +IW+  + G +++KS YHLA D+    ++  S +N+++ +WK  W  +  P+ KI  W+ I D LP   +++K+ V  +  C +C +  ES  H 
Subjt:  EAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLESTTHL

Query:  IWECKTSRKGWVRFIPKTYNL-FSIR-RAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNKVAN------FQNTGAEAL-----YRSIDLSIKEIE
        ++ CK ++  W     +  NL F  R  A     D+   L     + E+ +    +W IW  RN++ +       ++  + A+     YR+    ++  +
Subjt:  IWECKTSRKGWVRFIPKTYNL-FSIR-RAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNKVAN------FQNTGAEAL-----YRSIDLSIKEIE

Query:  AAYLKSRPSGRI-----GNHSSQAPK----------HNPSPISWQLKSDASWNVYLGCGGVGWMM-----------------------LEAKAMLDGLKQ
             SRP   I        +S +PK            P   S++L  DA+ +V+    G+G ++                       +EA A+   L  
Subjt:  AAYLKSRPSGRI-----GNHSSQAPK----------HNPSPISWQLKSDASWNVYLGCGGVGWMM-----------------------LEAKAMLDGLKQ

Query:  IFDTFKRRSIAIEAQRDALEIINIVSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSAM
        +    ++  I++  + DAL ++N +      +S    L   +  + S + +V  +H  R  N  AHC+A+ A+
Subjt:  IFDTFKRRSIAIEAQRDALEIINIVSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSAM

A0A803QEG9 Uncharacterized protein1.5e-12835.17Show/hide
Query:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD
        M F+  WI  IMSC++T  +S L+N +  G   P+RGLRQG PLSPYLFL CAEG S LL  E+   N+ GF++ +  P ++H+ FADDSL+F +     
Subjt:  MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKD

Query:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA
          ++ RVL  Y + SGQ IN +KS    S N          R LS+  +     YL +P+ +GR+K E F  +K+R+ K +Q W EKLFS GG+EVL+KA
Subjt:  LQSLVRVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKA

Query:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFR-----HLAMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF
        V Q+IPTY MSCFRLP   C+ ++ M A FWWG  +   K HW SW  +CK K  GG+GFR     + A+LAK +WR+V+ PNSLL   L+ +YF   +F
Subjt:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFR-----HLAMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNF

Query:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVHSHLRGYRVDHLLDDDNQWKEDLIRQNFHSPDVEDILNIPTGSK
        LEA +G +P LTW+ I WGR+L   G RW++G GR +    DPW+    +  P+       G  V +L+ D+ QW   L++Q F   D+E IL +P    
Subjt:  LEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVHSHLRGYRVDHLLDDDNQWKEDLIRQNFHSPDVEDILNIPTGSK

Query:  EAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLESTTHL
          +D++IW+  + GNF+++SAYHLA  L    + S S +  +  +WK  W  +   + KI  W++I D LP   +++++ +  +  C IC +  EST H 
Subjt:  EAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLESTTHL

Query:  IWECKTSRKGWVRFIPKTYNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNKVANFQNT-GAEALYRSIDLSIKEIEAAYLK-----SR
        ++ CK ++  W  F    +N            D+  +L     + E+ +   IMW IW  RNKV + +N   A  L     + ++   AA  K     + 
Subjt:  IWECKTSRKGWVRFIPKTYNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNKVANFQNT-GAEALYRSIDLSIKEIEAAYLK-----SR

Query:  PSGRIGNHSSQAPKHNPSPI----SW--------QLKSDASWNVYLGCGGVGWMMLEAKAMLDG--LKQIFDTFKRRSIAIEA-----------------
        P+       SQ P   P+      SW        +L +DA+        GVG ++ +A   + G     I   FK   +  +A                 
Subjt:  PSGRIGNHSSQAPKHNPSPI----SW--------QLKSDASWNVYLGCGGVGWMMLEAKAMLDG--LKQIFDTFKRRSIAIEA-----------------

Query:  -QRDALEIINIVSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSAMDL
         + DAL ++N +       SE   L   + ++ S   NV  SH  R  N  AH +A+ A+ L
Subjt:  -QRDALEIINIVSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSAMDL

SwissProt top hitse value%identityAlignment
P08548 LINE-1 reverse transcriptase homolog8.3e-1224.28Show/hide
Query:  TWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKDLQSLV
        T+++ I +  S  + +I++N      F    G RQG PLSP LF    E  +  +  E++   I G  I      L+   FADD +++++  R     L+
Subjt:  TWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKDLQSLV

Query:  RVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERI---LSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIK--A
         V+K Y  VSG  IN  KS      N N+   +  + I   +  KK   LG+YL    +      E +  ++  + + +  WK    S  G+  ++K   
Subjt:  RVLKWYEEVSGQTINLEKSAFMASKNLNEDGVSGCERI---LSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIK--A

Query:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFRHLAMLAKLSWRLVKN
        + +AI  +     + P S   D++++   F W + K +     +S +          L   + +++ K +W   KN
Subjt:  VAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFRHLAMLAKLSWRLVKN

P0C2F6 Putative ribonuclease H protein At1g657503.8e-4128.71Show/hide
Query:  MPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKAVAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGG
        MP    R   + F  + +RV   + GW+EK  S  G+  L KAV  ++P ++MS   LP SI + +D++   F WG    K+K H + W K+C  K+ GG
Subjt:  MPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKAVAQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGG

Query:  LGFR-----HLAMLAKLSWRLVKNPNSLLFKTLRGRY----FKDRNFLEAHLGRAPFLTWRSICWG-RDLFKRGFRWRVGNGRLISIESDPWINRKGNAR
        LG R     + A+++K+ WRL++  NSL    L+ +Y     +D  +L      +   TWRSI  G RD+   G  W  G+G+ I   +D W++ K    
Subjt:  LGFR-----HLAMLAKLSWRLVKNPNSLLFKTLRGRY----FKDRNFLEAHLGRAPFLTWRSICWG-RDLFKRGFRWRVGNGRLISIESDPWINRKGNAR

Query:  PLAVHSHLRGYRVDHLLDDD-----NQW---KEDLIRQNFHSPDVED-ILNIPTGSKEAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEA
         L + +  R    D ++  D       W   K D    N    ++   +L++ TG   A+D + W     G FS++SAY       E     +      A
Subjt:  PLAVHSHLRGYRVDHLLDDD-----NQW---KEDLIRQNFHSPDVED-ILNIPTGSKEAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEA

Query:  GFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLESTTHLIWECKTSRKGWVRFIPKTYNLFSIRRAGWNPKDWWCWLKDNLE
         F+  +WK +   R K  +W +    + T+    ++ +  + +C +C   +ES  H++ +C      WVR +P+       R+ G+  K  + WL DNL 
Subjt:  GFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLESTTHLIWECKTSRKGWVRFIPKTYNLFSIRRAGWNPKDWWCWLKDNLE

Query:  E----EEI---TKGVVIMWQIWNYR
        +    E+I   T   VI+W  W +R
Subjt:  E----EEI---TKGVVIMWQIWNYR

P11369 LINE-1 retrotransposable element ORF2 protein1.6e-1023.55Show/hide
Query:  WIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKDLQSLVR
        ++  I +  S    +I +N +         G RQG PLSPYLF    E  +  + +++    I G QI K    ++    ADD +++I + +   + L+ 
Subjt:  WIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKDLQSLVR

Query:  VLKWYEEVSGQTINLEKS-AFMASKNLNEDGVSGCERILSI--KKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIK--AV
        ++  + EV G  IN  KS AF+ +KN   +         SI       LG+ L    +   +K   F+ +K  +++ L+ WK+   S  G+  ++K   +
Subjt:  VLKWYEEVSGQTINLEKS-AFMASKNLNEDGVSGCERILSI--KKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIK--AV

Query:  AQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGG------LGFRHLAMLAKLSW
         +AI  +     ++P    ++++    +F W   K +        + + K K T G      L   + A++ K +W
Subjt:  AQAIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGG------LGFRHLAMLAKLSW

P92555 Uncharacterized mitochondrial protein AtMg012505.7e-1348.53Show/hide
Query:  LINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDS
        +IN  P+GL  PSRGLRQGDPLSPYLF+ C E  S L  R +    + G +++   P + H+ FADD+
Subjt:  LINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDS

P93295 Uncharacterized mitochondrial protein AtMg003102.8e-2841.45Show/hide
Query:  AIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKE-TGGLGFRHL-----AMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNFLE
        A+P Y MSCFRL   +C  +      FWW   + KRK  W++W+K+CK KE  GGLGFR L     A+LAK S+R++  P++LL + LR RYF   + +E
Subjt:  AIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKE-TGGLGFRHL-----AMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNFLE

Query:  AHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPL
          +G  P   WRSI  GR+L  RG    +G+G    +  D WI  +    PL
Subjt:  AHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPL

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein2.5e-1921.69Show/hide
Query:  LRGRYFKDRNFLEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVHSHLRGYRVDHLLDDDNQ---WKEDLIRQNFHS
        ++ RYFKD + L+A + +     W S+  G  L K+G R  +G+G+ I I  D  ++     RPL      +   +++L +       W +  I Q    
Subjt:  LRGRYFKDRNFLEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVHSHLRGYRVDHLLDDDNQ---WKEDLIRQNFHS

Query:  PDVEDILNIPTGSKEAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPL
         D   I  I     +  D+IIWN +T G ++++S Y L        + + +  +        IW     P+ K  +W+ +   L T   +  +G+ I+P 
Subjt:  PDVEDILNIPTGSKEAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAGFWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPL

Query:  CCICGKKLESTTHLIWECKTSRKGW-VRFIPKTYNLFSIRRAGWNPKDWWCWLKD-NLEEEEITKGVVIMWQIWNYRNKVA--NFQNTGAEALYRSIDLS
        C  C ++ ES  H ++ C  +   W +       N         N  +   +++D  + +      V ++W+IW  RN V    F+ + ++ +  +   +
Subjt:  CCICGKKLESTTHLIWECKTSRKGW-VRFIPKTYNLFSIRRAGWNPKDWWCWLKD-NLEEEEITKGVVIMWQIWNYRNKVA--NFQNTGAEALYRSIDLS

Query:  IKEIEAAYL-KSRPSGRIGNHSSQAPKHNPSPISWQLKSDASWNVYLGCGGVGWMM-----------------------LEAKAMLDGLKQIFDTFKRRS
           + A    K  PS       ++    NP     +   DA ++V       GW++                        E KA+L  L+Q   T+ R  
Subjt:  IKEIEAAYL-KSRPSGRIGNHSSQAPKHNPSPISWQLKSDASWNVYLGCGGVGWMM-----------------------LEAKAMLDGLKQIFDTFKRRS

Query:  IAIEAQRDALEIINIVSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVAR
          +  + D   +IN+++  +   S    L D I   A+   ++ F    R  N  AH +A+
Subjt:  IAIEAQRDALEIINIVSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVAR

AT3G25270.1 Ribonuclease H-like superfamily protein9.7e-0823.71Show/hide
Query:  IWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLESTTHLIWECKTSRKGW-VRFIPKTYNLFSIRRAGWNPKDWWCWLKD----NLE
        IWK KT P+ K  +WK++   L T  N+ ++ +  +P C  C ++ E++ HL ++C  +++ W    IP       +R  G   +     L      N +
Subjt:  IWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLESTTHLIWECKTSRKGW-VRFIPKTYNLFSIRRAGWNPKDWWCWLKD----NLE

Query:  EEEITKGVVIMWQIWNYRN------KVANFQNTGAEALYRSIDLSIKEIEAAYLKSR----PSGRIGNHSSQAPKHNPSPISW-QLKSDASWNVYLGCGG
         +     + I+W++W  RN      K  ++QNT   A     D+   E    Y++S      S R    +    K    P +W +   D ++N       
Subjt:  EEEITKGVVIMWQIWNYRN------KVANFQNTGAEALYRSIDLSIKEIEAAYLKSR----PSGRIGNHSSQAPKHNPSPISW-QLKSDASWNVYLGCGG

Query:  VGWMMLEAKAMLDGLKQIFDTFKRRSIAIEAQ
         GW+M +   +  G  Q   +    S+  E Q
Subjt:  VGWMMLEAKAMLDGLKQIFDTFKRRSIAIEAQ

AT4G29090.1 Ribonuclease H-like superfamily protein2.1e-4724.96Show/hide
Query:  AIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFR-----HLAMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNFLEA
        A+PTYTM+CF LP ++C  I  + A FWW   +  +  HW +W  +   K  GG+GF+     +LA+L K  WR++  P SL+ K  + RYF   + L A
Subjt:  AIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFR-----HLAMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNFLEA

Query:  HLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVH--------SHLRGYRVDHLLDDD-NQWKEDLIRQNFHSPDVEDILN
         LG  P   W+SI   +++ ++G R  VGNG  I I    W++ K  +  L +         S     +V  L+D+   +W++D+I   F   + + I  
Subjt:  HLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPLAVH--------SHLRGYRVDHLLDDD-NQWKEDLIRQNFHSPDVEDILN

Query:  IPTGSKEAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAG-FWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKK
        +  G +   D   W+  + G++++KS Y +   +I      Q  +       ++ IWK++T P+ +  +WK + + LP    +  + +     C  C   
Subjt:  IPTGSKEAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAG-FWKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKK

Query:  LESTTHLIWECKTSRKGW-VRFIPKT----------YNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNK-VANFQNTGAEALYRSIDL
         E+  HL+++C  +R  W +  IP             NL+ +   G     W         E+       ++W++W  RN+ V   +   A+ + R  + 
Subjt:  LESTTHLIWECKTSRKGW-VRFIPKT----------YNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMWQIWNYRNK-VANFQNTGAEALYRSIDL

Query:  SIKE----IEAAYLKSRPSGRIGNHSSQAPKHNPSPISW-QLKSDASWNVYLGCGGVGWMMLEAKA--------MLDGLKQIFDT--------------F
         ++E     EA    ++P      + S   +  P P  W +  +DA+WN      G+GW++   K          L  LK + +               F
Subjt:  SIKE----IEAAYLKSRPSGRIGNHSSQAPKHNPSPISW-QLKSDASWNVYLGCGGVGWMMLEAKA--------MLDGLKQIFDT--------------F

Query:  KRRSIAIEAQRDAL-EIINIVSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSAM
        +   +  E+    L EI+N      E    +K     ++ + S    V F    R  NT A  VAR ++
Subjt:  KRRSIAIEAQRDAL-EIINIVSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSAM

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.0e-2941.45Show/hide
Query:  AIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKE-TGGLGFRHL-----AMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNFLE
        A+P Y MSCFRL   +C  +      FWW   + KRK  W++W+K+CK KE  GGLGFR L     A+LAK S+R++  P++LL + LR RYF   + +E
Subjt:  AIPTYTMSCFRLPNSICDDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKE-TGGLGFRHL-----AMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNFLE

Query:  AHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPL
          +G  P   WRSI  GR+L  RG    +G+G    +  D WI  +    PL
Subjt:  AHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGRLISIESDPWINRKGNARPL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)4.1e-1448.53Show/hide
Query:  LINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDS
        +IN  P+GL  PSRGLRQGDPLSPYLF+ C E  S L  R +    + G +++   P + H+ FADD+
Subjt:  LINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTTCAGTGAAACGTGGATCAGAAAGATCATGAGTTGTATATCCACAGTTAGCTATTCCATCCTTATAAACGAAGATCCGAAAGGGCTTTTCAAACCTAGCAGAGG
ATTAAGACAAGGGGATCCTCTTTCTCCCTACCTCTTTCTTTTCTGTGCAGAAGGCTTCTCGACTTTACTAAGCAGGGAGGAATCTCATCATAACATTACAGGTTTCCAGA
TTAATAAATATTGTCCTTCCCTAACTCATATTTTCTTTGCAGATGACAGTCTTATTTTCATAAAAGAGCATAGAAAAGACCTTCAATCCCTAGTAAGGGTTCTGAAGTGG
TATGAGGAAGTTTCGGGGCAAACAATCAATCTGGAAAAATCAGCTTTTATGGCTAGTAAGAACTTGAATGAGGATGGGGTTTCAGGCTGTGAAAGAATCCTGAGCATTAA
AAAAACTACATCCCTAGGCATTTACTTACGAATGCCTGCCCAAACAGGCAGGAACAAAGGGGAGGCCTTCAGAAGGGTCAAAGATAGAGTGGAAAAAACCTTGCAAGGAT
GGAAAGAGAAGCTTTTCTCCCAAGGAGGGAAGGAAGTCCTTATCAAGGCCGTGGCGCAGGCGATCCCCACATACACCATGAGCTGTTTTAGACTCCCGAACAGCATTTGT
GACGACATTGATAGAATGTGTGCCAGATTCTGGTGGGGTGAAGCGAAAGGCAAAAGAAAAAGCCATTGGATGAGCTGGAGGAAAATGTGTAAAAGGAAAGAAACAGGTGG
GTTAGGGTTCAGACACCTTGCAATGCTCGCAAAGCTAAGTTGGAGGTTGGTAAAAAACCCTAACAGTCTGCTTTTTAAAACCCTCAGAGGGCGGTACTTCAAAGACCGAA
ATTTCCTAGAAGCCCATTTGGGTAGAGCTCCATTTCTAACCTGGCGTAGCATTTGTTGGGGGCGTGATCTGTTTAAAAGAGGCTTCAGATGGAGAGTTGGCAACGGGAGG
TTAATAAGTATCGAGTCCGATCCTTGGATTAATAGAAAGGGTAATGCGAGGCCTTTGGCTGTTCATAGCCACCTGAGAGGCTATAGAGTTGATCACCTGCTGGATGACGA
CAACCAATGGAAAGAGGATCTAATTAGACAAAACTTCCACAGCCCTGATGTGGAAGACATTCTAAATATCCCGACAGGAAGCAAAGAAGCCAAGGACGAAATTATATGGA
ACTTAGACACAAAAGGAAACTTCTCGATTAAGAGCGCATATCACCTGGCGATGGACCTCATAGAACATTATGTGGAATCTCAATCTGACAACAACAAAGAGGCTGGCTTT
TGGAAGAGTATTTGGAAAACAAAAACTCATCCCCGAACAAAGATCTGTGTTTGGAAGATTATCCAAGACATCCTCCCAACAAAGGCCAACATCATCAAGAAGGGAGTAGA
CATCAACCCTCTGTGTTGTATTTGTGGTAAAAAATTGGAATCTACCACCCACCTAATCTGGGAGTGTAAAACCTCTAGAAAGGGTTGGGTGAGATTTATTCCTAAAACTT
ATAATTTGTTTTCTATACGCAGGGCGGGCTGGAATCCTAAGGACTGGTGGTGTTGGCTGAAGGATAACTTGGAAGAAGAAGAAATCACAAAAGGTGTTGTCATAATGTGG
CAAATTTGGAATTACCGCAACAAAGTAGCCAATTTCCAGAACACAGGAGCTGAAGCTCTCTATCGAAGTATTGATCTCAGCATAAAAGAGATTGAAGCGGCTTACCTCAA
GAGTCGGCCTTCTGGCAGAATTGGGAACCATTCGAGTCAAGCTCCAAAGCATAATCCTTCCCCGATCAGTTGGCAGCTAAAATCCGACGCCTCCTGGAATGTTTATTTAG
GCTGTGGGGGTGTGGGTTGGATGATGTTGGAAGCAAAAGCTATGCTAGATGGGTTAAAGCAAATTTTCGATACCTTCAAACGTCGATCCATCGCCATTGAAGCTCAGAGG
GACGCCCTAGAAATCATCAACATCGTCTCCGAAAAAACAGAGGACCTTTCAGAGGTGAAATCTCTCACCGATCAAATCAAAGCCATCGCCTCAGATGTGCAAAATGTGGG
TTTTAGTCATTGTAGTAGAGTTTTGAACACAGAAGCGCACTGTGTTGCGAGATCGGCCATGGATCTTTCGTCTGTTGTAGGTGATCTCCATGGATGCAGATCTTCGCAGG
AAAAGGGGCAGGTAGCTAAAGCCTTAGATGCCCCAGAAATCTTACCCGAAAATCTTACATCAGAACAGAAAGCTGAAATGGATGAGAGTTATAAAGAGATAAAAGCTGCC
ATTAAATATGGGAGAGACTCACTGTCTTTAGAAATGGTTTTAGATGCTTTAAGATCCAAGGACATTGAGATTAAGATAGAAAAGAAAGATGTGACGCCCGAGCTAAAGGC
TAAAAATAAGCAACAAAATTCTGATAAAAAGGAAGAAGGAAATAATGCTAACATTCATGAAGGCTATGAATCAGTAGAATTATTGGTCGTAACTGATCAGAGATGTGGGG
AAGAATGGGCTTTATTGAACAAGATGGGGGCACTGTCCTTAAAACTGATAGATGGAACAGAGGGATATCTCACTGAAGTGAGATATATTCCAGACTTAAAGAGAAATTTA
ATATCTCTTGGTACCCTTGATAAACTAAGGAATTTGGAGGCGTTTCAGACTGAACCAGGTGGAACCGGGGCGGCCAGAGACAACAGGGACCGAATGGAGACGGAAGAGCT
CGACCCCCACAGACGGGCTGACCATATGGGTCGAGTCGACCCTCTAGTCTGTTCATCCTCTAGGGTCGATTTTTTATATTCTGCTCAGTTGTCCTCTTCAGCTCTGAGTA
CATCGGAGTGGTCCAAAATTGCTTATAACACCTTGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACTTCAGTGAAACGTGGATCAGAAAGATCATGAGTTGTATATCCACAGTTAGCTATTCCATCCTTATAAACGAAGATCCGAAAGGGCTTTTCAAACCTAGCAGAGG
ATTAAGACAAGGGGATCCTCTTTCTCCCTACCTCTTTCTTTTCTGTGCAGAAGGCTTCTCGACTTTACTAAGCAGGGAGGAATCTCATCATAACATTACAGGTTTCCAGA
TTAATAAATATTGTCCTTCCCTAACTCATATTTTCTTTGCAGATGACAGTCTTATTTTCATAAAAGAGCATAGAAAAGACCTTCAATCCCTAGTAAGGGTTCTGAAGTGG
TATGAGGAAGTTTCGGGGCAAACAATCAATCTGGAAAAATCAGCTTTTATGGCTAGTAAGAACTTGAATGAGGATGGGGTTTCAGGCTGTGAAAGAATCCTGAGCATTAA
AAAAACTACATCCCTAGGCATTTACTTACGAATGCCTGCCCAAACAGGCAGGAACAAAGGGGAGGCCTTCAGAAGGGTCAAAGATAGAGTGGAAAAAACCTTGCAAGGAT
GGAAAGAGAAGCTTTTCTCCCAAGGAGGGAAGGAAGTCCTTATCAAGGCCGTGGCGCAGGCGATCCCCACATACACCATGAGCTGTTTTAGACTCCCGAACAGCATTTGT
GACGACATTGATAGAATGTGTGCCAGATTCTGGTGGGGTGAAGCGAAAGGCAAAAGAAAAAGCCATTGGATGAGCTGGAGGAAAATGTGTAAAAGGAAAGAAACAGGTGG
GTTAGGGTTCAGACACCTTGCAATGCTCGCAAAGCTAAGTTGGAGGTTGGTAAAAAACCCTAACAGTCTGCTTTTTAAAACCCTCAGAGGGCGGTACTTCAAAGACCGAA
ATTTCCTAGAAGCCCATTTGGGTAGAGCTCCATTTCTAACCTGGCGTAGCATTTGTTGGGGGCGTGATCTGTTTAAAAGAGGCTTCAGATGGAGAGTTGGCAACGGGAGG
TTAATAAGTATCGAGTCCGATCCTTGGATTAATAGAAAGGGTAATGCGAGGCCTTTGGCTGTTCATAGCCACCTGAGAGGCTATAGAGTTGATCACCTGCTGGATGACGA
CAACCAATGGAAAGAGGATCTAATTAGACAAAACTTCCACAGCCCTGATGTGGAAGACATTCTAAATATCCCGACAGGAAGCAAAGAAGCCAAGGACGAAATTATATGGA
ACTTAGACACAAAAGGAAACTTCTCGATTAAGAGCGCATATCACCTGGCGATGGACCTCATAGAACATTATGTGGAATCTCAATCTGACAACAACAAAGAGGCTGGCTTT
TGGAAGAGTATTTGGAAAACAAAAACTCATCCCCGAACAAAGATCTGTGTTTGGAAGATTATCCAAGACATCCTCCCAACAAAGGCCAACATCATCAAGAAGGGAGTAGA
CATCAACCCTCTGTGTTGTATTTGTGGTAAAAAATTGGAATCTACCACCCACCTAATCTGGGAGTGTAAAACCTCTAGAAAGGGTTGGGTGAGATTTATTCCTAAAACTT
ATAATTTGTTTTCTATACGCAGGGCGGGCTGGAATCCTAAGGACTGGTGGTGTTGGCTGAAGGATAACTTGGAAGAAGAAGAAATCACAAAAGGTGTTGTCATAATGTGG
CAAATTTGGAATTACCGCAACAAAGTAGCCAATTTCCAGAACACAGGAGCTGAAGCTCTCTATCGAAGTATTGATCTCAGCATAAAAGAGATTGAAGCGGCTTACCTCAA
GAGTCGGCCTTCTGGCAGAATTGGGAACCATTCGAGTCAAGCTCCAAAGCATAATCCTTCCCCGATCAGTTGGCAGCTAAAATCCGACGCCTCCTGGAATGTTTATTTAG
GCTGTGGGGGTGTGGGTTGGATGATGTTGGAAGCAAAAGCTATGCTAGATGGGTTAAAGCAAATTTTCGATACCTTCAAACGTCGATCCATCGCCATTGAAGCTCAGAGG
GACGCCCTAGAAATCATCAACATCGTCTCCGAAAAAACAGAGGACCTTTCAGAGGTGAAATCTCTCACCGATCAAATCAAAGCCATCGCCTCAGATGTGCAAAATGTGGG
TTTTAGTCATTGTAGTAGAGTTTTGAACACAGAAGCGCACTGTGTTGCGAGATCGGCCATGGATCTTTCGTCTGTTGTAGGTGATCTCCATGGATGCAGATCTTCGCAGG
AAAAGGGGCAGGTAGCTAAAGCCTTAGATGCCCCAGAAATCTTACCCGAAAATCTTACATCAGAACAGAAAGCTGAAATGGATGAGAGTTATAAAGAGATAAAAGCTGCC
ATTAAATATGGGAGAGACTCACTGTCTTTAGAAATGGTTTTAGATGCTTTAAGATCCAAGGACATTGAGATTAAGATAGAAAAGAAAGATGTGACGCCCGAGCTAAAGGC
TAAAAATAAGCAACAAAATTCTGATAAAAAGGAAGAAGGAAATAATGCTAACATTCATGAAGGCTATGAATCAGTAGAATTATTGGTCGTAACTGATCAGAGATGTGGGG
AAGAATGGGCTTTATTGAACAAGATGGGGGCACTGTCCTTAAAACTGATAGATGGAACAGAGGGATATCTCACTGAAGTGAGATATATTCCAGACTTAAAGAGAAATTTA
ATATCTCTTGGTACCCTTGATAAACTAAGGAATTTGGAGGCGTTTCAGACTGAACCAGGTGGAACCGGGGCGGCCAGAGACAACAGGGACCGAATGGAGACGGAAGAGCT
CGACCCCCACAGACGGGCTGACCATATGGGTCGAGTCGACCCTCTAGTCTGTTCATCCTCTAGGGTCGATTTTTTATATTCTGCTCAGTTGTCCTCTTCAGCTCTGAGTA
CATCGGAGTGGTCCAAAATTGCTTATAACACCTTGGATTGA
Protein sequenceShow/hide protein sequence
MNFSETWIRKIMSCISTVSYSILINEDPKGLFKPSRGLRQGDPLSPYLFLFCAEGFSTLLSREESHHNITGFQINKYCPSLTHIFFADDSLIFIKEHRKDLQSLVRVLKW
YEEVSGQTINLEKSAFMASKNLNEDGVSGCERILSIKKTTSLGIYLRMPAQTGRNKGEAFRRVKDRVEKTLQGWKEKLFSQGGKEVLIKAVAQAIPTYTMSCFRLPNSIC
DDIDRMCARFWWGEAKGKRKSHWMSWRKMCKRKETGGLGFRHLAMLAKLSWRLVKNPNSLLFKTLRGRYFKDRNFLEAHLGRAPFLTWRSICWGRDLFKRGFRWRVGNGR
LISIESDPWINRKGNARPLAVHSHLRGYRVDHLLDDDNQWKEDLIRQNFHSPDVEDILNIPTGSKEAKDEIIWNLDTKGNFSIKSAYHLAMDLIEHYVESQSDNNKEAGF
WKSIWKTKTHPRTKICVWKIIQDILPTKANIIKKGVDINPLCCICGKKLESTTHLIWECKTSRKGWVRFIPKTYNLFSIRRAGWNPKDWWCWLKDNLEEEEITKGVVIMW
QIWNYRNKVANFQNTGAEALYRSIDLSIKEIEAAYLKSRPSGRIGNHSSQAPKHNPSPISWQLKSDASWNVYLGCGGVGWMMLEAKAMLDGLKQIFDTFKRRSIAIEAQR
DALEIINIVSEKTEDLSEVKSLTDQIKAIASDVQNVGFSHCSRVLNTEAHCVARSAMDLSSVVGDLHGCRSSQEKGQVAKALDAPEILPENLTSEQKAEMDESYKEIKAA
IKYGRDSLSLEMVLDALRSKDIEIKIEKKDVTPELKAKNKQQNSDKKEEGNNANIHEGYESVELLVVTDQRCGEEWALLNKMGALSLKLIDGTEGYLTEVRYIPDLKRNL
ISLGTLDKLRNLEAFQTEPGGTGAARDNRDRMETEELDPHRRADHMGRVDPLVCSSSRVDFLYSAQLSSSALSTSEWSKIAYNTLD