; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0003997 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0003997
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:257155..259608
RNA-Seq ExpressionLag0003997
SyntenyLag0003997
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]2.5e-17642.13Show/hide
Query:  MFGHE-KAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFE-APAKN
        +F +E K GG  R+  QM+ FR +LD C L ++   G  FTW R        +ERLD   +N    DN    K+ HL+Y+ SDHR +LAEI F   P   
Subjt:  MFGHE-KAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFE-APAKN

Query:  SHRKDRLLKFEGGWTKFEEAKDIINNAWREVRGSE-ASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELECVVGGSTEESRMV--AELELES
        + RK R  +FE  W K +E  +II+N+W     ++ ASR +    +C N LH W+  +  G ++  I    K +  L        + S  V  AE  L+ 
Subjt:  SHRKDRLLKFEGGWTKFEEAKDIINNAWREVRGSE-ASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELECVVGGSTEESRMV--AELELES

Query:  LL-EEEEMW------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLD
        LL  EE+ W                  FHSKA++R   N I+ L +D G  V  +E +  V  DYF ++F AS  +  A S +L ++ + +S +Q   L 
Subjt:  LL-EEEEMW------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLD

Query:  KPFTAAEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKL
        + FTA+E+  A++ +   K+PG DG  A+F+   W++VG    +  L+ LN G   +S NKT+I+LIPK + P  M D RPIS C+V YKII+K+LA + 
Subjt:  KPFTAAEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKL

Query:  RRVLKSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGN
        + VL S+IS  QSAF+  R I+DN+L+ FE ++S+  R++G +G+ ALKLDMSKA+DRVEW+FL A+M KMG     IS +M C+ +  +S L+NG +  
Subjt:  RRVLKSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGN

Query:  AITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSK
        ++ P RG++QGDP+SPYLFLIC+EGLSRLL  EE     + L ++ H P I+HL FADDSL+FC+A+++ C +IK+    Y  ASGQ +N DKS+M  S 
Subjt:  AITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSK

Query:  NVGKEKAKALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARF
        N  +    +    LG+ +      YLG+P+ + R+K++LF  +++R+WK +  W   IFS GGKE+L+KA+ Q IPTY M+CFRL    C++I    ARF
Subjt:  NVGKEKAKALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARF

Query:  WWGSSNEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPSLMLNEDPW
        WWGSS + +KIH+ NWK LC SK  GG+GFR    FNQA LAKQ+WRI + P SLLSRVL+ +Y+ ++DFM   +    SL      W
Subjt:  WWGSSNEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPSLMLNEDPW

XP_030502555.1 uncharacterized protein LOC115717715 [Cannabis sativa]2.2e-17240.53Show/hide
Query:  MFGHE-KAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFEAPAKNS
        +F +E K GG  R+  QM+ FR +LD C L ++   G  FTW +   K    +ERLD   +N    D +   K+ HL+Y+ SDHR +LA I F       
Subjt:  MFGHE-KAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFEAPAKNS

Query:  HRKDRLLKFEGGWTKFEEAKDIINNAWREVRGSE-ASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELECVVGGSTEESRMV--AELELESL
          +    +FE  W K +E  +II+++W     S+ A++ +   S C N L  W+ +R  G ++  I    + +  L   V    +    +  AE  L+ L
Subjt:  HRKDRLLKFEGGWTKFEEAKDIINNAWREVRGSE-ASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELECVVGGSTEESRMV--AELELESL

Query:  L-EEEEMW------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLDK
        L  EE+ W                  FHSKA++R   N I+ L +D G  V  +E +  +  DYF Q+F AS  +  A + +L ++ + +S +Q   L K
Subjt:  L-EEEEMW------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLDK

Query:  PFTAAEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKLR
         FTA+++   ++ +   K+PG DG  A+F+   W +VG    +  L+ LN G   ++ NKT+I+LIPK + P  M D RPIS C+V YKII+K+LA + +
Subjt:  PFTAAEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKLR

Query:  RVLKSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNA
         VL S+IS  QSAF+  R I+DN+L+ FE ++S+  R++G +G+VA KLDMSKA+DRVEW+F+ A+M KMG +  WIS +M C+ +  +S L+NG +  +
Subjt:  RVLKSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNA

Query:  ITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSKN
        + P RG++QGDP+SPYLFLIC+EGLSRLL  EE     + L ++ H P I+HL FADDSL+FCRA+++ C +IK+    Y  ASGQ +N DKS+M  S N
Subjt:  ITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSKN

Query:  VGKEKAKALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFW
               +    LG+ + D    YLG+P+ + R+K++LF  +++R+WK +  W   IFS GGKE+L+KA+ Q IPTY M+CFRL    C++I    ARFW
Subjt:  VGKEKAKALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFW

Query:  WGSSNEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPSLMLNEDPW---LGSDGLR
        WGSS + +KIH+  WK LC SK  G +GFR    FNQA LAKQ+WR+ +NP SLLSRVL+ +Y+   DF+        SL      W   L   GLR
Subjt:  WGSSNEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPSLMLNEDPW---LGSDGLR

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]4.5e-17843.24Show/hide
Query:  EKAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEI-CFEAPAKNSHRKD
        EK    +    Q+E FR++L  C L D+GFKG  +TW+     +  T+ RLDR   N++  D   + +V+HL+ H SDH P+L  +  F  P ++  R  
Subjt:  EKAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEI-CFEAPAKNSHRKD

Query:  RLLKFEGGWTKFEEAKDIINNAWREVRGSE--ASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKEL-ECVVGGSTEESRMVAELELESLLEEE
           KFE  W   +E   +I  AW    G+    +   ++   C   L AW  + +      AI +  K++  L E  +  +++   +    +++ LL+++
Subjt:  RLLKFEGGWTKFEEAKDIINNAWREVRGSE--ASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKEL-ECVVGGSTEESRMVAELELESLLEEE

Query:  EMW-------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLDKPFTA
        E++                   FH+KA+ RR++N I G+ N +G WV+  E +G VA DYF  +FQA   +       LD+VD++++ D    L   FTA
Subjt:  EMW-------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLDKPFTA

Query:  AEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKLRRVLK
         E++ A+  M PTKAPG DG +ALF+QK+W +VG   +   L+FLN G  +  IN T I LIPK QNP +M++ RPIS C+V+YKII+KVLAN+L++VL 
Subjt:  AEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKLRRVLK

Query:  SIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNAITPS
         IIS  QSAF+PGR I+DNVL+ +E ++++H+R KGK+G VALKLD+SKAYDRVEW FL+++M+KMG    WI +VM+CV +  +S+LVNG     I PS
Subjt:  SIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNAITPS

Query:  RGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSKNVGKE
        RGI+QGDP+SPYLFL+CAEGL+ LL + E       + I    P I++L FADDSL+FC+A+  + ETI +I + YE ASGQSINL+KS    S N  + 
Subjt:  RGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSKNVGKE

Query:  KAKALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFWWGSS
        +   +   LGV+  D F  YLG+P+  GR K   F +++DRVWK L+GWKG + S  GKEILIKA+AQ IPTYTM+ F++P  +C E+    ARFWWG  
Subjt:  KAKALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFWWGSS

Query:  NEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPSLM
          +RKIH+ +W KL   K++GGMGFRD+  FN AMLAKQ WR+V+  +SLL R  + +YF +  F++     N S +
Subjt:  NEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPSLM

XP_030939975.1 uncharacterized protein LOC115964883 [Quercus lobata]6.5e-17740.88Show/hide
Query:  EKAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFEAPAKNSHRKDR
        EK GG  R   QM+ FRD+LDFCG  D+GF G  FTW       ++T  RLDR             ++V H+   +SDH P+   +C +      ++K R
Subjt:  EKAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFEAPAKNSHRKDR

Query:  LLKFEGGWTKFEEAKDIINNAWRE-VRGSEASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELEC--VVGGSTEESRMVAELELESLLEEEE
          +FE  W K E  + II  AW +   G    + I++   C + L  W++    G++R  +++K K++ + E   + G   ++ R++     E +++EE 
Subjt:  LLKFEGGWTKFEEAKDIINNAWRE-VRGSEASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELEC--VVGGSTEESRMVAELELESLLEEEE

Query:  MW------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLDKPFTAAE
        +W                  FHS+AT R KRN I  L  + G+ V  E+ +G    DYF QIF ++ P++    +IL  +D++++     DL + FTA E
Subjt:  MW------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLDKPFTAAE

Query:  IEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKLRRVLKSI
        +E A++ M P  APG DG   +F++  W+ +G + I   L  LN G    S+N T ISLIPK ++P K TD RPIS C+V+YKI++K +AN+L+++L  +
Subjt:  IEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKLRRVLKSI

Query:  ISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNAITPSRG
        +S  QSAF+  R ISDN+L+ FE ++ + +++KGK G++A+KLDMSKAYDRVEWAFL+ +M+K+G    WI+ V +C+ S+ +SVLVNG      TP+RG
Subjt:  ISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNAITPSRG

Query:  IKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSKNVGKEKA
        ++QGDP+SPYLFL+CAEGL  L+ + E   + K + + S  P +SHLFFADDSL+FCRA+ ++  +I +I K YE ASGQ IN +K+ +  S N      
Subjt:  IKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSKNVGKEKA

Query:  KALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFWWGSSNE
        + +   LGV  T ++  YLG+PS  GR K + F  +R+R+W  ++GWK  + S GG+E+LIKA+ Q +PT+TM CF++PKS+C++I     +FWWG   E
Subjt:  KALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFWWGSSNE

Query:  KRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPS
         RKIH++ WKKLCKSK  GG+GF+DI LFN AML KQ WR++ N +SL  +V + K+F     +   +  N S
Subjt:  KRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPS

XP_030946812.1 uncharacterized protein LOC115971195 [Quercus lobata]2.0e-17841.4Show/hide
Query:  EKAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFEAPAKNSHRKDR
        EK GG  R   QM+ FRD+LDFCG  D+GF G  FTW       ++T  RLDR             ++V H+   +SDH P+   +C +      ++K R
Subjt:  EKAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFEAPAKNSHRKDR

Query:  LLKFEGGWTKFEEAKDIINNAW-REVRGSEASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELEC--VVGGSTEESRMVAELELESLLEEEE
          +FE  W K E  + +I NAW     G    + I++   C + L  W++    G++R  + +K K++   E   + G   ++ R++     E +++EE 
Subjt:  LLKFEGGWTKFEEAKDIINNAW-REVRGSEASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELEC--VVGGSTEESRMVAELELESLLEEEE

Query:  MW------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLDKPFTAAE
        +W                  FHS+AT R KRN I  L  + G  V GE+ +G    +YF QIF ++ P++    +IL  +D++++     DL + FTA E
Subjt:  MW------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLDKPFTAAE

Query:  IEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKLRRVLKSI
        +E A++ M P  APG DG   +F++  W+ +G + I   L  LN G    S+N T ISLIPK ++P K TD RPIS C+V+YKI++K +AN+L+++L  +
Subjt:  IEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKLRRVLKSI

Query:  ISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNAITPSRG
        +S  QSAF+  R ISDN+L+ FE ++ + +++KGK G++A+KLDMSKAYDRVEWAFL+ +M+K+G    WI+ V +C+ S+ +SVLVNG      TP+RG
Subjt:  ISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNAITPSRG

Query:  IKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSKNVGKEKA
        ++QGDP+SPYLFL+CAEGL  L+ + E   S K + + S  P +SHLFFADDSL+FCRA+ ++  +I +I K YE ASGQ IN +K+ +  S N      
Subjt:  IKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSKNVGKEKA

Query:  KALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFWWGSSNE
        + +   LGV  T ++  YLG+PS  GR K + F  +R+RVW+ ++GWK  + S GG+E+LIKA+ Q +PT+TM CF+LPKS+C++I     +FWWG   E
Subjt:  KALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFWWGSSNE

Query:  KRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPS
         RKIH++ WKKLCKSK +GG+GF+DI LFN AML KQ WR++ N +SL  +V + KYF     +   +  N S
Subjt:  KRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPS

TrEMBL top hitse value%identityAlignment
A0A2N9EWI8 Uncharacterized protein1.2e-17641.69Show/hide
Query:  MFGHEKAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPIL--AEICFEAPAKN
        M   EK G + R   QM  FR++L+ C LLD+GF+G  FTWT   +  +   ERLDR    ++ +D     ++ H+ + +SDH  ++  +E   + P +N
Subjt:  MFGHEKAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPIL--AEICFEAPAKN

Query:  SHRKDRLLKFEGGWTKFEEAKDIINNAWREVR-GSEASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELECVVGG-----STEESRMVAELE
        S +  R   FE  W + E  ++ I+ AW   + G+   R  Q+   C   L +W+KA L+   +  +DK+    K L+ + GG     +  E R +   E
Subjt:  SHRKDRLLKFEGGWTKFEEAKDIINNAWREVR-GSEASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELECVVGG-----STEESRMVAELE

Query:  LESLLEEEEM-------------------WFHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVR
        L SLL++EE+                   +FHS A+ R+K N I G+ + + VW   E  +  V   YFH+I+  + P   A   ++  VD  +S+D  +
Subjt:  LESLLEEEEM-------------------WFHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVR

Query:  DLDKPFTAAEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLA
        +L KPFT  E+ +A+  M+P+KAPG DG  ALFFQK+W +VG +     L+FLN+G  +KS+N T I+LIPK ++P  MT  RPIS C+V+YKII+KVL 
Subjt:  DLDKPFTAAEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLA

Query:  NKLRRVLKSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGL
        N+++ +L  ++S  QSAF+PGR ISDN++I FE I+ + ++  GK   +A KLDMSKAY+RVEW +LK +M K+G  + W++ +M CV S+ YS+LVNG 
Subjt:  NKLRRVLKSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGL

Query:  IGNAITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMM
            + PSRG++QGDP+SPYLFLICAEGLS LL + E + + + + ++   P +SHLFFADDSLIFCRA+E DC+ +++I   YE ASGQ IN DK+ + 
Subjt:  IGNAITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMM

Query:  SSKNVGKEKAKALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAY
         S+N        +    G   T  F  YLG+P   GR+K + F +++DR+W+ L+GWK    S  GKEILIKA+ Q IPTY M+CF+LP  +C+EI+   
Subjt:  SSKNVGKEKAKALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAY

Query:  ARFWWGSSNEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPSLMLNEDPWLGS
         RFWWG    +RKIH+++ KKLC++K +GGMGFRD+  FNQA+LA+Q WR+++NP SL+ RV   K    D  ++  +G+  ++ + +D W+ S
Subjt:  ARFWWGSSNEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPSLMLNEDPWLGS

A0A7N2LIH6 Uncharacterized protein2.5e-18243.78Show/hide
Query:  EKAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFEAPAKNSHRKDR
        EK G  +R++ QM+ FR+ L  CGL+D+GF G RFTW  G      T  RLDR   N+         KV H++   SDH  +LA   F     N  R  +
Subjt:  EKAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFEAPAKNSHRKDR

Query:  LLKFEGGWTKFEEAKDIINNAWREVRGSEASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELECV-VGGSTEESRMVAELELESL-LEEEEM
           FE  WT+ EE K+I+  AW   R   A    +R   C   L  WN+    G++   I +K   +++LE + +   T E     + E+  L   EE M
Subjt:  LLKFEGGWTKFEEAKDIINNAWREVRGSEASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELECV-VGGSTEESRMVAELELESL-LEEEEM

Query:  W------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLDKPFTAAEI
        W                  FH+ A+ RR++N I GL +D GVW + +E    +  DYF  I+ ++ P S   S  L+++D R++ +   +L K F A E+
Subjt:  W------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLDKPFTAAEI

Query:  EVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKLRRVLKSII
          A++ M+PTKAPG DG   +F+QKYW +VG       L  LN G   K INKT I LIPKT+NP K+T+ RPIS C+V+YKII+KVLAN+L++VL  +I
Subjt:  EVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKLRRVLKSII

Query:  SPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNAITPSRGI
           QSAF+PGR I+DNV++ FE ++SI+ R KGKEG +A+KLDMSKAYDRVEWA+L++MM KMG    WIS +M CV S+ +SVL+NG    + TPSRG+
Subjt:  SPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNAITPSRGI

Query:  KQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSKNVGKEKAK
        +QGDP+SPYLFL+C EGLS ++ ++E +   + +      P ISHLFFADDS+IFCRA+  +CE + K+ + YE  SGQ +N DK+ +  S+N   E  +
Subjt:  KQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSKNVGKEKAK

Query:  ALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFWWGSSNEK
              G Q+      YLG+P   GR K K F +++D+V + + GWKG + S  G+E+LIKA+AQ  PTYTM  F+LP S+C E+N     FWWG    +
Subjt:  ALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFWWGSSNEK

Query:  RKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPS
        +K+ +++WK LCK K  GGMGF+D+  FN A+LAKQ WR+ +NP SL  RVL+ KYFA   FM+  LG  PS
Subjt:  RKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPS

A0A803PIB6 Uncharacterized protein1.2e-17642.13Show/hide
Query:  MFGHE-KAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFE-APAKN
        +F +E K GG  R+  QM+ FR +LD C L ++   G  FTW R        +ERLD   +N    DN    K+ HL+Y+ SDHR +LAEI F   P   
Subjt:  MFGHE-KAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFE-APAKN

Query:  SHRKDRLLKFEGGWTKFEEAKDIINNAWREVRGSE-ASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELECVVGGSTEESRMV--AELELES
        + RK R  +FE  W K +E  +II+N+W     ++ ASR +    +C N LH W+  +  G ++  I    K +  L        + S  V  AE  L+ 
Subjt:  SHRKDRLLKFEGGWTKFEEAKDIINNAWREVRGSE-ASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELECVVGGSTEESRMV--AELELES

Query:  LL-EEEEMW------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLD
        LL  EE+ W                  FHSKA++R   N I+ L +D G  V  +E +  V  DYF ++F AS  +  A S +L ++ + +S +Q   L 
Subjt:  LL-EEEEMW------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLD

Query:  KPFTAAEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKL
        + FTA+E+  A++ +   K+PG DG  A+F+   W++VG    +  L+ LN G   +S NKT+I+LIPK + P  M D RPIS C+V YKII+K+LA + 
Subjt:  KPFTAAEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKL

Query:  RRVLKSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGN
        + VL S+IS  QSAF+  R I+DN+L+ FE ++S+  R++G +G+ ALKLDMSKA+DRVEW+FL A+M KMG     IS +M C+ +  +S L+NG +  
Subjt:  RRVLKSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGN

Query:  AITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSK
        ++ P RG++QGDP+SPYLFLIC+EGLSRLL  EE     + L ++ H P I+HL FADDSL+FC+A+++ C +IK+    Y  ASGQ +N DKS+M  S 
Subjt:  AITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSK

Query:  NVGKEKAKALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARF
        N  +    +    LG+ +      YLG+P+ + R+K++LF  +++R+WK +  W   IFS GGKE+L+KA+ Q IPTY M+CFRL    C++I    ARF
Subjt:  NVGKEKAKALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARF

Query:  WWGSSNEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPSLMLNEDPW
        WWGSS + +KIH+ NWK LC SK  GG+GFR    FNQA LAKQ+WRI + P SLLSRVL+ +Y+ ++DFM   +    SL      W
Subjt:  WWGSSNEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPSLMLNEDPW

A0A803Q8W7 Uncharacterized protein1.8e-17740.75Show/hide
Query:  GHEKAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFEAPAKNSHRK
        G +  GG  RN  QM+ FR  LD C L +  F+G  FTW +G  K D  +ERLD   VN    D    +   HL+Y+ SDHR I  ++   +      R+
Subjt:  GHEKAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFEAPAKNSHRK

Query:  DRLLKFEGGWTKFEEAKDIINNAWREV-RGSEASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELE--CVVGGSTEESRMVAELELESLLEE
            +FE  W   EEA D+I   W+ V  G  A+ F+Q  S C + L  W++ +  G+ +  I +  K++ +L    +   +        E  L+ LL +
Subjt:  DRLLKFEGGWTKFEEAKDIINNAWREV-RGSEASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELE--CVVGGSTEESRMVAELELESLLEE

Query:  EEMW-------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLDKPFT
        EE +                   FH+ A+SRR+ N I+ L +D G  V  ++ +  V   YF  +F A+  N+ A   +L ++ + ++A+    L KPFT
Subjt:  EEMW-------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLDKPFT

Query:  AAEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKLRRVL
        + EI  A++ +NP K+PG DG  A+F+  YWS+VG    +  L  LNEG  ++SINK++I+LIPK ++P  M D RPI+ C+V+YKII+K LA + + VL
Subjt:  AAEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKLRRVL

Query:  KSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNAITP
         ++IS  QSAF+  R I+DN+L+ FE I+ +  +++G  G+ ALKLDMSKA+DRVEW +++ +M KMG    W+S +M C+ S  +S ++NG     + P
Subjt:  KSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNAITP

Query:  SRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSKNVGK
        +RG++QGDP+SPYLFLIC+EGLSRLL  EE   + + L++  H P ISHL FADDSL+FC A+      IKK+   Y  ASGQ +N  KS+M  S N  +
Subjt:  SRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSKNVGK

Query:  EKAKALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFWWGS
                TLG+ ++D    YLG+P+ + R+K ++F  V++R+W+ L  W   +FS GGKE+L+KA+ Q IPTY M+CFRLP + C ++    A FWWGS
Subjt:  EKAKALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFWWGS

Query:  SNEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPSLMLNEDPW---LGSDGLRF
        + +  KIH+ +WK LCKSK +GGMGFR    FN+A+LAKQ+WRI   P SLLSR+L+ +YF+ + F++  LG +PSL      W   L  +GLR+
Subjt:  SNEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPSLMLNEDPW---LGSDGLRF

A0A803QH07 Uncharacterized protein7.5e-17940.6Show/hide
Query:  MFGHEKAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEIC-FEAPAKNS
        ++   K GG  R   QME FR  LDFC L ++ F G  FTWT+  N+ +  QERLD    N   + +   +   HL+++ SDHR I   I       + +
Subjt:  MFGHEKAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEIC-FEAPAKNS

Query:  HRKDRLLKFEGGWTKFEEAKDIINNAWREVRGSEASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELECVV----GGSTEESRMVAELELES
         RK R  +FE  W K EEA  +I + W+ V   + + F    S C + L  W+  +  G ++  I K  K + +L  +      G T+  +    L+ E 
Subjt:  HRKDRLLKFEGGWTKFEEAKDIINNAWREVRGSEASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELECVV----GGSTEESRMVAELELES

Query:  LLEEEEMW------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLDK
        L  EEE W                  FH+KA+SR+  N I+ L ND G+ V G+ENL  V   Y+ ++F +   +S +   +++++ S + +   + L  
Subjt:  LLEEEEMW------------------FHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLDK

Query:  PFTAAEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKLR
        PF+ AE+  A++ M+P K+PG DG  A+F+Q YW +VG    +  L  LN+G E+  +N ++I+LIPK  NP  M D RPIS C+V+YK+I+K +  + +
Subjt:  PFTAAEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKLR

Query:  RVLKSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNA
        +VL  +IS  QSAF+  R I+DN+L+ FE I+ +  +++G+ G+ ALKLDMSKA+DRVEW +L+A+M KMG +  W++ +M+C+ +  +S  +NG +   
Subjt:  RVLKSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNA

Query:  ITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSKN
        + P RG++QGDP+SPYLFLIC+EGLSRLL  EE++   K LR+  + P +SHL FADDSL+FC+A+E+    +K+    Y  ASGQ +N DKS+M  S N
Subjt:  ITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSKN

Query:  VGKEKAKALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFW
                 + TL + +TD    YLG+PS +GR+K +LF  ++++VWK L  W   IFS GGKE+L+KA+ Q IPTY M+CF+L K  C ++    A FW
Subjt:  VGKEKAKALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFW

Query:  WGSSNEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPSLMLNEDPW---LGSDGLRF
        WG++    KIH+  WK LCKSK +GGMGFR    FNQA+LAKQ+WRI   P+SLLSR+L+ +YF+   F+  S+G +PS       W   L   GLRF
Subjt:  WGSSNEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPSLMLNEDPW---LGSDGLRF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.8e-4124.64Show/hide
Query:  KDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFEAPAKNSHRKDRLLKFEGGWTKFE---EAKDIINNAWREVRGSEASRFIQRSSIC
        K++TQ R   +++N  LL+          +Y V +      ++ FE     +  KD    ++  W  F+     K I  NA++  R  E S+ I   +  
Subjt:  KDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFEAPAKNSHRKDRLLKFEGGWTKFE---EAKDIINNAWREVRGSEASRFIQRSSIC

Query:  MNRLHAWNKARLKGSLRAAIDKKDKEIKELECVVGGSTEESRMVAELELESLLEEEEMWFHSKAT-----------SRRKRNCIEGLFNDRGVWVDGEEN
        +  L    +   K S R  I K   E+KE+E      T+++           + E   WF  +              +R++N I+ + ND+G        
Subjt:  MNRLHAWNKARLKGSLRAAIDKKDKEIKELECVVGGSTEESRMVAELELESLLEEEEMWFHSKAT-----------SRRKRNCIEGLFNDRGVWVDGEEN

Query:  LGAVAWDYFHQIFQASPPNSGATSRILDSVD-SRLSADQVRDLDKPFTAAEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEI
        +     +Y+  ++     N       LD+    RL+ ++V  L++P T +EI   I  +   K+PG DGF A F+Q+Y   +    ++   +   EG   
Subjt:  LGAVAWDYFHQIFQASPPNSGATSRILDSVD-SRLSADQVRDLDKPFTAAEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGGEI

Query:  KSINKTVISLIPKT-QNPVKMTDLRPISSCSVVYKIIAKVLANKLRRVLKSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKA
         S  +  I LIPK  ++  K  + RPIS  ++  KI+ K+LAN++++ +K +I   Q  FIPG Q   N+      I  I +R+K K   V + +D  KA
Subjt:  KSINKTVISLIPKT-QNPVKMTDLRPISSCSVVYKIIAKVLANKLRRVLKSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKA

Query:  YDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNAITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLF
        +D+++  F+   ++K+G+   ++  + A  +    ++++NG    A     G +QG P+SP LF I  E L+R  IR+E ++  K +++      +S   
Subjt:  YDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNAITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLF

Query:  FADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSKNVGKEKAKALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALR---
        FADD +++        + + K+  ++   SG  IN+ KS      N  + +++ +   L   +      YLG+  Q  R+   LF +    + K ++   
Subjt:  FADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSMMMSSKNVGKEKAKALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALR---

Query:  -GWKGNIFSAGGKEILIK--AIAQGIPTYTMACFRLPKSICEEINKAYARFWWGSSNEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIV
          WK    S  G+  ++K   + + I  +     +LP +   E+ K   +F W   N+KR    +    L +  + GG+   D  L+ +A + K +W   
Subjt:  -GWKGNIFSAGGKEILIK--AIAQGIPTYTMACFRLPKSICEEINKAYARFWWGSSNEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIV

Query:  RN
        +N
Subjt:  RN

P0C2F6 Putative ribonuclease H protein At1g657503.6e-2128.92Show/hide
Query:  MPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFWWGSSNEKRKIHYMNWKKLCKSKEQGG
        MP    R     F ++ +RV   + GW+    S  G+  L KA+   +P ++M+   LP+SI   +++    F WGS+ EK+K H + W K+C  K++GG
Subjt:  MPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFWWGSSNEKRKIHYMNWKKLCKSKEQGG

Query:  MGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFA---KDDFMKVSLGSNPS-----------LMLNEDPWLGSDGLRFPSFVPEDLKGRKVRELM
        +G R     N+A+++K  WR+++   SL + VL+ KY     +D    +  GS  S           ++ +   W+  DG +   +    + G+ + E  
Subjt:  MGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFA---KDDFMKVSLGSNPS-----------LMLNEDPWLGSDGLRFPSFVPEDLKGRKVRELM

Query:  LDNG
        LDNG
Subjt:  LDNG

P11369 LINE-1 retrotransposable element ORF2 protein2.2e-4226.75Show/hide
Query:  KARLKGSLRA-AIDKKDKE----------IKELECVVGGSTEESRMVAELELESLLEEEE------------MWFHSK-----------ATSRRKRNCIE
        KA L+G L A +  KK +E          +K LE     S + SR    ++L   + + E             WF  K               R +  I 
Subjt:  KARLKGSLRA-AIDKKDKE----------IKELECVVGGSTEESRMVAELELESLLEEEE------------MWFHSK-----------ATSRRKRNCIE

Query:  GLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVD-SRLSADQVRDLDKPFTAAEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGE
         + N++G      E +      ++ +++     N     + LD     +L+ DQV  L+ P +  EIE  I  +   K+PG DGF A F+Q +   +   
Subjt:  GLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVD-SRLSADQVRDLDKPFTAAEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGE

Query:  TIRTCLNFLNEGGEIKSINKTVISLIPKTQ-NPVKMTDLRPISSCSVVYKIIAKVLANKLRRVLKSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSK
          +       EG    S  +  I+LIPK Q +P K+ + RPIS  ++  KI+ K+LAN+++  +K+II P Q  FIPG Q   N+    + IN IH  +K
Subjt:  TIRTCLNFLNEGGEIKSINKTVISLIPKTQ-NPVKMTDLRPISSCSVVYKIIAKVLANKLRRVLKSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSK

Query:  GKE-GWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNAITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSF
         K+   + + LD  KA+D+++  F+  ++++ G+   +++ + A       ++ VNG    AI    G +QG P+SPYLF I  E L+R  IR++ ++  
Subjt:  GKE-GWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNAITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSF

Query:  KSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSM-MMSSKNVGKEKAKALSITLGVQLTDSFGHYLG--MPSQNGRNK
        K ++I      IS L  ADD +++    +     +  +   +    G  IN +KSM  + +KN  K+  K +  T    +  +   YLG  +  +     
Subjt:  KSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKSM-MMSSKNVGKEKAKALSITLGVQLTDSFGHYLG--MPSQNGRNK

Query:  NKLFCKVRDRVWKALRGWKGNIFSAGGKEILIK--AIAQGIPTYTMACFRLPKSICEEINKAYARFWWGSSNEKRKIHYMNWKKLCKSKE-QGGMGFRDI
        +K F  ++  + + LR WK    S  G+  ++K   + + I  +     ++P     E+  A  +F W  +N+K +I     K L K K   GG+   D+
Subjt:  NKLFCKVRDRVWKALRGWKGNIFSAGGKEILIK--AIAQGIPTYTMACFRLPKSICEEINKAYARFWWGSSNEKRKIHYMNWKKLCKSKE-QGGMGFRDI

Query:  SLFNQAMLAKQSW
         L+ +A++ K +W
Subjt:  SLFNQAMLAKQSW

P14381 Transposon TX1 uncharacterized 149 kDa protein4.4e-3523.58Show/hide
Query:  MTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFEAP----AKNSHRKDRLLKFEG----------GWTKFEEAKDIINNAWREVRGSEAS
        ++Q R+DR  ++  L+       +    +  SDH  +   +   AP    A   H  + LL+ EG          GW  F++    +N  W   +     
Subjt:  MTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFEAP----AKNSHRKDRLLKFEG----------GWTKFEEAKDIINNAWREVRGSEAS

Query:  RFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELECVVGGSTEESRMVAELELESLLEEEEM-----------------------WFHSKATSRR
          +    +C      + K+ + G   A I+  + E+ +LE  + GS +++     LE +  L   E                        +F++    + 
Subjt:  RFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELECVVGGSTEESRMVAELELESLLEEEEM-----------------------WFHSKATSRR

Query:  KRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLDKPFTAAEIEVAIRGMNPTKAPGFDGFHALFFQKYWS
         R  I  LF + G  ++  E +   A  ++  +F   P +  A   + D +   +S  +   L+ P T  E+  A+R M   K+PG DG    FFQ +W 
Subjt:  KRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLDKPFTAAEIEVAIRGMNPTKAPGFDGFHALFFQKYWS

Query:  VVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKLRRVLKSIISPFQSAFIPGRQISDNVLIGFECINSIH
         +G +  R       +G    S  + V+SL+PK  +   + + RP+S  S  YKI+AK ++ +L+ VL  +I P QS  +PGR I DNV   F   + +H
Subjt:  VVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKLRRVLKSIISPFQSAFIPGRQISDNVLIGFECINSIH

Query:  SRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNAITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESK
           +       L LD  KA+DRV+  +L   +        ++  +     S E  V +N  +   +   RG++QG P+S  L+ +  E    LL +  + 
Subjt:  SRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNAITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESK

Query:  LSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKS--MMMSSKNVGKEKAKALSITLGVQLTDSFGHYLGMPSQNGR
        L  K   +      +    +ADD +I       D E  ++  + Y AAS   IN  KS  ++  S  V         I+   ++    G YL   S    
Subjt:  LSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETIKKIFKDYEAASGQSINLDKS--MMMSSKNVGKEKAKALSITLGVQLTDSFGHYLGMPSQNGR

Query:  NKNKLFCKVRDRVWKALRGWKG--NIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFWW
          ++ F ++ + V   L  WKG   + S  G+ ++I  +      Y + C    +    +I +    F W
Subjt:  NKNKLFCKVRDRVWKALRGWKG--NIFSAGGKEILIKAIAQGIPTYTMACFRLPKSICEEINKAYARFWW

P93295 Uncharacterized mitochondrial protein AtMg003101.1e-2853.27Show/hide
Query:  IPTYTMACFRLPKSICEEINKAYARFWWGSSNEKRKIHYMNWKKLCKSKE-QGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKV
        +P Y M+CFRL K +C+++  A   FWW S   KRKI ++ W+KLCKSKE  GG+GFRD+  FNQA+LAKQS+RI+  P +LLSR+LR +YF     M+ 
Subjt:  IPTYTMACFRLPKSICEEINKAYARFWWGSSNEKRKIHYMNWKKLCKSKE-QGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKV

Query:  SLGSNPS
        S+G+ PS
Subjt:  SLGSNPS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.7e-1330.95Show/hide
Query:  WFHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPP--NSGATSRILDSVDSRLSADQVRDLDKPFTAAEIEVAIRGMNPTKAPGFD
        +FH    + + +N I+ L  D  V V+    +  +   Y+  +  +        +  RI D    R +      L    +  EI  A+  M   KAPG D
Subjt:  WFHSKATSRRKRNCIEGLFNDRGVWVDGEENLGAVAWDYFHQIFQASPP--NSGATSRILDSVDSRLSADQVRDLDKPFTAAEIEVAIRGMNPTKAPGFD

Query:  GFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKII
         F A FF + W VV   TI     F   G  +K  N T I+LIPK     +++  RP+S C+VVYKII
Subjt:  GFHALFFQKYWSVVGGETIRTCLNFLNEGGEIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.2e-1336.05Show/hide
Query:  LANKLRRVLKSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKV
        +  +L+ ++ ++I P Q++FIPGR  +DN++   E ++S+  R KG +GW+ LKLD+ KAYDR+ W +L+  +   G  + W+ ++
Subjt:  LANKLRRVLKSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFLKAMMDKMGLSQAWISKV

AT4G29090.1 Ribonuclease H-like superfamily protein1.7e-2646.3Show/hide
Query:  IPTYTMACFRLPKSICEEINKAYARFWWGSSNEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVS
        +PTYTMACF LPK++C++I    A FWW +  E + +H+  W  L   K +GG+GF+DI  FN A+L KQ WR++  PESL+++V + +YF K D +   
Subjt:  IPTYTMACFRLPKSICEEINKAYARFWWGSSNEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVS

Query:  LGSNPSLM
        LGS PS +
Subjt:  LGSNPSLM

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.5e-3053.27Show/hide
Query:  IPTYTMACFRLPKSICEEINKAYARFWWGSSNEKRKIHYMNWKKLCKSKE-QGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKV
        +P Y M+CFRL K +C+++  A   FWW S   KRKI ++ W+KLCKSKE  GG+GFRD+  FNQA+LAKQS+RI+  P +LLSR+LR +YF     M+ 
Subjt:  IPTYTMACFRLPKSICEEINKAYARFWWGSSNEKRKIHYMNWKKLCKSKE-QGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKV

Query:  SLGSNPS
        S+G+ PS
Subjt:  SLGSNPS

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.9e-1345.59Show/hide
Query:  LVNGLIGNAITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDS
        ++NG     +TPSRG++QGDP+SPYLF++C E LS L  R + +     +R++++ P I+HL FADD+
Subjt:  LVNGLIGNAITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGGGCATGAGAAAGCTGGTGGTGTTGAGCGGAATAGTGGCCAGATGGAGTGTTTTCGTGACTCGCTGGACTTCTGTGGTTTGCTTGATGTCGGTTTCAAGGGCGG
TAGATTTACGTGGACTAGAGGATCCAATAAGAAAGACATGACTCAGGAGAGGCTTGACAGATTCCGTGTTAACCAAAAGCTGCTAGATAATGTACATCTCATTAAAGTTC
TCCACCTCAACTATCATGTGTCGGATCACAGGCCAATATTGGCTGAAATCTGTTTTGAAGCCCCTGCCAAGAATTCCCATAGAAAGGATAGGTTGCTTAAGTTTGAGGGT
GGCTGGACTAAATTCGAAGAAGCTAAAGACATCATTAACAATGCTTGGAGAGAGGTGAGGGGCAGTGAAGCTAGCAGATTTATTCAGAGGTCCTCCATTTGTATGAATAG
GTTGCATGCTTGGAACAAAGCTAGGCTGAAGGGTTCCTTGAGGGCTGCTATTGACAAGAAAGATAAAGAGATTAAGGAGCTTGAGTGTGTTGTTGGTGGTTCGACGGAGG
AGAGTCGTATGGTGGCTGAATTAGAGCTCGAATCGTTGCTTGAAGAAGAAGAGATGTGGTTCCATTCCAAGGCTACATCGAGAAGGAAGAGAAACTGCATAGAGGGCTTA
TTTAATGATAGAGGCGTTTGGGTTGATGGTGAGGAGAATTTGGGGGCAGTGGCCTGGGATTATTTCCATCAAATTTTCCAAGCCTCTCCCCCAAATTCGGGGGCCACTAG
CCGAATTTTGGACTCTGTGGATTCTAGGTTGTCGGCTGACCAAGTAAGGGATCTGGACAAGCCTTTTACTGCTGCGGAAATTGAGGTTGCAATCAGAGGTATGAATCCCA
CTAAGGCTCCGGGGTTCGATGGATTCCATGCTCTCTTCTTCCAAAAGTATTGGAGTGTTGTTGGGGGAGAAACTATCCGAACTTGCCTGAATTTTCTAAATGAGGGAGGG
GAGATAAAGTCGATCAATAAAACGGTTATATCTCTTATCCCCAAGACCCAAAACCCGGTGAAGATGACCGATCTTAGGCCAATCAGCTCGTGTTCGGTGGTGTACAAGAT
CATAGCGAAGGTTCTGGCCAACAAGTTGCGAAGGGTGCTTAAGTCAATCATTTCCCCTTTCCAATCTGCTTTTATCCCGGGTAGACAGATTTCTGATAACGTTTTGATAG
GTTTTGAATGTATTAATTCTATTCATAGCAGATCAAAAGGCAAGGAGGGTTGGGTTGCCTTAAAACTGGACATGAGTAAAGCCTACGACAGGGTGGAGTGGGCTTTTCTC
AAGGCAATGATGGATAAGATGGGTTTAAGCCAAGCCTGGATTAGCAAGGTGATGGCTTGTGTAGAATCGATTGAATACTCGGTGCTAGTGAATGGTCTTATTGGAAATGC
TATCACTCCCTCCAGAGGCATCAAACAAGGGGACCCCATGTCTCCCTACCTTTTCCTCATCTGCGCGGAGGGCCTGTCTCGGCTTCTTATCAGGGAAGAGTCTAAGCTTA
GCTTTAAAAGTCTTAGAATAAACTCTCACTGCCCGCCTATATCTCATTTATTTTTTGCTGACGATAGTTTAATTTTTTGTAGGGCCTCTGAGAAGGATTGTGAGACTATA
AAGAAGATTTTTAAGGACTATGAAGCTGCCTCAGGCCAATCCATCAACCTAGACAAGTCCATGATGATGTCCAGTAAGAACGTGGGAAAAGAGAAAGCAAAGGCCCTTAG
CATCACCCTTGGAGTGCAGCTAACTGACTCCTTTGGGCATTACCTTGGGATGCCGTCTCAGAATGGGCGTAACAAGAACAAACTCTTCTGCAAGGTTAGAGACAGAGTTT
GGAAGGCGCTCCGAGGGTGGAAAGGTAACATTTTTTCGGCAGGGGGCAAAGAGATACTTATCAAAGCCATTGCCCAGGGCATTCCCACCTATACTATGGCCTGTTTCAGG
CTCCCGAAAAGTATTTGTGAGGAGATTAACAAGGCTTATGCCCGCTTTTGGTGGGGCTCTAGTAATGAGAAAAGGAAAATCCACTATATGAACTGGAAGAAGCTATGTAA
GAGCAAGGAACAAGGGGGTATGGGCTTTAGGGACATTAGCTTGTTCAATCAAGCTATGCTTGCCAAGCAAAGCTGGAGGATTGTGAGAAACCCAGAGAGCCTTCTTTCTA
GAGTTCTTAGGGACAAGTACTTTGCTAAAGACGACTTTATGAAGGTTTCCCTAGGCTCCAACCCCTCGTTAATGCTCAACGAGGATCCGTGGCTTGGTTCGGATGGCTTG
AGGTTCCCCTCTTTTGTCCCCGAAGATCTCAAGGGGAGAAAGGTTCGGGAGCTGATGCTTGATAATGGGGGTGGAATGAAGGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGGGCATGAGAAAGCTGGTGGTGTTGAGCGGAATAGTGGCCAGATGGAGTGTTTTCGTGACTCGCTGGACTTCTGTGGTTTGCTTGATGTCGGTTTCAAGGGCGG
TAGATTTACGTGGACTAGAGGATCCAATAAGAAAGACATGACTCAGGAGAGGCTTGACAGATTCCGTGTTAACCAAAAGCTGCTAGATAATGTACATCTCATTAAAGTTC
TCCACCTCAACTATCATGTGTCGGATCACAGGCCAATATTGGCTGAAATCTGTTTTGAAGCCCCTGCCAAGAATTCCCATAGAAAGGATAGGTTGCTTAAGTTTGAGGGT
GGCTGGACTAAATTCGAAGAAGCTAAAGACATCATTAACAATGCTTGGAGAGAGGTGAGGGGCAGTGAAGCTAGCAGATTTATTCAGAGGTCCTCCATTTGTATGAATAG
GTTGCATGCTTGGAACAAAGCTAGGCTGAAGGGTTCCTTGAGGGCTGCTATTGACAAGAAAGATAAAGAGATTAAGGAGCTTGAGTGTGTTGTTGGTGGTTCGACGGAGG
AGAGTCGTATGGTGGCTGAATTAGAGCTCGAATCGTTGCTTGAAGAAGAAGAGATGTGGTTCCATTCCAAGGCTACATCGAGAAGGAAGAGAAACTGCATAGAGGGCTTA
TTTAATGATAGAGGCGTTTGGGTTGATGGTGAGGAGAATTTGGGGGCAGTGGCCTGGGATTATTTCCATCAAATTTTCCAAGCCTCTCCCCCAAATTCGGGGGCCACTAG
CCGAATTTTGGACTCTGTGGATTCTAGGTTGTCGGCTGACCAAGTAAGGGATCTGGACAAGCCTTTTACTGCTGCGGAAATTGAGGTTGCAATCAGAGGTATGAATCCCA
CTAAGGCTCCGGGGTTCGATGGATTCCATGCTCTCTTCTTCCAAAAGTATTGGAGTGTTGTTGGGGGAGAAACTATCCGAACTTGCCTGAATTTTCTAAATGAGGGAGGG
GAGATAAAGTCGATCAATAAAACGGTTATATCTCTTATCCCCAAGACCCAAAACCCGGTGAAGATGACCGATCTTAGGCCAATCAGCTCGTGTTCGGTGGTGTACAAGAT
CATAGCGAAGGTTCTGGCCAACAAGTTGCGAAGGGTGCTTAAGTCAATCATTTCCCCTTTCCAATCTGCTTTTATCCCGGGTAGACAGATTTCTGATAACGTTTTGATAG
GTTTTGAATGTATTAATTCTATTCATAGCAGATCAAAAGGCAAGGAGGGTTGGGTTGCCTTAAAACTGGACATGAGTAAAGCCTACGACAGGGTGGAGTGGGCTTTTCTC
AAGGCAATGATGGATAAGATGGGTTTAAGCCAAGCCTGGATTAGCAAGGTGATGGCTTGTGTAGAATCGATTGAATACTCGGTGCTAGTGAATGGTCTTATTGGAAATGC
TATCACTCCCTCCAGAGGCATCAAACAAGGGGACCCCATGTCTCCCTACCTTTTCCTCATCTGCGCGGAGGGCCTGTCTCGGCTTCTTATCAGGGAAGAGTCTAAGCTTA
GCTTTAAAAGTCTTAGAATAAACTCTCACTGCCCGCCTATATCTCATTTATTTTTTGCTGACGATAGTTTAATTTTTTGTAGGGCCTCTGAGAAGGATTGTGAGACTATA
AAGAAGATTTTTAAGGACTATGAAGCTGCCTCAGGCCAATCCATCAACCTAGACAAGTCCATGATGATGTCCAGTAAGAACGTGGGAAAAGAGAAAGCAAAGGCCCTTAG
CATCACCCTTGGAGTGCAGCTAACTGACTCCTTTGGGCATTACCTTGGGATGCCGTCTCAGAATGGGCGTAACAAGAACAAACTCTTCTGCAAGGTTAGAGACAGAGTTT
GGAAGGCGCTCCGAGGGTGGAAAGGTAACATTTTTTCGGCAGGGGGCAAAGAGATACTTATCAAAGCCATTGCCCAGGGCATTCCCACCTATACTATGGCCTGTTTCAGG
CTCCCGAAAAGTATTTGTGAGGAGATTAACAAGGCTTATGCCCGCTTTTGGTGGGGCTCTAGTAATGAGAAAAGGAAAATCCACTATATGAACTGGAAGAAGCTATGTAA
GAGCAAGGAACAAGGGGGTATGGGCTTTAGGGACATTAGCTTGTTCAATCAAGCTATGCTTGCCAAGCAAAGCTGGAGGATTGTGAGAAACCCAGAGAGCCTTCTTTCTA
GAGTTCTTAGGGACAAGTACTTTGCTAAAGACGACTTTATGAAGGTTTCCCTAGGCTCCAACCCCTCGTTAATGCTCAACGAGGATCCGTGGCTTGGTTCGGATGGCTTG
AGGTTCCCCTCTTTTGTCCCCGAAGATCTCAAGGGGAGAAAGGTTCGGGAGCTGATGCTTGATAATGGGGGTGGAATGAAGGACTAA
Protein sequenceShow/hide protein sequence
MFGHEKAGGVERNSGQMECFRDSLDFCGLLDVGFKGGRFTWTRGSNKKDMTQERLDRFRVNQKLLDNVHLIKVLHLNYHVSDHRPILAEICFEAPAKNSHRKDRLLKFEG
GWTKFEEAKDIINNAWREVRGSEASRFIQRSSICMNRLHAWNKARLKGSLRAAIDKKDKEIKELECVVGGSTEESRMVAELELESLLEEEEMWFHSKATSRRKRNCIEGL
FNDRGVWVDGEENLGAVAWDYFHQIFQASPPNSGATSRILDSVDSRLSADQVRDLDKPFTAAEIEVAIRGMNPTKAPGFDGFHALFFQKYWSVVGGETIRTCLNFLNEGG
EIKSINKTVISLIPKTQNPVKMTDLRPISSCSVVYKIIAKVLANKLRRVLKSIISPFQSAFIPGRQISDNVLIGFECINSIHSRSKGKEGWVALKLDMSKAYDRVEWAFL
KAMMDKMGLSQAWISKVMACVESIEYSVLVNGLIGNAITPSRGIKQGDPMSPYLFLICAEGLSRLLIREESKLSFKSLRINSHCPPISHLFFADDSLIFCRASEKDCETI
KKIFKDYEAASGQSINLDKSMMMSSKNVGKEKAKALSITLGVQLTDSFGHYLGMPSQNGRNKNKLFCKVRDRVWKALRGWKGNIFSAGGKEILIKAIAQGIPTYTMACFR
LPKSICEEINKAYARFWWGSSNEKRKIHYMNWKKLCKSKEQGGMGFRDISLFNQAMLAKQSWRIVRNPESLLSRVLRDKYFAKDDFMKVSLGSNPSLMLNEDPWLGSDGL
RFPSFVPEDLKGRKVRELMLDNGGGMKD