; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039486 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039486
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr2:44721541..44724928
RNA-Seq ExpressionLag0039486
SyntenyLag0039486
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CDH30699.1 putative Ty1-copia-like retrotransposon [Cercis chinensis]3.4e-7940.5Show/hide
Query:  TSSNPFSTLTPPFG----IKLSDSNYLLWKNQLLNHIIAFDMESFINGTPAPP----KFLDTAQTQ-----VSPHYLIWQKFNRTLMGWIYSSLNEDKLG
        T +   ++  P FG    IKL  +N+L+W+NQLLN ++A   E  + GT + P       DTA T      ++P Y++WQ+ NR +M WIYSSL E  + 
Subjt:  TSSNPFSTLTPPFG----IKLSDSNYLLWKNQLLNHIIAFDMESFINGTPAPP----KFLDTAQTQ-----VSPHYLIWQKFNRTLMGWIYSSLNEDKLG

Query:  EIIGCNTAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLAD
        +I+  N+A EIW  LR  Y S S ARIM LR Q+Q  RK GL+V  Y+ +I+ I D   AIGE +S  D +  +L GLG++YNP V SI +R +  S+  
Subjt:  EIIGCNTAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLAD

Query:  VLSLLLAYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFPQSFQPNF-SGQSVLGRPQSSPRPPRWPSSNSSRPQCQI
        + S L  YE RLE Q++V+    +QAN A  + SN  +R P   N      P      F+   S Q  F  G S  GR           ++  SRPQCQI
Subjt:  VLSLLLAYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFPQSFQPNF-SGQSVLGRPQSSPRPPRWPSSNSSRPQCQI

Query:  CNKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNLQQSTPYYGSEQVVVGNG
        C K GH A  C++R +  +Q P + HS     NQFS ++   Q L +          P       W++D GATHH+T D+ NL    P+ G ++V+VGNG
Subjt:  CNKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNLQQSTPYYGSEQVVVGNG

Query:  KSIPVLHVGSSTISSSSFPIQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLSKSILLTGQLEGGLYKL
        K + VLH G S+I +S   + LK+VLH P I+ NL+S+ KLC +N A+VEF+PS+F VKD  +++ILL G L+ GLY +
Subjt:  KSIPVLHVGSSTISSSSFPIQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLSKSILLTGQLEGGLYKL

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]1.8e-8537.98Show/hide
Query:  SSTQPSIPQFYPPRYNPSVTAGFQFPPQSTSSLPFFPQYNPSVPLFSSPQTSSNPFSTLTPPFGIKLSDSNYLLWKNQLLNHIIAFDMESFINGT-PAPP
        SST P+ P   PP  NP  ++    P          PQ   + PL + P        ++  P  +KL D NY++WK QLLN +IA  +E F++G+   PP
Subjt:  SSTQPSIPQFYPPRYNPSVTAGFQFPPQSTSSLPFFPQYNPSVPLFSSPQTSSNPFSTLTPPFGIKLSDSNYLLWKNQLLNHIIAFDMESFINGT-PAPP

Query:  KFLDTAQTQVSPHYLIWQKFNRTLMGWIYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIG
        +FLD  Q Q +P +  WQ++NR +M WIY+S+NE  LG+I+G  +A +IWE L  +Y + S A +  LR+ +Q I+K+GLT   Y+ + + + +  ++IG
Subjt:  KFLDTAQTQVSPHYLIWQKFNRTLMGWIYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIG

Query:  EPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLADVLSLLLAYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFP
        EP++Y DHL Y L GLG DYNPFVTSI ++  +PS+ +  S                                ++ R P+  NPS  S P+   +++S P
Subjt:  EPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLADVLSLLLAYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFP

Query:  QSFQPNFSGQSVLGRPQSSPRPPRWPSSNSSRPQCQICNKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPH
        +    N         P  SP     PSS   RP+CQIC K GHTA  CY+  N  YQ PP    P   FN +   +P S   SS  P     S P +   
Subjt:  QSFQPNFSGQSVLGRPQSSPRPPRWPSSNSSRPQCQICNKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPH

Query:  EAWYMDFGATHHMTPDINNLQQSTPYYGSEQVVVGNGKSIPVLHVGSSTISSSSFPIQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLS
         +WYMD GA+HH TPD+N L  ++PY G +QV VGNGK+ P+ ++G +++ + +  ++L  V HSP ++ NL+S+++LC +N AF+EF+P+FFLVK  ++
Subjt:  EAWYMDFGATHHMTPDINNLQQSTPYYGSEQVVVGNGKSIPVLHVGSSTISSSSFPIQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLS

Query:  KSILLTGQLEGGLYKL
        K +LL GQL+ GLY++
Subjt:  KSILLTGQLEGGLYKL

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]7.7e-8438.95Show/hide
Query:  PSVPLFSSPQTSSN-------------PFSTLTPPFGIKLSDSNYLLWKNQLLNHIIAFDMESFIN-GTPAPPKFLDTAQTQVSPHYLIWQKFNRTLMGW
        P  P  +S  T++N             P  +L+    IKL ++N LL K+QLLN IIA  +E FI+    +PPK+LD A  QV+P ++ W + N+ +M W
Subjt:  PSVPLFSSPQTSSN-------------PFSTLTPPFGIKLSDSNYLLWKNQLLNHIIAFDMESFIN-GTPAPPKFLDTAQTQVSPHYLIWQKFNRTLMGW

Query:  IYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSI
        IYSSL    +G+I+  +TA +IW  L   YES S A +M L SQ+Q+I+K  + +S+YL+++K + D+F+ IGEPLSY D L  ILEGL  +Y+ FVTSI
Subjt:  IYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSI

Query:  HNRTEKPSLADVLSLLLAYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFPQSFQPNFSGQSVLGRPQSSPRPPRWPS
        HNR+++PSL +V SLL  YE RL ++S   +LN  QAN                                                         PR P 
Subjt:  HNRTEKPSLADVLSLLLAYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFPQSFQPNFSGQSVLGRPQSSPRPPRWPS

Query:  SNSSRPQCQICNKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQT-SPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNLQQSTPY
         N+S PQCQIC K GH AL  Y+R N  Y  P   ++  F  N   QT SP+S  L++++  T  +SQ   +   +WYMD GATHH TP+  ++  +  Y
Subjt:  SNSSRPQCQICNKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQT-SPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNLQQSTPY

Query:  YGSEQVVVGNGKSIPVLHVGSSTISSSSFPIQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLSKSILLTGQLEGGLYKLSVKRVKLTAS
           +  +VGN K I + H+G + + SS  PI L  VLH+P IS  LIS+ +L  +N+AFVEF+P+FFLVKD  +K +LL G LE GLYKL+ +   LT+S
Subjt:  YGSEQVVVGNGKSIPVLHVGSSTISSSSFPIQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLSKSILLTGQLEGGLYKLSVKRVKLTAS

Query:  RRCDRKRPDATVFPY-SERAYKRQRRDAVLTASR
               P    FP    +A+  Q+   VL  +R
Subjt:  RRCDRKRPDATVFPY-SERAYKRQRRDAVLTASR

RVW78327.1 7-deoxyloganetin glucosyltransferase [Vitis vinifera]4.8e-7837.68Show/hide
Query:  LFSSPQTSSNPFSTLTPPFGIKLSDSNYLLWKNQLLNHIIAFDMESFING-TPAPPKFLDTAQTQVSPHYLIWQKFNRTLMGWIYSSLNEDKLGEIIGCN
        + SS  T +   S L  P  IKL   N++LW  Q+ N I A   E  I G  P PPK + T   +V+P +L+W++++R ++ WIYSSL  + +G+I+G  
Subjt:  LFSSPQTSSNPFSTLTPPFGIKLSDSNYLLWKNQLLNHIIAFDMESFING-TPAPPKFLDTAQTQVSPHYLIWQKFNRTLMGWIYSSLNEDKLGEIIGCN

Query:  TAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLADVLSLLL
        T++E W  L+  + + + AR M LR   Q  +K  LT+ +Y+ ++K I+D  +AIGEP+   D +  +L GLG +YNP V S+  R +   L  V S+LL
Subjt:  TAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLADVLSLLL

Query:  AYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFPQSFQPNFSGQSVLGRPQSSP---RPPRWPSSNS---SRPQCQIC
         +E RL  Q+T    +L+   +AN    N  RRS            S  PS    PQSF P     ++    Q  P   RP R+ ++ S   +RPQCQ+C
Subjt:  AYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFPQSFQPNFSGQSVLGRPQSSP---RPPRWPSSNS---SRPQCQIC

Query:  NKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNLQQSTPYYGSEQVVVGNGK
         KFGH  L+CY+R +  YQ P             + TS  SQ +  T P     + P +   ++W++D GATHH++    N+   TPY G++ V+VGNGK
Subjt:  NKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNLQQSTPYYGSEQVVVGNGK

Query:  SIPVLHVGSSTISSSSFPIQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLSKSILLTGQLEGGLYKLSVKRV
        S+P+  VG S + + + P  L +VLH P ++ NLIS++K C +N   +EFHPS F VKD  +K  LL GQLE GLYK     +
Subjt:  SIPVLHVGSSTISSSSFPIQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLSKSILLTGQLEGGLYKLSVKRV

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]3.1e-10150.95Show/hide
Query:  QFPPQSTSSLPFFPQYNPSVPLFSSPQTSSNPFSTLTPPFGIKLSDSNYLLWKNQLLNHIIAFDMESFINGT-PAPPKFLDTAQTQVSPHYLIWQKFNRT
        QFPP + + L   P  NP          S+NPF TL  P  +KL+D+N+LLWKNQLLN +IA  +  +++GT   PP+FLD  Q Q +P Y  W+++NR 
Subjt:  QFPPQSTSSLPFFPQYNPSVPLFSSPQTSSNPFSTLTPPFGIKLSDSNYLLWKNQLLNHIIAFDMESFINGT-PAPPKFLDTAQTQVSPHYLIWQKFNRT

Query:  LMGWIYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPF
        LM WIYSSL+E+K+GE++   T ++IW  L  VY+S +TARIMGL++++Q +RKDG +VSQYLA+IK+IADKF+A+GEPLSY DHL ++L+GLG++YN F
Subjt:  LMGWIYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPF

Query:  VTSIHNRTEKPSLADVLSLLLAYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRP---PNPSKQSIPSKPPSTFSFPQSFQPNFSGQSVLGRPQSSP
        VTSIHNR + PSL DV SLLLAYEARL+KQ+TVD LN+ QANL NLS+ +N +R P     PN  K S P+ P S              QS+LG+PQS  
Subjt:  VTSIHNRTEKPSLADVLSLLLAYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRP---PNPSKQSIPSKPPSTFSFPQSFQPNFSGQSVLGRPQSSP

Query:  RPPRWPSSNSSRPQCQICNKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNL
        + P  P  +SS+ QCQIC K GH+A  CY+R N AY       SPQ  ++         QP S T P +    Q   HP E+W+MD GATHHMTPD + L
Subjt:  RPPRWPSSNSSRPQCQICNKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNL

Query:  QQSTPYYGSEQVVVGNGKSI
           TPY G EQV VGNG S+
Subjt:  QQSTPYYGSEQVVVGNGKSI

TrEMBL top hitse value%identityAlignment
A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE13.7e-8438.95Show/hide
Query:  PSVPLFSSPQTSSN-------------PFSTLTPPFGIKLSDSNYLLWKNQLLNHIIAFDMESFIN-GTPAPPKFLDTAQTQVSPHYLIWQKFNRTLMGW
        P  P  +S  T++N             P  +L+    IKL ++N LL K+QLLN IIA  +E FI+    +PPK+LD A  QV+P ++ W + N+ +M W
Subjt:  PSVPLFSSPQTSSN-------------PFSTLTPPFGIKLSDSNYLLWKNQLLNHIIAFDMESFIN-GTPAPPKFLDTAQTQVSPHYLIWQKFNRTLMGW

Query:  IYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSI
        IYSSL    +G+I+  +TA +IW  L   YES S A +M L SQ+Q+I+K  + +S+YL+++K + D+F+ IGEPLSY D L  ILEGL  +Y+ FVTSI
Subjt:  IYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSI

Query:  HNRTEKPSLADVLSLLLAYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFPQSFQPNFSGQSVLGRPQSSPRPPRWPS
        HNR+++PSL +V SLL  YE RL ++S   +LN  QAN                                                         PR P 
Subjt:  HNRTEKPSLADVLSLLLAYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFPQSFQPNFSGQSVLGRPQSSPRPPRWPS

Query:  SNSSRPQCQICNKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQT-SPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNLQQSTPY
         N+S PQCQIC K GH AL  Y+R N  Y  P   ++  F  N   QT SP+S  L++++  T  +SQ   +   +WYMD GATHH TP+  ++  +  Y
Subjt:  SNSSRPQCQICNKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQT-SPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNLQQSTPY

Query:  YGSEQVVVGNGKSIPVLHVGSSTISSSSFPIQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLSKSILLTGQLEGGLYKLSVKRVKLTAS
           +  +VGN K I + H+G + + SS  PI L  VLH+P IS  LIS+ +L  +N+AFVEF+P+FFLVKD  +K +LL G LE GLYKL+ +   LT+S
Subjt:  YGSEQVVVGNGKSIPVLHVGSSTISSSSFPIQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLSKSILLTGQLEGGLYKLSVKRVKLTAS

Query:  RRCDRKRPDATVFPY-SERAYKRQRRDAVLTASR
               P    FP    +A+  Q+   VL  +R
Subjt:  RRCDRKRPDATVFPY-SERAYKRQRRDAVLTASR

A0A438H229 7-deoxyloganetin glucosyltransferase2.3e-7837.68Show/hide
Query:  LFSSPQTSSNPFSTLTPPFGIKLSDSNYLLWKNQLLNHIIAFDMESFING-TPAPPKFLDTAQTQVSPHYLIWQKFNRTLMGWIYSSLNEDKLGEIIGCN
        + SS  T +   S L  P  IKL   N++LW  Q+ N I A   E  I G  P PPK + T   +V+P +L+W++++R ++ WIYSSL  + +G+I+G  
Subjt:  LFSSPQTSSNPFSTLTPPFGIKLSDSNYLLWKNQLLNHIIAFDMESFING-TPAPPKFLDTAQTQVSPHYLIWQKFNRTLMGWIYSSLNEDKLGEIIGCN

Query:  TAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLADVLSLLL
        T++E W  L+  + + + AR M LR   Q  +K  LT+ +Y+ ++K I+D  +AIGEP+   D +  +L GLG +YNP V S+  R +   L  V S+LL
Subjt:  TAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLADVLSLLL

Query:  AYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFPQSFQPNFSGQSVLGRPQSSP---RPPRWPSSNS---SRPQCQIC
         +E RL  Q+T    +L+   +AN    N  RRS            S  PS    PQSF P     ++    Q  P   RP R+ ++ S   +RPQCQ+C
Subjt:  AYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFPQSFQPNFSGQSVLGRPQSSP---RPPRWPSSNS---SRPQCQIC

Query:  NKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNLQQSTPYYGSEQVVVGNGK
         KFGH  L+CY+R +  YQ P             + TS  SQ +  T P     + P +   ++W++D GATHH++    N+   TPY G++ V+VGNGK
Subjt:  NKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNLQQSTPYYGSEQVVVGNGK

Query:  SIPVLHVGSSTISSSSFPIQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLSKSILLTGQLEGGLYKLSVKRV
        S+P+  VG S + + + P  L +VLH P ++ NLIS++K C +N   +EFHPS F VKD  +K  LL GQLE GLYK     +
Subjt:  SIPVLHVGSSTISSSSFPIQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLSKSILLTGQLEGGLYKLSVKRV

A0A6J1DQX7 uncharacterized protein LOC1110223151.5e-10150.95Show/hide
Query:  QFPPQSTSSLPFFPQYNPSVPLFSSPQTSSNPFSTLTPPFGIKLSDSNYLLWKNQLLNHIIAFDMESFINGT-PAPPKFLDTAQTQVSPHYLIWQKFNRT
        QFPP + + L   P  NP          S+NPF TL  P  +KL+D+N+LLWKNQLLN +IA  +  +++GT   PP+FLD  Q Q +P Y  W+++NR 
Subjt:  QFPPQSTSSLPFFPQYNPSVPLFSSPQTSSNPFSTLTPPFGIKLSDSNYLLWKNQLLNHIIAFDMESFINGT-PAPPKFLDTAQTQVSPHYLIWQKFNRT

Query:  LMGWIYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPF
        LM WIYSSL+E+K+GE++   T ++IW  L  VY+S +TARIMGL++++Q +RKDG +VSQYLA+IK+IADKF+A+GEPLSY DHL ++L+GLG++YN F
Subjt:  LMGWIYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPF

Query:  VTSIHNRTEKPSLADVLSLLLAYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRP---PNPSKQSIPSKPPSTFSFPQSFQPNFSGQSVLGRPQSSP
        VTSIHNR + PSL DV SLLLAYEARL+KQ+TVD LN+ QANL NLS+ +N +R P     PN  K S P+ P S              QS+LG+PQS  
Subjt:  VTSIHNRTEKPSLADVLSLLLAYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRP---PNPSKQSIPSKPPSTFSFPQSFQPNFSGQSVLGRPQSSP

Query:  RPPRWPSSNSSRPQCQICNKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNL
        + P  P  +SS+ QCQIC K GH+A  CY+R N AY       SPQ  ++         QP S T P +    Q   HP E+W+MD GATHHMTPD + L
Subjt:  RPPRWPSSNSSRPQCQICNKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNL

Query:  QQSTPYYGSEQVVVGNGKSI
           TPY G EQV VGNG S+
Subjt:  QQSTPYYGSEQVVVGNGKSI

A0A7J0GPN0 UBX domain-containing protein8.9e-8637.98Show/hide
Query:  SSTQPSIPQFYPPRYNPSVTAGFQFPPQSTSSLPFFPQYNPSVPLFSSPQTSSNPFSTLTPPFGIKLSDSNYLLWKNQLLNHIIAFDMESFINGT-PAPP
        SST P+ P   PP  NP  ++    P          PQ   + PL + P        ++  P  +KL D NY++WK QLLN +IA  +E F++G+   PP
Subjt:  SSTQPSIPQFYPPRYNPSVTAGFQFPPQSTSSLPFFPQYNPSVPLFSSPQTSSNPFSTLTPPFGIKLSDSNYLLWKNQLLNHIIAFDMESFINGT-PAPP

Query:  KFLDTAQTQVSPHYLIWQKFNRTLMGWIYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIG
        +FLD  Q Q +P +  WQ++NR +M WIY+S+NE  LG+I+G  +A +IWE L  +Y + S A +  LR+ +Q I+K+GLT   Y+ + + + +  ++IG
Subjt:  KFLDTAQTQVSPHYLIWQKFNRTLMGWIYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIG

Query:  EPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLADVLSLLLAYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFP
        EP++Y DHL Y L GLG DYNPFVTSI ++  +PS+ +  S                                ++ R P+  NPS  S P+   +++S P
Subjt:  EPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLADVLSLLLAYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFP

Query:  QSFQPNFSGQSVLGRPQSSPRPPRWPSSNSSRPQCQICNKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPH
        +    N         P  SP     PSS   RP+CQIC K GHTA  CY+  N  YQ PP    P   FN +   +P S   SS  P     S P +   
Subjt:  QSFQPNFSGQSVLGRPQSSPRPPRWPSSNSSRPQCQICNKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPH

Query:  EAWYMDFGATHHMTPDINNLQQSTPYYGSEQVVVGNGKSIPVLHVGSSTISSSSFPIQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLS
         +WYMD GA+HH TPD+N L  ++PY G +QV VGNGK+ P+ ++G +++ + +  ++L  V HSP ++ NL+S+++LC +N AF+EF+P+FFLVK  ++
Subjt:  EAWYMDFGATHHMTPDINNLQQSTPYYGSEQVVVGNGKSIPVLHVGSSTISSSSFPIQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLS

Query:  KSILLTGQLEGGLYKL
        K +LL GQL+ GLY++
Subjt:  KSILLTGQLEGGLYKL

U6EFK2 Putative Ty1-copia-like retrotransposon1.6e-7940.5Show/hide
Query:  TSSNPFSTLTPPFG----IKLSDSNYLLWKNQLLNHIIAFDMESFINGTPAPP----KFLDTAQTQ-----VSPHYLIWQKFNRTLMGWIYSSLNEDKLG
        T +   ++  P FG    IKL  +N+L+W+NQLLN ++A   E  + GT + P       DTA T      ++P Y++WQ+ NR +M WIYSSL E  + 
Subjt:  TSSNPFSTLTPPFG----IKLSDSNYLLWKNQLLNHIIAFDMESFINGTPAPP----KFLDTAQTQ-----VSPHYLIWQKFNRTLMGWIYSSLNEDKLG

Query:  EIIGCNTAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLAD
        +I+  N+A EIW  LR  Y S S ARIM LR Q+Q  RK GL+V  Y+ +I+ I D   AIGE +S  D +  +L GLG++YNP V SI +R +  S+  
Subjt:  EIIGCNTAYEIWEHLRIVYESLSTARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLAD

Query:  VLSLLLAYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFPQSFQPNF-SGQSVLGRPQSSPRPPRWPSSNSSRPQCQI
        + S L  YE RLE Q++V+    +QAN A  + SN  +R P   N      P      F+   S Q  F  G S  GR           ++  SRPQCQI
Subjt:  VLSLLLAYEARLEKQSTVDSLNLVQANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFPQSFQPNF-SGQSVLGRPQSSPRPPRWPSSNSSRPQCQI

Query:  CNKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNLQQSTPYYGSEQVVVGNG
        C K GH A  C++R +  +Q P + HS     NQFS ++   Q L +          P       W++D GATHH+T D+ NL    P+ G ++V+VGNG
Subjt:  CNKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNLQQSTPYYGSEQVVVGNG

Query:  KSIPVLHVGSSTISSSSFPIQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLSKSILLTGQLEGGLYKL
        K + VLH G S+I +S   + LK+VLH P I+ NL+S+ KLC +N A+VEF+PS+F VKD  +++ILL G L+ GLY +
Subjt:  KSIPVLHVGSSTISSSSFPIQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLSKSILLTGQLEGGLYKL

SwissProt top hitse value%identityAlignment
F4I3Z5 Tetratricopeptide repeat protein SKI31.5e-2655.45Show/hide
Query:  LPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDVCLLLCHGVACMELAKQLCSSHFLMMAVNSLLKAQVISVVPIPTVSIILAQAEGSLGLKENWESY
        + ++SL  GL S   +DF +AE++ AQACSL + + CLLLCHG  CMELA+Q   S FL +AV SL K Q  S+ P+P V  +LAQA GSLG KE WE  
Subjt:  LPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDVCLLLCHGVACMELAKQLCSSHFLMMAVNSLLKAQVISVVPIPTVSIILAQAEGSLGLKENWESY

Query:  LRFEWFSWPP
        LR EWF WPP
Subjt:  LRFEWFSWPP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.6e-4528.32Show/hide
Query:  KLSDSNYLLWKNQLLNHIIAFDMESFING-TPAPPKFLDT-AQTQVSPHYLIWQKFNRTLMGWIYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTAR
        KL+ +NYL+W  Q+      +++  F++G T  PP  + T A  +V+P Y  W++ ++ +   +  +++      +    TA +IWE LR +Y + S   
Subjt:  KLSDSNYLLWKNQLLNHIIAFDMESFING-TPAPPKFLDT-AQTQVSPHYLIWQKFNRTLMGWIYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTAR

Query:  IMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLADVLSLLLAYEARLEKQSTVDSLNLV--
        +  LR+Q+++  K   T+  Y+  +    D+ + +G+P+ + + +  +LE L  +Y P +  I  +   P+L ++   LL +E+++   S+   + +   
Subjt:  IMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLADVLSLLLAYEARLEKQSTVDSLNLV--

Query:  ---QANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFPQSFQPNFSGQSVLGRPQSSPRPPRWPSSNSSRP---QCQICNKFGHTALACYNRHNPAY
             N    + +NN  R+ R  N +  +  SKP       Q    NF                  P++N S+P   +CQIC   GH+A  C        
Subjt:  ---QANLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFPQSFQPNFSGQSVLGRPQSSPRPPRWPSSNSSRP---QCQICNKFGHTALACYNRHNPAY

Query:  QAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNLQQSTPYYGSEQVVVGNGKSIPVLHVGSSTISSSSFP
                 Q + +  +   P S P +   P  N     P +    W +D GATHH+T D NNL    PY G + V+V +G +IP+ H GS+++S+ S P
Subjt:  QAPPSGHSPQFYFNQFSQTSPLSQPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNLQQSTPYYGSEQVVVGNGKSIPVLHVGSSTISSSSFP

Query:  IQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLSKSILLTGQLEGGLYK
        + L ++L+ P+I  NLIS+ +LC  N   VEF P+ F VKDL +   LL G+ +  LY+
Subjt:  IQLKSVLHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLSKSILLTGQLEGGLYK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-4228.26Show/hide
Query:  KLSDSNYLLWKNQLLNHIIAFDMESFING-TPAPPKFLDT-AQTQVSPHYLIWQKFNRTLMGWIYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTAR
        KL+ +NYL+W  Q+      +++  F++G TP PP  + T A  +V+P Y  W++ ++ +   I  +++      +    TA +IWE LR +Y + S   
Subjt:  KLSDSNYLLWKNQLLNHIIAFDMESFING-TPAPPKFLDT-AQTQVSPHYLIWQKFNRTLMGWIYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTAR

Query:  IMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLADVLSLLLAYEARLEKQSTVDSLNLVQA
        +  LR               ++ +     D+ + +G+P+ + + +  +LE L  DY P +  I  +   PSL ++   L+  E++L   ++ + + +   
Subjt:  IMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLADVLSLLLAYEARLEKQSTVDSLNLVQA

Query:  NLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFPQSFQPNFSGQSVLGRPQSSPRPPRWPSSNSSRPQCQICNKFGHTALACYNRHNPAYQAPPSGHS
         + + + + N  ++ R  N +  +  ++         S+QP+ SG       +S  R P+         +CQIC+  GH+A  C           P  H 
Subjt:  NLANLSVSNNVRRSPRPPNPSKQSIPSKPPSTFSFPQSFQPNFSGQSVLGRPQSSPRPPRWPSSNSSRPQCQICNKFGHTALACYNRHNPAYQAPPSGHS

Query:  PQFYFNQFSQTSPLS--QPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNLQQSTPYYGSEQVVVGNGKSIPVLHVGSSTISSSSFPIQLKSV
         Q   NQ   TSP +  QP ++ + ++  N+         W +D GATHH+T D NNL    PY G + V++ +G +IP+ H GS+++ +SS  + L  V
Subjt:  PQFYFNQFSQTSPLS--QPLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNLQQSTPYYGSEQVVVGNGKSIPVLHVGSSTISSSSFPIQLKSV

Query:  LHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLSKSILLTGQLEGGLYK
        L+ P+I  NLIS+ +LC  NR  VEF P+ F VKDL +   LL G+ +  LY+
Subjt:  LHSPHISHNLISIAKLCLENRAFVEFHPSFFLVKDLLSKSILLTGQLEGGLYK

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).4.6e-1027.42Show/hide
Query:  DSNYLLWKNQLLNHIIAFDMESFINGT-PAPPKFLDTAQTQVSPHYLIWQKFNRTLMGWIYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTARIMGL
        + NY+ WK +  + +       FI+GT P P  F        SP Y  W++ N  +M W+ +S+ +  L  ++   TA+++WE LR V+      +I  L
Subjt:  DSNYLLWKNQLLNHIIAFDMESFINGT-PAPPKFLDTAQTQVSPHYLIWQKFNRTLMGWIYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESLSTARIMGL

Query:  RSQVQKIRKDGLTVSQYLAQIKDI
        R ++  +R+ G +V +Y  ++  +
Subjt:  RSQVQKIRKDGLTVSQYLAQIKDI

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.2e-1324.08Show/hide
Query:  PFGIKLSDSNYLLWKNQLLNHIIAFDMESFINGTPAPPKFLDTAQTQVSPHYLIWQKFNRTLMGWIYSSLNEDKL-GEIIGCNTAYEIWEHLRIVYESLS
        P  + + +SNY  W+   L H ++FD+   I+GT  P    D          + WQK +  +   +Y +L   +  G  +  +T+ +IW  ++  + +  
Subjt:  PFGIKLSDSNYLLWKNQLLNHIIAFDMESFINGTPAPPKFLDTAQTQVSPHYLIWQKFNRTLMGWIYSSLNEDKL-GEIIGCNTAYEIWEHLRIVYESLS

Query:  TARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLADVLSLLLAYEARLEK
         AR + L S+++      + V+ Y  ++K +AD    +  P++  + + Y+L GL   ++  +  I +R   PS  D  ++L   E RL++
Subjt:  TARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLADVLSLLLAYEARLEK

AT1G76630.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-2755.45Show/hide
Query:  LPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDVCLLLCHGVACMELAKQLCSSHFLMMAVNSLLKAQVISVVPIPTVSIILAQAEGSLGLKENWESY
        + ++SL  GL S   +DF +AE++ AQACSL + + CLLLCHG  CMELA+Q   S FL +AV SL K Q  S+ P+P V  +LAQA GSLG KE WE  
Subjt:  LPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDVCLLLCHGVACMELAKQLCSSHFLMMAVNSLLKAQVISVVPIPTVSIILAQAEGSLGLKENWESY

Query:  LRFEWFSWPP
        LR EWF WPP
Subjt:  LRFEWFSWPP

AT1G76630.2 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-2755.45Show/hide
Query:  LPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDVCLLLCHGVACMELAKQLCSSHFLMMAVNSLLKAQVISVVPIPTVSIILAQAEGSLGLKENWESY
        + ++SL  GL S   +DF +AE++ AQACSL + + CLLLCHG  CMELA+Q   S FL +AV SL K Q  S+ P+P V  +LAQA GSLG KE WE  
Subjt:  LPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDVCLLLCHGVACMELAKQLCSSHFLMMAVNSLLKAQVISVVPIPTVSIILAQAEGSLGLKENWESY

Query:  LRFEWFSWPP
        LR EWF WPP
Subjt:  LRFEWFSWPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACTACCCATGTTTAGTCTGGTGGATGGTTTGATATCTTTTTGGAGCCAAGATTTTATGGCTGCTGAGAAATATTTTGCACAAGCATGTTCTTTGGGACATGATGA
TGTCTGTCTCCTCCTCTGTCATGGTGTAGCCTGCATGGAACTTGCAAAGCAGCTTTGCAGTTCTCATTTCTTGATGATGGCTGTGAACAGTCTCCTTAAAGCTCAAGTAA
TTTCTGTTGTTCCAATACCAACTGTCTCGATCATACTGGCTCAAGCAGAAGGGAGCCTTGGTTTGAAAGAAAATTGGGAGTCATATCTTCGTTTTGAATGGTTCTCGTGG
CCCCCAGGTTTATCTAGCACCTCGTCAACTGCCGATGATCGATTGTTGGTTCTTCCTCTTGTCTCTTCCTCTGTAATCACCACTTCGGTGAGCTCACCAATTTCTTCCCA
ACGTGTTCAACAAAATCCCCTCTCCACTGGTACCCATACTCGCCCATTAAATCCTACCGTCCCTCCTTTCCAATCTTCTACTCAGCCCTCGATTCCTCAGTTTTATCCCC
CACGATACAATCCCTCGGTTACGGCTGGTTTTCAGTTTCCCCCGCAATCCACCTCTTCTCTGCCCTTCTTTCCTCAATATAATCCCTCTGTTCCACTTTTTTCATCTCCT
CAGACATCGTCCAACCCTTTCTCTACCCTAACCCCACCCTTTGGTATCAAACTTTCTGATTCGAACTACTTGCTTTGGAAGAACCAGCTTCTCAATCACATAATTGCTTT
CGACATGGAGAGCTTTATCAACGGCACTCCAGCCCCGCCAAAATTCTTGGACACTGCTCAAACTCAAGTAAGTCCTCACTACCTTATTTGGCAGAAGTTTAATCGCACTC
TCATGGGATGGATTTACTCTTCCTTAAATGAAGATAAATTAGGGGAAATTATTGGTTGTAATACTGCTTATGAAATATGGGAGCATCTGCGTATTGTGTATGAATCTTTA
TCGACTGCTAGGATAATGGGCCTTAGGTCTCAGGTACAGAAAATCAGAAAGGATGGCCTCACCGTATCTCAATATCTTGCTCAGATAAAGGACATTGCCGATAAGTTTTC
AGCTATAGGGGAACCACTCTCATACTGTGACCATTTGGGGTATATTTTAGAAGGCTTAGGAACCGATTACAATCCCTTTGTCACCTCTATTCACAACAGAACAGAAAAAC
CTTCCTTGGCCGATGTCCTCAGTTTGTTGCTTGCTTACGAGGCACGTCTTGAGAAACAGTCGACTGTTGATAGCCTTAACCTTGTGCAAGCCAATCTTGCGAATTTATCT
GTGTCAAACAATGTCAGGCGTTCCCCTCGCCCTCCGAATCCCTCAAAGCAATCCATCCCCTCTAAACCTCCTTCCACATTTTCCTTTCCCCAATCCTTTCAACCTAATTT
TTCAGGTCAGAGTGTCCTTGGTCGCCCACAGTCCTCCCCACGTCCACCTCGCTGGCCTTCCTCAAATTCATCACGCCCACAATGCCAAATATGTAACAAATTTGGGCATA
CTGCCCTTGCTTGTTATAATCGACATAATCCAGCTTATCAAGCTCCTCCTTCTGGTCATTCCCCTCAATTTTATTTTAACCAGTTTTCCCAAACTTCACCTCTCTCCCAA
CCTCTGTCATCTACCTCTCCTGATACAAACACCAATTCTCAGCCCCCAACTCACCCGCATGAGGCTTGGTATATGGATTTCGGAGCAACTCACCATATGACTCCGGATAT
CAACAATCTTCAGCAGTCAACCCCCTACTATGGTAGCGAGCAAGTTGTTGTTGGCAACGGTAAGTCCATTCCAGTCCTTCATGTTGGGTCCTCTACTATTTCATCGTCTT
CTTTTCCAATTCAGTTGAAGTCTGTATTGCATTCTCCTCATATATCCCATAATCTTATTAGCATTGCTAAGTTGTGTCTTGAAAATAGGGCTTTTGTTGAGTTTCATCCA
TCCTTTTTTCTTGTGAAGGATCTTCTATCCAAGAGCATTCTACTCACGGGTCAGCTTGAAGGGGGCCTTTATAAGCTTTCAGTCAAGCGCGTAAAGCTCACAGCGTCGAG
ACGCTGTGATAGGAAGCGTCCTGACGCTACCGTTTTTCCTTATTCAGAACGCGCGTATAAGAGGCAACGTCGCGACGCTGTCTTGACAGCGTCTCGACGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTACTACCCATGTTTAGTCTGGTGGATGGTTTGATATCTTTTTGGAGCCAAGATTTTATGGCTGCTGAGAAATATTTTGCACAAGCATGTTCTTTGGGACATGATGA
TGTCTGTCTCCTCCTCTGTCATGGTGTAGCCTGCATGGAACTTGCAAAGCAGCTTTGCAGTTCTCATTTCTTGATGATGGCTGTGAACAGTCTCCTTAAAGCTCAAGTAA
TTTCTGTTGTTCCAATACCAACTGTCTCGATCATACTGGCTCAAGCAGAAGGGAGCCTTGGTTTGAAAGAAAATTGGGAGTCATATCTTCGTTTTGAATGGTTCTCGTGG
CCCCCAGGTTTATCTAGCACCTCGTCAACTGCCGATGATCGATTGTTGGTTCTTCCTCTTGTCTCTTCCTCTGTAATCACCACTTCGGTGAGCTCACCAATTTCTTCCCA
ACGTGTTCAACAAAATCCCCTCTCCACTGGTACCCATACTCGCCCATTAAATCCTACCGTCCCTCCTTTCCAATCTTCTACTCAGCCCTCGATTCCTCAGTTTTATCCCC
CACGATACAATCCCTCGGTTACGGCTGGTTTTCAGTTTCCCCCGCAATCCACCTCTTCTCTGCCCTTCTTTCCTCAATATAATCCCTCTGTTCCACTTTTTTCATCTCCT
CAGACATCGTCCAACCCTTTCTCTACCCTAACCCCACCCTTTGGTATCAAACTTTCTGATTCGAACTACTTGCTTTGGAAGAACCAGCTTCTCAATCACATAATTGCTTT
CGACATGGAGAGCTTTATCAACGGCACTCCAGCCCCGCCAAAATTCTTGGACACTGCTCAAACTCAAGTAAGTCCTCACTACCTTATTTGGCAGAAGTTTAATCGCACTC
TCATGGGATGGATTTACTCTTCCTTAAATGAAGATAAATTAGGGGAAATTATTGGTTGTAATACTGCTTATGAAATATGGGAGCATCTGCGTATTGTGTATGAATCTTTA
TCGACTGCTAGGATAATGGGCCTTAGGTCTCAGGTACAGAAAATCAGAAAGGATGGCCTCACCGTATCTCAATATCTTGCTCAGATAAAGGACATTGCCGATAAGTTTTC
AGCTATAGGGGAACCACTCTCATACTGTGACCATTTGGGGTATATTTTAGAAGGCTTAGGAACCGATTACAATCCCTTTGTCACCTCTATTCACAACAGAACAGAAAAAC
CTTCCTTGGCCGATGTCCTCAGTTTGTTGCTTGCTTACGAGGCACGTCTTGAGAAACAGTCGACTGTTGATAGCCTTAACCTTGTGCAAGCCAATCTTGCGAATTTATCT
GTGTCAAACAATGTCAGGCGTTCCCCTCGCCCTCCGAATCCCTCAAAGCAATCCATCCCCTCTAAACCTCCTTCCACATTTTCCTTTCCCCAATCCTTTCAACCTAATTT
TTCAGGTCAGAGTGTCCTTGGTCGCCCACAGTCCTCCCCACGTCCACCTCGCTGGCCTTCCTCAAATTCATCACGCCCACAATGCCAAATATGTAACAAATTTGGGCATA
CTGCCCTTGCTTGTTATAATCGACATAATCCAGCTTATCAAGCTCCTCCTTCTGGTCATTCCCCTCAATTTTATTTTAACCAGTTTTCCCAAACTTCACCTCTCTCCCAA
CCTCTGTCATCTACCTCTCCTGATACAAACACCAATTCTCAGCCCCCAACTCACCCGCATGAGGCTTGGTATATGGATTTCGGAGCAACTCACCATATGACTCCGGATAT
CAACAATCTTCAGCAGTCAACCCCCTACTATGGTAGCGAGCAAGTTGTTGTTGGCAACGGTAAGTCCATTCCAGTCCTTCATGTTGGGTCCTCTACTATTTCATCGTCTT
CTTTTCCAATTCAGTTGAAGTCTGTATTGCATTCTCCTCATATATCCCATAATCTTATTAGCATTGCTAAGTTGTGTCTTGAAAATAGGGCTTTTGTTGAGTTTCATCCA
TCCTTTTTTCTTGTGAAGGATCTTCTATCCAAGAGCATTCTACTCACGGGTCAGCTTGAAGGGGGCCTTTATAAGCTTTCAGTCAAGCGCGTAAAGCTCACAGCGTCGAG
ACGCTGTGATAGGAAGCGTCCTGACGCTACCGTTTTTCCTTATTCAGAACGCGCGTATAAGAGGCAACGTCGCGACGCTGTCTTGACAGCGTCTCGACGCTAA
Protein sequenceShow/hide protein sequence
MLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDVCLLLCHGVACMELAKQLCSSHFLMMAVNSLLKAQVISVVPIPTVSIILAQAEGSLGLKENWESYLRFEWFSW
PPGLSSTSSTADDRLLVLPLVSSSVITTSVSSPISSQRVQQNPLSTGTHTRPLNPTVPPFQSSTQPSIPQFYPPRYNPSVTAGFQFPPQSTSSLPFFPQYNPSVPLFSSP
QTSSNPFSTLTPPFGIKLSDSNYLLWKNQLLNHIIAFDMESFINGTPAPPKFLDTAQTQVSPHYLIWQKFNRTLMGWIYSSLNEDKLGEIIGCNTAYEIWEHLRIVYESL
STARIMGLRSQVQKIRKDGLTVSQYLAQIKDIADKFSAIGEPLSYCDHLGYILEGLGTDYNPFVTSIHNRTEKPSLADVLSLLLAYEARLEKQSTVDSLNLVQANLANLS
VSNNVRRSPRPPNPSKQSIPSKPPSTFSFPQSFQPNFSGQSVLGRPQSSPRPPRWPSSNSSRPQCQICNKFGHTALACYNRHNPAYQAPPSGHSPQFYFNQFSQTSPLSQ
PLSSTSPDTNTNSQPPTHPHEAWYMDFGATHHMTPDINNLQQSTPYYGSEQVVVGNGKSIPVLHVGSSTISSSSFPIQLKSVLHSPHISHNLISIAKLCLENRAFVEFHP
SFFLVKDLLSKSILLTGQLEGGLYKLSVKRVKLTASRRCDRKRPDATVFPYSERAYKRQRRDAVLTASRR