; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005706 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005706
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr6:26733533..26735403
RNA-Seq ExpressionLag0005706
SyntenyLag0005706
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN61322.1 hypothetical protein VITISV_012106 [Vitis vinifera]3.7e-7637.1Show/hide
Query:  TFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDTP-APVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGEIIGCATAYDIWEHLR
        ++  L   L +KL+  NY+LW++Q+ N I A   E FI+ T   P K L P    +NP F+ W++ +R ++SW+YSSL    + +IIG  T++  W  L 
Subjt:  TFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDTP-APVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGEIIGCATAYDIWEHLR

Query:  IVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEKQS
         ++ SSS ARIM LR +LQ  +K ++S+  Y+ +IK  AD   AIGEP+S +D +  +L GLGS+YNA VT+I  R D+ SL  + S+LLA++ RLE+QS
Subjt:  IVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEKQS

Query:  SVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCIL----GPPPQAMFNQFS
        S++ ++   AN A+ S +   G+   +F+G +   ++P    N+T          G G  GR     R +  PS   ++PQC L    G   Q  +++F 
Subjt:  SVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCIL----GPPPQAMFNQFS

Query:  QPF-------SQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSIKFKS
          F       S S++    + +P   ++    P+  D++W++DSGA+HH+T ++  L  ++PY G ++V IGNG  + + N+GS  L S   + S + K 
Subjt:  QPF-------SQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSIKFKS

Query:  VLHAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKL----SKPPNRSLSSFSSQPAAFVSSVQD-INLWNQRLGHPAFPIV
        V H P IS NLIS+AK C +NN  +EFH++ F +KDL +K +L QG L++GLYK     +  P  S+++ S+  + F S+V++   LW+ RLGH +F IV
Subjt:  VLHAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKL----SKPPNRSLSSFSSQPAAFVSSVQD-INLWNQRLGHPAFPIV

Query:  QHVL
          V+
Subjt:  QHVL

RVW27268.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.7e-7637.1Show/hide
Query:  TFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDTP-APVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGEIIGCATAYDIWEHLR
        ++  L   L +KL+  NY+LW++Q+ N I A   E FI+ T   P K L P    +NP F+ W++ +R ++SW+YSSL    + +IIG  T++  W  L 
Subjt:  TFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDTP-APVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGEIIGCATAYDIWEHLR

Query:  IVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEKQS
         ++ SSS ARIM LR +LQ  +K ++S+  Y+ +IK  AD   AIGEP+S +D +  +L GLGS+YNA VT+I  R D+ SL  + S+LLA++ RLE+QS
Subjt:  IVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEKQS

Query:  SVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCIL----GPPPQAMFNQFS
        S++ ++   AN A+ S +   G+   +F+G +   ++P    N+T          G G  GR     R +  PS   ++PQC L    G   Q  +++F 
Subjt:  SVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCIL----GPPPQAMFNQFS

Query:  QPF-------SQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSIKFKS
          F       S S++    + +P   ++    P+  D++W++DSGA+HH+T ++  L  ++PY G ++V IGNG  + + N+GS  L S   + S + K 
Subjt:  QPF-------SQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSIKFKS

Query:  VLHAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKL----SKPPNRSLSSFSSQPAAFVSSVQD-INLWNQRLGHPAFPIV
        V H P IS NLIS+AK C +NN  +EFH++ F +KDL +K +L QG L++GLYK     +  P  S+++ S+  + F S+V++   LW+ RLGH +F IV
Subjt:  VLHAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKL----SKPPNRSLSSFSSQPAAFVSSVQD-INLWNQRLGHPAFPIV

Query:  QHVL
          V+
Subjt:  QHVL

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.0e-8140.16Show/hide
Query:  NNPQQYFPSAPSSTF--PTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFIN-DTPAPVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGE
        NN Q   P     T   P+L+  L+IKL++ N LL K+QLLN IIA  +E FI+ D  +P KYLD    QVNP+F+ W + N+++MSW+YSSL    +G+
Subjt:  NNPQQYFPSAPSSTF--PTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFIN-DTPAPVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGE

Query:  IIGCATAYDIWEHLRIVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADV
        I+  +TA DIW  L   YES S A +M+L SQLQ+I+K ++ +S+YL+++K + D+F  IGEPLSYRD L  ILEGL  EY+  VTSI NR+DRPSL +V
Subjt:  IIGCATAYDIWEHLRIVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADV

Query:  CSLLLAYDSRLEKQSSVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCILG
         SLL  Y+ RL ++S   NLN  QAN              PR  G      N +P+          S         R +    P  +P+     P    G
Subjt:  CSLLLAYDSRLEKQSSVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCILG

Query:  PPPQAMFNQFSQPFSQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSI
        P       Q S P S  ++   +S  PT  S+Q       D++W+MDSGATHH T +   +  +  Y   +  ++GN   I + ++G + L S +  + I
Subjt:  PPPQAMFNQFSQPFSQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSI

Query:  KFKSVLHAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKLSKPPNRSLSSFSSQPA----------AFVSSVQDINLWNQR
            VLH P IS  LIS+ +L  DN  FVEF+ + FL+KD  +K +LLQGHL+ GLYKL++P   + SS SS P+          AF+S    + LW+ R
Subjt:  KFKSVLHAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKLSKPPNRSLSSFSSQPA----------AFVSSVQDINLWNQR

Query:  LGHPAFPIVQHVL
        LGHPA  +V  VL
Subjt:  LGHPAFPIVQHVL

RVW95765.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]4.9e-7637.1Show/hide
Query:  TFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDTP-APVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGEIIGCATAYDIWEHLR
        ++  L   L +KL+  NY+LW++Q+ N I A   E FI+ T   P K L P    +NP F+ W++ +R ++SW+YSSL    + +IIG  T++  W  L 
Subjt:  TFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDTP-APVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGEIIGCATAYDIWEHLR

Query:  IVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEKQS
         ++ SSS ARIM LR +LQ  +K ++S+  Y+ +IK  AD   AIGEP+S +D +  +L GLGS+YNA VT+I  R D+ SL  + S+LLA++ RLE+QS
Subjt:  IVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEKQS

Query:  SVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCIL----GPPPQAMFNQFS
        S++ ++   AN A+ S +   G+   +F+G +   ++P    N+T          G G  GR     R +  PS   ++PQC L    G   Q  +++F 
Subjt:  SVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCIL----GPPPQAMFNQFS

Query:  QPF-------SQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSIKFKS
          F       S S++    + +P   ++    P+  D++W++DSGA+HH+T ++  L  ++PY G ++V IGNG  + + N+GS  L S   + S + K 
Subjt:  QPF-------SQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSIKFKS

Query:  VLHAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKL----SKPPNRSLSSFSSQPAAFVSSVQD-INLWNQRLGHPAFPIV
        V H P IS NLIS+AK C +NN  +EFH++ F +KDL +K +L QG L++GLYK     +  P  S+++ S+  + F S+V++   LW+ RLGH +F IV
Subjt:  VLHAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKL----SKPPNRSLSSFSSQPAAFVSSVQD-INLWNQRLGHPAFPIV

Query:  QHVL
          V+
Subjt:  QHVL

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]8.9e-8648.29Show/hide
Query:  QYPPQSSPTLPFFPAFNNPQQYFPSAPSSTFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDT-PAPVKYLDPVQTQVNPQFLIWQKYNRILM
        Q+PP   PT  F     NP   F + P   FPTL  PLN+KLNDNN+LLWKNQLLN +IA  +  +++ T   P ++LD  Q Q NP +  W++YNR+LM
Subjt:  QYPPQSSPTLPFFPAFNNPQQYFPSAPSSTFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDT-PAPVKYLDPVQTQVNPQFLIWQKYNRILM

Query:  SWLYSSLNEDKIGEIIGCATAYDIWEHLRIVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVT
         W+YSSL+E+K+GE++   T +DIW  L  VY+S +TARIM L+++LQ +RKD  SVSQYLA+IK+IADKF A+GEPLSYRDHLA++L+GLGSEYNA VT
Subjt:  SWLYSSLNEDKIGEIIGCATAYDIWEHLRIVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVT

Query:  SIQNRTDRPSLADVCSLLLAYDSRLEKQSSVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPN-FTAPFPFS--SNTSGPGVLGRPHAPSR
        SI NR D PSL DV SLLLAY++RL+KQ++VD LN+ QANL NLS  +++ + PP+FS            PN +   FP S  S      +LG+P +   
Subjt:  SIQNRTDRPSLADVCSLLLAYDSRLEKQSSVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPN-FTAPFPFS--SNTSGPGVLGRPHAPSR

Query:  PSKWPSN-NQQRPQC----ILGPPPQAMFNQFSQPFSQSISAPQS-----STLPTDNSA--QFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNE
          KWP   +  + QC     LG      +++ +  +    ++PQ+        PT  S+  +FQ P   D++WFMDSGATHHMT D + L   TPY G E
Subjt:  PSKWPSN-NQQRPQC----ILGPPPQAMFNQFSQPFSQSISAPQS-----STLPTDNSA--QFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNE

Query:  QVIIGNGTSI
        QV +GNG+S+
Subjt:  QVIIGNGTSI

TrEMBL top hitse value%identityAlignment
A0A438CVN7 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-7637.1Show/hide
Query:  TFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDTP-APVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGEIIGCATAYDIWEHLR
        ++  L   L +KL+  NY+LW++Q+ N I A   E FI+ T   P K L P    +NP F+ W++ +R ++SW+YSSL    + +IIG  T++  W  L 
Subjt:  TFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDTP-APVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGEIIGCATAYDIWEHLR

Query:  IVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEKQS
         ++ SSS ARIM LR +LQ  +K ++S+  Y+ +IK  AD   AIGEP+S +D +  +L GLGS+YNA VT+I  R D+ SL  + S+LLA++ RLE+QS
Subjt:  IVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEKQS

Query:  SVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCIL----GPPPQAMFNQFS
        S++ ++   AN A+ S +   G+   +F+G +   ++P    N+T          G G  GR     R +  PS   ++PQC L    G   Q  +++F 
Subjt:  SVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCIL----GPPPQAMFNQFS

Query:  QPF-------SQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSIKFKS
          F       S S++    + +P   ++    P+  D++W++DSGA+HH+T ++  L  ++PY G ++V IGNG  + + N+GS  L S   + S + K 
Subjt:  QPF-------SQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSIKFKS

Query:  VLHAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKL----SKPPNRSLSSFSSQPAAFVSSVQD-INLWNQRLGHPAFPIV
        V H P IS NLIS+AK C +NN  +EFH++ F +KDL +K +L QG L++GLYK     +  P  S+++ S+  + F S+V++   LW+ RLGH +F IV
Subjt:  VLHAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKL----SKPPNRSLSSFSSQPAAFVSSVQD-INLWNQRLGHPAFPIV

Query:  QHVL
          V+
Subjt:  QHVL

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE14.9e-8240.16Show/hide
Query:  NNPQQYFPSAPSSTF--PTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFIN-DTPAPVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGE
        NN Q   P     T   P+L+  L+IKL++ N LL K+QLLN IIA  +E FI+ D  +P KYLD    QVNP+F+ W + N+++MSW+YSSL    +G+
Subjt:  NNPQQYFPSAPSSTF--PTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFIN-DTPAPVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGE

Query:  IIGCATAYDIWEHLRIVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADV
        I+  +TA DIW  L   YES S A +M+L SQLQ+I+K ++ +S+YL+++K + D+F  IGEPLSYRD L  ILEGL  EY+  VTSI NR+DRPSL +V
Subjt:  IIGCATAYDIWEHLRIVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADV

Query:  CSLLLAYDSRLEKQSSVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCILG
         SLL  Y+ RL ++S   NLN  QAN              PR  G      N +P+          S         R +    P  +P+     P    G
Subjt:  CSLLLAYDSRLEKQSSVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCILG

Query:  PPPQAMFNQFSQPFSQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSI
        P       Q S P S  ++   +S  PT  S+Q       D++W+MDSGATHH T +   +  +  Y   +  ++GN   I + ++G + L S +  + I
Subjt:  PPPQAMFNQFSQPFSQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSI

Query:  KFKSVLHAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKLSKPPNRSLSSFSSQPA----------AFVSSVQDINLWNQR
            VLH P IS  LIS+ +L  DN  FVEF+ + FL+KD  +K +LLQGHL+ GLYKL++P   + SS SS P+          AF+S    + LW+ R
Subjt:  KFKSVLHAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKLSKPPNRSLSSFSSQPA----------AFVSSVQDINLWNQR

Query:  LGHPAFPIVQHVL
        LGHPA  +V  VL
Subjt:  LGHPAFPIVQHVL

A0A438IG92 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-7637.1Show/hide
Query:  TFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDTP-APVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGEIIGCATAYDIWEHLR
        ++  L   L +KL+  NY+LW++Q+ N I A   E FI+ T   P K L P    +NP F+ W++ +R ++SW+YSSL    + +IIG  T++  W  L 
Subjt:  TFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDTP-APVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGEIIGCATAYDIWEHLR

Query:  IVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEKQS
         ++ SSS ARIM LR +LQ  +K ++S+  Y+ +IK  AD   AIGEP+S +D +  +L GLGS+YNA VT+I  R D+ SL  + S+LLA++ RLE+QS
Subjt:  IVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEKQS

Query:  SVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCIL----GPPPQAMFNQFS
        S++ ++   AN A+ S +   G+   +F+G +   ++P    N+T          G G  GR     R +  PS   ++PQC L    G   Q  +++F 
Subjt:  SVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCIL----GPPPQAMFNQFS

Query:  QPF-------SQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSIKFKS
          F       S S++    + +P   ++    P+  D++W++DSGA+HH+T ++  L  ++PY G ++V IGNG  + + N+GS  L S   + S + K 
Subjt:  QPF-------SQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSIKFKS

Query:  VLHAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKL----SKPPNRSLSSFSSQPAAFVSSVQD-INLWNQRLGHPAFPIV
        V H P IS NLIS+AK C +NN  +EFH++ F +KDL +K +L QG L++GLYK     +  P  S+++ S+  + F S+V++   LW+ RLGH +F IV
Subjt:  VLHAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKL----SKPPNRSLSSFSSQPAAFVSSVQD-INLWNQRLGHPAFPIV

Query:  QHVL
          V+
Subjt:  QHVL

A0A6J1DQX7 uncharacterized protein LOC1110223154.3e-8648.29Show/hide
Query:  QYPPQSSPTLPFFPAFNNPQQYFPSAPSSTFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDT-PAPVKYLDPVQTQVNPQFLIWQKYNRILM
        Q+PP   PT  F     NP   F + P   FPTL  PLN+KLNDNN+LLWKNQLLN +IA  +  +++ T   P ++LD  Q Q NP +  W++YNR+LM
Subjt:  QYPPQSSPTLPFFPAFNNPQQYFPSAPSSTFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDT-PAPVKYLDPVQTQVNPQFLIWQKYNRILM

Query:  SWLYSSLNEDKIGEIIGCATAYDIWEHLRIVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVT
         W+YSSL+E+K+GE++   T +DIW  L  VY+S +TARIM L+++LQ +RKD  SVSQYLA+IK+IADKF A+GEPLSYRDHLA++L+GLGSEYNA VT
Subjt:  SWLYSSLNEDKIGEIIGCATAYDIWEHLRIVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVT

Query:  SIQNRTDRPSLADVCSLLLAYDSRLEKQSSVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPN-FTAPFPFS--SNTSGPGVLGRPHAPSR
        SI NR D PSL DV SLLLAY++RL+KQ++VD LN+ QANL NLS  +++ + PP+FS            PN +   FP S  S      +LG+P +   
Subjt:  SIQNRTDRPSLADVCSLLLAYDSRLEKQSSVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPN-FTAPFPFS--SNTSGPGVLGRPHAPSR

Query:  PSKWPSN-NQQRPQC----ILGPPPQAMFNQFSQPFSQSISAPQS-----STLPTDNSA--QFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNE
          KWP   +  + QC     LG      +++ +  +    ++PQ+        PT  S+  +FQ P   D++WFMDSGATHHMT D + L   TPY G E
Subjt:  PSKWPSN-NQQRPQC----ILGPPPQAMFNQFSQPFSQSISAPQS-----STLPTDNSA--QFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNE

Query:  QVIIGNGTSI
        QV +GNG+S+
Subjt:  QVIIGNGTSI

A5BFR8 Integrase catalytic domain-containing protein1.8e-7637.1Show/hide
Query:  TFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDTP-APVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGEIIGCATAYDIWEHLR
        ++  L   L +KL+  NY+LW++Q+ N I A   E FI+ T   P K L P    +NP F+ W++ +R ++SW+YSSL    + +IIG  T++  W  L 
Subjt:  TFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDTP-APVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGEIIGCATAYDIWEHLR

Query:  IVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEKQS
         ++ SSS ARIM LR +LQ  +K ++S+  Y+ +IK  AD   AIGEP+S +D +  +L GLGS+YNA VT+I  R D+ SL  + S+LLA++ RLE+QS
Subjt:  IVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEKQS

Query:  SVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCIL----GPPPQAMFNQFS
        S++ ++   AN A+ S +   G+   +F+G +   ++P    N+T          G G  GR     R +  PS   ++PQC L    G   Q  +++F 
Subjt:  SVDNLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCIL----GPPPQAMFNQFS

Query:  QPF-------SQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSIKFKS
          F       S S++    + +P   ++    P+  D++W++DSGA+HH+T ++  L  ++PY G ++V IGNG  + + N+GS  L S   + S + K 
Subjt:  QPF-------SQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSIKFKS

Query:  VLHAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKL----SKPPNRSLSSFSSQPAAFVSSVQD-INLWNQRLGHPAFPIV
        V H P IS NLIS+AK C +NN  +EFH++ F +KDL +K +L QG L++GLYK     +  P  S+++ S+  + F S+V++   LW+ RLGH +F IV
Subjt:  VLHAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKL----SKPPNRSLSSFSSQPAAFVSSVQD-INLWNQRLGHPAFPIV

Query:  QHVL
          V+
Subjt:  QHVL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.9e-4527.77Show/hide
Query:  TSPLNI------KLNDNNYLLWKNQLLNHIIAFDIEVFI--NDTPAPVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGEIIGCATAYDIWEH
        TS LN+      KL   NYL+W  Q+      +++  F+  + T  P         +VNP +  W++ ++++ S +  +++      +    TA  IWE 
Subjt:  TSPLNI------KLNDNNYLLWKNQLLNHIIAFDIEVFI--NDTPAPVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGEIIGCATAYDIWEH

Query:  LRIVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEK
        LR +Y + S   +  LR+QL++  K   ++  Y+  +    D+   +G+P+ + + +  +LE L  EY   +  I  +   P+L ++   LL ++S++  
Subjt:  LRIVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEK

Query:  QSSVD----NLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPH---APSRPSKWPSNNQQRPQCILGPPPQAM
         SS        N +       + +N+NG R  R+  N+    N  P    +  F  ++N S P  LG+          +K  S  Q     +    P + 
Subjt:  QSSVD----NLNLIQANLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPH---APSRPSKWPSNNQQRPQCILGPPPQAM

Query:  FNQFSQPFSQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSIKFKSVL
        F  +    + ++ +P SS                 + W +DSGATHH+TSD N L    PY G + V++ +G++IP+ + GS+SL +   S+ +   ++L
Subjt:  FNQFSQPFSQSISAPQSSTLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSIKFKSVL

Query:  HAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKLSKPPNRSLSSFSSQPAAFVSSVQDINLWNQRLGHPAFPIVQHVL
        + P+I  NLIS+ +LC  N V VEF  + F +KDL +   LLQG  KD LY+     ++ +S F+S      SS    + W+ RLGHPA  I+  V+
Subjt:  HAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKLSKPPNRSLSSFSSQPAAFVSSVQDINLWNQRLGHPAFPIVQHVL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.7e-3728.39Show/hide
Query:  KLNDNNYLLWKNQLLNHIIAFDIEVFIN-DTPAPVKYL-DPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGEIIGCATAYDIWEHLRIVYESSSTAR
        KL   NYL+W  Q+      +++  F++  TP P   +      +VNP +  W++ ++++ S +  +++      +    TA  IWE LR +Y + S   
Subjt:  KLNDNNYLLWKNQLLNHIIAFDIEVFIN-DTPAPVKYL-DPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGEIIGCATAYDIWEHLRIVYESSSTAR

Query:  IMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEKQSSVDNLNLIQA
        +  LR               ++ +     D+   +G+P+ + + +  +LE L  +Y   +  I  +   PSL ++   L+  +S+L   +S + +  I A
Subjt:  IMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEKQSSVDNLNLIQA

Query:  NLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCILGPPPQAMFNQFSQPFSQSIS-APQSS
        N+     +N+N  +  R  G+ R   N   R N     P SS +       +P+   R            +C     PQ   +QF    +Q  S +P + 
Subjt:  NLANLSTSNSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCILGPPPQAMFNQFSQPFSQSIS-APQSS

Query:  TLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSIKFKSVLHAPHISHNLISIAKLCLD
          P  N A    P   ++ W +DSGATHH+TSD N L    PY G + V+I +G++IP+ + GS+SL +   S+S+    VL+ P+I  NLIS+ +LC  
Subjt:  TLPTDNSAQFQQPSLLDDAWFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSIKFKSVLHAPHISHNLISIAKLCLD

Query:  NNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKLSKPPNRSLSSFSSQPAAFVSSVQDINLWNQRLGHPAFPIVQHVL
        N V VEF  + F +KDL +   LLQG  KD LY+     ++++S F+S  +    S      W+ RLGHP+  I+  V+
Subjt:  NNVFVEFHASHFLLKDLLSKTILLQGHLKDGLYKLSKPPNRSLSSFSSQPAAFVSSVQDINLWNQRLGHPAFPIVQHVL

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.3e-0725.61Show/hide
Query:  PQSSPTLPFFPAFNNPQQYFPSAPSSTFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDT-PAPVKYLDPVQTQVNPQFLIWQKYNRILMSWL
        P S P  P+         Y P  P    P+  S   +  +++NY+ WK +  + +       FI+ T P P    DP     +P +  W++ N ++M WL
Subjt:  PQSSPTLPFFPAFNNPQQYFPSAPSSTFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDT-PAPVKYLDPVQTQVNPQFLIWQKYNRILMSWL

Query:  YSSLNEDKIGEIIGCATAYDIWEHLRIVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDI
         +S+ +  +  ++   TA+ +WE LR V+      +I  LR +L  +R+   SV +Y  ++  +
Subjt:  YSSLNEDKIGEIIGCATAYDIWEHLRIVYESSSTARIMALRSQLQKIRKDNLSVSQYLAQIKDI

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.0e-1524.78Show/hide
Query:  PLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDTPAPVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKI-GEIIGCATAYDIWEHLRIVYESSS
        P+ + + ++NY  W+   L H ++FD+   I+ T  P           N   + WQK + I+   LY +L   +  G  +  +T+ DIW  ++  + ++ 
Subjt:  PLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDTPAPVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKI-GEIIGCATAYDIWEHLRIVYESSS

Query:  TARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEKQSSVDNLNL
         AR + L S+L+     ++ V+ Y  ++K +AD    +  P++ R+ + Y+L GL  +++  +  I++R   PS  D  ++L   + RL++       N 
Subjt:  TARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEKQSSVDNLNL

Query:  IQANLANLSTSNSNGKRPP-----RFSGNQ
           + ++ ST  +  + PP     R  GNQ
Subjt:  IQANLANLSTSNSNGKRPP-----RFSGNQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCGTCTTCAAGTTCCTCTTCTATCGATAGCACCACTTTACCACCAATAGCCTCTGTACGTTCATCTCCGATTCCCACCCCAATCATTTCCTCTACAGTTTCTTC
TTCCCCAACACCCTCTGTTTTGCCTCAGCGATTTCAGCCAAATACTCCAACCAGACCCCTCAATGCTAATGTTGCTCCGTTTCAAACCCTCCCTCAACAATCCACTTTCC
CTCATAACTCTTTCGCCCCTCCTACTGGTTTTCAGTATCCGCCGCAATCATCCCCCACCCTACCTTTCTTTCCAGCCTTCAACAATCCTCAACAATATTTTCCTTCGGCT
CCTTCTTCTACTTTCCCAACTTTGACGTCTCCGCTAAATATCAAGTTGAACGACAACAACTACTTGTTGTGGAAAAATCAGCTGCTTAATCATATCATTGCCTTTGATAT
AGAAGTCTTCATCAACGATACTCCTGCTCCGGTGAAATATTTGGATCCCGTTCAAACACAGGTAAACCCTCAGTTTTTAATTTGGCAAAAGTATAACAGAATCCTTATGA
GCTGGTTGTATTCTTCGTTAAATGAGGACAAAATCGGTGAGATAATTGGTTGTGCTACTGCATATGATATCTGGGAACACCTTCGTATTGTGTATGAGTCGTCATCTACT
GCACGCATTATGGCTTTAAGGTCTCAATTGCAAAAGATTAGAAAGGATAATCTGTCTGTTTCTCAGTACTTAGCTCAGATAAAGGATATAGCTGATAAGTTTTGTGCTAT
TGGTGAACCTCTGTCCTACCGAGATCACTTGGCTTATATTCTTGAGGGTTTGGGATCTGAATATAATGCGTCTGTCACCTCAATCCAAAACCGAACCGATCGTCCCTCAT
TAGCTGACGTTTGTAGTTTACTACTGGCTTATGACTCTAGGTTGGAGAAGCAATCGTCAGTTGATAATTTGAACCTCATACAGGCTAATCTTGCCAACTTATCCACTTCT
AATAGCAATGGTAAACGTCCTCCTCGGTTTTCTGGTAATCAAAGACCTCCCTTCAATCCAGTGCCTCGGCCTAATTTTACAGCTCCATTTCCCTTTTCTTCCAATACCTC
TGGTCCTGGTGTGCTCGGTCGCCCTCATGCTCCTTCGCGTCCATCAAAATGGCCCTCCAACAACCAACAACGCCCTCAATGCATCCTCGGTCCCCCTCCTCAAGCCATGT
TTAACCAATTCTCTCAGCCTTTCTCTCAATCAATTTCAGCTCCTCAATCTTCCACCTTGCCTACTGATAACTCTGCTCAGTTTCAACAGCCTTCCCTTCTCGATGATGCG
TGGTTCATGGACTCTGGGGCCACCCATCACATGACATCAGATGTGAATAAACTACAGCAGTCCACTCCGTACTTAGGAAATGAGCAAGTGATAATCGGCAATGGTACGTC
TATCCCTGTCCTTAATCTTGGCTCATCTTCTTTACAATCGTGTTTGCCCTCTCAATCTATTAAATTTAAGTCTGTTCTTCATGCTCCTCACATATCACACAATCTCATCA
GCATTGCTAAGTTGTGTCTTGATAATAATGTGTTTGTCGAATTTCATGCTAGTCATTTTCTTCTGAAGGATCTTCTTTCCAAGACAATCCTTCTTCAGGGTCATCTTAAA
GACGGTCTCTACAAGCTTTCCAAGCCTCCCAATCGGTCTCTGAGTAGTTTCTCATCACAGCCAGCTGCATTCGTCTCTTCTGTCCAGGATATCAACCTATGGAATCAGCG
GCTTGGACATCCAGCTTTCCCCATTGTTCAGCATGTCCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCGTCTTCAAGTTCCTCTTCTATCGATAGCACCACTTTACCACCAATAGCCTCTGTACGTTCATCTCCGATTCCCACCCCAATCATTTCCTCTACAGTTTCTTC
TTCCCCAACACCCTCTGTTTTGCCTCAGCGATTTCAGCCAAATACTCCAACCAGACCCCTCAATGCTAATGTTGCTCCGTTTCAAACCCTCCCTCAACAATCCACTTTCC
CTCATAACTCTTTCGCCCCTCCTACTGGTTTTCAGTATCCGCCGCAATCATCCCCCACCCTACCTTTCTTTCCAGCCTTCAACAATCCTCAACAATATTTTCCTTCGGCT
CCTTCTTCTACTTTCCCAACTTTGACGTCTCCGCTAAATATCAAGTTGAACGACAACAACTACTTGTTGTGGAAAAATCAGCTGCTTAATCATATCATTGCCTTTGATAT
AGAAGTCTTCATCAACGATACTCCTGCTCCGGTGAAATATTTGGATCCCGTTCAAACACAGGTAAACCCTCAGTTTTTAATTTGGCAAAAGTATAACAGAATCCTTATGA
GCTGGTTGTATTCTTCGTTAAATGAGGACAAAATCGGTGAGATAATTGGTTGTGCTACTGCATATGATATCTGGGAACACCTTCGTATTGTGTATGAGTCGTCATCTACT
GCACGCATTATGGCTTTAAGGTCTCAATTGCAAAAGATTAGAAAGGATAATCTGTCTGTTTCTCAGTACTTAGCTCAGATAAAGGATATAGCTGATAAGTTTTGTGCTAT
TGGTGAACCTCTGTCCTACCGAGATCACTTGGCTTATATTCTTGAGGGTTTGGGATCTGAATATAATGCGTCTGTCACCTCAATCCAAAACCGAACCGATCGTCCCTCAT
TAGCTGACGTTTGTAGTTTACTACTGGCTTATGACTCTAGGTTGGAGAAGCAATCGTCAGTTGATAATTTGAACCTCATACAGGCTAATCTTGCCAACTTATCCACTTCT
AATAGCAATGGTAAACGTCCTCCTCGGTTTTCTGGTAATCAAAGACCTCCCTTCAATCCAGTGCCTCGGCCTAATTTTACAGCTCCATTTCCCTTTTCTTCCAATACCTC
TGGTCCTGGTGTGCTCGGTCGCCCTCATGCTCCTTCGCGTCCATCAAAATGGCCCTCCAACAACCAACAACGCCCTCAATGCATCCTCGGTCCCCCTCCTCAAGCCATGT
TTAACCAATTCTCTCAGCCTTTCTCTCAATCAATTTCAGCTCCTCAATCTTCCACCTTGCCTACTGATAACTCTGCTCAGTTTCAACAGCCTTCCCTTCTCGATGATGCG
TGGTTCATGGACTCTGGGGCCACCCATCACATGACATCAGATGTGAATAAACTACAGCAGTCCACTCCGTACTTAGGAAATGAGCAAGTGATAATCGGCAATGGTACGTC
TATCCCTGTCCTTAATCTTGGCTCATCTTCTTTACAATCGTGTTTGCCCTCTCAATCTATTAAATTTAAGTCTGTTCTTCATGCTCCTCACATATCACACAATCTCATCA
GCATTGCTAAGTTGTGTCTTGATAATAATGTGTTTGTCGAATTTCATGCTAGTCATTTTCTTCTGAAGGATCTTCTTTCCAAGACAATCCTTCTTCAGGGTCATCTTAAA
GACGGTCTCTACAAGCTTTCCAAGCCTCCCAATCGGTCTCTGAGTAGTTTCTCATCACAGCCAGCTGCATTCGTCTCTTCTGTCCAGGATATCAACCTATGGAATCAGCG
GCTTGGACATCCAGCTTTCCCCATTGTTCAGCATGTCCTTTGA
Protein sequenceShow/hide protein sequence
MASSSSSSSIDSTTLPPIASVRSSPIPTPIISSTVSSSPTPSVLPQRFQPNTPTRPLNANVAPFQTLPQQSTFPHNSFAPPTGFQYPPQSSPTLPFFPAFNNPQQYFPSA
PSSTFPTLTSPLNIKLNDNNYLLWKNQLLNHIIAFDIEVFINDTPAPVKYLDPVQTQVNPQFLIWQKYNRILMSWLYSSLNEDKIGEIIGCATAYDIWEHLRIVYESSST
ARIMALRSQLQKIRKDNLSVSQYLAQIKDIADKFCAIGEPLSYRDHLAYILEGLGSEYNASVTSIQNRTDRPSLADVCSLLLAYDSRLEKQSSVDNLNLIQANLANLSTS
NSNGKRPPRFSGNQRPPFNPVPRPNFTAPFPFSSNTSGPGVLGRPHAPSRPSKWPSNNQQRPQCILGPPPQAMFNQFSQPFSQSISAPQSSTLPTDNSAQFQQPSLLDDA
WFMDSGATHHMTSDVNKLQQSTPYLGNEQVIIGNGTSIPVLNLGSSSLQSCLPSQSIKFKSVLHAPHISHNLISIAKLCLDNNVFVEFHASHFLLKDLLSKTILLQGHLK
DGLYKLSKPPNRSLSSFSSQPAAFVSSVQDINLWNQRLGHPAFPIVQHVL