; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024820 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024820
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr10:6112344..6113353
RNA-Seq ExpressionLag0024820
SyntenyLag0024820
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7809961.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Senna tora]3.4e-3335.52Show/hide
Query:  NNNNKGWNFN----NNNQRGGQ--YQGSNQRGSGNFNGGRGRGRG---------REFSPNVSQNRTG--------------GQSVVQSNGNMQTTTAFMA
        N NN+G+N N     NN RGG+  +  + + G  NF GG GRGRG         R F      +RTG                S  Q + N        +
Subjt:  NNNNKGWNFN----NNNQRGGQ--YQGSNQRGSGNFNGGRGRGRG---------REFSPNVSQNRTG--------------GQSVVQSNGNMQTTTAFMA

Query:  NTNNSFLANSETVLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLSNG---SNMITLNNILFVPEIAKNLKSVSKLAQDN
            + +A  E V D  W+ DSGA++HVT   +NL+N  EY G EQ+ +GNGK L IS  G S++ +    S  + LNN+L VP I KNL SVSK A+DN
Subjt:  NTNNSFLANSETVLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLSNG---SNMITLNNILFVPEIAKNLKSVSKLAQDN

Query:  NIFFEFHGDFCLVKAKDSRQVVLKG-ILKDGLYQLE-----------NVSKVSKEDALL-------LNFGSTLQPTLNNQVSVNVVSCYPVNTSVSKNVW
        ++FFEFH D C VK++ ++ ++LKG I  DGLY  E           N+S+ ++    L       +   ST+  T++N V+ N  +  P        +W
Subjt:  NIFFEFHGDFCLVKAKDSRQVVLKG-ILKDGLYQLE-----------NVSKVSKEDALL-------LNFGSTLQPTLNNQVSVNVVSCYPVNTSVSKNVW

Query:  HCRLGHPNLKVLESIIKVCNLSVKTNEVYQFYESC
        HCRLGH N   +  ++K CN S+     ++F E+C
Subjt:  HCRLGHPNLKVLESIIKVCNLSVKTNEVYQFYESC

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]4.5e-3336.01Show/hide
Query:  KGGSPQGN--NNNKGWNFNNNNQRGGQYQGSNQRGSGNFNGGRGRGRGREFSPNVSQ-------------NRTGGQSVVQSNGNMQTTTAFMANTNNSFL
        K    +GN  N+N  W  +NNN RG  ++G        + GGRGRGR  + +  V               ++T  +S   +N + Q        ++N+FL
Subjt:  KGGSPQGN--NNNKGWNFNNNNQRGGQYQGSNQRGSGNFNGGRGRGRGREFSPNVSQ-------------NRTGGQSVVQSNGNMQTTTAFMANTNNSFL

Query:  ANSETVLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLSNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFEFHGDF
        A+  ++ D +WY DSGAS+HVT   +   N  E+ G   +I+GNG+KL I  TG+S L +    + L++IL+VP+I KNL SVSKLA DNNI  EF  + 
Subjt:  ANSETVLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLSNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFEFHGDF

Query:  CLVKAKDSRQVVLKGILKDGLYQLENVSKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTSVSKNVWHCRLGHPNLKVLESIIKVCNLSVKTNEVY
        C VK K + + +L+GILKDGLYQL      S++D+             +  VS+             K  WH +LGHPN KVL+ ++K CN+ +  ++ +
Subjt:  CLVKAKDSRQVVLKGILKDGLYQLENVSKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTSVSKNVWHCRLGHPNLKVLESIIKVCNLSVKTNEVY

Query:  QFYESCQFGNL
         F E+CQ+G +
Subjt:  QFYESCQFGNL

PNY02796.1 copia protein (gag-int-pol protein), partial [Trifolium pratense]4.5e-3340.28Show/hide
Query:  NNSFLANSETVLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLSNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFE
        N++F+A+     D  WY DSGAS+HVT   +   +   + G   +++GNG+KL I  +G++ L N    + L ++L+VPEI KNL SVSKL  DNNI  E
Subjt:  NNSFLANSETVLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLSNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFE

Query:  FHGDFCLVKAKDSRQVVLKGILKDGLYQLENVSKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTSVSKNVWHCRLGHPNLKVLESIIKVCNLSVK
        F  D C VK K + + +LKG LK+GLYQ+ NVS  S +DA                +SV             K  WH +LGHPN KVL+ ++K CN+   
Subjt:  FHGDFCLVKAKDSRQVVLKGILKDGLYQLENVSKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTSVSKNVWHCRLGHPNLKVLESIIKVCNLSVK

Query:  TNEVYQFYESCQFGNL
        +++ ++F E+CQFG L
Subjt:  TNEVYQFYESCQFGNL

RZB67542.1 Retrovirus-related Pol polyprotein from transposon RE1 [Glycine soja]4.5e-3340.74Show/hide
Query:  NSFLANSETVLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLSNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFEF
        ++F+A+     D  WY DSGAS+HVT     L +  E  G   +++GNGK+L I  +G++ L+N    + L+N+L+VPEI KNL SVSKL  DNN   EF
Subjt:  NSFLANSETVLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLSNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFEF

Query:  HGDFCLVKAKDSRQVVLKGILKDGLYQLENV-SKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTSVSKNVWHCRLGHPNLKVLESIIKVCNLSVK
          + C VK K + + +LKG L+DGLYQL +V S+V+K+    +                          SV +N WH +LGHPN KVLE ++K CN+   
Subjt:  HGDFCLVKAKDSRQVVLKGILKDGLYQLENV-SKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTSVSKNVWHCRLGHPNLKVLESIIKVCNLSVK

Query:  TNEVYQFYESCQFGNL
        +N+ + F E+CQFG L
Subjt:  TNEVYQFYESCQFGNL

XP_031282138.1 uncharacterized protein LOC116140680 [Pistacia vera]5.8e-3338.31Show/hide
Query:  TAFMANTNNSF--LANSETVLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLSNGSN--MITLNNILFVPEIAKNLKSVS
        T  ++++  S+  +A  +TV D +WY+D GAS+H+T   +NL++   Y G  +V +GNG  +PI+ TG   + + ++  ++ L NIL VP+IAKNL S+S
Subjt:  TAFMANTNNSF--LANSETVLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLSNGSN--MITLNNILFVPEIAKNLKSVS

Query:  KLAQDNNIFFEFHGDFCLVKAKDSRQVVLKGILKDGLYQLENVSKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTSVS-------------KNV-
        ++ +DNN+  EFH D CLVK K S+ V+L+G +K GLYQLE  S VS        + S++ P+ ++ V+ NVV    V+TS S              NV 
Subjt:  KLAQDNNIFFEFHGDFCLVKAKDSRQVVLKGILKDGLYQLENVSKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTSVS-------------KNV-

Query:  ------WHCRLGHPNLKVLESIIKVCNLSVKT-NEVYQFYESCQFGNL
              WH RLGHPN+K L+ ++   ++   T +    F E+CQFG L
Subjt:  ------WHCRLGHPNLKVLESIIKVCNLSVKT-NEVYQFYESCQFGNL

TrEMBL top hitse value%identityAlignment
A0A803NU85 Uncharacterized protein1.1e-3437.58Show/hide
Query:  NKGGSPQGNNN-------NKGWNFNNNNQRGGQYQGSNQRGSGNFNGGRGR-GRGREFSPNVSQ-NRTGGQSVVQSN-------GNMQTTTAFMANTNNS
        N    P G+NN        +G++  +NN RG      N RG G+F GGRGR GRG    P      + G  + +  N       G      A      ++
Subjt:  NKGGSPQGNNN-------NKGWNFNNNNQRGGQYQGSNQRGSGNFNGGRGR-GRGREFSPNVSQ-NRTGGQSVVQSN-------GNMQTTTAFMANTNNS

Query:  FLANSETVLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLSNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFEFHG
         +A  E + D +WY DSGAS+H+T     + N  EYGG EQ+ IG+G KLPI   G  +L + ++ + L+N+L VP I+KNL SVSKL  DNN+  EF  
Subjt:  FLANSETVLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLSNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFEFHG

Query:  DFCLVKAKDSRQVVLKGILKDGLYQL--ENVSKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTSVSKNVWHCRLGHPNLKVLESIIKVCNLSVKT
        D C+VK + + +VVL+G LKDGLYQL     S  SK  +  L F S+  PT ++           V  +  K++WH +LGHP+  VL  ++K+ N+ V  
Subjt:  DFCLVKAKDSRQVVLKGILKDGLYQL--ENVSKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTSVSKNVWHCRLGHPNLKVLESIIKVCNLSVKT

Query:  NEVYQFYESCQFGN
        NE   F ++CQ+ N
Subjt:  NEVYQFYESCQFGN

A0A803P4G6 Uncharacterized protein6.1e-3636.15Show/hide
Query:  RGGQYQGSNQRGSGNFNGGRGRGRGREFSPNVSQ------NRTGGQSVVQSN-------GNMQTTTAFMANTNNSFLANSETVLDPNWYVDSGASSHVTG
        RG  +   NQ  +    G RGR RGR    N S+       + G  + V  N       G+           +N+F+A  + +    W+VDSGAS+H+T 
Subjt:  RGGQYQGSNQRGSGNFNGGRGRGRGREFSPNVSQ------NRTGGQSVVQSN-------GNMQTTTAFMANTNNSFLANSETVLDPNWYVDSGASSHVTG

Query:  GYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYL-SNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFEFHGDFCLVKAKDSRQVVLKGILKDGLY
          N++    EYGG E + +G+G KL IS  G  +L +N   ++ L  +L VP+IAKNL SV KL  DNN+  EF+ D CLVK K +++V+L+G+LKDGLY
Subjt:  GYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYL-SNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFEFHGDFCLVKAKDSRQVVLKGILKDGLY

Query:  QLENVSKVSKEDALLLNFGSTLQPTLNN---QVSVNVVSCYPVNTSVSK------NVWHCRLGHPNLKVLESIIKVCNLSVKTNEVYQFYESCQFG
        Q+++     +  + L++F S    T+ +    VS  V + +     VS+      +VWH RLGHP+ KVL+ +++  N+ V  NEV  F ++CQ+G
Subjt:  QLENVSKVSKEDALLLNFGSTLQPTLNN---QVSVNVVSCYPVNTSVSK------NVWHCRLGHPNLKVLESIIKVCNLSVKTNEVYQFYESCQFG

A0A803PEH4 Uncharacterized protein2.6e-3435.74Show/hide
Query:  NNNNKGWNFNNNN---QRGGQYQGSNQRGSGNFNGGRGRGRGREFSPNVSQNRTGGQSVV--------------QSNGNMQTTTAFMANTNNSFLANSET
        NNN +G  F + N     GG +  SN RG+ N   GRGRG G    P        G +                 +N + Q       N +++F+A  E 
Subjt:  NNNNKGWNFNNNN---QRGGQYQGSNQRGSGNFNGGRGRGRGREFSPNVSQNRTGGQSVV--------------QSNGNMQTTTAFMANTNNSFLANSET

Query:  VLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLS-NGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFEFHGDFCLVK
        +    W+ DSGAS+H+T    NL   ++Y G E V++GNG KL I+  GN  L+    N + L ++L VP+IAKNL SVSKLA DNN+  EF+ +FCLVK
Subjt:  VLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLS-NGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFEFHGDFCLVK

Query:  AKDSRQVVLKGILKDGLYQLENVSKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTSVSKNVWHCRLGHPNLKVLESIIKVCNLSVKTNEVYQFYE
         K +++V+L G+LKD LYQL++    S       NF S    ++++ V+ +      ++     +V H RLGHP++KVL  +++  N+SV  N +    +
Subjt:  AKDSRQVVLKGILKDGLYQLENVSKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTSVSKNVWHCRLGHPNLKVLESIIKVCNLSVKTNEVYQFYE

Query:  SCQFG
        +CQ+G
Subjt:  SCQFG

A0A803PPY9 Uncharacterized protein7.9e-3637.94Show/hide
Query:  SPQGN------NNNKGWNFNNNNQRGGQYQGSNQRGSGNFNGGRGRGRGREFSPNVSQNRTGGQSVVQSNGNMQTTTAFMANTN-NSFLANSETVLDPNW
        +PQ N      N  +G     N+  G ++ GS     G  NGGR RGRGR                  SNG  + T       N N+F+A  E V    W
Subjt:  SPQGN------NNNKGWNFNNNNQRGGQYQGSNQRGSGNFNGGRGRGRGREFSPNVSQNRTGGQSVVQSNGNMQTTTAFMANTN-NSFLANSETVLDPNW

Query:  YVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYL-SNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFEFHGDFCLVKAKDSRQ
        + DSGAS+H+T   + +   +EYGG E V +GNG +L IS  G+ YL +NG   + L  IL VP+IAKNL S+SKLA  N++  EF+ D CL+K K + +
Subjt:  YVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYL-SNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFEFHGDFCLVKAKDSRQ

Query:  VVLKGILKDGLYQLENVSKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTSVSK-NVWHCRLGHPNLKVLESIIKVCNLSVKTNEVYQFYESCQFG
         +L+G LK+GLYQ+ + S  +   + L     T    L    SVNV       +  SK +VWH RLGHP+ KVL  +++  N+ V  NE+  F ++CQ+G
Subjt:  VVLKGILKDGLYQLENVSKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTSVSK-NVWHCRLGHPNLKVLESIIKVCNLSVKTNEVYQFYESCQFG

Query:  NLTI--FHSQI
         ++   F SQI
Subjt:  NLTI--FHSQI

A0A803QD60 Uncharacterized protein4.4e-3437.77Show/hide
Query:  NTNNSFLANSETVLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYL-SNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNI
        N +++F+A  E +    W+ DSGAS+H+T    ++   +EYGG E+V +GNG +L IS   N  L ++    + L  +L VPEIAKNL SVSKL  DNN+
Subjt:  NTNNSFLANSETVLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYL-SNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNI

Query:  FFEFHGDFCLVKAKDSRQVVLKGILKDGLYQLENVSKVSKEDALLLNFGSTLQPTL---NNQVSVNVVSCYPVNTSVSKN-------------VWHCRLG
          EF+ D C+VK K +++V+L+G+L+DGLYQL+              F +T Q T    +N+V  +  S   V +  S+N             VW  RLG
Subjt:  FFEFHGDFCLVKAKDSRQVVLKGILKDGLYQLENVSKVSKEDALLLNFGSTLQPTL---NNQVSVNVVSCYPVNTSVSKN-------------VWHCRLG

Query:  HPNLKVLESIIKVCNLSVKTNEVYQFYESCQFG
        HP+ +VL  ++  C + +  NE+  F ++CQFG
Subjt:  HPNLKVLESIIKVCNLSVKTNEVYQFYESCQFG

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.0e-1631.47Show/hide
Query:  NKGGSPQGNNNNKGW-----NFNNNNQRGGQYQGSNQRGSGNFNGGRGRGRGREFSPNVSQNRTGGQSVVQSNGNMQTTTAFMANTNNSFLANSETVLDP
        N+  +   NNN+K W     NF+ NN +   Y G  Q       G +G    R      SQ     Q  + S  + Q  + F      + LA        
Subjt:  NKGGSPQGNNNNKGW-----NFNNNNQRGGQYQGSNQRGSGNFNGGRGRGRGREFSPNVSQNRTGGQSVVQSNGNMQTTTAFMANTNNSFLANSETVLDP

Query:  NWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLSNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFEFHGDFCLVKAKDSR
        NW +DSGA+ H+T  +NNL   + Y G + V++ +G  +PIS TG++ LS  S  + L+NIL+VP I KNL SV +L   N +  EF      VK  ++ 
Subjt:  NWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLSNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFEFHGDFCLVKAKDSR

Query:  QVVLKGILKDGLYQLENVSKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTS-VSKNVWHCRLGHPNLKVLESIIKVCNLSV
          +L+G  KD LY+    S                QP          VS +   +S  + + WH RLGHP   +L S+I   +LSV
Subjt:  QVVLKGILKDGLYQLENVSKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTS-VSKNVWHCRLGHPNLKVLESIIKVCNLSV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-1631.49Show/hide
Query:  MVGNKGGSPQGNNNNKG--WNFNNNNQRGGQYQGSNQRGSGNFNGGRGRGRGREFSPNV---SQNRTGGQSVVQSNGN-MQTTTAFMANTNNSFLANSET
        +V ++  +   N NN+G   N+NNNN R   +Q S+  GS + N       GR    +V   S  R       QS  N  Q+T+ F      + LA +  
Subjt:  MVGNKGGSPQGNNNNKG--WNFNNNNQRGGQYQGSNQRGSGNFNGGRGRGRGREFSPNV---SQNRTGGQSVVQSNGN-MQTTTAFMANTNNSFLANSET

Query:  VLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLSNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFEFHGDFCLVKA
            NW +DSGA+ H+T  +NNL   + Y G + V+I +G  +PI+ TG++ L   S  + LN +L+VP I KNL SV +L   N +  EF      VK 
Subjt:  VLDPNWYVDSGASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLSNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFEFHGDFCLVKA

Query:  KDSRQVVLKGILKDGLYQLENVSKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTSVSKNVWHCRLGHPNLKVLESIIKVCNLSV
         ++   +L+G  KD LY+    S                    +  VS+    C    +  + + WH RLGHP+L +L S+I   +L V
Subjt:  KDSRQVVLKGILKDGLYQLENVSKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTSVSKNVWHCRLGHPNLKVLESIIKVCNLSV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGGAAACAAAGGGGGATCACCTCAAGGGAACAATAATAACAAAGGCTGGAATTTCAATAATAACAATCAAAGAGGTGGTCAATATCAAGGCAGCAATCAA
AGAGGAAGTGGAAACTTTAATGGAGGAAGGGGTCGTGGACGTGGCCGAGAATTTTCTCCAAATGTTAGTCAGAACAGGACTGGTGGACAAAGTGTTGTGCAGTCA
AATGGGAATATGCAGACGACAACTGCTTTCATGGCTAATACTAACAATTCGTTCTTGGCCAATTCTGAAACTGTTCTAGACCCAAATTGGTATGTAGATAGCGGA
GCGTCCAGTCATGTTACGGGCGGTTACAACAACCTCATCAATCCAAAGGAATATGGAGGTAATGAACAAGTCATTATTGGAAATGGGAAGAAATTGCCCATTTCG
TTCACTGGTAATTCTTACTTATCCAATGGCTCAAATATGATTACGCTTAACAATATTTTATTTGTGCCTGAGATTGCAAAGAACTTGAAAAGTGTATCGAAATTA
GCTCAAGATAACAATATTTTCTTTGAGTTTCACGGTGATTTTTGTCTCGTAAAGGCCAAGGACTCGAGGCAGGTGGTGTTGAAAGGAATACTTAAAGACGGACTC
TATCAGCTTGAGAATGTGTCGAAGGTGTCAAAGGAAGATGCGTTGCTGCTCAATTTTGGTTCTACGTTGCAGCCCACCCTAAATAATCAAGTGTCAGTCAATGTT
GTGTCTTGTTATCCTGTGAACACTAGTGTCTCAAAAAATGTGTGGCATTGTCGCCTTGGCCACCCGAATTTAAAAGTGTTAGAATCTATTATCAAAGTTTGTAAT
CTTTCAGTTAAAACTAATGAAGTTTATCAGTTTTACGAATCATGTCAATTTGGAAATCTCACAATCTTCCATTCTCAAATAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTAGGAAACAAAGGGGGATCACCTCAAGGGAACAATAATAACAAAGGCTGGAATTTCAATAATAACAATCAAAGAGGTGGTCAATATCAAGGCAGCAATCAA
AGAGGAAGTGGAAACTTTAATGGAGGAAGGGGTCGTGGACGTGGCCGAGAATTTTCTCCAAATGTTAGTCAGAACAGGACTGGTGGACAAAGTGTTGTGCAGTCA
AATGGGAATATGCAGACGACAACTGCTTTCATGGCTAATACTAACAATTCGTTCTTGGCCAATTCTGAAACTGTTCTAGACCCAAATTGGTATGTAGATAGCGGA
GCGTCCAGTCATGTTACGGGCGGTTACAACAACCTCATCAATCCAAAGGAATATGGAGGTAATGAACAAGTCATTATTGGAAATGGGAAGAAATTGCCCATTTCG
TTCACTGGTAATTCTTACTTATCCAATGGCTCAAATATGATTACGCTTAACAATATTTTATTTGTGCCTGAGATTGCAAAGAACTTGAAAAGTGTATCGAAATTA
GCTCAAGATAACAATATTTTCTTTGAGTTTCACGGTGATTTTTGTCTCGTAAAGGCCAAGGACTCGAGGCAGGTGGTGTTGAAAGGAATACTTAAAGACGGACTC
TATCAGCTTGAGAATGTGTCGAAGGTGTCAAAGGAAGATGCGTTGCTGCTCAATTTTGGTTCTACGTTGCAGCCCACCCTAAATAATCAAGTGTCAGTCAATGTT
GTGTCTTGTTATCCTGTGAACACTAGTGTCTCAAAAAATGTGTGGCATTGTCGCCTTGGCCACCCGAATTTAAAAGTGTTAGAATCTATTATCAAAGTTTGTAAT
CTTTCAGTTAAAACTAATGAAGTTTATCAGTTTTACGAATCATGTCAATTTGGAAATCTCACAATCTTCCATTCTCAAATAGAATAG
Protein sequenceShow/hide protein sequence
MVGNKGGSPQGNNNNKGWNFNNNNQRGGQYQGSNQRGSGNFNGGRGRGRGREFSPNVSQNRTGGQSVVQSNGNMQTTTAFMANTNNSFLANSETVLDPNWYVDSG
ASSHVTGGYNNLINPKEYGGNEQVIIGNGKKLPISFTGNSYLSNGSNMITLNNILFVPEIAKNLKSVSKLAQDNNIFFEFHGDFCLVKAKDSRQVVLKGILKDGL
YQLENVSKVSKEDALLLNFGSTLQPTLNNQVSVNVVSCYPVNTSVSKNVWHCRLGHPNLKVLESIIKVCNLSVKTNEVYQFYESCQFGNLTIFHSQIE