; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012438 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012438
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr1:41056261..41057601
RNA-Seq ExpressionLag0012438
SyntenyLag0012438
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN71553.1 hypothetical protein VITISV_034738 [Vitis vinifera]1.4e-3537.36Show/hide
Query:  CSPKFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFS
        C PK +     +VNP F VW++Y+R ++SWIYSSLT + +G+I+G  T+ E W  L+  + +S+ AR M LR   Q   K  LT+ +++ K+K I D  +
Subjt:  CSPKFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFS

Query:  AIGEPLLYRDHF--------GYINPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQ
        AIGEP+  +D             NP V S+  R +   L  V S+L+ +E RL  QT+  + +++   +ANF   N  RR+QS  PS         +PPQ
Subjt:  AIGEPLLYRDHF--------GYINPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQ

Query:  SFPILSMIPSSNPNVLGRPQSSPRPYRWPHN---NQTRPQCQICGKLGHTALVCYNRHNPLYQ-----ASSTS
        SF + ++ P + P+   +     RP R+ +N      RPQCQ+CGK GH  L CY+R +  YQ     ASSTS
Subjt:  SFPILSMIPSSNPNVLGRPQSSPRPYRWPHN---NQTRPQCQICGKLGHTALVCYNRHNPLYQ-----ASSTS

CAN77126.1 hypothetical protein VITISV_013628 [Vitis vinifera]8.2e-3638.91Show/hide
Query:  KFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFSAIG
        KFLD A+ Q+NPLF  W++ NR +MSWIYSSLT                                        KI K+G+T+S++LA+IK++ D++SA+G
Subjt:  KFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFSAIG

Query:  EPLLYRDHFGY--------INPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSN----NQRRNQSTHPSNQSKSSPVPRPP
        EPL YRD   Y         + FVTSI NR+++SSL +V SLL  Y   LE++ +  QL   Q NL  +S  N    NQ+ N   +P N   S P  +P 
Subjt:  EPLLYRDHFGY--------INPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSN----NQRRNQSTHPSNQSKSSPVPRPP

Query:  QSFPILSMIPSSNPNVLGRPQSSPR-PYRWPHNNQT-----RPQCQICGKLGHTALVCYNRHNPLYQASSTSAQP
          +   S  P+S P++LG+PQ  P+   +W     T     RPQCQICGK GH AL CY+R N  YQ    ++QP
Subjt:  QSFPILSMIPSSNPNVLGRPQSSPR-PYRWPHNNQT-----RPQCQICGKLGHTALVCYNRHNPLYQASSTSAQP

RVW25398.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]9.6e-3737.73Show/hide
Query:  CSPKFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFS
        C PK +     +VNP F VW++Y+R ++SWIYSSLT + +G+I+G  T+ E W  L+  + +S+ AR M LR   Q   K  LT+ +++ K+K I D  +
Subjt:  CSPKFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFS

Query:  AIGEPLLYRDHF--------GYINPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQ
        AIGEP+L +D             NP V S+  R +   L  V S+L+ +E RL  QT+  + +++   +ANF+  N  RR+QS  PS         +PPQ
Subjt:  AIGEPLLYRDHF--------GYINPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQ

Query:  SFPILSMIPSSNPNVLGRPQSSPRPYRWPHN---NQTRPQCQICGKLGHTALVCYNRHNPLYQ-----ASSTS
        SF + ++ P + P+   +     RP R+ +N      RPQCQ+CGK GH  L CY+R +  YQ     ASSTS
Subjt:  SFPILSMIPSSNPNVLGRPQSSPRPYRWPHN---NQTRPQCQICGKLGHTALVCYNRHNPLYQ-----ASSTS

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]2.8e-6048.68Show/hide
Query:  PKFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFSAI
        P+FLD  + Q NP +  W++YNR LM WIYSSL+E+K+GE++   T  +IW  L  VY+S +TAR+MGL+++LQ + KDG +VSQ+LAKIK+I D+F+A+
Subjt:  PKFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFSAI

Query:  GEPLLYRDHFGYI--------NPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQSF
        GEPL YRDH  ++        N FVTSI NR +  SL DVRSLL+AYEARL+KQ +VDQLN+ QANL N S  +N +R          K S       SF
Subjt:  GEPLLYRDHFGYI--------NPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQSF

Query:  PILSMIPSSNPNVLGRPQSSPRPYRWPHN-NQTRPQCQICGKLGHTALVCYNRHNPLYQASSTSA
        P   +  + + ++LG+PQS    ++WP   + ++ QCQICGKLGH+A VCY+R N  Y  +S  A
Subjt:  PILSMIPSSNPNVLGRPQSSPRPYRWPHN-NQTRPQCQICGKLGHTALVCYNRHNPLYQASSTSA

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]3.0e-3854.8Show/hide
Query:  IGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFSAIGEPLLYRDHFGYI--------NPFVTSIQNRTNRSSL
        +GEI+G  +AF+IW+ LR VYESSS A +MG  SQLQKI KDGLTVSQ+LA+IKD++D F+AIGEPL YRDH  YI        NPFV+SI NRTNR S+
Subjt:  IGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFSAIGEPLLYRDHFGYI--------NPFVTSIQNRTNRSSL

Query:  ADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQSFPILSMIPSSNPNVL
        ADVR+LLI Y++RLEKQT+ D L ++QAN+A+ S  N+Q R+      N+S          S P +   PS  PN L
Subjt:  ADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQSFPILSMIPSSNPNVL

TrEMBL top hitse value%identityAlignment
A0A438CQD7 Retrovirus-related Pol polyprotein from transposon RE14.7e-3737.73Show/hide
Query:  CSPKFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFS
        C PK +     +VNP F VW++Y+R ++SWIYSSLT + +G+I+G  T+ E W  L+  + +S+ AR M LR   Q   K  LT+ +++ K+K I D  +
Subjt:  CSPKFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFS

Query:  AIGEPLLYRDHF--------GYINPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQ
        AIGEP+L +D             NP V S+  R +   L  V S+L+ +E RL  QT+  + +++   +ANF+  N  RR+QS  PS         +PPQ
Subjt:  AIGEPLLYRDHF--------GYINPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQ

Query:  SFPILSMIPSSNPNVLGRPQSSPRPYRWPHN---NQTRPQCQICGKLGHTALVCYNRHNPLYQ-----ASSTS
        SF + ++ P + P+   +     RP R+ +N      RPQCQ+CGK GH  L CY+R +  YQ     ASSTS
Subjt:  SFPILSMIPSSNPNVLGRPQSSPRPYRWPHN---NQTRPQCQICGKLGHTALVCYNRHNPLYQ-----ASSTS

A0A438H229 7-deoxyloganetin glucosyltransferase2.0e-3537Show/hide
Query:  CSPKFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFS
        C PK +     +VNP F VW++Y+R ++SWIYSSLT + +G+I+G  T+ E W  L+  + +S+ AR M LR   Q   K  LT+ +++ K+K I D  +
Subjt:  CSPKFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFS

Query:  AIGEPLLYRDHF--------GYINPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQ
        AIGEP+  +D             NP V S+  R +   L  V S+L+ +E RL  QT+  + +++   +ANF   N  RR+QS  PS         +PPQ
Subjt:  AIGEPLLYRDHF--------GYINPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQ

Query:  SFPILSMIPSSNPNVLGRPQSSPRPYRWPHN---NQTRPQCQICGKLGHTALVCYNRHNPLYQ-----ASSTS
        SF + ++ P + P+   +     RP R+ +N      RPQCQ+CGK GH  L CY+R +  YQ      SSTS
Subjt:  SFPILSMIPSSNPNVLGRPQSSPRPYRWPHN---NQTRPQCQICGKLGHTALVCYNRHNPLYQ-----ASSTS

A0A6J1DQX7 uncharacterized protein LOC1110223151.3e-6048.68Show/hide
Query:  PKFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFSAI
        P+FLD  + Q NP +  W++YNR LM WIYSSL+E+K+GE++   T  +IW  L  VY+S +TAR+MGL+++LQ + KDG +VSQ+LAKIK+I D+F+A+
Subjt:  PKFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFSAI

Query:  GEPLLYRDHFGYI--------NPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQSF
        GEPL YRDH  ++        N FVTSI NR +  SL DVRSLL+AYEARL+KQ +VDQLN+ QANL N S  +N +R          K S       SF
Subjt:  GEPLLYRDHFGYI--------NPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQSF

Query:  PILSMIPSSNPNVLGRPQSSPRPYRWPHN-NQTRPQCQICGKLGHTALVCYNRHNPLYQASSTSA
        P   +  + + ++LG+PQS    ++WP   + ++ QCQICGKLGH+A VCY+R N  Y  +S  A
Subjt:  PILSMIPSSNPNVLGRPQSSPRPYRWPHN-NQTRPQCQICGKLGHTALVCYNRHNPLYQASSTSA

A5AQ04 Integrase catalytic domain-containing protein6.7e-3637.36Show/hide
Query:  CSPKFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFS
        C PK +     +VNP F VW++Y+R ++SWIYSSLT + +G+I+G  T+ E W  L+  + +S+ AR M LR   Q   K  LT+ +++ K+K I D  +
Subjt:  CSPKFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFS

Query:  AIGEPLLYRDHF--------GYINPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQ
        AIGEP+  +D             NP V S+  R +   L  V S+L+ +E RL  QT+  + +++   +ANF   N  RR+QS  PS         +PPQ
Subjt:  AIGEPLLYRDHF--------GYINPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQ

Query:  SFPILSMIPSSNPNVLGRPQSSPRPYRWPHN---NQTRPQCQICGKLGHTALVCYNRHNPLYQ-----ASSTS
        SF + ++ P + P+   +     RP R+ +N      RPQCQ+CGK GH  L CY+R +  YQ     ASSTS
Subjt:  SFPILSMIPSSNPNVLGRPQSSPRPYRWPHN---NQTRPQCQICGKLGHTALVCYNRHNPLYQ-----ASSTS

A5BPS3 Uncharacterized protein3.9e-3638.91Show/hide
Query:  KFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFSAIG
        KFLD A+ Q+NPLF  W++ NR +MSWIYSSLT                                        KI K+G+T+S++LA+IK++ D++SA+G
Subjt:  KFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFSAIG

Query:  EPLLYRDHFGY--------INPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSN----NQRRNQSTHPSNQSKSSPVPRPP
        EPL YRD   Y         + FVTSI NR+++SSL +V SLL  Y   LE++ +  QL   Q NL  +S  N    NQ+ N   +P N   S P  +P 
Subjt:  EPLLYRDHFGY--------INPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSN----NQRRNQSTHPSNQSKSSPVPRPP

Query:  QSFPILSMIPSSNPNVLGRPQSSPR-PYRWPHNNQT-----RPQCQICGKLGHTALVCYNRHNPLYQASSTSAQP
          +   S  P+S P++LG+PQ  P+   +W     T     RPQCQICGK GH AL CY+R N  YQ    ++QP
Subjt:  QSFPILSMIPSSNPNVLGRPQSSPR-PYRWPHNNQT-----RPQCQICGKLGHTALVCYNRHNPLYQASSTSAQP

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.2e-1121.25Show/hide
Query:  AKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFSAIGEPLLY
        A  +VNP +  W++ ++ + S +  +++      +   +TA +IW+ LR +Y + S   V  LR+QL++  K   T+  ++  +    DQ + +G+P+ +
Subjt:  AKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFSAIGEPLLY

Query:  RDHFGYI--------NPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQSFPILSMI
         +    +         P +  I  +    +L ++   L+ +E+++   +S   +  + AN  +   +     N + + +N+                   
Subjt:  RDHFGYI--------NPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQSFPILSMI

Query:  PSSNPNVLGRPQSSPRPYRWPHNNQTRP---QCQICGKLGHTALVCYNRHNPLYQASSTSAQPLCLFIDLAPQ
         + N N   +P         P+NNQ++P   +CQICG  GH+A  C    +  + +S  S QP   F    P+
Subjt:  PSSNPNVLGRPQSSPRPYRWPHNNQTRP---QCQICGKLGHTALVCYNRHNPLYQASSTSAQPLCLFIDLAPQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-0521.9Show/hide
Query:  AKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFSAIGEPLLY
        A  +VNP +  W++ ++ + S I  +++      +   +TA +IW+ LR +Y + S   V  LR               F+ +     DQ + +G+P+ +
Subjt:  AKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFSAIGEPLLY

Query:  RDHFGYI--------NPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQL----NMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQSFPI
         +    +         P +  I  +    SL ++   LI  E++L    S + +    N+V     N + + N R +   + +N ++S+           
Subjt:  RDHFGYI--------NPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQL----NMVQANLANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQSFPI

Query:  LSMIPSSNPNVLGRPQSSPRPYRWPHNNQTRPQCQICGKLGHTALVCYNRHNPLYQASSTSAQPLCLFIDLAPQ
         S  PSS+ +     Q  P+PY          +CQIC   GH+A  C   H   +Q+++   Q    F    P+
Subjt:  LSMIPSSNPNVLGRPQSSPRPYRWPHNNQTRPQCQICGKLGHTALVCYNRHNPLYQASSTSAQPLCLFIDLAPQ

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.4e-0624.39Show/hide
Query:  NPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDI
        +PL+  W++ N  +M W+ +S+T+  +  ++   TA ++W+ LR V+      ++  LR +L  + + G +V ++  K+  +
Subjt:  NPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCAACCTCAAGCTCGTCCTCCTCCGGCCATGACATTGTTCCGTCTCTTCCCACTATTTCGACATTTATATCAGTTCCCTCCCTTACCTCTACCTCTATCACCAC
CACCCCTGTTTCGACGCCTGTCTCTTCCCATCGAACTCAGGTCAGAGGTTTCACTCCCTTAACTACTGTTCATACAAGACCCCTAAACCCTAATACTCTCCCCTTTCAAT
CCCAGTTTCAGCCTCCCCCAATCTCTTCTACGTTTCCCTTTCCAAATAATACGACTCGACCTGGATTTCAGTATCCACCCCCCGCTACTCCGGCTTATCCTTTTATGTCG
GCTTACCCTGCTTCAACCCCTTTTTTTCCCAGCTCCCCACCCTTCACCAAATCTTTTTCTCATTCTCACCCCGCCTCTCAATATCAAGCTCACAGACTCAAATTATCTCC
TCTGGAAGAACCAACTGCTCAACCACATTCTTGCATTCGACATGGAAAGTCTTATAAATGGTACGCCTGCTCCCCTAAATTTCTGGATGCCGCGAAAACCCAAGTAAATC
CTCTCTTTCCCGTTTGGCAGAAATATAATCGTACGTTAATGAGTTGGATTTATTCTTCTCTGACTGAGGATAAGATAGGTGAAATAATTGGTTGTTCTACTGCCTTTGAG
ATTTGGGATCACCTTAGAATCGTTTATGAATCTTCTTCTACTGCTCGTGTTATGGGGTTACGGTCTCAATTACAGAAAATCCACAAAGATGGTCTCACTGTGTCTCAGTT
TCTTGCCAAAATTAAGGATATCGTTGATCAGTTTTCCGCCATTGGGGAACCATTATTATATAGAGACCATTTCGGTTATATTAACCCATTTGTTACGTCCATTCAGAATC
GCACTAACCGTTCTTCTCTTGCTGATGTCCGTAGTCTGTTGATTGCATATGAAGCTAGGCTTGAGAAGCAAACTTCTGTAGACCAGTTGAACATGGTACAAGCTAATTTA
GCTAATTTTTCTCCTTCCAATAATCAGAGACGAAACCAATCCACCCACCCCTCTAACCAGTCAAAATCCTCCCCTGTTCCCAGACCACCTCAGTCGTTTCCTATCCTTTC
AATGATTCCCTCATCTAATCCCAATGTTCTTGGTCGCCCCCAATCATCGCCACGCCCATATCGTTGGCCTCATAACAATCAAACCCGTCCACAGTGTCAAATTTGTGGGA
AACTAGGCCATACCGCTCTTGTCTGCTATAATAGACACAACCCTCTCTATCAAGCTTCATCTACTTCAGCTCAACCTCTATGTTTATTTATCGATTTGGCTCCACAATGG
TTATTTTTCTGGTTTACGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCATCAACCTCAAGCTCGTCCTCCTCCGGCCATGACATTGTTCCGTCTCTTCCCACTATTTCGACATTTATATCAGTTCCCTCCCTTACCTCTACCTCTATCACCAC
CACCCCTGTTTCGACGCCTGTCTCTTCCCATCGAACTCAGGTCAGAGGTTTCACTCCCTTAACTACTGTTCATACAAGACCCCTAAACCCTAATACTCTCCCCTTTCAAT
CCCAGTTTCAGCCTCCCCCAATCTCTTCTACGTTTCCCTTTCCAAATAATACGACTCGACCTGGATTTCAGTATCCACCCCCCGCTACTCCGGCTTATCCTTTTATGTCG
GCTTACCCTGCTTCAACCCCTTTTTTTCCCAGCTCCCCACCCTTCACCAAATCTTTTTCTCATTCTCACCCCGCCTCTCAATATCAAGCTCACAGACTCAAATTATCTCC
TCTGGAAGAACCAACTGCTCAACCACATTCTTGCATTCGACATGGAAAGTCTTATAAATGGTACGCCTGCTCCCCTAAATTTCTGGATGCCGCGAAAACCCAAGTAAATC
CTCTCTTTCCCGTTTGGCAGAAATATAATCGTACGTTAATGAGTTGGATTTATTCTTCTCTGACTGAGGATAAGATAGGTGAAATAATTGGTTGTTCTACTGCCTTTGAG
ATTTGGGATCACCTTAGAATCGTTTATGAATCTTCTTCTACTGCTCGTGTTATGGGGTTACGGTCTCAATTACAGAAAATCCACAAAGATGGTCTCACTGTGTCTCAGTT
TCTTGCCAAAATTAAGGATATCGTTGATCAGTTTTCCGCCATTGGGGAACCATTATTATATAGAGACCATTTCGGTTATATTAACCCATTTGTTACGTCCATTCAGAATC
GCACTAACCGTTCTTCTCTTGCTGATGTCCGTAGTCTGTTGATTGCATATGAAGCTAGGCTTGAGAAGCAAACTTCTGTAGACCAGTTGAACATGGTACAAGCTAATTTA
GCTAATTTTTCTCCTTCCAATAATCAGAGACGAAACCAATCCACCCACCCCTCTAACCAGTCAAAATCCTCCCCTGTTCCCAGACCACCTCAGTCGTTTCCTATCCTTTC
AATGATTCCCTCATCTAATCCCAATGTTCTTGGTCGCCCCCAATCATCGCCACGCCCATATCGTTGGCCTCATAACAATCAAACCCGTCCACAGTGTCAAATTTGTGGGA
AACTAGGCCATACCGCTCTTGTCTGCTATAATAGACACAACCCTCTCTATCAAGCTTCATCTACTTCAGCTCAACCTCTATGTTTATTTATCGATTTGGCTCCACAATGG
TTATTTTTCTGGTTTACGTAG
Protein sequenceShow/hide protein sequence
MASTSSSSSSGHDIVPSLPTISTFISVPSLTSTSITTTPVSTPVSSHRTQVRGFTPLTTVHTRPLNPNTLPFQSQFQPPPISSTFPFPNNTTRPGFQYPPPATPAYPFMS
AYPASTPFFPSSPPFTKSFSHSHPASQYQAHRLKLSPLEEPTAQPHSCIRHGKSYKWYACSPKFLDAAKTQVNPLFPVWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAFE
IWDHLRIVYESSSTARVMGLRSQLQKIHKDGLTVSQFLAKIKDIVDQFSAIGEPLLYRDHFGYINPFVTSIQNRTNRSSLADVRSLLIAYEARLEKQTSVDQLNMVQANL
ANFSPSNNQRRNQSTHPSNQSKSSPVPRPPQSFPILSMIPSSNPNVLGRPQSSPRPYRWPHNNQTRPQCQICGKLGHTALVCYNRHNPLYQASSTSAQPLCLFIDLAPQW
LFFWFT