; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012423 (gene) of Snake gourd v1 genome

Gene IDTan0012423
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG11:27131093..27135527
RNA-Seq ExpressionTan0012423
SyntenyTan0012423
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-9262.63Show/hide
Query:  MSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSLQD
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL +ECPQVPA N  ++V+E Y+ W KAN+KA+ YI+A++SEVLAKKHE M++AREIM SLQ+
Subjt:  MSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSLQD

Query:  MFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKNKG
        MFGQ+S Q  H++LKY+YN+RM EG++VREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+S +QFRSN VMNKI Y LTTLLN+LQTF+SLMK KG
Subjt:  MFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKNKG

Query:  QADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR
        Q  GEA +   +R+F +GS+SGTKS  SSS  KK +KKK G   KA  A     K   A KG CFHCN +GHWKRNC KYL E K+ K+
Subjt:  QADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-9262.63Show/hide
Query:  MSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSLQD
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL +ECPQVPA N  ++V+E Y+ W KAN+KA+ YI+A++SEVLAKKHE M++AREIM SLQ+
Subjt:  MSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSLQD

Query:  MFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKNKG
        MFGQ+S Q  H++LKY+YN+RM EG++VREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+S +QFRSN VMNKI Y LTTLLN+LQTF+SLMK KG
Subjt:  MFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKNKG

Query:  QADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR
        Q  GEA +   +R+F +GS+SGTKS  SSS  KK +KKK G   KA  A     K   A KG CFHCN +GHWKRNC KYL E K+ K+
Subjt:  QADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-9262.63Show/hide
Query:  MSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSLQD
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL +ECPQVPA N  ++V+E Y+ W KAN+KA+ YI+A++SEVLAKKHE M++AREIM SLQ+
Subjt:  MSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSLQD

Query:  MFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKNKG
        MFGQ+S Q  H++LKY+YN+RM EG++VREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+S +QFRSN VMNKI Y LTTLLN+LQTF+SLMK KG
Subjt:  MFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKNKG

Query:  QADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR
        Q  GEA +   +R+F +GS+SGTKS  SSS  KK +KKK G   KA  A     K   A KG CFHCN +GHWKRNC KYL E K+ K+
Subjt:  QADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR

KAA0062993.1 gag/pol protein [Cucumis melo var. makuwa]7.1e-9362.89Show/hide
Query:  LRMSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSL
        L M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL KECPQVPA N  ++V+E Y+ W KAN+KA+ YI+A++SEVLAKKHE M++AREIM SL
Subjt:  LRMSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSL

Query:  QDMFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKN
        Q+MFGQ+S Q  H++LKY+YN+RM EG++VREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+S +QFRSN VMNKI Y LTTLLN+LQTF+SLMK 
Subjt:  QDMFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKN

Query:  KGQADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR
        KGQ  GEA +   +R+F +GS+SGTKS  SSS  KK +KKK G   KA  A     K   A KG CFHCN +GHWKRNC KYL E K+ K+
Subjt:  KGQADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-9262.63Show/hide
Query:  MSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSLQD
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL +ECPQVPA N  ++V+E Y+ W KAN+KA+ YI+A++SEVLAKKHE M++AREIM SLQ+
Subjt:  MSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSLQD

Query:  MFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKNKG
        MFGQ+S Q  H++LKY+YN+RM EG++VREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+S +QFRSN VMNKI Y LTTLLN+LQTF+SLMK KG
Subjt:  MFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKNKG

Query:  QADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR
        Q  GEA +   +R+F +GS+SGTKS  SSS  KK +KKK G   KA  A     K   A KG CFHCN +GHWKRNC KYL E K+ K+
Subjt:  QADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.0e-9262.63Show/hide
Query:  MSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSLQD
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL +ECPQVPA N  ++V+E Y+ W KAN+KA+ YI+A++SEVLAKKHE M++AREIM SLQ+
Subjt:  MSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSLQD

Query:  MFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKNKG
        MFGQ+S Q  H++LKY+YN+RM EG++VREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+S +QFRSN VMNKI Y LTTLLN+LQTF+SLMK KG
Subjt:  MFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKNKG

Query:  QADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR
        Q  GEA +   +R+F +GS+SGTKS  SSS  KK +KKK G   KA  A     K   A KG CFHCN +GHWKRNC KYL E K+ K+
Subjt:  QADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR

A0A5A7TU93 Gag/pol protein1.0e-9262.63Show/hide
Query:  MSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSLQD
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL +ECPQVPA N  ++V+E Y+ W KAN+KA+ YI+A++SEVLAKKHE M++AREIM SLQ+
Subjt:  MSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSLQD

Query:  MFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKNKG
        MFGQ+S Q  H++LKY+YN+RM EG++VREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+S +QFRSN VMNKI Y LTTLLN+LQTF+SLMK KG
Subjt:  MFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKNKG

Query:  QADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR
        Q  GEA +   +R+F +GS+SGTKS  SSS  KK +KKK G   KA  A     K   A KG CFHCN +GHWKRNC KYL E K+ K+
Subjt:  QADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR

A0A5A7TWB9 Gag/pol protein1.0e-9262.63Show/hide
Query:  MSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSLQD
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL +ECPQVPA N  ++V+E Y+ W KAN+KA+ YI+A++SEVLAKKHE M++AREIM SLQ+
Subjt:  MSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSLQD

Query:  MFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKNKG
        MFGQ+S Q  H++LKY+YN+RM EG++VREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+S +QFRSN VMNKI Y LTTLLN+LQTF+SLMK KG
Subjt:  MFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKNKG

Query:  QADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR
        Q  GEA +   +R+F +GS+SGTKS  SSS  KK +KKK G   KA  A     K   A KG CFHCN +GHWKRNC KYL E K+ K+
Subjt:  QADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR

A0A5A7V4M1 Gag/pol protein3.4e-9362.89Show/hide
Query:  LRMSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSL
        L M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL KECPQVPA N  ++V+E Y+ W KAN+KA+ YI+A++SEVLAKKHE M++AREIM SL
Subjt:  LRMSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSL

Query:  QDMFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKN
        Q+MFGQ+S Q  H++LKY+YN+RM EG++VREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+S +QFRSN VMNKI Y LTTLLN+LQTF+SLMK 
Subjt:  QDMFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKN

Query:  KGQADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR
        KGQ  GEA +   +R+F +GS+SGTKS  SSS  KK +KKK G   KA  A     K   A KG CFHCN +GHWKRNC KYL E K+ K+
Subjt:  KGQADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR

A0A5D3CPJ6 Gag/pol protein1.0e-9262.63Show/hide
Query:  MSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSLQD
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL +ECPQVPA N  ++V+E Y+ W KAN+KA+ YI+A++SEVLAKKHE M++AREIM SLQ+
Subjt:  MSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYIVANVSEVLAKKHEGMVSAREIMSSLQD

Query:  MFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKNKG
        MFGQ+S Q  H++LKY+YN+RM EG++VREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+S +QFRSN VMNKI Y LTTLLN+LQTF+SLMK KG
Subjt:  MFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLTTLLNKLQTFQSLMKNKG

Query:  QADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR
        Q  GEA +   +R+F +GS+SGTKS  SSS  KK +KKK G   KA  A     K   A KG CFHCN +GHWKRNC KYL E K+ K+
Subjt:  QADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTACCTTCTTGGGGACAAGACCGAGTGAGCGGCTGGGAACTCAACTTGGCATGATGGAATTCACTCCTTCCCGCTTACTAGGAAGAAAGCTGCTGCCGAAGCAGCA
GCGTTGTAACGTCGTTCATCAGCGTTGCAACGCTGAGGAACGGCATCACAATGCCAGTCTCCCTATCGACGCGGGAGCTCCACGGTCGCGTGGCGCGCGGCAACGCCACC
CCGATAGAAGCGCGCTGACGACGGCGAAGCGCTGCGTAACGCGCGAGCATCAAGATAGCAACGCCGCCGTGATGGCATTCAGAAGAGTTGCGAGACCGGTTCAACGGAGC
TTCAGTGGTAGAAAACGAGATGTTCGCATCGAGTTTACTCTCACGTCCTCAAGAGTTCACACCGTGAGTTTCATGCTCGGCTACGTGGCGTCAATAGGGCGTCCCCTACG
CATGTCGTCCTCAATTATAACCTTGTTAAAAAGCGATCGTTTAACTGGTGAGAATTTTACTACGTGGAAGTCCAACCTGAATACGATTCTCGTTGTTGACGACCTTCGAT
TTGTACTGACTAAGGAATGTCCTCAGGTCCCTGCTCGAAACGTCCCTCAATCTGTTAAGGAGGTATACGACTGCTGGATCAAGGCCAATGATAAAGCCAAGGTCTACATT
GTTGCTAATGTTTCTGAAGTTCTAGCCAAAAAGCACGAGGGCATGGTCTCAGCCCGTGAGATCATGAGTTCATTGCAGGATATGTTTGGACAATCGTCTGGACAACATCT
GCACGAATCCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGGATCAACGGTGAGAGAACACGTTCTCGATCTGATGGTCCACTTCAATGTGGCTGAGATGAACGGAG
CGGTCATTGACGAGCAAAGTCAGGTCTCGTTCATCCTGGAATCTCTTCCGAAGAGTGTCATACAATTCCGCAGCAATGTAGTGATGAACAAGATAGAGTATAACCTGACT
ACTCTCCTTAATAAACTACAAACTTTCCAGTCTCTTATGAAGAATAAGGGACAGGCCGATGGAGAGGCAACTCTGTTTGCCCATTCCAGAAGGTTCCAGAAGGGTTCATC
CTCTGGGACTAAGTCCTGTGGTTCATCTTCTTGGCTTAAGAAGACCCAAAAGAAGAAGATAGGAGGGAAAAGGAAGGCACCTGCTGCTGATAATGCCAAGGGAAAGGCCA
AGGTTGCAGACAAAGGAAAATGTTTCCACTGCAACGTGGATGGGCACTGGAAGCGTAACTGCCTAAAATACCTTGTTGAGCTCAAAGAGAAAAAAAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTACCTTCTTGGGGACAAGACCGAGTGAGCGGCTGGGAACTCAACTTGGCATGATGGAATTCACTCCTTCCCGCTTACTAGGAAGAAAGCTGCTGCCGAAGCAGCA
GCGTTGTAACGTCGTTCATCAGCGTTGCAACGCTGAGGAACGGCATCACAATGCCAGTCTCCCTATCGACGCGGGAGCTCCACGGTCGCGTGGCGCGCGGCAACGCCACC
CCGATAGAAGCGCGCTGACGACGGCGAAGCGCTGCGTAACGCGCGAGCATCAAGATAGCAACGCCGCCGTGATGGCATTCAGAAGAGTTGCGAGACCGGTTCAACGGAGC
TTCAGTGGTAGAAAACGAGATGTTCGCATCGAGTTTACTCTCACGTCCTCAAGAGTTCACACCGTGAGTTTCATGCTCGGCTACGTGGCGTCAATAGGGCGTCCCCTACG
CATGTCGTCCTCAATTATAACCTTGTTAAAAAGCGATCGTTTAACTGGTGAGAATTTTACTACGTGGAAGTCCAACCTGAATACGATTCTCGTTGTTGACGACCTTCGAT
TTGTACTGACTAAGGAATGTCCTCAGGTCCCTGCTCGAAACGTCCCTCAATCTGTTAAGGAGGTATACGACTGCTGGATCAAGGCCAATGATAAAGCCAAGGTCTACATT
GTTGCTAATGTTTCTGAAGTTCTAGCCAAAAAGCACGAGGGCATGGTCTCAGCCCGTGAGATCATGAGTTCATTGCAGGATATGTTTGGACAATCGTCTGGACAACATCT
GCACGAATCCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGGATCAACGGTGAGAGAACACGTTCTCGATCTGATGGTCCACTTCAATGTGGCTGAGATGAACGGAG
CGGTCATTGACGAGCAAAGTCAGGTCTCGTTCATCCTGGAATCTCTTCCGAAGAGTGTCATACAATTCCGCAGCAATGTAGTGATGAACAAGATAGAGTATAACCTGACT
ACTCTCCTTAATAAACTACAAACTTTCCAGTCTCTTATGAAGAATAAGGGACAGGCCGATGGAGAGGCAACTCTGTTTGCCCATTCCAGAAGGTTCCAGAAGGGTTCATC
CTCTGGGACTAAGTCCTGTGGTTCATCTTCTTGGCTTAAGAAGACCCAAAAGAAGAAGATAGGAGGGAAAAGGAAGGCACCTGCTGCTGATAATGCCAAGGGAAAGGCCA
AGGTTGCAGACAAAGGAAAATGTTTCCACTGCAACGTGGATGGGCACTGGAAGCGTAACTGCCTAAAATACCTTGTTGAGCTCAAAGAGAAAAAAAGGTAA
Protein sequenceShow/hide protein sequence
MPTFLGTRPSERLGTQLGMMEFTPSRLLGRKLLPKQQRCNVVHQRCNAEERHHNASLPIDAGAPRSRGARQRHPDRSALTTAKRCVTREHQDSNAAVMAFRRVARPVQRS
FSGRKRDVRIEFTLTSSRVHTVSFMLGYVASIGRPLRMSSSIITLLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTKECPQVPARNVPQSVKEVYDCWIKANDKAKVYI
VANVSEVLAKKHEGMVSAREIMSSLQDMFGQSSGQHLHESLKYVYNSRMKEGSTVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSVIQFRSNVVMNKIEYNLT
TLLNKLQTFQSLMKNKGQADGEATLFAHSRRFQKGSSSGTKSCGSSSWLKKTQKKKIGGKRKAPAADNAKGKAKVADKGKCFHCNVDGHWKRNCLKYLVELKEKKR