; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006362 (gene) of Snake gourd v1 genome

Gene IDTan0006362
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG07:42177378..42180128
RNA-Seq ExpressionTan0006362
SyntenyTan0006362
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]5.7e-8463.12Show/hide
Query:  ISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQD
        ++S+ + +L +D+L   N+ +WK+ +NT+LI+DDLRF L EECPQVP  NA ++++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQ+
Subjt:  ISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQD

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSNAVMNKIECNLTTLLNELQTFQSLMKNKG
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFN+AEMN AVIDE SQV FILESL +SFLQFRSNAVMNKI   LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSNAVMNKIECNLTTLLNELQTFQSLMKNKG

Query:  QANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAP-AADKGKGKTKVADKGKCFHTGVDGHRSEIAQKYL
        Q  GEAN+   +R  H+GS+S TKS  SSSG KK +KKK G   KA  AA K   KTK A KG CFH   +GH      KYL
Subjt:  QANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAP-AADKGKGKTKVADKGKCFHTGVDGHRSEIAQKYL

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-8361.92Show/hide
Query:  ISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQD
        ++S+ + +L +D+L   N+ +WK+ +NT+LI+DDLRF L EECPQVP  NA ++++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQ+
Subjt:  ISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQD

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSNAVMNKIECNLTTLLNELQTFQSLMKNKG
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFN+AEMN AVIDE SQV FILESL +SFLQFRSNAVMNKI   LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSNAVMNKIECNLTTLLNELQTFQSLMKNKG

Query:  QANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAPAADKGKGKTKVADKGKCFHTGVDGHRSEIAQKYL
        Q  GEAN+   +R  H+GS+S TKS  SSSG KK +KKK G   KA  A     K   A KG CFH   +GH      KYL
Subjt:  QANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAPAADKGKGKTKVADKGKCFHTGVDGHRSEIAQKYL

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-8361.92Show/hide
Query:  ISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQD
        ++S+ + +L +D+L   N+ +WK+ +NT+LI+DDLRF L EECPQVP  NA ++++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQ+
Subjt:  ISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQD

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSNAVMNKIECNLTTLLNELQTFQSLMKNKG
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFN+AEMN AVIDE SQV FILESL +SFLQFRSNAVMNKI   LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSNAVMNKIECNLTTLLNELQTFQSLMKNKG

Query:  QANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAPAADKGKGKTKVADKGKCFHTGVDGHRSEIAQKYL
        Q  GEAN+   +R  H+GS+S TKS  SSSG KK +KKK G   KA  A     K   A KG CFH   +GH      KYL
Subjt:  QANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAPAADKGKGKTKVADKGKCFHTGVDGHRSEIAQKYL

KAA0062993.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-8558.9Show/hide
Query:  SHREFHARLRGVNRASPYGMCLHGSIPSISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVY
        +H E +AR RG     P  M     I  ++S+ + +L +D+L   N+ +WK+ +NT+LI+DDLRF L +ECPQVP  NA ++++E Y+RW KAN+KA+ Y
Subjt:  SHREFHARLRGVNRASPYGMCLHGSIPSISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVY

Query:  ILASVSEVLAKKHEGMVSAREIMSSLQDMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSN
        ILAS+SEVLAKKHE M++AREIM SLQ+MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFN+AEMN AVIDE SQV FILESL +SFLQFRSN
Subjt:  ILASVSEVLAKKHEGMVSAREIMSSLQDMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSN

Query:  AVMNKIECNLTTLLNELQTFQSLMKNKGQANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAPAADKGKGKTKVADKGKCFHTGVDGH
        AVMNKI   LTTLLNELQTF+SLMK KGQ  GEAN+   +R  H+GS+S TKS  SSSG KK +KKK G   KA  A     K   A KG CFH   +GH
Subjt:  AVMNKIECNLTTLLNELQTFQSLMKNKGQANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAPAADKGKGKTKVADKGKCFHTGVDGH

Query:  RSEIAQKYL
              KYL
Subjt:  RSEIAQKYL

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]5.7e-8463.12Show/hide
Query:  ISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQD
        ++S+ + +L +D+L   N+ +WK+ +NT+LI+DDLRF L EECPQVP  NA ++++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQ+
Subjt:  ISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQD

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSNAVMNKIECNLTTLLNELQTFQSLMKNKG
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFN+AEMN AVIDE SQV FILESL +SFLQFRSNAVMNKI   LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSNAVMNKIECNLTTLLNELQTFQSLMKNKG

Query:  QANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAP-AADKGKGKTKVADKGKCFHTGVDGHRSEIAQKYL
        Q  GEAN+   +R  H+GS+S TKS  SSSG KK +KKK G   KA  AA K   KTK A KG CFH   +GH      KYL
Subjt:  QANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAP-AADKGKGKTKVADKGKCFHTGVDGHRSEIAQKYL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein8.1e-8461.92Show/hide
Query:  ISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQD
        ++S+ + +L +D+L   N+ +WK+ +NT+LI+DDLRF L EECPQVP  NA ++++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQ+
Subjt:  ISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQD

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSNAVMNKIECNLTTLLNELQTFQSLMKNKG
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFN+AEMN AVIDE SQV FILESL +SFLQFRSNAVMNKI   LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSNAVMNKIECNLTTLLNELQTFQSLMKNKG

Query:  QANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAPAADKGKGKTKVADKGKCFHTGVDGHRSEIAQKYL
        Q  GEAN+   +R  H+GS+S TKS  SSSG KK +KKK G   KA  A     K   A KG CFH   +GH      KYL
Subjt:  QANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAPAADKGKGKTKVADKGKCFHTGVDGHRSEIAQKYL

A0A5A7TU93 Gag/pol protein8.1e-8461.92Show/hide
Query:  ISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQD
        ++S+ + +L +D+L   N+ +WK+ +NT+LI+DDLRF L EECPQVP  NA ++++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQ+
Subjt:  ISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQD

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSNAVMNKIECNLTTLLNELQTFQSLMKNKG
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFN+AEMN AVIDE SQV FILESL +SFLQFRSNAVMNKI   LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSNAVMNKIECNLTTLLNELQTFQSLMKNKG

Query:  QANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAPAADKGKGKTKVADKGKCFHTGVDGHRSEIAQKYL
        Q  GEAN+   +R  H+GS+S TKS  SSSG KK +KKK G   KA  A     K   A KG CFH   +GH      KYL
Subjt:  QANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAPAADKGKGKTKVADKGKCFHTGVDGHRSEIAQKYL

A0A5A7V4M1 Gag/pol protein1.5e-8558.9Show/hide
Query:  SHREFHARLRGVNRASPYGMCLHGSIPSISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVY
        +H E +AR RG     P  M     I  ++S+ + +L +D+L   N+ +WK+ +NT+LI+DDLRF L +ECPQVP  NA ++++E Y+RW KAN+KA+ Y
Subjt:  SHREFHARLRGVNRASPYGMCLHGSIPSISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVY

Query:  ILASVSEVLAKKHEGMVSAREIMSSLQDMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSN
        ILAS+SEVLAKKHE M++AREIM SLQ+MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFN+AEMN AVIDE SQV FILESL +SFLQFRSN
Subjt:  ILASVSEVLAKKHEGMVSAREIMSSLQDMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSN

Query:  AVMNKIECNLTTLLNELQTFQSLMKNKGQANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAPAADKGKGKTKVADKGKCFHTGVDGH
        AVMNKI   LTTLLNELQTF+SLMK KGQ  GEAN+   +R  H+GS+S TKS  SSSG KK +KKK G   KA  A     K   A KG CFH   +GH
Subjt:  AVMNKIECNLTTLLNELQTFQSLMKNKGQANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAPAADKGKGKTKVADKGKCFHTGVDGH

Query:  RSEIAQKYL
              KYL
Subjt:  RSEIAQKYL

A0A5D3CPJ6 Gag/pol protein2.8e-8463.12Show/hide
Query:  ISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQD
        ++S+ + +L +D+L   N+ +WK+ +NT+LI+DDLRF L EECPQVP  NA ++++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQ+
Subjt:  ISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQD

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSNAVMNKIECNLTTLLNELQTFQSLMKNKG
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFN+AEMN AVIDE SQV FILESL +SFLQFRSNAVMNKI   LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSNAVMNKIECNLTTLLNELQTFQSLMKNKG

Query:  QANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAP-AADKGKGKTKVADKGKCFHTGVDGHRSEIAQKYL
        Q  GEAN+   +R  H+GS+S TKS  SSSG KK +KKK G   KA  AA K   KTK A KG CFH   +GH      KYL
Subjt:  QANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAP-AADKGKGKTKVADKGKCFHTGVDGHRSEIAQKYL

A0A5D3CSZ6 Gag/pol protein2.8e-8463.12Show/hide
Query:  ISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQD
        ++S+ + +L +D+L   N+ +WK+ +NT+LI+DDLRF L EECPQVP  NA ++++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQ+
Subjt:  ISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQD

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSNAVMNKIECNLTTLLNELQTFQSLMKNKG
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFN+AEMN AVIDE SQV FILESL +SFLQFRSNAVMNKI   LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSNAVMNKIECNLTTLLNELQTFQSLMKNKG

Query:  QANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAP-AADKGKGKTKVADKGKCFHTGVDGHRSEIAQKYL
        Q  GEAN+   +R  H+GS+S TKS  SSSG KK +KKK G   KA  AA K   KTK A KG CFH   +GH      KYL
Subjt:  QANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAP-AADKGKGKTKVADKGKCFHTGVDGHRSEIAQKYL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGTGGAATTTTTCCACTGCCGAAGCGAGCGTTGCAACGCCGTTCATCGGCGTTGCAACGCTATAGAACGGCGTCGCAACGCCAACGCTCCATATCGACGCGGGCG
CTCAAGGAGTCAGCGGCGTTGCAACGCTGCTCCTCAGCGTTGCAACGCCACCCCGATAGCAGGCGCGTGCGAGGATGAAGCGGGCGTTGCAACGCCGTTACGCCTGTCCA
GCAGATGCTGGCCGGTTCAGCCGGTTCAACTGGAGCTTCAGTGGGAGAAAACGAGATGTTCACATCGAGTTTACTCTCACGTCTTCAAGAGTTCACACCGTGAGTTCCAT
GCTCGGCTACGTGGCGTCAATAGGGCGTCCCCCTACGGTATGTGTTTGCATGGTTCTATACCAAGCATATCGTCCTCAATAATAACCTTACTTAAAAGCGATCGTTTAAC
TTGTGAGAATTTTACTACGTGGAAGTCCAACTTGAATACGATTCTCATTGTTGACGACCTACGGTTTGAACTGACTGAGGAATGTCCTCAGGTCCCTACTCGTAACGCTC
CTCAATCTATTAAGGAGGCGTACGACCGCTGGATCAAGGCTAATGATAAGGCCAAGGTCTACATTTTGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATG
GTCTCAGCTCGTGAGATCATGAGTTCGCTGCAGGATATGTTTGGACAACCATCTGGACAGATTCGACATGAATCCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGG
GTCATCGGTGAGAGAACACGTTCTCGATCTGATGGTCCACTTCAACATGGCTGAGATGAACGAAGCGGTCATTGACGAGCAAAGTCAGGTCTTGTTCATCCTGGAATCTC
TTATGAAGAGTTTCCTGCAATTCCGCAGCAATGCGGTGATGAACAAGATAGAGTGTAACCTGACTACTCTCCTAAATGAACTACAAACTTTCCAGTCTCTTATGAAGAAT
AAGGGTCAGGCTAATGGAGAGGCAAATCTGTTTGCCCATTCCAGAAGTCTCCATAAGGGTTCATCCTCTAGGACTAAGTCCTGTGGTTCATCTTCTGGGCTTAAGAAGAC
CCAAAAGAAGAAGATAGGAGGGAAAGGGAAGGCACCTGCTGCTGACAAAGGCAAGGGAAAGACCAAGGTTGCGGATAAAGGAAAATGTTTCCACACTGGCGTGGATGGGC
ACCGAAGCGAAATCGCCCAAAAATACCTTCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCGTGGAATTTTTCCACTGCCGAAGCGAGCGTTGCAACGCCGTTCATCGGCGTTGCAACGCTATAGAACGGCGTCGCAACGCCAACGCTCCATATCGACGCGGGCG
CTCAAGGAGTCAGCGGCGTTGCAACGCTGCTCCTCAGCGTTGCAACGCCACCCCGATAGCAGGCGCGTGCGAGGATGAAGCGGGCGTTGCAACGCCGTTACGCCTGTCCA
GCAGATGCTGGCCGGTTCAGCCGGTTCAACTGGAGCTTCAGTGGGAGAAAACGAGATGTTCACATCGAGTTTACTCTCACGTCTTCAAGAGTTCACACCGTGAGTTCCAT
GCTCGGCTACGTGGCGTCAATAGGGCGTCCCCCTACGGTATGTGTTTGCATGGTTCTATACCAAGCATATCGTCCTCAATAATAACCTTACTTAAAAGCGATCGTTTAAC
TTGTGAGAATTTTACTACGTGGAAGTCCAACTTGAATACGATTCTCATTGTTGACGACCTACGGTTTGAACTGACTGAGGAATGTCCTCAGGTCCCTACTCGTAACGCTC
CTCAATCTATTAAGGAGGCGTACGACCGCTGGATCAAGGCTAATGATAAGGCCAAGGTCTACATTTTGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATG
GTCTCAGCTCGTGAGATCATGAGTTCGCTGCAGGATATGTTTGGACAACCATCTGGACAGATTCGACATGAATCCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGG
GTCATCGGTGAGAGAACACGTTCTCGATCTGATGGTCCACTTCAACATGGCTGAGATGAACGAAGCGGTCATTGACGAGCAAAGTCAGGTCTTGTTCATCCTGGAATCTC
TTATGAAGAGTTTCCTGCAATTCCGCAGCAATGCGGTGATGAACAAGATAGAGTGTAACCTGACTACTCTCCTAAATGAACTACAAACTTTCCAGTCTCTTATGAAGAAT
AAGGGTCAGGCTAATGGAGAGGCAAATCTGTTTGCCCATTCCAGAAGTCTCCATAAGGGTTCATCCTCTAGGACTAAGTCCTGTGGTTCATCTTCTGGGCTTAAGAAGAC
CCAAAAGAAGAAGATAGGAGGGAAAGGGAAGGCACCTGCTGCTGACAAAGGCAAGGGAAAGACCAAGGTTGCGGATAAAGGAAAATGTTTCCACACTGGCGTGGATGGGC
ACCGAAGCGAAATCGCCCAAAAATACCTTCTTTGA
Protein sequenceShow/hide protein sequence
MFVEFFHCRSERCNAVHRRCNAIERRRNANAPYRRGRSRSQRRCNAAPQRCNATPIAGACEDEAGVATPLRLSSRCWPVQPVQLELQWEKTRCSHRVYSHVFKSSHREFH
ARLRGVNRASPYGMCLHGSIPSISSSIITLLKSDRLTCENFTTWKSNLNTILIVDDLRFELTEECPQVPTRNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGM
VSAREIMSSLQDMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNMAEMNEAVIDEQSQVLFILESLMKSFLQFRSNAVMNKIECNLTTLLNELQTFQSLMKN
KGQANGEANLFAHSRSLHKGSSSRTKSCGSSSGLKKTQKKKIGGKGKAPAADKGKGKTKVADKGKCFHTGVDGHRSEIAQKYLL