; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007771 (gene) of Snake gourd v1 genome

Gene IDTan0007771
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionzinc finger homeobox protein 4-like isoform X1
Genome locationLG02:90253029..90254819
RNA-Seq ExpressionTan0007771
SyntenyTan0007771
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585540.1 hypothetical protein SDJN03_18273, partial [Cucurbita argyrosperma subsp. sororia]2.5e-8968.38Show/hide
Query:  PHCSSPDPPFNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDSAPAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDENTNA
        PHC+     F+P E  VAQIL+E     +KS   LG IP W++RRKRS + SPP+S+     A+V S  SKKVKESSPTSPL LNS P SRSESDE+TNA
Subjt:  PHCSSPDPPFNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDSAPAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDENTNA

Query:  KLSKKKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILG---SKNESGIPEIGALSSAMKAVK-FTVESSNHQHHHEFQ
        K SKKKP++DKKSQ+VEAIDELTKQNQ LKGEF+AM+QHYN LK INS LKAKKQEMILG   SKNES IPEIG  SSAM+ VK  TVESS H       
Subjt:  KLSKKKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILG---SKNESGIPEIGALSSAMKAVK-FTVESSNHQHHHEFQ

Query:  PSINNQTAPMAEQSTNNSQNFQIPVGAIPFYDP-SMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNKSNGAAKLQSPP-NPCM
             Q APMAEQS N SQNFQIP+G IPFYDP S+ P+GIPDLNISLEEINQR+YS+FMAARAR+NRIQICKNK+NG  KLQ+PP NPCM
Subjt:  PSINNQTAPMAEQSTNNSQNFQIPVGAIPFYDP-SMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNKSNGAAKLQSPP-NPCM

KAG7020453.1 hypothetical protein SDJN02_17137, partial [Cucurbita argyrosperma subsp. argyrosperma]3.6e-8868.29Show/hide
Query:  PHCSSPDPPFNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDSAPAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDENTNA
        PHC+     F+P E  VAQIL+E     +KS   LG IP W++RRKRS + SPP+S+     A+V S  SKKVKESSPTSPL LNS P SRSESDE+TNA
Subjt:  PHCSSPDPPFNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDSAPAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDENTNA

Query:  KLSKKKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILG---SKNESGIPEIGALSSAMKAVK-FTVESSNHQHHHEFQ
        K SKKKP++DKKSQ+VEAIDELTKQNQ LKGEF+AM+QHYN LK INS LKAKKQEMILG   SKNES IPEIG  SSAM+ VK  TVESS H       
Subjt:  KLSKKKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILG---SKNESGIPEIGALSSAMKAVK-FTVESSNHQHHHEFQ

Query:  PSINNQTAPMAEQSTNNSQNFQIPVGAIPFYDP-SMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNKSNGAAKLQSPPN
             Q APMAEQS N SQNFQIP+G IPFYDP S+ P+GIPDLNISLEEINQR+YS+FMAARAR+NRIQICKNK+NG  KLQ+PPN
Subjt:  PSINNQTAPMAEQSTNNSQNFQIPVGAIPFYDP-SMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNKSNGAAKLQSPPN

XP_022951578.1 uncharacterized protein LOC111454352 [Cucurbita moschata]3.8e-9068.73Show/hide
Query:  PHCSSPDPPFNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDSAPAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDENTNA
        PHC+S D  F+P E  VAQIL+E     +KS   LG IP W++RRKRS + SPP+S+     A+V S  SKKVKESSPTSPL LNS P SRSESDE+TNA
Subjt:  PHCSSPDPPFNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDSAPAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDENTNA

Query:  KLSKKKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILG---SKNESGIPEIGALSSAMKAVK-FTVESSNHQHHHEFQ
        K SKKKP++DKKSQ+VEAIDELTKQNQ LKGEF+AM+QHYN LK INS LKAKKQEMILG   SKNES IPEIG  SSAM+ VK  TVESS H       
Subjt:  KLSKKKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILG---SKNESGIPEIGALSSAMKAVK-FTVESSNHQHHHEFQ

Query:  PSINNQTAPMAEQSTNNSQNFQIPVGAIPFYDP-SMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNKSNGAAKLQSPP-NPCM
             Q  PMAEQS N SQNFQIP+G IPFYDP S+ P+GIPDLNISLEEINQR+YS+FMAARAR+NRIQICKNK+NG  KLQ+PP NPCM
Subjt:  PSINNQTAPMAEQSTNNSQNFQIPVGAIPFYDP-SMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNKSNGAAKLQSPP-NPCM

XP_023002465.1 uncharacterized protein LOC111496295 [Cucurbita maxima]2.9e-9069.07Show/hide
Query:  PHCSSPDPPFNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDSAPAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDENTNA
        P C+S D  F P E  VAQIL+E     +KS   LG IP W++RRKRS + SPP+S+     A+V S  SKKVKESSPTSPL LNS P SRSESDE+TNA
Subjt:  PHCSSPDPPFNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDSAPAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDENTNA

Query:  KLSKKKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILG---SKNESGIPEIGALSSAMKAVK-FTVESSNHQHHHEFQ
        K SKKK ++DKKSQ+VEAIDELTKQNQ LKGEF+AM+QHYN LK INS LKAKKQEMILG   SKNES IPEIG  SSAM+ VK  TVESSNHQ      
Subjt:  KLSKKKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILG---SKNESGIPEIGALSSAMKAVK-FTVESSNHQHHHEFQ

Query:  PSINNQTAPMAEQSTNNSQNFQIPVGAIPFYDP-SMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNKSNGAAKLQSPP-NPCM
             Q APMAEQS N SQNFQIP+G IPFYDP S+ P+GIPDLNISLEEINQR+YS+FMAARAR+NRIQICKNK+NG  KLQ+PP NPCM
Subjt:  PSINNQTAPMAEQSTNNSQNFQIPVGAIPFYDP-SMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNKSNGAAKLQSPP-NPCM

XP_023537124.1 uncharacterized protein LOC111798295 [Cucurbita pepo subsp. pepo]8.5e-9068.73Show/hide
Query:  PHCSSPDPPFNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDSAPAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDENTNA
        PHC+S D  F P E  VAQIL+E     +KS   LG IP W++RRKRS + SPP+S+     A+V S  SKKVKESSPTSPL LNS P SRSESDE+TNA
Subjt:  PHCSSPDPPFNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDSAPAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDENTNA

Query:  KLSKKKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILG---SKNESGIPEIGALSSAMKAVK-FTVESSNHQHHHEFQ
        K +KKKP+ DKKSQ+VEAIDELTKQNQ LKGEF+AM+QHYN LK INS LKAKKQEMILG   SKNES IPEIG  SSAM+ VK  TVESS H       
Subjt:  KLSKKKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILG---SKNESGIPEIGALSSAMKAVK-FTVESSNHQHHHEFQ

Query:  PSINNQTAPMAEQSTNNSQNFQIPVGAIPFYDP-SMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNKSNGAAKLQSPP-NPCM
             Q APMAEQS N SQNFQIP+G IPFYDP S+ P+GIPDLNISLEEINQR+YS+FMAARAR+NRIQICKNK+NG  KLQ+PP NPCM
Subjt:  PSINNQTAPMAEQSTNNSQNFQIPVGAIPFYDP-SMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNKSNGAAKLQSPP-NPCM

TrEMBL top hitse value%identityAlignment
A0A0A0LRP1 Uncharacterized protein1.2e-7358.04Show/hide
Query:  FNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDSA------PAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDENTN-AKL
        F+P+E +VAQIL +LPLL+Q+S+FSLGL P+W IRRKRS +DSPPD++      P P P  +   SS++ KESSPT+PL+L+S P SRSESDENT  AK+
Subjt:  FNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDSA------PAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDENTN-AKL

Query:  SKKKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILGSKNESGIPEIGALSS-AMKAVKFTVESSN---HQHHHEFQPS
        SKKK  VDKKSQY+E I++LT Q Q L+G+ +AM++H+ +LKTINS LKAKKQE++ G  N S  P+ G  +S AM+  K TV+SS+     +H E +PS
Subjt:  SKKKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILGSKNESGIPEIGALSS-AMKAVKFTVESSN---HQHHHEFQPS

Query:  INNQTAPMAEQSTNNSQNFQIPVGAIPFYDPSMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNK-----SNGAAKLQS
        + NQT P+AEQS N+ QN+QIP+G IP YDPS+GP+GIPDLN+SLE+I  ++Y+K++AA+ARQNRIQI KNK     +NGA KLQS
Subjt:  INNQTAPMAEQSTNNSQNFQIPVGAIPFYDPSMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNK-----SNGAAKLQS

A0A1S3BAR4 uncharacterized protein LOC1034880496.8e-7761.35Show/hide
Query:  FNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDS----APAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDEN-TNAKLSK
        F+P+EL+VAQIL +LPLL+QKSNFSLGL P+W IRRKRS +DSPPD+       P P      SS++ KESSPT+PL+LNS P SRSESDEN T AK+SK
Subjt:  FNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDS----APAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDEN-TNAKLSK

Query:  KKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILGSKNESGIPEIGALSS-AMKAVKFTVESSN---HQHHHEFQPSIN
        KK  VDKKSQY+E ID+LT Q Q L+G+ +AM++H+ +LKTINS LKAKKQE++ G  N S  PEIG  SS AM+  K TV+SS      +H E +PS+ 
Subjt:  KKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILGSKNESGIPEIGALSS-AMKAVKFTVESSN---HQHHHEFQPSIN

Query:  NQTAPMAEQSTNNSQNFQIPVGAIPFYDPSMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNK---SNGAAKLQS
        NQT P AEQ  N+++N+QIP+G IP YDPS+GP+GIPDLN+SLE+I  +SY+K++AARARQNRIQI KNK   +NGA KLQS
Subjt:  NQTAPMAEQSTNNSQNFQIPVGAIPFYDPSMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNK---SNGAAKLQS

A0A5A7VHE1 Uncharacterized protein6.8e-7761.35Show/hide
Query:  FNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDS----APAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDEN-TNAKLSK
        F+P+EL+VAQIL +LPLL+QKSNFSLGL P+W IRRKRS +DSPPD+       P P      SS++ KESSPT+PL+LNS P SRSESDEN T AK+SK
Subjt:  FNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDS----APAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDEN-TNAKLSK

Query:  KKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILGSKNESGIPEIGALSS-AMKAVKFTVESSN---HQHHHEFQPSIN
        KK  VDKKSQY+E ID+LT Q Q L+G+ +AM++H+ +LKTINS LKAKKQE++ G  N S  PEIG  SS AM+  K TV+SS      +H E +PS+ 
Subjt:  KKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILGSKNESGIPEIGALSS-AMKAVKFTVESSN---HQHHHEFQPSIN

Query:  NQTAPMAEQSTNNSQNFQIPVGAIPFYDPSMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNK---SNGAAKLQS
        NQT P AEQ  N+++N+QIP+G IP YDPS+GP+GIPDLN+SLE+I  +SY+K++AARARQNRIQI KNK   +NGA KLQS
Subjt:  NQTAPMAEQSTNNSQNFQIPVGAIPFYDPSMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNK---SNGAAKLQS

A0A6J1GI34 uncharacterized protein LOC1114543521.8e-9068.73Show/hide
Query:  PHCSSPDPPFNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDSAPAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDENTNA
        PHC+S D  F+P E  VAQIL+E     +KS   LG IP W++RRKRS + SPP+S+     A+V S  SKKVKESSPTSPL LNS P SRSESDE+TNA
Subjt:  PHCSSPDPPFNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDSAPAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDENTNA

Query:  KLSKKKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILG---SKNESGIPEIGALSSAMKAVK-FTVESSNHQHHHEFQ
        K SKKKP++DKKSQ+VEAIDELTKQNQ LKGEF+AM+QHYN LK INS LKAKKQEMILG   SKNES IPEIG  SSAM+ VK  TVESS H       
Subjt:  KLSKKKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILG---SKNESGIPEIGALSSAMKAVK-FTVESSNHQHHHEFQ

Query:  PSINNQTAPMAEQSTNNSQNFQIPVGAIPFYDP-SMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNKSNGAAKLQSPP-NPCM
             Q  PMAEQS N SQNFQIP+G IPFYDP S+ P+GIPDLNISLEEINQR+YS+FMAARAR+NRIQICKNK+NG  KLQ+PP NPCM
Subjt:  PSINNQTAPMAEQSTNNSQNFQIPVGAIPFYDP-SMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNKSNGAAKLQSPP-NPCM

A0A6J1KP15 uncharacterized protein LOC1114962951.4e-9069.07Show/hide
Query:  PHCSSPDPPFNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDSAPAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDENTNA
        P C+S D  F P E  VAQIL+E     +KS   LG IP W++RRKRS + SPP+S+     A+V S  SKKVKESSPTSPL LNS P SRSESDE+TNA
Subjt:  PHCSSPDPPFNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDSAPAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDENTNA

Query:  KLSKKKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILG---SKNESGIPEIGALSSAMKAVK-FTVESSNHQHHHEFQ
        K SKKK ++DKKSQ+VEAIDELTKQNQ LKGEF+AM+QHYN LK INS LKAKKQEMILG   SKNES IPEIG  SSAM+ VK  TVESSNHQ      
Subjt:  KLSKKKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILG---SKNESGIPEIGALSSAMKAVK-FTVESSNHQHHHEFQ

Query:  PSINNQTAPMAEQSTNNSQNFQIPVGAIPFYDP-SMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNKSNGAAKLQSPP-NPCM
             Q APMAEQS N SQNFQIP+G IPFYDP S+ P+GIPDLNISLEEINQR+YS+FMAARAR+NRIQICKNK+NG  KLQ+PP NPCM
Subjt:  PSINNQTAPMAEQSTNNSQNFQIPVGAIPFYDP-SMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNKSNGAAKLQSPP-NPCM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTTCTTCTTATCCTCCTCCTCCTCCTCATTGTTCCTCCCCCGATCCCCCCTTCAACCCTGACGAACTCTACGTCGCTCAAATCCTCATCGAATTGCCTCTTCT
CGTTCAGAAATCCAACTTTTCTCTCGGCTTAATCCCCGCCTGGTCCATCCGACGCAAGAGATCCGTCATTGATTCGCCGCCGGACTCCGCCCCCGCCCCCGCCCCCGCCG
CTGTCTCCTCTCTGTCGTCCAAGAAGGTCAAAGAGTCCAGCCCTACTTCTCCTCTTGCCCTCAACTCCACGCCCTTTTCCCGGAGTGAATCCGATGAGAATACCAACGCC
AAACTCTCCAAGAAGAAGCCCGCTGTCGATAAGAAATCTCAGTATGTGGAAGCCATTGACGAATTGACCAAGCAGAATCAAGATTTGAAAGGGGAATTTCAGGCTATGCA
GCAACATTATAATAGTCTCAAAACTATCAATTCGAACTTGAAGGCAAAAAAGCAAGAGATGATTCTGGGTTCTAAGAACGAATCAGGAATTCCAGAAATAGGAGCCTTAA
GTTCGGCCATGAAAGCCGTTAAGTTCACTGTCGAGTCCTCAAATCATCAACATCATCATGAATTTCAACCGTCGATCAACAATCAGACGGCTCCCATGGCGGAACAGAGT
ACTAACAACAGTCAGAATTTTCAAATCCCAGTTGGGGCAATTCCTTTCTATGATCCATCAATGGGTCCAATTGGTATTCCTGATTTGAACATATCTCTTGAAGAAATTAA
TCAGAGGAGTTACTCCAAATTCATGGCGGCTCGAGCAAGACAGAACAGGATTCAGATCTGCAAGAACAAGAGCAACGGAGCCGCCAAATTGCAGAGTCCTCCTAATCCCT
GTATGTGA
mRNA sequenceShow/hide mRNA sequence
CCTCTCACTGCTCTCTTTTCTCTTCTTCTATTCTTCCTCTTCTATAAATCCCCCAATCCCTTTTCTTCACTGAATAATTCTCTCTCTCTTTTTCATGGAATTTCGCTTGC
CCTAGACCTCCGTACATCCACTCCCATGGCGTCTTCTTCTTATCCTCCTCCTCCTCCTCATTGTTCCTCCCCCGATCCCCCCTTCAACCCTGACGAACTCTACGTCGCTC
AAATCCTCATCGAATTGCCTCTTCTCGTTCAGAAATCCAACTTTTCTCTCGGCTTAATCCCCGCCTGGTCCATCCGACGCAAGAGATCCGTCATTGATTCGCCGCCGGAC
TCCGCCCCCGCCCCCGCCCCCGCCGCTGTCTCCTCTCTGTCGTCCAAGAAGGTCAAAGAGTCCAGCCCTACTTCTCCTCTTGCCCTCAACTCCACGCCCTTTTCCCGGAG
TGAATCCGATGAGAATACCAACGCCAAACTCTCCAAGAAGAAGCCCGCTGTCGATAAGAAATCTCAGTATGTGGAAGCCATTGACGAATTGACCAAGCAGAATCAAGATT
TGAAAGGGGAATTTCAGGCTATGCAGCAACATTATAATAGTCTCAAAACTATCAATTCGAACTTGAAGGCAAAAAAGCAAGAGATGATTCTGGGTTCTAAGAACGAATCA
GGAATTCCAGAAATAGGAGCCTTAAGTTCGGCCATGAAAGCCGTTAAGTTCACTGTCGAGTCCTCAAATCATCAACATCATCATGAATTTCAACCGTCGATCAACAATCA
GACGGCTCCCATGGCGGAACAGAGTACTAACAACAGTCAGAATTTTCAAATCCCAGTTGGGGCAATTCCTTTCTATGATCCATCAATGGGTCCAATTGGTATTCCTGATT
TGAACATATCTCTTGAAGAAATTAATCAGAGGAGTTACTCCAAATTCATGGCGGCTCGAGCAAGACAGAACAGGATTCAGATCTGCAAGAACAAGAGCAACGGAGCCGCC
AAATTGCAGAGTCCTCCTAATCCCTGTATGTGATCGCAGAGTTCAACAAATTCGACAATTCCACATTTTTACATTCTTTTTTTATCTTAGTATTCAATTTCATCAATTTG
ATGATGGGGGGCCTATTTTGATTGTGGAATTTGGGGTAGGTTTTTAATTTTTATTTTTATTTTAATTTTGTCCTTTTTGTAGATTTTTGAATTGGGGGTTACTCCAATTG
TAAAGTTAGATAGAATCCTAAGAGCTGCCCCCTCTAGTGAGCCATTCTATTTTTT
Protein sequenceShow/hide protein sequence
MASSSYPPPPPHCSSPDPPFNPDELYVAQILIELPLLVQKSNFSLGLIPAWSIRRKRSVIDSPPDSAPAPAPAAVSSLSSKKVKESSPTSPLALNSTPFSRSESDENTNA
KLSKKKPAVDKKSQYVEAIDELTKQNQDLKGEFQAMQQHYNSLKTINSNLKAKKQEMILGSKNESGIPEIGALSSAMKAVKFTVESSNHQHHHEFQPSINNQTAPMAEQS
TNNSQNFQIPVGAIPFYDPSMGPIGIPDLNISLEEINQRSYSKFMAARARQNRIQICKNKSNGAAKLQSPPNPCM