; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020219 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020219
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
Genome locationChr04:29944839..29947663
RNA-Seq ExpressionHG10020219
SyntenyHG10020219
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580620.1 hypothetical protein SDJN03_20622, partial [Cucurbita argyrosperma subsp. sororia]2.2e-11191.45Show/hide
Query:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDG
        MLSSMAD SLSLCFSS SS FCISRSLHLS          SPRFS+ HHRPSRLLRFSVKSS+SGSF GDDSFGLFPW DGD+EIHWVPEERVTLFTPDG
Subjt:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDG

Query:  LVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG
        LVQIGGSIVPRRIS SDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG
Subjt:  LVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG

Query:  GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
        GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
Subjt:  GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG

KAG7017377.1 hypothetical protein SDJN02_19242 [Cucurbita argyrosperma subsp. argyrosperma]5.0e-11191.03Show/hide
Query:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDG
        MLSSMAD SLSLCFSS SS FCI+RSLHLS          SPRFS+ HHRPSRLLRFSVKSS+SGSF GDDSFGLFPW DGD+EIHWVPEERVTLFTPDG
Subjt:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDG

Query:  LVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG
        LVQIGGSIVPRRIS SDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG
Subjt:  LVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG

Query:  GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
        GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
Subjt:  GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG

XP_022934142.1 uncharacterized protein LOC111441404 isoform X1 [Cucurbita moschata]2.2e-11191.45Show/hide
Query:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDG
        MLSSMAD SLSLCFSS SS FCISRSLHLS          SPRFS+ HHRPSRLLRFSVKSS+SGSF GDDSFGLFPW DGD+EIHWVPEERVTLFTPDG
Subjt:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDG

Query:  LVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG
        LVQIGGSIVPRRIS SDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG
Subjt:  LVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG

Query:  GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
        GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
Subjt:  GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG

XP_022983093.1 uncharacterized protein LOC111481743 [Cucurbita maxima]2.2e-11191.03Show/hide
Query:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDG
        MLSSM D SLSLCFSS SS FCISRSLHLS          SPRFS+ HHRPSRLLRFS+KSS+SGSF GDDSFGLFPW DGD+EIHWVPEERVTLFTPDG
Subjt:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDG

Query:  LVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG
        LVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG
Subjt:  LVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG

Query:  GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
        GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
Subjt:  GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG

XP_038905101.1 uncharacterized protein LOC120091234 [Benincasa hispida]6.7e-11694.87Show/hide
Query:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDG
        MLSSMAD SLSLCFSSFS    ISRSLHLSPSFLLHPFL+SPRFSV HHRPSRLLRFS+K SSSGSF GDDSFGLFPW+DGDSEIHWVPEERVTLFTPDG
Subjt:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDG

Query:  LVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG
        LVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCI+GFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG
Subjt:  LVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG

Query:  GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
        GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
Subjt:  GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG

TrEMBL top hitse value%identityAlignment
A0A1S3B735 uncharacterized protein LOC1034865017.7e-11091.06Show/hide
Query:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGD-DSFGLFPWADGDSEIHWVPEERVTLFTPD
        MLSSMAD SLS  FSSFSS      SLHLSPSFL HPFLFSP+F + HHRPS LLRFS+KSSSSG F GD DSFGLFPWADGDSEIHWVPEERVTLFTPD
Subjt:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGD-DSFGLFPWADGDSEIHWVPEERVTLFTPD

Query:  GLVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASK
        GLVQIGGSIVPRRISSSDKKQGKSK  QRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASK
Subjt:  GLVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASK

Query:  GGLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
        GGLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
Subjt:  GGLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG

A0A5D3DPB2 Uncharacterized protein7.7e-11091.06Show/hide
Query:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGD-DSFGLFPWADGDSEIHWVPEERVTLFTPD
        MLSSMAD SLS  FSSFSS      SLHLSPSFL HPFLFSP+F + HHRPS LLRFS+KSSSSG F GD DSFGLFPWADGDSEIHWVPEERVTLFTPD
Subjt:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGD-DSFGLFPWADGDSEIHWVPEERVTLFTPD

Query:  GLVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASK
        GLVQIGGSIVPRRISSSDKKQGKSK  QRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASK
Subjt:  GLVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASK

Query:  GGLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
        GGLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
Subjt:  GGLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG

A0A6J1CTU7 uncharacterized protein LOC111014232 isoform X12.8e-10790.87Show/hide
Query:  MADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDGLVQI
        MA+ S +LCFSSFSS  CISRSL LSPSFL  P  FS  FSV HHRPSRLLRFSV+SS SGSF GDDS GLFPWADG SEIHWVPEERVTLFTPDGLVQI
Subjt:  MADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDGLVQI

Query:  GGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQE
        GGSIVPRRISSSDKKQGKSK YQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQE
Subjt:  GGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQE

Query:  KLTMTVAVPLLWGVPPASETLHLAVQSGGG
        KLTMTVAVPLLWGVPPASETLH AVQSGGG
Subjt:  KLTMTVAVPLLWGVPPASETLHLAVQSGGG

A0A6J1F1V4 uncharacterized protein LOC111441404 isoform X11.1e-11191.45Show/hide
Query:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDG
        MLSSMAD SLSLCFSS SS FCISRSLHLS          SPRFS+ HHRPSRLLRFSVKSS+SGSF GDDSFGLFPW DGD+EIHWVPEERVTLFTPDG
Subjt:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDG

Query:  LVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG
        LVQIGGSIVPRRIS SDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG
Subjt:  LVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG

Query:  GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
        GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
Subjt:  GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG

A0A6J1J6S7 uncharacterized protein LOC1114817431.1e-11191.03Show/hide
Query:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDG
        MLSSM D SLSLCFSS SS FCISRSLHLS          SPRFS+ HHRPSRLLRFS+KSS+SGSF GDDSFGLFPW DGD+EIHWVPEERVTLFTPDG
Subjt:  MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDG

Query:  LVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG
        LVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG
Subjt:  LVQIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKG

Query:  GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
        GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
Subjt:  GLQEKLTMTVAVPLLWGVPPASETLHLAVQSGGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G36895.1 unknown protein3.6e-7564.66Show/hide
Query:  MADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFS--VPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDGLV
        MA+ S +L FS+FSS   IS      PS        + RFS  +   RPS   RF+VK+S  G+F+ DD+F  FPW+D ++EI WVPEER+TLFT DGLV
Subjt:  MADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFS--VPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDGLV

Query:  QIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGL
        QIGG++VPRRI SS+KK G+S++ ++ Q+F ES YMDP Q +CLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVED VLE GGE+VA E  S  GL
Subjt:  QIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGL

Query:  QEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
        QEKLTMTVAVP LWGVPPA+E LHLAV++GGG
Subjt:  QEKLTMTVAVPLLWGVPPASETLHLAVQSGGG

AT2G36895.2 unknown protein3.4e-7364.22Show/hide
Query:  MADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFS--VPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDGLV
        MA+ S +L FS+FSS   IS      PS        + RFS  +   RPS   RF+VK+S  G+F+ DD+F  FPW+D ++EI WVPEER+TLFT DGLV
Subjt:  MADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFS--VPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDGLV

Query:  QIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGL
        QIGG++VPRRI SS+ K G+S++ ++ Q+F ES YMDP Q +CLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVED VLE GGE+VA E  S  GL
Subjt:  QIGGSIVPRRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGL

Query:  QEKLTMTVAVPLLWGVPPASETLHLAVQSGGG
        QEKLTMTVAVP LWGVPPA+E LHLAV++GGG
Subjt:  QEKLTMTVAVPLLWGVPPASETLHLAVQSGGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTCATCCATGGCTGACCTTTCTCTTTCCTTATGTTTCTCTTCCTTCTCTTCCTCCTTCTGCATTTCCCGCTCCCTCCACCTTTCCCCTTCTTTCCTTTTACACCC
TTTTCTTTTTTCTCCTAGATTCTCGGTCCCCCATCATCGCCCATCTCGTCTCCTTCGTTTCTCCGTCAAATCCTCTTCCTCTGGAAGCTTCACAGGGGACGATTCCTTCG
GATTGTTTCCTTGGGCTGATGGTGATAGCGAAATCCATTGGGTTCCCGAGGAGAGAGTCACATTGTTCACCCCTGATGGGCTTGTTCAGATTGGAGGCTCCATCGTCCCT
AGACGAATTTCTTCTTCAGATAAAAAACAAGGGAAATCAAAGGCTTACCAAAGATTCCAACGGTTTCAAGAGAGTGATTACATGGATCCAAAACAGAGCATATGTCTTGG
TGCTCTATTTGATATTGCAGCTACCAATGGACTTGACATGGGAAGAAGACTTTGTATCTTTGGTTTTTGCCGTTCTGTTGAGATGCTAAGTGATGTTGTGGAGGACATTG
TTTTGGAGCAAGGTGGAGAGGTTGTAGCAGCAGAGAAGGCAAGTAAAGGGGGTTTGCAGGAAAAACTAACCATGACAGTTGCTGTGCCACTTCTATGGGGGGTTCCTCCT
GCTTCTGAAACTCTTCATTTAGCTGTTCAGAGTGGTGGAGGGGGAAAGAAGAAAGCACCTAAAGGAGCGAAGGTTGCTTGTCAAGAACGTTGGCTCTCGTTGGGTGGTGG
CTTAGGTGGACGTATACGTCCTGTCCTATGCCAAAGACCCCTGCATCATGACCCACAAGGTCTTATGGGCCCAAGTGCCAAAGGTAGAAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCTCATCCATGGCTGACCTTTCTCTTTCCTTATGTTTCTCTTCCTTCTCTTCCTCCTTCTGCATTTCCCGCTCCCTCCACCTTTCCCCTTCTTTCCTTTTACACCC
TTTTCTTTTTTCTCCTAGATTCTCGGTCCCCCATCATCGCCCATCTCGTCTCCTTCGTTTCTCCGTCAAATCCTCTTCCTCTGGAAGCTTCACAGGGGACGATTCCTTCG
GATTGTTTCCTTGGGCTGATGGTGATAGCGAAATCCATTGGGTTCCCGAGGAGAGAGTCACATTGTTCACCCCTGATGGGCTTGTTCAGATTGGAGGCTCCATCGTCCCT
AGACGAATTTCTTCTTCAGATAAAAAACAAGGGAAATCAAAGGCTTACCAAAGATTCCAACGGTTTCAAGAGAGTGATTACATGGATCCAAAACAGAGCATATGTCTTGG
TGCTCTATTTGATATTGCAGCTACCAATGGACTTGACATGGGAAGAAGACTTTGTATCTTTGGTTTTTGCCGTTCTGTTGAGATGCTAAGTGATGTTGTGGAGGACATTG
TTTTGGAGCAAGGTGGAGAGGTTGTAGCAGCAGAGAAGGCAAGTAAAGGGGGTTTGCAGGAAAAACTAACCATGACAGTTGCTGTGCCACTTCTATGGGGGGTTCCTCCT
GCTTCTGAAACTCTTCATTTAGCTGTTCAGAGTGGTGGAGGGGGAAAGAAGAAAGCACCTAAAGGAGCGAAGGTTGCTTGTCAAGAACGTTGGCTCTCGTTGGGTGGTGG
CTTAGGTGGACGTATACGTCCTGTCCTATGCCAAAGACCCCTGCATCATGACCCACAAGGTCTTATGGGCCCAAGTGCCAAAGGTAGAAGGTGA
Protein sequenceShow/hide protein sequence
MLSSMADLSLSLCFSSFSSSFCISRSLHLSPSFLLHPFLFSPRFSVPHHRPSRLLRFSVKSSSSGSFTGDDSFGLFPWADGDSEIHWVPEERVTLFTPDGLVQIGGSIVP
RRISSSDKKQGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLTMTVAVPLLWGVPP
ASETLHLAVQSGGGGKKKAPKGAKVACQERWLSLGGGLGGRIRPVLCQRPLHHDPQGLMGPSAKGRR