; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033665 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033665
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionSAP30_Sin3_bdg domain-containing protein
Genome locationscaffold13:39054257..39056935
RNA-Seq ExpressionSpg033665
SyntenySpg033665
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0000118 - histone deacetylase complex (cellular component)
GO:0003712 - transcription coregulator activity (molecular function)
InterPro domainsIPR024145 - Histone deacetylase complex subunit SAP30/SAP30-like
IPR038291 - SAP30, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152712.1 uncharacterized protein LOC101220556 isoform X1 [Cucumis sativus]3.4e-8996.63Show/hide
Query:  MIEAVETSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVE+SINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVETSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV
        NLQWN ++MASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST+VDLSKLEM ALWRYWRHFNLV
Subjt:  NLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV

XP_022144390.1 uncharacterized protein LOC111014080 isoform X1 [Momordica charantia]6.8e-9097.19Show/hide
Query:  MIEAVETSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVE+SINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVETSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV
        NLQWNGL+MASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST+VDL KLEM ALWRYWRHFNLV
Subjt:  NLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV

XP_022144391.1 uncharacterized protein LOC111014080 isoform X2 [Momordica charantia]2.2e-8897.19Show/hide
Query:  MIEAVETSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVE+SINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVETSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV
        NLQWNGL+MASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST VDL KLEM ALWRYWRHFNLV
Subjt:  NLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV

XP_022962615.1 uncharacterized protein LOC111463011 isoform X1 [Cucurbita moschata]5.7e-8977.97Show/hide
Query:  MIEAVETSIN-GGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEF
        MI+AVE+SIN GGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPT NEEDDDLEF
Subjt:  MIEAVETSIN-GGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEF

Query:  ENLQWNGLEMASDDAQKSHKSRHKLHKSSG-SSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLVSRRYSQPVKRAIGRSGSEAF
        ENLQWNG++MASDDAQK HKSRH+LHKSSG SSHKTMSRSLSCDSQSKSSVSAPQGST+VDLSKLEM ALWRYW+HFNLV      P K  +       F
Subjt:  ENLQWNGLEMASDDAQKSHKSRHKLHKSSG-SSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLVSRRYSQPVKRAIGRSGSEAF

Query:  HVTATGRVAGHNGFCEGCKETEDSVQMRGGEELGNP
               +    GF +  K  + +VQ+RGGE+LGNP
Subjt:  HVTATGRVAGHNGFCEGCKETEDSVQMRGGEELGNP

XP_038883974.1 uncharacterized protein LOC120074938 isoform X1 [Benincasa hispida]5.7e-8996.63Show/hide
Query:  MIEAVETSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVE+SINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVETSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV
        NLQWNG++MASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSA QGST+VDLSKLEM ALWRYWRHFNLV
Subjt:  NLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV

TrEMBL top hitse value%identityAlignment
A0A0A0LNK2 SAP30_Sin3_bdg domain-containing protein1.6e-8996.63Show/hide
Query:  MIEAVETSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVE+SINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVETSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV
        NLQWN ++MASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST+VDLSKLEM ALWRYWRHFNLV
Subjt:  NLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV

A0A1S3BAV7 uncharacterized protein LOC103487947 isoform X11.6e-8996.63Show/hide
Query:  MIEAVETSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVE+SINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVETSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV
        NLQWN ++MASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST+VDLSKLEM ALWRYWRHFNLV
Subjt:  NLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV

A0A5A7VFZ9 Histone deacetylase complex subunit SAP30/SAP30-like protein1.6e-8996.63Show/hide
Query:  MIEAVETSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVE+SINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVETSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV
        NLQWN ++MASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST+VDLSKLEM ALWRYWRHFNLV
Subjt:  NLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV

A0A6J1CRH4 uncharacterized protein LOC111014080 isoform X13.3e-9097.19Show/hide
Query:  MIEAVETSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVE+SINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVETSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV
        NLQWNGL+MASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST+VDL KLEM ALWRYWRHFNLV
Subjt:  NLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV

A0A6J1HFL4 uncharacterized protein LOC111463011 isoform X12.8e-8977.97Show/hide
Query:  MIEAVETSIN-GGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEF
        MI+AVE+SIN GGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPT NEEDDDLEF
Subjt:  MIEAVETSIN-GGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEF

Query:  ENLQWNGLEMASDDAQKSHKSRHKLHKSSG-SSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLVSRRYSQPVKRAIGRSGSEAF
        ENLQWNG++MASDDAQK HKSRH+LHKSSG SSHKTMSRSLSCDSQSKSSVSAPQGST+VDLSKLEM ALWRYW+HFNLV      P K  +       F
Subjt:  ENLQWNGLEMASDDAQKSHKSRHKLHKSSG-SSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLVSRRYSQPVKRAIGRSGSEAF

Query:  HVTATGRVAGHNGFCEGCKETEDSVQMRGGEELGNP
               +    GF +  K  + +VQ+RGGE+LGNP
Subjt:  HVTATGRVAGHNGFCEGCKETEDSVQMRGGEELGNP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19330.1 unknown protein4.7e-6574.87Show/hide
Query:  MIEAVETS--INGGFSHLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDD
        M+EAV++S  +NGGF  +QS  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDD
Subjt:  MIEAVETS--INGGFSHLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDD

Query:  LEFENLQWNGLEM-----ASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV
        L+FEN Q NG +M     AS+D  K HKS+ +  +SS SSHKTMSRSLS DSQSKSS   P  + +VDLSKLEM AL  YWRHFNLV
Subjt:  LEFENLQWNGLEM-----ASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV

AT1G19330.2 unknown protein1.9e-6676.37Show/hide
Query:  MIEAVETS--INGGFSHLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDD
        M+EAV++S  +NGGF  +QS  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDD
Subjt:  MIEAVETS--INGGFSHLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDD

Query:  LEFENLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV
        L+FEN Q NG +M S+D  K HKS+ +  +SS SSHKTMSRSLS DSQSKSS   P  + +VDLSKLEM AL  YWRHFNLV
Subjt:  LEFENLQWNGLEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV

AT1G19330.3 unknown protein1.8e-6474.47Show/hide
Query:  MIEAVETS--INGGFSHLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDD
        M+EAV++S  +NGGF  +QS  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDD
Subjt:  MIEAVETS--INGGFSHLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDD

Query:  LEFENLQWNGLEM-----ASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSS-VSAPQGSTRVDLSKLEMTALWRYWRHFNLV
        L+FEN Q NG +M     AS+D  K HKS+ +  +SS SSHKTMSRSLS DSQSKSS  + P+   +VDLSKLEM AL  YWRHFNLV
Subjt:  LEFENLQWNGLEM-----ASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSS-VSAPQGSTRVDLSKLEMTALWRYWRHFNLV

AT1G75060.1 unknown protein1.9e-5872.09Show/hide
Query:  GGFSHLQSC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE-NLQWN-G
        GGFS LQSC GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEED+DLE + + QWN  
Subjt:  GGFSHLQSC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE-NLQWN-G

Query:  LEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV
         +M ++D  K HKS+ + H+SS  S K + R +SCDS SK S   P+ + +VDL+KL+M AL RYWRHFNLV
Subjt:  LEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV

AT1G75060.2 unknown protein6.1e-5772.09Show/hide
Query:  GGFSHLQSC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE-NLQWN-G
        GGFS LQSC GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEED+DLE + + QWN  
Subjt:  GGFSHLQSC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE-NLQWN-G

Query:  LEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV
         +M ++D  K HKS+ + H+SS  S K + R +SCDS SK S   P+    VDL+KL+M AL RYWRHFNLV
Subjt:  LEMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGAAGCTGTGGAGACTTCCATCAATGGCGGTTTCTCGCACTTGCAGAGCTGTGGGGACAGTAGCGAGGAGGAGCTTTCGGTTCTTCCTCGTCATACCAAGGTCGT
CGTTACCGGAAATAATCGAACCAAATCGGTCCTCGTTGGACTTCAAGGCGTCGTCAAGAAAGCCGTTGGCCTTGGCGGCTGGCATTGGCTGGTTCTAACAAATGGAATAG
AAGTGAAACTACAACGGAATGCCCTTAGCGTGATCGAGGCTCCGACGGGTAATGAGGAAGACGACGACCTCGAATTTGAGAACTTGCAATGGAATGGATTGGAGATGGCA
TCCGATGACGCCCAAAAATCCCACAAATCAAGGCATAAATTACACAAATCATCTGGGTCATCTCACAAGACTATGAGCAGATCCCTTTCCTGTGACTCACAGTCGAAGAG
CTCGGTTTCTGCACCGCAAGGATCCACGAGGGTTGACCTTAGTAAATTGGAGATGACTGCATTATGGAGATATTGGCGACACTTCAATCTCGTAAGTAGACGCTATTCCC
AACCCGTCAAAAGAGCAATTGGTAGATCTGGTTCAGAGGCATTTCATGTCACAGCAACTGGACGAGTTGCAGGTCATAATGGGTTTTGTGAAGGCTGCAAAGAGACTGAA
GACAGTGTGCAAATGAGAGGAGGAGAGGAACTGGGGAATCCATCG
mRNA sequenceShow/hide mRNA sequence
ATGATTGAAGCTGTGGAGACTTCCATCAATGGCGGTTTCTCGCACTTGCAGAGCTGTGGGGACAGTAGCGAGGAGGAGCTTTCGGTTCTTCCTCGTCATACCAAGGTCGT
CGTTACCGGAAATAATCGAACCAAATCGGTCCTCGTTGGACTTCAAGGCGTCGTCAAGAAAGCCGTTGGCCTTGGCGGCTGGCATTGGCTGGTTCTAACAAATGGAATAG
AAGTGAAACTACAACGGAATGCCCTTAGCGTGATCGAGGCTCCGACGGGTAATGAGGAAGACGACGACCTCGAATTTGAGAACTTGCAATGGAATGGATTGGAGATGGCA
TCCGATGACGCCCAAAAATCCCACAAATCAAGGCATAAATTACACAAATCATCTGGGTCATCTCACAAGACTATGAGCAGATCCCTTTCCTGTGACTCACAGTCGAAGAG
CTCGGTTTCTGCACCGCAAGGATCCACGAGGGTTGACCTTAGTAAATTGGAGATGACTGCATTATGGAGATATTGGCGACACTTCAATCTCGTAAGTAGACGCTATTCCC
AACCCGTCAAAAGAGCAATTGGTAGATCTGGTTCAGAGGCATTTCATGTCACAGCAACTGGACGAGTTGCAGGTCATAATGGGTTTTGTGAAGGCTGCAAAGAGACTGAA
GACAGTGTGCAAATGAGAGGAGGAGAGGAACTGGGGAATCCATCG
Protein sequenceShow/hide protein sequence
MIEAVETSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGLEMA
SDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTRVDLSKLEMTALWRYWRHFNLVSRRYSQPVKRAIGRSGSEAFHVTATGRVAGHNGFCEGCKETE
DSVQMRGGEELGNPS