; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G014170 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G014170
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionSAP30_Sin3_bdg domain-containing protein
Genome locationchr08:22413436..22416170
RNA-Seq ExpressionLsi08G014170
SyntenyLsi08G014170
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0000118 - histone deacetylase complex (cellular component)
GO:0003712 - transcription coregulator activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR024145 - Histone deacetylase complex subunit SAP30/SAP30-like
IPR025718 - Histone deacetylase complex subunit SAP30, Sin3 binding domain
IPR038291 - SAP30, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152712.1 uncharacterized protein LOC101220556 isoform X1 [Cucumis sativus]9.2e-9088.46Show/hide
Query:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ
        + SCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWN IDMASDDAQ
Subjt:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ

Query:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST---------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAA
        KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST                     VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAA
Subjt:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST---------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAA

Query:  KRLKTVCK
        KRLKTVCK
Subjt:  KRLKTVCK

XP_008444685.1 PREDICTED: uncharacterized protein LOC103487947 isoform X2 [Cucumis melo]7.0e-9088.89Show/hide
Query:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ
        + SCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWN IDMASDDAQ
Subjt:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ

Query:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST--------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAK
        KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST                    VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAK
Subjt:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST--------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAK

Query:  RLKTVCK
        RLKTVCK
Subjt:  RLKTVCK

XP_022144391.1 uncharacterized protein LOC111014080 isoform X2 [Momordica charantia]1.3e-8888.73Show/hide
Query:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ
        + SCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNG+DMASDDAQ
Subjt:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ

Query:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST--------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAK
        KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST                    VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAK
Subjt:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST--------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAK

Query:  RLKT
        RLKT
Subjt:  RLKT

XP_038883974.1 uncharacterized protein LOC120074938 isoform X1 [Benincasa hispida]1.0e-8887.98Show/hide
Query:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ
        + SCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ
Subjt:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ

Query:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST---------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAA
        KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSA QGST                     V AIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAA
Subjt:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST---------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAA

Query:  KRLKTVCK
        KRLKTVCK
Subjt:  KRLKTVCK

XP_038883975.1 uncharacterized protein LOC120074938 isoform X2 [Benincasa hispida]1.2e-8988.89Show/hide
Query:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ
        + SCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ
Subjt:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ

Query:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTVD--------------------AIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAK
        KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSA QGSTVD                    AIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAK
Subjt:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTVD--------------------AIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAK

Query:  RLKTVCK
        RLKTVCK
Subjt:  RLKTVCK

TrEMBL top hitse value%identityAlignment
A0A0A0LNK2 SAP30_Sin3_bdg domain-containing protein4.4e-9088.46Show/hide
Query:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ
        + SCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWN IDMASDDAQ
Subjt:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ

Query:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST---------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAA
        KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST                     VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAA
Subjt:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST---------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAA

Query:  KRLKTVCK
        KRLKTVCK
Subjt:  KRLKTVCK

A0A1S3BAV7 uncharacterized protein LOC103487947 isoform X14.4e-9088.46Show/hide
Query:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ
        + SCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWN IDMASDDAQ
Subjt:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ

Query:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST---------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAA
        KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST                     VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAA
Subjt:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST---------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAA

Query:  KRLKTVCK
        KRLKTVCK
Subjt:  KRLKTVCK

A0A1S3BAY6 uncharacterized protein LOC103487947 isoform X23.4e-9088.89Show/hide
Query:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ
        + SCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWN IDMASDDAQ
Subjt:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ

Query:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST--------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAK
        KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST                    VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAK
Subjt:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST--------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAK

Query:  RLKTVCK
        RLKTVCK
Subjt:  RLKTVCK

A0A5A7VFZ9 Histone deacetylase complex subunit SAP30/SAP30-like protein4.4e-9088.46Show/hide
Query:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ
        + SCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWN IDMASDDAQ
Subjt:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ

Query:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST---------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAA
        KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST                     VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAA
Subjt:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST---------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAA

Query:  KRLKTVCK
        KRLKTVCK
Subjt:  KRLKTVCK

A0A6J1CT98 uncharacterized protein LOC111014080 isoform X26.4e-8988.73Show/hide
Query:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ
        + SCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNG+DMASDDAQ
Subjt:  MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQ

Query:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST--------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAK
        KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST                    VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAK
Subjt:  KSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQGST--------------------VDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAK

Query:  RLKT
        RLKT
Subjt:  RLKT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19330.1 unknown protein1.1e-6971.5Show/hide
Query:  SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDM-----ASDDAQK
        SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDDL+FEN Q NG DM     AS+D  K
Subjt:  SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDM-----ASDDAQK

Query:  SHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAP---------------------QGSTVDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAK
         HKS+ +  +SS SSHKTMSRSLS DSQSKSS   P                       + VDAIPNPSKEQL+D+VQRHFMSQQ+DELQVI+GFV+AAK
Subjt:  SHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAP---------------------QGSTVDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAK

Query:  RLKTVCK
        R+K  CK
Subjt:  RLKTVCK

AT1G19330.2 unknown protein4.6e-7172.77Show/hide
Query:  SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQKSHKSR
        SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDDL+FEN Q NG DM S+D  K HKS+
Subjt:  SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQKSHKSR

Query:  HKLHKSSGSSHKTMSRSLSCDSQSKSSVSAP---------------------QGSTVDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAKRLKTV
         +  +SS SSHKTMSRSLS DSQSKSS   P                       + VDAIPNPSKEQL+D+VQRHFMSQQ+DELQVI+GFV+AAKR+K  
Subjt:  HKLHKSSGSSHKTMSRSLSCDSQSKSSVSAP---------------------QGSTVDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAKRLKTV

Query:  CK
        CK
Subjt:  CK

AT1G19330.3 unknown protein1.5e-6971.15Show/hide
Query:  SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDM-----ASDDAQK
        SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDDL+FEN Q NG DM     AS+D  K
Subjt:  SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDM-----ASDDAQK

Query:  SHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAP----------------------QGSTVDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAA
         HKS+ +  +SS SSHKTMSRSLS DSQSKSS   P                        + VDAIPNPSKEQL+D+VQRHFMSQQ+DELQVI+GFV+AA
Subjt:  SHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAP----------------------QGSTVDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAA

Query:  KRLKTVCK
        KR+K  CK
Subjt:  KRLKTVCK

AT1G75060.1 unknown protein2.5e-6163.21Show/hide
Query:  MASC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE-NLQWN-GIDMAS
        + SC GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEED+DLE + + QWN   DM +
Subjt:  MASC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE-NLQWN-GIDMAS

Query:  DDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQ---------------------GSTVDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGF
        +D  K HKS+ + H+SS  S K + R +SCDS SK S   P+                      + VDA+PNP+KEQL+D++QRHFMSQQ+DELQVI+GF
Subjt:  DDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQ---------------------GSTVDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGF

Query:  VKAAKRLKTVCK
        V+AA  +K  C+
Subjt:  VKAAKRLKTVCK

AT1G75060.2 unknown protein1.9e-6163.51Show/hide
Query:  MASC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE-NLQWN-GIDMAS
        + SC GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEED+DLE + + QWN   DM +
Subjt:  MASC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE-NLQWN-GIDMAS

Query:  DDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQ--------------------GSTVDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFV
        +D  K HKS+ + H+SS  S K + R +SCDS SK S   P+                     + VDA+PNP+KEQL+D++QRHFMSQQ+DELQVI+GFV
Subjt:  DDAQKSHKSRHKLHKSSGSSHKTMSRSLSCDSQSKSSVSAPQ--------------------GSTVDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFV

Query:  KAAKRLKTVCK
        +AA  +K  C+
Subjt:  KAAKRLKTVCK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGCTGTGGGGACAGTAGTGAGGAAGAGCTCTCAGTGCTTCCTCGCCATACCAAGGTCGTCGTTACTGGAAACAATCGTACCAAATCCGTCCTCGTTGGACTTCA
AGGCGTTGTCAAGAAAGCCGTTGGCCTTGGCGGGTGGCATTGGCTGGTTCTAACAAATGGTATAGAGGTGAAACTGCAGCGGAATGCGCTTAGTGTGATTGAGGCTCCGA
CGGGTAATGAGGAAGACGACGATCTCGAATTCGAGAACTTGCAATGGAATGGAATTGATATGGCATCTGATGACGCCCAAAAATCCCACAAATCAAGGCATAAATTACAC
AAATCATCTGGGTCATCTCACAAGACTATGAGCAGATCCCTTTCCTGTGACTCACAGTCGAAGAGCTCGGTTTCTGCACCGCAAGGATCCACGGTAGATGCCATTCCGAA
CCCGTCGAAAGAGCAATTGGTAGACCTAGTTCAGAGGCATTTCATGTCACAGCAACTGGATGAGTTGCAGGTCATAATGGGTTTTGTGAAGGCTGCAAAGAGGCTGAAGA
CAGTGTGCAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAGCTGTGGGGACAGTAGTGAGGAAGAGCTCTCAGTGCTTCCTCGCCATACCAAGGTCGTCGTTACTGGAAACAATCGTACCAAATCCGTCCTCGTTGGACTTCA
AGGCGTTGTCAAGAAAGCCGTTGGCCTTGGCGGGTGGCATTGGCTGGTTCTAACAAATGGTATAGAGGTGAAACTGCAGCGGAATGCGCTTAGTGTGATTGAGGCTCCGA
CGGGTAATGAGGAAGACGACGATCTCGAATTCGAGAACTTGCAATGGAATGGAATTGATATGGCATCTGATGACGCCCAAAAATCCCACAAATCAAGGCATAAATTACAC
AAATCATCTGGGTCATCTCACAAGACTATGAGCAGATCCCTTTCCTGTGACTCACAGTCGAAGAGCTCGGTTTCTGCACCGCAAGGATCCACGGTAGATGCCATTCCGAA
CCCGTCGAAAGAGCAATTGGTAGACCTAGTTCAGAGGCATTTCATGTCACAGCAACTGGATGAGTTGCAGGTCATAATGGGTTTTGTGAAGGCTGCAAAGAGGCTGAAGA
CAGTGTGCAAATGAGAGGAGCAGAGAAACTGGGGAATCCATTAAGTTGATTCTCATGCTAACAGATAATTCAGCTGAATCAAAAATGTTTCGATTCTCTATGTAACATAT
GTATCGATCGATCGCAAATGGTTTTTGTTTTTGGGGCCGGGATTAGAAAGGGGGTTATTAGTACGATTCTATCGTAGTAGTTAATAGAGTAATATCTTGTTTTAGTGACG
ATGTTACTTGTAAGTTGTAATGTAGATATGTGTAAACTGCAAATATTAGAGACCTCTGAGAGAAATGCCCCAAACGACTTGCTTTCTAAGCCTTTGTTTAATGTAATCCT
CACTAACTTTTGGGATTCCCATTGCCCAAAAATGTACATATTTTCTTGTTTCCAAATCCCATCTCTTTGATCTTTGCTTTC
Protein sequenceShow/hide protein sequence
MASCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLQWNGIDMASDDAQKSHKSRHKLH
KSSGSSHKTMSRSLSCDSQSKSSVSAPQGSTVDAIPNPSKEQLVDLVQRHFMSQQLDELQVIMGFVKAAKRLKTVCK