; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg11817 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg11817
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionSAP30_Sin3_bdg domain-containing protein
Genome locationCarg_Chr05:978064..980672
RNA-Seq ExpressionCarg11817
SyntenyCarg11817
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0000118 - histone deacetylase complex (cellular component)
GO:0003712 - transcription coregulator activity (molecular function)
InterPro domainsIPR024145 - Histone deacetylase complex subunit SAP30/SAP30-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598417.1 hypothetical protein SDJN03_08195, partial [Cucurbita argyrosperma subsp. sororia]5.3e-9384.51Show/hide
Query:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
        MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
Subjt:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF

Query:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST----------------------------------QLDELQVM
        ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST                                  QLDELQV+
Subjt:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST----------------------------------QLDELQVM

Query:  RGFVKAAKRLKTVQIRGGEKLGNPLS
        RGFVKAAKRLKTVQIRGGEKLGNPLS
Subjt:  RGFVKAAKRLKTVQIRGGEKLGNPLS

KAG7029370.1 hypothetical protein SDJN02_07708 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-98100Show/hide
Query:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
        MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
Subjt:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF

Query:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGSTQLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS
        ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGSTQLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS
Subjt:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGSTQLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS

XP_022962615.1 uncharacterized protein LOC111463011 isoform X1 [Cucurbita moschata]7.7e-9280.93Show/hide
Query:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
        MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
Subjt:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF

Query:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST------------------------------------------
        ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST                                          
Subjt:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST------------------------------------------

Query:  --QLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS
          QLDELQV+RGFVKAAKRLKTVQIRGGEKLGNPLS
Subjt:  --QLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS

XP_022962616.1 uncharacterized protein LOC111463011 isoform X2 [Cucurbita moschata]5.9e-9281.28Show/hide
Query:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
        MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
Subjt:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF

Query:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST------------------------------------------
        ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST                                          
Subjt:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST------------------------------------------

Query:  -QLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS
         QLDELQV+RGFVKAAKRLKTVQIRGGEKLGNPLS
Subjt:  -QLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS

XP_022997312.1 uncharacterized protein LOC111492260 isoform X2 [Cucurbita maxima]7.2e-9080.85Show/hide
Query:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
        MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
Subjt:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF

Query:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST------------------------------------------
        ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST                                          
Subjt:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST------------------------------------------

Query:  -QLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS
         QLDELQV+RGFVKAAKRLKTVQIR GEKLGNPLS
Subjt:  -QLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS

TrEMBL top hitse value%identityAlignment
A0A6J1CT98 uncharacterized protein LOC111014080 isoform X24.4e-7776.47Show/hide
Query:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
        MI+AVESSIN GGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPT NEEDDDLEF
Subjt:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF

Query:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST------------------------------------------
        ENLQWNG+DMASDDAQK HKSRH+LHKSSG SSHKTMSRSLSCDSQSKSSVSAPQGST                                          
Subjt:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST------------------------------------------

Query:  -QLDELQVMRGFVKAAKRLKT
         QLDELQV+ GFVKAAKRLKT
Subjt:  -QLDELQVMRGFVKAAKRLKT

A0A6J1HDR7 uncharacterized protein LOC111463011 isoform X22.9e-9281.28Show/hide
Query:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
        MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
Subjt:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF

Query:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST------------------------------------------
        ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST                                          
Subjt:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST------------------------------------------

Query:  -QLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS
         QLDELQV+RGFVKAAKRLKTVQIRGGEKLGNPLS
Subjt:  -QLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS

A0A6J1HFL4 uncharacterized protein LOC111463011 isoform X13.7e-9280.93Show/hide
Query:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
        MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
Subjt:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF

Query:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST------------------------------------------
        ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST                                          
Subjt:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST------------------------------------------

Query:  --QLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS
          QLDELQV+RGFVKAAKRLKTVQIRGGEKLGNPLS
Subjt:  --QLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS

A0A6J1K4M6 uncharacterized protein LOC111492260 isoform X23.5e-9080.85Show/hide
Query:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
        MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
Subjt:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF

Query:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST------------------------------------------
        ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST                                          
Subjt:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST------------------------------------------

Query:  -QLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS
         QLDELQV+RGFVKAAKRLKTVQIR GEKLGNPLS
Subjt:  -QLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS

A0A6J1K755 uncharacterized protein LOC111492260 isoform X14.6e-9080.51Show/hide
Query:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
        MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF
Subjt:  MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEF

Query:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST------------------------------------------
        ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST                                          
Subjt:  ENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGST------------------------------------------

Query:  --QLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS
          QLDELQV+RGFVKAAKRLKTVQIR GEKLGNPLS
Subjt:  --QLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19330.1 unknown protein6.1e-5560.26Show/hide
Query:  MIDAVESS-INGGGFSHLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDD
        M++AV+SS +  GGF  +QS  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PT NEEDDD
Subjt:  MIDAVESS-INGGGFSHLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDD

Query:  LEFENLQWNGIDM-----ASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQG------------------------------------
        L+FEN Q NG DM     AS+D  KPHKS+ R  +SS  SSHKTMSRSLS DSQSKSS   P                                      
Subjt:  LEFENLQWNGIDM-----ASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQG------------------------------------

Query:  --------STQLDELQVMRGFVKAAKRLK
                S Q+DELQV+ GFV+AAKR+K
Subjt:  --------STQLDELQVMRGFVKAAKRLK

AT1G19330.2 unknown protein2.5e-5661.16Show/hide
Query:  MIDAVESS-INGGGFSHLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDD
        M++AV+SS +  GGF  +QS  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PT NEEDDD
Subjt:  MIDAVESS-INGGGFSHLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDD

Query:  LEFENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQG-----------------------------------------
        L+FEN Q NG DM S+D  KPHKS+ R  +SS  SSHKTMSRSLS DSQSKSS   P                                           
Subjt:  LEFENLQWNGIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQG-----------------------------------------

Query:  ---STQLDELQVMRGFVKAAKRLK
           S Q+DELQV+ GFV+AAKR+K
Subjt:  ---STQLDELQVMRGFVKAAKRLK

AT1G19330.3 unknown protein8.0e-5560Show/hide
Query:  MIDAVESS-INGGGFSHLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDD
        M++AV+SS +  GGF  +QS  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PT NEEDDD
Subjt:  MIDAVESS-INGGGFSHLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDD

Query:  LEFENLQWNGIDM-----ASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQG------------------------------------
        L+FEN Q NG DM     AS+D  KPHKS+ R  +SS  SSHKTMSRSLS DSQSKSS   P                                      
Subjt:  LEFENLQWNGIDM-----ASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQG------------------------------------

Query:  ---------STQLDELQVMRGFVKAAKRLK
                 S Q+DELQV+ GFV+AAKR+K
Subjt:  ---------STQLDELQVMRGFVKAAKRLK

AT1G75060.1 unknown protein8.6e-4971.61Show/hide
Query:  GGGFSHLQSC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEFE-NLQWN-
        GGGFS LQSC GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PT NEED+DLE + + QWN 
Subjt:  GGGFSHLQSC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEFE-NLQWN-

Query:  GIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGSTQLD
          DM ++D  KPHKS+ R H+SS   S K + R +SCDS SK S   P+ + ++D
Subjt:  GIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGSTQLD

AT1G75060.2 unknown protein1.9e-4869.09Show/hide
Query:  GGGFSHLQSC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEFE-NLQWN-
        GGGFS LQSC GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PT NEED+DLE + + QWN 
Subjt:  GGGFSHLQSC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEFE-NLQWN-

Query:  GIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQ----GSTQLDELQVMR
          DM ++D  KPHKS+ R H+SS   S K + R +SCDS SK S   P+      T+LD   ++R
Subjt:  GIDMASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQ----GSTQLDELQVMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGACGCTGTGGAGAGTTCTATCAATGGCGGCGGTTTCTCGCACTTGCAGAGCTGCGGGGATAGTAGCGAGGAGGAGCTCTCTGTCCTTCCTCGCCATACCAAAGT
CGTCGTTACTGGAAATAATCGAACCAAATCCGTCCTCGTTGGACTGCAAGGCGTTGTCAAGAAAGCCGTGGGCCTTGGCGGCTGGCATTGGCTGGTTCTAACGAATGGCA
TAGAGGTGAAACTACAGCGGAATGCGCTTAGTGTGATCGAGGCTCCTACCAGTAACGAGGAAGATGACGATCTCGAATTTGAGAATTTGCAATGGAATGGAATTGATATG
GCATCCGATGACGCCCAAAAACCCCACAAATCAAGGCATAGATTACACAAATCATCTGGGTCATCATCTCACAAGACTATGAGCAGATCCCTTTCCTGTGACTCACAGTC
CAAGAGTTCTGTTTCTGCACCTCAAGGATCCACGCAACTGGATGAGTTGCAGGTGATGAGGGGTTTTGTGAAGGCTGCAAAGAGGCTGAAGACAGTGCAAATAAGAGGAG
GAGAGAAGCTGGGGAATCCATTGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATCCCTCCTTTTCCTCTTCCTCTCTTCCTTCTCTCCTTGCTTTCTCTGTGTTTTTTTTTTCTTCTTCTTCTTCCTTCAAACTTCCCACACCATTGCATGGCCTTTCCTCT
TCGTTTCATTTGACACACTCCGAAATGATTGACGCTGTGGAGAGTTCTATCAATGGCGGCGGTTTCTCGCACTTGCAGAGCTGCGGGGATAGTAGCGAGGAGGAGCTCTC
TGTCCTTCCTCGCCATACCAAAGTCGTCGTTACTGGAAATAATCGAACCAAATCCGTCCTCGTTGGACTGCAAGGCGTTGTCAAGAAAGCCGTGGGCCTTGGCGGCTGGC
ATTGGCTGGTTCTAACGAATGGCATAGAGGTGAAACTACAGCGGAATGCGCTTAGTGTGATCGAGGCTCCTACCAGTAACGAGGAAGATGACGATCTCGAATTTGAGAAT
TTGCAATGGAATGGAATTGATATGGCATCCGATGACGCCCAAAAACCCCACAAATCAAGGCATAGATTACACAAATCATCTGGGTCATCATCTCACAAGACTATGAGCAG
ATCCCTTTCCTGTGACTCACAGTCCAAGAGTTCTGTTTCTGCACCTCAAGGATCCACGCAACTGGATGAGTTGCAGGTGATGAGGGGTTTTGTGAAGGCTGCAAAGAGGC
TGAAGACAGTGCAAATAAGAGGAGGAGAGAAGCTGGGGAATCCATTGAGTTGATCGTCATGTCATGCTAACAGATAATTCAGCTGACTCGCAAGCGTTTCGATTGTCTCT
GTAACATTGTATGTATCGATCGGGTGCTAATGGTTTTTGTGTTACGGGTCCGGGTTAGTACGATTCTGTGGTAGTAGTTAATAGAGTAATATCCTGTTTTAGTGAGGATG
TTACTTGTAATGTACTAT
Protein sequenceShow/hide protein sequence
MIDAVESSINGGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTSNEEDDDLEFENLQWNGIDM
ASDDAQKPHKSRHRLHKSSGSSSHKTMSRSLSCDSQSKSSVSAPQGSTQLDELQVMRGFVKAAKRLKTVQIRGGEKLGNPLS