; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg28023 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg28023
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionprotein DOUBLE-STRAND BREAK FORMATION
Genome locationCarg_Chr13:131445..134776
RNA-Seq ExpressionCarg28023
SyntenyCarg28023
Gene Ontology termsGO:0042138 - meiotic DNA double-strand break formation (biological process)
InterPro domainsIPR044969 - Protein DOUBLE-STRAND BREAK FORMATION


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583341.1 Protein DOUBLE-STRAND BREAK FORMATION, partial [Cucurbita argyrosperma subsp. sororia]4.7e-11986.86Show/hide
Query:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA----------
        MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA          
Subjt:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA----------

Query:  -----------TLRYEALNFRELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAG
                    LRYEALNFRELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAG
Subjt:  -----------TLRYEALNFRELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAG

Query:  SHSVQALTSEYLKKKVTERNRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEGFMISPTYSLLAYPPDWI
        SHSVQALTSEYLKKKVTERNRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEG + S +Y +  +   +I
Subjt:  SHSVQALTSEYLKKKVTERNRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEGFMISPTYSLLAYPPDWI

KAG7019109.1 hypothetical protein SDJN02_18067, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-146100Show/hide
Query:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEATLRYEALNFR
        MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEATLRYEALNFR
Subjt:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEATLRYEALNFR

Query:  ELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTERNR
        ELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTERNR
Subjt:  ELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTERNR

Query:  KISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEGFMISPTYSLLAYPPDWINVQRPYKLLAQAVATAMCLPE
        KISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEGFMISPTYSLLAYPPDWINVQRPYKLLAQAVATAMCLPE
Subjt:  KISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEGFMISPTYSLLAYPPDWINVQRPYKLLAQAVATAMCLPE

XP_022964954.1 uncharacterized protein LOC111464906 isoform X1 [Cucurbita moschata]1.8e-11891.37Show/hide
Query:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA--TLRYEALN
        MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELL FESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIE+   LRYEALN
Subjt:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA--TLRYEALN

Query:  FRELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTER
        FRELKSFNQPRLQVSHAEWLNFAEHSLN GFFSIAIKAYEQALSSLQQSDTANYTSH SSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKK+VTER
Subjt:  FRELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTER

Query:  NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEGFMISPTYSLLAYPPDWI
        NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEG + S +Y +  +   +I
Subjt:  NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEGFMISPTYSLLAYPPDWI

XP_022964955.1 uncharacterized protein LOC111464906 isoform X2 [Cucurbita moschata]6.8e-11896.64Show/hide
Query:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA--TLRYEALN
        MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELL FESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIE+   LRYEALN
Subjt:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA--TLRYEALN

Query:  FRELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTER
        FRELKSFNQPRLQVSHAEWLNFAEHSLN GFFSIAIKAYEQALSSLQQSDTANYTSH SSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKK+VTER
Subjt:  FRELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTER

Query:  NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEG
        NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEG
Subjt:  NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEG

XP_023520165.1 uncharacterized protein LOC111783465 isoform X2 [Cucurbita pepo subsp. pepo]1.2e-11790.2Show/hide
Query:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA--TLRYEALN
        MPCSVAEQYSLFCSRLRSRR DDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETV+KTDDQKLLVIEFLVRAFALVGDIE+   LRYEALN
Subjt:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA--TLRYEALN

Query:  FRELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTER
        FRELKSFNQPRLQVSHAEWLNFAEHSLN GFFSIA+KAYEQALSSLQQSDTANYTSH SSKCAEVIEKIKRLKDH+LKSAGSHSVQALTSEYLKKKVTER
Subjt:  FRELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTER

Query:  NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEGFMISPTYSLLAYPPDWI
        NRKISSSCTRKFTASTLFRNGIRNHNAK+LHEYQALEG + S +Y +  +   +I
Subjt:  NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEGFMISPTYSLLAYPPDWI

TrEMBL top hitse value%identityAlignment
A0A0A0LSV1 Uncharacterized protein5.0e-9879.53Show/hide
Query:  YSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA--TLRYEALNFRELKSFN
        YSLF SRLRSRR DDSTLRILE F ASKD  SLMDV S  KE+LRFESLSIIRET EKTDD KLLVIEFLVRAFALVGDIE+   LRYEALNFR LKSFN
Subjt:  YSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA--TLRYEALNFRELKSFN

Query:  QPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTERNRKISSSC
        QP LQVSHAEWLNFAEHSL+ GFFSI+IKAYEQALSSLQQSDTANYTSH S K  EV+EKI RLKDHAL  AGSHSVQALTS+YLKKKVTERNRKISSSC
Subjt:  QPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTERNRKISSSC

Query:  TRKFTASTLFRNGIRNHNAKRLHEYQALEG---------FMISPTYSLLAYPPD
        TRKFTASTLF NGIRN+NA++LHEY++ EG         F+I PT SL +YP D
Subjt:  TRKFTASTLFRNGIRNHNAKRLHEYQALEG---------FMISPTYSLLAYPPD

A0A6J1HMC3 uncharacterized protein LOC111464906 isoform X23.3e-11896.64Show/hide
Query:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA--TLRYEALN
        MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELL FESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIE+   LRYEALN
Subjt:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA--TLRYEALN

Query:  FRELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTER
        FRELKSFNQPRLQVSHAEWLNFAEHSLN GFFSIAIKAYEQALSSLQQSDTANYTSH SSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKK+VTER
Subjt:  FRELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTER

Query:  NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEG
        NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEG
Subjt:  NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEG

A0A6J1HPP0 uncharacterized protein LOC111464906 isoform X18.7e-11991.37Show/hide
Query:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA--TLRYEALN
        MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELL FESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIE+   LRYEALN
Subjt:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA--TLRYEALN

Query:  FRELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTER
        FRELKSFNQPRLQVSHAEWLNFAEHSLN GFFSIAIKAYEQALSSLQQSDTANYTSH SSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKK+VTER
Subjt:  FRELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTER

Query:  NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEGFMISPTYSLLAYPPDWI
        NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEG + S +Y +  +   +I
Subjt:  NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEGFMISPTYSLLAYPPDWI

A0A6J1I136 uncharacterized protein LOC111469552 isoform X26.9e-11695.38Show/hide
Query:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA--TLRYEALN
        MPCSVAEQYSLFCSRLRSRR DDSTLRILEFFSASKDTM  MDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIE+   LRYEALN
Subjt:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA--TLRYEALN

Query:  FRELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTER
        FRELKSFNQPRLQVSHAEWLNFAEHSLN GFFSIAIKAYEQALSSLQQSDTANYTSH SSK AEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTER
Subjt:  FRELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTER

Query:  NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEG
        NRKISSSCTRKFTASTLFRNGIRNHNAK+LHEYQALEG
Subjt:  NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEG

A0A6J1I645 uncharacterized protein LOC111469552 isoform X11.8e-11690.2Show/hide
Query:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA--TLRYEALN
        MPCSVAEQYSLFCSRLRSRR DDSTLRILEFFSASKDTM  MDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIE+   LRYEALN
Subjt:  MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEA--TLRYEALN

Query:  FRELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTER
        FRELKSFNQPRLQVSHAEWLNFAEHSLN GFFSIAIKAYEQALSSLQQSDTANYTSH SSK AEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTER
Subjt:  FRELKSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTER

Query:  NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEGFMISPTYSLLAYPPDWI
        NRKISSSCTRKFTASTLFRNGIRNHNAK+LHEYQALEG + S +Y +  +   +I
Subjt:  NRKISSSCTRKFTASTLFRNGIRNHNAKRLHEYQALEGFMISPTYSLLAYPPDWI

SwissProt top hitse value%identityAlignment
Q8RX33 Protein DOUBLE-STRAND BREAK FORMATION3.6e-2943.33Show/hide
Query:  VAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEATL--RYEALNFREL
        +A+Q  LF +R++ RR D+ +LRILE    + +  S ++V+S +++ +R ES+ I  E   ++   KL V+EF  RAFAL+GD+E+ L  RYEALN R+L
Subjt:  VAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEATL--RYEALNFREL

Query:  KSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHS
        KS +   L VSH+EW  FA  S+  GF SIA KA E AL SL++       S D+S   +  EK++RL+D A     SHS
Subjt:  KSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHS

Arabidopsis top hitse value%identityAlignment
AT1G07060.1 unknown protein2.6e-3043.33Show/hide
Query:  VAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEATL--RYEALNFREL
        +A+Q  LF +R++ RR D+ +LRILE    + +  S ++V+S +++ +R ES+ I  E   ++   KL V+EF  RAFAL+GD+E+ L  RYEALN R+L
Subjt:  VAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEATL--RYEALNFREL

Query:  KSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHS
        KS +   L VSH+EW  FA  S+  GF SIA KA E AL SL++       S D+S   +  EK++RL+D A     SHS
Subjt:  KSFNQPRLQVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGTGTTCAGTTGCGGAGCAATACTCTCTCTTTTGTTCACGGCTTAGAAGCCGAAGATTAGATGATTCTACTTTGCGAATTCTGGAATTCTTTTCCGCTTCCAAAGA
CACGATGTCGTTGATGGATGTCAAATCCGGAGTAAAAGAATTACTCAGATTTGAATCTCTATCTATCATTCGTGAAACTGTTGAGAAAACGGATGATCAAAAGCTTCTAG
TCATCGAGTTTCTTGTTCGAGCTTTTGCTCTTGTTGGAGACATTGAGGCTACTTTGAGATATGAGGCCTTGAATTTTCGGGAACTGAAGTCTTTTAATCAACCAAGGCTT
CAAGTCTCACACGCGGAATGGTTAAACTTCGCTGAGCATTCATTGAACACTGGCTTTTTTTCAATTGCTATAAAGGCATATGAGCAAGCACTGTCAAGCCTTCAGCAGAG
TGATACTGCAAACTACACATCACATGATTCCTCTAAATGTGCGGAAGTTATTGAGAAGATAAAGAGACTCAAAGATCATGCTCTGAAATCAGCTGGTTCCCATTCTGTTC
AGGCTCTCACATCTGAGTATTTGAAAAAGAAAGTAACTGAAAGGAACAGAAAGATTTCTTCATCCTGCACAAGAAAGTTTACGGCAAGCACTCTATTCAGAAATGGTATC
AGAAACCATAATGCAAAAAGGCTGCATGAATATCAAGCTTTGGAGGGATTCATGATCAGTCCTACATATAGTCTCCTTGCATATCCACCTGACTGGATAAATGTTCAGCG
TCCCTACAAGCTGTTGGCACAAGCCGTTGCAACTGCTATGTGCCTGCCTGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCGTGTTCAGTTGCGGAGCAATACTCTCTCTTTTGTTCACGGCTTAGAAGCCGAAGATTAGATGATTCTACTTTGCGAATTCTGGAATTCTTTTCCGCTTCCAAAGA
CACGATGTCGTTGATGGATGTCAAATCCGGAGTAAAAGAATTACTCAGATTTGAATCTCTATCTATCATTCGTGAAACTGTTGAGAAAACGGATGATCAAAAGCTTCTAG
TCATCGAGTTTCTTGTTCGAGCTTTTGCTCTTGTTGGAGACATTGAGGCTACTTTGAGATATGAGGCCTTGAATTTTCGGGAACTGAAGTCTTTTAATCAACCAAGGCTT
CAAGTCTCACACGCGGAATGGTTAAACTTCGCTGAGCATTCATTGAACACTGGCTTTTTTTCAATTGCTATAAAGGCATATGAGCAAGCACTGTCAAGCCTTCAGCAGAG
TGATACTGCAAACTACACATCACATGATTCCTCTAAATGTGCGGAAGTTATTGAGAAGATAAAGAGACTCAAAGATCATGCTCTGAAATCAGCTGGTTCCCATTCTGTTC
AGGCTCTCACATCTGAGTATTTGAAAAAGAAAGTAACTGAAAGGAACAGAAAGATTTCTTCATCCTGCACAAGAAAGTTTACGGCAAGCACTCTATTCAGAAATGGTATC
AGAAACCATAATGCAAAAAGGCTGCATGAATATCAAGCTTTGGAGGGATTCATGATCAGTCCTACATATAGTCTCCTTGCATATCCACCTGACTGGATAAATGTTCAGCG
TCCCTACAAGCTGTTGGCACAAGCCGTTGCAACTGCTATGTGCCTGCCTGAATAG
Protein sequenceShow/hide protein sequence
MPCSVAEQYSLFCSRLRSRRLDDSTLRILEFFSASKDTMSLMDVKSGVKELLRFESLSIIRETVEKTDDQKLLVIEFLVRAFALVGDIEATLRYEALNFRELKSFNQPRL
QVSHAEWLNFAEHSLNTGFFSIAIKAYEQALSSLQQSDTANYTSHDSSKCAEVIEKIKRLKDHALKSAGSHSVQALTSEYLKKKVTERNRKISSSCTRKFTASTLFRNGI
RNHNAKRLHEYQALEGFMISPTYSLLAYPPDWINVQRPYKLLAQAVATAMCLPE