; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10017424 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10017424
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionMatK_N domain-containing protein
Genome locationChr03:14132538..14135907
RNA-Seq ExpressionHG10017424
SyntenyHG10017424
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR022546 - Uncharacterised protein family Ycf68


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD5336145.1 unnamed protein product [Arabidopsis thaliana]7.7e-8164.18Show/hide
Query:  MLESAALLGFPHLGSL--GKDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRIS-----------------------------
        +LESAALLGFP LG +   +DQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR S                             
Subjt:  MLESAALLGFPHLGSL--GKDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRIS-----------------------------

Query:  ---------RGPERRWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKG
                 RG     LPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKG
Subjt:  ---------RGPERRWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKG

Query:  VVSDEMLRGVENKRRSGDSRIVAR---GKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAARRSGGVEAALDGESPVAESI
        VVSDEMLRGVENKRRSGDSRI A        K+  RS+ E  G  L+R  G   +   R    RS  V   ++G    AE++
Subjt:  VVSDEMLRGVENKRRSGDSRIVAR---GKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAARRSGGVEAALDGESPVAESI

KAD3640919.1 hypothetical protein E3N88_30142 [Mikania micrantha]4.5e-9774.62Show/hide
Query:  KDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR----ISRGPERRWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSI
        +DQVGPCEQLDALSPFNPLSE+RQKEGKSMDRPH LHPVGTTR PQGRLR     S       LPCGGCQRFESAYLQLVNLADTKLYDST FFRFG SI
Subjt:  KDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR----ISRGPERRWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSI

Query:  YDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQ
        YDLSFMDVDKI PFSSTLGWHSLK+KGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENK RSGDSRI                             GE 
Subjt:  YDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQ

Query:  YKRRAARRSGGVE-AALDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDSEVV
                   VE   LDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDSE V
Subjt:  YKRRAARRSGGVE-AALDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDSEVV

KAG5070513.1 hypothetical protein JHK85_002890 [Glycine max]3.1e-8264.22Show/hide
Query:  LKKDLRVSRVGPGGFLNAFFFLLIGVISQRLAMLESAALLGFPHLGSLGKDQVG----PCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQG
        LKKDLRVSRV PGG LNAF FLLIGVISQRLAM           L    KD  G      +        N L+   Q E  S   P  +      R    
Subjt:  LKKDLRVSRVGPGGFLNAFFFLLIGVISQRLAMLESAALLGFPHLGSLGKDQVG----PCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQG

Query:  RLRISRGPERRWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD
        R +     E   L  GG     S  L L ++ ++  +D           YDLSFMDVDKILP SSTLGWHSLKVKGEVQTKKGL WIPRHPETRKGVVSD
Subjt:  RLRISRGPERRWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD

Query:  EMLRGVENKRRSGDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAARRSGGVEAALDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCK
        EMLRGVENKRRSGDSR   RGKESKSDSRSSGERNGSSLNRENGVVGE YK RAARRS      LDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCK
Subjt:  EMLRGVENKRRSGDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAARRSGGVEAALDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCK

Query:  AKYSWVTDSEVVP
        AKYSWVTDSEVVP
Subjt:  AKYSWVTDSEVVP

KAG7528872.1 hypothetical protein ISN44_Un153g000040 [Arabidopsis suecica]1.4e-10369.61Show/hide
Query:  LESAALLGFPHLGSL--GKDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRIS-----------------------------R
        LESAALLGFP LG +   +DQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR++                             R
Subjt:  LESAALLGFPHLGSL--GKDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRIS-----------------------------R

Query:  GPERRWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGV
        G     LPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLRGV
Subjt:  GPERRWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGV

Query:  ENKRRSGDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAARRSGGVEAALDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWV
        ENKRRSGDSRI                             GE  + R           LDGESPVAESITSL SDPSSMGHVESRVNQQGPPCKAKYSWV
Subjt:  ENKRRSGDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAARRSGGVEAALDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWV

Query:  TDSEVV
        TDSE V
Subjt:  TDSEVV

OVA05688.1 hypothetical protein BVC80_4285g1 [Macleaya cordata]1.5e-9768.18Show/hide
Query:  GFPHLGSLG----KDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR----------------------------------ISR
        G P + S G    +DQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR                                  ISR
Subjt:  GFPHLGSLG----KDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR----------------------------------ISR

Query:  GPER-RWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRG
        G  R   LPCGGCQRFESAYLQLVNLADTK+YDST FFRFGSSIYDLSFMDVDKIL FSSTLGWHSLKV GEVQT+KGLRWIPRHPETRKGV SDEMLRG
Subjt:  GPER-RWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRG

Query:  VENKRRSGDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAARRSGGVEA-ALDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYS
        VENK RSGDSRI              G+     L   N   G++        S  VE   LDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYS
Subjt:  VENKRRSGDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAARRSGGVEA-ALDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYS

Query:  WVTDSEVV
        WVTDSE V
Subjt:  WVTDSEVV

TrEMBL top hitse value%identityAlignment
A0A200Q5G5 Uncharacterized protein ycf687.4e-9868.18Show/hide
Query:  GFPHLGSLG----KDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR----------------------------------ISR
        G P + S G    +DQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR                                  ISR
Subjt:  GFPHLGSLG----KDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR----------------------------------ISR

Query:  GPER-RWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRG
        G  R   LPCGGCQRFESAYLQLVNLADTK+YDST FFRFGSSIYDLSFMDVDKIL FSSTLGWHSLKV GEVQT+KGLRWIPRHPETRKGV SDEMLRG
Subjt:  GPER-RWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRG

Query:  VENKRRSGDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAARRSGGVEA-ALDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYS
        VENK RSGDSRI              G+     L   N   G++        S  VE   LDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYS
Subjt:  VENKRRSGDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAARRSGGVEA-ALDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYS

Query:  WVTDSEVV
        WVTDSE V
Subjt:  WVTDSEVV

A0A2N9G477 Uncharacterized protein ycf686.1e-9261.63Show/hide
Query:  KKDLRVSRVGPGGFLNAFFFLLIGVISQRLAMLESAALLGFPH----------LGSLGKDQ-------------VGP----------------------C
        K+ L    +G  G LNAFFFLLIGVISQRLAM+      G  H            + GKD+             VGP                       
Subjt:  KKDLRVSRVGPGGFLNAFFFLLIGVISQRLAMLESAALLGFPH----------LGSLGKDQ-------------VGP----------------------C

Query:  EQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRL-----RISRGPERRWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFM
        E+LDALSPFNPLSEMRQKEGKSM RPHRLHPVGTTRSPQGRL     RISRGPERR         FESAYLQLVNLADTKLYDST FFRFGSSIYDLSFM
Subjt:  EQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRL-----RISRGPERRWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFM

Query:  DVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAA
        DVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRI                             GE       
Subjt:  DVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAA

Query:  RRSGGVE-AALDGESPVAESITSLRSDPSSMGHVESRVNQQGPP
             VE   LDGESPVAESITSLRSDPSSMGHVESRVNQQGPP
Subjt:  RRSGGVE-AALDGESPVAESITSLRSDPSSMGHVESRVNQQGPP

A0A2N9IA97 Uncharacterized protein ycf686.7e-9196.26Show/hide
Query:  LPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRS
        LPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENKRRS
Subjt:  LPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRS

Query:  GDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAARRSGGVE-AALDGESPVAESITSLRSDPSSMGHVESRVNQQGPP
        GDSRI ARGKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAARRS GVE   LDGESPVAESITSLRSDPSSMGHVESRVNQQGPP
Subjt:  GDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAARRSGGVE-AALDGESPVAESITSLRSDPSSMGHVESRVNQQGPP

A0A5N6MLP8 Uncharacterized protein ycf682.2e-9774.62Show/hide
Query:  KDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR----ISRGPERRWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSI
        +DQVGPCEQLDALSPFNPLSE+RQKEGKSMDRPH LHPVGTTR PQGRLR     S       LPCGGCQRFESAYLQLVNLADTKLYDST FFRFG SI
Subjt:  KDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR----ISRGPERRWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSI

Query:  YDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQ
        YDLSFMDVDKI PFSSTLGWHSLK+KGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENK RSGDSRI                             GE 
Subjt:  YDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQ

Query:  YKRRAARRSGGVE-AALDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDSEVV
                   VE   LDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDSE V
Subjt:  YKRRAARRSGGVE-AALDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDSEVV

A0A6N2MWA8 Uncharacterized protein ycf688.8e-8357.18Show/hide
Query:  MGPGDLKKDLRVSRVGPGGFLNAFFFLLIGVISQRLAM---------LESA-----------------ALLGFPHLGSLGKDQVGPCEQL----------
        MG G LKKDLRVS+VGPGG LNAFFFLLIGVIS+RLAM         LESA                 A   F H   LG   +    +L          
Subjt:  MGPGDLKKDLRVSRVGPGGFLNAFFFLLIGVISQRLAM---------LESA-----------------ALLGFPHLGSLGKDQVGPCEQL----------

Query:  ----------DAL-------------------------SPFNPLSEMRQKEGKSM-DRPHRLHPVGTTRSPQGRL---------------------RISR
                  D++                         SP  P S +  +E + + DRP     +  +R+    L                     RISR
Subjt:  ----------DAL-------------------------SPFNPLSEMRQKEGKSM-DRPHRLHPVGTTRSPQGRL---------------------RISR

Query:  GPERRWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGV
        GP    LPCGGCQRFESAYLQLV+L  TKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLRGV
Subjt:  GPERRWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGV

Query:  ENKRRSGDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAARRSGGV
        ENKRRSGDSRIV RGKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAARRSGGV
Subjt:  ENKRRSGDSRIVARGKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAARRSGGV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACCAGGAGATTTGAAAAAGGATCTTAGAGTGTCTAGGGTTGGGCCAGGAGGGTTTCTTAACGCCTTCTTTTTTCTTCTCATCGGAGTTATTTCACAAAGACTTGC
CATGTTGGAGTCGGCGGCTCTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGGATCAAGTTGGCCCTTGCGAACAGCTTGATGCACTATCTCCCTTCAACCCTTTGA
GCGAAATGCGGCAAAAGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTAGGAACTACGAGATCACCCCAAGGACGCCTTCGAATCAGTCGGGGGCCTGAG
AGGCGGTGGTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATATGATAGCACTCCATTTTTCCGATTCGG
CAGTTCGATCTATGATTTATCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAAGAAAG
GCTTACGGTGGATACCTAGGCACCCAGAGACGAGGAAGGGCGTAGTAAGCGACGAAATGCTTCGGGGAGTTGAAAATAAGCGTAGATCCGGAGATTCCCGAATAGTAGCC
AGAGGAAAAGAAAGCAAAAGCGATTCCCGTAGTAGCGGCGAGCGAAATGGGAGCAGCCTAAACCGTGAAAACGGGGTTGTGGGAGAGCAATACAAGCGTCGTGCTGCTAG
GCGAAGCGGTGGAGTGGAGGCTGCACTAGATGGCGAGAGTCCAGTAGCCGAAAGCATCACTAGCTTACGCTCTGACCCGAGTAGCATGGGGCACGTGGAATCCCGTGTGA
ATCAGCAAGGACCACCTTGCAAGGCTAAATACTCCTGGGTGACCGATAGTGAAGTAGTACCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGACCAGGAGATTTGAAAAAGGATCTTAGAGTGTCTAGGGTTGGGCCAGGAGGGTTTCTTAACGCCTTCTTTTTTCTTCTCATCGGAGTTATTTCACAAAGACTTGC
CATGTTGGAGTCGGCGGCTCTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGGATCAAGTTGGCCCTTGCGAACAGCTTGATGCACTATCTCCCTTCAACCCTTTGA
GCGAAATGCGGCAAAAGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTAGGAACTACGAGATCACCCCAAGGACGCCTTCGAATCAGTCGGGGGCCTGAG
AGGCGGTGGTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATATGATAGCACTCCATTTTTCCGATTCGG
CAGTTCGATCTATGATTTATCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAAGAAAG
GCTTACGGTGGATACCTAGGCACCCAGAGACGAGGAAGGGCGTAGTAAGCGACGAAATGCTTCGGGGAGTTGAAAATAAGCGTAGATCCGGAGATTCCCGAATAGTAGCC
AGAGGAAAAGAAAGCAAAAGCGATTCCCGTAGTAGCGGCGAGCGAAATGGGAGCAGCCTAAACCGTGAAAACGGGGTTGTGGGAGAGCAATACAAGCGTCGTGCTGCTAG
GCGAAGCGGTGGAGTGGAGGCTGCACTAGATGGCGAGAGTCCAGTAGCCGAAAGCATCACTAGCTTACGCTCTGACCCGAGTAGCATGGGGCACGTGGAATCCCGTGTGA
ATCAGCAAGGACCACCTTGCAAGGCTAAATACTCCTGGGTGACCGATAGTGAAGTAGTACCGTGA
Protein sequenceShow/hide protein sequence
MGPGDLKKDLRVSRVGPGGFLNAFFFLLIGVISQRLAMLESAALLGFPHLGSLGKDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRISRGPE
RRWLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIVA
RGKESKSDSRSSGERNGSSLNRENGVVGEQYKRRAARRSGGVEAALDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDSEVVP