; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG08G009780 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG08G009780
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionDamaged dna-binding 2, putative isoform 1
Genome locationCG_Chr08:22514074..22515570
RNA-Seq ExpressionClCG08G009780
SyntenyClCG08G009780
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064705.1 Damaged dna-binding 2, putative isoform 1 [Cucumis melo var. makuwa]1.2e-10389.74Show/hide
Query:  MSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAEVDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEEV
        MSIALESN+RIPPSVFSQ GLP YCSVLNT+GI  VVRRE AVADAVAP +VDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSY+GPL MESLEEV
Subjt:  MSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAEVDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEEV

Query:  LPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLPP--PIRPP
        LPIRRGISNFYNGKSKSFTSLADASSSSSIK+IAKPENA SRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESH +NDLNS +PP  PIRPP
Subjt:  LPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLPP--PIRPP

Query:  LHPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ
        LHPNGRASR NSG+AVP LCKFP WRSYS+ANIQ
Subjt:  LHPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ

XP_004144215.1 uncharacterized protein LOC101211014 [Cucumis sativus]2.8e-9786.75Show/hide
Query:  MSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAEVDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEEV
        MSIALE+N     +VFSQ GLPSYCSVLNT+GI  VVRRE A+ADAVAPA+VDRC+SSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPL MESLEEV
Subjt:  MSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAEVDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEEV

Query:  LPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLPPP--IRPP
        LPIRRGISNFYNGKSKSFTSL DASSSSSIKDIAKPENA SRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSE H +NDLNS LPPP  IRPP
Subjt:  LPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLPPP--IRPP

Query:  LHPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ
        L+PNGR SR NSG+AVP LCKFP WRSYS+ANIQ
Subjt:  LHPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ

XP_008445543.1 PREDICTED: uncharacterized protein LOC103488525 [Cucumis melo]7.8e-10887.65Show/hide
Query:  FWFISFVP--LDLNHSVMSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAEVDRCSSSSSSSIGENSGFSVRSSDNDDGEDNE
        F  I  VP  LD NHSVMSIALESN+RIPPSVFSQ GLP YCSVLNT+GI  VVRRE AVADAVAP +VDRCSSSSSSSIGENSGFSVRSSDNDDGEDNE
Subjt:  FWFISFVP--LDLNHSVMSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAEVDRCSSSSSSSIGENSGFSVRSSDNDDGEDNE

Query:  AESSYKGPLAMESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHN
        AESSY+GPL MESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSSIK+IAKPENA SRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESH 
Subjt:  AESSYKGPLAMESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHN

Query:  SNDLNSRLPP--PIRPPLHPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ
        +NDLNS +PP  PIRPPLHPNGRASR NSG+AVP LCKFP WRSYS+ANIQ
Subjt:  SNDLNSRLPP--PIRPPLHPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ

XP_022962518.1 uncharacterized protein LOC111462922 [Cucurbita moschata]4.4e-9585.41Show/hide
Query:  MSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAE-VDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEE
        MSIALESNSRIPPSVFSQG LPSYCSVLNT+G+  VVRRE  V D VAPAE VDRCSSSSSSSIGENS FSVRS ++DDGEDNEAESSYK  L MESLEE
Subjt:  MSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAE-VDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLPPPIRPPL
        VLPIRRGISNFYNGKSKSFTSL DASS+SSIKDIAKPENA SRKRRNLLASNLIAGGISKRPIISS  SSLALAV +SSSE  +  DLNSRL P IRPPL
Subjt:  VLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLPPPIRPPL

Query:  HPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ
        HP GRASRSNSG+AVPLLCKFP WRSYSLANIQ
Subjt:  HPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ

XP_038883984.1 uncharacterized protein LOC120074946 [Benincasa hispida]4.4e-10391.85Show/hide
Query:  MSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRE-TAVADAVAPAEVDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEE
        MSIALESNSRIPPSVFSQ GLPSYCSVLNT+G   VVR+E  AV DAVA AEVD CSSSSSSSIGENSGFSVRSSDND+GEDNEAESSYKGPL MESLEE
Subjt:  MSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRE-TAVADAVAPAEVDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLPPPIRPPL
        VLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENA SRKRRNLLASNLIAGGISKRPII+SSRSSLALAVVLSSSESHNSNDLNSRL PPIRPPL
Subjt:  VLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLPPPIRPPL

Query:  HPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ
        HPNGRASRSNSG+ VPLLCKFP WRSYSLANIQ
Subjt:  HPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ

TrEMBL top hitse value%identityAlignment
A0A0A0KFI7 Uncharacterized protein1.3e-9786.75Show/hide
Query:  MSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAEVDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEEV
        MSIALE+N     +VFSQ GLPSYCSVLNT+GI  VVRRE A+ADAVAPA+VDRC+SSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPL MESLEEV
Subjt:  MSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAEVDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEEV

Query:  LPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLPPP--IRPP
        LPIRRGISNFYNGKSKSFTSL DASSSSSIKDIAKPENA SRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSE H +NDLNS LPPP  IRPP
Subjt:  LPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLPPP--IRPP

Query:  LHPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ
        L+PNGR SR NSG+AVP LCKFP WRSYS+ANIQ
Subjt:  LHPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ

A0A1S3BCZ8 uncharacterized protein LOC1034885253.8e-10887.65Show/hide
Query:  FWFISFVP--LDLNHSVMSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAEVDRCSSSSSSSIGENSGFSVRSSDNDDGEDNE
        F  I  VP  LD NHSVMSIALESN+RIPPSVFSQ GLP YCSVLNT+GI  VVRRE AVADAVAP +VDRCSSSSSSSIGENSGFSVRSSDNDDGEDNE
Subjt:  FWFISFVP--LDLNHSVMSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAEVDRCSSSSSSSIGENSGFSVRSSDNDDGEDNE

Query:  AESSYKGPLAMESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHN
        AESSY+GPL MESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSSIK+IAKPENA SRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESH 
Subjt:  AESSYKGPLAMESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHN

Query:  SNDLNSRLPP--PIRPPLHPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ
        +NDLNS +PP  PIRPPLHPNGRASR NSG+AVP LCKFP WRSYS+ANIQ
Subjt:  SNDLNSRLPP--PIRPPLHPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ

A0A5A7VFP0 Damaged dna-binding 2, putative isoform 15.6e-10489.74Show/hide
Query:  MSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAEVDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEEV
        MSIALESN+RIPPSVFSQ GLP YCSVLNT+GI  VVRRE AVADAVAP +VDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSY+GPL MESLEEV
Subjt:  MSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAEVDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEEV

Query:  LPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLPP--PIRPP
        LPIRRGISNFYNGKSKSFTSLADASSSSSIK+IAKPENA SRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESH +NDLNS +PP  PIRPP
Subjt:  LPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLPP--PIRPP

Query:  LHPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ
        LHPNGRASR NSG+AVP LCKFP WRSYS+ANIQ
Subjt:  LHPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ

A0A6J1HF10 uncharacterized protein LOC1114629222.1e-9585.41Show/hide
Query:  MSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAE-VDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEE
        MSIALESNSRIPPSVFSQG LPSYCSVLNT+G+  VVRRE  V D VAPAE VDRCSSSSSSSIGENS FSVRS ++DDGEDNEAESSYK  L MESLEE
Subjt:  MSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAE-VDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLPPPIRPPL
        VLPIRRGISNFYNGKSKSFTSL DASS+SSIKDIAKPENA SRKRRNLLASNLIAGGISKRPIISS  SSLALAV +SSSE  +  DLNSRL P IRPPL
Subjt:  VLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLPPPIRPPL

Query:  HPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ
        HP GRASRSNSG+AVPLLCKFP WRSYSLANIQ
Subjt:  HPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ

A0A6J1K8D9 uncharacterized protein LOC1114920744.0e-9484.98Show/hide
Query:  MSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAE-VDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEE
        MSIALESNSRIPPSVFSQG LPSYCSVLNT+G+  VVRRE  V D VAPAE VDRCSSSSSSSIGENS FSVRS ++DDGEDNEAESSYK  L MESLEE
Subjt:  MSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAE-VDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLPPPIRPPL
        VL IRRGISNFYNGKSKSFTSL DASS+SSIKDIAKPENA SRKRRNLLASNLIAGGISKRPIISS  SSLALAV +SSSE  + + LNSRL P IRPPL
Subjt:  VLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLPPPIRPPL

Query:  HPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ
        HPNGRASRSNSG+AVPLLCKFP WRSYSLANIQ
Subjt:  HPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24550.1 unknown protein1.8e-0943.81Show/hide
Query:  SSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPL--AMESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLI
        SS SSSSIGE+      S + ++ E+++A S  +G L     SLE+ LPI+RG+SN Y GKSKSF +L +A+S +  KD+ K EN  +++RR ++A+ L 
Subjt:  SSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPL--AMESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLI

Query:  AGGIS
          G S
Subjt:  AGGIS

AT3G43850.1 unknown protein4.1e-2251.08Show/hide
Query:  SSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPL-AMESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIA
        SS+SS SIGEN       SD+D+G +NE ESSY GPL  MESLEE LPI+R IS FY GKSKSF SL++ +SS  +KD+ KPEN  SR+RRNLL+  + +
Subjt:  SSSSSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPL-AMESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIA

Query:  -GGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLP
         GGISK+P  S         + +S  E  +S+  +  LP
Subjt:  -GGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLP

AT4G31510.1 unknown protein7.5e-0835.98Show/hide
Query:  RRETAVADAVAPAEVD------RCSSS----SSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPL--AMESLEEVLPIRRGISNFYNGKSKSFTSLADAS
        R      D   PA +       RC  S    SSSS+GE       +S+N++ ED+   SS    L     SLE+ LPI+RG+SN Y GKSKSF +L +AS
Subjt:  RRETAVADAVAPAEVD------RCSSS----SSSSIGENSGFSVRSSDNDDGEDNEAESSYKGPL--AMESLEEVLPIRRGISNFYNGKSKSFTSLADAS

Query:  SSSSIKDIAKPENALSRKRRNLLASNL-IAGGISKRPIIS--SSRSSLALAVVLSSSESHNSND
        +++   D+ K E+ L+++RR L+A+ L     +S   I +  +  S   LA+  S +E H  ND
Subjt:  SSSSIKDIAKPENALSRKRRNLLASNL-IAGGISKRPIIS--SSRSSLALAVVLSSSESHNSND

AT5G21940.1 unknown protein1.4e-2744.81Show/hide
Query:  SSSSSSSIGENSGFSVRSSDN--DDGEDNEAESSYKGPL-AMESLEEVLPIRRGISNFYNGKSKSFTSL-ADA----SSSSSIKDIAKPENALSRKRRNL
        SSS+SSSIG NS    +SS++  DD  +NE ES YKGPL  MESLE+VLP+R+GIS +Y+GKSKSFT+L A+A    +SSSS+KD+AKPEN  SR+RRNL
Subjt:  SSSSSSSIGENSGFSVRSSDN--DDGEDNEAESSYKGPL-AMESLEEVLPIRRGISNFYNGKSKSFTSL-ADA----SSSSSIKDIAKPENALSRKRRNL

Query:  LASNL-------IAGGISKRPIISSSRSSLALAVVLSS------------SESHNSNDLNSRLPP----------PIRPPLHPNGRASRSNSGAAVPLLC
        L   +         GGISK+ ++SSSRS+L LA+ +++              S  S+   S  PP             PPL+P  + S  N  ++   L 
Subjt:  LASNL-------IAGGISKRPIISSSRSSLALAVVLSS------------SESHNSNDLNSRLPP----------PIRPPLHPNGRASRSNSGAAVPLLC

Query:  KFPPWRSYSLAN
         F  WRS+S+A+
Subjt:  KFPPWRSYSLAN

AT5G24890.1 unknown protein6.7e-0940.82Show/hide
Query:  SSSSSSIGE--NSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNL
        SS SSSIG   +S      S+N++ + +  E   +G  +M SLE+ LP +RG+SN Y GKSKSF +L +     S+K++AK EN L+++RR  + + L
Subjt:  SSSSSSIGE--NSGFSVRSSDNDDGEDNEAESSYKGPLAMESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAAAAGGATAAAAAGTTTTGGTTTATCTCGTTTGTCCCTTTGGATTTGAATCACTCAGTCATGTCAATTGCTTTGGAAAGCAATAGCAGGATTCCGCCGTCTGT
TTTCTCTCAAGGTGGCTTGCCGTCGTACTGCTCCGTCTTGAATACGTCGGGAATTAATACGGTAGTTCGGCGAGAGACGGCTGTTGCTGATGCGGTGGCGCCGGCAGAGG
TGGATAGATGTAGTTCGTCTTCATCGTCGTCGATCGGAGAAAACAGTGGTTTCTCTGTACGATCTTCGGATAATGACGACGGAGAGGATAATGAGGCGGAAAGTTCGTAT
AAAGGACCTCTAGCAATGGAATCGTTGGAAGAAGTTCTGCCTATCAGGAGAGGAATTTCGAATTTCTACAACGGAAAATCGAAATCCTTCACAAGCTTGGCAGACGCTTC
CTCTTCTTCCTCCATTAAAGACATAGCAAAGCCTGAAAACGCTCTCTCTCGGAAACGCAGAAATCTCCTTGCATCGAATCTCATCGCCGGCGGCATATCGAAGCGACCGA
TTATAAGTTCAAGTCGAAGCTCGTTAGCGTTGGCCGTCGTCCTGAGCAGTTCTGAGAGCCACAACAGTAACGATCTGAATTCAAGATTGCCTCCGCCGATTCGTCCTCCA
TTGCATCCCAACGGACGAGCATCTCGCAGCAATTCAGGTGCTGCAGTTCCTCTTCTCTGTAAATTCCCCCCTTGGCGATCATACTCCTTGGCCAATATACAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGAAAAGGATAAAAAGTTTTGGTTTATCTCGTTTGTCCCTTTGGATTTGAATCACTCAGTCATGTCAATTGCTTTGGAAAGCAATAGCAGGATTCCGCCGTCTGT
TTTCTCTCAAGGTGGCTTGCCGTCGTACTGCTCCGTCTTGAATACGTCGGGAATTAATACGGTAGTTCGGCGAGAGACGGCTGTTGCTGATGCGGTGGCGCCGGCAGAGG
TGGATAGATGTAGTTCGTCTTCATCGTCGTCGATCGGAGAAAACAGTGGTTTCTCTGTACGATCTTCGGATAATGACGACGGAGAGGATAATGAGGCGGAAAGTTCGTAT
AAAGGACCTCTAGCAATGGAATCGTTGGAAGAAGTTCTGCCTATCAGGAGAGGAATTTCGAATTTCTACAACGGAAAATCGAAATCCTTCACAAGCTTGGCAGACGCTTC
CTCTTCTTCCTCCATTAAAGACATAGCAAAGCCTGAAAACGCTCTCTCTCGGAAACGCAGAAATCTCCTTGCATCGAATCTCATCGCCGGCGGCATATCGAAGCGACCGA
TTATAAGTTCAAGTCGAAGCTCGTTAGCGTTGGCCGTCGTCCTGAGCAGTTCTGAGAGCCACAACAGTAACGATCTGAATTCAAGATTGCCTCCGCCGATTCGTCCTCCA
TTGCATCCCAACGGACGAGCATCTCGCAGCAATTCAGGTGCTGCAGTTCCTCTTCTCTGTAAATTCCCCCCTTGGCGATCATACTCCTTGGCCAATATACAGTAGCGTAG
GGTAAGGGTTTTTCCATGGCCGCCTCATGGAAGCTAATCAGTCACCATCAAAGACTAACTTGACCTTGCGAAACCGATTTTCATCAAATCGGTTTCTCACCAATAACTCA
TTTTCTTTCATATTGTCCACGATCTTACTTCTGC
Protein sequenceShow/hide protein sequence
MKEKDKKFWFISFVPLDLNHSVMSIALESNSRIPPSVFSQGGLPSYCSVLNTSGINTVVRRETAVADAVAPAEVDRCSSSSSSSIGENSGFSVRSSDNDDGEDNEAESSY
KGPLAMESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSSIKDIAKPENALSRKRRNLLASNLIAGGISKRPIISSSRSSLALAVVLSSSESHNSNDLNSRLPPPIRPP
LHPNGRASRSNSGAAVPLLCKFPPWRSYSLANIQ