; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg025230 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg025230
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDamaged dna-binding 2, putative isoform 1
Genome locationscaffold13:41144783..41146812
RNA-Seq ExpressionSpg025230
SyntenySpg025230
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064705.1 Damaged dna-binding 2, putative isoform 1 [Cucumis melo var. makuwa]3.2e-9785.96Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE
        MSIALESN RIPPSVFSQ GLP YCSVLNTTGIIPVVRRE AV DAVAP E+VDRCSSSSSSSIGENSG SVRS DNDD EDNEAESSY+GPLGMESLEE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLALAVAMSGSES----DLNSRLPP--PIRP
        VLPIRRGISNFYNGKSKSFTSLA+ASS++SIK+IAKPENAFSRKRRNLLASNLIAGGISKRP ISSSRSSLALAV +S SES    DLNS +PP  PIRP
Subjt:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLALAVAMSGSES----DLNSRLPP--PIRP

Query:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        PLHPNGRASR N GSAVP LCK+PTWRSYS+A+IQ
Subjt:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

XP_008445543.1 PREDICTED: uncharacterized protein LOC103488525 [Cucumis melo]3.2e-9785.96Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE
        MSIALESN RIPPSVFSQ GLP YCSVLNTTGIIPVVRRE AV DAVAP E+VDRCSSSSSSSIGENSG SVRS DNDD EDNEAESSY+GPLGMESLEE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLALAVAMSGSES----DLNSRLPP--PIRP
        VLPIRRGISNFYNGKSKSFTSLA+ASS++SIK+IAKPENAFSRKRRNLLASNLIAGGISKRP ISSSRSSLALAV +S SES    DLNS +PP  PIRP
Subjt:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLALAVAMSGSES----DLNSRLPP--PIRP

Query:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        PLHPNGRASR N GSAVP LCK+PTWRSYS+A+IQ
Subjt:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

XP_022131782.1 uncharacterized protein LOC111004861 [Momordica charantia]2.6e-10289.27Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDE-EDNEAESSYKGPLGMESLE
        MSIAL+ N+RIPPSVFSQGGLPSYCSVLNT GIIPVVRRE AVGDAVAPAEE+DRCSSSSSSSIGENSGLSV+S DNDD+ E+NEAESSYKGPLGMESLE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDE-EDNEAESSYKGPLGMESLE

Query:  EVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSRSSLALAVAM--SGSES--DLNSRLPPPIRPPL
        EVLP+RRGISNFYNGKSKSFTSLAEASS A+IKDIAKPENA+SRKRRNLLASNLIAGGISKRPISSSRSSLALAVAM  SGS S  DLNSR PPPIRPPL
Subjt:  EVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSRSSLALAVAM--SGSES--DLNSRLPPPIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        HPNGR+SRSNL S V LLCKYPTWRSYSLADIQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

XP_022962518.1 uncharacterized protein LOC111462922 [Cucurbita moschata]1.6e-9685.84Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE
        MSIALESN+RIPPSVFSQG LPSYCSVLNTTG+IPVVRRE  VGD VAPAE VDRCSSSSSSSIGENS  SVRS+++DD EDNEAESSYK  LGMESLEE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLALAVAMSGSE----SDLNSRLPPPIRPPL
        VLPIRRGISNFYNGKSKSFTSL +ASST+SIKDIAKPENAFSRKRRNLLASNLIAGGISKRPI SSR SSLALAV MS SE     DLNSRL P IRPPL
Subjt:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLALAVAMSGSE----SDLNSRLPPPIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        HP GRASRSN GSAVPLLCK+PTWRSYSLA+IQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

XP_023547301.1 uncharacterized protein LOC111806162 [Cucurbita pepo subsp. pepo]3.6e-9685.41Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE
        MSIALESN+RIPPSVFSQG LPSYCSVLNTTG+IPVVRR+  VGD VAPAE VDRCSSSSSSSIGENS  SVRS+++DD EDNEAESSYK  LGMESLEE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLALAVAMSGSE----SDLNSRLPPPIRPPL
        VLPIRRGISNFYNGKSKSFTSL +ASST+SIKDIAKPENAFSRKRRNLLASNLIAGGISKRPI SSR SSLALAV MS SE     DLNSRL P IRPPL
Subjt:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLALAVAMSGSE----SDLNSRLPPPIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        HP GRASRSN GSAVPLLCK+PTWRSYSLA+IQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

TrEMBL top hitse value%identityAlignment
A0A1S3BCZ8 uncharacterized protein LOC1034885251.6e-9785.96Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE
        MSIALESN RIPPSVFSQ GLP YCSVLNTTGIIPVVRRE AV DAVAP E+VDRCSSSSSSSIGENSG SVRS DNDD EDNEAESSY+GPLGMESLEE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLALAVAMSGSES----DLNSRLPP--PIRP
        VLPIRRGISNFYNGKSKSFTSLA+ASS++SIK+IAKPENAFSRKRRNLLASNLIAGGISKRP ISSSRSSLALAV +S SES    DLNS +PP  PIRP
Subjt:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLALAVAMSGSES----DLNSRLPP--PIRP

Query:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        PLHPNGRASR N GSAVP LCK+PTWRSYS+A+IQ
Subjt:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

A0A5A7VFP0 Damaged dna-binding 2, putative isoform 11.6e-9785.96Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE
        MSIALESN RIPPSVFSQ GLP YCSVLNTTGIIPVVRRE AV DAVAP E+VDRCSSSSSSSIGENSG SVRS DNDD EDNEAESSY+GPLGMESLEE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLALAVAMSGSES----DLNSRLPP--PIRP
        VLPIRRGISNFYNGKSKSFTSLA+ASS++SIK+IAKPENAFSRKRRNLLASNLIAGGISKRP ISSSRSSLALAV +S SES    DLNS +PP  PIRP
Subjt:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLALAVAMSGSES----DLNSRLPP--PIRP

Query:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        PLHPNGRASR N GSAVP LCK+PTWRSYS+A+IQ
Subjt:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

A0A6J1BS00 uncharacterized protein LOC1110048611.2e-10289.27Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDE-EDNEAESSYKGPLGMESLE
        MSIAL+ N+RIPPSVFSQGGLPSYCSVLNT GIIPVVRRE AVGDAVAPAEE+DRCSSSSSSSIGENSGLSV+S DNDD+ E+NEAESSYKGPLGMESLE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDE-EDNEAESSYKGPLGMESLE

Query:  EVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSRSSLALAVAM--SGSES--DLNSRLPPPIRPPL
        EVLP+RRGISNFYNGKSKSFTSLAEASS A+IKDIAKPENA+SRKRRNLLASNLIAGGISKRPISSSRSSLALAVAM  SGS S  DLNSR PPPIRPPL
Subjt:  EVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSRSSLALAVAM--SGSES--DLNSRLPPPIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        HPNGR+SRSNL S V LLCKYPTWRSYSLADIQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

A0A6J1HF10 uncharacterized protein LOC1114629227.8e-9785.84Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE
        MSIALESN+RIPPSVFSQG LPSYCSVLNTTG+IPVVRRE  VGD VAPAE VDRCSSSSSSSIGENS  SVRS+++DD EDNEAESSYK  LGMESLEE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLALAVAMSGSE----SDLNSRLPPPIRPPL
        VLPIRRGISNFYNGKSKSFTSL +ASST+SIKDIAKPENAFSRKRRNLLASNLIAGGISKRPI SSR SSLALAV MS SE     DLNSRL P IRPPL
Subjt:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLALAVAMSGSE----SDLNSRLPPPIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        HP GRASRSN GSAVPLLCK+PTWRSYSLA+IQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

A0A6J1K8D9 uncharacterized protein LOC1114920741.1e-9585.41Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE
        MSIALESN+RIPPSVFSQG LPSYCSVLNTTG+IPVVRRE  VGD VAPAE VDRCSSSSSSSIGENS  SVRS+++DD EDNEAESSYK  LGMESLEE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLALAVAMSGSESD----LNSRLPPPIRPPL
        VL IRRGISNFYNGKSKSFTSL +ASST+SIKDIAKPENAFSRKRRNLLASNLIAGGISKRPI SSR SSLALAV MS SE      LNSRL P IRPPL
Subjt:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLALAVAMSGSESD----LNSRLPPPIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        HPNGRASRSN GSAVPLLCK+PTWRSYSLA+IQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24550.1 unknown protein3.4e-1247.62Show/hide
Query:  SSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLG--MESLEEVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLI
        SS SSSSIGE+S       + ++EE+++A S  +G L     SLE+ LPI+RG+SN Y GKSKSF +L EA+S A  KD+ K EN F+++RR ++A+ L 
Subjt:  SSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLG--MESLEEVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLI

Query:  AGGIS
          G S
Subjt:  AGGIS

AT3G43850.1 unknown protein1.9e-2354.96Show/hide
Query:  SSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLG-MESLEEVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIA
        SS+SS SIGENS       D+D+  +NE ESSY GPL  MESLEE LPI+R IS FY GKSKSF SL+E SS   +KD+ KPEN +SR+RRNLL+  + +
Subjt:  SSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLG-MESLEEVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIA

Query:  -GGISKRPISSSRSSLALAVAMSGSESDLNS
         GGISK+P  S        +AMS  E D +S
Subjt:  -GGISKRPISSSRSSLALAVAMSGSESDLNS

AT4G31510.1 unknown protein1.6e-0939.52Show/hide
Query:  VVRRETAVGDAVAPAEEVDRCSSS----SSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLG--MESLEEVLPIRRGISNFYNGKSKSFTSLAEASSTA
        V   + AV  +++    + RC  S    SSSS+GE S       +N+++ED+   SS    L     SLE+ LPI+RG+SN Y GKSKSF +L EAS+T 
Subjt:  VVRRETAVGDAVAPAEEVDRCSSS----SSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLG--MESLEEVLPIRRGISNFYNGKSKSFTSLAEASSTA

Query:  SIKDIAKPENAFSRKRRNLLASNL
           D+ K E+  +++RR L+A+ L
Subjt:  SIKDIAKPENAFSRKRRNLLASNL

AT5G21940.1 unknown protein5.8e-2844.34Show/hide
Query:  APAEEVDRCSSSSSSSIGENSGLSVRSLDN--DDEEDNEAESSYKGPLG-MESLEEVLPIRRGISNFYNGKSKSFTSL-AEA----SSTASIKDIAKPEN
        +P++     SSS+SSSIG NS    +S ++  DD  +NE ES YKGPL  MESLE+VLP+R+GIS +Y+GKSKSFT+L AEA    +S++S+KD+AKPEN
Subjt:  APAEEVDRCSSSSSSSIGENSGLSVRSLDN--DDEEDNEAESSYKGPLG-MESLEEVLPIRRGISNFYNGKSKSFTSL-AEA----SSTASIKDIAKPEN

Query:  AFSRKRRNLLASNL-------IAGGISKRPI-SSSRSSLALAVAM-----------SGSESDLNSRL----PPPIR-----------PPLHPNGRASRSN
         +SR+RRNLL   +         GGISK+ + SSSRS+L LA+A+           SG +S   S       PP +           PPL+P  + S  N
Subjt:  AFSRKRRNLLASNL-------IAGGISKRPI-SSSRSSLALAVAM-----------SGSESDLNSRL----PPPIR-----------PPLHPNGRASRSN

Query:  LGSAVPLLCKYPTWRSYSLAD
        L S+   L  +  WRS+S+AD
Subjt:  LGSAVPLLCKYPTWRSYSLAD

AT5G24890.1 unknown protein3.5e-0939.8Show/hide
Query:  SSSSSSIGE--NSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEEVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNL
        SS SSSIG   +S       +N++++ +  E   +G   M SLE+ LP +RG+SN Y GKSKSF +L E     S+K++AK EN  +++RR  + + L
Subjt:  SSSSSSIGE--NSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEEVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAATTGCTTTGGAGAGCAATAACAGGATTCCGCCGTCGGTTTTCTCTCAAGGCGGCTTGCCGTCGTATTGTTCCGTCTTGAATACCACGGGAATTATTCCGGTAGT
CCGACGAGAGACTGCCGTTGGTGATGCGGTGGCGCCGGCCGAGGAGGTGGATAGATGCAGTTCGTCTTCGTCGTCGTCGATTGGGGAAAACAGTGGTTTATCTGTTAGAT
CGTTGGATAATGACGACGAGGAGGATAATGAGGCGGAAAGTTCGTATAAAGGACCTCTAGGAATGGAATCGTTGGAGGAAGTTTTGCCTATCAGGAGAGGAATTTCAAAT
TTCTACAACGGAAAATCGAAATCCTTCACGAGCCTGGCAGAGGCTTCCTCAACGGCCTCCATTAAAGACATAGCAAAGCCTGAGAATGCCTTCTCTCGGAAACGGAGAAA
TCTTCTTGCATCCAATCTCATCGCCGGCGGCATATCGAAGCGACCGATTAGTTCAAGCCGAAGCTCGTTGGCGCTGGCCGTCGCCATGAGTGGTTCTGAAAGCGATCTGA
ATTCGAGATTGCCTCCGCCGATTCGACCTCCATTGCACCCTAACGGACGCGCATCTCGCAGCAATTTAGGTTCTGCAGTTCCTCTTCTCTGTAAATACCCCACTTGGCGA
TCGTATTCCTTGGCCGATATACAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAATTGCTTTGGAGAGCAATAACAGGATTCCGCCGTCGGTTTTCTCTCAAGGCGGCTTGCCGTCGTATTGTTCCGTCTTGAATACCACGGGAATTATTCCGGTAGT
CCGACGAGAGACTGCCGTTGGTGATGCGGTGGCGCCGGCCGAGGAGGTGGATAGATGCAGTTCGTCTTCGTCGTCGTCGATTGGGGAAAACAGTGGTTTATCTGTTAGAT
CGTTGGATAATGACGACGAGGAGGATAATGAGGCGGAAAGTTCGTATAAAGGACCTCTAGGAATGGAATCGTTGGAGGAAGTTTTGCCTATCAGGAGAGGAATTTCAAAT
TTCTACAACGGAAAATCGAAATCCTTCACGAGCCTGGCAGAGGCTTCCTCAACGGCCTCCATTAAAGACATAGCAAAGCCTGAGAATGCCTTCTCTCGGAAACGGAGAAA
TCTTCTTGCATCCAATCTCATCGCCGGCGGCATATCGAAGCGACCGATTAGTTCAAGCCGAAGCTCGTTGGCGCTGGCCGTCGCCATGAGTGGTTCTGAAAGCGATCTGA
ATTCGAGATTGCCTCCGCCGATTCGACCTCCATTGCACCCTAACGGACGCGCATCTCGCAGCAATTTAGGTTCTGCAGTTCCTCTTCTCTGTAAATACCCCACTTGGCGA
TCGTATTCCTTGGCCGATATACAGTAG
Protein sequenceShow/hide protein sequence
MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDAVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEEVLPIRRGISN
FYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSRSSLALAVAMSGSESDLNSRLPPPIRPPLHPNGRASRSNLGSAVPLLCKYPTWR
SYSLADIQ