; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004109 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004109
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDamaged dna-binding 2, putative isoform 1
Genome locationchr6:1174141..1175002
RNA-Seq ExpressionLag0004109
SyntenyLag0004109
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064705.1 Damaged dna-binding 2, putative isoform 1 [Cucumis melo var. makuwa]9.4e-9785.53Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE
        MSIALESN RIPPSVFSQ GLP YCSVLNTTGIIPVVRRE AV D VAP E+VDRCSSSSSSSIGENSG SVRS DNDD EDNEAESSY+GPLGMESLEE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLALAVAMSGSES----DLNSRLPP--PIRP
        VLPIRRGISNFYNGKSKSFTSLA+ASS++SIK+IAKPENAFSRKRRNLLASNLIAGGISKRP ISSSRSSLALAV +S SES    DLNS +PP  PIRP
Subjt:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLALAVAMSGSES----DLNSRLPP--PIRP

Query:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        PLHPNGRASR N GSAVP LCK+PTWRSYS+A+IQ
Subjt:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

XP_008445543.1 PREDICTED: uncharacterized protein LOC103488525 [Cucumis melo]9.4e-9785.53Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE
        MSIALESN RIPPSVFSQ GLP YCSVLNTTGIIPVVRRE AV D VAP E+VDRCSSSSSSSIGENSG SVRS DNDD EDNEAESSY+GPLGMESLEE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLALAVAMSGSES----DLNSRLPP--PIRP
        VLPIRRGISNFYNGKSKSFTSLA+ASS++SIK+IAKPENAFSRKRRNLLASNLIAGGISKRP ISSSRSSLALAV +S SES    DLNS +PP  PIRP
Subjt:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLALAVAMSGSES----DLNSRLPP--PIRP

Query:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        PLHPNGRASR N GSAVP LCK+PTWRSYS+A+IQ
Subjt:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

XP_022131782.1 uncharacterized protein LOC111004861 [Momordica charantia]5.7e-10288.84Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDE-EDNEAESSYKGPLGMESLE
        MSIAL+ N+RIPPSVFSQGGLPSYCSVLNT GIIPVVRRE AVGD VAPAEE+DRCSSSSSSSIGENSGLSV+S DNDD+ E+NEAESSYKGPLGMESLE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDE-EDNEAESSYKGPLGMESLE

Query:  EVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSRSSLALAVAM--SGSES--DLNSRLPPPIRPPL
        EVLP+RRGISNFYNGKSKSFTSLAEASS A+IKDIAKPENA+SRKRRNLLASNLIAGGISKRPISSSRSSLALAVAM  SGS S  DLNSR PPPIRPPL
Subjt:  EVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSRSSLALAVAM--SGSES--DLNSRLPPPIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        HPNGR+SRSNL S V LLCKYPTWRSYSLADIQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

XP_022962518.1 uncharacterized protein LOC111462922 [Cucurbita moschata]4.2e-9786.27Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE
        MSIALESN+RIPPSVFSQG LPSYCSVLNTTG+IPVVRRE  VGDVVAPAE VDRCSSSSSSSIGENS  SVRS+++DD EDNEAESSYK  LGMESLEE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLALAVAMSGSE----SDLNSRLPPPIRPPL
        VLPIRRGISNFYNGKSKSFTSL +ASST+SIKDIAKPENAFSRKRRNLLASNLIAGGISKRPI SSR SSLALAV MS SE     DLNSRL P IRPPL
Subjt:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLALAVAMSGSE----SDLNSRLPPPIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        HP GRASRSN GSAVPLLCK+PTWRSYSLA+IQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

XP_023547301.1 uncharacterized protein LOC111806162 [Cucurbita pepo subsp. pepo]9.4e-9785.84Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE
        MSIALESN+RIPPSVFSQG LPSYCSVLNTTG+IPVVRR+  VGDVVAPAE VDRCSSSSSSSIGENS  SVRS+++DD EDNEAESSYK  LGMESLEE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLALAVAMSGSE----SDLNSRLPPPIRPPL
        VLPIRRGISNFYNGKSKSFTSL +ASST+SIKDIAKPENAFSRKRRNLLASNLIAGGISKRPI SSR SSLALAV MS SE     DLNSRL P IRPPL
Subjt:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLALAVAMSGSE----SDLNSRLPPPIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        HP GRASRSN GSAVPLLCK+PTWRSYSLA+IQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

TrEMBL top hitse value%identityAlignment
A0A1S3BCZ8 uncharacterized protein LOC1034885254.6e-9785.53Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE
        MSIALESN RIPPSVFSQ GLP YCSVLNTTGIIPVVRRE AV D VAP E+VDRCSSSSSSSIGENSG SVRS DNDD EDNEAESSY+GPLGMESLEE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLALAVAMSGSES----DLNSRLPP--PIRP
        VLPIRRGISNFYNGKSKSFTSLA+ASS++SIK+IAKPENAFSRKRRNLLASNLIAGGISKRP ISSSRSSLALAV +S SES    DLNS +PP  PIRP
Subjt:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLALAVAMSGSES----DLNSRLPP--PIRP

Query:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        PLHPNGRASR N GSAVP LCK+PTWRSYS+A+IQ
Subjt:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

A0A5A7VFP0 Damaged dna-binding 2, putative isoform 14.6e-9785.53Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE
        MSIALESN RIPPSVFSQ GLP YCSVLNTTGIIPVVRRE AV D VAP E+VDRCSSSSSSSIGENSG SVRS DNDD EDNEAESSY+GPLGMESLEE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLALAVAMSGSES----DLNSRLPP--PIRP
        VLPIRRGISNFYNGKSKSFTSLA+ASS++SIK+IAKPENAFSRKRRNLLASNLIAGGISKRP ISSSRSSLALAV +S SES    DLNS +PP  PIRP
Subjt:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLALAVAMSGSES----DLNSRLPP--PIRP

Query:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        PLHPNGRASR N GSAVP LCK+PTWRSYS+A+IQ
Subjt:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

A0A6J1BS00 uncharacterized protein LOC1110048612.8e-10288.84Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDE-EDNEAESSYKGPLGMESLE
        MSIAL+ N+RIPPSVFSQGGLPSYCSVLNT GIIPVVRRE AVGD VAPAEE+DRCSSSSSSSIGENSGLSV+S DNDD+ E+NEAESSYKGPLGMESLE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDE-EDNEAESSYKGPLGMESLE

Query:  EVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSRSSLALAVAM--SGSES--DLNSRLPPPIRPPL
        EVLP+RRGISNFYNGKSKSFTSLAEASS A+IKDIAKPENA+SRKRRNLLASNLIAGGISKRPISSSRSSLALAVAM  SGS S  DLNSR PPPIRPPL
Subjt:  EVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSRSSLALAVAM--SGSES--DLNSRLPPPIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        HPNGR+SRSNL S V LLCKYPTWRSYSLADIQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

A0A6J1HF10 uncharacterized protein LOC1114629222.1e-9786.27Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE
        MSIALESN+RIPPSVFSQG LPSYCSVLNTTG+IPVVRRE  VGDVVAPAE VDRCSSSSSSSIGENS  SVRS+++DD EDNEAESSYK  LGMESLEE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLALAVAMSGSE----SDLNSRLPPPIRPPL
        VLPIRRGISNFYNGKSKSFTSL +ASST+SIKDIAKPENAFSRKRRNLLASNLIAGGISKRPI SSR SSLALAV MS SE     DLNSRL P IRPPL
Subjt:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLALAVAMSGSE----SDLNSRLPPPIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        HP GRASRSN GSAVPLLCK+PTWRSYSLA+IQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

A0A6J1K8D9 uncharacterized protein LOC1114920743.0e-9685.84Show/hide
Query:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE
        MSIALESN+RIPPSVFSQG LPSYCSVLNTTG+IPVVRRE  VGDVVAPAE VDRCSSSSSSSIGENS  SVRS+++DD EDNEAESSYK  LGMESLEE
Subjt:  MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEE

Query:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLALAVAMSGSESD----LNSRLPPPIRPPL
        VL IRRGISNFYNGKSKSFTSL +ASST+SIKDIAKPENAFSRKRRNLLASNLIAGGISKRPI SSR SSLALAV MS SE      LNSRL P IRPPL
Subjt:  VLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLALAVAMSGSESD----LNSRLPPPIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        HPNGRASRSN GSAVPLLCK+PTWRSYSLA+IQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24550.1 unknown protein3.4e-1247.62Show/hide
Query:  SSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLG--MESLEEVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLI
        SS SSSSIGE+S       + ++EE+++A S  +G L     SLE+ LPI+RG+SN Y GKSKSF +L EA+S A  KD+ K EN F+++RR ++A+ L 
Subjt:  SSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLG--MESLEEVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLI

Query:  AGGIS
          G S
Subjt:  AGGIS

AT3G43850.1 unknown protein1.5e-2354.96Show/hide
Query:  SSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLG-MESLEEVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIA
        SS+SS SIGENS       D+D+  +NE ESSY GPL  MESLEE LPI+R IS FY GKSKSF SL+E SS   +KD+ KPEN +SR+RRNLL+  + +
Subjt:  SSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLG-MESLEEVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIA

Query:  -GGISKRPISSSRSSLALAVAMSGSESDLNS
         GGISK+P  S        +AMS  E D +S
Subjt:  -GGISKRPISSSRSSLALAVAMSGSESDLNS

AT4G31510.1 unknown protein1.6e-0943.81Show/hide
Query:  RCSSS----SSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLG--MESLEEVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNL
        RC  S    SSSS+GE S       +N+++ED+   SS    L     SLE+ LPI+RG+SN Y GKSKSF +L EAS+T    D+ K E+  +++RR L
Subjt:  RCSSS----SSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLG--MESLEEVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNL

Query:  LASNL
        +A+ L
Subjt:  LASNL

AT5G21940.1 unknown protein4.5e-2844.34Show/hide
Query:  APAEEVDRCSSSSSSSIGENSGLSVRSLDN--DDEEDNEAESSYKGPLG-MESLEEVLPIRRGISNFYNGKSKSFTSL-AEA----SSTASIKDIAKPEN
        +P++     SSS+SSSIG NS    +S ++  DD  +NE ES YKGPL  MESLE+VLP+R+GIS +Y+GKSKSFT+L AEA    +S++S+KD+AKPEN
Subjt:  APAEEVDRCSSSSSSSIGENSGLSVRSLDN--DDEEDNEAESSYKGPLG-MESLEEVLPIRRGISNFYNGKSKSFTSL-AEA----SSTASIKDIAKPEN

Query:  AFSRKRRNLLASNL-------IAGGISKRPI-SSSRSSLALAVAM-----------SGSESDLNSRL----PPPIR-----------PPLHPNGRASRSN
         +SR+RRNLL   +         GGISK+ + SSSRS+L LA+A+           SG +S   S       PP +           PPL+P  + S  N
Subjt:  AFSRKRRNLLASNL-------IAGGISKRPI-SSSRSSLALAVAM-----------SGSESDLNSRL----PPPIR-----------PPLHPNGRASRSN

Query:  LGSAVPLLCKYPTWRSYSLAD
        L S+   L  +  WRS+S+AD
Subjt:  LGSAVPLLCKYPTWRSYSLAD

AT5G24890.1 unknown protein3.5e-0939.8Show/hide
Query:  SSSSSSIGE--NSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEEVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNL
        SS SSSIG   +S       +N++++ +  E   +G   M SLE+ LP +RG+SN Y GKSKSF +L E     S+K++AK EN  +++RR  + + L
Subjt:  SSSSSSIGE--NSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEEVLPIRRGISNFYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAATTGCTTTGGAGAGCAACAACAGGATTCCGCCGTCGGTTTTCTCTCAAGGCGGCTTGCCGTCGTATTGTTCCGTCTTGAATACCACGGGAATTATTCCGGTAGT
CCGACGAGAGACTGCCGTTGGTGATGTGGTGGCGCCGGCCGAGGAGGTGGATAGATGCAGTTCGTCTTCGTCGTCGTCGATTGGGGAAAACAGCGGTTTATCTGTTAGAT
CGTTGGATAATGACGACGAGGAGGATAATGAGGCGGAAAGTTCGTATAAAGGACCTCTAGGAATGGAATCGTTGGAGGAAGTTTTGCCTATCAGGAGAGGAATTTCAAAT
TTCTACAACGGAAAATCGAAATCCTTCACGAGCCTGGCAGAGGCTTCCTCAACGGCCTCCATTAAAGACATAGCAAAGCCTGAGAATGCCTTCTCTCGGAAACGGAGAAA
TCTTCTTGCATCCAATCTCATCGCCGGCGGCATATCGAAGCGACCGATTAGTTCAAGCCGAAGCTCGTTGGCGCTGGCCGTCGCCATGAGTGGTTCTGAAAGCGATCTGA
ATTCGAGATTGCCTCCGCCGATTCGACCTCCATTGCACCCCAACGGACGCGCATCTCGCTCCAATTTAGGTTCTGCAGTTCCTCTTCTCTGTAAATACCCCACTTGGCGA
TCATATTCCTTGGCCGATATTCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAATTGCTTTGGAGAGCAACAACAGGATTCCGCCGTCGGTTTTCTCTCAAGGCGGCTTGCCGTCGTATTGTTCCGTCTTGAATACCACGGGAATTATTCCGGTAGT
CCGACGAGAGACTGCCGTTGGTGATGTGGTGGCGCCGGCCGAGGAGGTGGATAGATGCAGTTCGTCTTCGTCGTCGTCGATTGGGGAAAACAGCGGTTTATCTGTTAGAT
CGTTGGATAATGACGACGAGGAGGATAATGAGGCGGAAAGTTCGTATAAAGGACCTCTAGGAATGGAATCGTTGGAGGAAGTTTTGCCTATCAGGAGAGGAATTTCAAAT
TTCTACAACGGAAAATCGAAATCCTTCACGAGCCTGGCAGAGGCTTCCTCAACGGCCTCCATTAAAGACATAGCAAAGCCTGAGAATGCCTTCTCTCGGAAACGGAGAAA
TCTTCTTGCATCCAATCTCATCGCCGGCGGCATATCGAAGCGACCGATTAGTTCAAGCCGAAGCTCGTTGGCGCTGGCCGTCGCCATGAGTGGTTCTGAAAGCGATCTGA
ATTCGAGATTGCCTCCGCCGATTCGACCTCCATTGCACCCCAACGGACGCGCATCTCGCTCCAATTTAGGTTCTGCAGTTCCTCTTCTCTGTAAATACCCCACTTGGCGA
TCATATTCCTTGGCCGATATTCAGTAG
Protein sequenceShow/hide protein sequence
MSIALESNNRIPPSVFSQGGLPSYCSVLNTTGIIPVVRRETAVGDVVAPAEEVDRCSSSSSSSIGENSGLSVRSLDNDDEEDNEAESSYKGPLGMESLEEVLPIRRGISN
FYNGKSKSFTSLAEASSTASIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSRSSLALAVAMSGSESDLNSRLPPPIRPPLHPNGRASRSNLGSAVPLLCKYPTWR
SYSLADIQ