; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G008290 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G008290
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDamaged dna-binding 2, putative isoform 1
Genome locationchr08:16701038..16702884
RNA-Seq ExpressionLsi08G008290
SyntenyLsi08G008290
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064705.1 Damaged dna-binding 2, putative isoform 1 [Cucumis melo var. makuwa]1.2e-9990.18Show/hide
Query:  IPPSVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAEVDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISNF
        IPPSVFSQ GLP YCSVLNTTGIIPVVRREAAV DAVAP +VDRCSSSSSSSIGENSGFSVRS DNDDGEDNEAESSY+GPLGMESLEEVLPIRRGISNF
Subjt:  IPPSVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAEVDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISNF

Query:  YNGKSKSFTSLADA-SSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRL--PLPIRPPLHPNGRASRS
        YNGKSKSFTSLADA SSSSIK+IAKPENAFSRKRRNLLASNLIAGGISKRPII SSRSSLALAVVLSSSESH N+D NS +  P PIRPPLHPNGRASR 
Subjt:  YNGKSKSFTSLADA-SSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRL--PLPIRPPLHPNGRASRS

Query:  NSGSTVPLLCKFPTWRSYSLANIQ
        NSGS VP LCKFPTWRSYS+ANIQ
Subjt:  NSGSTVPLLCKFPTWRSYSLANIQ

XP_004144215.1 uncharacterized protein LOC101211014 [Cucumis sativus]4.6e-9689.14Show/hide
Query:  SVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAEVDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISNFYNG
        +VFSQ GLPSYCSVLNTTGIIPVVRREAA+ DAVAPA+VDRC+SSSSSSIGENSGFSVRS DNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISNFYNG
Subjt:  SVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAEVDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISNFYNG

Query:  KSKSFTSLADA-SSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRLPLP--IRPPLHPNGRASRSNSG
        KSKSFTSL DA SSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPII SSRSSLALAVVLSSSE H N+D NS LP P  IRPPL+PNGR SR NSG
Subjt:  KSKSFTSLADA-SSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRLPLP--IRPPLHPNGRASRSNSG

Query:  STVPLLCKFPTWRSYSLANIQ
        S VP LCKFPTWRSYS+ANIQ
Subjt:  STVPLLCKFPTWRSYSLANIQ

XP_008445543.1 PREDICTED: uncharacterized protein LOC103488525 [Cucumis melo]1.2e-9990.18Show/hide
Query:  IPPSVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAEVDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISNF
        IPPSVFSQ GLP YCSVLNTTGIIPVVRREAAV DAVAP +VDRCSSSSSSSIGENSGFSVRS DNDDGEDNEAESSY+GPLGMESLEEVLPIRRGISNF
Subjt:  IPPSVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAEVDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISNF

Query:  YNGKSKSFTSLADA-SSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRL--PLPIRPPLHPNGRASRS
        YNGKSKSFTSLADA SSSSIK+IAKPENAFSRKRRNLLASNLIAGGISKRPII SSRSSLALAVVLSSSESH N+D NS +  P PIRPPLHPNGRASR 
Subjt:  YNGKSKSFTSLADA-SSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRL--PLPIRPPLHPNGRASRS

Query:  NSGSTVPLLCKFPTWRSYSLANIQ
        NSGS VP LCKFPTWRSYS+ANIQ
Subjt:  NSGSTVPLLCKFPTWRSYSLANIQ

XP_022962518.1 uncharacterized protein LOC111462922 [Cucurbita moschata]2.8e-9387Show/hide
Query:  IPPSVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAE-VDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISN
        IPPSVFSQG LPSYCSVLNTTG+IPVVRREA VGD VAPAE VDRCSSSSSSSIGENS FSVRS+++DDGEDNEAESSYK  LGMESLEEVLPIRRGISN
Subjt:  IPPSVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAE-VDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISN

Query:  FYNGKSKSFTSLADASS-SSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRLPLPIRPPLHPNGRASRSN
        FYNGKSKSFTSL DASS SSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPII S  SSLALAV +SSSE  + +D NSRL   IRPPLHP GRASRSN
Subjt:  FYNGKSKSFTSLADASS-SSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRLPLPIRPPLHPNGRASRSN

Query:  SGSTVPLLCKFPTWRSYSLANIQ
        SGS VPLLCKFPTWRSYSLANIQ
Subjt:  SGSTVPLLCKFPTWRSYSLANIQ

XP_038883984.1 uncharacterized protein LOC120074946 [Benincasa hispida]3.4e-9992.38Show/hide
Query:  IPPSVFSQGGLPSYCSVLNTTGIIPVVRRE-AAVGDAVAPAEVDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISN
        IPPSVFSQ GLPSYCSVLNTTG IPVVR+E AAVGDAVA AEVD CSSSSSSSIGENSGFSVRS DND+GEDNEAESSYKGPLGMESLEEVLPIRRGISN
Subjt:  IPPSVFSQGGLPSYCSVLNTTGIIPVVRRE-AAVGDAVAPAEVDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISN

Query:  FYNGKSKSFTSLADA-SSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRLPLPIRPPLHPNGRASRSN
        FYNGKSKSFTSLADA SSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPII SSRSSLALAVVLSSSESHN++D NSRL  PIRPPLHPNGRASRSN
Subjt:  FYNGKSKSFTSLADA-SSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRLPLPIRPPLHPNGRASRSN

Query:  SGSTVPLLCKFPTWRSYSLANIQ
        SGS VPLLCKFP+WRSYSLANIQ
Subjt:  SGSTVPLLCKFPTWRSYSLANIQ

TrEMBL top hitse value%identityAlignment
A0A0A0KFI7 Uncharacterized protein2.2e-9689.14Show/hide
Query:  SVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAEVDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISNFYNG
        +VFSQ GLPSYCSVLNTTGIIPVVRREAA+ DAVAPA+VDRC+SSSSSSIGENSGFSVRS DNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISNFYNG
Subjt:  SVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAEVDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISNFYNG

Query:  KSKSFTSLADA-SSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRLPLP--IRPPLHPNGRASRSNSG
        KSKSFTSL DA SSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPII SSRSSLALAVVLSSSE H N+D NS LP P  IRPPL+PNGR SR NSG
Subjt:  KSKSFTSLADA-SSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRLPLP--IRPPLHPNGRASRSNSG

Query:  STVPLLCKFPTWRSYSLANIQ
        S VP LCKFPTWRSYS+ANIQ
Subjt:  STVPLLCKFPTWRSYSLANIQ

A0A1S3BCZ8 uncharacterized protein LOC1034885255.6e-10090.18Show/hide
Query:  IPPSVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAEVDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISNF
        IPPSVFSQ GLP YCSVLNTTGIIPVVRREAAV DAVAP +VDRCSSSSSSSIGENSGFSVRS DNDDGEDNEAESSY+GPLGMESLEEVLPIRRGISNF
Subjt:  IPPSVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAEVDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISNF

Query:  YNGKSKSFTSLADA-SSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRL--PLPIRPPLHPNGRASRS
        YNGKSKSFTSLADA SSSSIK+IAKPENAFSRKRRNLLASNLIAGGISKRPII SSRSSLALAVVLSSSESH N+D NS +  P PIRPPLHPNGRASR 
Subjt:  YNGKSKSFTSLADA-SSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRL--PLPIRPPLHPNGRASRS

Query:  NSGSTVPLLCKFPTWRSYSLANIQ
        NSGS VP LCKFPTWRSYS+ANIQ
Subjt:  NSGSTVPLLCKFPTWRSYSLANIQ

A0A5A7VFP0 Damaged dna-binding 2, putative isoform 15.6e-10090.18Show/hide
Query:  IPPSVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAEVDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISNF
        IPPSVFSQ GLP YCSVLNTTGIIPVVRREAAV DAVAP +VDRCSSSSSSSIGENSGFSVRS DNDDGEDNEAESSY+GPLGMESLEEVLPIRRGISNF
Subjt:  IPPSVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAEVDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISNF

Query:  YNGKSKSFTSLADA-SSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRL--PLPIRPPLHPNGRASRS
        YNGKSKSFTSLADA SSSSIK+IAKPENAFSRKRRNLLASNLIAGGISKRPII SSRSSLALAVVLSSSESH N+D NS +  P PIRPPLHPNGRASR 
Subjt:  YNGKSKSFTSLADA-SSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRL--PLPIRPPLHPNGRASRS

Query:  NSGSTVPLLCKFPTWRSYSLANIQ
        NSGS VP LCKFPTWRSYS+ANIQ
Subjt:  NSGSTVPLLCKFPTWRSYSLANIQ

A0A6J1HF10 uncharacterized protein LOC1114629221.3e-9387Show/hide
Query:  IPPSVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAE-VDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISN
        IPPSVFSQG LPSYCSVLNTTG+IPVVRREA VGD VAPAE VDRCSSSSSSSIGENS FSVRS+++DDGEDNEAESSYK  LGMESLEEVLPIRRGISN
Subjt:  IPPSVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAE-VDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISN

Query:  FYNGKSKSFTSLADASS-SSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRLPLPIRPPLHPNGRASRSN
        FYNGKSKSFTSL DASS SSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPII S  SSLALAV +SSSE  + +D NSRL   IRPPLHP GRASRSN
Subjt:  FYNGKSKSFTSLADASS-SSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRLPLPIRPPLHPNGRASRSN

Query:  SGSTVPLLCKFPTWRSYSLANIQ
        SGS VPLLCKFPTWRSYSLANIQ
Subjt:  SGSTVPLLCKFPTWRSYSLANIQ

A0A6J1K8D9 uncharacterized protein LOC1114920741.1e-9287Show/hide
Query:  IPPSVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAE-VDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISN
        IPPSVFSQG LPSYCSVLNTTG+IPVVRREA VGD VAPAE VDRCSSSSSSSIGENS FSVRS+++DDGEDNEAESSYK  LGMESLEEVL IRRGISN
Subjt:  IPPSVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAE-VDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISN

Query:  FYNGKSKSFTSLADASS-SSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRLPLPIRPPLHPNGRASRSN
        FYNGKSKSFTSL DASS SSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPII S  SSLALAV +SSSE  + D  NSRL   IRPPLHPNGRASRSN
Subjt:  FYNGKSKSFTSLADASS-SSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRLPLPIRPPLHPNGRASRSN

Query:  SGSTVPLLCKFPTWRSYSLANIQ
        SGS VPLLCKFPTWRSYSLANIQ
Subjt:  SGSTVPLLCKFPTWRSYSLANIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24550.1 unknown protein1.4e-1037.5Show/hide
Query:  SSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLG--MESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSIKDIAKPENAFSRKRRNLLASNLIA
        SS SSSSIGE+S       + ++ E+++A S  +G L     SLE+ LPI+RG+SN Y GKSKSF +L +A+S + KD+ K EN F+++RR ++A+ L  
Subjt:  SSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLG--MESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSIKDIAKPENAFSRKRRNLLASNLIA

Query:  GGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRLPLPIRPP
         G S                 +S+S  ++  +PNS   L ++ P
Subjt:  GGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRLPLPIRPP

AT3G43850.1 unknown protein5.9e-2554.2Show/hide
Query:  SSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLG-MESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSIKDIAKPENAFSRKRRNLLASNLIA-
        SS+SS SIGENS       D+D+G +NE ESSY GPL  MESLEE LPI+R IS FY GKSKSF SL++ SS  +KD+ KPEN +SR+RRNLL+  + + 
Subjt:  SSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLG-MESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSIKDIAKPENAFSRKRRNLLASNLIA-

Query:  GGISKRPIIRSSRSSLALAVVLSSSESHNND
        GGISK+P     +S LA++     S S  +D
Subjt:  GGISKRPIIRSSRSSLALAVVLSSSESHNND

AT4G31510.1 unknown protein3.8e-0835.98Show/hide
Query:  RREAAVGDAVAPAEVD------RCSSS----SSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLG--MESLEEVLPIRRGISNFYNGKSKSFTSLADAS
        R      D   PA +       RC  S    SSSS+GE S       +N++ ED+   SS    L     SLE+ LPI+RG+SN Y GKSKSF +L +AS
Subjt:  RREAAVGDAVAPAEVD------RCSSS----SSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLG--MESLEEVLPIRRGISNFYNGKSKSFTSLADAS

Query:  SSSIKDIAKPENAFSRKRRNLLASNL-IAGGISKRPIIR--SSRSSLALAVVLSSSESHN-NDD
        +++  D+ K E+  +++RR L+A+ L     +S   I    +  S   LA+  S +E H  NDD
Subjt:  SSSIKDIAKPENAFSRKRRNLLASNL-IAGGISKRPIIR--SSRSSLALAVVLSSSESHN-NDD

AT5G21940.1 unknown protein4.4e-2843.4Show/hide
Query:  SSSSSSSIGENSGFSVRSLDN--DDGEDNEAESSYKGPLG-MESLEEVLPIRRGISNFYNGKSKSFTSL------ADASSSSIKDIAKPENAFSRKRRNL
        SSS+SSSIG NS    +S ++  DD  +NE ES YKGPL  MESLE+VLP+R+GIS +Y+GKSKSFT+L      A  SSSS+KD+AKPEN +SR+RRNL
Subjt:  SSSSSSSIGENSGFSVRSLDN--DDGEDNEAESSYKGPLG-MESLEEVLPIRRGISNFYNGKSKSFTSL------ADASSSSIKDIAKPENAFSRKRRNL

Query:  LASNL-------IAGGISKRPIIRSSRSSLALAVVLSS-------SESHNNDDPNS---------------RLPLPIRPPLHPNGRASRSNSGSTVPLLC
        L   +         GGISK+ ++ SSRS+L LA+ +++       S S  +  P S               +  +   PPL+P  + S  N  S+   L 
Subjt:  LASNL-------IAGGISKRPIIRSSRSSLALAVVLSS-------SESHNNDDPNS---------------RLPLPIRPPLHPNGRASRSNSGSTVPLLC

Query:  KFPTWRSYSLAN
         F  WRS+S+A+
Subjt:  KFPTWRSYSLAN

AT5G24890.1 unknown protein1.3e-0839.18Show/hide
Query:  SSSSSSIGE--NSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSIKDIAKPENAFSRKRRNLLASNL
        SS SSSIG   +S       +N++ + +  E   +G   M SLE+ LP +RG+SN Y GKSKSF +L +    S+K++AK EN  +++RR  + + L
Subjt:  SSSSSSIGE--NSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISNFYNGKSKSFTSLADASSSSIKDIAKPENAFSRKRRNLLASNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATTCCGCCGTCTGTTTTCTCGCAAGGTGGCTTGCCGTCGTACTGCTCCGTCTTGAATACGACGGGGATTATTCCGGTAGTTCGGCGAGAGGCGGCCGTTGGTGA
TGCGGTGGCGCCGGCAGAGGTGGATAGATGTAGTTCGTCTTCATCGTCGTCGATCGGAGAAAACAGTGGTTTCTCTGTTCGATCGTTGGATAATGACGATGGAGAGGATA
ATGAGGCGGAAAGTTCGTATAAAGGACCTCTAGGAATGGAATCTTTGGAAGAAGTTTTGCCTATCAGGAGAGGAATTTCAAATTTCTACAACGGAAAATCGAAATCCTTC
ACAAGCTTGGCAGACGCTTCCTCTTCTTCCATTAAAGACATAGCAAAGCCTGAAAACGCCTTCTCTCGGAAACGGAGAAATCTTCTTGCATCTAATCTCATCGCCGGCGG
CATATCAAAGCGACCGATTATTAGGTCAAGTCGAAGCTCGTTAGCGTTGGCCGTCGTCCTGAGCAGTTCTGAAAGCCACAACAATGACGATCCGAACTCGAGATTGCCTC
TGCCGATTCGTCCTCCATTGCACCCCAACGGACGAGCATCTCGCAGCAATTCAGGTTCAACAGTTCCTCTTCTCTGTAAATTCCCCACTTGGCGATCATACTCCTTGGCC
AATATACAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGATTCCGCCGTCTGTTTTCTCGCAAGGTGGCTTGCCGTCGTACTGCTCCGTCTTGAATACGACGGGGATTATTCCGGTAGTTCGGCGAGAGGCGGCCGTTGGTGA
TGCGGTGGCGCCGGCAGAGGTGGATAGATGTAGTTCGTCTTCATCGTCGTCGATCGGAGAAAACAGTGGTTTCTCTGTTCGATCGTTGGATAATGACGATGGAGAGGATA
ATGAGGCGGAAAGTTCGTATAAAGGACCTCTAGGAATGGAATCTTTGGAAGAAGTTTTGCCTATCAGGAGAGGAATTTCAAATTTCTACAACGGAAAATCGAAATCCTTC
ACAAGCTTGGCAGACGCTTCCTCTTCTTCCATTAAAGACATAGCAAAGCCTGAAAACGCCTTCTCTCGGAAACGGAGAAATCTTCTTGCATCTAATCTCATCGCCGGCGG
CATATCAAAGCGACCGATTATTAGGTCAAGTCGAAGCTCGTTAGCGTTGGCCGTCGTCCTGAGCAGTTCTGAAAGCCACAACAATGACGATCCGAACTCGAGATTGCCTC
TGCCGATTCGTCCTCCATTGCACCCCAACGGACGAGCATCTCGCAGCAATTCAGGTTCAACAGTTCCTCTTCTCTGTAAATTCCCCACTTGGCGATCATACTCCTTGGCC
AATATACAGTAGCATAGGGTAAGGCTTTTTCCATGGCTGCCTCCATGGAAGCTAATCATCATCAAAGACTAACTCGACCTTGCGAAACCGATCTTCATCAAATCGGTTTC
TCACGAATAACTCATTTCCTTTCAAATTGTCCACTATCTTACTTCTGCAGCAACTCTGTTTTGATTTTTTTTTCCTTTCTGTATGTATGTATTGTAAATTGCAAAGAGAA
AAAGAATAGCAGTTCATCCATCGAATCGATGTCTGTTTGTTTATAGAAATTGAAATAGCAAATTGAGAGATCTGAGAGTTGCTTTTCCAGATAAAGTTAAAGGAAAAATT
ACAGATTATTGTGTAGCAGAAACTTTGGGGCAAATTGAAAACGAATGATAATGAAGTGTCAATGTCAATGTCAATGGTGCAAGGAATTGTGTGGAGATGGTGATTAGGGC
CCAACTTGGGGTGGTTTGCTGGTCTTCTCTGGTCACACTCCAGTTTCGATGAGAACACACGGCCAGTTCCTTTCGACTGCATGTCTCTGCCCCTTTTAATTTCTCCCCGT
TCAGTTTGTTTTGAAGAGCGAG
Protein sequenceShow/hide protein sequence
MAIPPSVFSQGGLPSYCSVLNTTGIIPVVRREAAVGDAVAPAEVDRCSSSSSSSIGENSGFSVRSLDNDDGEDNEAESSYKGPLGMESLEEVLPIRRGISNFYNGKSKSF
TSLADASSSSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPIIRSSRSSLALAVVLSSSESHNNDDPNSRLPLPIRPPLHPNGRASRSNSGSTVPLLCKFPTWRSYSLA
NIQ