; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0016963 (gene) of Chayote v1 genome

Gene IDSed0016963
OrganismSechium edule (Chayote v1)
DescriptionDamaged dna-binding 2, putative isoform 1
Genome locationLG08:32800970..32802335
RNA-Seq ExpressionSed0016963
SyntenySed0016963
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064705.1 Damaged dna-binding 2, putative isoform 1 [Cucumis melo var. makuwa]3.2e-6770.94Show/hide
Query:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCDAAA-EEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEKA
        MS+A E  +R           P YCSVLN TG+IPV+RREA+V DA A E+V+  SSSSSSSIGENS FSVRSSD+DD EDNEAESSY+ PL MESLE+ 
Subjt:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCDAAA-EEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEKA

Query:  LPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRP-ISSSRSSLALAVAMSSS------DLNSDFSP--SIRPP
        LPIRRGISNFYNGKSKSFTSLADA    SI +IAKPENAFSRKRRNLLAS L +G ISKRP ISSSRSSLALAV +SSS      DLNS   P   IRPP
Subjt:  LPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRP-ISSSRSSLALAVAMSSS------DLNSDFSP--SIRPP

Query:  LHPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ
        LHPNGRASR N GSAVP LCK+P WRS SMA+IQ
Subjt:  LHPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ

XP_022962518.1 uncharacterized protein LOC111462922 [Cucurbita moschata]5.9e-6972.96Show/hide
Query:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCD--AAAEEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEK
        MS+A E  SR           PSYCSVLN TGVIPV+RREA V D  A AE V+  SSSSSSSIGENSDFSVRS +DDD EDNEAESSYK+ L MESLE+
Subjt:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCD--AAAEEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEK

Query:  ALPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRPISSSR-SSLALAVAMSSS------DLNSDFSPSIRPPL
         LPIRRGISNFYNGKSKSFTSL DA    SI DIAKPENAFSRKRRNLLAS L +G ISKRPI SSR SSLALAV MSSS      DLNS  SP+IRPPL
Subjt:  ALPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRPISSSR-SSLALAVAMSSS------DLNSDFSPSIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ
        HP GRASRSN GSAVPLLCK+P WRS S+A+IQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ

XP_022997020.1 uncharacterized protein LOC111492074 [Cucurbita maxima]2.9e-6872.53Show/hide
Query:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCD--AAAEEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEK
        MS+A E  SR           PSYCSVLN TGVIPV+RREA V D  A AE V+  SSSSSSSIGENSDFSVRS +DDD EDNEAESSYK+ L MESLE+
Subjt:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCD--AAAEEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEK

Query:  ALPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRPISSSR-SSLALAVAMSSSD------LNSDFSPSIRPPL
         L IRRGISNFYNGKSKSFTSL DA    SI DIAKPENAFSRKRRNLLAS L +G ISKRPI SSR SSLALAV MSSS+      LNS  SP+IRPPL
Subjt:  ALPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRPISSSR-SSLALAVAMSSSD------LNSDFSPSIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ
        HPNGRASRSN GSAVPLLCK+P WRS S+A+IQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ

XP_023547301.1 uncharacterized protein LOC111806162 [Cucurbita pepo subsp. pepo]1.3e-6872.53Show/hide
Query:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCD--AAAEEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEK
        MS+A E  SR           PSYCSVLN TGVIPV+RR+A V D  A AE V+  SSSSSSSIGENSDFSVRS +DDD EDNEAESSYK+ L MESLE+
Subjt:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCD--AAAEEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEK

Query:  ALPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRPISSSR-SSLALAVAMSSS------DLNSDFSPSIRPPL
         LPIRRGISNFYNGKSKSFTSL DA    SI DIAKPENAFSRKRRNLLAS L +G ISKRPI SSR SSLALAV MSSS      DLNS  SP+IRPPL
Subjt:  ALPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRPISSSR-SSLALAVAMSSS------DLNSDFSPSIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ
        HP GRASRSN GSAVPLLCK+P WRS S+A+IQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ

XP_038883984.1 uncharacterized protein LOC120074946 [Benincasa hispida]4.5e-6972.41Show/hide
Query:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRRE-ASVCDAAAEEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEKA
        MS+A E  SR           PSYCSVLN TG IPV+R+E A+V DA A EV+  SSSSSSSIGENS FSVRSSD+D+ EDNEAESSYK PL MESLE+ 
Subjt:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRRE-ASVCDAAAEEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEKA

Query:  LPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRP-ISSSRSSLALAVAMSSS------DLNSDFSPSIRPPLH
        LPIRRGISNFYNGKSKSFTSLADA    SI DIAKPENAFSRKRRNLLAS L +G ISKRP I+SSRSSLALAV +SSS      DLNS  SP IRPPLH
Subjt:  LPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRP-ISSSRSSLALAVAMSSS------DLNSDFSPSIRPPLH

Query:  PNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ
        PNGRASRSN GS VPLLCK+P+WRS S+A+IQ
Subjt:  PNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ

TrEMBL top hitse value%identityAlignment
A0A1S3BCZ8 uncharacterized protein LOC1034885251.6e-6770.94Show/hide
Query:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCDAAA-EEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEKA
        MS+A E  +R           P YCSVLN TG+IPV+RREA+V DA A E+V+  SSSSSSSIGENS FSVRSSD+DD EDNEAESSY+ PL MESLE+ 
Subjt:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCDAAA-EEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEKA

Query:  LPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRP-ISSSRSSLALAVAMSSS------DLNSDFSP--SIRPP
        LPIRRGISNFYNGKSKSFTSLADA    SI +IAKPENAFSRKRRNLLAS L +G ISKRP ISSSRSSLALAV +SSS      DLNS   P   IRPP
Subjt:  LPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRP-ISSSRSSLALAVAMSSS------DLNSDFSP--SIRPP

Query:  LHPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ
        LHPNGRASR N GSAVP LCK+P WRS SMA+IQ
Subjt:  LHPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ

A0A5A7VFP0 Damaged dna-binding 2, putative isoform 11.6e-6770.94Show/hide
Query:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCDAAA-EEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEKA
        MS+A E  +R           P YCSVLN TG+IPV+RREA+V DA A E+V+  SSSSSSSIGENS FSVRSSD+DD EDNEAESSY+ PL MESLE+ 
Subjt:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCDAAA-EEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEKA

Query:  LPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRP-ISSSRSSLALAVAMSSS------DLNSDFSP--SIRPP
        LPIRRGISNFYNGKSKSFTSLADA    SI +IAKPENAFSRKRRNLLAS L +G ISKRP ISSSRSSLALAV +SSS      DLNS   P   IRPP
Subjt:  LPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRP-ISSSRSSLALAVAMSSS------DLNSDFSP--SIRPP

Query:  LHPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ
        LHPNGRASR N GSAVP LCK+P WRS SMA+IQ
Subjt:  LHPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ

A0A6J1BS00 uncharacterized protein LOC1110048611.6e-6770.39Show/hide
Query:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCDAA--AEEVESYSSSSSSSIGENSDFSVRSSD-DDDEEDNEAESSYKKPLEMESLE
        MS+A +  SR           PSYCSVLN  G+IPV+RREA+V DA   AEE++  SSSSSSSIGENS  SV+SSD DDD E+NEAESSYK PL MESLE
Subjt:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCDAA--AEEVESYSSSSSSSIGENSDFSVRSSD-DDDEEDNEAESSYKKPLEMESLE

Query:  KALPIRRGISNFYNGKSKSFTSLADAS----IGDIAKPENAFSRKRRNLLASKLFSGSISKRPISSSRSSLALAVAM------SSSDLNSDFSPSIRPPL
        + LP+RRGISNFYNGKSKSFTSLA+AS    I DIAKPENA+SRKRRNLLAS L +G ISKRPISSSRSSLALAVAM      SS DLNS   P IRPPL
Subjt:  KALPIRRGISNFYNGKSKSFTSLADAS----IGDIAKPENAFSRKRRNLLASKLFSGSISKRPISSSRSSLALAVAM------SSSDLNSDFSPSIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ
        HPNGR+SRSNL S V LLCKYP WRS S+ADIQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ

A0A6J1HF10 uncharacterized protein LOC1114629222.8e-6972.96Show/hide
Query:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCD--AAAEEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEK
        MS+A E  SR           PSYCSVLN TGVIPV+RREA V D  A AE V+  SSSSSSSIGENSDFSVRS +DDD EDNEAESSYK+ L MESLE+
Subjt:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCD--AAAEEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEK

Query:  ALPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRPISSSR-SSLALAVAMSSS------DLNSDFSPSIRPPL
         LPIRRGISNFYNGKSKSFTSL DA    SI DIAKPENAFSRKRRNLLAS L +G ISKRPI SSR SSLALAV MSSS      DLNS  SP+IRPPL
Subjt:  ALPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRPISSSR-SSLALAVAMSSS------DLNSDFSPSIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ
        HP GRASRSN GSAVPLLCK+P WRS S+A+IQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ

A0A6J1K8D9 uncharacterized protein LOC1114920741.4e-6872.53Show/hide
Query:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCD--AAAEEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEK
        MS+A E  SR           PSYCSVLN TGVIPV+RREA V D  A AE V+  SSSSSSSIGENSDFSVRS +DDD EDNEAESSYK+ L MESLE+
Subjt:  MSVAFEITSR----------SPSYCSVLNATGVIPVLRREASVCD--AAAEEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEK

Query:  ALPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRPISSSR-SSLALAVAMSSSD------LNSDFSPSIRPPL
         L IRRGISNFYNGKSKSFTSL DA    SI DIAKPENAFSRKRRNLLAS L +G ISKRPI SSR SSLALAV MSSS+      LNS  SP+IRPPL
Subjt:  ALPIRRGISNFYNGKSKSFTSLADA----SIGDIAKPENAFSRKRRNLLASKLFSGSISKRPISSSR-SSLALAVAMSSSD------LNSDFSPSIRPPL

Query:  HPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ
        HPNGRASRSN GSAVPLLCK+P WRS S+A+IQ
Subjt:  HPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24550.1 unknown protein5.0e-1045.36Show/hide
Query:  SSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLE--MESLEKALPIRRGISNFYNGKSKSFTSLADAS--IGDIAKPENAFSRKRRNLLASKL
        SS SSSSIGE+S+      ++++EE+++A S  +  L+    SLE +LPI+RG+SN Y GKSKSF +L +A+    D+ K EN F+++RR ++A+KL
Subjt:  SSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLE--MESLEKALPIRRGISNFYNGKSKSFTSLADAS--IGDIAKPENAFSRKRRNLLASKL

AT3G43850.1 unknown protein2.4e-2050.72Show/hide
Query:  SSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLE-MESLEKALPIRRGISNFYNGKSKSFTSLADAS---IGDIAKPENAFSRKRRNLLASKLFS-
        SS+SS SIGEN       SDDD+  +NE ESSY  PL+ MESLE+ALPI+R IS FY GKSKSF SL++ S   + D+ KPEN +SR+RRNLL+ ++ S 
Subjt:  SSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLE-MESLEKALPIRRGISNFYNGKSKSFTSLADAS---IGDIAKPENAFSRKRRNLLASKLFS-

Query:  GSISKRPISSSRSSLALAVAMSSSDLNS---DFSPSIR
        G ISK+P  S      LA++    D +S   D  P++R
Subjt:  GSISKRPISSSRSSLALAVAMSSSDLNS---DFSPSIR

AT4G31510.1 unknown protein1.2e-0844.09Show/hide
Query:  SSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLE--MESLEKALPIRRGISNFYNGKSKSFTSLADAS-IGDIAKPENAFSRKRRNLLASKL
        SSSS+GE       +S+++++ED+   SS  + L     SLE +LPI+RG+SN Y GKSKSF +L +AS   D+ K E+  +++RR L+A+KL
Subjt:  SSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLE--MESLEKALPIRRGISNFYNGKSKSFTSLADAS-IGDIAKPENAFSRKRRNLLASKL

AT5G21940.1 unknown protein6.6e-2641.38Show/hide
Query:  PVLRREASVCDAAAEEVESYSSSSSSSIGENSDFSVRSSDD--DDEEDNEAESSYKKPLE-MESLEKALPIRRGISNFYNGKSKSFTSL---------AD
        PV+    S   + ++   S SSS+SSSIG NSD   +SS+D  DD  +NE ES YK PLE MESLE+ LP+R+GIS +Y+GKSKSFT+L         + 
Subjt:  PVLRREASVCDAAAEEVESYSSSSSSSIGENSDFSVRSSDD--DDEEDNEAESSYKKPLE-MESLEKALPIRRGISNFYNGKSKSFTSL---------AD

Query:  ASIGDIAKPENAFSRKRRNLLASKLFS-------GSISKRPI-SSSRSSLALAVAMSS-------SDLNSDFSPSIR---------------------PP
        +S+ D+AKPEN +SR+RRNLL  +++        G ISK+ + SSSRS+L LA+A+++       S    D SP                        PP
Subjt:  ASIGDIAKPENAFSRKRRNLLASKLFS-------GSISKRPI-SSSRSSLALAVAMSS-------SDLNSDFSPSIR---------------------PP

Query:  LHPNGRASRSNLGSAVPLLCKYPAWRSCSMAD
        L+P  + S  NL S+   L  + AWRS S+AD
Subjt:  LHPNGRASRSNLGSAVPLLCKYPAWRSCSMAD

AT5G24890.1 unknown protein3.6e-0836.88Show/hide
Query:  AFEITSRSPSYCSVLNATGVIPVLRREASVCDAAAEEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPL------EMESLEKALPIRRGIS
        A E  S S S     N  GV      E+ +    + +   Y SS SSSIG   D    S +D++E +NE +    K L       M SLE +LP +RG+S
Subjt:  AFEITSRSPSYCSVLNATGVIPVLRREASVCDAAAEEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPL------EMESLEKALPIRRGIS

Query:  NFYNGKSKSFTSLAD-ASIGDIAKPENAFSRKRRNLLASKL
        N Y GKSKSF +L +  S+ ++AK EN  +++RR  + +KL
Subjt:  NFYNGKSKSFTSLAD-ASIGDIAKPENAFSRKRRNLLASKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGTTGCTTTCGAGATCACTAGCAGATCGCCGTCCTACTGCTCCGTCTTGAATGCCACCGGAGTTATCCCGGTACTCCGGCGAGAGGCTTCCGTTTGTGATGCGGC
GGCGGAGGAGGTGGAAAGTTACAGTTCGTCTTCATCGTCGTCGATTGGAGAAAACAGTGATTTCTCGGTCAGATCGTCGGATGACGACGACGAAGAAGATAACGAGGCGG
AAAGTTCGTACAAAAAACCTCTAGAAATGGAGTCGTTGGAAAAAGCCTTGCCGATCAGGAGAGGAATTTCGAATTTCTACAACGGAAAATCGAAATCGTTCACAAGTCTG
GCAGACGCTTCGATCGGAGACATAGCAAAGCCTGAAAACGCGTTTTCTCGGAAGCGGAGAAATCTTCTTGCATCGAAGCTTTTCTCCGGCAGCATATCGAAGCGGCCGAT
CAGTTCGAGTCGAAGCTCGTTGGCTTTGGCCGTTGCAATGAGCAGTTCCGATCTGAATTCGGACTTTTCTCCGTCGATTCGGCCGCCGTTGCATCCCAACGGGCGAGCAT
CTCGGAGCAATTTAGGCTCTGCGGTTCCTCTTCTCTGTAAATATCCCGCTTGGCGATCGTGTTCCATGGCCGACATACAGTAA
mRNA sequenceShow/hide mRNA sequence
TGGAAGCCCGGCAGATGAGGATAAGGCCTTCGGCTTTCCTTTTCTGAATTATTTATCAAAGATATAATATTTCGCTATTCTCCCCCTTCTTCATTAATTGTTCCCATAAT
CCCCATTTTCTAATTTCTATATATAAATTTCTTGTCTCTTTTTCTTTCAACAATCTCATGTTCTTCATTTCAATTCTCTTCACTTCAAATCTGGAATTGCTTTGGTTTCT
TCTTCTTGTTCCTCGATTTCGATCTGAAACACTCGATCATGTCAGTTGCTTTCGAGATCACTAGCAGATCGCCGTCCTACTGCTCCGTCTTGAATGCCACCGGAGTTATC
CCGGTACTCCGGCGAGAGGCTTCCGTTTGTGATGCGGCGGCGGAGGAGGTGGAAAGTTACAGTTCGTCTTCATCGTCGTCGATTGGAGAAAACAGTGATTTCTCGGTCAG
ATCGTCGGATGACGACGACGAAGAAGATAACGAGGCGGAAAGTTCGTACAAAAAACCTCTAGAAATGGAGTCGTTGGAAAAAGCCTTGCCGATCAGGAGAGGAATTTCGA
ATTTCTACAACGGAAAATCGAAATCGTTCACAAGTCTGGCAGACGCTTCGATCGGAGACATAGCAAAGCCTGAAAACGCGTTTTCTCGGAAGCGGAGAAATCTTCTTGCA
TCGAAGCTTTTCTCCGGCAGCATATCGAAGCGGCCGATCAGTTCGAGTCGAAGCTCGTTGGCTTTGGCCGTTGCAATGAGCAGTTCCGATCTGAATTCGGACTTTTCTCC
GTCGATTCGGCCGCCGTTGCATCCCAACGGGCGAGCATCTCGGAGCAATTTAGGCTCTGCGGTTCCTCTTCTCTGTAAATATCCCGCTTGGCGATCGTGTTCCATGGCCG
ACATACAGTAAACTTAGGGTTTCTTCGTTGATGACTAACTTGACGTTTGGAGACCGATTGTTTCGTCAAATTGGTCTGTTCGTATGATTTGTTTTTCTTTTGAAATGTTC
TTGAGCTTACTTTTACGGCATCTTTGTTTTTAATATTTTTGTATGTAAATTGAAAAGGGGAAATGAATAGAAATTGATCCATGGTATCTATGTTTATTTGTTGATGGGAC
TAGGAAAGAAATGGGAAACCAACTGTATTCATTTTTTGGAAAAATGTTAATTATGAATTACACATTAATATATTTAACATATATGGTGAATTAAGTGGA
Protein sequenceShow/hide protein sequence
MSVAFEITSRSPSYCSVLNATGVIPVLRREASVCDAAAEEVESYSSSSSSSIGENSDFSVRSSDDDDEEDNEAESSYKKPLEMESLEKALPIRRGISNFYNGKSKSFTSL
ADASIGDIAKPENAFSRKRRNLLASKLFSGSISKRPISSSRSSLALAVAMSSSDLNSDFSPSIRPPLHPNGRASRSNLGSAVPLLCKYPAWRSCSMADIQ