; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010174 (gene) of Snake gourd v1 genome

Gene IDTan0010174
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDamaged dna-binding 2, putative isoform 1
Genome locationLG02:72916166..72917887
RNA-Seq ExpressionTan0010174
SyntenyTan0010174
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064705.1 Damaged dna-binding 2, putative isoform 1 [Cucumis melo var. makuwa]1.1e-9284.32Show/hide
Query:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLGMESLE
        MSIALESN+RI PSVFSQ G P YCSVLNTTGIIPVVRREAAV D A AP E+VDRCSSSSSSSIGENSGFSVRSSDNDD EDNEAESS+ GPLGMESLE
Subjt:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLGMESLE

Query:  EVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLAMAIAMSSS------DLNSRL--PLPIR
        EVLPIRRGISNFYNGKSKSFTSLADASS+SSIK+IAKPENAFSRKRRNLLASNLIAGGISKRP ISSSRSSLA+A+ +SSS      DLNS +  P PIR
Subjt:  EVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLAMAIAMSSS------DLNSRL--PLPIR

Query:  PPLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        PPLHPNGRASR N GSAVP LCK+PTWRSYS+A+IQ
Subjt:  PPLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

XP_008445543.1 PREDICTED: uncharacterized protein LOC103488525 [Cucumis melo]1.1e-9284.32Show/hide
Query:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLGMESLE
        MSIALESN+RI PSVFSQ G P YCSVLNTTGIIPVVRREAAV D A AP E+VDRCSSSSSSSIGENSGFSVRSSDNDD EDNEAESS+ GPLGMESLE
Subjt:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLGMESLE

Query:  EVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLAMAIAMSSS------DLNSRL--PLPIR
        EVLPIRRGISNFYNGKSKSFTSLADASS+SSIK+IAKPENAFSRKRRNLLASNLIAGGISKRP ISSSRSSLA+A+ +SSS      DLNS +  P PIR
Subjt:  EVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLAMAIAMSSS------DLNSRL--PLPIR

Query:  PPLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        PPLHPNGRASR N GSAVP LCK+PTWRSYS+A+IQ
Subjt:  PPLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

XP_022131782.1 uncharacterized protein LOC111004861 [Momordica charantia]4.7e-9685.04Show/hide
Query:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDE-EDNEAESSFEGPLGMESL
        MSIAL+ NSRI PSVFSQGG PSYCSVLNT GIIPVVRREAAVGD A APAEE+DRCSSSSSSSIGENSG SV+SSDNDD+ E+NEAESS++GPLGMESL
Subjt:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDE-EDNEAESSFEGPLGMESL

Query:  EEVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSRSSLAMAIAM------SSSDLNSRLPLPIRPP
        EEVLP+RRGISNFYNGKSKSFTSLA+ASSA++IKDIAKPENA+SRKRRNLLASNLIAGGISKRPISSSRSSLA+A+AM      SS DLNSR P PIRPP
Subjt:  EEVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSRSSLAMAIAM------SSSDLNSRLPLPIRPP

Query:  LHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        LHPNGR+SRSNL S V LLCKYPTWRSYSLADIQ
Subjt:  LHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

XP_022962518.1 uncharacterized protein LOC111462922 [Cucurbita moschata]1.3e-9083.76Show/hide
Query:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLGMESLE
        MSIALESNSRI PSVFSQG  PSYCSVLNTTG+IPVVRREA VGD   APAE VDRCSSSSSSSIGENS FSVRS ++DD EDNEAESS++  LGMESLE
Subjt:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLGMESLE

Query:  EVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLAMAIAMSSS------DLNSRLPLPIRPP
        EVLPIRRGISNFYNGKSKSFTSL DASS SSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPI SSR SSLA+A+ MSSS      DLNSRL   IRPP
Subjt:  EVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLAMAIAMSSS------DLNSRLPLPIRPP

Query:  LHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        LHP GRASRSN GSAVPLLCK+PTWRSYSLA+IQ
Subjt:  LHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

XP_038883984.1 uncharacterized protein LOC120074946 [Benincasa hispida]1.8e-9284.68Show/hide
Query:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRRE-AAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLGMESL
        MSIALESNSRI PSVFSQ G PSYCSVLNTTG IPVVR+E AAVGD  AA   EVD CSSSSSSSIGENSGFSVRSSDND+ EDNEAESS++GPLGMESL
Subjt:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRRE-AAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLGMESL

Query:  EEVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLAMAIAMSSS------DLNSRLPLPIRP
        EEVLPIRRGISNFYNGKSKSFTSLADASS+SSIKDIAKPENAFSRKRRNLLASNLIAGGISKRP I+SSRSSLA+A+ +SSS      DLNSRL  PIRP
Subjt:  EEVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLAMAIAMSSS------DLNSRLPLPIRP

Query:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        PLHPNGRASRSN GS VPLLCK+P+WRSYSLA+IQ
Subjt:  PLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

TrEMBL top hitse value%identityAlignment
A0A1S3BCZ8 uncharacterized protein LOC1034885255.2e-9384.32Show/hide
Query:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLGMESLE
        MSIALESN+RI PSVFSQ G P YCSVLNTTGIIPVVRREAAV D A AP E+VDRCSSSSSSSIGENSGFSVRSSDNDD EDNEAESS+ GPLGMESLE
Subjt:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLGMESLE

Query:  EVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLAMAIAMSSS------DLNSRL--PLPIR
        EVLPIRRGISNFYNGKSKSFTSLADASS+SSIK+IAKPENAFSRKRRNLLASNLIAGGISKRP ISSSRSSLA+A+ +SSS      DLNS +  P PIR
Subjt:  EVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLAMAIAMSSS------DLNSRL--PLPIR

Query:  PPLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        PPLHPNGRASR N GSAVP LCK+PTWRSYS+A+IQ
Subjt:  PPLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

A0A5A7VFP0 Damaged dna-binding 2, putative isoform 15.2e-9384.32Show/hide
Query:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLGMESLE
        MSIALESN+RI PSVFSQ G P YCSVLNTTGIIPVVRREAAV D A AP E+VDRCSSSSSSSIGENSGFSVRSSDNDD EDNEAESS+ GPLGMESLE
Subjt:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLGMESLE

Query:  EVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLAMAIAMSSS------DLNSRL--PLPIR
        EVLPIRRGISNFYNGKSKSFTSLADASS+SSIK+IAKPENAFSRKRRNLLASNLIAGGISKRP ISSSRSSLA+A+ +SSS      DLNS +  P PIR
Subjt:  EVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRP-ISSSRSSLAMAIAMSSS------DLNSRL--PLPIR

Query:  PPLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        PPLHPNGRASR N GSAVP LCK+PTWRSYS+A+IQ
Subjt:  PPLHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

A0A6J1BS00 uncharacterized protein LOC1110048612.3e-9685.04Show/hide
Query:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDE-EDNEAESSFEGPLGMESL
        MSIAL+ NSRI PSVFSQGG PSYCSVLNT GIIPVVRREAAVGD A APAEE+DRCSSSSSSSIGENSG SV+SSDNDD+ E+NEAESS++GPLGMESL
Subjt:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDE-EDNEAESSFEGPLGMESL

Query:  EEVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSRSSLAMAIAM------SSSDLNSRLPLPIRPP
        EEVLP+RRGISNFYNGKSKSFTSLA+ASSA++IKDIAKPENA+SRKRRNLLASNLIAGGISKRPISSSRSSLA+A+AM      SS DLNSR P PIRPP
Subjt:  EEVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSRSSLAMAIAM------SSSDLNSRLPLPIRPP

Query:  LHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        LHPNGR+SRSNL S V LLCKYPTWRSYSLADIQ
Subjt:  LHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

A0A6J1HF10 uncharacterized protein LOC1114629226.4e-9183.76Show/hide
Query:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLGMESLE
        MSIALESNSRI PSVFSQG  PSYCSVLNTTG+IPVVRREA VGD   APAE VDRCSSSSSSSIGENS FSVRS ++DD EDNEAESS++  LGMESLE
Subjt:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLGMESLE

Query:  EVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLAMAIAMSSS------DLNSRLPLPIRPP
        EVLPIRRGISNFYNGKSKSFTSL DASS SSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPI SSR SSLA+A+ MSSS      DLNSRL   IRPP
Subjt:  EVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLAMAIAMSSS------DLNSRLPLPIRPP

Query:  LHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        LHP GRASRSN GSAVPLLCK+PTWRSYSLA+IQ
Subjt:  LHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

A0A6J1K8D9 uncharacterized protein LOC1114920743.2e-9083.33Show/hide
Query:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLGMESLE
        MSIALESNSRI PSVFSQG  PSYCSVLNTTG+IPVVRREA VGD   APAE VDRCSSSSSSSIGENS FSVRS ++DD EDNEAESS++  LGMESLE
Subjt:  MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLGMESLE

Query:  EVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLAMAIAMSSSD------LNSRLPLPIRPP
        EVL IRRGISNFYNGKSKSFTSL DASS SSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPI SSR SSLA+A+ MSSS+      LNSRL   IRPP
Subjt:  EVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSR-SSLAMAIAMSSSD------LNSRLPLPIRPP

Query:  LHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ
        LHPNGRASRSN GSAVPLLCK+PTWRSYSLA+IQ
Subjt:  LHPNGRASRSNLGSAVPLLCKYPTWRSYSLADIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24550.1 unknown protein8.4e-1146.67Show/hide
Query:  SSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLG--MESLEEVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLI
        SS SSSSIGE+      S + ++EE+++A S   G L     SLE+ LPI+RG+SN Y GKSKSF +L +A  AS  KD+ K EN F+++RR ++A+ L 
Subjt:  SSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLG--MESLEEVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLI

Query:  AGGIS
          G S
Subjt:  AGGIS

AT3G43850.1 unknown protein4.0e-2152.24Show/hide
Query:  SSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLG-MESLEEVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIA
        SS+SS SIGEN       SD+D+  +NE ESS+ GPL  MESLEE LPI+R IS FY GKSKSF SL++ SS   +KD+ KPEN +SR+RRNLL+  + +
Subjt:  SSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLG-MESLEEVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIA

Query:  -GGISKRPISSSRSSLAMAIAMSSSDLNSRLPLP
         GGISK+P    +S LAM+     S  +    LP
Subjt:  -GGISKRPISSSRSSLAMAIAMSSSDLNSRLPLP

AT4G31510.1 unknown protein6.7e-0841.9Show/hide
Query:  RCSSS----SSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLG--MESLEEVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNL
        RC  S    SSSS+GE       +S+N+++ED+   SS    L     SLE+ LPI+RG+SN Y GKSKSF +L +AS+ +   D+ K E+  +++RR L
Subjt:  RCSSS----SSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLG--MESLEEVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNL

Query:  LASNL
        +A+ L
Subjt:  LASNL

AT5G21940.1 unknown protein2.4e-2642.99Show/hide
Query:  APAEEVDRCSSSSSSSIGENSGFSVRSSDN--DDEEDNEAESSFEGPLG-MESLEEVLPIRRGISNFYNGKSKSFTSL-ADASSA----SSIKDIAKPEN
        +P++     SSS+SSSIG NS    +SS++  DD  +NE ES ++GPL  MESLE+VLP+R+GIS +Y+GKSKSFT+L A+A+SA    SS+KD+AKPEN
Subjt:  APAEEVDRCSSSSSSSIGENSGFSVRSSDN--DDEEDNEAESSFEGPLG-MESLEEVLPIRRGISNFYNGKSKSFTSL-ADASSA----SSIKDIAKPEN

Query:  AFSRKRRNLLASNL-------IAGGISKRPI-SSSRSSLAMAIAMS------------------SSDLNSRLP----------LPIRPPLHPNGRASRSN
         +SR+RRNLL   +         GGISK+ + SSSRS+L +A+A++                  SS   S  P          +   PPL+P  + S  N
Subjt:  AFSRKRRNLLASNL-------IAGGISKRPI-SSSRSSLAMAIAMS------------------SSDLNSRLP----------LPIRPPLHPNGRASRSN

Query:  LGSAVPLLCKYPTWRSYSLAD
        L S+   L  +  WRS+S+AD
Subjt:  LGSAVPLLCKYPTWRSYSLAD

AT5G24890.1 unknown protein1.8e-0839.8Show/hide
Query:  SSSSSSIGE--NSGFSVRSSDNDDEEDNEAESSFEGPLGMESLEEVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNL
        SS SSSIG   +S      S+N++++ +  E    G   M SLE+ LP +RG+SN Y GKSKSF +L +     S+K++AK EN  +++RR  + + L
Subjt:  SSSSSSIGE--NSGFSVRSSDNDDEEDNEAESSFEGPLGMESLEEVLPIRRGISNFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAATTGCTTTGGAAAGCAATAGCAGGATTTCGCCGTCCGTTTTCTCGCAAGGCGGCTTTCCCTCGTACTGCTCCGTCTTGAATACCACCGGAATTATTCCGGTTGT
CCGACGAGAGGCTGCCGTTGGTGATCATGCGGCGGCACCGGCGGAGGAGGTGGATAGATGTAGTTCGTCGTCGTCGTCGTCGATTGGGGAGAACAGTGGTTTCTCTGTAA
GATCGTCGGATAATGACGATGAAGAGGATAATGAGGCGGAAAGTTCGTTTGAAGGACCTTTAGGAATGGAATCGTTGGAGGAAGTTTTGCCTATCAGGAGAGGAATTTCA
AATTTCTACAACGGAAAATCGAAATCCTTCACGAGCCTGGCAGATGCTTCCTCGGCTTCCTCCATTAAAGACATAGCAAAGCCTGAGAATGCCTTTTCTCGAAAACGGAG
AAATCTTCTTGCATCGAATCTCATCGCCGGCGGGATATCGAAGCGACCGATCAGTTCAAGTCGAAGCTCGTTGGCTATGGCCATCGCCATGAGCAGTTCTGATCTGAATT
CGAGACTACCTCTGCCGATTCGACCTCCATTGCACCCCAACGGACGAGCATCTCGCAGCAATTTAGGTTCTGCAGTTCCTCTTCTCTGTAAATATCCCACTTGGCGATCA
TACTCCTTGGCCGATATACAGTAG
mRNA sequenceShow/hide mRNA sequence
GGATAAGGCCTTCGGCTTTCCTTTTCTGAATTCTTTATCCTTTTGTTTTCATATCATTAAAAACTCTCTCATCGAAAATAATTTTACTGTTTTTCTTCATTAATTATTCC
GAATCCATATTTTCTATATATAAATTCTTGTCTCTTCTTCCTTGAACAATCTCAGTTTCTCAATCTGATTTATTTGCAATTCCATGGCGCTTTTCTTCATCCAGTGATTG
AATGATCACATCCCTTCCTTCAAATTCTCTCTTCAGTTGAAATTTCCAATTATTCTGGTTTATCTCCTTCATCGCTTTGGATCTGAAACAGTTACTCATGTCAATTGCTT
TGGAAAGCAATAGCAGGATTTCGCCGTCCGTTTTCTCGCAAGGCGGCTTTCCCTCGTACTGCTCCGTCTTGAATACCACCGGAATTATTCCGGTTGTCCGACGAGAGGCT
GCCGTTGGTGATCATGCGGCGGCACCGGCGGAGGAGGTGGATAGATGTAGTTCGTCGTCGTCGTCGTCGATTGGGGAGAACAGTGGTTTCTCTGTAAGATCGTCGGATAA
TGACGATGAAGAGGATAATGAGGCGGAAAGTTCGTTTGAAGGACCTTTAGGAATGGAATCGTTGGAGGAAGTTTTGCCTATCAGGAGAGGAATTTCAAATTTCTACAACG
GAAAATCGAAATCCTTCACGAGCCTGGCAGATGCTTCCTCGGCTTCCTCCATTAAAGACATAGCAAAGCCTGAGAATGCCTTTTCTCGAAAACGGAGAAATCTTCTTGCA
TCGAATCTCATCGCCGGCGGGATATCGAAGCGACCGATCAGTTCAAGTCGAAGCTCGTTGGCTATGGCCATCGCCATGAGCAGTTCTGATCTGAATTCGAGACTACCTCT
GCCGATTCGACCTCCATTGCACCCCAACGGACGAGCATCTCGCAGCAATTTAGGTTCTGCAGTTCCTCTTCTCTGTAAATATCCCACTTGGCGATCATACTCCTTGGCCG
ATATACAGTAGACAGACACTAGGGTTTCTCCATGGCCTCATGAAAAGCTAATCACCATTAAAGACTAACTCGACCTTTCGAATTTCGAAACCGACTTCCATCAAATCGGT
CTCTCCCCAATAATTCATTTTCTTTCAATTTTTTCAAGATCTTACTTCCGCAGCATCTCTGTTATGAGTTTTTTTTGGGTATGTAAATTGAAAAAGGAAAAGAAGAGCAG
TTGATCCATCGAATCGATGTTCGTTTGTTAATAGAATTTGAATAGCTTTTTGCGCATAAAGTTATGGGAAAAATACAGATATTGGCCGGCAGAAAGTTTGGCACAATTGA
AAATGAAGTGTCAATGTCCATTGTTTCGAGAATTGTATGAAGAAAGGTGATTAGATCAGGAATTGAACATAAAGAACATGGAGCCAATTGACTAAAAATGGAATTAGAAG
TATCATGAGAGCATCTCTCTTTTCTGTGAATATATGATCTCTCTTTCTTTTTTCTTTTTTCTTTTTTTAATCTCTAGTTTTAGTACATGAAATCTTCAGTTTTCTAAAAT
TGAAAATTTTGTACCGATGGT
Protein sequenceShow/hide protein sequence
MSIALESNSRISPSVFSQGGFPSYCSVLNTTGIIPVVRREAAVGDHAAAPAEEVDRCSSSSSSSIGENSGFSVRSSDNDDEEDNEAESSFEGPLGMESLEEVLPIRRGIS
NFYNGKSKSFTSLADASSASSIKDIAKPENAFSRKRRNLLASNLIAGGISKRPISSSRSSLAMAIAMSSSDLNSRLPLPIRPPLHPNGRASRSNLGSAVPLLCKYPTWRS
YSLADIQ