; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G00100 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G00100
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110414781
Genome locationClcChr04:291842..293820
RNA-Seq ExpressionClc04G00100
SyntenyClc04G00100
Gene Ontology termsGO:0031326 - regulation of cellular biosynthetic process (biological process)
GO:0072593 - reactive oxygen species metabolic process (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0016667 - oxidoreductase activity, acting on a sulfur group of donors (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043532.1 uncharacterized protein E6C27_scaffold335G00260 [Cucumis melo var. makuwa]5.9e-8668.49Show/hide
Query:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPTPISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLNS
        MAEFPSPLER+VASALLLLS S      PSPP TPIS+ +WLF +NI G K SRE+SAFCDYSN+SSS+LT SD SS TPP E LLF T P  H+L LN 
Subjt:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPTPISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLNS

Query:  NLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSVV
                                                              VVRKSRSKL+RISENRNLSS DEVTLSSGSASSETTSCLSSSSSVV
Subjt:  NLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSVV

Query:  TSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK
        TSAPIHRLVTRAEKKLEMIR AWRKKQ+ASAHMRRRAEAILSYLS GCSSEVKIRQVIGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYMYK
Subjt:  TSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK

KGN53858.2 hypothetical protein Csa_019116 [Cucumis sativus]4.1e-8768.49Show/hide
Query:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPTPISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLNS
        MAEFPSPLER+VASALLLLS S      PSPP TPIS+ +WLF E  I  K SREMSAFC++SNSSSS+LT SD SS TPP E LLFSTSP  H+LKLN 
Subjt:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPTPISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLNS

Query:  NLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSVV
                                                              VVRKSRSKL+RISENRNLSS DEVTLSSGSASSETTSCLSSSSSVV
Subjt:  NLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSVV

Query:  TSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK
        TS PIHRLVTRAEKKLEMIR AWRKKQ+ASAHMRRRAEAILSYLS GCSSEVKIR+VIGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYMYK
Subjt:  TSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK

XP_004149405.1 uncharacterized protein LOC101216264 [Cucumis sativus]4.1e-8768.49Show/hide
Query:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPTPISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLNS
        MAEFPSPLER+VASALLLLS S      PSPP TPIS+ +WLF E  I  K SREMSAFC++SNSSSS+LT SD SS TPP E LLFSTSP  H+LKLN 
Subjt:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPTPISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLNS

Query:  NLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSVV
                                                              VVRKSRSKL+RISENRNLSS DEVTLSSGSASSETTSCLSSSSSVV
Subjt:  NLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSVV

Query:  TSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK
        TS PIHRLVTRAEKKLEMIR AWRKKQ+ASAHMRRRAEAILSYLS GCSSEVKIR+VIGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYMYK
Subjt:  TSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK

XP_016902598.1 PREDICTED: uncharacterized protein LOC103499533 [Cucumis melo]5.9e-8668.49Show/hide
Query:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPTPISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLNS
        MAEFPSPLER+VASALLLLS S      PSPP TPIS+ +WLF +NI G K SRE+SAFCDYSN+SSS+LT SD SS TPP E LLF T P  H+L LN 
Subjt:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPTPISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLNS

Query:  NLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSVV
                                                              VVRKSRSKL+RISENRNLSS DEVTLSSGSASSETTSCLSSSSSVV
Subjt:  NLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSVV

Query:  TSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK
        TSAPIHRLVTRAEKKLEMIR AWRKKQ+ASAHMRRRAEAILSYLS GCSSEVKIRQVIGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYMYK
Subjt:  TSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK

XP_038882356.1 uncharacterized protein LOC120073620 [Benincasa hispida]5.7e-8969.05Show/hide
Query:  MAEFPSPLERSVASALLLLSTSP--PPPPPPSPPPTPISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKL
        MA+FPSPLER+VASALLLLST P  PPPPPPSPPPTP SQ +WLF  N+IG K S E+S FCDYSNSSSS+LT S+ESS+T  +EPLLFST P L ELKL
Subjt:  MAEFPSPLERSVASALLLLSTSP--PPPPPPSPPPTPISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKL

Query:  NSNLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSS
                                                                VVRKSRSK+IRISE  NLSS D+VTLSS SASSETTSCLSSSSS
Subjt:  NSNLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSS

Query:  VVTSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK
        VVT APIHRLV RAEKKLEMIR AWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK
Subjt:  VVTSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK

TrEMBL top hitse value%identityAlignment
A0A0A0L130 Uncharacterized protein2.0e-8768.49Show/hide
Query:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPTPISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLNS
        MAEFPSPLER+VASALLLLS S      PSPP TPIS+ +WLF E  I  K SREMSAFC++SNSSSS+LT SD SS TPP E LLFSTSP  H+LKLN 
Subjt:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPTPISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLNS

Query:  NLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSVV
                                                              VVRKSRSKL+RISENRNLSS DEVTLSSGSASSETTSCLSSSSSVV
Subjt:  NLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSVV

Query:  TSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK
        TS PIHRLVTRAEKKLEMIR AWRKKQ+ASAHMRRRAEAILSYLS GCSSEVKIR+VIGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYMYK
Subjt:  TSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK

A0A1S4E302 uncharacterized protein LOC1034995332.8e-8668.49Show/hide
Query:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPTPISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLNS
        MAEFPSPLER+VASALLLLS S      PSPP TPIS+ +WLF +NI G K SRE+SAFCDYSN+SSS+LT SD SS TPP E LLF T P  H+L LN 
Subjt:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPTPISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLNS

Query:  NLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSVV
                                                              VVRKSRSKL+RISENRNLSS DEVTLSSGSASSETTSCLSSSSSVV
Subjt:  NLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSVV

Query:  TSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK
        TSAPIHRLVTRAEKKLEMIR AWRKKQ+ASAHMRRRAEAILSYLS GCSSEVKIRQVIGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYMYK
Subjt:  TSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK

A0A5A7TJT5 Uncharacterized protein2.8e-8668.49Show/hide
Query:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPTPISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLNS
        MAEFPSPLER+VASALLLLS S      PSPP TPIS+ +WLF +NI G K SRE+SAFCDYSN+SSS+LT SD SS TPP E LLF T P  H+L LN 
Subjt:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPTPISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLNS

Query:  NLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSVV
                                                              VVRKSRSKL+RISENRNLSS DEVTLSSGSASSETTSCLSSSSSVV
Subjt:  NLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSVV

Query:  TSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK
        TSAPIHRLVTRAEKKLEMIR AWRKKQ+ASAHMRRRAEAILSYLS GCSSEVKIRQVIGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYMYK
Subjt:  TSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK

A0A6J1EL58 uncharacterized protein LOC1114355593.6e-8166.1Show/hide
Query:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPT-PISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLN
        MAEFP  LER+VASALLLLSTSPPPPP P P P+ PISQ +WLF E I+G K S EMS FCD S S SSVLT SDESS+T  QE LLFSTS    ELKLN
Subjt:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPT-PISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLN

Query:  SNLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSV
                                                               VVRKSRS+ +RIS NRNL+  D+VTLSSGSASSETT CLSSSSSV
Subjt:  SNLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSV

Query:  VTSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMY
         TSAPI RLVTRAEKKLEMIR AWRKK VASAHMRRRAEAILSYLSGGCSSEVKIRQV+GDSPDTSKALRMLLKLEEIKRSGTGGRQDPY+Y
Subjt:  VTSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMY

A0A6J1HWL4 uncharacterized protein LOC1114673044.0e-8065.75Show/hide
Query:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPT-PISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLN
        MAEFP  LER+VASALLLLSTSPPPP  P P P+ PISQ +WLF E I+G K S EMS FCD S S SSVLT SDESS+T  QE LLFSTS    ELKLN
Subjt:  MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPT-PISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLN

Query:  SNLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSV
                                                               VVRKSRS+ +RIS NRNL+  D+VTLSSGSASSETT CLSSSSSV
Subjt:  SNLFLFENAPLSDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSV

Query:  VTSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMY
         TSAPI RLVTRAEKKLEMIR AWRKK VASAHMRRRAEAILSYLSGGCSSEVKIRQV+GDSPDTSKALRMLLKLEEIKRSGTGGRQDPY+Y
Subjt:  VTSAPIHRLVTRAEKKLEMIREAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G57440.1 unknown protein3.2e-2148Show/hide
Query:  QLHGFQVVRKSRSKLIRISEN---RNLSSRDEV-TLSSGSASSETTSCLSSSSSVVTSAPIHRLVTRAEKKLEMIREAWR--KKQVASAHMRRRAEAILS
        +L  F+  RK RS++I  S N    +L        LS+ S  S+  SCLS+ SS V+S    R+  R +K  E +R   +  K+   S+ +RRRA+ IL 
Subjt:  QLHGFQVVRKSRSKLIRISEN---RNLSSRDEV-TLSSGSASSETTSCLSSSSSVVTSAPIHRLVTRAEKKLEMIREAWR--KKQVASAHMRRRAEAILS

Query:  YLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK
        +LS   SSEV IRQ++GDSPDTSKALRMLLK+EE+KR GTGGR DP++YK
Subjt:  YLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAATTCCCTTCCCCTCTTGAACGCAGCGTTGCTTCTGCTCTGCTCCTCCTCTCCACTTCGCCGCCGCCTCCTCCTCCGCCCTCTCCTCCACCTACACCCATTTC
TCAACACCAATGGCTGTTTGGGGAGAACATTATTGGAGCAAAATTCTCCAGAGAGATGTCCGCGTTTTGTGATTATTCAAACTCTTCCTCTTCGGTACTCACTGGATCAG
ATGAATCGTCCGATACTCCCCCGCAGGAACCCTTGCTGTTCTCTACTTCGCCTCGTCTCCACGAGCTAAAGCTTAATTCGAATTTATTTTTGTTCGAAAATGCTCCGCTA
TCGGATCCCTCCTGTTGTTGTTTCTTCTTCGGTATCAACTTCAATCTGATCAAAATCTTGAAAGCTCTACTTCCTCCGATTCTCCTGATCACTTACCTGTCTGTCACTGA
ATTGCAATTACATGGATTCCAAGTCGTGAGAAAGAGTCGTTCGAAGCTAATAAGGATTTCCGAGAACCGGAATCTCAGTTCTAGAGACGAGGTTACTTTGTCTTCAGGCT
CAGCGTCCTCGGAGACGACTTCTTGTTTGTCAAGCAGTTCAAGCGTGGTCACAAGCGCGCCGATCCATCGGCTGGTTACGAGAGCAGAGAAGAAGTTAGAAATGATTCGT
GAGGCGTGGAGGAAAAAGCAGGTGGCATCGGCTCATATGCGGCGGCGTGCGGAAGCCATTCTGAGTTATCTCTCCGGCGGGTGTTCCTCTGAGGTGAAGATACGGCAAGT
GATTGGTGACAGCCCTGACACAAGCAAGGCTCTCAGAATGCTGTTGAAACTGGAAGAGATCAAAAGATCCGGAACAGGTGGACGTCAAGATCCCTATATGTACAAGAGAG
TCTCAGAAAAATCCCAACTGATTGGGAAGTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAATTCCCTTCCCCTCTTGAACGCAGCGTTGCTTCTGCTCTGCTCCTCCTCTCCACTTCGCCGCCGCCTCCTCCTCCGCCCTCTCCTCCACCTACACCCATTTC
TCAACACCAATGGCTGTTTGGGGAGAACATTATTGGAGCAAAATTCTCCAGAGAGATGTCCGCGTTTTGTGATTATTCAAACTCTTCCTCTTCGGTACTCACTGGATCAG
ATGAATCGTCCGATACTCCCCCGCAGGAACCCTTGCTGTTCTCTACTTCGCCTCGTCTCCACGAGCTAAAGCTTAATTCGAATTTATTTTTGTTCGAAAATGCTCCGCTA
TCGGATCCCTCCTGTTGTTGTTTCTTCTTCGGTATCAACTTCAATCTGATCAAAATCTTGAAAGCTCTACTTCCTCCGATTCTCCTGATCACTTACCTGTCTGTCACTGA
ATTGCAATTACATGGATTCCAAGTCGTGAGAAAGAGTCGTTCGAAGCTAATAAGGATTTCCGAGAACCGGAATCTCAGTTCTAGAGACGAGGTTACTTTGTCTTCAGGCT
CAGCGTCCTCGGAGACGACTTCTTGTTTGTCAAGCAGTTCAAGCGTGGTCACAAGCGCGCCGATCCATCGGCTGGTTACGAGAGCAGAGAAGAAGTTAGAAATGATTCGT
GAGGCGTGGAGGAAAAAGCAGGTGGCATCGGCTCATATGCGGCGGCGTGCGGAAGCCATTCTGAGTTATCTCTCCGGCGGGTGTTCCTCTGAGGTGAAGATACGGCAAGT
GATTGGTGACAGCCCTGACACAAGCAAGGCTCTCAGAATGCTGTTGAAACTGGAAGAGATCAAAAGATCCGGAACAGGTGGACGTCAAGATCCCTATATGTACAAGAGAG
TCTCAGAAAAATCCCAACTGATTGGGAAGTTTTGA
Protein sequenceShow/hide protein sequence
MAEFPSPLERSVASALLLLSTSPPPPPPPSPPPTPISQHQWLFGENIIGAKFSREMSAFCDYSNSSSSVLTGSDESSDTPPQEPLLFSTSPRLHELKLNSNLFLFENAPL
SDPSCCCFFFGINFNLIKILKALLPPILLITYLSVTELQLHGFQVVRKSRSKLIRISENRNLSSRDEVTLSSGSASSETTSCLSSSSSVVTSAPIHRLVTRAEKKLEMIR
EAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKRVSEKSQLIGKF