; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG08G005180 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG08G005180
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRegulator of Vps4 activity in the MVB pathway protein
Genome locationCG_Chr08:16364169..16366694
RNA-Seq ExpressionClCG08G005180
SyntenyClCG08G005180
Gene Ontology termsGO:0015031 - protein transport (biological process)
InterPro domainsIPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022139373.1 uncharacterized protein LOC111010325 [Momordica charantia]4.9e-3148.31Show/hide
Query:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN
        M R L +L G R+F AS FR ++ LA+SR+A L  +R  + S         +    H    + AL R    IK QN LDAY++IEGYLN LIER DL   
Subjt:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN

Query:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNL
         R+CP ELKEA SG++FAA+RC + P+LQEIKS+LT+RFG+EF   AVELR+ N      + +KLS  +P+ ES+MN+
Subjt:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNL

XP_022139637.1 uncharacterized protein LOC111010490 [Momordica charantia]4.3e-3551.12Show/hide
Query:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN
        MRR+L  L G R+F +SNFR ++  A +R+  L   R  + S +   +   +  D      NLAL RC   IK+QN LDAY +IEGYLN L++RI L   
Subjt:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN

Query:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNL
         R+CP ELKEAASGVV+AA+RC + P+L+EIKS+LTSRFGREF G AVELR+ +  ++L + +KLS  +P+ ESKMNL
Subjt:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNL

XP_038884436.1 uncharacterized protein LOC120075284 isoform X1 [Benincasa hispida]2.9e-3148.07Show/hide
Query:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN
        MRR+L  L G RSF +S+FR +V+ A +R+  LK     + S     +   +          LAL RC + I++QN LDAY +IEGYLN L+E I L G 
Subjt:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN

Query:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPN-VSDKLKLAEKLSEMRPSPESKMNLHK
        GR+CP ELKEA S VVFA++R  +  +L++IKSILTS+FG+EF G AVELR+ N V+D   + +KLS  +P+ +SKMNL K
Subjt:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPN-VSDKLKLAEKLSEMRPSPESKMNLHK

XP_038886174.1 uncharacterized protein LOC120076425 isoform X1 [Benincasa hispida]2.7e-7453.03Show/hide
Query:  MRRLLRSLK---GRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDL
        MR +L+ +K   GRR FAAS FRE V L +SRIA LKEERSN CSANL++IQ+FIPY      +  AL RCGD IKNQNRLDAYIIIEGYLN L+ERI L
Subjt:  MRRLLRSLK---GRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDL

Query:  FGNGRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHKSPPPPPPPPPPPPPPPP
        FG+GRDCP ELKEAASGVVFAATRC EI +L++IKSILTS FGR+FIG AVEL + N   + +L EKLSEM+PSPESKMNL  +        PP P    
Subjt:  FGNGRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHKSPPPPPPPPPPPPPPPP

Query:  PPPPPPPPPPPPPPPPPPPGMAEQMTREQIWRRSWSKRGMSEMIETSSSVFGSEDSSNSSGSRKKKSK----------------------------VVGL
                              E+M  ++  +R WSKRG SEMIE S+++FGSEDSSNS+ SRK+K+K                            +VGL
Subjt:  PPPPPPPPPPPPPPPPPPPGMAEQMTREQIWRRSWSKRGMSEMIETSSSVFGSEDSSNSSGSRKKKSK----------------------------VVGL

Query:  KISPTETSLDSFYTSTQTGVKPSSSNQTTQKSKGVGFKNTSLKPTGVKNLLSSKQTKQNNKVVPLNITPTPTNFDPE-------TRRYQTRSSSKK
          +P  TS DSF  +     KPSS   TTQ +K       + KPTGV NLLSSK   Q NKVVPL  +PTPTNF P+       TRRY TRSSS K
Subjt:  KISPTETSLDSFYTSTQTGVKPSSSNQTTQKSKGVGFKNTSLKPTGVKNLLSSKQTKQNNKVVPLNITPTPTNFDPE-------TRRYQTRSSSKK

XP_038886175.1 uncharacterized protein LOC120076425 isoform X2 [Benincasa hispida]3.1e-7048.48Show/hide
Query:  MRRLLRSLK---GRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDL
        MR +L+ +K   GRR FAAS FRE V L +SRIA LKEERSN CSANL++IQ+FIPY      +  AL RCGD IKNQNRLDAYIIIEGYLN L+ERI L
Subjt:  MRRLLRSLK---GRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDL

Query:  FGNGRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHKSPPPPPPPPPPPPPPPP
        FG+GRDCP ELKEAASGVVFAATRC EI +L++IKSILTS FGR+FIG AVEL + N                                           
Subjt:  FGNGRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHKSPPPPPPPPPPPPPPPP

Query:  PPPPPPPPPPPPPPPPPPPGMAEQMTREQIWRRSWSKRGMSEMIETSSSVFGSEDSSNSSGSRKKKSK----------------------------VVGL
                          P   E+M  ++  +R WSKRG SEMIE S+++FGSEDSSNS+ SRK+K+K                            +VGL
Subjt:  PPPPPPPPPPPPPPPPPPPGMAEQMTREQIWRRSWSKRGMSEMIETSSSVFGSEDSSNSSGSRKKKSK----------------------------VVGL

Query:  KISPTETSLDSFYTSTQTGVKPSSSNQTTQKSKGVGFKNTSLKPTGVKNLLSSKQTKQNNKVVPLNITPTPTNFDPE-------TRRYQTRSSSKK
          +P  TS DSF  +     KPSS   TTQ +K       + KPTGV NLLSSK   Q NKVVPL  +PTPTNF P+       TRRY TRSSS K
Subjt:  KISPTETSLDSFYTSTQTGVKPSSSNQTTQKSKGVGFKNTSLKPTGVKNLLSSKQTKQNNKVVPLNITPTPTNFDPE-------TRRYQTRSSSKK

TrEMBL top hitse value%identityAlignment
A0A1S3BD77 uncharacterized protein LOC1034886053.1e-3147.22Show/hide
Query:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN
        MRR+L  + G RSF +S F+ +V+ + +R+  L + R  + S +L  +   +    H     LAL RC   IK+QN LDAY +IEGYLN L+ERI L G 
Subjt:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN

Query:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHK
         R+CP ELKEAAS VVFAA+R  +  +L +IKSI TS+FG+EF   AVELR+ N  ++  + +KLS  +P  +SKMNL K
Subjt:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHK

A0A5B6ZVT4 Uncharacterized protein (Fragment)2.4e-3146.11Show/hide
Query:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN
        M + L +L G R+F  S F+ +V LAISR+A LK++R ++CS  L  I  F+    H      AL R    I+ QN LD +++IEGY   LIER++L   
Subjt:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN

Query:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHK
         + CP ELKEA S ++FA+TRC E P+LQE++SI TSRFG+EF+G A+ELR+ N    LK+ +KLS  +PS E+++ + K
Subjt:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHK

A0A6I9SMR2 uncharacterized protein LOC1051563971.2e-3046.47Show/hide
Query:  RRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRDCPAELKE
        RR F  S F+  V LA+SR+A LK +R  +CS     +  F+    H    + AL R    IK QN LD ++++EGY + LIER++LF   + CP ELKE
Subjt:  RRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRDCPAELKE

Query:  AASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHK
        A S ++FAATRC E P+LQ+I++I TSRFG+EF   AVELR+ N     K+ +KLS   PS E+KM   K
Subjt:  AASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHK

A0A6J1CEI4 uncharacterized protein LOC1110104902.1e-3551.12Show/hide
Query:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN
        MRR+L  L G R+F +SNFR ++  A +R+  L   R  + S +   +   +  D      NLAL RC   IK+QN LDAY +IEGYLN L++RI L   
Subjt:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN

Query:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNL
         R+CP ELKEAASGVV+AA+RC + P+L+EIKS+LTSRFGREF G AVELR+ +  ++L + +KLS  +P+ ESKMNL
Subjt:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNL

A0A6J1CFG6 uncharacterized protein LOC1110103252.4e-3148.31Show/hide
Query:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN
        M R L +L G R+F AS FR ++ LA+SR+A L  +R  + S         +    H    + AL R    IK QN LDAY++IEGYLN LIER DL   
Subjt:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN

Query:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNL
         R+CP ELKEA SG++FAA+RC + P+LQEIKS+LT+RFG+EF   AVELR+ N      + +KLS  +P+ ES+MN+
Subjt:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13340.1 Regulator of Vps4 activity in the MVB pathway protein1.6e-2738.33Show/hide
Query:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN
        M + L +L G RSF  + F+ ++ LA++R++ LK +R  + S  +  +   +    H      A HR    +K+QN LD    I GY    ++RI LF +
Subjt:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN

Query:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHK
         RDCP EL EA SG++FAA+R  E P+LQEI+++L SRFG++    ++ELRS N     K+ +KLS   P  E +M   K
Subjt:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHK

AT1G25420.1 Regulator of Vps4 activity in the MVB pathway protein9.5e-1731.64Show/hide
Query:  LLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRD
        LL  L  R  F A   +  + LAI+R+  L+ +R  +     + I  F+     P    +A  R    I+  N   AY I+E +   ++ R+ +  + ++
Subjt:  LLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRD

Query:  CPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHK
        CP EL+EA + ++FAA RC+E+P L +IK++  +++G+EFI  A ELR P+      + EKLS   PS  +++ + K
Subjt:  CPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHK

AT1G34220.2 Regulator of Vps4 activity in the MVB pathway protein5.6e-1730.77Show/hide
Query:  RSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRDCPAELKEA
        + F A+  + ++KL I RI  ++  R  +     + I   +      +A     H     I+ +  + A  I+E +   +  R+ +    R+CP +LKEA
Subjt:  RSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRDCPAELKEA

Query:  ASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHK
         S V FAA RC+++ +LQ+++ +  S++G+EF+  A EL+ P+     KL E LS   PSPE+K+ L K
Subjt:  ASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHK

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein2.1e-1629.41Show/hide
Query:  RRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRDCPAELKE
        RR F +S  +   K+A++RI  ++ +R        + I V +      +A     H     I+ QN   A  IIE +   ++ R+ +    + CP +LKE
Subjt:  RRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRDCPAELKE

Query:  AASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHK
          + ++FAA RC+EIP+L +++ I   ++G++F+  A +LR P+      L +KLS   P  E K+ + K
Subjt:  AASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAGATTGTTGAGAAGTTTGAAGGGCAGAAGAAGCTTTGCGGCATCAAATTTTAGGGAAGTGGTGAAATTAGCCATCTCTCGCATCGCAGGCCTAAAGGAGGAGCG
CTCTAACAAATGCTCAGCAAATCTCCAATCCATCCAAGTCTTCATTCCATACGATCCCCACCCTTCCGCTCTTAACCTTGCCCTCCATCGCTGTGGGGACTTCATTAAAA
ATCAAAATCGTTTGGATGCTTATATTATCATTGAAGGGTATCTCAATCCCTTGATAGAAAGGATCGATCTCTTTGGAAATGGAAGAGATTGTCCAGCTGAACTGAAGGAG
GCAGCATCAGGTGTGGTATTTGCAGCGACAAGATGTAATGAAATTCCAAAACTTCAAGAGATCAAATCAATTTTGACTTCTCGTTTTGGTAGAGAATTTATTGGCCATGC
TGTTGAACTACGCAGCCCTAACGTTTCAGATAAATTGAAGCTAGCGGAAAAATTGTCAGAAATGAGGCCAAGTCCGGAGAGTAAAATGAATCTTCATAAAAGCCCGCCGC
CGCCGCCGCCGCCGCCGCCGCCGCCGCCGCCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCAGGTATGGCCGAA
CAAATGACTAGAGAGCAAATATGGAGAAGGTCATGGAGCAAGCGAGGCATGAGTGAAATGATAGAGACTTCAAGTAGCGTTTTTGGATCAGAAGACTCATCCAATTCTTC
AGGAAGTAGAAAAAAGAAAAGCAAAGTTGTGGGTTTGAAGATAAGCCCTACTGAAACCAGTTTGGACTCATTTTATACTTCTACACAAACCGGTGTTAAGCCATCAAGTT
CTAATCAGACTACACAAAAGAGTAAAGGTGTGGGTTTCAAAAACACATCCCTTAAACCAACTGGTGTTAAAAACCTATTGAGTTCAAAGCAAACAAAACAAAATAACAAA
GTTGTGCCTTTAAACATAACCCCTACTCCAACCAACTTTGATCCAGAAACAAGACGTTATCAAACTCGTAGTTCGAGCAAAAAGAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAGATTGTTGAGAAGTTTGAAGGGCAGAAGAAGCTTTGCGGCATCAAATTTTAGGGAAGTGGTGAAATTAGCCATCTCTCGCATCGCAGGCCTAAAGGAGGAGCG
CTCTAACAAATGCTCAGCAAATCTCCAATCCATCCAAGTCTTCATTCCATACGATCCCCACCCTTCCGCTCTTAACCTTGCCCTCCATCGCTGTGGGGACTTCATTAAAA
ATCAAAATCGTTTGGATGCTTATATTATCATTGAAGGGTATCTCAATCCCTTGATAGAAAGGATCGATCTCTTTGGAAATGGAAGAGATTGTCCAGCTGAACTGAAGGAG
GCAGCATCAGGTGTGGTATTTGCAGCGACAAGATGTAATGAAATTCCAAAACTTCAAGAGATCAAATCAATTTTGACTTCTCGTTTTGGTAGAGAATTTATTGGCCATGC
TGTTGAACTACGCAGCCCTAACGTTTCAGATAAATTGAAGCTAGCGGAAAAATTGTCAGAAATGAGGCCAAGTCCGGAGAGTAAAATGAATCTTCATAAAAGCCCGCCGC
CGCCGCCGCCGCCGCCGCCGCCGCCGCCGCCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCAGGTATGGCCGAA
CAAATGACTAGAGAGCAAATATGGAGAAGGTCATGGAGCAAGCGAGGCATGAGTGAAATGATAGAGACTTCAAGTAGCGTTTTTGGATCAGAAGACTCATCCAATTCTTC
AGGAAGTAGAAAAAAGAAAAGCAAAGTTGTGGGTTTGAAGATAAGCCCTACTGAAACCAGTTTGGACTCATTTTATACTTCTACACAAACCGGTGTTAAGCCATCAAGTT
CTAATCAGACTACACAAAAGAGTAAAGGTGTGGGTTTCAAAAACACATCCCTTAAACCAACTGGTGTTAAAAACCTATTGAGTTCAAAGCAAACAAAACAAAATAACAAA
GTTGTGCCTTTAAACATAACCCCTACTCCAACCAACTTTGATCCAGAAACAAGACGTTATCAAACTCGTAGTTCGAGCAAAAAGAGATAA
Protein sequenceShow/hide protein sequence
MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQVFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRDCPAELKE
AASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLHKSPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPGMAE
QMTREQIWRRSWSKRGMSEMIETSSSVFGSEDSSNSSGSRKKKSKVVGLKISPTETSLDSFYTSTQTGVKPSSSNQTTQKSKGVGFKNTSLKPTGVKNLLSSKQTKQNNK
VVPLNITPTPTNFDPETRRYQTRSSSKKR