; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G04970 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G04970
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRegulator of Vps4 activity in the MVB pathway protein
Genome locationClcChr08:15304645..15307180
RNA-Seq ExpressionClc08G04970
SyntenyClc08G04970
Gene Ontology termsGO:0015031 - protein transport (biological process)
InterPro domainsIPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445649.1 PREDICTED: uncharacterized protein LOC103488605 [Cucumis melo]8.2e-3448.39Show/hide
Query:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN
        MRR+L  + G RSF +S F+ +V+ + +R+  L + R  + S +L  + + +    H     LAL RC   IK+QN LDAY +IEGYLN L+ERI L G 
Subjt:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN

Query:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIASHK
         R+CP ELKEAAS VVFAA+R  +  +L +IKSI TS+FG+EF   AVELR+ N  ++  + +KLS  +P  +SKMNLLK IAS K
Subjt:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIASHK

XP_022139637.1 uncharacterized protein LOC111010490 [Momordica charantia]7.2e-3852.17Show/hide
Query:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN
        MRR+L  L G R+F +SNFR ++  A +R+  L   R  + S +   +   +  D      NLAL RC   IK+QN LDAY +IEGYLN L++RI L   
Subjt:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN

Query:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIAS
         R+CP ELKEAASGVV+AA+RC + P+L+EIKS+LTSRFGREF G AVELR+ +  ++L + +KLS  +P+ ESKMNLL+AIAS
Subjt:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIAS

XP_038884436.1 uncharacterized protein LOC120075284 isoform X1 [Benincasa hispida]1.1e-3349.19Show/hide
Query:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN
        MRR+L  L G RSF +S+FR +V+ A +R+  LK     + S     +   +          LAL RC + I++QN LDAY +IEGYLN L+E I L G 
Subjt:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN

Query:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPN-VSDKLKLAEKLSEMRPSPESKMNLLKAIAS
        GR+CP ELKEA S VVFA++R  +  +L++IKSILTS+FG+EF G AVELR+ N V+D   + +KLS  +P+ +SKMNLLK IAS
Subjt:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPN-VSDKLKLAEKLSEMRPSPESKMNLLKAIAS

XP_038886174.1 uncharacterized protein LOC120076425 isoform X1 [Benincasa hispida]4.7e-8253.47Show/hide
Query:  MRRLLRSLK---GRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDL
        MR +L+ +K   GRR FAAS FRE V L +SRIA LKEERSN CSANL++IQ FIPY      +  AL RCGD IKNQNRLDAYIIIEGYLN L+ERI L
Subjt:  MRRLLRSLK---GRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDL

Query:  FGNGRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIASHKSPPPPPPPPPP
        FG+GRDCP ELKEAASGVVFAATRC EI +L++IKSILTS FGR+FIG AVEL + N   + +L EKLSEM+PSPESKMNLL AIAS K+  PP P    
Subjt:  FGNGRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIASHKSPPPPPPPPPP

Query:  LPPPPPPPPPPPPPPPPPPPPPPPPPPGMAEQMTREQIWRRSWSKRGMSEMIETSSSVFGSEDSSNSSGSRKKKSK------------------------
                                      E+M  ++  +R WSKRG SEMIE S+++FGSEDSSNS+ SRK+K+K                        
Subjt:  LPPPPPPPPPPPPPPPPPPPPPPPPPPGMAEQMTREQIWRRSWSKRGMSEMIETSSSVFGSEDSSNSSGSRKKKSK------------------------

Query:  ----VVGLKISPTETSLDSFYTSTQTGVKPSSSNQTTQKSKGVGFKNTSLKPTGVKNLLSSKQTKQNNKVVPLNITPTPTNFDPE-------TRRYQTRS
            +VGL  +P  TS DSF  +     KPSS   TTQ +K       + KPTGV NLLSSK   Q NKVVPL  +PTPTNF P+       TRRY TRS
Subjt:  ----VVGLKISPTETSLDSFYTSTQTGVKPSSSNQTTQKSKGVGFKNTSLKPTGVKNLLSSKQTKQNNKVVPLNITPTPTNFDPE-------TRRYQTRS

Query:  SSKK
        SS K
Subjt:  SSKK

XP_038886175.1 uncharacterized protein LOC120076425 isoform X2 [Benincasa hispida]6.0e-6947.52Show/hide
Query:  MRRLLRSLK---GRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDL
        MR +L+ +K   GRR FAAS FRE V L +SRIA LKEERSN CSANL++IQ FIPY      +  AL RCGD IKNQNRLDAYIIIEGYLN L+ERI L
Subjt:  MRRLLRSLK---GRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDL

Query:  FGNGRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIASHKSPPPPPPPPPP
        FG+GRDCP ELKEAASGVVFAATRC EI +L++IKSILTS FGR+FIG AVEL + N                                           
Subjt:  FGNGRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIASHKSPPPPPPPPPP

Query:  LPPPPPPPPPPPPPPPPPPPPPPPPPPGMAEQMTREQIWRRSWSKRGMSEMIETSSSVFGSEDSSNSSGSRKKKSK------------------------
                                  P   E+M  ++  +R WSKRG SEMIE S+++FGSEDSSNS+ SRK+K+K                        
Subjt:  LPPPPPPPPPPPPPPPPPPPPPPPPPPGMAEQMTREQIWRRSWSKRGMSEMIETSSSVFGSEDSSNSSGSRKKKSK------------------------

Query:  ----VVGLKISPTETSLDSFYTSTQTGVKPSSSNQTTQKSKGVGFKNTSLKPTGVKNLLSSKQTKQNNKVVPLNITPTPTNFDPE-------TRRYQTRS
            +VGL  +P  TS DSF  +     KPSS   TTQ +K       + KPTGV NLLSSK   Q NKVVPL  +PTPTNF P+       TRRY TRS
Subjt:  ----VVGLKISPTETSLDSFYTSTQTGVKPSSSNQTTQKSKGVGFKNTSLKPTGVKNLLSSKQTKQNNKVVPLNITPTPTNFDPE-------TRRYQTRS

Query:  SSKK
        SS K
Subjt:  SSKK

TrEMBL top hitse value%identityAlignment
A0A1S3BD77 uncharacterized protein LOC1034886054.0e-3448.39Show/hide
Query:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN
        MRR+L  + G RSF +S F+ +V+ + +R+  L + R  + S +L  + + +    H     LAL RC   IK+QN LDAY +IEGYLN L+ERI L G 
Subjt:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN

Query:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIASHK
         R+CP ELKEAAS VVFAA+R  +  +L +IKSI TS+FG+EF   AVELR+ N  ++  + +KLS  +P  +SKMNLLK IAS K
Subjt:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIASHK

A0A5B6ZVT4 Uncharacterized protein (Fragment)1.5e-3347.28Show/hide
Query:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN
        M + L +L G R+F  S F+ +V LAISR+A LK++R ++CS  L  I  F+    H      AL R    I+ QN LD +++IEGY   LIER++L   
Subjt:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN

Query:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIAS
         + CP ELKEA S ++FA+TRC E P+LQE++SI TSRFG+EF+G A+ELR+ N    LK+ +KLS  +PS E+++ +LK IAS
Subjt:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIAS

A0A6I9SMR2 uncharacterized protein LOC1051563977.5e-3347.7Show/hide
Query:  RRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRDCPAELKE
        RR F  S F+  V LA+SR+A LK +R  +CS     +  F+    H    + AL R    IK QN LD ++++EGY + LIER++LF   + CP ELKE
Subjt:  RRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRDCPAELKE

Query:  AASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIAS
        A S ++FAATRC E P+LQ+I++I TSRFG+EF   AVELR+ N     K+ +KLS   PS E+KM  LK IAS
Subjt:  AASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIAS

A0A6J1CEI4 uncharacterized protein LOC1110104903.5e-3852.17Show/hide
Query:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN
        MRR+L  L G R+F +SNFR ++  A +R+  L   R  + S +   +   +  D      NLAL RC   IK+QN LDAY +IEGYLN L++RI L   
Subjt:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN

Query:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIAS
         R+CP ELKEAASGVV+AA+RC + P+L+EIKS+LTSRFGREF G AVELR+ +  ++L + +KLS  +P+ ESKMNLL+AIAS
Subjt:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIAS

A0A6J1CFG6 uncharacterized protein LOC1110103255.2e-3448.66Show/hide
Query:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN
        M R L +L G R+F AS FR ++ LA+SR+A L  +R  + S         +    H    + AL R    IK QN LDAY++IEGYLN LIER DL   
Subjt:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN

Query:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIASHKS
         R+CP ELKEA SG++FAA+RC + P+LQEIKS+LT+RFG+EF   AVELR+ N      + +KLS  +P+ ES+MN+L+AIAS  +
Subjt:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIASHKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13340.1 Regulator of Vps4 activity in the MVB pathway protein1.7e-2938.5Show/hide
Query:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN
        M + L +L G RSF  + F+ ++ LA++R++ LK +R  + S  +  +   +    H      A HR    +K+QN LD    I GY    ++RI LF +
Subjt:  MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGN

Query:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIASHKS
         RDCP EL EA SG++FAA+R  E P+LQEI+++L SRFG++    ++ELRS N     K+ +KLS   P  E +M  LK IA+  +
Subjt:  GRDCPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIASHKS

AT1G25420.1 Regulator of Vps4 activity in the MVB pathway protein1.0e-1832.61Show/hide
Query:  LLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRD
        LL  L  R  F A   +  + LAI+R+  L+ +R  +     + I  F+     P    +A  R    I+  N   AY I+E +   ++ R+ +  + ++
Subjt:  LLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRD

Query:  CPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIASHKS
        CP EL+EA + ++FAA RC+E+P L +IK++  +++G+EFI  A ELR P+      + EKLS   PS  +++ +LK IA   S
Subjt:  CPAELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIASHKS

AT1G34220.2 Regulator of Vps4 activity in the MVB pathway protein1.0e-1831.98Show/hide
Query:  RSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRDCPAELKEA
        + F A+  + ++KL I RI  ++  R  +     + I   +      +A     H     I+ +  + A  I+E +   +  R+ +    R+CP +LKEA
Subjt:  RSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRDCPAELKEA

Query:  ASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIA
         S V FAA RC+++ +LQ+++ +  S++G+EF+  A EL+ P+     KL E LS   PSPE+K+ LLK IA
Subjt:  ASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIA

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein3.3e-1729.48Show/hide
Query:  RRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRDCPAELKE
        RR F +S  +   K+A++RI  ++ +R        + I   +      +A     H     I+ QN   A  IIE +   ++ R+ +    + CP +LKE
Subjt:  RRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRDCPAELKE

Query:  AASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIA
          + ++FAA RC+EIP+L +++ I   ++G++F+  A +LR P+      L +KLS   P  E K+ ++K IA
Subjt:  AASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAGATTGTTGAGAAGTTTGAAGGGCAGAAGAAGCTTTGCGGCATCAAATTTTAGGGAAGTGGTGAAATTAGCCATCTCTCGCATCGCAGGCCTAAAGGAG
GAGCGCTCTAACAAATGCTCAGCAAATCTCCAATCCATCCAAGCCTTCATTCCATACGATCCCCACCCTTCCGCTCTTAACCTTGCCCTCCATCGCTGTGGGGAC
TTCATTAAAAATCAAAATCGTTTGGATGCTTATATTATCATTGAAGGGTATCTCAATCCCTTGATAGAAAGGATCGATCTCTTTGGAAATGGAAGAGATTGTCCA
GCTGAACTGAAGGAGGCAGCATCAGGTGTGGTATTTGCAGCGACAAGATGTAATGAAATTCCAAAACTTCAAGAGATCAAATCAATTTTGACTTCTCGTTTTGGT
AGAGAATTTATTGGCCATGCTGTTGAACTACGCAGCCCTAACGTTTCAGATAAATTGAAGCTAGCGGAAAAATTGTCAGAAATGAGGCCAAGTCCGGAGAGTAAA
ATGAATCTTCTAAAAGCCATTGCTTCTCATAAAAGCCCGCCGCCGCCGCCGCCGCCGCCGCCGCCGCTGCCACCACCACCACCACCACCACCACCACCACCACCA
CCACCACCACCACCACCACCACCACCACCACCACCACCACCAGGTATGGCCGAACAAATGACTAGAGAGCAAATATGGAGAAGGTCATGGAGCAAGCGAGGCATG
AGTGAAATGATAGAGACTTCAAGTAGCGTTTTTGGATCAGAAGACTCATCCAATTCTTCAGGAAGTAGAAAAAAGAAAAGCAAAGTTGTGGGTTTGAAGATAAGC
CCTACTGAAACCAGTTTGGACTCATTTTATACTTCTACACAAACCGGTGTTAAGCCATCAAGTTCTAATCAGACTACACAAAAGAGTAAAGGTGTGGGTTTCAAA
AACACATCCCTTAAACCAACTGGTGTTAAAAACCTATTGAGTTCAAAGCAAACAAAACAAAATAACAAAGTTGTGCCTTTAAACATAACCCCTACTCCAACCAAC
TTTGATCCAGAAACAAGACGTTATCAAACTCGTAGTTCGAGCAAAAAGAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAGATTGTTGAGAAGTTTGAAGGGCAGAAGAAGCTTTGCGGCATCAAATTTTAGGGAAGTGGTGAAATTAGCCATCTCTCGCATCGCAGGCCTAAAGGAG
GAGCGCTCTAACAAATGCTCAGCAAATCTCCAATCCATCCAAGCCTTCATTCCATACGATCCCCACCCTTCCGCTCTTAACCTTGCCCTCCATCGCTGTGGGGAC
TTCATTAAAAATCAAAATCGTTTGGATGCTTATATTATCATTGAAGGGTATCTCAATCCCTTGATAGAAAGGATCGATCTCTTTGGAAATGGAAGAGATTGTCCA
GCTGAACTGAAGGAGGCAGCATCAGGTGTGGTATTTGCAGCGACAAGATGTAATGAAATTCCAAAACTTCAAGAGATCAAATCAATTTTGACTTCTCGTTTTGGT
AGAGAATTTATTGGCCATGCTGTTGAACTACGCAGCCCTAACGTTTCAGATAAATTGAAGCTAGCGGAAAAATTGTCAGAAATGAGGCCAAGTCCGGAGAGTAAA
ATGAATCTTCTAAAAGCCATTGCTTCTCATAAAAGCCCGCCGCCGCCGCCGCCGCCGCCGCCGCCGCTGCCACCACCACCACCACCACCACCACCACCACCACCA
CCACCACCACCACCACCACCACCACCACCACCACCACCACCAGGTATGGCCGAACAAATGACTAGAGAGCAAATATGGAGAAGGTCATGGAGCAAGCGAGGCATG
AGTGAAATGATAGAGACTTCAAGTAGCGTTTTTGGATCAGAAGACTCATCCAATTCTTCAGGAAGTAGAAAAAAGAAAAGCAAAGTTGTGGGTTTGAAGATAAGC
CCTACTGAAACCAGTTTGGACTCATTTTATACTTCTACACAAACCGGTGTTAAGCCATCAAGTTCTAATCAGACTACACAAAAGAGTAAAGGTGTGGGTTTCAAA
AACACATCCCTTAAACCAACTGGTGTTAAAAACCTATTGAGTTCAAAGCAAACAAAACAAAATAACAAAGTTGTGCCTTTAAACATAACCCCTACTCCAACCAAC
TTTGATCCAGAAACAAGACGTTATCAAACTCGTAGTTCGAGCAAAAAGAGATAA
Protein sequenceShow/hide protein sequence
MRRLLRSLKGRRSFAASNFREVVKLAISRIAGLKEERSNKCSANLQSIQAFIPYDPHPSALNLALHRCGDFIKNQNRLDAYIIIEGYLNPLIERIDLFGNGRDCP
AELKEAASGVVFAATRCNEIPKLQEIKSILTSRFGREFIGHAVELRSPNVSDKLKLAEKLSEMRPSPESKMNLLKAIASHKSPPPPPPPPPPLPPPPPPPPPPPP
PPPPPPPPPPPPPPGMAEQMTREQIWRRSWSKRGMSEMIETSSSVFGSEDSSNSSGSRKKKSKVVGLKISPTETSLDSFYTSTQTGVKPSSSNQTTQKSKGVGFK
NTSLKPTGVKNLLSSKQTKQNNKVVPLNITPTPTNFDPETRRYQTRSSSKKR