; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG08G012800 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG08G012800
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionProtein of unknown function (DUF1997)
Genome locationCG_Chr08:25693723..25697534
RNA-Seq ExpressionClCG08G012800
SyntenyClCG08G012800
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064985.1 DUF1997 family protein [Cucumis melo var. makuwa]4.5e-11991.32Show/hide
Query:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD
        MV+LTTKWL QG GSFPLL  HKFSSPRKKFE IK SKATNS+TNTK+ANLSV RREKI+LP+YS G GRTYHIREFLNHPSGIEAMLNKNALKSFQLLD
Subjt:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD

Query:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE
        ANTYRCTLPKLQLLNFEAAPTLDLR+IPTDEDFTVEMLSCKFEGSELVERQN HFSA+MINHLTWDT+DSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE
Subjt:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE

Query:  NPGNLMLQALLDNLVPLLLRQLLQDYENWISQQLDHSQLSIS
        NPGNLMLQALLDNLVPLLLRQL+QDYE WI+QQLDHSQLSIS
Subjt:  NPGNLMLQALLDNLVPLLLRQLLQDYENWISQQLDHSQLSIS

XP_004138757.1 uncharacterized protein LOC101204116 [Cucumis sativus]1.7e-11891.74Show/hide
Query:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD
        MVVLTTK L QG GSFPLL  HKFSSPRKKFE  KLSKATNS+TNTK+ANLSV +REKI+LP+YS G GRTYHI+EFLNHPSGIEAMLNKNALKSFQLLD
Subjt:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD

Query:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE
        ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQN+HFSALMINHLTWDT+DSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE
Subjt:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE

Query:  NPGNLMLQALLDNLVPLLLRQLLQDYENWISQQLDHSQLSIS
        NPGNLMLQALLDNLVPLLLRQL+QDYE WISQQLDHSQLSIS
Subjt:  NPGNLMLQALLDNLVPLLLRQLLQDYENWISQQLDHSQLSIS

XP_008445108.1 PREDICTED: uncharacterized protein LOC103488246 [Cucumis melo]1.3e-11891.32Show/hide
Query:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD
        MV+LTTKWL QG GSFPLL  HKFSSPRKKFE IK SKATNS+TNTK+ANLSV RREKI+LP YS G GRTYHIREFLNHPSGIEAMLNKNALKSFQLLD
Subjt:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD

Query:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE
        ANTYRCTLPKLQLLNFEAAPTLDLR+IPTDEDFTVEMLSCKFEGSELVERQN HFSA+MINHLTWDT+DSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE
Subjt:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE

Query:  NPGNLMLQALLDNLVPLLLRQLLQDYENWISQQLDHSQLSIS
        NPGNLMLQALLDNLVPLLLRQL+QDYE WI+QQLDHSQLSIS
Subjt:  NPGNLMLQALLDNLVPLLLRQLLQDYENWISQQLDHSQLSIS

XP_022929609.1 uncharacterized protein LOC111436145 [Cucurbita moschata]2.2e-11891.36Show/hide
Query:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD
        MVVL TKWL QGKGSFPLLATHKFSSPRKKFE IK+SKATNS+TNTKRANLSVTR+EKIKLP+YSG GGRTYHIREFLNHPSGIEAM+NKNALKSFQ LD
Subjt:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD

Query:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE
        ANTYRCTL KLQLLNFEAAPTLDLRVIPT+EDFTVEMLSCKFEGSELVERQN+HFSALMINHLTWD+V SNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE
Subjt:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE

Query:  NPGNLMLQALLDNLVPLLLRQLLQDYENWIS-QQLDHSQLSIS
        NPGNLMLQALLDNLVPLLLRQL+QDYE WIS QQ+DHS +SIS
Subjt:  NPGNLMLQALLDNLVPLLLRQLLQDYENWIS-QQLDHSQLSIS

XP_038885685.1 uncharacterized protein LOC120075989 isoform X1 [Benincasa hispida]3.9e-12394.21Show/hide
Query:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD
        MVVLTTKWL QGKGSFPLLA HKFSSPRKKFEVIKLSKATNS+TNTKRANLSVTRREKI+LP+YS   GR YHIREFLNHPSGIEAMLNKNAL+SFQLLD
Subjt:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD

Query:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE
        ANTYRCTLPKLQLLNFEAAPTLDLRVIPTD+DFTVEMLSCKFEGSELVERQN+HFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE
Subjt:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE

Query:  NPGNLMLQALLDNLVPLLLRQLLQDYENWISQQLDHSQLSIS
        NPGNLMLQALLDNLVPLLLRQL+QDYE WISQQLDHSQLSIS
Subjt:  NPGNLMLQALLDNLVPLLLRQLLQDYENWISQQLDHSQLSIS

TrEMBL top hitse value%identityAlignment
A0A0A0LSA8 Uncharacterized protein1.1e-13484.16Show/hide
Query:  MELPLNLLFVSVKDQWHFLNSPSQNSLLSVADFRIPVAVFRRPIDNRRSIKIHSK-----RMVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKA
        M L L     S    +HFLNS SQ+SL S+ADFR+P A  RRPID RR+IK HSK     RMVVLTTK L QG GSFPLL  HKFSSPRKKFE  KLSKA
Subjt:  MELPLNLLFVSVKDQWHFLNSPSQNSLLSVADFRIPVAVFRRPIDNRRSIKIHSK-----RMVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKA

Query:  TNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLDANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLS
        TNS+TNTK+ANLSV +REKI+LP+YS G GRTYHI+EFLNHPSGIEAMLNKNALKSFQLLDANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLS
Subjt:  TNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLDANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLS

Query:  CKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVENPGNLMLQALLDNLVPLLLRQLLQDYENWISQQLDHSQL
        CKFEGSELVERQN+HFSALMINHLTWDT+DSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVENPGNLMLQALLDNLVPLLLRQL+QDYE WISQQLDHSQL
Subjt:  CKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVENPGNLMLQALLDNLVPLLLRQLLQDYENWISQQLDHSQL

Query:  SIS
        SIS
Subjt:  SIS

A0A1S3BBW5 uncharacterized protein LOC1034882466.4e-11991.32Show/hide
Query:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD
        MV+LTTKWL QG GSFPLL  HKFSSPRKKFE IK SKATNS+TNTK+ANLSV RREKI+LP YS G GRTYHIREFLNHPSGIEAMLNKNALKSFQLLD
Subjt:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD

Query:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE
        ANTYRCTLPKLQLLNFEAAPTLDLR+IPTDEDFTVEMLSCKFEGSELVERQN HFSA+MINHLTWDT+DSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE
Subjt:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE

Query:  NPGNLMLQALLDNLVPLLLRQLLQDYENWISQQLDHSQLSIS
        NPGNLMLQALLDNLVPLLLRQL+QDYE WI+QQLDHSQLSIS
Subjt:  NPGNLMLQALLDNLVPLLLRQLLQDYENWISQQLDHSQLSIS

A0A5A7VGG9 DUF1997 family protein2.2e-11991.32Show/hide
Query:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD
        MV+LTTKWL QG GSFPLL  HKFSSPRKKFE IK SKATNS+TNTK+ANLSV RREKI+LP+YS G GRTYHIREFLNHPSGIEAMLNKNALKSFQLLD
Subjt:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD

Query:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE
        ANTYRCTLPKLQLLNFEAAPTLDLR+IPTDEDFTVEMLSCKFEGSELVERQN HFSA+MINHLTWDT+DSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE
Subjt:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE

Query:  NPGNLMLQALLDNLVPLLLRQLLQDYENWISQQLDHSQLSIS
        NPGNLMLQALLDNLVPLLLRQL+QDYE WI+QQLDHSQLSIS
Subjt:  NPGNLMLQALLDNLVPLLLRQLLQDYENWISQQLDHSQLSIS

A0A6J1EP97 uncharacterized protein LOC1114361451.1e-11891.36Show/hide
Query:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD
        MVVL TKWL QGKGSFPLLATHKFSSPRKKFE IK+SKATNS+TNTKRANLSVTR+EKIKLP+YSG GGRTYHIREFLNHPSGIEAM+NKNALKSFQ LD
Subjt:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD

Query:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE
        ANTYRCTL KLQLLNFEAAPTLDLRVIPT+EDFTVEMLSCKFEGSELVERQN+HFSALMINHLTWD+V SNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE
Subjt:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE

Query:  NPGNLMLQALLDNLVPLLLRQLLQDYENWIS-QQLDHSQLSIS
        NPGNLMLQALLDNLVPLLLRQL+QDYE WIS QQ+DHS +SIS
Subjt:  NPGNLMLQALLDNLVPLLLRQLLQDYENWIS-QQLDHSQLSIS

A0A6J1K9Q3 uncharacterized protein LOC1114923701.4e-11891.36Show/hide
Query:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD
        MVVL TKWL QGKGSFPLLATHKFSSPRKKFE IKLSKATNS+TNTKRANLSVTR+EKIKLP+YSG GGRTYHIREFLNHPSGIEAM+NKNALKSFQ LD
Subjt:  MVVLTTKWLAQGKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD

Query:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE
        ANTYRCTL KLQLLNFEAAPTLDLRVIPT+EDFTVEMLSCKFEGSELVERQN+HFSALMINHLTWD+V SNSYLEVDVKL LSLEIYTLPFTLMPTAAVE
Subjt:  ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVE

Query:  NPGNLMLQALLDNLVPLLLRQLLQDYENWIS-QQLDHSQLSIS
        NPGNLMLQALLDNLVPLLLRQL+QDYE WIS QQ+DHS +SIS
Subjt:  NPGNLMLQALLDNLVPLLLRQLLQDYENWIS-QQLDHSQLSIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G31115.1 Protein of unknown function (DUF1997)1.4e-5448.28Show/hide
Query:  AQGKGSFPLLATHKFSSPRKKFEV--IKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD--ANTYR
        + G     LL ++  + PR++  +  +   K     ++ K+AN+S +R+++IKL      G +     EFL HPSG+EA++N  AL+S+ L+D   +TYR
Subjt:  AQGKGSFPLLATHKFSSPRKKFEV--IKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD--ANTYR

Query:  CTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVENPGNL
        CTLPK+QL++FE  P L LRV PT ED TVE+LSCK EGSEL+E Q++ FSA+M N +TW+      +LEVDV+LN++LEI T PFT++P +AVE PGNL
Subjt:  CTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVENPGNL

Query:  MLQALLDNLVPLLLRQLLQDYENWISQQLDHS
        ++Q L+D LVPLLL+QLL+DY+ WI +Q  +S
Subjt:  MLQALLDNLVPLLLRQLLQDYENWISQQLDHS

AT4G31115.2 Protein of unknown function (DUF1997)5.4e-5454.08Show/hide
Query:  TNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD--ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCK
        ++ K+AN+S +R+++IKL      G +     EFL HPSG+EA++N  AL+S+ L+D   +TYRCTLPK+QL++FE  P L LRV PT ED TVE+LSCK
Subjt:  TNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLD--ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFTVEMLSCK

Query:  FEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVENPGNLMLQALLDNLVPLLLRQLLQDYENWISQQLDHS
         EGSEL+E Q++ FSA+M N +TW+      +LEVDV+LN++LEI T PFT++P +AVE PGNL++Q L+D LVPLLL+QLL+DY+ WI +Q  +S
Subjt:  FEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVENPGNLMLQALLDNLVPLLLRQLLQDYENWISQQLDHS

AT5G04440.1 Protein of unknown function (DUF1997)7.6e-1629Show/hide
Query:  SKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYH--IREFLNHPSGIEAMLNKNALKSFQLLDANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFT
        S AT+S T+  R + S T + +           R     + E+++ P+   ++L+   +   + +D NT+RC +   +  NFE  P L +RV        
Subjt:  SKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYH--IREFLNHPSGIEAMLNKNALKSFQLLDANTYRCTLPKLQLLNFEAAPTLDLRVIPTDEDFT

Query:  VEMLSCKFEGSELVERQNKHFSALMINHLTWDTV---DSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVENPGNLMLQALLDNLVPLLLRQLLQDYENWIS
        +++LSCK EGS +V  QN  F A M+N ++ D+     S   +  D  + +++EI    F + P  A+E  G  +L  +L  ++P  L QL +DY  W S
Subjt:  VEMLSCKFEGSELVERQNKHFSALMINHLTWDTV---DSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVENPGNLMLQALLDNLVPLLLRQLLQDYENWIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGACATGTTGGTTGTCACCGGGTTTGCCCCGATGGGCCGACCGTATGAGAGTAAAATAAGGGATTCCACTTACGATCAACTACCACGTGTTTCCCATAATTCAGC
GTATCATCATCATCTGGATATGGAGCTTCCCCTCAATTTGCTTTTCGTTTCTGTTAAAGATCAGTGGCATTTCCTTAATTCTCCCTCACAAAATTCTTTACTATCCGTCG
CAGACTTCAGAATTCCGGTCGCCGTTTTTCGCCGTCCGATTGACAATCGGAGGAGCATAAAAATTCACTCCAAGAGGATGGTGGTTTTAACTACTAAATGGCTTGCGCAA
GGAAAAGGCTCGTTTCCATTGCTTGCTACACACAAGTTTAGCTCACCGAGGAAGAAATTTGAAGTGATTAAGCTATCCAAGGCCACCAATTCTGACACTAATACCAAGAG
GGCAAACTTATCTGTCACAAGAAGGGAGAAGATCAAATTGCCCAATTACAGTGGCGGGGGAGGCAGGACATATCATATCAGAGAATTCCTGAATCACCCTTCAGGAATTG
AAGCAATGCTTAACAAAAATGCCTTGAAAAGTTTCCAGTTACTTGATGCTAACACATACAGATGCACTCTGCCTAAATTGCAACTTTTGAACTTTGAAGCTGCCCCTACA
CTTGATTTACGAGTGATCCCGACAGACGAAGATTTTACCGTTGAGATGCTTTCTTGCAAGTTTGAAGGTTCAGAATTGGTGGAACGCCAAAACAAACATTTTTCAGCCTT
GATGATTAATCACTTGACATGGGACACAGTTGATTCAAATTCATATCTGGAAGTTGATGTGAAGTTGAATTTGTCTCTGGAGATTTATACCCTTCCCTTCACTCTGATGC
CTACTGCAGCAGTCGAGAATCCAGGGAATTTGATGCTTCAAGCTCTGTTGGACAACCTTGTACCTCTGCTGCTGCGGCAATTATTGCAAGATTATGAAAACTGGATCAGT
CAGCAGCTAGATCATTCCCAACTCTCCATCTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGACATGTTGGTTGTCACCGGGTTTGCCCCGATGGGCCGACCGTATGAGAGTAAAATAAGGGATTCCACTTACGATCAACTACCACGTGTTTCCCATAATTCAGC
GTATCATCATCATCTGGATATGGAGCTTCCCCTCAATTTGCTTTTCGTTTCTGTTAAAGATCAGTGGCATTTCCTTAATTCTCCCTCACAAAATTCTTTACTATCCGTCG
CAGACTTCAGAATTCCGGTCGCCGTTTTTCGCCGTCCGATTGACAATCGGAGGAGCATAAAAATTCACTCCAAGAGGATGGTGGTTTTAACTACTAAATGGCTTGCGCAA
GGAAAAGGCTCGTTTCCATTGCTTGCTACACACAAGTTTAGCTCACCGAGGAAGAAATTTGAAGTGATTAAGCTATCCAAGGCCACCAATTCTGACACTAATACCAAGAG
GGCAAACTTATCTGTCACAAGAAGGGAGAAGATCAAATTGCCCAATTACAGTGGCGGGGGAGGCAGGACATATCATATCAGAGAATTCCTGAATCACCCTTCAGGAATTG
AAGCAATGCTTAACAAAAATGCCTTGAAAAGTTTCCAGTTACTTGATGCTAACACATACAGATGCACTCTGCCTAAATTGCAACTTTTGAACTTTGAAGCTGCCCCTACA
CTTGATTTACGAGTGATCCCGACAGACGAAGATTTTACCGTTGAGATGCTTTCTTGCAAGTTTGAAGGTTCAGAATTGGTGGAACGCCAAAACAAACATTTTTCAGCCTT
GATGATTAATCACTTGACATGGGACACAGTTGATTCAAATTCATATCTGGAAGTTGATGTGAAGTTGAATTTGTCTCTGGAGATTTATACCCTTCCCTTCACTCTGATGC
CTACTGCAGCAGTCGAGAATCCAGGGAATTTGATGCTTCAAGCTCTGTTGGACAACCTTGTACCTCTGCTGCTGCGGCAATTATTGCAAGATTATGAAAACTGGATCAGT
CAGCAGCTAGATCATTCCCAACTCTCCATCTCTTGATGCAATGTGAAGTGAAAAGATAGTCTTCCATTTTCAAGCTACTAGAGAAGCTTAGATTAAGAGAAGGCTTTTTC
TCTTTTTAGTGTCTGGTTTCACTGTACATTTCTACATGCTACAATGTTTACTAAGCTTGGGCTCCATCTCCCATGAGGCTATGCCTACAGAAGGGAAGTAAAGTAGTCAG
TTCAAGGCACAAGATAGTGAAAACTGAAAAGTTTAGAGACCTTGTTTGGTTTGTAATTCGAAAAACAAGTAATTTTATGAAAG
Protein sequenceShow/hide protein sequence
MVDMLVVTGFAPMGRPYESKIRDSTYDQLPRVSHNSAYHHHLDMELPLNLLFVSVKDQWHFLNSPSQNSLLSVADFRIPVAVFRRPIDNRRSIKIHSKRMVVLTTKWLAQ
GKGSFPLLATHKFSSPRKKFEVIKLSKATNSDTNTKRANLSVTRREKIKLPNYSGGGGRTYHIREFLNHPSGIEAMLNKNALKSFQLLDANTYRCTLPKLQLLNFEAAPT
LDLRVIPTDEDFTVEMLSCKFEGSELVERQNKHFSALMINHLTWDTVDSNSYLEVDVKLNLSLEIYTLPFTLMPTAAVENPGNLMLQALLDNLVPLLLRQLLQDYENWIS
QQLDHSQLSIS