; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G10580 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G10580
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionAlpha-amylase
Genome locationClcChr04:24104400..24119169
RNA-Seq ExpressionClc04G10580
SyntenyClc04G10580
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595177.1 hypothetical protein SDJN03_11730, partial [Cucurbita argyrosperma subsp. sororia]3.2e-8884.34Show/hide
Query:  MDAISYSTKQFIRYTNVNSRPCILSPMASHPSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARFQ
        M AI++STK  IR+T VNSRP ILS MASHPSLVFV WNTR RQ SK S V SPC RN  L PKSS++N+DI+PSED+PEDGVSLGTMKLPSDIDIA+F+
Subjt:  MDAISYSTKQFIRYTNVNSRPCILSPMASHPSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARFQ

Query:  VLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEIS
        VLLFQWANSLCQGANLPLPVPL+VDKI SGVRLGFI+IGDGKTEVLVYIDCLVFPATAS+SPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEI+
Subjt:  VLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEIS

XP_004143327.1 uncharacterized protein LOC101214488 [Cucumis sativus]1.1e-9389.05Show/hide
Query:  MDAISYSTKQFIRYTNVNSRPCILSPMASH-PSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARF
        MDAISYST QFIRY NVNSRPC+ S M SH PSLVF A NTR+RQIS FS+V  PCHR  KLVPKSSD+N+DIVPSEDDPEDGVSLGTMKLP D DIARF
Subjt:  MDAISYSTKQFIRYTNVNSRPCILSPMASH-PSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARF

Query:  QVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISR
        QVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATAS+SPIFRAIRNGRLKDQSPPGEPRIMRSLL ALKKSVEISR
Subjt:  QVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISR

Query:  V
        V
Subjt:  V

XP_008462584.1 PREDICTED: uncharacterized protein LOC103500909 [Cucumis melo]1.8e-9489.55Show/hide
Query:  MDAISYSTKQFIRYTNVNSRPCILSPMASH-PSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARF
        MDAISYST QF R+ NVNSRPC+LSPM SH PSLVF A NTRHRQIS FS+   PCHR S LVPKSSD+N+DIVPSEDDPEDGVSLGTMKLP D DIARF
Subjt:  MDAISYSTKQFIRYTNVNSRPCILSPMASH-PSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARF

Query:  QVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISR
        QVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLL ALKKSVEISR
Subjt:  QVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISR

Query:  V
        V
Subjt:  V

XP_022972616.1 uncharacterized protein LOC111471157 [Cucurbita maxima]4.2e-8884Show/hide
Query:  MDAISYSTKQFIRYTNVNSRPCILSPMASHPSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARFQ
        M AI++STK  IR+T VNSRP ILS MASHPSLVFV WNTR RQ SK S V SPC RN  L PKSS++N+DI PSED+PEDGVSLGTMKLPSDIDIA+F+
Subjt:  MDAISYSTKQFIRYTNVNSRPCILSPMASHPSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARFQ

Query:  VLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISRV
        VLLFQWANSLCQGANLPLPVPL+VDKI SGVRLGFI+IGDGKTEVLVYIDCLVFPATAS+SPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEI+ V
Subjt:  VLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISRV

XP_038881940.1 uncharacterized protein LOC120073272 [Benincasa hispida]2.4e-9992Show/hide
Query:  MDAISYSTKQFIRYTNVNSRPCILSPMASHPSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARFQ
        M+AIS+STKQFI YT VNSRPCILSPMASHPSLVFVAWNTRHRQIS+FS++ SPC RN+KLVPKSSD+N DIVPSEDDPEDGVSLGTMKLPSDIDIARFQ
Subjt:  MDAISYSTKQFIRYTNVNSRPCILSPMASHPSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARFQ

Query:  VLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISRV
        VLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATA+T+PIFRAIRNGRL+DQSPPGEPRIMRSLL ALKKSVEISRV
Subjt:  VLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISRV

TrEMBL top hitse value%identityAlignment
A0A0A0KFD4 Uncharacterized protein5.6e-9489.05Show/hide
Query:  MDAISYSTKQFIRYTNVNSRPCILSPMASH-PSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARF
        MDAISYST QFIRY NVNSRPC+ S M SH PSLVF A NTR+RQIS FS+V  PCHR  KLVPKSSD+N+DIVPSEDDPEDGVSLGTMKLP D DIARF
Subjt:  MDAISYSTKQFIRYTNVNSRPCILSPMASH-PSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARF

Query:  QVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISR
        QVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATAS+SPIFRAIRNGRLKDQSPPGEPRIMRSLL ALKKSVEISR
Subjt:  QVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISR

Query:  V
        V
Subjt:  V

A0A1S3CH91 uncharacterized protein LOC1035009098.6e-9589.55Show/hide
Query:  MDAISYSTKQFIRYTNVNSRPCILSPMASH-PSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARF
        MDAISYST QF R+ NVNSRPC+LSPM SH PSLVF A NTRHRQIS FS+   PCHR S LVPKSSD+N+DIVPSEDDPEDGVSLGTMKLP D DIARF
Subjt:  MDAISYSTKQFIRYTNVNSRPCILSPMASH-PSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARF

Query:  QVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISR
        QVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLL ALKKSVEISR
Subjt:  QVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISR

Query:  V
        V
Subjt:  V

A0A5A7SHD0 Uncharacterized protein8.6e-9589.55Show/hide
Query:  MDAISYSTKQFIRYTNVNSRPCILSPMASH-PSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARF
        MDAISYST QF R+ NVNSRPC+LSPM SH PSLVF A NTRHRQIS FS+   PCHR S LVPKSSD+N+DIVPSEDDPEDGVSLGTMKLP D DIARF
Subjt:  MDAISYSTKQFIRYTNVNSRPCILSPMASH-PSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARF

Query:  QVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISR
        QVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLL ALKKSVEISR
Subjt:  QVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISR

Query:  V
        V
Subjt:  V

A0A6J1HDQ9 uncharacterized protein LOC1114632294.6e-8883Show/hide
Query:  MDAISYSTKQFIRYTNVNSRPCILSPMASHPSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARFQ
        M  I++STK  IR+T VNSRP ILS MASHPSLVFV WNTR RQ SK S + SPC RN  L PKSS++N+DI+PSED+PEDGVSLGTMKLPSDIDIA+F+
Subjt:  MDAISYSTKQFIRYTNVNSRPCILSPMASHPSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARFQ

Query:  VLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISRV
        VLLFQWANSLCQGANLPLPVPL+VDKI SGVRLGFI+IGDGKTEVLVYIDCLVFPATAS+SPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEI+ V
Subjt:  VLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISRV

A0A6J1I6G0 uncharacterized protein LOC1114711572.0e-8884Show/hide
Query:  MDAISYSTKQFIRYTNVNSRPCILSPMASHPSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARFQ
        M AI++STK  IR+T VNSRP ILS MASHPSLVFV WNTR RQ SK S V SPC RN  L PKSS++N+DI PSED+PEDGVSLGTMKLPSDIDIA+F+
Subjt:  MDAISYSTKQFIRYTNVNSRPCILSPMASHPSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARFQ

Query:  VLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISRV
        VLLFQWANSLCQGANLPLPVPL+VDKI SGVRLGFI+IGDGKTEVLVYIDCLVFPATAS+SPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEI+ V
Subjt:  VLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISRV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G36145.1 unknown protein6.2e-4561.39Show/hide
Query:  QISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARFQVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGD-GK
        Q+   +   SP      +V  +S  N  +   E   EDGVSLGTMKLP D D+ARF+ LLFQWANSLCQGANLPLPVPLKVD+I  G RLGFI + D GK
Subjt:  QISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARFQVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGD-GK

Query:  TEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISRV
        T+V VYIDCLVF  T     +F+A RNGR KD++PPGE RIMRSLL ALKK+VEI+RV
Subjt:  TEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGTGAGATCCATGGACGCCATTTCTTATTCAACAAAGCAATTCATTCGTTACACAAATGTCAATTCTCGGCCATGTATTTTGTCTCCAATGGCGTCTCATCCGTC
ACTGGTTTTCGTCGCTTGGAACACTCGCCATCGGCAAATCTCCAAGTTCTCCGAAGTCTTAAGTCCATGCCACCGGAATTCGAAGCTGGTGCCCAAGTCTTCGGATCAGA
ACAGCGATATTGTGCCCTCCGAAGATGACCCCGAAGATGGAGTATCGCTTGGGACCATGAAATTGCCTTCGGACATTGACATTGCCAGATTTCAGGTCTTACTCTTCCAG
TGGGCCAATAGTCTTTGCCAGGGAGCTAACTTGCCGCTTCCAGTGCCTTTGAAGGTTGACAAAATACCCAGTGGAGTTAGACTTGGTTTTATCACAATTGGAGATGGAAA
GACAGAAGTTCTCGTGTATATAGATTGCTTGGTTTTTCCTGCTACTGCCAGTACTAGTCCAATTTTTCGAGCCATAAGAAATGGACGCTTAAAGGACCAGTCACCTCCTG
GTGAACCGAGAATTATGAGGAGTCTTTTGAGTGCTTTGAAAAAATCAGTTGAAATTTCTAGAGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGTGAGATCCATGGACGCCATTTCTTATTCAACAAAGCAATTCATTCGTTACACAAATGTCAATTCTCGGCCATGTATTTTGTCTCCAATGGCGTCTCATCCGTC
ACTGGTTTTCGTCGCTTGGAACACTCGCCATCGGCAAATCTCCAAGTTCTCCGAAGTCTTAAGTCCATGCCACCGGAATTCGAAGCTGGTGCCCAAGTCTTCGGATCAGA
ACAGCGATATTGTGCCCTCCGAAGATGACCCCGAAGATGGAGTATCGCTTGGGACCATGAAATTGCCTTCGGACATTGACATTGCCAGATTTCAGGTCTTACTCTTCCAG
TGGGCCAATAGTCTTTGCCAGGGAGCTAACTTGCCGCTTCCAGTGCCTTTGAAGGTTGACAAAATACCCAGTGGAGTTAGACTTGGTTTTATCACAATTGGAGATGGAAA
GACAGAAGTTCTCGTGTATATAGATTGCTTGGTTTTTCCTGCTACTGCCAGTACTAGTCCAATTTTTCGAGCCATAAGAAATGGACGCTTAAAGGACCAGTCACCTCCTG
GTGAACCGAGAATTATGAGGAGTCTTTTGAGTGCTTTGAAAAAATCAGTTGAAATTTCTAGAGTATGAGCTAATCTTTGTTTATGTAAATGTAACGAACTTGTCTTTTCT
TGTGAAGGTTCACTTTGACATCATATCTGGCTCTCATTTTTTTTTCTTTTTATATCTTAACAATTCAATGTGGCTTGAATTAGAAAAATATTTATTTTGGCTTTAGTCTC
ATTCAGAAAACTATGACATAAGATTCTCTTAGATCAAAACAGCATGGCAGCATAATGCTGATGTTTCGGATGTCTTCTACTGATTTAGAATGCACAAGGCTTGCTTTCCA
CTTTTTGGAATATTCACAAATTGTTGATGGAGACGTTGGAAAAATGCTGTGAAGAAAGCTATAACAAGGAAATTCATTGGAAGTCTTATTTATTATTCAGTGTTTGGTTT
ATTCACACTGAAACCAAAATTAGCTGATGTGTTGGCCATTGCACTCCTTGTCAAGGCATGAAACTAGAACAAAGTAATTAGAAGGGGAAAAAAATGAAGAAAGCTATGCG
CCGATAATTATATAGTTTGTCATCAGCAATACATTCAGATAAATATGTTGATCTAAACCTGAAGATTCTTTGTTATACTTGCTTTCTATGATATACTTAAATTGACCAGT
AGAATGAACTTTGCATAACTTTCTTAACTTCAAAGAGATGGAAGAACTGAAACCTGCTGATAGAATGTGTGGTAATGAAGTAGAGGACGGTTTCGATGCAATTGTTATAG
GGTCAGGATATGGTGGTTCTGTTGCTGCATGTCGGATGTCTATGGCAGGAATAAAAGTTTGCTTACTTGAGAAGGGCCGCAAATGGGAATCTCAGGATTTTGTTACTGAC
AGCATGAAAATAACTTCAGCCGTGAGGATAGAAAATCACAATTTAGGCTTAAGCTTTGGTCCTAAAGATGCATTATTCCAGGTATTTGAACAAAATGATTCTCTAGCAAC
TGTAGCCTGTGGGCTAGGGGGAGGTTCACTAGTGAATGCTGGAGTGATGCTTCCAACCCCAGTTCTTGTTAGAAGGGATCCAAACTGGCCAAAAGAATGGGAAAGGGATT
GGTCTTTCTGTGAAGCTGCTGCTGCTGCCATGTTGAAGGTACAAAGTATTCCCATCAAGTTTCCTTCCGCCAAAGTTTTAGAAGAAACTGTTGACGAAGAGATTGAAGGA
AGTTTTGAGTCTTCGGTGAATCTTAGCATTAACTTCGATCTTGAGGAATCACTGTCTAATTCAAAGAAAATCCAACAGCGGGGTAACTGCTTGGCTTGTGGAAATTGCCT
CGCTGGATGTCCTTATAATGCAAAGAGTTCAACAGACAAAAATTATTTATTGACAGCCATCCAGGCAGGATGTGTTGTTCATACTACATGTCAAGTTCAGTATGTTGTAA
AAAATTCACCTAACCAAGAAGGCAGAACCTCCCGAAAAAGAAGATGGTCTGTTTACTTGAATGAGATTGATTTTATCACCTGTGATTTTGTAATCCTTTCAGCTGGAGTT
TTTGGTACAACTGAGATACTCTTTCGGTCTCAAATGAGAGGACTAGATGTTTCTGAAGCACTTGGCTGTGGATTTAGCTGTAACGGAAATGCTGTGGCCTACCTTGCTGG
GAGTCCTGCACCTTTGAATGCTTATGGGTTAGATAGAGAGCAGCTATGGAAAAAAGCTTTTCATGAACGGCCAGGACCATCTATCTCTTCTTCGTACACTTCTTCATTGG
GATTCACAATTCAGAGTGCAGTACTTCCTTCGGCATATCCTAACCTGCTTTTTAAAGGCATTACAACTTATGGATGGCCCAATGGCTACTGGTTCTTTCATGGGATTTTA
GATAAATTGAAACAAGTTCTAAGCTTCAAAGCCAGCCAAGCAATTGTTCTGAACGCAATGGGTTATGACAAGGGCGATGGGAAGATTATGTTGCAAAGGGACACAGATAA
AATTTCCTTTTTTCCACCATTTGATCCTTTACTACCACAAAAAGTAAATGTCTTTCAAAGAATCACAAAAAAGTTAGGGGGAGTTCTTTTCATTTCAAGGTACCGAAGCA
CATCAGTTCACCATTTAGGTGGGTGCAACGTGGCATCTGATTCTTCTCGTGGTGTTTGCAATGCCAGTGGTCAGGTTTTTTATCCCAAGAATCCTGCTTCTGTGCATCCA
GGCCTCTATGTTTGCGATGCTTCATTAATTCCACGTTCTGTTGGTGTAAATCCATCTTTTACAATCGCAATTGTTTCTGAACATGTAAGCAAGCATCTTGTGAGTGATAT
TCTCAAGTACAAGTGCCAACATGGCATTGAGGTTTCTGCTAGCAATGATAATAAGCATTCTATCCACAAAACAAATATAAATAGATCCCAGAGGTCAATAGTCATGGTTA
AAGAAACCATGAAGGGTTATGTGGGAGGAATGCCTTGTGCTGTTTATCTCATAATGAAGATGAACTCCGAGGGTCGGAAAGATTTCTATCAATCAAAAGGAAGTTTTGGA
GAATGTCATCCACTTCTTAGAGGAAAAGTTGGTGGGTATGTAGAATTTCGGGCCATTGAGAAGGACAATCTATACATTATCGATGGGGAAGTAAATTTGTGTTATACTGG
TTGCAGAACTCCCTTCACTCAGTATATGACTTATCACCTTCTCCTTGCAGCTTCTTCTGGTTCAAGATATATTCTGAAGGGGAAGAAGACCTTGAATCCTTATCTCTTTG
GTTTATATGCTTGGCAAGAGACGACGACACTGCATGTGAGAGTTGAAAAAGTTGCAGAAAATAGTTCGATGAATGATGTTGCCATTTTAGAAGGGGAACTTAGCATCTCA
ATTTTAGAACTTCTCAAGAGTTTCTTAAGCCTTAAGGGAGAAAAGAGAGGACAGTTCATTAGTCTTTTGTTAAAGACTTTTGTGAGAACCTATATCTTACAGATGCCACG
GTTGACTTACAAAAACCCAACACCACTGGGCTTCTTAGAAAACCTCTACGGTCATGGATACACTTCTCGTTTTGAAATCACAACAGAAGATGGAATTACCATCTGTTGCA
TAAAATTTAGCTGTGCCCAATATCCATCGAGGGTTCAAGAAGGAAAACAACGTAATCCAGTTATCCTGATTAATGGCTATTCAACAGAGAGTTACTATCTGCCAACAGAA
CCCACTGATTTGGTTAGAACTTTACTTGGAGAAGGGCACGATGTCTGGCTATTGCAATCAAGATTACACCCTCTAAATCCTTCTAACGACTTCACAATAGCAGATGTTGG
CAGATTTGACATCCCTGCTGCAATCAACAAGATCCTAGAAATGGATGGGTCCTGCAGAAAGGTACATATTGTTGCACACTGTGTTGGTGGGTTGGCATCACACATTTCTC
TCATGGGAGGACATGTCTCTAATTCTTGTGTGGCCTCTCTCTCTTGTACCAACTCTTCAATGTTTTTCAAGCTAACTGTTTCGTCAATGGTCAAAATGTGGCTTCCTCTG
GTCCCAGTGAGTACAAGTTTCTGAATGGTTACAGTTAACGGCTCCCATTTGTCCTGAATTTATTTGCTTGAGTTCATACTTTTCTGAACAGTTGTGTCAAAATGTGATTG
TTTAGGATCTGGAAAATGTCTAGACTACACATCTTAATTAGACAACTCTGAGTTTCGTTTCTCTTTAGGTTAATGAAAAACGAAATAAAACTCTTTGGGTGCATTTGGTT
TTATGCAGATATCTATGGCTATACTTGGAAAGAACAAGATTCTCCCTCTCTTGGGAACATCTCGTATCAGCAGAAGGCATCAGCTCCTAAAATTGATAGCCCATTTGTTA
CCGCGGTACGAGAGGTGCACTTGCAACGAATGTGAAGTCTTCTCTGGCATATTTGGCAGCACATTTTGGCATGAAAATGTGAGTCCTTCTCTTCATCACTGGTTAAACAA
GGAAAGCTCCACAATGCTCCCCATGGCAGCATTTCCTCACCTCAGAAAAATTTGCAATGCTGGTTTCATCGTGGACGACAAAGGGAACAACAATTACTTGATACATCCAG
AGAGAATGGCATTCCCAACGCAATACATATCAGGTGGAAGGAGTCTTCTGGTAAGTCCTCTCACTTCCTTTCTGGCCAACAAATACATGAAGTTGCATCAGCCAAAATTC
AGACATGAAAGGGTGGTTGTGGATGGTTTTGGGCATTCTGATTTGTTGATTGGAGAGAAGTCTTGTAAGGAAGTATTCCCTCATATTCTGTCACATATTGAATTAGCTGA
AAAGGAAGGTGCAATCACTGGTGATGCCAGAGAGAGATACAATAGGGGGGAGGCATTGTCTTGGAGTGAAGATCCACATGATGGGTACAAAAATGGAAAGTAGGAACTGG
TAGGTAAGCAGAATGGCAAACGCTTCTCAAAGCTGAAGCCTCGCAATCGAATTACCCTATCCCTTTGTTCATTCTACTTGTTCTTCCATTTCCCAATTCTCCCTCAAAAC
CCCTTAATCCTTCGCTTTCTTCTCTCGTAATCTCTCTCTATTGCCGTTTTCGATGTCACTGCCTTCTACGTTCGCTCCTTCCTCTCATTTTCGCCTCCCAATTTCACATT
TCAAGCCCTCTTACCATTCCAGAACTTCTTTTTTCTTATCCATCGACTATTGGTGCTGCAAATCCCTCAATTCACCCTCTAGAGGCACTTCCCGACGATTGCCGCTCATC
TGCTCTTCTTCGTCCGATGGCGCGTCTGGTTCAGTTCCCTCTGATAGTGATAACATTCCCAGTAACTTCTGTATCATAGAAGGACCGGAGACCGTTCAGGATTTTGTTCA
GATGCAATTCCAGGAAATCCAGGACAATATAAGGAGTCGTCGTAATAAAATTTTTCTTCTAATGGAAGAGGTAAGAAGATTACGAATTCAACAACGCTTAAAGAATCTAA
AAGTTATTGATGAGAATGACAATGAAGAGGCGAATGAAATGCCTGACATTCCATCATCTATTCCTTTTCTTCCCCACGTGACACCAAAGACGTTGAAGCAGCAATATTTA
ACCAGCTTGTCAGTTATATGGGGAATAATTGTATTTGGTGGCCTTATTGCCCCAATTCTGGAGCTAAAATTGGGATTAGGTGGCACTTCGTACGAAGATTTCATCCACAA
CATGCATTTGCCTATGCAATTAAGTCAAGTGGATCCCATTGTGGCGTCATTTTCAGGTGGAGCTGTAGGTGTCATTTCTGCCTTGATGTTAATTGAAGCTAACAACGTTG
AGCAACAAGAGAAAAAAAGGTGCAAATATTGTCATGGAACGGGGTATCTGGCTTGTGCCCGATGTTCTTCAAGTGGTGTATGCTTAAGTGCTGACCCCATCTCACTATCT
GCTTCTTCTAGCCGCCCTTTACGAATGCCCAAAACTCAAAGATGTCTCAACTGTTCTGGTGCAGGAAAGGTAATGTGCCCAACATGTCTTTGTACGGGGATGTTGATGGC
AAGTGAGCACGACCCAAGAATCGACCCATTCGACTAAGGCTAAACTTGTACGTTGCATCGGAACTCTTTGTTTTCATCTTTTTTTCAAAGTTTTCTTTCTTGTTTGTCAT
CTAGATAATGAACTTGAAAACTCTCGTATATAAAAAAAATATAAATGTTGGAGAATTGTTTGTATATTA
Protein sequenceShow/hide protein sequence
MQVRSMDAISYSTKQFIRYTNVNSRPCILSPMASHPSLVFVAWNTRHRQISKFSEVLSPCHRNSKLVPKSSDQNSDIVPSEDDPEDGVSLGTMKLPSDIDIARFQVLLFQ
WANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATASTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISRV