; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy1G004040 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy1G004040
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionMicrobial collagenase
Genome locationGy14Chr1:2565290..2569850
RNA-Seq ExpressionCsGy1G004040
SyntenyCsGy1G004040
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK28472.1 uncharacterized protein E5676_scaffold629G001050 [Cucumis melo var. makuwa]3.44e-17694.1Show/hide
Query:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS
        MSSLTVAASSFSS SLTRFA SSSNNSL P K+ LKVP NA S+S ISFKSSN PSIYRFPSLKTCA LDGKDPNGATPVLV+EESSTSSNIVNEEVEKS
Subjt:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS

Query:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
        VKVLKNAAKTRKVPA EVLSA SVLEKAKLDPS FFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
Subjt:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW

Query:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
Subjt:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

XP_004137329.1 uncharacterized protein LOC101217184 [Cucumis sativus]1.74e-18799.63Show/hide
Query:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS
        MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVL KVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS
Subjt:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS

Query:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
        VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
Subjt:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW

Query:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
Subjt:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

XP_008453454.1 PREDICTED: uncharacterized protein LOC103494159 [Cucumis melo]1.78e-17894.83Show/hide
Query:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS
        MSSLTVAASSFSS SLTRFA SSSNNSL PPK+ LKVP NA S+S ISFKSSN PSIYRFPSLKTCA LDGKDPNGATPVLV+EESSTSSNIVNEEVEKS
Subjt:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS

Query:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
        VKVLKNAAKTRKVPAEEVLSA SVLEKAKLDPS FFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
Subjt:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW

Query:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
Subjt:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

XP_023005200.1 uncharacterized protein LOC111498299 [Cucurbita maxima]1.41e-14078.23Show/hide
Query:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS
        M+SL V  S+  S   + FA SSSNNSL PPK+  K P NA +   IS KSSN PS  RFP  K  A L  KDPNGA P+ V EESS+S  IV+EEVEKS
Subjt:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS

Query:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
        VKVLK+AAKTR+V AEEVLSA SVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKL+KGRYFP+TAIQRFDAAGKRIENGVFLGPIGSLTFEGR+SW
Subjt:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW

Query:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        K RILAFIFER+RIK GPL PLEISLG+K+EREPS+KDP FIWFYVDEE+AVARGRSGGTAFWCRCRRVNT
Subjt:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

XP_038879808.1 uncharacterized protein LOC120071551 isoform X1 [Benincasa hispida]4.07e-16388.93Show/hide
Query:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS
        MSSLTVAASS SS  LT+FA SSSNN L P K+ LK+P NAN+Q +ISFKSSN PSI R P LKTCAALD KDPNGATPVLVEEE S   NIVNEEVEKS
Subjt:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS

Query:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
        VKVLKNAAKTRKV AEEVLSA SVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGR+SW
Subjt:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW

Query:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        KTRILAFIFER+RIKIGPLNPLEISLGQKEEREPS KDP FIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
Subjt:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

TrEMBL top hitse value%identityAlignment
A0A0A0LPV7 Uncharacterized protein8.43e-18899.63Show/hide
Query:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS
        MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVL KVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS
Subjt:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS

Query:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
        VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
Subjt:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW

Query:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
Subjt:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

A0A1S3BXG3 uncharacterized protein LOC1034941598.61e-17994.83Show/hide
Query:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS
        MSSLTVAASSFSS SLTRFA SSSNNSL PPK+ LKVP NA S+S ISFKSSN PSIYRFPSLKTCA LDGKDPNGATPVLV+EESSTSSNIVNEEVEKS
Subjt:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS

Query:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
        VKVLKNAAKTRKVPAEEVLSA SVLEKAKLDPS FFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
Subjt:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW

Query:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
Subjt:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

A0A5A7USJ0 Uncharacterized protein8.61e-17994.83Show/hide
Query:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS
        MSSLTVAASSFSS SLTRFA SSSNNSL PPK+ LKVP NA S+S ISFKSSN PSIYRFPSLKTCA LDGKDPNGATPVLV+EESSTSSNIVNEEVEKS
Subjt:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS

Query:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
        VKVLKNAAKTRKVPAEEVLSA SVLEKAKLDPS FFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
Subjt:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW

Query:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
Subjt:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

A0A5D3DY46 Uncharacterized protein1.66e-17694.1Show/hide
Query:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS
        MSSLTVAASSFSS SLTRFA SSSNNSL P K+ LKVP NA S+S ISFKSSN PSIYRFPSLKTCA LDGKDPNGATPVLV+EESSTSSNIVNEEVEKS
Subjt:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS

Query:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
        VKVLKNAAKTRKVPA EVLSA SVLEKAKLDPS FFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
Subjt:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW

Query:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
Subjt:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

A0A6J1KWT0 uncharacterized protein LOC1114982996.83e-14178.23Show/hide
Query:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS
        M+SL V  S+  S   + FA SSSNNSL PPK+  K P NA +   IS KSSN PS  RFP  K  A L  KDPNGA P+ V EESS+S  IV+EEVEKS
Subjt:  MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKS

Query:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
        VKVLK+AAKTR+V AEEVLSA SVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKL+KGRYFP+TAIQRFDAAGKRIENGVFLGPIGSLTFEGR+SW
Subjt:  VKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW

Query:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        K RILAFIFER+RIK GPL PLEISLG+K+EREPS+KDP FIWFYVDEE+AVARGRSGGTAFWCRCRRVNT
Subjt:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G18060.1 unknown protein2.3e-7977.09Show/hide
Query:  IVNEEVEKSVKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGS
        I N++V +SV VLK AAKTRKV A+E+L+AFS +EKAK+DPS F  TLGG  SPGRTWMLIFTAEKKL KGRYFP+TA+QRFDAAGKRIENGV+LGP G+
Subjt:  IVNEEVEKSVKVLKNAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGS

Query:  LTFEGRLSWKTRILAFIFERVRIKIGPLNPLEISLGQKEE-REPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRV
        LTFEGR SWK RILAF+FE++RIKIGPL+PLE SLG+K+   EPS KDP FIWFY+DEEIAVARGRSGGTAFWCRCRR+
Subjt:  LTFEGRLSWKTRILAFIFERVRIKIGPLNPLEISLGQKEE-REPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCACTCACTGTGGCAGCTTCCTCTTTCAGCTCCTTCTCCTTAACCCGGTTCGCTCACAGTTCATCCAACAACTCTCTCATCCCCCCAAAAGTCCTTCTC
AAAGTCCCTTTAAATGCCAACTCCCAATCTTCGATTTCCTTCAAATCTTCAAATACGCCTTCAATTTACCGGTTTCCGAGTTTGAAAACCTGCGCAGCTTTAGAC
GGAAAAGACCCAAATGGAGCGACCCCAGTTCTGGTTGAGGAAGAGAGTTCCACTTCCAGCAATATTGTGAATGAGGAAGTGGAAAAGAGCGTTAAAGTGCTAAAA
AATGCGGCAAAGACAAGAAAGGTACCAGCAGAAGAAGTTTTGTCTGCTTTTTCTGTACTTGAGAAGGCTAAACTTGACCCTTCAAAGTTTTTTAATACACTTGGT
GGAACAAGCTCTCCTGGTAGAACCTGGATGCTTATTTTTACTGCTGAGAAAAAATTGAAAAAGGGTCGCTACTTCCCTGTTACAGCCATCCAGAGGTTTGATGCT
GCTGGAAAAAGAATAGAGAATGGAGTGTTTCTGGGACCTATTGGAAGCTTAACGTTCGAAGGTAGACTTTCATGGAAGACAAGAATACTAGCGTTCATTTTCGAA
CGAGTTCGAATAAAAATTGGACCTTTAAACCCTTTAGAGATTAGTCTTGGTCAAAAAGAAGAAAGGGAGCCAAGCACCAAGGATCCTTGCTTTATCTGGTTTTAT
GTTGATGAGGAAATAGCTGTTGCTCGTGGTAGAAGTGGGGGAACTGCATTTTGGTGCCGGTGTCGCCGTGTCAATACTTAG
mRNA sequenceShow/hide mRNA sequence
AATTTCCATTTGAAGAAAAATAAAGAACCGTTCATATTCATCAAATCAGAGGATTGTTGGCAATCATGGCTTACGAGGATTTGATCCTCACCAGATATTTGTATG
GGTTAAAGGAAGACGAAGAACTTTCCTCTCTAGACGAAGTGAATAACGATGAGCTCACTCACTGTGGCAGCTTCCTCTTTCAGCTCCTTCTCCTTAACCCGGTTC
GCTCACAGTTCATCCAACAACTCTCTCATCCCCCCAAAAGTCCTTCTCAAAGTCCCTTTAAATGCCAACTCCCAATCTTCGATTTCCTTCAAATCTTCAAATACG
CCTTCAATTTACCGGTTTCCGAGTTTGAAAACCTGCGCAGCTTTAGACGGAAAAGACCCAAATGGAGCGACCCCAGTTCTGGTTGAGGAAGAGAGTTCCACTTCC
AGCAATATTGTGAATGAGGAAGTGGAAAAGAGCGTTAAAGTGCTAAAAAATGCGGCAAAGACAAGAAAGGTACCAGCAGAAGAAGTTTTGTCTGCTTTTTCTGTA
CTTGAGAAGGCTAAACTTGACCCTTCAAAGTTTTTTAATACACTTGGTGGAACAAGCTCTCCTGGTAGAACCTGGATGCTTATTTTTACTGCTGAGAAAAAATTG
AAAAAGGGTCGCTACTTCCCTGTTACAGCCATCCAGAGGTTTGATGCTGCTGGAAAAAGAATAGAGAATGGAGTGTTTCTGGGACCTATTGGAAGCTTAACGTTC
GAAGGTAGACTTTCATGGAAGACAAGAATACTAGCGTTCATTTTCGAACGAGTTCGAATAAAAATTGGACCTTTAAACCCTTTAGAGATTAGTCTTGGTCAAAAA
GAAGAAAGGGAGCCAAGCACCAAGGATCCTTGCTTTATCTGGTTTTATGTTGATGAGGAAATAGCTGTTGCTCGTGGTAGAAGTGGGGGAACTGCATTTTGGTGC
CGGTGTCGCCGTGTCAATACTTAGTTCTTTTCTCCATCAGATATTTGATCTTCTGCTTTGTACATGTCGAATTTGTATGCTTGTATCAAATTATCATGTACCTCG
ACGCTGTCATATTCAAATGTTCAATGTGCTATATTTATTAAGAACGTGTACATCCTTTGAACTACCTTCTCTCATTTGAAAGTTTGCAATTGGCATAGAAACCTT
TCGAATTGGTCTACATGAGTTTCACTTCATCTGTCCTATTTGCCAACTCTGAACGAATCAATTATTTATTTATATAGAGCATACTATTTGCTCCAGCAGAAAGTT
TATTATGATTTTTGGGCTTCTTTTTAGGTAACTGCCACTTCTCTTTGGATGGCTCTTGGAGATACAGTGTTTGAGTGTAGCTTGAACAGCAATCAAGTAATGATT
CTATTGAGCTCACCTTAGCTTACTCTTTCTTTCTCTAGAATTTAGTTTTGGAAGGCTGGTGCCAATGATCGTCTATCATTTTGGGGGGGTCAAGTGTAGCTGTGC
TGCTATCCTCTTAATTTTTTAAAAAAATTATTTTTACAAGTTAACAAGATATCTTATAAATGTTGAATGATTCATGTAGACAGCAACGAGCAAACCTAGAATGCT
CATCTGTTCCATGAATCAACAAGTAATCCACTCTTCTGAGTTCTGCGTAGATTTCATCTTGGGAGTGATGGATAATTTAGACGAAGTCAGTTCAGTTTAGAAGGG
AGGAAAGGAGTTTATATGTTACTCTTGTTATTCCAGTTTCTTTCATTAATTCATGCTTTCGTATTAATTGATTAAAATGGTCAAAAAATATAAGGAAAAAAATGT
GGAAGAGCTGAACAATCAATGAAGCAGATGCATATCATCCAGGGTAGTAAAGAGTATCCAAACACAAACACCTCCTATGATTTTTTAACCTATGAGATTTGTTTC
ACATTGACCATCTGTATTCTTACTGTTTGACAGAATAAACTTATTTTTAACGAAATGCAAACTTTGGACGAGACATCTTAGTTCTTATTTGCAGCCAAGGACCGG
ACGTAGCTCAGACTCCCCTTGAAGTAGTTGACTAGAGTTTCTCTGGGTGGAAGCGATTCTTCGATAACTGGAACGTGGATGAAGTTTTGAGCCCATTCATGAAGG
GATGGGACCCTTTCTCTGTCAAACACATTCATTTCTCCCACTTCATCCAGAACATTGAGCCAGTGGCAAATCCAACCTGCAGCTAAATCTAGGTAACCAATTTGC
TCTCCACCAAAGAATTTCTTTCCTTGTATTTCTTTGTCAAGCAGTGCCAAATTTTGTATTGCAGCCTCTACAGCCTTCTCTTTCTCTTCTCCTTCAGCCTGGCAA
GCTTCCCAGGCACCGATCAAACCCTGCGTTAAGAAAAGTATGGTTATTACAATATTCTGACGATAAGCCATTATCTTGTGAAGTTACAGATTTGGAGTGCAACCA
GTTTTGTGTCATTTTCTGTAACCACTCCATTTAGTTTGAAGGTCTTAAGTTTTGCAAAACTAAGGTTGCACTAGATAACTATTTGGTTTAGTTTTTGAAAATTAA
TCTTGTAATCATTTCTCACATTTATAAGTTTATTTGTTTTGTTATACATTGGTACTTACCAATGTTTTCAAAAACAAAACAAAGTTTTAGAAACTATAAAAACTG
AAAACTTGTTTCTGATATTGGAACTCCGTTAAGAATTTAAAGGGAGAAAACAAGCAATAAATTAAAAAAACAGAAACTAAAAACAAAATTGTTATCAAACAGGAC
CTAAATAGTTGTATTGTATGCCAAAAATTGTTTATATGACCTACCATTTATGCTATCACTGAATTTTTATCTTTAAAGCTTAATTCAAATTAAATTTCAGGACTT
GGAAGCACAAAAATGCATGAAAAAGGAAGTTAATAGCAAAAGAACTATGGTTACCTTCTCGTCTAAAAACTTAGCCCAGAAGCGAGCATTGGCTCTGTCATATGG
ATCTTCAGGCAATATCGGGTTCTCCTTCCACGTCTCGTCGATGTATTCAATAATGAGAAGTGACTCGGAGATGGCTTTGTCATTGTGCAAAAACACAGGTATTTT
CTTGTGTACAGGGTTGGACTTGAGAAGCAGCTCACTTTTGTTTCTCAAGTCTTCAACTATGTACTCAAACTCAATTCCTTTGAGCTTCAAGGCCCATTCGACCCT
TATGCAGAATAGGCTTCCAGCTGATCCAATAATCTTTACCTCTGCCATTGTTGGTGATATTTGATATCTCCTTTAGCTCAAGTTTCTTGCCTTTTATAGTCTGTG
ACTCTGTATTGCTTCTAGCTGATGTTATTATTATTGTCCAATTTTGACCTTTCAAACGTAGGAAATAGCTTCTTTGACTAGGAGCTGGCATCAAAGGATAGTGCT
AATGAGTTTTTAAACGAGGCCCCATTTTTTTTTTTTAAACAGCCACCAAATGCATCACTTTGCTTTTTCTTCTTCAGAATGAGCATGTTTATTAGTAAACAATGT
TACCTTTTCAAAAGTAAATTAGCTATCAAATAAGGGGTAAGAGACCATCTACAGAAAGAATTGATATAAAATATATGGAAGGAATGAAAATTGAAGAACCTAGAT
CTGATCTGTTCAAGTG
Protein sequenceShow/hide protein sequence
MSSLTVAASSFSSFSLTRFAHSSSNNSLIPPKVLLKVPLNANSQSSISFKSSNTPSIYRFPSLKTCAALDGKDPNGATPVLVEEESSTSSNIVNEEVEKSVKVLK
NAAKTRKVPAEEVLSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSWKTRILAFIFE
RVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT