; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G045980 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G045980
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionMicrobial collagenase
Genome locationchrH02:27275450..27277247
RNA-Seq ExpressionChy2G045980
SyntenyChy2G045980
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK28472.1 uncharacterized protein E5676_scaffold629G001050 [Cucumis melo var. makuwa]8.90e-17091.88Show/hide
Query:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLVE--------IVNEEVEKS
        MSSLTVAASSFSS SLTRFASSSSNNSLNP KL LKVPFNA S+S ISFKSSN PSIYRFPS KTCA LDGKDPNGATPVLV+        IVNEEVEKS
Subjt:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLVE--------IVNEEVEKS

Query:  VKVLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
        VKVLKNAAKTRKVPA E+LSA SVLEKAKLDPS FFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
Subjt:  VKVLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW

Query:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
Subjt:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

XP_004137329.1 uncharacterized protein LOC101217184 [Cucumis sativus]4.64e-17494.46Show/hide
Query:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLVE--------IVNEEVEKS
        MSSLTVAASSFSSFSLTRFA SSSNNSL PPK+L KVP NANSQSSISFKSSNTPSIYRFPS KTCAALDGKDPNGATPVLVE        IVNEEVEKS
Subjt:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLVE--------IVNEEVEKS

Query:  VKVLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
        VKVLKNAAKTRKVPAEE+LSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
Subjt:  VKVLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW

Query:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
Subjt:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

XP_008453454.1 PREDICTED: uncharacterized protein LOC103494159 [Cucumis melo]4.61e-17292.62Show/hide
Query:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLVE--------IVNEEVEKS
        MSSLTVAASSFSS SLTRFASSSSNNSLNPPKL LKVPFNA S+S ISFKSSN PSIYRFPS KTCA LDGKDPNGATPVLV+        IVNEEVEKS
Subjt:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLVE--------IVNEEVEKS

Query:  VKVLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
        VKVLKNAAKTRKVPAEE+LSA SVLEKAKLDPS FFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
Subjt:  VKVLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW

Query:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
Subjt:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

XP_023005200.1 uncharacterized protein LOC111498299 [Cucurbita maxima]8.05e-13877.32Show/hide
Query:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLV------EIVNEEVEKSVK
        M+SL V  S+  S   + FASSSSNNSLNPPK+  K P NA +   IS KSSN PS  RFP  K  A L  KDPNGA P+ V       IV+EEVEKSVK
Subjt:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLV------EIVNEEVEKSVK

Query:  VLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSWKT
        VLK+AAKTR+V AEE+LSA SVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKL+KGRYFP+TAIQRFDAAGKRIENGVFLGPIGSLTFEGR+SWK 
Subjt:  VLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSWKT

Query:  RILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        RILAFIFER+RIK GPL PLEISLG+K+EREPS+KDP FIWFYVDEE+AVARGRSGGTAFWCRCRRVNT
Subjt:  RILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

XP_038879808.1 uncharacterized protein LOC120071551 isoform X1 [Benincasa hispida]1.15e-16088.48Show/hide
Query:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLVE------IVNEEVEKSVK
        MSSLTVAASS SS  LT+FASSSSNN LNP KL LK+P NAN+Q +ISFKSSN PSI R P  KTCAALD KDPNGATPVLVE      IVNEEVEKSVK
Subjt:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLVE------IVNEEVEKSVK

Query:  VLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSWKT
        VLKNAAKTRKV AEE+LSA SVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGR+SWKT
Subjt:  VLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSWKT

Query:  RILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        RILAFIFER+RIKIGPLNPLEISLGQKEEREPS KDP FIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
Subjt:  RILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

TrEMBL top hitse value%identityAlignment
A0A0A0LPV7 Uncharacterized protein1.7e-13594.46Show/hide
Query:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLVE--------IVNEEVEKS
        MSSLTVAASSFSSFSLTRFA SSSNNSL PPK +LKVP NANSQSSISFKSSNTPSIYRFPS KTCAALDGKDPNGATPVLVE        IVNEEVEKS
Subjt:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLVE--------IVNEEVEKS

Query:  VKVLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
        VKVLKNAAKTRKVPAEE+LSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
Subjt:  VKVLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW

Query:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
Subjt:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

A0A1S3BXG3 uncharacterized protein LOC1034941595.4e-13492.62Show/hide
Query:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLVE--------IVNEEVEKS
        MSSLTVAASSFSS SLTRFASSSSNNSLNPPKL LKVPFNA S+S ISFKSSN PSIYRFPS KTCA LDGKDPNGATPVLV+        IVNEEVEKS
Subjt:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLVE--------IVNEEVEKS

Query:  VKVLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
        VKVLKNAAKTRKVPAEE+LSA SVLEKAKLDPS FFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
Subjt:  VKVLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW

Query:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
Subjt:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

A0A5A7USJ0 Uncharacterized protein5.4e-13492.62Show/hide
Query:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLVE--------IVNEEVEKS
        MSSLTVAASSFSS SLTRFASSSSNNSLNPPKL LKVPFNA S+S ISFKSSN PSIYRFPS KTCA LDGKDPNGATPVLV+        IVNEEVEKS
Subjt:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLVE--------IVNEEVEKS

Query:  VKVLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
        VKVLKNAAKTRKVPAEE+LSA SVLEKAKLDPS FFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
Subjt:  VKVLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW

Query:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
Subjt:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

A0A5D3DY46 Uncharacterized protein3.0e-13291.88Show/hide
Query:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLVE--------IVNEEVEKS
        MSSLTVAASSFSS SLTRFASSSSNNSLNP KL LKVPFNA S+S ISFKSSN PSIYRFPS KTCA LDGKDPNGATPVLV+        IVNEEVEKS
Subjt:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLVE--------IVNEEVEKS

Query:  VKVLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
        VKVLKNAAKTRKVPA E+LSA SVLEKAKLDPS FFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW
Subjt:  VKVLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSW

Query:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
Subjt:  KTRILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

A0A6J1KWT0 uncharacterized protein LOC1114982996.7e-10877.32Show/hide
Query:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLV------EIVNEEVEKSVK
        M+SL V  S+  S   + FASSSSNNSLNPPK+  K P NA +   IS KSSN PS  RFP  K  A L  KDPNGA P+ V       IV+EEVEKSVK
Subjt:  MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLV------EIVNEEVEKSVK

Query:  VLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSWKT
        VLK+AAKTR+V AEE+LSA SVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKL+KGRYFP+TAIQRFDAAGKRIENGVFLGPIGSLTFEGR+SWK 
Subjt:  VLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSWKT

Query:  RILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT
        RILAFIFER+RIK GPL PLEISLG+K+EREPS+KDP FIWFYVDEE+AVARGRSGGTAFWCRCRRVNT
Subjt:  RILAFIFERVRIKIGPLNPLEISLGQKEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G18060.1 unknown protein9.0e-8172.96Show/hide
Query:  AALDGKDPNGATPVLVEIVNEEVEKSVKVLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFD
        A +DG++     P   +I N++V +SV VLK AAKTRKV A+EIL+AFS +EKAK+DPS F  TLGG  SPGRTWMLIFTAEKKL KGRYFP+TA+QRFD
Subjt:  AALDGKDPNGATPVLVEIVNEEVEKSVKVLKNAAKTRKVPAEEILSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFD

Query:  AAGKRIENGVFLGPIGSLTFEGRLSWKTRILAFIFERVRIKIGPLNPLEISLGQKEE-REPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRV
        AAGKRIENGV+LGP G+LTFEGR SWK RILAF+FE++RIKIGPL+PLE SLG+K+   EPS KDP FIWFY+DEEIAVARGRSGGTAFWCRCRR+
Subjt:  AAGKRIENGVFLGPIGSLTFEGRLSWKTRILAFIFERVRIKIGPLNPLEISLGQKEE-REPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCACTCACTGTGGCAGCTTCCTCTTTCAGCTCCTTCTCCTTAACCCGGTTCGCTTCCAGTTCATCCAACAACTCTCTCAATCCCCCAAAACTCCTTCTCAAAGT
CCCTTTTAACGCCAACTCCCAATCTTCGATTTCCTTCAAATCTTCAAATACGCCTTCAATTTACCGGTTTCCGAGTTTTAAAACCTGCGCAGCTTTAGACGGAAAAGACC
CAAATGGAGCGACCCCAGTTCTGGTTGAGATTGTGAACGAGGAAGTGGAAAAGAGCGTTAAAGTGCTAAAAAATGCGGCAAAGACAAGAAAGGTACCAGCAGAAGAAATT
TTGTCTGCTTTTTCTGTACTTGAGAAGGCTAAACTTGACCCTTCAAAGTTTTTTAATACACTTGGTGGAACAAGCTCTCCTGGTAGAACCTGGATGCTTATTTTTACTGC
TGAGAAAAAATTGAAAAAGGGTCGGTACTTCCCTGTTACAGCCATCCAGAGGTTTGATGCTGCTGGAAAAAGAATAGAGAATGGAGTGTTTCTGGGACCTATTGGAAGCT
TAACGTTCGAAGGTAGACTTTCATGGAAGACAAGAATACTAGCGTTCATTTTCGAACGAGTTCGAATAAAAATTGGACCTTTAAACCCTTTAGAGATTAGTCTTGGTCAA
AAAGAAGAAAGGGAGCCAAGCACCAAGGATCCTTGCTTTATCTGGTTTTATGTTGATGAGGAAATAGCTGTTGCTCGTGGTAGAAGTGGGGGAACTGCATTTTGGTGCCG
GTGTCGCCGTGTCAATACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCACTCACTGTGGCAGCTTCCTCTTTCAGCTCCTTCTCCTTAACCCGGTTCGCTTCCAGTTCATCCAACAACTCTCTCAATCCCCCAAAACTCCTTCTCAAAGT
CCCTTTTAACGCCAACTCCCAATCTTCGATTTCCTTCAAATCTTCAAATACGCCTTCAATTTACCGGTTTCCGAGTTTTAAAACCTGCGCAGCTTTAGACGGAAAAGACC
CAAATGGAGCGACCCCAGTTCTGGTTGAGATTGTGAACGAGGAAGTGGAAAAGAGCGTTAAAGTGCTAAAAAATGCGGCAAAGACAAGAAAGGTACCAGCAGAAGAAATT
TTGTCTGCTTTTTCTGTACTTGAGAAGGCTAAACTTGACCCTTCAAAGTTTTTTAATACACTTGGTGGAACAAGCTCTCCTGGTAGAACCTGGATGCTTATTTTTACTGC
TGAGAAAAAATTGAAAAAGGGTCGGTACTTCCCTGTTACAGCCATCCAGAGGTTTGATGCTGCTGGAAAAAGAATAGAGAATGGAGTGTTTCTGGGACCTATTGGAAGCT
TAACGTTCGAAGGTAGACTTTCATGGAAGACAAGAATACTAGCGTTCATTTTCGAACGAGTTCGAATAAAAATTGGACCTTTAAACCCTTTAGAGATTAGTCTTGGTCAA
AAAGAAGAAAGGGAGCCAAGCACCAAGGATCCTTGCTTTATCTGGTTTTATGTTGATGAGGAAATAGCTGTTGCTCGTGGTAGAAGTGGGGGAACTGCATTTTGGTGCCG
GTGTCGCCGTGTCAATACTTAG
Protein sequenceShow/hide protein sequence
MSSLTVAASSFSSFSLTRFASSSSNNSLNPPKLLLKVPFNANSQSSISFKSSNTPSIYRFPSFKTCAALDGKDPNGATPVLVEIVNEEVEKSVKVLKNAAKTRKVPAEEI
LSAFSVLEKAKLDPSKFFNTLGGTSSPGRTWMLIFTAEKKLKKGRYFPVTAIQRFDAAGKRIENGVFLGPIGSLTFEGRLSWKTRILAFIFERVRIKIGPLNPLEISLGQ
KEEREPSTKDPCFIWFYVDEEIAVARGRSGGTAFWCRCRRVNT