; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G00810 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G00810
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLysM domain-containing protein
Genome locationClcChr08:1443198..1447827
RNA-Seq ExpressionClc08G00810
SyntenyClc08G00810
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR018392 - LysM domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039254.1 uncharacterized protein E6C27_scaffold64G00450 [Cucumis melo var. makuwa]7.4e-10578.78Show/hide
Query:  RDWATT--TFKNQVRAVTLRWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNPRAIT---DVYQLE
        R+WA T  +F NQ+R ++LRWRFQL D+SK QLS KHH+VHI+EG+ESL SIS++NGDP +SIVI NKKI+DTDLEQKGQNIKIQN RA++   D +QLE
Subjt:  RDWATT--TFKNQVRAVTLRWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNPRAIT---DVYQLE

Query:  EKLQSALNGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPIDST
        EKLQSALNGL+ YKKL AL SSRLPPARTTSFIVLVPL++FC RCIIGASYARVF T +L+ ++K+EGERHKFR+GHWRSALRDIRELDGLDCE+PIDST
Subjt:  EKLQSALNGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPIDST

Query:  SPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR
        SPS DEQISVEDLSHAYKKLDQDY+KFLSECGLSKWGYWRGGT+R
Subjt:  SPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR

XP_008459633.1 PREDICTED: uncharacterized protein LOC103498697 isoform X2 [Cucumis melo]7.4e-10578.37Show/hide
Query:  RDWATT--TFKNQVRAVTLRWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNPRAIT---DVYQLE
        R+WA T  +F NQ+R ++LRWRFQL D+SK QLS KHH+VHI+EG+ESL SIS++NGDP +SIVI NKKI+DTD EQKGQNIKIQNPR ++   D +QLE
Subjt:  RDWATT--TFKNQVRAVTLRWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNPRAIT---DVYQLE

Query:  EKLQSALNGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPIDST
        EKLQ+ALNGL+ YKKL AL SSRLPPARTTSFIVLVPL++FC RCIIGASYARVF T +L+ ++K+EGERHKFR+GHWRSALRDIRELDGLDCE+PIDST
Subjt:  EKLQSALNGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPIDST

Query:  SPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR
        SPSEDEQISVEDLSHAYKKLDQDY+KFLSECGLSKWGYWRGGT+R
Subjt:  SPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR

XP_011656102.1 uncharacterized protein LOC101208955 isoform X2 [Cucumis sativus]6.9e-10379.18Show/hide
Query:  DWATT--TFKNQVRAVTLRWRFQ-LQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNP---RAITDVYQLE
        +WA T  +F NQ+R + LRWRFQ L DISK QLS KHH+VHI+EG+ESLTS S+QNGDP HSIV+ANKKIMDTDLEQK QNIKIQNP   R I + +QLE
Subjt:  DWATT--TFKNQVRAVTLRWRFQ-LQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNP---RAITDVYQLE

Query:  EKLQSALNGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPIDST
        EKLQSALNGLR YKKL AL SS  PPARTTSFIVLVPL++FCARCIIGASYAR F T +L+ +DK+EGER KFR+GHWRSALRDIRELDGLDCE+PIDST
Subjt:  EKLQSALNGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPIDST

Query:  SPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR
        SPSEDEQISVE+LSHAYKKLDQDY+KFLSECGLSKWGYWRGGT+R
Subjt:  SPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR

XP_011656104.1 uncharacterized protein LOC101208955 isoform X3 [Cucumis sativus]6.9e-10379.18Show/hide
Query:  DWATT--TFKNQVRAVTLRWRFQ-LQDISKDQLSAKHHYVHIVEG---SESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNPRAITDVYQLE
        +WA T  +F NQ+R + LRWRFQ L DISK QLS KHH+VHI+EG   +ESLTS S+QNGDP HSIV+ANKKIMDTDLEQK QNIKIQNPR I + +QLE
Subjt:  DWATT--TFKNQVRAVTLRWRFQ-LQDISKDQLSAKHHYVHIVEG---SESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNPRAITDVYQLE

Query:  EKLQSALNGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPIDST
        EKLQSALNGLR YKKL AL SS  PPARTTSFIVLVPL++FCARCIIGASYAR F T +L+ +DK+EGER KFR+GHWRSALRDIRELDGLDCE+PIDST
Subjt:  EKLQSALNGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPIDST

Query:  SPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR
        SPSEDEQISVE+LSHAYKKLDQDY+KFLSECGLSKWGYWRGGT+R
Subjt:  SPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR

XP_038890844.1 uncharacterized protein LOC120080288 isoform X1 [Benincasa hispida]8.2e-11284.68Show/hide
Query:  PSNFYPRDWATT--TFKNQVRAVTLRWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNPRAITDVY
        PS  + R+WA T  + KNQ RA+TLRWRFQLQDISK+QLS KHH VHIVEGSESLT   +QNGDP HSI +ANK+I DTDLEQKGQNIKIQNPRAI DVY
Subjt:  PSNFYPRDWATT--TFKNQVRAVTLRWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNPRAITDVY

Query:  QLEEKLQSALNGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPI
        QLEEKLQSALN LRNYKKL AL SS LPPARTTSFIVLVPLIVFCARCIIGASYARVF TSRLETVDKREG+ HKFR+GHWRSALRDIRELDGLDCESPI
Subjt:  QLEEKLQSALNGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPI

Query:  DSTSPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR
        DS SPSEDEQIS EDLSH YKKLDQDY+KFLSECGLSKWGYWRGGT+R
Subjt:  DSTSPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR

TrEMBL top hitse value%identityAlignment
A0A0A0KSX5 Uncharacterized protein3.4e-10379.18Show/hide
Query:  DWATT--TFKNQVRAVTLRWRFQ-LQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNP---RAITDVYQLE
        +WA T  +F NQ+R + LRWRFQ L DISK QLS KHH+VHI+EG+ESLTS S+QNGDP HSIV+ANKKIMDTDLEQK QNIKIQNP   R I + +QLE
Subjt:  DWATT--TFKNQVRAVTLRWRFQ-LQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNP---RAITDVYQLE

Query:  EKLQSALNGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPIDST
        EKLQSALNGLR YKKL AL SS  PPARTTSFIVLVPL++FCARCIIGASYAR F T +L+ +DK+EGER KFR+GHWRSALRDIRELDGLDCE+PIDST
Subjt:  EKLQSALNGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPIDST

Query:  SPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR
        SPSEDEQISVE+LSHAYKKLDQDY+KFLSECGLSKWGYWRGGT+R
Subjt:  SPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR

A0A1S3CB50 uncharacterized protein LOC103498697 isoform X15.7e-10375.89Show/hide
Query:  RDWATT--TFKNQVRAVTLRWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNPRAIT---DVYQLE
        R+WA T  +F NQ+R ++LRWRFQL D+SK QLS KHH+VHI+EG+ESL SIS++NGDP +SIVI NKKI+DTD EQKGQNIKIQNPR ++   D +QLE
Subjt:  RDWATT--TFKNQVRAVTLRWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNPRAIT---DVYQLE

Query:  EKLQSALNGLRNYKKLSALTSSRLPP--------ARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLD
        EKLQ+ALNGL+ YKKL AL SSRLPP        ARTTSFIVLVPL++FC RCIIGASYARVF T +L+ ++K+EGERHKFR+GHWRSALRDIRELDGLD
Subjt:  EKLQSALNGLRNYKKLSALTSSRLPP--------ARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLD

Query:  CESPIDSTSPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR
        CE+PIDSTSPSEDEQISVEDLSHAYKKLDQDY+KFLSECGLSKWGYWRGGT+R
Subjt:  CESPIDSTSPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR

A0A1S3CBV5 uncharacterized protein LOC103498697 isoform X23.6e-10578.37Show/hide
Query:  RDWATT--TFKNQVRAVTLRWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNPRAIT---DVYQLE
        R+WA T  +F NQ+R ++LRWRFQL D+SK QLS KHH+VHI+EG+ESL SIS++NGDP +SIVI NKKI+DTD EQKGQNIKIQNPR ++   D +QLE
Subjt:  RDWATT--TFKNQVRAVTLRWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNPRAIT---DVYQLE

Query:  EKLQSALNGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPIDST
        EKLQ+ALNGL+ YKKL AL SSRLPPARTTSFIVLVPL++FC RCIIGASYARVF T +L+ ++K+EGERHKFR+GHWRSALRDIRELDGLDCE+PIDST
Subjt:  EKLQSALNGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPIDST

Query:  SPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR
        SPSEDEQISVEDLSHAYKKLDQDY+KFLSECGLSKWGYWRGGT+R
Subjt:  SPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR

A0A5A7T6Z4 LysM domain-containing protein3.6e-10578.78Show/hide
Query:  RDWATT--TFKNQVRAVTLRWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNPRAIT---DVYQLE
        R+WA T  +F NQ+R ++LRWRFQL D+SK QLS KHH+VHI+EG+ESL SIS++NGDP +SIVI NKKI+DTDLEQKGQNIKIQN RA++   D +QLE
Subjt:  RDWATT--TFKNQVRAVTLRWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNPRAIT---DVYQLE

Query:  EKLQSALNGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPIDST
        EKLQSALNGL+ YKKL AL SSRLPPARTTSFIVLVPL++FC RCIIGASYARVF T +L+ ++K+EGERHKFR+GHWRSALRDIRELDGLDCE+PIDST
Subjt:  EKLQSALNGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPIDST

Query:  SPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR
        SPS DEQISVEDLSHAYKKLDQDY+KFLSECGLSKWGYWRGGT+R
Subjt:  SPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRR

A0A6J1CKQ6 uncharacterized protein LOC111012032 isoform X32.5e-9074.17Show/hide
Query:  WATTTFKNQVRAVTLRWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNPRAITDVYQLEEKLQSAL
        +A  + KNQ  AV LRWRFQLQDI +DQ   KHH+V IVEG E+ TSI  QNG   HSIVI N+KI DTDLE KGQ+ KI+NP AI DVYQL+EKLQS+L
Subjt:  WATTTFKNQVRAVTLRWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNPRAITDVYQLEEKLQSAL

Query:  NGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPID---STSPSE
        NGL+NYKKL    S RLPPARTTSFIVLVPLIVFCARCIIGASYARV +TS+L+T+DK EGE HKFR+GHWRSALRDIRELDGLD ES  D   S SPS 
Subjt:  NGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPID---STSPSE

Query:  DEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTR
        DEQISVEDLSHAYKKLD+DY+KFLSECGLS  GYWRG  R
Subjt:  DEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G09970.1 unknown protein2.5e-1024.78Show/hide
Query:  RWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNG----DPAHSIVIANKKIMDTDLEQKGQNI---------KIQNPRAITDVYQLEEKLQSALNG
        R RF +Q +S+++   KH      + SESL  I  Q G    +P  S    + ++ D D E+K   +         K+     + D+ ++E+  ++    
Subjt:  RWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNG----DPAHSIVIANKKIMDTDLEQKGQNI---------KIQNPRAITDVYQLEEKLQSALNG

Query:  LRNYKKLSALTSSRLPPARTTSFIV-LVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRE---LDGLDCESPIDSTSPSED
        +     LS      LP   T   +  L+P++ FC  CIIG  +           + ++  + H   +  WR+AL D  E    DG D  SP    + +  
Subjt:  LRNYKKLSALTSSRLPPARTTSFIV-LVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRE---LDGLDCESPIDSTSPSED

Query:  EQISVEDLSHAYKKLDQDYQKFLSECGLSK
        E  + ++++ AY +++ +Y++FL ECG+ +
Subjt:  EQISVEDLSHAYKKLDQDYQKFLSECGLSK

AT4G09970.2 unknown protein4.8e-0924.67Show/hide
Query:  RWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNG----DPAHSIVIANKKIMDTDLEQKGQNI---------KIQNPRAITDVYQLEEKLQSALNG
        +W F +Q +S+++   KH      + SESL  I  Q G    +P  S    + ++ D D E+K   +         K+     + D+ ++E+  ++    
Subjt:  RWRFQLQDISKDQLSAKHHYVHIVEGSESLTSISDQNG----DPAHSIVIANKKIMDTDLEQKGQNI---------KIQNPRAITDVYQLEEKLQSALNG

Query:  LRNYKKLSALTSSRLPPARTTSFIV-LVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPIDSTSPSEDEQI
        +     LS      LP   T   +  L+P++ FC  CIIG  +           + ++  + H   +  WR+AL D  E    D     DS SP   E  
Subjt:  LRNYKKLSALTSSRLPPARTTSFIV-LVPLIVFCARCIIGASYARVFETSRLETVDKREGERHKFRNGHWRSALRDIRELDGLDCESPIDSTSPSEDEQI

Query:  SVEDLSHAYKKLDQDYQKFLSECGLSK
        + ++++ AY +++ +Y++FL ECG+ +
Subjt:  SVEDLSHAYKKLDQDYQKFLSECGLSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATGGCCAATTTCAACGGTTTCTGTGATGGAAGTGAAGCTCAGCCAGAGAAACAGAGCACACCGTTTTTCTCTCCTCCCCAAGTTGCTTCCACACCCAACGCTCC
CTTCTTTAATTCTCAGAAACTTACACAACTTGCTTTGCTTTTGTGCTTTTTCCCCTCCAATTTCTACCCCAGAGATTGGGCCACCACCACATTCAAGAATCAAGTCAGAG
CCGTTACTCTGAGATGGAGGTTTCAGCTTCAGGATATATCCAAAGATCAACTCTCCGCCAAGCACCACTATGTTCATATTGTAGAAGGGAGCGAGAGCTTGACCTCAATT
TCAGATCAGAATGGAGATCCTGCACATTCCATTGTCATAGCTAATAAGAAGATAATGGACACGGATCTAGAACAAAAAGGGCAGAATATCAAGATTCAAAACCCTCGAGC
GATTACAGATGTATATCAATTAGAAGAAAAGCTTCAAAGTGCTTTGAATGGACTTCGAAACTATAAGAAGCTTTCCGCACTAACCTCCTCTCGTCTACCTCCTGCTAGAA
CCACTAGTTTTATAGTTTTGGTTCCTCTTATAGTATTTTGTGCCAGATGCATAATTGGTGCCTCTTATGCTAGAGTTTTCGAAACATCGAGGCTTGAAACTGTTGATAAA
CGAGAAGGAGAACGTCACAAGTTCAGAAACGGGCACTGGAGATCTGCTCTTCGTGATATAAGGGAATTGGATGGTTTGGATTGTGAGTCCCCCATAGATTCTACTAGTCC
TTCAGAAGATGAACAGATCTCAGTTGAAGATTTATCACATGCTTACAAGAAACTGGATCAGGATTACCAAAAATTTCTATCAGAATGTGGACTGAGTAAATGGGGCTACT
GGCGTGGGGGTACCCGGAGAGACTTGAACAGGAATGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAATGGCCAATTTCAACGGTTTCTGTGATGGAAGTGAAGCTCAGCCAGAGAAACAGAGCACACCGTTTTTCTCTCCTCCCCAAGTTGCTTCCACACCCAACGCTCC
CTTCTTTAATTCTCAGAAACTTACACAACTTGCTTTGCTTTTGTGCTTTTTCCCCTCCAATTTCTACCCCAGAGATTGGGCCACCACCACATTCAAGAATCAAGTCAGAG
CCGTTACTCTGAGATGGAGGTTTCAGCTTCAGGATATATCCAAAGATCAACTCTCCGCCAAGCACCACTATGTTCATATTGTAGAAGGGAGCGAGAGCTTGACCTCAATT
TCAGATCAGAATGGAGATCCTGCACATTCCATTGTCATAGCTAATAAGAAGATAATGGACACGGATCTAGAACAAAAAGGGCAGAATATCAAGATTCAAAACCCTCGAGC
GATTACAGATGTATATCAATTAGAAGAAAAGCTTCAAAGTGCTTTGAATGGACTTCGAAACTATAAGAAGCTTTCCGCACTAACCTCCTCTCGTCTACCTCCTGCTAGAA
CCACTAGTTTTATAGTTTTGGTTCCTCTTATAGTATTTTGTGCCAGATGCATAATTGGTGCCTCTTATGCTAGAGTTTTCGAAACATCGAGGCTTGAAACTGTTGATAAA
CGAGAAGGAGAACGTCACAAGTTCAGAAACGGGCACTGGAGATCTGCTCTTCGTGATATAAGGGAATTGGATGGTTTGGATTGTGAGTCCCCCATAGATTCTACTAGTCC
TTCAGAAGATGAACAGATCTCAGTTGAAGATTTATCACATGCTTACAAGAAACTGGATCAGGATTACCAAAAATTTCTATCAGAATGTGGACTGAGTAAATGGGGCTACT
GGCGTGGGGGTACCCGGAGAGACTTGAACAGGAATGAATAG
Protein sequenceShow/hide protein sequence
MAMANFNGFCDGSEAQPEKQSTPFFSPPQVASTPNAPFFNSQKLTQLALLLCFFPSNFYPRDWATTTFKNQVRAVTLRWRFQLQDISKDQLSAKHHYVHIVEGSESLTSI
SDQNGDPAHSIVIANKKIMDTDLEQKGQNIKIQNPRAITDVYQLEEKLQSALNGLRNYKKLSALTSSRLPPARTTSFIVLVPLIVFCARCIIGASYARVFETSRLETVDK
REGERHKFRNGHWRSALRDIRELDGLDCESPIDSTSPSEDEQISVEDLSHAYKKLDQDYQKFLSECGLSKWGYWRGGTRRDLNRNE