; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G12260 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G12260
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionprotein MODIFYING WALL LIGNIN-2-like
Genome locationChr4:10626375..10628366
RNA-Seq ExpressionCSPI04G12260
SyntenyCSPI04G12260
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004150759.2 protein MODIFYING WALL LIGNIN-1 isoform X2 [Cucumis sativus]1.3e-125100Show/hide
Query:  MKMKSKISQVKRENHEFSTTNLQHSFLYFQNHDLNTSSSSSSSSSSFRLLLLQPAMESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAK
        MKMKSKISQVKRENHEFSTTNLQHSFLYFQNHDLNTSSSSSSSSSSFRLLLLQPAMESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAK
Subjt:  MKMKSKISQVKRENHEFSTTNLQHSFLYFQNHDLNTSSSSSSSSSSFRLLLLQPAMESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAK

Query:  LCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIALLISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVS
        LCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIALLISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVS
Subjt:  LCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIALLISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVS

Query:  AAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG
        AAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG
Subjt:  AAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG

XP_008444240.1 PREDICTED: uncharacterized protein LOC103487629 [Cucumis melo]8.8e-8589.73Show/hide
Query:  MESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIAL
        ME PSSSSFVISFS+VAILTLASFASC+AAEFNRTKKEDLKLN K CFLPESEAFKLGIGGL+CLIMAQIIG+TLI HSYWPKEHRKSCSVKKPLLSIAL
Subjt:  MESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIAL

Query:  LISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG
        LISWVSFVIAVIM+SGATSMSRRQEYA+GWVEGECYLVKDGIFV AA+LVLINGGSTI SAAIG R  R NHV+K PNQIHAQIG
Subjt:  LISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG

XP_023535227.1 uncharacterized protein LOC111796718 [Cucurbita pepo subsp. pepo]9.1e-7478.92Show/hide
Query:  MESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIAL
        ME P  SSFVISFS+VA+LTLASFASCMAAEFNRTKK+DLKLN + CFLPESEAFKLG+ G++CLIMA IIG T+ICH+YWPKEHRKSCSVK+PLLS  L
Subjt:  MESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIAL

Query:  LISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG
        LISWVSF IAV M+ GATSMSRRQEY KGWVEGECYLVKDG+FV AA+LVLINGGSTI SAAIG RR       K PNQ+HAQIG
Subjt:  LISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG

XP_031740353.1 protein MODIFYING WALL LIGNIN-1 isoform X1 [Cucumis sativus]1.1e-12299.17Show/hide
Query:  MKMKSKISQVKRENHEFSTTNLQHSFLYFQNHDLNTSSSSSSSSSSFRLLLLQPAMESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAK
        MKMKSKISQVKRENHEFSTTNLQHSFLYFQNHDLNTSSSSSSSSSSFRLLLLQPAMESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAK
Subjt:  MKMKSKISQVKRENHEFSTTNLQHSFLYFQNHDLNTSSSSSSSSSSFRLLLLQPAMESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAK

Query:  LCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIALLISW-VSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFV
        LCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIALLIS  VSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFV
Subjt:  LCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIALLISW-VSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFV

Query:  SAAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG
        SAAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG
Subjt:  SAAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG

XP_038896171.1 protein MODIFYING WALL LIGNIN-1 [Benincasa hispida]6.5e-8084.32Show/hide
Query:  MESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIAL
        ME P S  FVISFS+VA+LT+ASFASCMAAEFNRTKKEDLKLN +LCFLPESEAFKLG+GGL+CLIMAQIIG  +ICHSYWPKEHRKSCSVK+P+LSIAL
Subjt:  MESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIAL

Query:  LISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG
        LISWVSF IAV+M+SGATSMSRRQEY KGWVEGECY+VKDGIFV AAVLVLINGGSTI+SAAIG    RTNHV K PNQIHAQIG
Subjt:  LISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG

TrEMBL top hitse value%identityAlignment
A0A0A0L099 Uncharacterized protein6.5e-126100Show/hide
Query:  MKMKSKISQVKRENHEFSTTNLQHSFLYFQNHDLNTSSSSSSSSSSFRLLLLQPAMESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAK
        MKMKSKISQVKRENHEFSTTNLQHSFLYFQNHDLNTSSSSSSSSSSFRLLLLQPAMESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAK
Subjt:  MKMKSKISQVKRENHEFSTTNLQHSFLYFQNHDLNTSSSSSSSSSSFRLLLLQPAMESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAK

Query:  LCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIALLISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVS
        LCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIALLISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVS
Subjt:  LCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIALLISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVS

Query:  AAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG
        AAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG
Subjt:  AAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG

A0A1S3B9G0 uncharacterized protein LOC1034876294.2e-8589.73Show/hide
Query:  MESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIAL
        ME PSSSSFVISFS+VAILTLASFASC+AAEFNRTKKEDLKLN K CFLPESEAFKLGIGGL+CLIMAQIIG+TLI HSYWPKEHRKSCSVKKPLLSIAL
Subjt:  MESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIAL

Query:  LISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG
        LISWVSFVIAVIM+SGATSMSRRQEYA+GWVEGECYLVKDGIFV AA+LVLINGGSTI SAAIG R  R NHV+K PNQIHAQIG
Subjt:  LISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG

A0A6J1CC41 uncharacterized protein LOC111010123 isoform X13.9e-7078.21Show/hide
Query:  SSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIALLISWVS
        S F ISFS+VA LTL SFASCMAAEFNRTKK+DLKL+ + CFLPESEAFKLG+  L+CL+MAQIIG T+ICHSYWPKE RKSCSVK+PLLS  LLISWVS
Subjt:  SSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIALLISWVS

Query:  FVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG
        F IAV M+SGATSMSRRQEY KGWVEGECY+VKDGIFV AA+LVLINGGSTI SAAIG    R +HV   P+QIHAQIG
Subjt:  FVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQIG

A0A6J1F9Y8 uncharacterized protein LOC1114434382.2e-7378.8Show/hide
Query:  MESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIAL
        ME P  SSFVISFS+VA+LTLASFASCMAAEFNRTKK+DLKLN + CFLPESEAFKLG+ G++CLIMA IIG T+ICH+YWPKEHRKSCSVK+PLLS  L
Subjt:  MESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIAL

Query:  LISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQI
        LISWVSF IAV M+ GATSMSRRQEY KGWVEGECYLVKDG+FV AA+LVLINGGSTI SAAIG RR       K PNQ+HAQI
Subjt:  LISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQI

A0A6J1II23 uncharacterized protein LOC1114771054.1e-7278.26Show/hide
Query:  MESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIAL
        ME+P  SSFVISFS+VA+LTLASFASCMAAEFNRTKK+DLKLN + CFLPESEAFKLG+ G++CLIMA IIG T+ICH+YWPKEHRKSCSVK+PLL+  L
Subjt:  MESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIAL

Query:  LISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQI
        LISWVSF IAV M+ GATSMSRRQEY KGWVEGECYLVKDG+FV AA+LVLINGGSTI SAAIG RR       K PNQ+HAQI
Subjt:  LISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMRRWRTNHVIKPPNQIHAQI

SwissProt top hitse value%identityAlignment
A2RVU1 Protein MODIFYING WALL LIGNIN-17.6e-2335.4Show/hide
Query:  VAILTLASFASCMAAEFNRTKK----------EDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIALLISWV
        + +  LA+F  C++AEF + K           +DLK + + C+LPE+ AF LGI  L+C+ +AQI+G  +IC  +   +  ++         I LL SWV
Subjt:  VAILTLASFASCMAAEFNRTKK----------EDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIALLISWV

Query:  SFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMR
        +F +AV ++S   SM+R Q Y KGW+  ECYLVKDG+F ++  L +    + + + A  ++
Subjt:  SFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMR

O65708 Protein MODIFYING WALL LIGNIN-21.9e-2638.73Show/hide
Query:  FSVVAILTLASFASCMAAEFNRTKKEDLKLNA-KLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKS--CSVKKPLLSIALLISWVSFVI
        +SVV  L L SF +C AAEF RT+KED++ +  + C++P S AF LG   +LC  +AQI+G  ++  ++  +  R+         L ++ LL+SW +FV+
Subjt:  FSVVAILTLASFASCMAAEFNRTKKEDLKLNA-KLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKS--CSVKKPLLSIALLISWVSFVI

Query:  AVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLIN-GGSTIASAAIGMRRWR--TNHVIKPPNQ
         V+++S A SMSR Q Y +GW++ +CYLVKDG+F ++  L ++  G  TI++  I +++ +     VIK  NQ
Subjt:  AVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLIN-GGSTIASAAIGMRRWR--TNHVIKPPNQ

Arabidopsis top hitse value%identityAlignment
AT1G31720.1 Protein of unknown function (DUF1218)2.4e-2435.37Show/hide
Query:  FSVVAILTLASFASCMAAEFNRTKK----------EDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIALLI
        F  + +  LA+F  C++AEF + K           +DLK + + C+LPE+ AF LGI  L+C+ +AQI+G  +IC  +   +  ++         I LL 
Subjt:  FSVVAILTLASFASCMAAEFNRTKK----------EDLKLNAKLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIALLI

Query:  SWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMR
        SWV+F +AV ++S   SM+R Q Y KGW+  ECYLVKDG+F ++  L +    + + + A  ++
Subjt:  SWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGMR

AT4G19370.1 Protein of unknown function (DUF1218)1.4e-2738.73Show/hide
Query:  FSVVAILTLASFASCMAAEFNRTKKEDLKLNA-KLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKS--CSVKKPLLSIALLISWVSFVI
        +SVV  L L SF +C AAEF RT+KED++ +  + C++P S AF LG   +LC  +AQI+G  ++  ++  +  R+         L ++ LL+SW +FV+
Subjt:  FSVVAILTLASFASCMAAEFNRTKKEDLKLNA-KLCFLPESEAFKLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKS--CSVKKPLLSIALLISWVSFVI

Query:  AVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLIN-GGSTIASAAIGMRRWR--TNHVIKPPNQ
         V+++S A SMSR Q Y +GW++ +CYLVKDG+F ++  L ++  G  TI++  I +++ +     VIK  NQ
Subjt:  AVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLIN-GGSTIASAAIGMRRWR--TNHVIKPPNQ

AT5G17210.1 Protein of unknown function (DUF1218)8.4e-0930.46Show/hide
Query:  VISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAK----LCFLPESEAFKLGIGGLLCLIMAQII---GTTLICHSYWPKEHRKSCSVKKPLLSIALLI
        ++   V+ +L L S  +   AE  R K+  + +        C  P S AF LG    L L+MAQII    +   C    P   R +  +      I  ++
Subjt:  VISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAK----LCFLPESEAFKLGIGGLLCLIMAQII---GTTLICHSYWPKEHRKSCSVKKPLLSIALLI

Query:  SWVSFVIA-VIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLI
        SW +FVIA ++++SGA       E +       CY+VK G+F + AVL L+
Subjt:  SWVSFVIA-VIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLI

AT5G17210.2 Protein of unknown function (DUF1218)1.6e-0734.55Show/hide
Query:  CFLPESEAFKLGIGGLLCLIMAQII---GTTLICHSYWPKEHRKSCSVKKPLLSIALLISWVSFVIA-VIMVSGATSMSRRQEYAKGWVEGECYLVKDGI
        C  P S AF LG    L L+MAQII    +   C    P   R +  +      I  ++SW +FVIA ++++SGA       E +       CY+VK G+
Subjt:  CFLPESEAFKLGIGGLLCLIMAQII---GTTLICHSYWPKEHRKSCSVKKPLLSIALLISWVSFVIA-VIMVSGATSMSRRQEYAKGWVEGECYLVKDGI

Query:  FVSAAVLVLI
        F + AVL L+
Subjt:  FVSAAVLVLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATGAAGTCTAAGATTTCACAAGTCAAGAGAGAAAATCATGAATTTTCCACCACTAACCTCCAACATTCATTCCTTTATTTCCAAAACCATGATTTAAACACTTC
TTCTTCTTCTTCTTCTTCTTCTTCTTCCTTCCGTCTTCTTCTTCTTCAACCCGCCATGGAAAGCCCCTCTTCTTCTAGCTTTGTAATAAGCTTTTCCGTTGTTGCCATCC
TCACACTCGCCTCTTTCGCATCATGTATGGCGGCTGAATTCAACAGGACGAAAAAAGAGGACCTGAAATTGAATGCAAAATTATGCTTTCTGCCTGAAAGTGAAGCATTC
AAATTGGGAATTGGAGGTTTGCTTTGTTTAATAATGGCTCAGATCATTGGAACTACCTTAATCTGCCATAGCTATTGGCCTAAAGAGCATAGAAAAAGCTGCAGTGTCAA
AAAACCTCTGCTTTCCATCGCCCTTCTCATCTCTTGGGTTAGTTTCGTAATAGCGGTGATAATGGTGAGTGGAGCAACAAGCATGAGCAGGAGACAAGAGTACGCGAAGG
GATGGGTGGAGGGAGAATGCTATTTGGTCAAAGACGGAATCTTTGTATCCGCCGCCGTATTGGTTCTCATTAACGGAGGCTCCACCATCGCCTCCGCGGCCATTGGGATG
AGGAGGTGGAGGACCAACCATGTTATTAAACCACCCAATCAAATACATGCTCAGATTGGCTAA
mRNA sequenceShow/hide mRNA sequence
AAGAATTGTAAATTTGAATCTCTATTTCTCTCTATCTATTTCACCACTGATGAAGATGAAGTCTAAGATTTCACAAGTCAAGAGAGAAAATCATGAATTTTCCACCACTA
ACCTCCAACATTCATTCCTTTATTTCCAAAACCATGATTTAAACACTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCCTTCCGTCTTCTTCTTCTTCAACCCGCCATGGAA
AGCCCCTCTTCTTCTAGCTTTGTAATAAGCTTTTCCGTTGTTGCCATCCTCACACTCGCCTCTTTCGCATCATGTATGGCGGCTGAATTCAACAGGACGAAAAAAGAGGA
CCTGAAATTGAATGCAAAATTATGCTTTCTGCCTGAAAGTGAAGCATTCAAATTGGGAATTGGAGGTTTGCTTTGTTTAATAATGGCTCAGATCATTGGAACTACCTTAA
TCTGCCATAGCTATTGGCCTAAAGAGCATAGAAAAAGCTGCAGTGTCAAAAAACCTCTGCTTTCCATCGCCCTTCTCATCTCTTGGGTTAGTTTCGTAATAGCGGTGATA
ATGGTGAGTGGAGCAACAAGCATGAGCAGGAGACAAGAGTACGCGAAGGGATGGGTGGAGGGAGAATGCTATTTGGTCAAAGACGGAATCTTTGTATCCGCCGCCGTATT
GGTTCTCATTAACGGAGGCTCCACCATCGCCTCCGCGGCCATTGGGATGAGGAGGTGGAGGACCAACCATGTTATTAAACCACCCAATCAAATACATGCTCAGATTGGCT
AACAAAAACAATATTACACACCTCATCAATACATTTTGTTTTTGTATTTATTTCATTGTTGGAATGATGAACAATGAACAAAAAAGATGTCACCACATTCCCATTTATAG
TTTTTCATTTTTTTTCTATTTTCTTCTTTCTTCTCCTAGTTTGTATAGTGCCCCAACAAAAGAAAATACCAATACTTTACAATCTTTCACATCA
Protein sequenceShow/hide protein sequence
MKMKSKISQVKRENHEFSTTNLQHSFLYFQNHDLNTSSSSSSSSSSFRLLLLQPAMESPSSSSFVISFSVVAILTLASFASCMAAEFNRTKKEDLKLNAKLCFLPESEAF
KLGIGGLLCLIMAQIIGTTLICHSYWPKEHRKSCSVKKPLLSIALLISWVSFVIAVIMVSGATSMSRRQEYAKGWVEGECYLVKDGIFVSAAVLVLINGGSTIASAAIGM
RRWRTNHVIKPPNQIHAQIG