; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G028510 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G028510
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionO-fucosyltransferase family protein
Genome locationchr02:34688236..34691318
RNA-Seq ExpressionLsi02G028510
SyntenyLsi02G028510
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR018786 - Protein of unknown function DUF2343


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059352.1 hypothetical protein E6C27_scaffold242G00780 [Cucumis melo var. makuwa]1.9e-11180.87Show/hide
Query:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKI---SSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGI
        MRARLVVFPIRGRNWCFSRSIDP  SDSASAQTPSTFKDLWTKI   SSSSSSKS ALS  N S+NAEIVTDFISFKMNKAWTALEKAP+GSFKNKLHGI
Subjt:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKI---SSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGI

Query:  GLKLLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKL
        GLKLLSRVKPSEIFLKSITKDVTSVE TYPSSLNPRLVRRRLRHIA RGT+IHRKYFYGS+S+LP+ SAFT                        GSEKL
Subjt:  GLKLLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKL

Query:  LQLVSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDITAIKDICKMFDLNITNVLKYKDTL
        LQLVSDRSYP NSSSD KK E KVQQY G AL++QPSKELDKFLSQMEASGDITAI+DICKMFDLNI NVLKYKD L
Subjt:  LQLVSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDITAIKDICKMFDLNITNVLKYKDTL

XP_008462234.1 PREDICTED: uncharacterized protein LOC103500639 [Cucumis melo]8.7e-11281.45Show/hide
Query:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKI-SSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGL
        MRARLVVFPIRGRNWCFSRSIDP  SDSASAQTPSTFKDLWTKI SSSSSSKS ALS  N S+NAEIVTDFISFKMNKAWTALEKAP+GSFKNKLHGIGL
Subjt:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKI-SSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGL

Query:  KLLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKLLQ
        KLLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIA RGT+IHRKYFYGS+S+ P+ SAFT                        GSEKLLQ
Subjt:  KLLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKLLQ

Query:  LVSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDITAIKDICKMFDLNITNVLKYKDTL
        LVSDRSYP NSSSD KK E KVQQY G AL++QPSKELDKFLSQMEASGDITAI+DICKMFDLNI NVLKYKD L
Subjt:  LVSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDITAIKDICKMFDLNITNVLKYKDTL

XP_031744510.1 uncharacterized protein C23H3.12c [Cucumis sativus]1.5e-11180.29Show/hide
Query:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGLK
        MRARLVVFPIRGRNWCFSRSI+P  SDS+SAQTPSTFKDLWTKISSSSSSKSDALS  N S+NAEIVTDFIS KMNKAWTALEKAP+GSFKNKLHGIGLK
Subjt:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGLK

Query:  LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKLLQL
        LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIA RGT+IHRKYFY S+S+LP+ SAFT                        GSEKLLQL
Subjt:  LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKLLQL

Query:  VSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDITAIKDICKMFDLNITNVLKYKDTL
        VSDRSYP NSSSD KK E K+QQY G AL++QPS++LDKFLSQMEASGDITAIKDICKMFDLNI NVLKYKD L
Subjt:  VSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDITAIKDICKMFDLNITNVLKYKDTL

XP_038896482.1 uncharacterized protein LOC120084733 isoform X1 [Benincasa hispida]1.9e-11985.77Show/hide
Query:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGLK
        MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDA     VSSNAEIVTDFISFKMNKAWTALEKAP GSFKNKLHGIGLK
Subjt:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGLK

Query:  LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKLLQL
        LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRG  IHRKYFYGS+SMLP+TSAFT                        GSEKLLQL
Subjt:  LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKLLQL

Query:  VSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDITAIKDICKMFDLNITNVLKYKDTL
        VSDRSYPC+SSSDDKKTEHKVQQYPGSAL+++PSKELDKFLSQMEASGDITAIKDICKMFDLN+TNVLKYKDTL
Subjt:  VSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDITAIKDICKMFDLNITNVLKYKDTL

XP_038896483.1 uncharacterized protein LOC120084733 isoform X2 [Benincasa hispida]1.7e-12394Show/hide
Query:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGLK
        MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDA     VSSNAEIVTDFISFKMNKAWTALEKAP GSFKNKLHGIGLK
Subjt:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGLK

Query:  LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFTGSEKLLQLVSDRSYPCNSSSDDKKTEHKVQQY
        LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRG  IHRKYFYGS+SMLP+TSAFTGSEKLLQLVSDRSYPC+SSSDDKKTEHKVQQY
Subjt:  LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFTGSEKLLQLVSDRSYPCNSSSDDKKTEHKVQQY

Query:  PGSALELQPSKELDKFLSQMEASGDITAIKDICKMFDLNITNVLKYKDTL
        PGSAL+++PSKELDKFLSQMEASGDITAIKDICKMFDLN+TNVLKYKDTL
Subjt:  PGSALELQPSKELDKFLSQMEASGDITAIKDICKMFDLNITNVLKYKDTL

TrEMBL top hitse value%identityAlignment
A0A0A0K9A3 Uncharacterized protein7.2e-11280.29Show/hide
Query:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGLK
        MRARLVVFPIRGRNWCFSRSI+P  SDS+SAQTPSTFKDLWTKISSSSSSKSDALS  N S+NAEIVTDFIS KMNKAWTALEKAP+GSFKNKLHGIGLK
Subjt:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGLK

Query:  LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKLLQL
        LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIA RGT+IHRKYFY S+S+LP+ SAFT                        GSEKLLQL
Subjt:  LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKLLQL

Query:  VSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDITAIKDICKMFDLNITNVLKYKDTL
        VSDRSYP NSSSD KK E K+QQY G AL++QPS++LDKFLSQMEASGDITAIKDICKMFDLNI NVLKYKD L
Subjt:  VSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDITAIKDICKMFDLNITNVLKYKDTL

A0A1S3CGF0 uncharacterized protein LOC1035006394.2e-11281.45Show/hide
Query:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKI-SSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGL
        MRARLVVFPIRGRNWCFSRSIDP  SDSASAQTPSTFKDLWTKI SSSSSSKS ALS  N S+NAEIVTDFISFKMNKAWTALEKAP+GSFKNKLHGIGL
Subjt:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKI-SSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGL

Query:  KLLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKLLQ
        KLLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIA RGT+IHRKYFYGS+S+ P+ SAFT                        GSEKLLQ
Subjt:  KLLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKLLQ

Query:  LVSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDITAIKDICKMFDLNITNVLKYKDTL
        LVSDRSYP NSSSD KK E KVQQY G AL++QPSKELDKFLSQMEASGDITAI+DICKMFDLNI NVLKYKD L
Subjt:  LVSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDITAIKDICKMFDLNITNVLKYKDTL

A0A5D3BYB6 Uncharacterized protein9.4e-11280.87Show/hide
Query:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKI---SSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGI
        MRARLVVFPIRGRNWCFSRSIDP  SDSASAQTPSTFKDLWTKI   SSSSSSKS ALS  N S+NAEIVTDFISFKMNKAWTALEKAP+GSFKNKLHGI
Subjt:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKI---SSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGI

Query:  GLKLLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKL
        GLKLLSRVKPSEIFLKSITKDVTSVE TYPSSLNPRLVRRRLRHIA RGT+IHRKYFYGS+S+LP+ SAFT                        GSEKL
Subjt:  GLKLLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKL

Query:  LQLVSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDITAIKDICKMFDLNITNVLKYKDTL
        LQLVSDRSYP NSSSD KK E KVQQY G AL++QPSKELDKFLSQMEASGDITAI+DICKMFDLNI NVLKYKD L
Subjt:  LQLVSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDITAIKDICKMFDLNITNVLKYKDTL

A0A6J1HLD1 uncharacterized protein LOC1114645841.1e-10778.18Show/hide
Query:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGLK
        MRARLVVFPIRGRNWCFSRS+DPAASDSAS QTPST KDLWTKISS SSSKSDALSS+  SSNAEIVTDFISFKMNKAWTALEKAP+GSFKNKLHGIGLK
Subjt:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGLK

Query:  LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKLLQL
        LLSRVKPSEIFLKSITKDVTSVEI YPSSLNPRLVRRRLRHIALRGT IH+K+FYGS+S+LP+ SAFT                        GSE LLQL
Subjt:  LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKLLQL

Query:  VSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDI-TAIKDICKMFDLNITNVLKYKDTL
        VSDRSY CNSS+D  KT + VQQ+PGS L+LQPSKELDKFLSQME S DI T IKDICK+FDLN+ NVLKYKD +
Subjt:  VSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDI-TAIKDICKMFDLNITNVLKYKDTL

A0A6J1KLS4 uncharacterized protein LOC1114944211.7e-10878.55Show/hide
Query:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGLK
        MRARLVVFPIRGRNWCFSRS+DPAASDSAS QTPST KDLWTKISS SSSKSDALSS+  SSNAEIVTDFISFKMNKAWTALEKAP+GSFKNKLHGIGLK
Subjt:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGLK

Query:  LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKLLQL
        LLSRVKPSEIFLKSITKDVTSVEI YPSSLNPRLVRRRLRHIALRGT IH+K+FYGS+S+LP+ SAFT                        GSE LLQL
Subjt:  LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFT------------------------GSEKLLQL

Query:  VSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDI-TAIKDICKMFDLNITNVLKYKDTL
        VSDRSY CNSS+D  KT + VQQ+PGS L+LQPSKELDKFLSQME SGDI T IKDICK+FDLN+ NVLKYKD +
Subjt:  VSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQMEASGDI-TAIKDICKMFDLNITNVLKYKDTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G53760.1 unknown protein1.2e-6650.9Show/hide
Query:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGLK
        MRARLVVFPI+G+ WCFSRS+DP A+ S S  TP+T + LW KISS S           +++NAE++ DFIS KMNKAW  LEKAP+GS KNK+HG GLK
Subjt:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGLK

Query:  LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAF------------------------TGSEKLLQL
        LL+RVKPSEIFLKSI+K+VTSV++TYP SL+PRLVRRRLRHIA+ GT++H+KY  GS+++LP+TSAF                         GSEKLL+L
Subjt:  LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAF------------------------TGSEKLLQL

Query:  VSDRSYPCNSSSDDKKTEHK------VQQYPGSALELQPSKELDKFLSQMEASG-DITAIKDICKMFDLNITNVLKYKD
        +S+ + P    S D   E K       Q+       L PS+EL + + +    G D   I +ICK FDLN  +VLKY++
Subjt:  VSDRSYPCNSSSDDKKTEHK------VQQYPGSALELQPSKELDKFLSQMEASG-DITAIKDICKMFDLNITNVLKYKD

AT1G53760.2 unknown protein7.7e-5865.27Show/hide
Query:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGLK
        MRARLVVFPI+G+ WCFSRS+DP A+ S S  TP+T + LW KISS S           +++NAE++ DFIS KMNKAW  LEKAP+GS KNK+HG GLK
Subjt:  MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGLK

Query:  LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAF
        LL+RVKPSEIFLKSI+K+VTSV++TYP SL+PRLVRRRLRHIA+ GT++H+KY  GS+++LP+TSAF
Subjt:  LLSRVKPSEIFLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGCCAGATTGGTAGTGTTTCCAATCAGAGGAAGAAATTGGTGTTTCAGCAGATCCATCGACCCGGCCGCTTCGGATTCTGCTTCTGCTCAGACTCCTTCCACTTT
CAAAGATCTCTGGACCAAAATCTCTTCGTCTTCTTCCTCTAAATCAGACGCTCTTTCGAGCACTAACGTTAGTAGCAATGCGGAGATTGTGACCGATTTCATCTCTTTCA
AGATGAATAAAGCTTGGACTGCTCTTGAGAAGGCTCCTGAAGGATCGTTTAAAAATAAGCTTCACGGGATCGGATTGAAGCTTTTATCTCGAGTTAAGCCGTCTGAGATA
TTCTTGAAGTCTATAACTAAAGATGTTACGAGCGTCGAAATAACATATCCATCGAGTTTGAATCCACGGCTCGTTCGCAGGAGGCTACGGCATATTGCCCTCAGGGGAAC
TCTCATCCACAGGAAATACTTCTATGGTTCAATCTCGATGCTTCCAATAACAAGTGCATTTACTGGAAGTGAAAAGCTCCTTCAGTTGGTCTCAGATAGATCTTACCCAT
GCAACTCATCCTCTGATGACAAGAAAACCGAGCACAAAGTCCAACAGTATCCAGGTTCAGCACTGGAGCTGCAGCCATCGAAGGAACTTGACAAATTTCTGAGCCAAATG
GAGGCATCTGGTGATATAACTGCAATTAAGGATATCTGCAAGATGTTTGATTTAAACATAACTAATGTTTTGAAGTACAAAGATACTTTGTGA
mRNA sequenceShow/hide mRNA sequence
TCAAAGTGAAAGTGATGCGGCGGAATTTTTCTCCATCTGCGTTCTTGCATTACTCGCAACAGCACGTATTTAACTCTCATTTCCTTCACTTCTGAGTTCTCTCAATATTT
TCTCACATTCGGTTCAGAAAATCCAATTTGCTTAGGGGATTAGGGTTTATGTGAGAGAAGGAGGAGTTCGTGTACGAACGATAAGGCATTTTCTCCATTGGAATTCTGGA
ACGATGAGAGCCAGATTGGTAGTGTTTCCAATCAGAGGAAGAAATTGGTGTTTCAGCAGATCCATCGACCCGGCCGCTTCGGATTCTGCTTCTGCTCAGACTCCTTCCAC
TTTCAAAGATCTCTGGACCAAAATCTCTTCGTCTTCTTCCTCTAAATCAGACGCTCTTTCGAGCACTAACGTTAGTAGCAATGCGGAGATTGTGACCGATTTCATCTCTT
TCAAGATGAATAAAGCTTGGACTGCTCTTGAGAAGGCTCCTGAAGGATCGTTTAAAAATAAGCTTCACGGGATCGGATTGAAGCTTTTATCTCGAGTTAAGCCGTCTGAG
ATATTCTTGAAGTCTATAACTAAAGATGTTACGAGCGTCGAAATAACATATCCATCGAGTTTGAATCCACGGCTCGTTCGCAGGAGGCTACGGCATATTGCCCTCAGGGG
AACTCTCATCCACAGGAAATACTTCTATGGTTCAATCTCGATGCTTCCAATAACAAGTGCATTTACTGGAAGTGAAAAGCTCCTTCAGTTGGTCTCAGATAGATCTTACC
CATGCAACTCATCCTCTGATGACAAGAAAACCGAGCACAAAGTCCAACAGTATCCAGGTTCAGCACTGGAGCTGCAGCCATCGAAGGAACTTGACAAATTTCTGAGCCAA
ATGGAGGCATCTGGTGATATAACTGCAATTAAGGATATCTGCAAGATGTTTGATTTAAACATAACTAATGTTTTGAAGTACAAAGATACTTTGTGATCAGTGGTCCCATC
TTCAGATGGGGGGAAAGAAGGTGCAACATTCAATCAAAGGAGATGCTCATTGATATTCATGGGGTGAAAGATGATCGACCAAAATCATTGAAGATGAAACATGAGGGCCA
CCACATACGCTTGGACTGGTGGGTGGCTTTAATGTGTTGGCTAGTGTGATCTTATCCATTCCCAGGGACAAGAATCGGATCCGTAGAATACAACTGAAATGTCGTTGGCA
CCAGATCCCATACTGGTCAGTAACTAGCTTTCTTTGCTTATAAAAACTAAGTCTTTTATTTTATTTTTGCTTTCGTGTTTGTAATTTAATAGCACCTTATTTTGTTCATT
TTATTATAAAATATTAAACTATTTGATCT
Protein sequenceShow/hide protein sequence
MRARLVVFPIRGRNWCFSRSIDPAASDSASAQTPSTFKDLWTKISSSSSSKSDALSSTNVSSNAEIVTDFISFKMNKAWTALEKAPEGSFKNKLHGIGLKLLSRVKPSEI
FLKSITKDVTSVEITYPSSLNPRLVRRRLRHIALRGTLIHRKYFYGSISMLPITSAFTGSEKLLQLVSDRSYPCNSSSDDKKTEHKVQQYPGSALELQPSKELDKFLSQM
EASGDITAIKDICKMFDLNITNVLKYKDTL