; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G017730 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G017730
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionO-fucosyltransferase family protein
Genome locationchr01:16696945..16701358
RNA-Seq ExpressionLsi01G017730
SyntenyLsi01G017730
Gene Ontology termsGO:0006004 - fucose metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144331.1 uncharacterized protein LOC101219097 [Cucumis sativus]2.1e-21787.7Show/hide
Query:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRT KPKPKPRSPLIFFFV+L+AIAFLFLFSSLISTNG+SSFPSSNSIQKIFR KNLTQKQRRNRH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS
        ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS N+SSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQVLSTGMKLGARAV HVE+VS
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS

Query:  RVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYDA
        R+ELRDSSRYSNLLLINRTASPLSWFMECKDRNN SA++LPYKFLPSMAAENLRDAA+K                               IKGLLGDYDA
Subjt:  RVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYDA

Query:  IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAG
        IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVV+NNYQLFMIERLIMAG
Subjt:  IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAG

Query:  AKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER
        AKT IRTFKEDDTDLSLTDDPKKN K WQIPVYT +E R
Subjt:  AKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER

XP_008455718.1 PREDICTED: uncharacterized protein LOC103495824 [Cucumis melo]6.9e-22188.84Show/hide
Query:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRT KPKPK RSPLIFFFV+LAAIAFLFLFSSLISTNG+SSFPSSNSIQKIFRFKNLTQKQRR RH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS
        ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQVLST MKLGARAVAHVEQVS
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS

Query:  RVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYDA
        R+ELRDSS YSNLLLINRTASPLSWFMECKDRNNRSA++LPYKFLPSMAAENLRDAA+K                               IKGLLGDYDA
Subjt:  RVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYDA

Query:  IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAG
        IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAG
Subjt:  IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAG

Query:  AKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER
        AKTFIRTFKEDDTDLSLTDDPKKN K+WQIPVYT +E R
Subjt:  AKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER

XP_022967807.1 uncharacterized protein LOC111467213 isoform X1 [Cucurbita maxima]6.5e-21184.81Show/hide
Query:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNG-ASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGH
        MA  +T K KPKPRSP +FFFVALA IAFLFLFSSLISTNG +SSFPSSNSI++IFRFKNL QKQRRNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGH
Subjt:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNG-ASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGH

Query:  QESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQV
        QESSLRCALEEAMFLQR FVMPSRMCINPIHNKKG+LHQSTNASSEE WE NSCAMDSLYDMDLISDTVPVILDNSKLWYQV STGMKLG+R VAHV+QV
Subjt:  QESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQV

Query:  SRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYD
        SR+ELRD SRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDA++K                               IK LLGDYD
Subjt:  SRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYD

Query:  AIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMA
        AIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVP GRTLFIASNER PGFFSPLSARYKLAYSSNYS ILDPVVKNNYQLFMIERLIMA
Subjt:  AIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMA

Query:  GAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEERS
        GAKTFIRTFKEDDTDLSLTDDPKKN K+WQ P+YT DEE S
Subjt:  GAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEERS

XP_023545162.1 uncharacterized protein LOC111804548 isoform X1 [Cucurbita pepo subsp. pepo]1.1e-21084.39Show/hide
Query:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNG--ASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLG
        MA  +T K KPKPRSP +FFFVALA IAFLFLFSSLISTNG  +SSFPSSNSI++IFRFKNL QKQRRNRHVFS NDKFLYWGNRIDCPGKHCESCEGLG
Subjt:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNG--ASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLG

Query:  HQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQ
        HQESSLRCALEEAMFLQR FVMPSRMCINPIHNKKG+LHQSTNASSEE WE NSCAMDSLYDMDLISDTVPVILDNSKLWYQV STGMKLG+R VAHV+Q
Subjt:  HQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQ

Query:  VSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDY
        VSR+ELRD SRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDA++K                               IK LLGDY
Subjt:  VSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDY

Query:  DAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIM
        DAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVP GRTLFIASNER PGFFSPLSARYKLAYSSNYS ILDPVVKNNYQLFMIERL+M
Subjt:  DAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIM

Query:  AGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEERS
        AGAKTFIRTFKEDDTDLSLTDDPKKN K+WQ P+YT DEE S
Subjt:  AGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEERS

XP_038881641.1 uncharacterized protein LOC120073097 [Benincasa hispida]1.4e-22189.52Show/hide
Query:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRT KPKPKPRSPLIFFFVALAAIAFLFLFSSL+STNGASSF SSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS
        ESSLRCALEEAMFLQR FVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS

Query:  RVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYDA
        R+ELRD+SRYS+LLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAEN+RDAA+K                               IK LLGDYDA
Subjt:  RVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYDA

Query:  IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAG
        IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNER PGFFSPLS RYKLAYS NYS ILDPVVKNNYQLFMIERLIMAG
Subjt:  IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAG

Query:  AKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER
        AKTFIRTFKEDDTDLSLTDDPKKN KIWQIPVYTADEER
Subjt:  AKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER

TrEMBL top hitse value%identityAlignment
A0A0A0L0X3 Uncharacterized protein1.0e-21787.7Show/hide
Query:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRT KPKPKPRSPLIFFFV+L+AIAFLFLFSSLISTNG+SSFPSSNSIQKIFR KNLTQKQRRNRH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS
        ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS N+SSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQVLSTGMKLGARAV HVE+VS
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS

Query:  RVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYDA
        R+ELRDSSRYSNLLLINRTASPLSWFMECKDRNN SA++LPYKFLPSMAAENLRDAA+K                               IKGLLGDYDA
Subjt:  RVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYDA

Query:  IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAG
        IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVV+NNYQLFMIERLIMAG
Subjt:  IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAG

Query:  AKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER
        AKT IRTFKEDDTDLSLTDDPKKN K WQIPVYT +E R
Subjt:  AKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER

A0A1S3C1H9 O-fucosyltransferase family protein3.3e-22188.84Show/hide
Query:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRT KPKPK RSPLIFFFV+LAAIAFLFLFSSLISTNG+SSFPSSNSIQKIFRFKNLTQKQRR RH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS
        ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQVLST MKLGARAVAHVEQVS
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS

Query:  RVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYDA
        R+ELRDSS YSNLLLINRTASPLSWFMECKDRNNRSA++LPYKFLPSMAAENLRDAA+K                               IKGLLGDYDA
Subjt:  RVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYDA

Query:  IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAG
        IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAG
Subjt:  IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAG

Query:  AKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER
        AKTFIRTFKEDDTDLSLTDDPKKN K+WQIPVYT +E R
Subjt:  AKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER

A0A5D3DWC1 O-fucosyltransferase family protein3.3e-22188.84Show/hide
Query:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRT KPKPK RSPLIFFFV+LAAIAFLFLFSSLISTNG+SSFPSSNSIQKIFRFKNLTQKQRR RH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS
        ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQVLST MKLGARAVAHVEQVS
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS

Query:  RVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYDA
        R+ELRDSS YSNLLLINRTASPLSWFMECKDRNNRSA++LPYKFLPSMAAENLRDAA+K                               IKGLLGDYDA
Subjt:  RVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYDA

Query:  IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAG
        IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAG
Subjt:  IHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAG

Query:  AKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER
        AKTFIRTFKEDDTDLSLTDDPKKN K+WQIPVYT +E R
Subjt:  AKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER

A0A6J1DKB9 O-fucosyltransferase family protein7.0e-21184.84Show/hide
Query:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASS--FPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLG
        MAFPR  K KPKPRSPL FFFVALAAIAFLFLFSSLISTNGASS  F SSNSIQKIFRF N+ +K +RNRHVFS NDKFLYWGNRIDCPGKHCESCEGLG
Subjt:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASS--FPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLG

Query:  HQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQ
        HQESSLRCALEEAMFL+R FVMPSRMCINPIHNKKG+LHQS NASSEESWEA SCAMDSLYD+DLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVE+
Subjt:  HQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQ

Query:  VSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDY
        VSR EL+D++RYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAA+K                               IKGLLGDY
Subjt:  VSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDY

Query:  DAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIM
        DAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKR+AKWV  GRTLFIASNER PGFFSPLSARYKLAYSSNYS ILDP+VKNNYQLFMIERLIM
Subjt:  DAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIM

Query:  AGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEERS
        AGAKTFIRTFKEDDTDLSLTDDPKKN KIWQ PVYT DEE+S
Subjt:  AGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEERS

A0A6J1HRU4 O-fucosyltransferase family protein3.1e-21184.81Show/hide
Query:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNG-ASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGH
        MA  +T K KPKPRSP +FFFVALA IAFLFLFSSLISTNG +SSFPSSNSI++IFRFKNL QKQRRNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGH
Subjt:  MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNG-ASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGH

Query:  QESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQV
        QESSLRCALEEAMFLQR FVMPSRMCINPIHNKKG+LHQSTNASSEE WE NSCAMDSLYDMDLISDTVPVILDNSKLWYQV STGMKLG+R VAHV+QV
Subjt:  QESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQV

Query:  SRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYD
        SR+ELRD SRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDA++K                               IK LLGDYD
Subjt:  SRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYD

Query:  AIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMA
        AIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVP GRTLFIASNER PGFFSPLSARYKLAYSSNYS ILDPVVKNNYQLFMIERLIMA
Subjt:  AIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMA

Query:  GAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEERS
        GAKTFIRTFKEDDTDLSLTDDPKKN K+WQ P+YT DEE S
Subjt:  GAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEERS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G04280.1 unknown protein1.7e-1522.87Show/hide
Query:  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGAR
        + C+ + H   S  CAL EA +L RT VM   +C++ I+   G         +EE  +         +D + + +   V LD ++ W Q      K   R
Subjt:  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGAR

Query:  AVAHVEQVSRVELRDSSRYSNLLLINRTAS--PLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRN
           H+ +  RV     +   + L++ +  S  P +++    + +  S +  P+  L                          R+ +++V      S+I +
Subjt:  AVAHVEQVSRVELRDSSRYSNLLLINRTAS--PLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRN

Query:  RIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILD--------
        R+     DYDA+H+ RG+K +        ++ + P+L+ DT P  +L  +   V  GR L+IA+NE    FF+PL  +Y   +  +Y D+ D        
Subjt:  RIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILD--------

Query:  --------PVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTD
                PV  + Y    ++  +    K  I TF +   D
Subjt:  --------PVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTD

AT2G41150.1 unknown protein3.8e-8459.92Show/hide
Query:  RTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSL
        + HK K  P S  +   + + A+AFL LF+S+IST G  + P   ++   F       +  RN+   S++DK+LYWGNRIDCPGK+CE+C GLGHQESSL
Subjt:  RTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSL

Query:  RCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRVEL
        RCALEEAMFL RTFVMPSRMCINPIHNKKG+L++S N + EESWE +SCAM+SLYD+DLIS+ +PVILD+S+ W+ +LST MKL  R  AHV   +R EL
Subjt:  RCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRVEL

Query:  RDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLIL
         DSS ++NLLLINRTASPL+WF+ECKDR NRS ++LPY FL +MAA  LRDAA+K + L I+
Subjt:  RDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLIL

AT2G41150.2 unknown protein1.4e-14761.2Show/hide
Query:  RTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSL
        + HK K  P S  +   + + A+AFL LF+S+IST G  + P   ++   F       +  RN+   S++DK+LYWGNRIDCPGK+CE+C GLGHQESSL
Subjt:  RTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSL

Query:  RCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRVEL
        RCALEEAMFL RTFVMPSRMCINPIHNKKG+L++S N + EESWE +SCAM+SLYD+DLIS+ +PVILD+S+ W+ +LST MKL  R  AHV   +R EL
Subjt:  RCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRVEL

Query:  RDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYDAIHVR
         DSS ++NLLLINRTASPL+WF+ECKDR NRS ++LPY FL +MAA  LRDAA+K                               IK  LGDYDAIHVR
Subjt:  RDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYDAIHVR

Query:  RGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAGAKTF
        RGDK+KTRKDRF V+RS  PHLDRDTRPEF++ RI K +P GRTLFI SNER P FFSPL+ RYK+AYSSN+S+ILDP+++NNYQLFM+ERLIM GAKTF
Subjt:  RGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAGAKTF

Query:  IRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADE
         +TF+E +TDL+LTDDPKKN K W+IPVYT DE
Subjt:  IRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADE

AT3G56750.1 unknown protein1.8e-14760.92Show/hide
Query:  RTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSL
        +  + KP   S  +  F  +   +FL LFSS+IST G    P   ++   F +        R +   S+++K+LYWGNRIDCPGK+CE+C GLGHQESSL
Subjt:  RTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSL

Query:  RCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRVEL
        RCALEEAMFL RTFVMPS MCINPIHNKKG+L++S N ++EE W  +SCAMDSLYD+DLIS+ +PVILD+SK W+ VLST MKLG R +AHV  V+R  L
Subjt:  RCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRVEL

Query:  RDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYDAIHVR
        ++ S YSNLL+INRTASPL+WF+ECKDR+NRSA++LPY FLP+MAA  LR+AA+K                               IK  LGDYDAIHVR
Subjt:  RDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYDAIHVR

Query:  RGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAGAKTF
        RGDK+KTRKDRFGV+R   PHLDRDTRPEF+L+RI K +P GRTLFI SNER PGFFSPL+ RYKLAYSSN+S+ILDP+++NNYQLFM+ERL+M GAKT+
Subjt:  RGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAGAKTF

Query:  IRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER
         +TFKE +TDL+LTDDPKKN K W+IPVYT DE R
Subjt:  IRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER

AT4G12700.1 unknown protein2.2e-1522.87Show/hide
Query:  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGAR
        + C+ + H   S  CAL EA +L RT VM   +C++ ++   G   +  +      +E        L +   + D V    D  K WY+    G+KL   
Subjt:  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGAR

Query:  AVAHVEQVSRVELRDSSRYSNLLLINR--TASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRN
            V  +  V+++D+      L++ +  T  P +++    +    S +  P+  L                          ++ +++V      S+I +
Subjt:  AVAHVEQVSRVELRDSSRYSNLLLINR--TASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRN

Query:  RIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILD--------
        R+     DYDAIH+ RGDK +        ++ + P+L++DT P  +L  +   +  GR L+IA+NE    FF+PL  +YK  +   + D+ D        
Subjt:  RIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILD--------

Query:  --------PVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTD
                PV  + Y    ++  +    K  I TF +   D
Subjt:  --------PVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATTTCCCAGAACCCACAAGCCAAAACCCAAACCCAGATCCCCACTCATCTTCTTCTTCGTTGCCCTCGCCGCCATTGCCTTTCTTTTCCTCTTTTCCTCACTGAT
TTCTACCAATGGGGCTTCTTCTTTTCCATCCTCGAATTCAATTCAGAAAATCTTCAGATTCAAGAATCTGACCCAGAAACAGAGACGTAATCGGCATGTTTTTAGTGTAA
ATGACAAGTTCTTGTACTGGGGCAACAGAATCGACTGCCCGGGGAAGCACTGCGAGTCTTGTGAGGGTTTGGGTCACCAGGAATCCAGCTTGAGGTGTGCCCTTGAAGAA
GCCATGTTCCTCCAAAGAACATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACACAATAAGAAAGGGCTTCTTCATCAGTCCACCAATGCAAGCTCAGAGGAAAG
TTGGGAAGCAAACTCTTGTGCCATGGATTCTTTGTACGATATGGACCTTATATCTGATACCGTGCCAGTGATTTTAGACAACTCAAAATTGTGGTATCAGGTGCTGTCAA
CTGGTATGAAATTAGGAGCTAGAGCAGTTGCCCACGTCGAGCAAGTTAGTCGTGTTGAACTCAGAGACAGCAGCCGCTACTCCAATCTTTTGCTAATAAATCGAACTGCC
AGCCCTCTTTCATGGTTTATGGAATGCAAGGACAGAAACAACCGCAGTGCCATATTGTTGCCCTATAAATTTCTTCCTTCTATGGCAGCAGAAAACTTGAGAGATGCAGC
TGATAAGAGACGAAGTTTGCTTATACTGTTCATAAAAACATTGAGAAAAGATTTACAAGTTGTCTCTTTTTCTTTTTTGGATAGTTCTATACGTAACCGTATTAAAGGAC
TACTTGGTGATTATGATGCCATCCATGTTCGTCGTGGAGATAAAATAAAGACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCGCATCTCGACAGGGATACA
CGGCCCGAGTTTATGCTAAAGAGAATAGCAAAGTGGGTTCCGGCAGGGCGGACACTTTTTATTGCTTCAAATGAGAGAATTCCTGGATTCTTCTCGCCCCTCTCTGCTCG
GTACAAGTTGGCTTATTCCTCGAACTATAGCGATATTCTGGATCCTGTGGTTAAGAACAATTATCAGTTATTCATGATCGAAAGGCTCATTATGGCGGGTGCTAAGACAT
TCATCAGAACATTCAAAGAAGACGATACGGATCTAAGCCTCACCGACGACCCAAAGAAGAACATGAAAATATGGCAAATACCTGTCTACACAGCTGATGAAGAAAGAAGC
TGA
mRNA sequenceShow/hide mRNA sequence
ATTGGACACATGGTGGTTAGGTGAGAGGCAAAACCACAAAAATCTTTTGCATCAAAATTTCTTCGATTCAAAGTTACATATCAATCAATTATTCCATTTATGATTAATTT
ACAATTCTCACTACTCACTGATACTAATGCTCGATCCAAGTTCTTGTCTTTCAGCGGATGCGATCGATTTCTGTAGCATTTCCTTGTTCGAGTGAATCGAATTTCAATCA
TTCTTGCGCTATACATTTCAATTTCTCTTCCACTCTGCAAATTCCATGGCATTTCCCAGAACCCACAAGCCAAAACCCAAACCCAGATCCCCACTCATCTTCTTCTTCGT
TGCCCTCGCCGCCATTGCCTTTCTTTTCCTCTTTTCCTCACTGATTTCTACCAATGGGGCTTCTTCTTTTCCATCCTCGAATTCAATTCAGAAAATCTTCAGATTCAAGA
ATCTGACCCAGAAACAGAGACGTAATCGGCATGTTTTTAGTGTAAATGACAAGTTCTTGTACTGGGGCAACAGAATCGACTGCCCGGGGAAGCACTGCGAGTCTTGTGAG
GGTTTGGGTCACCAGGAATCCAGCTTGAGGTGTGCCCTTGAAGAAGCCATGTTCCTCCAAAGAACATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACACAATAA
GAAAGGGCTTCTTCATCAGTCCACCAATGCAAGCTCAGAGGAAAGTTGGGAAGCAAACTCTTGTGCCATGGATTCTTTGTACGATATGGACCTTATATCTGATACCGTGC
CAGTGATTTTAGACAACTCAAAATTGTGGTATCAGGTGCTGTCAACTGGTATGAAATTAGGAGCTAGAGCAGTTGCCCACGTCGAGCAAGTTAGTCGTGTTGAACTCAGA
GACAGCAGCCGCTACTCCAATCTTTTGCTAATAAATCGAACTGCCAGCCCTCTTTCATGGTTTATGGAATGCAAGGACAGAAACAACCGCAGTGCCATATTGTTGCCCTA
TAAATTTCTTCCTTCTATGGCAGCAGAAAACTTGAGAGATGCAGCTGATAAGAGACGAAGTTTGCTTATACTGTTCATAAAAACATTGAGAAAAGATTTACAAGTTGTCT
CTTTTTCTTTTTTGGATAGTTCTATACGTAACCGTATTAAAGGACTACTTGGTGATTATGATGCCATCCATGTTCGTCGTGGAGATAAAATAAAGACCAGAAAGGACAGG
TTTGGTGTTGATAGAAGCTTACATCCGCATCTCGACAGGGATACACGGCCCGAGTTTATGCTAAAGAGAATAGCAAAGTGGGTTCCGGCAGGGCGGACACTTTTTATTGC
TTCAAATGAGAGAATTCCTGGATTCTTCTCGCCCCTCTCTGCTCGGTACAAGTTGGCTTATTCCTCGAACTATAGCGATATTCTGGATCCTGTGGTTAAGAACAATTATC
AGTTATTCATGATCGAAAGGCTCATTATGGCGGGTGCTAAGACATTCATCAGAACATTCAAAGAAGACGATACGGATCTAAGCCTCACCGACGACCCAAAGAAGAACATG
AAAATATGGCAAATACCTGTCTACACAGCTGATGAAGAAAGAAGCTGAGGAATTATTGCCCAGTTATTTCTCTGGGAAAATATCTTGGAGAGATCTTTTTGATCCACCAA
CAAAAGCCTTCCATTGTTCATTGAAAAGCTTCAGAGTGAGGATAAGAATGTTTTAGGGAAGGTAGATGCTTAGCCAATGTTGTAAATGTGTGGTATAAATTGTGATCCAG
GTTTATGAATTTTATTAGTTCCATTTGTTTAGTGCCCACACCATAAATTTGTGACCAGTTTTTTTTTGTTTATGTTCTCTTCTCTTTACAGAAAGTAGTTTACTACCATT
TTTTAAATCATGTTTTCTATTTTTAGAGAATGGTAATATGAATTTAGTGTGTACTCTGTAG
Protein sequenceShow/hide protein sequence
MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEE
AMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRVELRDSSRYSNLLLINRTA
SPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKRRSLLILFIKTLRKDLQVVSFSFLDSSIRNRIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEERS