; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022595 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022595
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionalpha-(1,4)-fucosyltransferase
Genome locationtig00000289:1399662..1409531
RNA-Seq ExpressionSgr022595
SyntenySgr022595
Gene Ontology termsGO:0006486 - protein glycosylation (biological process)
GO:0007165 - signal transduction (biological process)
GO:0036065 - fucosylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0032580 - Golgi cisterna membrane (cellular component)
GO:0008417 - fucosyltransferase activity (molecular function)
InterPro domainsIPR001503 - Glycosyl transferase family 10
IPR008862 - T-complex 11
IPR038577 - GT10-like, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016381.1 T-complex protein 11-like protein [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0069.21Show/hide
Query:  MAVGLEEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDP
        MAVGLEE+SERKG V LYFPANDDE S  SSSTPP+LP RL RRLMESKAPSTAEEIEAKL+KADLRRQAKRQRAGY M+RRRT DIVRANMKGM K+DP
Subjt:  MAVGLEEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDP

Query:  STIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSSNQV
        + IIARCWR+FVQ RKTTF LAKAFQALDIT+ESVKSMEFEQLASKINATATIQTV+ALLVRLESR++ILRTTSG+KLSME VDHLLKRVG HGRSSNQV
Subjt:  STIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSSNQV

Query:  TKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEIFDK
         K  RS+TIG RKAAK PSKLSRYPAKVVLFAYMILGHP+TVFIGKSEFENALLESA NF+QEFELL+KI+LEGPL T+ EEQ S+ P   RSQLEIFDK
Subjt:  TKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEIFDK

Query:  RWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQAES
        RWCSYLHHFVVWKDKDAIFFEENM GVARQLE FMAQTSKL LEGD+ NI HD Q SEEQ IL+E LQQLG+ ENS SVAGSSS E D+  +P  R  E+
Subjt:  RWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQAES

Query:  SKSGVHSLSDVLLALVSLSAMMNNLAEDR------CFNSSSFGKNDNSPSAKHIAEQH-----ISSSE------IWVIKVLKEVRDEVCEMSPPSWRSEI
        SK    + S  +L +     + N +  D         N+ +  +N      K   E+      + S E       WVIKVLKEVRDE+CEMSPPSWRSEI
Subjt:  SKSGVHSLSDVLLALVSLSAMMNNLAEDR------CFNSSSFGKNDNSPSAKHIAEQH-----ISSSE------IWVIKVLKEVRDEVCEMSPPSWRSEI

Query:  AEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNARD
        AEKIDIEI+ QILNSGT DVGYFK+LLDF+LVTLQKLS+PAKEKEMEASYQKLMEELGDVSCSGE SK  FALLMV+GLRF+LHQIQ   + + ++NA  
Subjt:  AEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNARD

Query:  DGIKRFGRCGAGME---------------------------------------DYTDSVA-AITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSLH
          ++   +  AG+E                                       ++TDS+A AI+KN GVQPE LPSTIRTGGSSLIPSKI S + GTS H
Subjt:  DGIKRFGRCGAGME---------------------------------------DYTDSVA-AITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSLH

Query:  GKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDTV
        GKEQPECKGERLDLLIRLGLLKLVNQI+GLSSDTLPETLKLNLARL  VQSRLQRIIVISTSLLV+RQILLNE LVSNPSE DSILSTCAKRLC+LLD V
Subjt:  GKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDTV

Query:  ENAGILEIVEALSMVLGDRDS---------------TLKSFKQ-----------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEVLVV
        EN GILEIVEAL +VL D DS                +KS ++                        SRKGRQL EA+LLPIGAG+LT KVVEAAE L+V
Subjt:  ENAGILEIVEALSMVLGDRDS---------------TLKSFKQ-----------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEVLVV

Query:  MAVVSVIVHGDWYRELIKN
        MAVVSVIVHGDWYRELIKN
Subjt:  MAVVSVIVHGDWYRELIKN

XP_022133963.1 uncharacterized protein LOC111006371 [Momordica charantia]0.0e+0070.98Show/hide
Query:  MAVGLEEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDP
        MAVGLEEE+ERKG V LYFPAND EASS SSSTPPRLP RLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRA YL+QRRR CDI R N KGMPK+DP
Subjt:  MAVGLEEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDP

Query:  STIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSSNQV
        STIIAR WRNFV  RKTTF LAKAFQALD++ ESVKSMEFE+LASKINATATIQT+RALLVRL SRYTIL TTSG+KLS E+VDHLLKRVGLHGRS NQV
Subjt:  STIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSSNQV

Query:  TKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLRE--EQQSSAPPLFRSQLEIF
        TK SRS TIG RKAAK P KLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNF+QEFELL KI+LEGPL TL E  +QQS APP FRSQLEIF
Subjt:  TKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLRE--EQQSSAPPLFRSQLEIF

Query:  DKRWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQA
        DKRWCSYLHH++ WKDKDAIFFE+NM GVA QLELFMAQTSKLTLEGD+GNI HDMQASEEQ I RE LQ+LG+ ENSLS++GSSSSE D V +PESRQA
Subjt:  DKRWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQA

Query:  ESSKSGVHSLSDVLLALVSLSAMMNNLAEDR------CFNSSSFGKNDNSPSAKHIAEQH-----ISSSE------IWVIKVLKEVRDEVCEMSPPSWRS
        ESSKS    +S   + +     + N++  D         N+ +  +N      K   E+      + S E       WVIKVLKEVRDE+CEMSPPSWRS
Subjt:  ESSKSGVHSLSDVLLALVSLSAMMNNLAEDR------CFNSSSFGKNDNSPSAKHIAEQH-----ISSSE------IWVIKVLKEVRDEVCEMSPPSWRS

Query:  EIAEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNA
        EIAEKIDIEIL QILNSGTL VGYFK+LLDF+LVTLQKLSAPAKEKEMEASYQKLMEELGDV+ SGEKS  SFALLMV+GLRFVLHQIQ+    + ++NA
Subjt:  EIAEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNA

Query:  ------------------RDDGIKRFG-------------------RCGAGME--DYTDSVAAITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSL
                          R    KR G                   R  A +E  ++TDSVAAIT+N G QP+ILPSTIR GGSSLIPSKISS+ LGT  
Subjt:  ------------------RDDGIKRFG-------------------RCGAGME--DYTDSVAAITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSL

Query:  HGKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDT
        HGKEQPECKGERLDLLIRLGLLKLV QIRGL+SDTLPETLKLNLARL IVQSRLQRIIVISTSLLV+RQILL+ESLVSNPSE DSILSTCAKRLCDLLD+
Subjt:  HGKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDT

Query:  VENAGILEIVEALSMVLGDRDS---------------TLKSFKQ-----------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEVLV
        VENAGILEI+EAL  VL DRDS                +KS ++                          KGRQL EAALL IGAGAL DKVVEAAEVLV
Subjt:  VENAGILEIVEALSMVLGDRDS---------------TLKSFKQ-----------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEVLV

Query:  VMAVVSVIVHGDWYRELIKN
        VMAVV+ IVHGDWY ELIKN
Subjt:  VMAVVSVIVHGDWYRELIKN

XP_022992643.1 uncharacterized protein LOC111488930 [Cucurbita maxima]0.0e+0069.68Show/hide
Query:  MAVGLEEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDP
        MAVGLEE+SERKG V LYFPANDDE S  SSSTPP+LP RL RRLMESKAPSTAEEIEAKL+KADLRRQAKRQRAGY M+RRRT DIVRANMKGMPK+DP
Subjt:  MAVGLEEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDP

Query:  STIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSSNQV
        + IIARCWR+FVQ RKTTF LAKAFQALDIT+ESVKSMEFEQLASKINATATIQTV+ALLVRLESR++ILRTTSG+KLSME VDHLLKRVG HGRSSNQV
Subjt:  STIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSSNQV

Query:  TKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEIFDK
         K  RS+TIG  KAAK P KLSRY  KVVLFAYMILGHP+TVFIGKSEFENALLESASNF+QEFELL+KI+LEGPL T+ EE QSS P   RSQLEIFDK
Subjt:  TKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEIFDK

Query:  RWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQAES
        RWCSYLHHFVVWKDKDAIFFEENM GVARQLE FMAQTSKL LEGD+ NI HD Q SEEQ ILRE LQQLG+ ENS SVAGSSS E D+  +P  R  ES
Subjt:  RWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQAES

Query:  SKSGVHSLSDVLLALVSLSAMMNNLAEDR------CFNSSSFGKNDNSPSAKHIAEQH-----ISSSE------IWVIKVLKEVRDEVCEMSPPSWRSEI
        SK    + S  +L +     + N +  D         N+ +  +N      K   E+      + S E       WVIKVLKEVRDE+CEMSPPSWRSEI
Subjt:  SKSGVHSLSDVLLALVSLSAMMNNLAEDR------CFNSSSFGKNDNSPSAKHIAEQH-----ISSSE------IWVIKVLKEVRDEVCEMSPPSWRSEI

Query:  AEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNAR-
        AEKIDIEI+ QILNSGT DVGYFK+LLDF+LVTLQKLSAPAKEKEMEASYQ+LMEELGDVS SGE SK  FALLMVKGLRF+LHQIQ+  +    ++ R 
Subjt:  AEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNAR-

Query:  -------DDGIKRF-----GRCGAGME------------------------DYTDSVA-AITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSLHGK
                 G++        RCG+  +                        ++TDSVA AI+K+ GVQPE LPSTIRTGGSSLIPSKI S + GTS HGK
Subjt:  -------DDGIKRF-----GRCGAGME------------------------DYTDSVA-AITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSLHGK

Query:  EQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDTVEN
        EQPECKGERLDLLIRLGLLKLVNQI+GLSSDTLPETLKLNLARL  VQSRLQRIIVISTSLLV+RQILLNE LVSNPSE DSILSTCAK LC+LLD VEN
Subjt:  EQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDTVEN

Query:  AGILEIVEALSMVLGDRDS---TLKSFKQ-----------------------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEVLVVMA
         GILEIVEAL +VL D DS    L++ KQ                                    SRKGRQL EAALLPIGAGALT KVVEAAE L+VMA
Subjt:  AGILEIVEALSMVLGDRDS---TLKSFKQ-----------------------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEVLVVMA

Query:  VVSVIVHGDWYRELIKN
        VVSVIVHGDWYRELIKN
Subjt:  VVSVIVHGDWYRELIKN

XP_023550419.1 uncharacterized protein LOC111808569 [Cucurbita pepo subsp. pepo]0.0e+0069.42Show/hide
Query:  MAVGLEEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDP
        MAVGLEE+SERKG V LYFPANDDE S  SSSTPP+LP RL RRLMESKAPSTAEEIEAKL+KADLRRQAKRQRAGY M+RRRT DIVRANMKGM K+DP
Subjt:  MAVGLEEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDP

Query:  STIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSSNQV
        + IIARCWR+FVQ RKTTF LAKAFQALDIT+ESVKSMEFEQLASKINATATIQTV+ALLVRLES ++ILRTTSG+KLSME VDHLLKRVG HGRSSNQV
Subjt:  STIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSSNQV

Query:  TKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEIFDK
         K  RS+TIG RKAAK PSKLSRYPAKVVLFAYMILGHP+TV IGKSEFENALLESASNF+QEFELL+KI+LEGPL T+ EE QSS P   RSQLEIFDK
Subjt:  TKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEIFDK

Query:  RWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQAES
        RWCSYLHHFVVWKDKDAIFFEENM GVARQLE FMAQTSKL LEGD+ NI HD Q SEEQ I RE LQQLG+LENS SVAGSSS ES++  +P  R  ES
Subjt:  RWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQAES

Query:  SKSGVHSLSDVLLALVSLSAMMNNLAEDR------CFNSSSFGKNDNSPSAKHIAEQH-----ISSSE------IWVIKVLKEVRDEVCEMSPPSWRSEI
        SK    + S  +L +     + N +  D         N+ +  +N      K   E+      + S E       WVIKVLKEVRDE+CEMSPPSWRSEI
Subjt:  SKSGVHSLSDVLLALVSLSAMMNNLAEDR------CFNSSSFGKNDNSPSAKHIAEQH-----ISSSE------IWVIKVLKEVRDEVCEMSPPSWRSEI

Query:  AEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNARD
        AEKIDIEI+ QILNSGT DVGYFK+LLDF+LVTLQKLSAPAKEKEMEASYQKLMEELGDVSC GE SK  FALLMVKGLRF+LHQIQ+    + ++NA  
Subjt:  AEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNARD

Query:  DGIKRFGRCGAGME---------------------------------------DYTDSVA-AITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSLH
          ++   +  AG+E                                       ++TDSVA AI+KN GVQPE LPSTIRTGGSSLIPSKI S + GTS H
Subjt:  DGIKRFGRCGAGME---------------------------------------DYTDSVA-AITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSLH

Query:  GKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDTV
        GKEQPECKGERLDLLIRLGLLKLVNQI+GLSSDTLPETLKLNLARL  VQSRLQRIIVISTSLLV+RQILLNE LVSNPSE D+ILSTCAKRLC+LLD V
Subjt:  GKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDTV

Query:  ENAGILEIVEALSMVLGDRDS---------------TLKSFKQ-----------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEVLVV
        EN GILEIVEAL +VL D DS                +KS ++                         RKGRQL EA+LLPIGAG+LT KVVEAAE L+V
Subjt:  ENAGILEIVEALSMVLGDRDS---------------TLKSFKQ-----------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEVLVV

Query:  MAVVSVIVHGDWYRELIKN
        MAVVSVIVHGDWYRELIKN
Subjt:  MAVVSVIVHGDWYRELIKN

XP_038885048.1 uncharacterized protein LOC120075584 [Benincasa hispida]0.0e+0069.72Show/hide
Query:  MAVGLEEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDP
        MAVGLEEESERKG V LYFPANDDE SS SSSTPP+LP RLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAG+LM+RRRTCD V AN+KGMPK DP
Subjt:  MAVGLEEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDP

Query:  STIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSSNQV
        S +IAR WR+FVQ+RKTTF LAKA+Q LDIT ESVKSMEFEQLASKINATATIQTVRALLVR+ESR+TIL+T SG+KLS+E +DHLLKRVGLHGRSSNQV
Subjt:  STIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSSNQV

Query:  TKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEIFDK
         K SRSETIG RKAAK  SKLSRYPAKVVLFAYMILGHP+TVFIGKSE ENALLESASNF+QEFELL+KI+LEGPL T  +E QSS PP FRSQLEIFDK
Subjt:  TKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEIFDK

Query:  RWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQAES
        RWCSYLHHFVVWKDKDAIFFEENM GVARQLELFMAQTSKL LEGD+GNI HD+Q +EEQ I+RE LQQ+G  ENSLSVAGS SSE D   +   R+AES
Subjt:  RWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQAES

Query:  SKSGVHSLSDVLLALVSLSAMMNNLAEDR------CFNSSSFGKNDNSPSAKHIAEQH-----ISSSE------IWVIKVLKEVRDEVCEMSPPSWRSEI
        SKS  H+ S  +L +     + N +  D         N+ +  +       K   E+      + S E       WV+KVLKEVR+E+CEMSPPSWRSEI
Subjt:  SKSGVHSLSDVLLALVSLSAMMNNLAEDR------CFNSSSFGKNDNSPSAKHIAEQH-----ISSSE------IWVIKVLKEVRDEVCEMSPPSWRSEI

Query:  AEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNARD
        AEKIDI IL QILNSGTLDVGYFK+LLDF+LVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSG+  K SFALLMVKGLRFVLHQIQ+    + ++NA  
Subjt:  AEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNARD

Query:  DGIKRFGRCGAGME---------------------------------------DYTDSVAAITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSLHG
          ++   +  AG+E                                       ++TDSVAA      VQPEILPSTIRTGGSSL+PSKIS TS G S+HG
Subjt:  DGIKRFGRCGAGME---------------------------------------DYTDSVAAITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSLHG

Query:  KEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDTVE
        KEQPECKGERLDLLIRLGLLKLVNQI GLSSDTLPET KLNLARL +VQSRLQRIIVISTSLLV+RQILLNE LVS+PSE DSILSTC KRLC+LLDTVE
Subjt:  KEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDTVE

Query:  NAGILEIVEALSMVLGDRDS---------------TLKSFKQ-----------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEVLVVM
        NAGILEIVEAL  VL DRDS                +KS ++                         RKGRQL E  LLPIGAGALT+KVVEAAEVLVVM
Subjt:  NAGILEIVEALSMVLGDRDS---------------TLKSFKQ-----------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEVLVVM

Query:  AVVSVIVHGDWYRELIKN
        AVVSVIVHGDWYRELIKN
Subjt:  AVVSVIVHGDWYRELIKN

TrEMBL top hitse value%identityAlignment
A0A1S3C3X1 uncharacterized protein LOC103496554 isoform X29.6e-30368.19Show/hide
Query:  MAVGL--EEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESK-APSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPK
        MAVGL  EEESERKG V L+FPANDD+ SS SSSTPP+LP RLRRRLMESK APSTAEEIEAKL KADLRRQAKRQRAGYLM+RRRT DIVRANMKGM K
Subjt:  MAVGL--EEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESK-APSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPK

Query:  EDPSTIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSS
        ++PS IIAR WR+FVQ RKTTF LAKAFQ LDIT E VKSMEFEQLASKINAT+TIQTVRALLVR+ESR+TIL+TTSG+KLS+E VDHLLKRVGLHGRS 
Subjt:  EDPSTIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSS

Query:  NQVTKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEI
        NQV+K SRSET GSRKAA   SKLSRYPAKVVLFAYMILGHP+TVFIGKSE ENALLESASNF+QEFELL+KI+LEGPL TL EE QSS PP FRSQLEI
Subjt:  NQVTKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEI

Query:  FDKRWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQ
        FDKRWCSYLHHFVVWKDKD+IFFEENM GVA QLELFMAQTSKL LEGD+GN  HD Q + E  I+RE LQQLG+LE  LSVAGSSSS  D       +Q
Subjt:  FDKRWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQ

Query:  AESSKSGVHSLSDVLLALVSLSAMMNNLAED-----------RCFNSSSFGKNDNSPSAKHIAEQHISSSE------IWVIKVLKEVRDEVCEMSPPSWR
        AESSKS  H+ S  +L +     + N +  D                +SF         K   +  + S E       WVIKVLKEVRDE+CEMSPPSWR
Subjt:  AESSKSGVHSLSDVLLALVSLSAMMNNLAED-----------RCFNSSSFGKNDNSPSAKHIAEQHISSSE------IWVIKVLKEVRDEVCEMSPPSWR

Query:  SEIAEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSN
        SEIAEKIDIEIL QILNSGTLDV YFK+LLDF++VTLQKLSAPAKEKEM+ASYQKLMEELG+VS SGE  K SFALLMV+GLRFVLHQIQ     + ++N
Subjt:  SEIAEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSN

Query:  ARDDGIKRFGRCGAGME---------------------------------------DYTDSV-AAITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGT
        A    ++   +  AG+E                                       +YTDSV AA+++N GVQPEILPSTIRTGGSSLIPSK SSTS   
Subjt:  ARDDGIKRFGRCGAGME---------------------------------------DYTDSV-AAITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGT

Query:  SLHGKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLL
         +HGKEQPECKGERLDLLIRLGLLKLVNQI GLS DTLPETL LNLARL +VQSRLQRIIVISTSLLV+RQILLNE LVSN SE D+ILS CAKRLC LL
Subjt:  SLHGKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLL

Query:  DTVENAGILEIVEALSMVLGDRDS---------------TLKSFKQ-----------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEV
        D VENAGI EIVEAL  VL DR S                +KS ++                         RKGRQ  EAAL+PIGAGALT KVVEAAEV
Subjt:  DTVENAGILEIVEALSMVLGDRDS---------------TLKSFKQ-----------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEV

Query:  LVVMAVVSVIVHGDWYRELIK
        LVVMAVVSV VHGDWYRELIK
Subjt:  LVVMAVVSVIVHGDWYRELIK

A0A5A7SKI9 T-complex protein 119.6e-30368.19Show/hide
Query:  MAVGL--EEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESK-APSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPK
        MAVGL  EEESERKG V L+FPANDD+ SS SSSTPP+LP RLRRRLMESK APSTAEEIEAKL KADLRRQAKRQRAGYLM+RRRT DIVRANMKGM K
Subjt:  MAVGL--EEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESK-APSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPK

Query:  EDPSTIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSS
        ++PS IIAR WR+FVQ RKTTF LAKAFQ LDIT E VKSMEFEQLASKINAT+TIQTVRALLVR+ESR+TIL+TTSG+KLS+E VDHLLKRVGLHGRS 
Subjt:  EDPSTIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSS

Query:  NQVTKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEI
        NQV+K SRSET GSRKAA   SKLSRYPAKVVLFAYMILGHP+TVFIGKSE ENALLESASNF+QEFELL+KI+LEGPL TL EE QSS PP FRSQLEI
Subjt:  NQVTKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEI

Query:  FDKRWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQ
        FDKRWCSYLHHFVVWKDKD+IFFEENM GVA QLELFMAQTSKL LEGD+GN  HD Q + E  I+RE LQQLG+LE  LSVAGSSSS  D       +Q
Subjt:  FDKRWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQ

Query:  AESSKSGVHSLSDVLLALVSLSAMMNNLAED-----------RCFNSSSFGKNDNSPSAKHIAEQHISSSE------IWVIKVLKEVRDEVCEMSPPSWR
        AESSKS  H+ S  +L +     + N +  D                +SF         K   +  + S E       WVIKVLKEVRDE+CEMSPPSWR
Subjt:  AESSKSGVHSLSDVLLALVSLSAMMNNLAED-----------RCFNSSSFGKNDNSPSAKHIAEQHISSSE------IWVIKVLKEVRDEVCEMSPPSWR

Query:  SEIAEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSN
        SEIAEKIDIEIL QILNSGTLDV YFK+LLDF++VTLQKLSAPAKEKEM+ASYQKLMEELG+VS SGE  K SFALLMV+GLRFVLHQIQ     + ++N
Subjt:  SEIAEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSN

Query:  ARDDGIKRFGRCGAGME---------------------------------------DYTDSV-AAITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGT
        A    ++   +  AG+E                                       +YTDSV AA+++N GVQPEILPSTIRTGGSSLIPSK SSTS   
Subjt:  ARDDGIKRFGRCGAGME---------------------------------------DYTDSV-AAITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGT

Query:  SLHGKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLL
         +HGKEQPECKGERLDLLIRLGLLKLVNQI GLS DTLPETL LNLARL +VQSRLQRIIVISTSLLV+RQILLNE LVSN SE D+ILS CAKRLC LL
Subjt:  SLHGKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLL

Query:  DTVENAGILEIVEALSMVLGDRDS---------------TLKSFKQ-----------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEV
        D VENAGI EIVEAL  VL DR S                +KS ++                         RKGRQ  EAAL+PIGAGALT KVVEAAEV
Subjt:  DTVENAGILEIVEALSMVLGDRDS---------------TLKSFKQ-----------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEV

Query:  LVVMAVVSVIVHGDWYRELIK
        LVVMAVVSV VHGDWYRELIK
Subjt:  LVVMAVVSVIVHGDWYRELIK

A0A6J1C0N5 uncharacterized protein LOC1110063710.0e+0070.98Show/hide
Query:  MAVGLEEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDP
        MAVGLEEE+ERKG V LYFPAND EASS SSSTPPRLP RLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRA YL+QRRR CDI R N KGMPK+DP
Subjt:  MAVGLEEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDP

Query:  STIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSSNQV
        STIIAR WRNFV  RKTTF LAKAFQALD++ ESVKSMEFE+LASKINATATIQT+RALLVRL SRYTIL TTSG+KLS E+VDHLLKRVGLHGRS NQV
Subjt:  STIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSSNQV

Query:  TKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLRE--EQQSSAPPLFRSQLEIF
        TK SRS TIG RKAAK P KLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNF+QEFELL KI+LEGPL TL E  +QQS APP FRSQLEIF
Subjt:  TKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLRE--EQQSSAPPLFRSQLEIF

Query:  DKRWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQA
        DKRWCSYLHH++ WKDKDAIFFE+NM GVA QLELFMAQTSKLTLEGD+GNI HDMQASEEQ I RE LQ+LG+ ENSLS++GSSSSE D V +PESRQA
Subjt:  DKRWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQA

Query:  ESSKSGVHSLSDVLLALVSLSAMMNNLAEDR------CFNSSSFGKNDNSPSAKHIAEQH-----ISSSE------IWVIKVLKEVRDEVCEMSPPSWRS
        ESSKS    +S   + +     + N++  D         N+ +  +N      K   E+      + S E       WVIKVLKEVRDE+CEMSPPSWRS
Subjt:  ESSKSGVHSLSDVLLALVSLSAMMNNLAEDR------CFNSSSFGKNDNSPSAKHIAEQH-----ISSSE------IWVIKVLKEVRDEVCEMSPPSWRS

Query:  EIAEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNA
        EIAEKIDIEIL QILNSGTL VGYFK+LLDF+LVTLQKLSAPAKEKEMEASYQKLMEELGDV+ SGEKS  SFALLMV+GLRFVLHQIQ+    + ++NA
Subjt:  EIAEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNA

Query:  ------------------RDDGIKRFG-------------------RCGAGME--DYTDSVAAITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSL
                          R    KR G                   R  A +E  ++TDSVAAIT+N G QP+ILPSTIR GGSSLIPSKISS+ LGT  
Subjt:  ------------------RDDGIKRFG-------------------RCGAGME--DYTDSVAAITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSL

Query:  HGKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDT
        HGKEQPECKGERLDLLIRLGLLKLV QIRGL+SDTLPETLKLNLARL IVQSRLQRIIVISTSLLV+RQILL+ESLVSNPSE DSILSTCAKRLCDLLD+
Subjt:  HGKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDT

Query:  VENAGILEIVEALSMVLGDRDS---------------TLKSFKQ-----------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEVLV
        VENAGILEI+EAL  VL DRDS                +KS ++                          KGRQL EAALL IGAGAL DKVVEAAEVLV
Subjt:  VENAGILEIVEALSMVLGDRDS---------------TLKSFKQ-----------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEVLV

Query:  VMAVVSVIVHGDWYRELIKN
        VMAVV+ IVHGDWY ELIKN
Subjt:  VMAVVSVIVHGDWYRELIKN

A0A6J1FI11 uncharacterized protein LOC1114455180.0e+0068.99Show/hide
Query:  MAVGLEEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDP
        MAVGLEE+SERKG V LYFPANDDE S  SSSTPP+LP RL RRLMESKAPSTAEEIEAKL+KADLRRQAKRQRAGY M+RRRT DIVRANMKG+ K+DP
Subjt:  MAVGLEEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDP

Query:  STIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSSNQV
        + IIARCWR+FVQ RKTTF LAKAFQALDIT+ESVKSM+FEQLASKINATATIQTV+ALLVRLESR++ILRTTSG+KLSME VDHLLKRVG HGRSSNQV
Subjt:  STIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSSNQV

Query:  TKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEIFDK
         K  RS+TIG RKAA+ PSKLSRYPAKVVLFAYMILGHP+TVFIGKSEFENALLESASNF+QEFELL+KI+LEG L T+ EE QSSAP   RSQLEIFDK
Subjt:  TKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEIFDK

Query:  RWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQAES
        RWCSYLHHFVVWKDKDAIFFEENM GVARQLE FMAQTSKL LEGD+ NI HD Q SEEQ IL+E LQQLG+ ENS SVAGSSS E D+  +P  R  E+
Subjt:  RWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQAES

Query:  SKSGVHSLSDVLLALVSLSAMMNNLAED----RCFNSSSFGKNDNSPSAK-----------HIAE--QHISSSEIWVIKVLKEVRDEVCEMSPPSWRSEI
        SK    + S  +L +     + N +  D       +S++  + +NS  AK            I E  +   S   WVIKVLKEVRDE+CE SPPSWRSEI
Subjt:  SKSGVHSLSDVLLALVSLSAMMNNLAED----RCFNSSSFGKNDNSPSAK-----------HIAE--QHISSSEIWVIKVLKEVRDEVCEMSPPSWRSEI

Query:  AEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNARD
        AEKIDIEI+ QILNSG  DVGYFK+LLDF+LVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGE SK  FALLMV+GLRF+LHQIQ+    + ++NA  
Subjt:  AEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNARD

Query:  DGIKRFGRCGAGME---------------------------------------DYTDSVA-AITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSLH
          ++   +  AG+E                                       ++TDS+A AI+KN GVQPE LPSTIRTGGSSLIPSKI S + GTS H
Subjt:  DGIKRFGRCGAGME---------------------------------------DYTDSVA-AITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSLH

Query:  GKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDTV
        GKEQPECKGERLDLLIRLGLLKLVNQI+GLSSDTLPETLKLNLA+L  VQSRLQRIIVISTSLLV+RQILLNE LVSNPSE DSILSTCAKRLC+LLD V
Subjt:  GKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDTV

Query:  ENAGILEIVEALSMVLGDRDS---------------TLKSFKQ-----------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEVLVV
        EN GILEIVEAL +VL D DS                +KS ++                        SRKGRQL EA+LLPIGAG+LT KVVEAAE L+V
Subjt:  ENAGILEIVEALSMVLGDRDS---------------TLKSFKQ-----------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEVLVV

Query:  MAVVSVIVHGDWYRELIKN
        MAVVSVIVHGDWYREL+KN
Subjt:  MAVVSVIVHGDWYRELIKN

A0A6J1JZT9 uncharacterized protein LOC1114889300.0e+0069.68Show/hide
Query:  MAVGLEEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDP
        MAVGLEE+SERKG V LYFPANDDE S  SSSTPP+LP RL RRLMESKAPSTAEEIEAKL+KADLRRQAKRQRAGY M+RRRT DIVRANMKGMPK+DP
Subjt:  MAVGLEEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDP

Query:  STIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSSNQV
        + IIARCWR+FVQ RKTTF LAKAFQALDIT+ESVKSMEFEQLASKINATATIQTV+ALLVRLESR++ILRTTSG+KLSME VDHLLKRVG HGRSSNQV
Subjt:  STIIARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSSNQV

Query:  TKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEIFDK
         K  RS+TIG  KAAK P KLSRY  KVVLFAYMILGHP+TVFIGKSEFENALLESASNF+QEFELL+KI+LEGPL T+ EE QSS P   RSQLEIFDK
Subjt:  TKASRSETIGSRKAAKGPSKLSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEIFDK

Query:  RWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQAES
        RWCSYLHHFVVWKDKDAIFFEENM GVARQLE FMAQTSKL LEGD+ NI HD Q SEEQ ILRE LQQLG+ ENS SVAGSSS E D+  +P  R  ES
Subjt:  RWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQAES

Query:  SKSGVHSLSDVLLALVSLSAMMNNLAEDR------CFNSSSFGKNDNSPSAKHIAEQH-----ISSSE------IWVIKVLKEVRDEVCEMSPPSWRSEI
        SK    + S  +L +     + N +  D         N+ +  +N      K   E+      + S E       WVIKVLKEVRDE+CEMSPPSWRSEI
Subjt:  SKSGVHSLSDVLLALVSLSAMMNNLAEDR------CFNSSSFGKNDNSPSAKHIAEQH-----ISSSE------IWVIKVLKEVRDEVCEMSPPSWRSEI

Query:  AEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNAR-
        AEKIDIEI+ QILNSGT DVGYFK+LLDF+LVTLQKLSAPAKEKEMEASYQ+LMEELGDVS SGE SK  FALLMVKGLRF+LHQIQ+  +    ++ R 
Subjt:  AEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNAR-

Query:  -------DDGIKRF-----GRCGAGME------------------------DYTDSVA-AITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSLHGK
                 G++        RCG+  +                        ++TDSVA AI+K+ GVQPE LPSTIRTGGSSLIPSKI S + GTS HGK
Subjt:  -------DDGIKRF-----GRCGAGME------------------------DYTDSVA-AITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSLHGK

Query:  EQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDTVEN
        EQPECKGERLDLLIRLGLLKLVNQI+GLSSDTLPETLKLNLARL  VQSRLQRIIVISTSLLV+RQILLNE LVSNPSE DSILSTCAK LC+LLD VEN
Subjt:  EQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDTVEN

Query:  AGILEIVEALSMVLGDRDS---TLKSFKQ-----------------------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEVLVVMA
         GILEIVEAL +VL D DS    L++ KQ                                    SRKGRQL EAALLPIGAGALT KVVEAAE L+VMA
Subjt:  AGILEIVEALSMVLGDRDS---TLKSFKQ-----------------------------------ESRKGRQLVEAALLPIGAGALTDKVVEAAEVLVVMA

Query:  VVSVIVHGDWYRELIKN
        VVSVIVHGDWYRELIKN
Subjt:  VVSVIVHGDWYRELIKN

SwissProt top hitse value%identityAlignment
Q5F2L1 Alpha-(1,3)-fucosyltransferase 101.3e-1732.84Show/hide
Query:  SKNN-----DTLVYWSSSRCLP--QRNPLARKLLSLLPHHSFGKCLNNVGGLDMALSLYPECANDASVTLKWWDHLHCAMSHYKFVLAIENTMTESYVTE
        SKNN       LVY  S  C P   R+   R+L++ +   S+G+CL N         L  +  N AS+     D  +  ++ YKF+LA EN + + Y+TE
Subjt:  SKNN-----DTLVYWSSSRCLP--QRNPLARKLLSLLPHHSFGKCLNNVGGLDMALSLYPECANDASVTLKWWDHLHCAMSHYKFVLAIENTMTESYVTE

Query:  KLYYALDSGSVPIYFGAPNVWDFVPPH-SIIDGSKFSSLEELASYVKRLADDPVAYAEYHAWRRCGTLGNYRRTRAAS--------------LDTLPCRL
        K +  L  G VP+Y+G+P + D++P + S I  S+FS   ELAS+++RL  D   Y  Y  W+  G + N R   A                +DT  C +
Subjt:  KLYYALDSGSVPIYFGAPNVWDFVPPH-SIIDGSKFSSLEELASYVKRLADDPVAYAEYHAWRRCGTLGNYRRTRAAS--------------LDTLPCRL

Query:  CDAV
        C  V
Subjt:  CDAV

Q6A198 Alpha-(1,3)-fucosyltransferase 107.5e-1830.9Show/hide
Query:  ALFYETSTPPRQRRKGDPLRIYMDLEAGRKRSGFEDLFVSYHAEDDVQSTYAGALFHNGRNYHVSSSKNNDTLVYWSSSRCLP--QRNPLARKLLSLLPH
        ALF+E S     +    P+    +  A   R     L   Y    +V ++    +    +N+     K+   LVY  S  C P   R+   R+L++ +  
Subjt:  ALFYETSTPPRQRRKGDPLRIYMDLEAGRKRSGFEDLFVSYHAEDDVQSTYAGALFHNGRNYHVSSSKNNDTLVYWSSSRCLP--QRNPLARKLLSLLPH

Query:  HSFGKCLNNVGGLDMALSLYPECANDASVTLKWWDHLHCAMSHYKFVLAIENTMTESYVTEKLYYALDSGSVPIYFGAPNVWDFVPPH-SIIDGSKFSSL
         S+G+CL N         L P+ +N AS+     D     ++ YKF+LA EN + + Y+TEKL+  L  G VP+Y+G+P++ D++P + S I  S+FS  
Subjt:  HSFGKCLNNVGGLDMALSLYPECANDASVTLKWWDHLHCAMSHYKFVLAIENTMTESYVTEKLYYALDSGSVPIYFGAPNVWDFVPPH-SIIDGSKFSSL

Query:  EELASYVKRLADDPVAYAEYHAWRRCGTLGNYR
         ELASY++ L  D   Y  Y  W+  G + N R
Subjt:  EELASYVKRLADDPVAYAEYHAWRRCGTLGNYR

Q6NTZ6 Alpha-(1,3)-fucosyltransferase 108.8e-1930.9Show/hide
Query:  SYHAEDDVQSTY-AGALFHNGRNYHVSSSKNN------DTLVYWSSSRCLP--QRNPLARKLLSLLPHHSFGKCLNNVGGLDMALSLYPECANDASVTLK
        S H+   + S Y  G       +Y VS  K N        L Y  S  C P   R+   ++L+  +   S+G+CL+N           P+  N+ S    
Subjt:  SYHAEDDVQSTY-AGALFHNGRNYHVSSSKNN------DTLVYWSSSRCLP--QRNPLARKLLSLLPHHSFGKCLNNVGGLDMALSLYPECANDASVTLK

Query:  WWDHLHCAMSHYKFVLAIENTMTESYVTEKLYYALDSGSVPIYFGAPNVWDFVPPH-SIIDGSKFSSLEELASYVKRLADDPVAYAEYHAWRRCGTLGNY
             H  +  YKF+LA EN + E Y+TEKL+  L  G+VP+YFGAPNV D++P + S I  S+FS  ++LA+Y+K+L  +   Y ++  W+  G + N 
Subjt:  WWDHLHCAMSHYKFVLAIENTMTESYVTEKLYYALDSGSVPIYFGAPNVWDFVPPH-SIIDGSKFSSLEELASYVKRLADDPVAYAEYHAWRRCGTLGNY

Query:  RRTRAAS--------------LDTLPCRLCDAV
        R   A +              +D   C +C  V
Subjt:  RRTRAAS--------------LDTLPCRLCDAV

Q6P4F1 Alpha-(1,3)-fucosyltransferase 107.5e-1836.02Show/hide
Query:  LVYWSSSRCLP--QRNPLARKLLSLLPHHSFGKCLNNVGGLDMALSLYPECANDASVTLKWWDHLHCAMSHYKFVLAIENTMTESYVTEKLYYALDSGSV
        LVY  S  C P   R+   R+L++ +   S+G+CL N    D+   L    + DA       D  +  ++ YKF+LA EN + + Y+TEK +  L  G V
Subjt:  LVYWSSSRCLP--QRNPLARKLLSLLPHHSFGKCLNNVGGLDMALSLYPECANDASVTLKWWDHLHCAMSHYKFVLAIENTMTESYVTEKLYYALDSGSV

Query:  PIYFGAPNVWDFVPPH-SIIDGSKFSSLEELASYVKRLADDPVAYAEYHAWRRCGTLGNYR
        P+Y+G+P++ D++P + S I  S+FS   ELASY++RL  D   Y  Y  W+  G + N R
Subjt:  PIYFGAPNVWDFVPPH-SIIDGSKFSSLEELASYVKRLADDPVAYAEYHAWRRCGTLGNYR

Q9C8W3 Alpha-(1,4)-fucosyltransferase7.2e-13058.91Show/hide
Query:  MLLKPLNAFATVFMLGFALFLLFFAGLLEFPSASTSIRPVDDLSLSAVEFTPDPFTDLI----------------PPSRNGTLWSVAIGS---------G
        M ++ LNA A + M+ F L +L F G+LEFPSASTS+    D      + T DPF+D++                   ++  L +V+ GS          
Subjt:  MLLKPLNAFATVFMLGFALFLLFFAGLLEFPSASTSIRPVDDLSLSAVEFTPDPFTDLI----------------PPSRNGTLWSVAIGS---------G

Query:  TKSLAVWPMG----------LRLPRCNKSAATLVQNAGAETESPDKPDALFYETSTPPRQRRKGDPLRIYMDLEAGRKRSGFEDLFVSYHAEDDVQSTYA
         K + V   G          L   RC  +      +  A     D PDAL +ET+TPP QRR GDPLR+YM+LEAGRKRSG ED+F+SYHA+DDVQ+TYA
Subjt:  TKSLAVWPMG----------LRLPRCNKSAATLVQNAGAETESPDKPDALFYETSTPPRQRRKGDPLRIYMDLEAGRKRSGFEDLFVSYHAEDDVQSTYA

Query:  GALFHNGRNYHVSSSKNNDTLVYWSSSRCLPQRNPLARKLLSLLPHHSFGKCLNNVGGLDMALSLYPECANDASVTLKWWDHLHCAMSHYKFVLAIENTM
        G+LFHN RNYH+S  KNND LVYWSSSRCLP R+ LA+ LL L+PHHSFGKCLNNVGGLD ALS+YPEC  + +   KW+DHLHCAMSHYKFVLAIENT 
Subjt:  GALFHNGRNYHVSSSKNNDTLVYWSSSRCLPQRNPLARKLLSLLPHHSFGKCLNNVGGLDMALSLYPECANDASVTLKWWDHLHCAMSHYKFVLAIENTM

Query:  TESYVTEKLYYALDSGSVPIYFGAPNVWDFVPPHSIIDGSKFSSLEELASYVKRLADDPVAYAEYHAWRRCGTLGNYRRTRAASLDTLPCRLCDAVSRKG
         ESYVTEKL+YALDSGSVPIYFGA NV DFVPPHS+IDGSKF S++ELA+YVKRL DDPVAY+EYHAWRRCG +GNY +TRA SLDTLPCRLC+ +SR+G
Subjt:  TESYVTEKLYYALDSGSVPIYFGAPNVWDFVPPHSIIDGSKFSSLEELASYVKRLADDPVAYAEYHAWRRCGTLGNYRRTRAASLDTLPCRLCDAVSRKG

Query:  GRNA
        G+NA
Subjt:  GRNA

Arabidopsis top hitse value%identityAlignment
AT1G22930.1 T-complex protein 113.1e-10433.77Show/hide
Query:  EEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDPSTI---IARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATA
        +++E KLQ+      AKR R+ +L QRRR  D +      M +ED   +   ++RCWR FV+ ++TT  LAKA+  L I E    S+ FEQLA  + +  
Subjt:  EEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDPSTI---IARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATA

Query:  TIQTVRALLVRLESRYTILR--TTSGSKLSMENVDHLLKRVGLHGRSSNQVTKASR-SETIGSRKAAKGPS-KLSRYPAKVVLFAYMILGHPDTVFIGKS
        T++TV++LL RLE R    +  TT      ++N+DHLLKRV    R +   T  SR  + + S +   G S K+SRYP +VVL A+MILGHPD VF G+ 
Subjt:  TIQTVRALLVRLESRYTILR--TTSGSKLSMENVDHLLKRVGLHGRSSNQVTKASR-SETIGSRKAAKGPS-KLSRYPAKVVLFAYMILGHPDTVFIGKS

Query:  EFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEIFDKRWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDS
        + E AL  +A  F++E +LL+ ++ EGP+     E +       RSQL++FDK WCS+L+ FV+WK KDA   E++++  A QLEL M Q  KLT EG  
Subjt:  EFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEIFDKRWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDS

Query:  GNITHD-----MQASEEQMILRENLQQLG------NLENSL-----------------------------------SVAGSSSSESDAVCAPESRQAESS
          +THD     MQ +++Q +L E ++ L        +E++L                                   SV+ SSS   D++    S +   S
Subjt:  GNITHD-----MQASEEQMILRENLQQLG------NLENSL-----------------------------------SVAGSSSSESDAVCAPESRQAESS

Query:  --------KSGVHSLSDVLLALVS--LSAMMNNLAEDRCFN--SSSFGKNDNSPSAKHIAEQHISSSEIW----------------VIKVLKEVRDEVCE
                 SG   +S+  +  VS     M+N    D   N    S  K++     + I E        W                +  ++KEV DE+C+
Subjt:  --------KSGVHSLSDVLLALVS--LSAMMNNLAEDRCFN--SSSFGKNDNSPSAKHIAEQHISSSEIW----------------VIKVLKEVRDEVCE

Query:  MSPPSWRSEIAEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSF-ALLMVKGLRFVLHQIQHQ
        M P SW+ EI E ID++IL Q+LNSGTLD+ Y  K+L+F L TL+KLSAPA ++E E++++ L++EL  + C  E    +F A+ +VKG+RF+L QIQ  
Subjt:  MSPPSWRSEIAEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSF-ALLMVKGLRFVLHQIQHQ

Query:  FQSKYVSNARDDGIKRFGRCGAGM--------------------------------------EDYTDSVAAITKNVGVQPEILPSTIRTGGSSLIPSKIS
           + +   R   +K F +  AG                                       E++ ++++A+  NV V+   +  +++TGGS L P   +
Subjt:  FQSKYVSNARDDGIKRFGRCGAGM--------------------------------------EDYTDSVAAITKNVGVQPEILPSTIRTGGSSLIPSKIS

Query:  STSLGTSLHGKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAK
        S S      G +  ECKGER+DL +RLGLLKLVNQ+ GL+ + LPET +LNL R+  +Q+ +Q IIV++TSLL+ RQ+L         ++++S   + AK
Subjt:  STSLGTSLHGKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAK

Query:  RLCDLLDTVENAGILEIVE-ALSMVLGDRDSTLKSFKQES----------------------------RKGRQLVEAALLPI-GAGALTDKVVEAAEVLV
        +L +LLD  E AG+ EI+E  +S   G++   ++    +S                              G+++VE  +  + G G L ++V+E A  L 
Subjt:  RLCDLLDTVENAGILEIVE-ALSMVLGDRDSTLKSFKQES----------------------------RKGRQLVEAALLPI-GAGALTDKVVEAAEVLV

Query:  VMAVVSVIVHGDWYRELI
        V+A VSV VHG W  +L+
Subjt:  VMAVVSVIVHGDWYRELI

AT1G22930.2 T-complex protein 113.1e-10433.77Show/hide
Query:  EEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDPSTI---IARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATA
        +++E KLQ+      AKR R+ +L QRRR  D +      M +ED   +   ++RCWR FV+ ++TT  LAKA+  L I E    S+ FEQLA  + +  
Subjt:  EEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDPSTI---IARCWRNFVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATA

Query:  TIQTVRALLVRLESRYTILR--TTSGSKLSMENVDHLLKRVGLHGRSSNQVTKASR-SETIGSRKAAKGPS-KLSRYPAKVVLFAYMILGHPDTVFIGKS
        T++TV++LL RLE R    +  TT      ++N+DHLLKRV    R +   T  SR  + + S +   G S K+SRYP +VVL A+MILGHPD VF G+ 
Subjt:  TIQTVRALLVRLESRYTILR--TTSGSKLSMENVDHLLKRVGLHGRSSNQVTKASR-SETIGSRKAAKGPS-KLSRYPAKVVLFAYMILGHPDTVFIGKS

Query:  EFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEIFDKRWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDS
        + E AL  +A  F++E +LL+ ++ EGP+     E +       RSQL++FDK WCS+L+ FV+WK KDA   E++++  A QLEL M Q  KLT EG  
Subjt:  EFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEIFDKRWCSYLHHFVVWKDKDAIFFEENMMGVARQLELFMAQTSKLTLEGDS

Query:  GNITHD-----MQASEEQMILRENLQQLG------NLENSL-----------------------------------SVAGSSSSESDAVCAPESRQAESS
          +THD     MQ +++Q +L E ++ L        +E++L                                   SV+ SSS   D++    S +   S
Subjt:  GNITHD-----MQASEEQMILRENLQQLG------NLENSL-----------------------------------SVAGSSSSESDAVCAPESRQAESS

Query:  --------KSGVHSLSDVLLALVS--LSAMMNNLAEDRCFN--SSSFGKNDNSPSAKHIAEQHISSSEIW----------------VIKVLKEVRDEVCE
                 SG   +S+  +  VS     M+N    D   N    S  K++     + I E        W                +  ++KEV DE+C+
Subjt:  --------KSGVHSLSDVLLALVS--LSAMMNNLAEDRCFN--SSSFGKNDNSPSAKHIAEQHISSSEIW----------------VIKVLKEVRDEVCE

Query:  MSPPSWRSEIAEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSF-ALLMVKGLRFVLHQIQHQ
        M P SW+ EI E ID++IL Q+LNSGTLD+ Y  K+L+F L TL+KLSAPA ++E E++++ L++EL  + C  E    +F A+ +VKG+RF+L QIQ  
Subjt:  MSPPSWRSEIAEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSF-ALLMVKGLRFVLHQIQHQ

Query:  FQSKYVSNARDDGIKRFGRCGAGM--------------------------------------EDYTDSVAAITKNVGVQPEILPSTIRTGGSSLIPSKIS
           + +   R   +K F +  AG                                       E++ ++++A+  NV V+   +  +++TGGS L P   +
Subjt:  FQSKYVSNARDDGIKRFGRCGAGM--------------------------------------EDYTDSVAAITKNVGVQPEILPSTIRTGGSSLIPSKIS

Query:  STSLGTSLHGKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAK
        S S      G +  ECKGER+DL +RLGLLKLVNQ+ GL+ + LPET +LNL R+  +Q+ +Q IIV++TSLL+ RQ+L         ++++S   + AK
Subjt:  STSLGTSLHGKEQPECKGERLDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAK

Query:  RLCDLLDTVENAGILEIVE-ALSMVLGDRDSTLKSFKQES----------------------------RKGRQLVEAALLPI-GAGALTDKVVEAAEVLV
        +L +LLD  E AG+ EI+E  +S   G++   ++    +S                              G+++VE  +  + G G L ++V+E A  L 
Subjt:  RLCDLLDTVENAGILEIVE-ALSMVLGDRDSTLKSFKQES----------------------------RKGRQLVEAALLPI-GAGALTDKVVEAAEVLV

Query:  VMAVVSVIVHGDWYRELI
        V+A VSV VHG W  +L+
Subjt:  VMAVVSVIVHGDWYRELI

AT1G71990.1 fucosyltransferase 135.1e-13158.91Show/hide
Query:  MLLKPLNAFATVFMLGFALFLLFFAGLLEFPSASTSIRPVDDLSLSAVEFTPDPFTDLI----------------PPSRNGTLWSVAIGS---------G
        M ++ LNA A + M+ F L +L F G+LEFPSASTS+    D      + T DPF+D++                   ++  L +V+ GS          
Subjt:  MLLKPLNAFATVFMLGFALFLLFFAGLLEFPSASTSIRPVDDLSLSAVEFTPDPFTDLI----------------PPSRNGTLWSVAIGS---------G

Query:  TKSLAVWPMG----------LRLPRCNKSAATLVQNAGAETESPDKPDALFYETSTPPRQRRKGDPLRIYMDLEAGRKRSGFEDLFVSYHAEDDVQSTYA
         K + V   G          L   RC  +      +  A     D PDAL +ET+TPP QRR GDPLR+YM+LEAGRKRSG ED+F+SYHA+DDVQ+TYA
Subjt:  TKSLAVWPMG----------LRLPRCNKSAATLVQNAGAETESPDKPDALFYETSTPPRQRRKGDPLRIYMDLEAGRKRSGFEDLFVSYHAEDDVQSTYA

Query:  GALFHNGRNYHVSSSKNNDTLVYWSSSRCLPQRNPLARKLLSLLPHHSFGKCLNNVGGLDMALSLYPECANDASVTLKWWDHLHCAMSHYKFVLAIENTM
        G+LFHN RNYH+S  KNND LVYWSSSRCLP R+ LA+ LL L+PHHSFGKCLNNVGGLD ALS+YPEC  + +   KW+DHLHCAMSHYKFVLAIENT 
Subjt:  GALFHNGRNYHVSSSKNNDTLVYWSSSRCLPQRNPLARKLLSLLPHHSFGKCLNNVGGLDMALSLYPECANDASVTLKWWDHLHCAMSHYKFVLAIENTM

Query:  TESYVTEKLYYALDSGSVPIYFGAPNVWDFVPPHSIIDGSKFSSLEELASYVKRLADDPVAYAEYHAWRRCGTLGNYRRTRAASLDTLPCRLCDAVSRKG
         ESYVTEKL+YALDSGSVPIYFGA NV DFVPPHS+IDGSKF S++ELA+YVKRL DDPVAY+EYHAWRRCG +GNY +TRA SLDTLPCRLC+ +SR+G
Subjt:  TESYVTEKLYYALDSGSVPIYFGAPNVWDFVPPHSIIDGSKFSSLEELASYVKRLADDPVAYAEYHAWRRCGTLGNYRRTRAASLDTLPCRLCDAVSRKG

Query:  GRNA
        G+NA
Subjt:  GRNA

AT4G09150.1 T-complex protein 116.1e-11634.85Show/hide
Query:  RLPGRLRRRLMESKAPSTAEEIEAKLQKADLR---RQAKRQRAGYLMQRR--RTCDIVRANMKGMPKEDPSTIIARCWRNFVQVRKTTFTLAKAFQALDI
        R   RL R    + +  + +E E +  K  L    ++AK+ +A Y+ +RR   +C   R+      +     ++ RCWR F + +K+TF LA+A+  L I
Subjt:  RLPGRLRRRLMESKAPSTAEEIEAKLQKADLR---RQAKRQRAGYLMQRR--RTCDIVRANMKGMPKEDPSTIIARCWRNFVQVRKTTFTLAKAFQALDI

Query:  TEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRV---GLHGRSSNQVTKASRSETIGSRKAAKGPSKLSRYPAK
         E+S++S+ FEQ A ++N+ + IQTV+ALL RLE R T+ + +     ++EN++HLLK +      G+S + ++K  ++    S+   +   K++RYPA+
Subjt:  TEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRV---GLHGRSSNQVTKASRSETIGSRKAAKGPSKLSRYPAK

Query:  VVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQ--SSAPPLFRSQLEIFDKRWCSYLHHFVVWKDKDAIFFEENMM
        + L AYMI  HP  +F G+ E E AL+ESA+  I+EFELLVK++LEGP  TL       +  P  FRSQLE FDK WCSYL  FVVWK  DA   E+++ 
Subjt:  VVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQ--SSAPPLFRSQLEIFDKRWCSYLHHFVVWKDKDAIFFEENMM

Query:  GVARQLELFMAQTSKLTLEG---DSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQAESSKSGVHSLSDVLLALVSLSA--
          AR  E  +++ SK T      DSG     ++AS      R    +      S + A S    S +     S  + S  SG  ++S   +   S  A  
Subjt:  GVARQLELFMAQTSKLTLEG---DSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQAESSKSGVHSLSDVLLALVSLSA--

Query:  ------MMNNLAEDRCFNSSSFGKN------DNSPSAKHIAE-----------QHISSSE---IWVIKVLKEVRDEVCEMSPPSWRSEIAEKIDIEILLQ
              ++N +  D   NSSSF  +      D S     + E           + +  S+    WVIK++KEVRDE+CE+SP  WR EI + ID ++L Q
Subjt:  ------MMNNLAEDRCFNSSSFGKN------DNSPSAKHIAE-----------QHISSSE---IWVIKVLKEVRDEVCEMSPPSWRSEIAEKIDIEILLQ

Query:  ILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNARDDGIKRFGRCGA
        +L SG +D+GY   +L+F+L  L KLSAPA E+E+  ++ KLM ELG++  +   S  S+A+LMVKGLRFVL QI  Q   K +S +R   ++   +  A
Subjt:  ILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNARDDGIKRFGRCGA

Query:  GME---------------------------------------DYTDSVAAITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSLHGKEQPECKGERL
        G+E                                       ++ D+++A+  N      +  +T+RTGG+    SK+++ S  +   G E  ECKGE +
Subjt:  GME---------------------------------------DYTDSVAAITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSLHGKEQPECKGERL

Query:  DLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDTVENAGILEIVEAL
        DLL+R+GLLK+V++I GL+ +T+PET +LNL+RL  VQS++Q+I ++S S+L+++Q L++E+  S+  + ++I  TC  RL ++LD   +AG+ EI+E L
Subjt:  DLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDTVENAGILEIVEAL

Query:  SMVLGDRDSTLKS---------------------------------FKQESRKGRQLVEAALLPIGAGALTDKVVEAAEVLVVMAVVSVIVHGDWYRELI
        S +L   D+  K                                      + K +QLVE  L  IGA +L+DKV+E +++LV++A VS  VHG WY EL+
Subjt:  SMVLGDRDSTLKS---------------------------------FKQESRKGRQLVEAALLPIGAGALTDKVVEAAEVLVVMAVVSVIVHGDWYRELI

Query:  K
        K
Subjt:  K

AT4G09150.2 T-complex protein 112.3e-11535.03Show/hide
Query:  RLPGRLRRRLMESKAPSTAEEIEAKLQKADLR---RQAKRQRAGYLMQRR--RTCDIVRANMKGMPKEDPSTIIARCWRNFVQVRKTTFTLAKAFQALDI
        R   RL R    + +  + +E E +  K  L    ++AK+ +A Y+ +RR   +C   R+      +     ++ RCWR F + +K+TF LA+A+  L I
Subjt:  RLPGRLRRRLMESKAPSTAEEIEAKLQKADLR---RQAKRQRAGYLMQRR--RTCDIVRANMKGMPKEDPSTIIARCWRNFVQVRKTTFTLAKAFQALDI

Query:  TEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRV---GLHGRSSNQVTKASRSETIGSRKAAKGPSKLSRYPAK
         E+S++S+ FEQ A ++N+ + IQTV+ALL RLE R T+ + +     ++EN++HLLK +      G+S + ++K  ++    S+   +   K++RYPA+
Subjt:  TEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRV---GLHGRSSNQVTKASRSETIGSRKAAKGPSKLSRYPAK

Query:  VVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQ--SSAPPLFRSQLEIFDKRWCSYLHHFVVWKDKDAIFFEENMM
        + L AYMI  HP  +F G+ E E AL+ESA+  I+EFELLVK++LEGP  TL       +  P  FRSQLE FDK WCSYL  FVVWK  DA   E+++ 
Subjt:  VVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQ--SSAPPLFRSQLEIFDKRWCSYLHHFVVWKDKDAIFFEENMM

Query:  GVARQLELFMAQTSKLTLEG---DSG---------NITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQAESSKSGVHSLSDVLL
          AR  E  +++ SK T      DSG         + T+    SE             +L +S S +  SS+ S ++ +  S +A S+ + V +  D  L
Subjt:  GVARQLELFMAQTSKLTLEG---DSG---------NITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQAESSKSGVHSLSDVLL

Query:  ALVSLSAMMNNLAEDRCFNSSSFGKN------DNSPSAKHIAE-----------QHISSSE---IWVIKVLKEVRDEVCEMSPPSWRSEIAEKIDIEILL
        A  +   ++N +  D   NSSSF  +      D S     + E           + +  S+    WVIK++KEVRDE+CE+SP  WR EI + ID ++L 
Subjt:  ALVSLSAMMNNLAEDRCFNSSSFGKN------DNSPSAKHIAE-----------QHISSSE---IWVIKVLKEVRDEVCEMSPPSWRSEIAEKIDIEILL

Query:  QILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNARDDGIKRFGRCG
        Q+L SG +D+GY   +L+F+L  L KLSAPA E+E+  ++ KLM ELG++  +   S  S+A+LMVKGLRFVL QI  Q   K +S +R   ++   +  
Subjt:  QILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKSKPSFALLMVKGLRFVLHQIQHQFQSKYVSNARDDGIKRFGRCG

Query:  AGME---------------------------------------DYTDSVAAITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSLHGKEQPECKGER
        AG+E                                       ++ D+++A+  N      +  +T+RTGG+    SK+++ S  +   G E  ECKGE 
Subjt:  AGME---------------------------------------DYTDSVAAITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSLHGKEQPECKGER

Query:  LDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDTVENAGILEIVEA
        +DLL+R+GLLK+V++I GL+ +T+PET +LNL+RL  VQS++Q+I ++S S+L+++Q L++E+  S+  + ++I  TC  RL ++LD   +AG+ EI+E 
Subjt:  LDLLIRLGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDTVENAGILEIVEA

Query:  LSMVLGDRDSTLKS---------------------------------FKQESRKGRQLVEAALLPIGAGALTDKVVEAAEVLVVMAVVSVIVHGDWYREL
        LS +L   D+  K                                      + K +QLVE  L  IGA +L+DKV+E +++LV++A VS  VHG WY EL
Subjt:  LSMVLGDRDSTLKS---------------------------------FKQESRKGRQLVEAALLPIGAGALTDKVVEAAEVLVVMAVVSVIVHGDWYREL

Query:  IK
        +K
Subjt:  IK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGTGGGACTGGAGGAGGAGTCGGAGAGGAAGGGCGCAGTAGTGCTGTATTTTCCGGCGAATGACGATGAGGCTTCATCTGTATCGTCTTCGACTCCTCCTAGGCT
GCCTGGCAGGCTTCGACGCCGTCTTATGGAGTCTAAGGCGCCGTCTACTGCTGAAGAGATTGAAGCCAAGCTCCAAAAAGCCGATCTCCGTCGTCAGGCTAAGAGACAAA
GAGCTGGGTATTTGATGCAGAGAAGAAGAACATGTGATATTGTTCGTGCTAATATGAAGGGGATGCCTAAGGAGGACCCATCGACAATAATAGCAAGATGCTGGAGGAAT
TTTGTACAAGTGAGGAAAACCACATTTACTTTGGCAAAAGCATTTCAAGCATTAGATATTACTGAAGAATCTGTGAAGTCAATGGAATTTGAGCAGCTTGCTTCTAAGAT
CAATGCAACTGCAACCATACAAACTGTAAGAGCTTTACTTGTTCGTCTAGAGAGTCGATATACAATCTTGAGAACAACTTCTGGAAGTAAATTGAGCATGGAGAATGTAG
ATCACCTTCTCAAGCGTGTTGGATTGCATGGTAGAAGCAGCAACCAAGTAACCAAGGCAAGTAGGTCAGAGACTATTGGATCCAGGAAAGCTGCAAAAGGTCCCTCTAAA
TTATCTAGGTACCCTGCTAAAGTAGTGCTTTTTGCTTACATGATATTGGGGCATCCAGACACGGTTTTTATTGGGAAGAGTGAGTTTGAGAATGCGTTGCTGGAGTCGGC
ATCAAATTTTATTCAGGAATTTGAGTTGTTGGTTAAGATTTTGTTAGAGGGTCCCTTGTGGACCCTTCGTGAGGAGCAACAATCTTCCGCACCACCTTTGTTTAGATCCC
AGTTGGAGATATTTGATAAAAGATGGTGCTCTTACCTTCATCACTTTGTGGTGTGGAAAGATAAGGATGCTATATTTTTTGAGGAGAATATGATGGGTGTTGCCCGCCAG
TTGGAGCTTTTTATGGCGCAAACTTCTAAGCTGACATTAGAAGGTGATAGTGGTAATATCACACATGATATGCAGGCCAGTGAAGAGCAGATGATACTGAGAGAAAATTT
ACAGCAACTTGGGAATTTAGAAAATTCTTTATCAGTTGCAGGGTCAAGCTCCTCTGAATCTGATGCTGTATGTGCACCAGAGTCTCGACAAGCAGAAAGTTCGAAGTCAG
GTGTGCATAGTTTATCAGATGTTTTGTTGGCTTTGGTTTCTTTGTCTGCAATGATGAATAATCTTGCTGAAGATAGATGTTTTAATTCTTCCTCCTTTGGTAAAAATGAC
AATTCTCCTTCTGCCAAACACATTGCAGAACAACATATCAGCTCCAGTGAAATCTGGGTTATTAAGGTCCTGAAGGAGGTCAGGGATGAAGTGTGTGAGATGTCTCCACC
ATCTTGGAGATCAGAGATTGCCGAGAAAATTGATATTGAAATTCTTTTGCAGATTCTAAATTCAGGAACTCTGGATGTGGGTTACTTCAAAAAACTCCTGGACTTTACGT
TAGTCACTCTGCAAAAACTTTCTGCTCCAGCTAAAGAGAAAGAGATGGAGGCAAGTTACCAGAAGTTGATGGAAGAGCTAGGAGACGTTTCCTGTTCTGGAGAGAAGTCA
AAGCCTTCATTTGCACTCTTGATGGTCAAAGGTCTACGTTTTGTTCTACATCAGATTCAGCATCAGTTCCAATCCAAGTATGTTTCTAATGCAAGGGATGATGGAATTAA
ACGCTTTGGAAGATGCGGAGCTGGAATGGAAGACTACACCGATTCTGTGGCTGCCATAACAAAAAATGTAGGAGTACAGCCAGAGATCCTGCCCTCTACCATACGAACGG
GAGGAAGCAGTCTGATTCCATCAAAGATCAGCTCAACTTCTTTGGGAACAAGTCTCCATGGCAAAGAACAGCCGGAGTGCAAGGGAGAAAGACTTGATTTGTTAATTAGG
CTTGGGTTGCTAAAGTTGGTAAATCAAATAAGGGGGCTGAGCAGCGATACGCTGCCAGAGACTCTTAAGCTTAATCTCGCCAGGCTCACGATTGTTCAGTCTCGACTTCA
GAGGATCATCGTGATTTCTACCAGCTTGTTGGTCGTGCGTCAAATTCTTCTCAACGAGAGTTTGGTCTCGAATCCCAGTGAAGCCGATAGCATCTTATCGACATGCGCCA
AACGTCTCTGCGACCTCTTGGACACTGTAGAAAATGCGGGAATACTGGAGATTGTTGAAGCCCTTAGTATGGTCTTAGGAGATCGAGATTCGACCCTGAAAAGCTTCAAG
CAAGAAAGCAGAAAGGGAAGGCAACTGGTGGAGGCGGCGCTTTTGCCCATCGGAGCAGGAGCTCTTACAGATAAGGTGGTTGAGGCTGCAGAAGTTCTGGTTGTGATGGC
TGTTGTATCTGTGATTGTTCATGGAGATTGGTATAGAGAACTAATCAAAAACTGTCCCTCTTTTCTCTCTCTTCGGAGAGAGAAACCGACCGGCGACCGCCATTTCCGAC
CAACCATGCTCTTAAAACCCCTCAACGCCTTCGCCACCGTATTCATGCTCGGTTTCGCATTGTTCCTCCTCTTCTTCGCTGGCCTCCTCGAATTCCCATCGGCATCCACC
TCCATCCGACCGGTCGACGATCTGAGTCTTTCAGCAGTCGAGTTTACTCCCGACCCTTTCACCGACTTGATCCCGCCTTCAAGAAATGGGACTCTCTGGTCGGTTGCGAT
CGGTTCCGGGACAAAATCGCTGGCGGTTTGGCCAATGGGTCTTCGTCTTCCTCGCTGCAACAAGTCGGCGGCGACTCTAGTTCAGAATGCGGGGGCTGAAACTGAATCAC
CCGACAAACCTGACGCCTTGTTCTACGAGACCAGTACGCCTCCACGGCAGAGGCGCAAGGGAGATCCTCTTCGCATATATATGGATCTCGAGGCTGGTCGGAAGCGCTCA
GGTTTTGAGGATCTATTTGTTAGTTATCATGCAGAAGATGATGTTCAGTCAACTTATGCAGGAGCACTCTTTCACAATGGTAGAAACTATCACGTATCTTCTTCTAAAAA
CAATGATACACTTGTTTATTGGTCGTCGTCCCGTTGTCTTCCACAAAGAAATCCGCTTGCTAGGAAGCTTCTAAGCTTGCTGCCTCACCATTCATTTGGCAAGTGCTTGA
ACAATGTAGGCGGACTCGACATGGCCCTTTCTCTGTACCCTGAATGTGCAAATGATGCAAGTGTCACCCTGAAATGGTGGGATCACTTGCATTGTGCAATGTCTCATTAC
AAATTCGTTCTCGCTATCGAAAACACAATGACCGAGAGTTACGTGACGGAGAAGCTATATTACGCGCTAGACTCTGGTTCGGTTCCAATCTATTTTGGTGCCCCAAATGT
CTGGGATTTCGTCCCTCCACATTCTATAATTGATGGATCTAAATTCAGCTCCTTGGAAGAGTTAGCTTCTTACGTAAAGCGACTCGCCGATGACCCGGTAGCTTATGCCG
AGTACCATGCATGGAGACGATGCGGCACGTTGGGTAACTATAGAAGGACCCGGGCCGCAAGCCTCGACACGCTGCCGTGCCGGTTGTGTGATGCTGTGAGCAGAAAGGGT
GGTAGAAATGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGTGGGACTGGAGGAGGAGTCGGAGAGGAAGGGCGCAGTAGTGCTGTATTTTCCGGCGAATGACGATGAGGCTTCATCTGTATCGTCTTCGACTCCTCCTAGGCT
GCCTGGCAGGCTTCGACGCCGTCTTATGGAGTCTAAGGCGCCGTCTACTGCTGAAGAGATTGAAGCCAAGCTCCAAAAAGCCGATCTCCGTCGTCAGGCTAAGAGACAAA
GAGCTGGGTATTTGATGCAGAGAAGAAGAACATGTGATATTGTTCGTGCTAATATGAAGGGGATGCCTAAGGAGGACCCATCGACAATAATAGCAAGATGCTGGAGGAAT
TTTGTACAAGTGAGGAAAACCACATTTACTTTGGCAAAAGCATTTCAAGCATTAGATATTACTGAAGAATCTGTGAAGTCAATGGAATTTGAGCAGCTTGCTTCTAAGAT
CAATGCAACTGCAACCATACAAACTGTAAGAGCTTTACTTGTTCGTCTAGAGAGTCGATATACAATCTTGAGAACAACTTCTGGAAGTAAATTGAGCATGGAGAATGTAG
ATCACCTTCTCAAGCGTGTTGGATTGCATGGTAGAAGCAGCAACCAAGTAACCAAGGCAAGTAGGTCAGAGACTATTGGATCCAGGAAAGCTGCAAAAGGTCCCTCTAAA
TTATCTAGGTACCCTGCTAAAGTAGTGCTTTTTGCTTACATGATATTGGGGCATCCAGACACGGTTTTTATTGGGAAGAGTGAGTTTGAGAATGCGTTGCTGGAGTCGGC
ATCAAATTTTATTCAGGAATTTGAGTTGTTGGTTAAGATTTTGTTAGAGGGTCCCTTGTGGACCCTTCGTGAGGAGCAACAATCTTCCGCACCACCTTTGTTTAGATCCC
AGTTGGAGATATTTGATAAAAGATGGTGCTCTTACCTTCATCACTTTGTGGTGTGGAAAGATAAGGATGCTATATTTTTTGAGGAGAATATGATGGGTGTTGCCCGCCAG
TTGGAGCTTTTTATGGCGCAAACTTCTAAGCTGACATTAGAAGGTGATAGTGGTAATATCACACATGATATGCAGGCCAGTGAAGAGCAGATGATACTGAGAGAAAATTT
ACAGCAACTTGGGAATTTAGAAAATTCTTTATCAGTTGCAGGGTCAAGCTCCTCTGAATCTGATGCTGTATGTGCACCAGAGTCTCGACAAGCAGAAAGTTCGAAGTCAG
GTGTGCATAGTTTATCAGATGTTTTGTTGGCTTTGGTTTCTTTGTCTGCAATGATGAATAATCTTGCTGAAGATAGATGTTTTAATTCTTCCTCCTTTGGTAAAAATGAC
AATTCTCCTTCTGCCAAACACATTGCAGAACAACATATCAGCTCCAGTGAAATCTGGGTTATTAAGGTCCTGAAGGAGGTCAGGGATGAAGTGTGTGAGATGTCTCCACC
ATCTTGGAGATCAGAGATTGCCGAGAAAATTGATATTGAAATTCTTTTGCAGATTCTAAATTCAGGAACTCTGGATGTGGGTTACTTCAAAAAACTCCTGGACTTTACGT
TAGTCACTCTGCAAAAACTTTCTGCTCCAGCTAAAGAGAAAGAGATGGAGGCAAGTTACCAGAAGTTGATGGAAGAGCTAGGAGACGTTTCCTGTTCTGGAGAGAAGTCA
AAGCCTTCATTTGCACTCTTGATGGTCAAAGGTCTACGTTTTGTTCTACATCAGATTCAGCATCAGTTCCAATCCAAGTATGTTTCTAATGCAAGGGATGATGGAATTAA
ACGCTTTGGAAGATGCGGAGCTGGAATGGAAGACTACACCGATTCTGTGGCTGCCATAACAAAAAATGTAGGAGTACAGCCAGAGATCCTGCCCTCTACCATACGAACGG
GAGGAAGCAGTCTGATTCCATCAAAGATCAGCTCAACTTCTTTGGGAACAAGTCTCCATGGCAAAGAACAGCCGGAGTGCAAGGGAGAAAGACTTGATTTGTTAATTAGG
CTTGGGTTGCTAAAGTTGGTAAATCAAATAAGGGGGCTGAGCAGCGATACGCTGCCAGAGACTCTTAAGCTTAATCTCGCCAGGCTCACGATTGTTCAGTCTCGACTTCA
GAGGATCATCGTGATTTCTACCAGCTTGTTGGTCGTGCGTCAAATTCTTCTCAACGAGAGTTTGGTCTCGAATCCCAGTGAAGCCGATAGCATCTTATCGACATGCGCCA
AACGTCTCTGCGACCTCTTGGACACTGTAGAAAATGCGGGAATACTGGAGATTGTTGAAGCCCTTAGTATGGTCTTAGGAGATCGAGATTCGACCCTGAAAAGCTTCAAG
CAAGAAAGCAGAAAGGGAAGGCAACTGGTGGAGGCGGCGCTTTTGCCCATCGGAGCAGGAGCTCTTACAGATAAGGTGGTTGAGGCTGCAGAAGTTCTGGTTGTGATGGC
TGTTGTATCTGTGATTGTTCATGGAGATTGGTATAGAGAACTAATCAAAAACTGTCCCTCTTTTCTCTCTCTTCGGAGAGAGAAACCGACCGGCGACCGCCATTTCCGAC
CAACCATGCTCTTAAAACCCCTCAACGCCTTCGCCACCGTATTCATGCTCGGTTTCGCATTGTTCCTCCTCTTCTTCGCTGGCCTCCTCGAATTCCCATCGGCATCCACC
TCCATCCGACCGGTCGACGATCTGAGTCTTTCAGCAGTCGAGTTTACTCCCGACCCTTTCACCGACTTGATCCCGCCTTCAAGAAATGGGACTCTCTGGTCGGTTGCGAT
CGGTTCCGGGACAAAATCGCTGGCGGTTTGGCCAATGGGTCTTCGTCTTCCTCGCTGCAACAAGTCGGCGGCGACTCTAGTTCAGAATGCGGGGGCTGAAACTGAATCAC
CCGACAAACCTGACGCCTTGTTCTACGAGACCAGTACGCCTCCACGGCAGAGGCGCAAGGGAGATCCTCTTCGCATATATATGGATCTCGAGGCTGGTCGGAAGCGCTCA
GGTTTTGAGGATCTATTTGTTAGTTATCATGCAGAAGATGATGTTCAGTCAACTTATGCAGGAGCACTCTTTCACAATGGTAGAAACTATCACGTATCTTCTTCTAAAAA
CAATGATACACTTGTTTATTGGTCGTCGTCCCGTTGTCTTCCACAAAGAAATCCGCTTGCTAGGAAGCTTCTAAGCTTGCTGCCTCACCATTCATTTGGCAAGTGCTTGA
ACAATGTAGGCGGACTCGACATGGCCCTTTCTCTGTACCCTGAATGTGCAAATGATGCAAGTGTCACCCTGAAATGGTGGGATCACTTGCATTGTGCAATGTCTCATTAC
AAATTCGTTCTCGCTATCGAAAACACAATGACCGAGAGTTACGTGACGGAGAAGCTATATTACGCGCTAGACTCTGGTTCGGTTCCAATCTATTTTGGTGCCCCAAATGT
CTGGGATTTCGTCCCTCCACATTCTATAATTGATGGATCTAAATTCAGCTCCTTGGAAGAGTTAGCTTCTTACGTAAAGCGACTCGCCGATGACCCGGTAGCTTATGCCG
AGTACCATGCATGGAGACGATGCGGCACGTTGGGTAACTATAGAAGGACCCGGGCCGCAAGCCTCGACACGCTGCCGTGCCGGTTGTGTGATGCTGTGAGCAGAAAGGGT
GGTAGAAATGCTTGA
Protein sequenceShow/hide protein sequence
MAVGLEEESERKGAVVLYFPANDDEASSVSSSTPPRLPGRLRRRLMESKAPSTAEEIEAKLQKADLRRQAKRQRAGYLMQRRRTCDIVRANMKGMPKEDPSTIIARCWRN
FVQVRKTTFTLAKAFQALDITEESVKSMEFEQLASKINATATIQTVRALLVRLESRYTILRTTSGSKLSMENVDHLLKRVGLHGRSSNQVTKASRSETIGSRKAAKGPSK
LSRYPAKVVLFAYMILGHPDTVFIGKSEFENALLESASNFIQEFELLVKILLEGPLWTLREEQQSSAPPLFRSQLEIFDKRWCSYLHHFVVWKDKDAIFFEENMMGVARQ
LELFMAQTSKLTLEGDSGNITHDMQASEEQMILRENLQQLGNLENSLSVAGSSSSESDAVCAPESRQAESSKSGVHSLSDVLLALVSLSAMMNNLAEDRCFNSSSFGKND
NSPSAKHIAEQHISSSEIWVIKVLKEVRDEVCEMSPPSWRSEIAEKIDIEILLQILNSGTLDVGYFKKLLDFTLVTLQKLSAPAKEKEMEASYQKLMEELGDVSCSGEKS
KPSFALLMVKGLRFVLHQIQHQFQSKYVSNARDDGIKRFGRCGAGMEDYTDSVAAITKNVGVQPEILPSTIRTGGSSLIPSKISSTSLGTSLHGKEQPECKGERLDLLIR
LGLLKLVNQIRGLSSDTLPETLKLNLARLTIVQSRLQRIIVISTSLLVVRQILLNESLVSNPSEADSILSTCAKRLCDLLDTVENAGILEIVEALSMVLGDRDSTLKSFK
QESRKGRQLVEAALLPIGAGALTDKVVEAAEVLVVMAVVSVIVHGDWYRELIKNCPSFLSLRREKPTGDRHFRPTMLLKPLNAFATVFMLGFALFLLFFAGLLEFPSAST
SIRPVDDLSLSAVEFTPDPFTDLIPPSRNGTLWSVAIGSGTKSLAVWPMGLRLPRCNKSAATLVQNAGAETESPDKPDALFYETSTPPRQRRKGDPLRIYMDLEAGRKRS
GFEDLFVSYHAEDDVQSTYAGALFHNGRNYHVSSSKNNDTLVYWSSSRCLPQRNPLARKLLSLLPHHSFGKCLNNVGGLDMALSLYPECANDASVTLKWWDHLHCAMSHY
KFVLAIENTMTESYVTEKLYYALDSGSVPIYFGAPNVWDFVPPHSIIDGSKFSSLEELASYVKRLADDPVAYAEYHAWRRCGTLGNYRRTRAASLDTLPCRLCDAVSRKG
GRNA