; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg03120 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg03120
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
Genome locationCarg_Chr20:3016775..3022589
RNA-Seq ExpressionCarg03120
SyntenyCarg03120
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016616 - oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor (molecular function)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR003832 - Protein of unknown function DUF212
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570899.1 Glycosyltransferase BC10, partial [Cucurbita argyrosperma subsp. sororia]1.6e-18185.38Show/hide
Query:  MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIY
        MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDS GFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIY
Subjt:  MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIY

Query:  IHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEM
        IHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEM
Subjt:  IHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEM

Query:  SPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTN-------------------------------------------------LCKIRDLENELER
        SP I+REKWRKGSQWIALVRRHAEVVVND IIFPLFTN                                                 L  IRDLENELER
Subjt:  SPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTN-------------------------------------------------LCKIRDLENELER

Query:  RTLTYTMWNSSIPKGDKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG
        RTLTYTMWNSSIPKGDKRSWHPVTFQYSDATPLIIKEMKEI+HIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG
Subjt:  RTLTYTMWNSSIPKGDKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG

KAG7010745.1 yuiD, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-281100Show/hide
Query:  MDEVMTVGDAFSSSVKPSLFPLAPDPLLTSNLPLISAFLAGAIAQYKERKWESKRMFGSGGMPSSHSATVTALALAIALQEGSGGPAFAVAVVFACVVMY
        MDEVMTVGDAFSSSVKPSLFPLAPDPLLTSNLPLISAFLAGAIAQYKERKWESKRMFGSGGMPSSHSATVTALALAIALQEGSGGPAFAVAVVFACVVMY
Subjt:  MDEVMTVGDAFSSSVKPSLFPLAPDPLLTSNLPLISAFLAGAIAQYKERKWESKRMFGSGGMPSSHSATVTALALAIALQEGSGGPAFAVAVVFACVVMY

Query:  DASGVRLHAGLQAELLNQIVCEFPPEHPLSSIRPLRDSLGHTPLQSQIQDQATMTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNF
        DASGVRLHAGLQAELLNQIVCEFPPEHPLSSIRPLRDSLGHTPLQSQIQDQATMTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNF
Subjt:  DASGVRLHAGLQAELLNQIVCEFPPEHPLSSIRPLRDSLGHTPLQSQIQDQATMTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNF

Query:  TYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGA
        TYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGA
Subjt:  TYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGA

Query:  ALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCKIRDLEN
        ALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCKIRDLEN
Subjt:  ALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCKIRDLEN

Query:  ELERRTLTYTMWNSSIPKGDKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG
        ELERRTLTYTMWNSSIPKGDKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG
Subjt:  ELERRTLTYTMWNSSIPKGDKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG

XP_022943740.1 uncharacterized protein LOC111448398 [Cucurbita moschata]8.3e-18388.32Show/hide
Query:  MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIY
        MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDS GFQSPPKIAFLFLTRSNLPLDFIWASFFK+GDQAKFSIY
Subjt:  MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIY

Query:  IHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEM
        IHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLH+FSHTYRYLMSSTKSFVDSFLNVGEGRYNPEM
Subjt:  IHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEM

Query:  SPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK----------------------------------IRDLENELERRTLTYTMWNSSIPKG
        SP I+REKWRKGSQWIALVRRHAEVVVND IIFPLFTN CK                                  IR+LENELERRTLTYTMWNSSIPKG
Subjt:  SPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK----------------------------------IRDLENELERRTLTYTMWNSSIPKG

Query:  DKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG
        DKRSWHPVTFQYSDATPLIIKEMKEI+HIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG
Subjt:  DKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG

XP_022986321.1 uncharacterized protein LOC111484100 [Cucurbita maxima]7.2e-17986.41Show/hide
Query:  MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIY
        MTKKKGSTATAGAAAGSKV VVVCAMLLTLAILIFHSDEFK QSSNFTYQFRNNGFGE+S GFQSPPKIAFLFLTRSNLPLDFIWASFFKNG+QAKFSIY
Subjt:  MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIY

Query:  IHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEM
        IHSQPGFVYDKSTTKS FFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEM
Subjt:  IHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEM

Query:  SPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK----------------------------------IRDLENELERRTLTYTMWNSSIPKG
         P I REKWRKGSQWIALVRRHAEVVVND IIFPLFTN CK                                  IRDLENELERRTLTYTMWNSSIPKG
Subjt:  SPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK----------------------------------IRDLENELERRTLTYTMWNSSIPKG

Query:  DKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG
        DKRSWHPVTFQYSDATPLIIKE+KEI+HIDFES+HRTEWCRVNS YTPCFLFARKFTPGAALR+LKNG
Subjt:  DKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG

XP_023512252.1 uncharacterized protein LOC111777042 [Cucurbita pepo subsp. pepo]1.4e-18288.56Show/hide
Query:  MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIY
        MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDS GFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIY
Subjt:  MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIY

Query:  IHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEM
        IHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNP+M
Subjt:  IHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEM

Query:  SPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK----------------------------------IRDLENELERRTLTYTMWNSSIPKG
        SP I REKWRKGSQWIALVRRHAE VVND IIFPLFTN CK                                  IRDLENELERRTLTYTMWNSSIPKG
Subjt:  SPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK----------------------------------IRDLENELERRTLTYTMWNSSIPKG

Query:  DKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKN
        DKRSWHPVTFQYSDATPLIIKEMKEI+HIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKN
Subjt:  DKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKN

TrEMBL top hitse value%identityAlignment
A0A1R3J356 Glycosyl transferase, family 142.2e-14955.14Show/hide
Query:  MDEVMTVGDAFSSSVKPSLFPLAPDPLLTSNLPLISAFLAGAIAQ--------YKERKWESKRMFGSGGMPSSHSATVTALALAIALQEGSGGPAFAVAV
        MDEVMT  DA SS    +  P     LL SNLPLI+AFLA A+AQ        +KER+W+SKRM  SGGMPSSHSATVTALA+AI LQ+G+GGPAFA+AV
Subjt:  MDEVMTVGDAFSSSVKPSLFPLAPDPLLTSNLPLISAFLAGAIAQ--------YKERKWESKRMFGSGGMPSSHSATVTALALAIALQEGSGGPAFAVAV

Query:  VFACVVMYDASGVRLHAGLQAELLNQIVCEFPPEHPLSSIRPLRDSLGHTPLQSQIQDQATMTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDE
        V ACVVMYDASGVRLHAG QAELLNQIVCEFPPEHPLSS+RPLR+ LGHTPLQ                  AGA  G  VA                   
Subjt:  VFACVVMYDASGVRLHAGLQAELLNQIVCEFPPEHPLSSIRPLRDSLGHTPLQSQIQDQATMTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDE

Query:  FKLQSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIE
        F ++SS                     PKIAFLFL R NLPLDF+W SFFKN D+A FSIY+HSQPGFV+++S TKS FFY RQLNNSIQV+WG+STMI+
Subjt:  FKLQSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIE

Query:  AERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNL
        AE+LLF AAL DPANQRFVLLS+SCIP++NF + Y YLMSS KSFVDSFL+V E  Y+PEMSP I  +KWRKGSQWIAL+RRHA +V +D I+FP+F   
Subjt:  AERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNL

Query:  CK-----------------------------------------IRDLENELERRTLTYTMWNSSIPKGDKRSWHPVTFQYSDATPLIIKEMKEIDHIDFE
        CK                                         +R L+ E+ERRTLTY+ WN S   G+  +WHP+ F+++DATP +I+E+K+I  + +E
Subjt:  CK-----------------------------------------IRDLENELERRTLTYTMWNSSIPKGDKRSWHPVTFQYSDATPLIIKEMKEIDHIDFE

Query:  SEHRTEWCRVNS--MYTPCFLFARKFTPGAALRVL
        +E R E C VN+  +  PCFLFARKFTP AA+R+L
Subjt:  SEHRTEWCRVNS--MYTPCFLFARKFTPGAALRVL

A0A5A7T0B3 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein, putative isoform 11.9e-14874.59Show/hide
Query:  MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKL-QSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSI
        MTKKKGST T       KV V++CA+LLTLA+L+FHSDEFKL QSSNF YQF+NNG G  S GFQSPPKIAFLFLTR  LPLDF+WA+FF+NGD+AKFSI
Subjt:  MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKL-QSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSI

Query:  YIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPE
        YIHSQPGFVYDKSTTKS  FY RQLNNSIQVLWG+STMIEAERLLFGAAL DPANQRFVLLSDSCIPLHNFSHTY YLMSSTKSFVDSFLNV EGRYNPE
Subjt:  YIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPE

Query:  MSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK---------------------------------IRDLENELERRTLTYTMWNSSIPKG
        M P I +EKWRKGSQWI LVRRHAEVVVND IIFPLF   CK                                 IR LE ELERRTLTY+ WNSSIPK 
Subjt:  MSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK---------------------------------IRDLENELERRTLTYTMWNSSIPKG

Query:  DKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLK
        DKRSWHPVTF Y DATP  IKE+KEI+HIDFESEHRTEWCRV S YT CFLFARKF+PGA LR+LK
Subjt:  DKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLK

A0A6J1CGL5 uncharacterized protein LOC1110112293.3e-15373.42Show/hide
Query:  MTKKKGSTATAG----AAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAK
        MTKKKGS   A      A GSKV V VCAMLLTLAILIFHSDEFKLQSS+FTY+FR NGFG DS GFQSPPKIAFLFL R NLPLDF+WA+FFKNGD+AK
Subjt:  MTKKKGSTATAG----AAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAK

Query:  FSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRY
        FSIYIHSQPGFV++KSTTKS FFYGRQLNNS+QVLWG+STMIEAERLLFGAAL DPANQRFVLLSDSCIPLHNFSHTY YLMSSTKSFVDSFLNV EGRY
Subjt:  FSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRY

Query:  NPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK------------------------------------------IRDLENELERRTL
        NPEMSP IQ EKWRKGSQWIALVRRHAEVVVND IIFPLF   CK                                          IR LENE+ERRTL
Subjt:  NPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK------------------------------------------IRDLENELERRTL

Query:  TYTMWNSSIPKGDKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG
        TY+ WN+SIPKGD+RSWHPVT  Y DATP  +KE+KEI+HID+ESEHRTEWC VNS+YTPC+LFARKFT  AALR+  NG
Subjt:  TYTMWNSSIPKGDKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG

A0A6J1FY75 uncharacterized protein LOC1114483984.0e-18388.32Show/hide
Query:  MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIY
        MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDS GFQSPPKIAFLFLTRSNLPLDFIWASFFK+GDQAKFSIY
Subjt:  MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIY

Query:  IHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEM
        IHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLH+FSHTYRYLMSSTKSFVDSFLNVGEGRYNPEM
Subjt:  IHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEM

Query:  SPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK----------------------------------IRDLENELERRTLTYTMWNSSIPKG
        SP I+REKWRKGSQWIALVRRHAEVVVND IIFPLFTN CK                                  IR+LENELERRTLTYTMWNSSIPKG
Subjt:  SPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK----------------------------------IRDLENELERRTLTYTMWNSSIPKG

Query:  DKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG
        DKRSWHPVTFQYSDATPLIIKEMKEI+HIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG
Subjt:  DKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG

A0A6J1JAS9 uncharacterized protein LOC1114841003.5e-17986.41Show/hide
Query:  MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIY
        MTKKKGSTATAGAAAGSKV VVVCAMLLTLAILIFHSDEFK QSSNFTYQFRNNGFGE+S GFQSPPKIAFLFLTRSNLPLDFIWASFFKNG+QAKFSIY
Subjt:  MTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDSQGFQSPPKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIY

Query:  IHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEM
        IHSQPGFVYDKSTTKS FFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEM
Subjt:  IHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEM

Query:  SPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK----------------------------------IRDLENELERRTLTYTMWNSSIPKG
         P I REKWRKGSQWIALVRRHAEVVVND IIFPLFTN CK                                  IRDLENELERRTLTYTMWNSSIPKG
Subjt:  SPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK----------------------------------IRDLENELERRTLTYTMWNSSIPKG

Query:  DKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG
        DKRSWHPVTFQYSDATPLIIKE+KEI+HIDFES+HRTEWCRVNS YTPCFLFARKFTPGAALR+LKNG
Subjt:  DKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG

SwissProt top hitse value%identityAlignment
O32107 Uncharacterized membrane protein YuiD3.1e-1537.5Show/hide
Query:  LTSNLPLISAFLAGAIAQ--------YKERKWESKRMFGSGGMPSSHSATVTALALAIALQEGSGGPAFAVAVVFACVVMYDASGVRLHAGLQAELLNQI
        L +N PL+S+  A   AQ           RK +   +  +GGMPSSHSA VTAL+  +AL+ G     FAV+ +FA + M+DA+GVR HAG QA ++N++
Subjt:  LTSNLPLISAFLAGAIAQ--------YKERKWESKRMFGSGGMPSSHSATVTALALAIALQEGSGGPAFAVAVVFACVVMYDASGVRLHAGLQAELLNQI

Query:  VC----------EFPPEHPLSSIRPLRDSLGHTPLQ
        V           +FP        + L++ LGH P++
Subjt:  VC----------EFPPEHPLSSIRPLRDSLGHTPLQ

Q65XS5 Glycosyltransferase BC107.7e-8348.34Show/hide
Query:  KIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPL
        ++AFLF+ R+ LPLD +W +FF+   + +FSI++HS+PGFV  ++TT+S FFY RQ+NNS+QV WG+++MIEAER+L   AL DP N+RFV +SDSC+PL
Subjt:  KIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPL

Query:  HNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCKIR------------------------
        +NF++TY Y+MSS+ SFVDSF +   GRYNP M P I  E WRKGSQW  L R+HAEVVV D  + P F   C+ R                        
Subjt:  HNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCKIR------------------------

Query:  -------------DLENELERRTLTYTMWNSSIPKG-DKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALR
                      LE EL RR++T++ W+ S  K  ++R WHPVT++ SDATP ++K +K+ID+I +E+E+R EWC  N    PCFLFARKFT  A L+
Subjt:  -------------DLENELERRTLTYTMWNSSIPKG-DKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALR

Query:  VL
        +L
Subjt:  VL

Arabidopsis top hitse value%identityAlignment
AT1G11940.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein8.2e-9655.48Show/hide
Query:  PKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIP
        PK+AFLFL R +LPLDF+W  FFK  D A FSIYIHS PGFV+++ TT+S +FY RQLNNSI+V+WG+S+MIEAERLL  +AL D +NQRFVLLSD C P
Subjt:  PKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIP

Query:  LHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK-------------------------
        L++F + Y+YL+SS +SFVDSFL+  E RY+ +MSP I  EKWRKGSQWIAL+R HAEV+VND I+FP+F   CK                         
Subjt:  LHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK-------------------------

Query:  --------IRDLENELERRTLTYTMWNSSIPKGDKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKN
                ++ LE+E+ERRT+TYT+WN S  K + +SWHPVTF   ++ P  IKE+K+IDH+ +ESE RTEWC+ +S   PCFLFARKFT  AA+R++  
Subjt:  --------IRDLENELERRTLTYTMWNSSIPKGDKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKN

Query:  G
        G
Subjt:  G

AT1G62305.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein4.3e-9755.81Show/hide
Query:  PKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIP
        PK+AFLFL R +LPLDF+W  FFK+ DQ  FSIY+HS PGFV+D+S+T+S FFY RQL NSI+V+WG+S+MI AERLL  +AL DP+NQRFVLLSDSC+P
Subjt:  PKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIP

Query:  LHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK-------------------------
        L++F + YRYL+SS KSFVDSFL+  + RY  +M P I++EKWRKGSQWI+L+R HAEV+VND  +FP+F   CK                         
Subjt:  LHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK-------------------------

Query:  --------IRDLENELERRTLTYTMWNSSIPKGDKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKN
                +R LENE+ERRT+TYT WN S  K + +SWHP+TF   +  P  I+ +K+I+H+ +ESE+RTEWCR NS   PCFLFARKFT GAA+R+L  
Subjt:  --------IRDLENELERRTLTYTMWNSSIPKGDKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKN

Query:  G
        G
Subjt:  G

AT1G62305.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein7.9e-8350.5Show/hide
Query:  PKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIP
        PK+AFLFL R +LPLDF+W  FFK+ DQ  FSIY+HS PGFV+D+S+T+S FFY RQL NSI+V+WG+S+MI AERLL  +AL DP+NQRFVLLSD    
Subjt:  PKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIP

Query:  LHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK-------------------------
                        SF+D      + RY  +M P I++EKWRKGSQWI+L+R HAEV+VND  +FP+F   CK                         
Subjt:  LHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK-------------------------

Query:  --------IRDLENELERRTLTYTMWNSSIPKGDKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKN
                +R LENE+ERRT+TYT WN S  K + +SWHP+TF   +  P  I+ +K+I+H+ +ESE+RTEWCR NS   PCFLFARKFT GAA+R+L  
Subjt:  --------IRDLENELERRTLTYTMWNSSIPKGDKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKN

Query:  G
        G
Subjt:  G

AT5G14550.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.2e-8550Show/hide
Query:  PKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIP
        P+IAFLF+ R+ LPL+F+W +FFK G+  KFSIY+HS+PGFV +++TT+S +F  RQLN+SIQV WG+STMIEAER+L   AL D  N RFV LSDSCIP
Subjt:  PKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIP

Query:  LHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCKIRDL---------------------
        L++FS+TY Y+MS+  SFVDSF +  + RYNP M+P I    WRKGSQW+ L R+HAE+VVND  +FP+F   C+ + L                     
Subjt:  LHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCKIRDL---------------------

Query:  --------------ENELERRTLTYTMWN-SSIPKGDKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRV
                      ++EL RR+LT++ W+ SS    ++R WHP+T+++SDATP +I+ +K ID+I++E+E+R EWC      +PCFLFARKFT  AALR+
Subjt:  --------------ENELERRTLTYTMWN-SSIPKGDKRSWHPVTFQYSDATPLIIKEMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRV

Query:  LK
        L+
Subjt:  LK

AT5G14550.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein5.3e-7151.01Show/hide
Query:  PKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIP
        P+IAFLF+ R+ LPL+F+W +FFK G+  KFSIY+HS+PGFV +++TT+S +F  RQLN+SIQV WG+STMIEAER+L   AL D  N RFV LSDSCIP
Subjt:  PKIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIP

Query:  LHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK----------------------IRD
        L++FS+TY Y+MS+  SFVDSF +  + RYNP M+P I    WRKGSQW+ L R+HAE+VVND  +FP+F   C+                       + 
Subjt:  LHNFSHTYRYLMSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCK----------------------IRD

Query:  LENELERRTLTYTMWN-SSIPKGDKRSWHPVTFQYSDATPLIIKEMK
        +++EL RR+LT++ W+ SS    ++R WHP+T+++SDATP +I+ +K
Subjt:  LENELERRTLTYTMWN-SSIPKGDKRSWHPVTFQYSDATPLIIKEMK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGAGGTGATGACGGTTGGGGATGCGTTCTCGTCGTCCGTTAAACCGTCGTTGTTTCCATTGGCACCGGATCCTTTGCTCACTTCCAATCTCCCTCTTATCTCCGC
CTTCCTCGCTGGCGCCATCGCCCAGTATAAGGAAAGAAAATGGGAATCTAAACGGATGTTTGGTTCTGGCGGAATGCCTTCATCTCACTCTGCAACTGTGACTGCTTTGG
CCCTTGCTATTGCCCTGCAGGAAGGATCCGGAGGACCTGCTTTTGCCGTCGCCGTAGTCTTTGCATGTGTTGTAATGTATGATGCTTCTGGAGTCAGACTTCATGCTGGT
CTTCAAGCCGAGTTGCTGAACCAAATAGTTTGCGAGTTTCCTCCTGAACATCCTCTGTCCAGTATTAGACCATTGCGAGATTCACTTGGACACACTCCACTTCAGTCACA
GATACAGGATCAAGCAACTATGACAAAGAAGAAGGGTTCGACCGCCACCGCGGGGGCGGCCGCCGGTTCAAAGGTGGCGGTGGTTGTGTGTGCCATGCTGTTGACATTGG
CCATCCTGATATTCCATTCTGATGAGTTCAAGCTTCAGTCCTCCAACTTCACTTACCAGTTCAGAAACAATGGCTTTGGAGAAGATAGCCAAGGCTTTCAATCACCTCCC
AAGATTGCCTTTCTGTTCCTCACTCGCAGCAACCTTCCTCTTGATTTTATCTGGGCCAGCTTCTTCAAGAATGGAGACCAAGCCAAATTTTCCATTTACATTCACTCACA
ACCAGGCTTTGTTTATGACAAATCAACCACTAAGTCTCCTTTCTTCTATGGCAGACAATTGAACAACAGTATTCAGGTACTATGGGGAGATTCTACCATGATAGAAGCAG
AACGCCTGTTGTTTGGTGCAGCTCTTGCTGATCCAGCAAATCAGAGATTTGTCCTTCTTTCCGATAGCTGCATACCACTGCATAACTTCAGCCATACTTACCGTTATCTG
ATGTCTTCTACAAAAAGCTTTGTCGACAGTTTTTTGAATGTTGGCGAAGGCCGATATAATCCCGAGATGTCGCCTTTTATACAACGGGAAAAATGGCGAAAGGGCTCCCA
GTGGATCGCTTTGGTGAGGAGACATGCTGAAGTTGTAGTGAATGATGTCATAATCTTTCCTCTGTTTACGAACCTCTGTAAGATAAGAGATCTCGAGAATGAACTCGAAC
GACGAACGTTGACGTACACCATGTGGAACAGTTCCATCCCAAAAGGGGACAAAAGATCTTGGCATCCAGTTACTTTCCAATATTCAGATGCAACTCCTCTGATAATCAAA
GAAATGAAGGAAATCGACCACATTGACTTTGAATCCGAACACCGAACCGAGTGGTGTCGTGTTAACTCGATGTATACGCCATGCTTCTTGTTTGCAAGAAAGTTCACTCC
CGGAGCGGCTCTACGTGTCCTGAAAAACGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGAGGTGATGACGGTTGGGGATGCGTTCTCGTCGTCCGTTAAACCGTCGTTGTTTCCATTGGCACCGGATCCTTTGCTCACTTCCAATCTCCCTCTTATCTCCGC
CTTCCTCGCTGGCGCCATCGCCCAGTATAAGGAAAGAAAATGGGAATCTAAACGGATGTTTGGTTCTGGCGGAATGCCTTCATCTCACTCTGCAACTGTGACTGCTTTGG
CCCTTGCTATTGCCCTGCAGGAAGGATCCGGAGGACCTGCTTTTGCCGTCGCCGTAGTCTTTGCATGTGTTGTAATGTATGATGCTTCTGGAGTCAGACTTCATGCTGGT
CTTCAAGCCGAGTTGCTGAACCAAATAGTTTGCGAGTTTCCTCCTGAACATCCTCTGTCCAGTATTAGACCATTGCGAGATTCACTTGGACACACTCCACTTCAGTCACA
GATACAGGATCAAGCAACTATGACAAAGAAGAAGGGTTCGACCGCCACCGCGGGGGCGGCCGCCGGTTCAAAGGTGGCGGTGGTTGTGTGTGCCATGCTGTTGACATTGG
CCATCCTGATATTCCATTCTGATGAGTTCAAGCTTCAGTCCTCCAACTTCACTTACCAGTTCAGAAACAATGGCTTTGGAGAAGATAGCCAAGGCTTTCAATCACCTCCC
AAGATTGCCTTTCTGTTCCTCACTCGCAGCAACCTTCCTCTTGATTTTATCTGGGCCAGCTTCTTCAAGAATGGAGACCAAGCCAAATTTTCCATTTACATTCACTCACA
ACCAGGCTTTGTTTATGACAAATCAACCACTAAGTCTCCTTTCTTCTATGGCAGACAATTGAACAACAGTATTCAGGTACTATGGGGAGATTCTACCATGATAGAAGCAG
AACGCCTGTTGTTTGGTGCAGCTCTTGCTGATCCAGCAAATCAGAGATTTGTCCTTCTTTCCGATAGCTGCATACCACTGCATAACTTCAGCCATACTTACCGTTATCTG
ATGTCTTCTACAAAAAGCTTTGTCGACAGTTTTTTGAATGTTGGCGAAGGCCGATATAATCCCGAGATGTCGCCTTTTATACAACGGGAAAAATGGCGAAAGGGCTCCCA
GTGGATCGCTTTGGTGAGGAGACATGCTGAAGTTGTAGTGAATGATGTCATAATCTTTCCTCTGTTTACGAACCTCTGTAAGATAAGAGATCTCGAGAATGAACTCGAAC
GACGAACGTTGACGTACACCATGTGGAACAGTTCCATCCCAAAAGGGGACAAAAGATCTTGGCATCCAGTTACTTTCCAATATTCAGATGCAACTCCTCTGATAATCAAA
GAAATGAAGGAAATCGACCACATTGACTTTGAATCCGAACACCGAACCGAGTGGTGTCGTGTTAACTCGATGTATACGCCATGCTTCTTGTTTGCAAGAAAGTTCACTCC
CGGAGCGGCTCTACGTGTCCTGAAAAACGGTTGATGAACGATCCGATTGATGGAAAGATATTGTTGTAGATGGATTTAGGACATGATTTTTATTTATGATAACAAAGAAG
GGGTTCCAAAACCACTGTATTTTTAAAGGGATTTGTTGGATGATATGCTAGAAAGAACATAGAATCCTTTTTGAGTATC
Protein sequenceShow/hide protein sequence
MDEVMTVGDAFSSSVKPSLFPLAPDPLLTSNLPLISAFLAGAIAQYKERKWESKRMFGSGGMPSSHSATVTALALAIALQEGSGGPAFAVAVVFACVVMYDASGVRLHAG
LQAELLNQIVCEFPPEHPLSSIRPLRDSLGHTPLQSQIQDQATMTKKKGSTATAGAAAGSKVAVVVCAMLLTLAILIFHSDEFKLQSSNFTYQFRNNGFGEDSQGFQSPP
KIAFLFLTRSNLPLDFIWASFFKNGDQAKFSIYIHSQPGFVYDKSTTKSPFFYGRQLNNSIQVLWGDSTMIEAERLLFGAALADPANQRFVLLSDSCIPLHNFSHTYRYL
MSSTKSFVDSFLNVGEGRYNPEMSPFIQREKWRKGSQWIALVRRHAEVVVNDVIIFPLFTNLCKIRDLENELERRTLTYTMWNSSIPKGDKRSWHPVTFQYSDATPLIIK
EMKEIDHIDFESEHRTEWCRVNSMYTPCFLFARKFTPGAALRVLKNG