; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi10G001030 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi10G001030
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
Genome locationchr10:1606718..1609554
RNA-Seq ExpressionLsi10G001030
SyntenyLsi10G001030
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034789.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein, putative isoform 1 [Cucumis melo var. makuwa]1.2e-14766.36Show/hide
Query:  MTKKKGSMTVVCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFF--------
        MTKKKGS   V PPL     KVVV++CA+LLTLA+L+FH DEFKLIQSSNF YQF+NNG G SHGFQS PKIAFLFLTR+KLPLDFLWANFF        
Subjt:  MTKKKGSMTVVCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFF--------

Query:  -------------------------------KVLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFL
                                       +VLWGES+MIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVD        SFL
Subjt:  -------------------------------KVLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFL

Query:  NVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLN
        NVNEGRYNPEMLPVI QEKWRKGSQWITLVR+HAEVVVND IIFPLFKKFCK W                                      PP   +  
Subjt:  NVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLN

Query:  RRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRT
        R+ T      EKYHPNCIPDEHYVQ+LLSIRGLE ELERRTLTYS WNSSIPK DKRSWHPVTFYYPDATP  IKEIK         EINHIDFESEHRT
Subjt:  RRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRT

Query:  EWCRVNSTYMSCFLFARKFTPGAGLRILKND
        EWCRV STY SCFLFARKF+PGAGLRILK D
Subjt:  EWCRVNSTYMSCFLFARKFTPGAGLRILKND

TYK09590.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein, putative isoform 1 [Cucumis melo var. makuwa]1.4e-15167.29Show/hide
Query:  MTKKKGSMTVVCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFF--------
        MTKKKGS   V PPL     KVVV++CA+LLTLA+L+FH DEFKLIQSSNF YQF+NNG G SHGFQS PKIAFLFLTR+KLPLDFLWANFF        
Subjt:  MTKKKGSMTVVCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFF--------

Query:  -------------------------------KVLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFL
                                       +VLWGES+MIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRY  EM  SFL
Subjt:  -------------------------------KVLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFL

Query:  NVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLN
        NVNEGRYNPEMLPVI QEKWRKGSQWITLVR+HAEVVVND IIFPLFKKFCK W                                      PP   +  
Subjt:  NVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLN

Query:  RRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRT
        R+ T      EKYHPNCIPDEHYVQ+LLSIRGLE ELERRTLTYS WNSSIPK DKRSWHPVTFYYPDATP  IKEIK         EINHIDFESEHRT
Subjt:  RRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRT

Query:  EWCRVNSTYMSCFLFARKFTPGAGLRILKND
        EWCRV STY SCFLFARKF+PGAGLRILK D
Subjt:  EWCRVNSTYMSCFLFARKFTPGAGLRILKND

XP_008455976.1 PREDICTED: uncharacterized protein LOC103496040 [Cucumis melo]6.1e-14466.27Show/hide
Query:  VCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFF------------------
        V PPL     KVVV++CA+LLTLA+L+FH DEFKLIQSSNF YQF+NNG G SHGFQS PKIAFLFLTR+KLPLDFLWANFF                  
Subjt:  VCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFF------------------

Query:  ---------------------KVLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPE
                             +VLWGES+MIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVD        SFLNVNEGRYNPE
Subjt:  ---------------------KVLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPE

Query:  MLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEK
        MLPVI QEKWRKGSQWITLVR+HAEVVVND IIFPLFKKFCK W                                      PP   +  R+ T      
Subjt:  MLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEK

Query:  EKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRTEWCRVNSTYM
        EKYHPNCIPDEHYVQ+LLSIRGLE ELERRTLTYS WNSSIPK DKRSWHPVTFYYPDATP  IKEIK         EINHIDFESEHRTEWCRV STY 
Subjt:  EKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRTEWCRVNSTYM

Query:  SCFLFARKFTPGAGLRILKND
        SCFLFARKF+PGAGLRILK D
Subjt:  SCFLFARKFTPGAGLRILKND

XP_031737240.1 glycosyltransferase BC10 [Cucumis sativus]2.9e-14665.43Show/hide
Query:  MTKKKGSMTVVCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFFK-------
        MTKKKGS   VCPPL     KVVV++CA+LLTLAIL+FH DEFKLIQSSNFTYQF+NNG+G SHGF S PKIAFLFLTRKKLPLDFLWANFFK       
Subjt:  MTKKKGSMTVVCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFFK-------

Query:  --------------------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFL
                                        VLWGES+MIEAERLLF AALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSS KSFVD        SF 
Subjt:  --------------------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFL

Query:  NVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLN
        NV+EGRYNP+MLPVI QEKWRKGSQWITLVR+HAE+VVND IIFPLFKKFCK W                                      PP   +  
Subjt:  NVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLN

Query:  RRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRT
        R+ T      EK+HPNCIPDEHYVQ+LLSIRGL++ELERRTLTYSTWNSSIPK DKRSWHPVTFYYPDATP  IKEIK         EINHIDFESEHRT
Subjt:  RRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRT

Query:  EWCRVNSTYMSCFLFARKFTPGAGLRILKND
        EWC VNS Y SCFLFARKFTPGAGLRIL+ D
Subjt:  EWCRVNSTYMSCFLFARKFTPGAGLRILKND

XP_038902391.1 glycosyltransferase BC10-like [Benincasa hispida]5.7e-15067.93Show/hide
Query:  VCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFFK-----------------
        V PPLTAAGSK+VVV+CAM+LTLAILIFH DEFKLIQSSNFTYQ RNNG+GDSHGFQS PKIAFLFLTRKKLPLDFLWANFFK                 
Subjt:  VCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFFK-----------------

Query:  ----------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPE
                              VLWG+SSMIEAERLLFGAALDDPANQRFV+LSDSCIPLHNF+HTYNYLMSSTKSFVD        SFLNV+EGRYNP+
Subjt:  ----------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPE

Query:  MLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEK
        MLPVI QEKWRKGSQWITLVRKHAE+VVNDAIIFPLFKK C+ W                                      PP      R+      + 
Subjt:  MLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEK

Query:  EKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRTEWCRVNSTYM
        EKYHPNCIPDEHYVQ+LLSIRGLENELERRTLTYS WNSSIPK DKRSWHPVTF+YPDATPLRIKEIK         EINHID+ESEHRTEWCRVNS Y 
Subjt:  EKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRTEWCRVNSTYM

Query:  SCFLFARKFTPGAGLRILKND
        SC+LFARKFTPGAGLRILKND
Subjt:  SCFLFARKFTPGAGLRILKND

TrEMBL top hitse value%identityAlignment
A0A0A0LQZ0 Uncharacterized protein7.3e-14365.32Show/hide
Query:  VCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFFK-----------------
        VCPPL     KVVV++CA+LLTLAIL+FH DEFKLIQSSNFTYQF+NNG+G SHGF S PKIAFLFLTRKKLPLDFLWANFFK                 
Subjt:  VCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFFK-----------------

Query:  ----------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPE
                              VLWGES+MIEAERLLF AALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSS KSFVD        SF NV+EGRYNP+
Subjt:  ----------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPE

Query:  MLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEK
        MLPVI QEKWRKGSQWITLVR+HAE+VVND IIFPLFKKFCK W                                      PP   +  R+ T      
Subjt:  MLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEK

Query:  EKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRTEWCRVNSTYM
        EK+HPNCIPDEHYVQ+LLSIRGL++ELERRTLTYSTWNSSIPK DKRSWHPVTFYYPDATP  IKEIK         EINHIDFESEHRTEWC VNS Y 
Subjt:  EKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRTEWCRVNSTYM

Query:  SCFLFARKFTPGAGLRILKND
        SCFLFARKFTPGAGLRIL+ D
Subjt:  SCFLFARKFTPGAGLRILKND

A0A1S3C1Q7 uncharacterized protein LOC1034960402.9e-14466.27Show/hide
Query:  VCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFF------------------
        V PPL     KVVV++CA+LLTLA+L+FH DEFKLIQSSNF YQF+NNG G SHGFQS PKIAFLFLTR+KLPLDFLWANFF                  
Subjt:  VCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFF------------------

Query:  ---------------------KVLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPE
                             +VLWGES+MIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVD        SFLNVNEGRYNPE
Subjt:  ---------------------KVLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPE

Query:  MLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEK
        MLPVI QEKWRKGSQWITLVR+HAEVVVND IIFPLFKKFCK W                                      PP   +  R+ T      
Subjt:  MLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEK

Query:  EKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRTEWCRVNSTYM
        EKYHPNCIPDEHYVQ+LLSIRGLE ELERRTLTYS WNSSIPK DKRSWHPVTFYYPDATP  IKEIK         EINHIDFESEHRTEWCRV STY 
Subjt:  EKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRTEWCRVNSTYM

Query:  SCFLFARKFTPGAGLRILKND
        SCFLFARKF+PGAGLRILK D
Subjt:  SCFLFARKFTPGAGLRILKND

A0A5A7T0B3 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein, putative isoform 15.7e-14866.36Show/hide
Query:  MTKKKGSMTVVCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFF--------
        MTKKKGS   V PPL     KVVV++CA+LLTLA+L+FH DEFKLIQSSNF YQF+NNG G SHGFQS PKIAFLFLTR+KLPLDFLWANFF        
Subjt:  MTKKKGSMTVVCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFF--------

Query:  -------------------------------KVLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFL
                                       +VLWGES+MIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVD        SFL
Subjt:  -------------------------------KVLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFL

Query:  NVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLN
        NVNEGRYNPEMLPVI QEKWRKGSQWITLVR+HAEVVVND IIFPLFKKFCK W                                      PP   +  
Subjt:  NVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLN

Query:  RRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRT
        R+ T      EKYHPNCIPDEHYVQ+LLSIRGLE ELERRTLTYS WNSSIPK DKRSWHPVTFYYPDATP  IKEIK         EINHIDFESEHRT
Subjt:  RRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRT

Query:  EWCRVNSTYMSCFLFARKFTPGAGLRILKND
        EWCRV STY SCFLFARKF+PGAGLRILK D
Subjt:  EWCRVNSTYMSCFLFARKFTPGAGLRILKND

A0A5D3CE26 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein, putative isoform 16.5e-15267.29Show/hide
Query:  MTKKKGSMTVVCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFF--------
        MTKKKGS   V PPL     KVVV++CA+LLTLA+L+FH DEFKLIQSSNF YQF+NNG G SHGFQS PKIAFLFLTR+KLPLDFLWANFF        
Subjt:  MTKKKGSMTVVCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFF--------

Query:  -------------------------------KVLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFL
                                       +VLWGES+MIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRY  EM  SFL
Subjt:  -------------------------------KVLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFL

Query:  NVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLN
        NVNEGRYNPEMLPVI QEKWRKGSQWITLVR+HAEVVVND IIFPLFKKFCK W                                      PP   +  
Subjt:  NVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLN

Query:  RRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRT
        R+ T      EKYHPNCIPDEHYVQ+LLSIRGLE ELERRTLTYS WNSSIPK DKRSWHPVTFYYPDATP  IKEIK         EINHIDFESEHRT
Subjt:  RRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFESEHRT

Query:  EWCRVNSTYMSCFLFARKFTPGAGLRILKND
        EWCRV STY SCFLFARKF+PGAGLRILK D
Subjt:  EWCRVNSTYMSCFLFARKFTPGAGLRILKND

A0A6J1CGL5 uncharacterized protein LOC1110112292.3e-14163.36Show/hide
Query:  MTKKKGSMT-VVCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFFK------
        MTKKKGS T   C PLTA GSKVVV +CAMLLTLAILIFH DEFKL QSS+FTY+FR NG+GDSHGFQS PKIAFLFL R+ LPLDFLWANFFK      
Subjt:  MTKKKGSMT-VVCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFFK------

Query:  ---------------------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSF
                                         VLWGES+MIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTY+YLMSSTKSFVD        SF
Subjt:  ---------------------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSF

Query:  LNVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNL
        LNVNEGRYNPEM PVI +EKWRKGSQWI LVR+HAEVVVNDAIIFPLFKKFCK W                                      PP     
Subjt:  LNVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNL

Query:  NRRL---TKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFES
         R++    KFDI++ K+HPNCIPDEHYVQ+LLSIRGLENE+ERRTLTYS WN+SIPKGD+RSWHPVT +Y DATP ++KEIK         EINHID+ES
Subjt:  NRRL---TKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDFES

Query:  EHRTEWCRVNSTYMSCFLFARKFTPGAGLRILKN
        EHRTEWC VNS Y  C+LFARKFT  A LR+  N
Subjt:  EHRTEWCRVNSTYMSCFLFARKFTPGAGLRILKN

SwissProt top hitse value%identityAlignment
Q65XS5 Glycosyltransferase BC106.9e-6640.95Show/hide
Query:  KIAFLFLTRKKLPLDFLWANFFK---------------------------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPL
        ++AFLF+ R +LPLD +W  FF+                                       V WGE+SMIEAER+L   AL DP N+RFV +SDSC+PL
Subjt:  KIAFLFLTRKKLPLDFLWANFFK---------------------------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPL

Query:  HNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFV
        +NF++TY+Y+MSS+ SFVD        SF +   GRYNP M P+I  E WRKGSQW  L RKHAEVVV D  + P F+K C+                  
Subjt:  HNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFV

Query:  KSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKG-DKRSWHPVTFYYPDA
                           R  P   ++ +R +     E  K H NCIPDEHYVQ+LL+  GLE EL RR++T+S W+ S  K  ++R WHPVT+   DA
Subjt:  KSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKG-DKRSWHPVTFYYPDA

Query:  TPLRIKEIKVSVGNVNALEINHIDFESEHRTEWCRVNSTYMSCFLFARKFTPGAGLRIL
        TP  +K IK         +I++I +E+E+R EWC  N     CFLFARKFT  AGL++L
Subjt:  TPLRIKEIKVSVGNVNALEINHIDFESEHRTEWCRVNSTYMSCFLFARKFTPGAGLRIL

Arabidopsis top hitse value%identityAlignment
AT1G11940.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein5.2e-7740.78Show/hide
Query:  MTKKKGSMTVVCPPLTAAGS------KVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFF--
        MTKK  S   + PPL+  G       K+V+     L  LA+L     ++    + +F      +           PK+AFLFL R+ LPLDF+W  FF  
Subjt:  MTKKKGSMTVVCPPLTAAGS------KVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFF--

Query:  -------------------------------------KVLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFE
                                             KV+WGESSMIEAERLL  +AL+D +NQRFVLLSD C PL++F + Y YL+SS +SFVD     
Subjt:  -------------------------------------KVLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFE

Query:  MSCSFLNVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPP
           SFL+  E RY+ +M PVI +EKWRKGSQWI L+R HAEV+VND I+FP+FK+FCK                                        PP
Subjt:  MSCSFLNVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPP

Query:  EGKNLNRRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDF
         G N       F  +K +   NCIPDEHYVQ+LL+++GLE+E+ERRT+TY+ WN S  K + +SWHPVTF   ++ P  IKEIK         +I+H+ +
Subjt:  EGKNLNRRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDATPLRIKEIKVSVGNVNALEINHIDF

Query:  ESEHRTEWCRVNSTYMSCFLFARKFTPGAGLRIL
        ESE RTEWC+ +S  + CFLFARKFT  A +RI+
Subjt:  ESEHRTEWCRVNSTYMSCFLFARKFTPGAGLRIL

AT1G62305.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein7.8e-8145.96Show/hide
Query:  PKIAFLFLTRKKLPLDFLWANFFK---------------------------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIP
        PK+AFLFL R+ LPLDFLW  FFK                                       V+WGESSMI AERLL  +AL+DP+NQRFVLLSDSC+P
Subjt:  PKIAFLFLTRKKLPLDFLWANFFK---------------------------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIP

Query:  LHNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIF
        L++F + Y YL+SS KSFVD        SFL+  + RY  +M PVI +EKWRKGSQWI+L+R HAEV+VND  +FP+F+KFCK                 
Subjt:  LHNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIF

Query:  VKSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDA
                            R+ P     L+ R     ++K ++  NCIPDEHYVQ+LL++RGLENE+ERRT+TY+TWN S  K + +SWHP+TF   + 
Subjt:  VKSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDA

Query:  TPLRIKEIKVSVGNVNALEINHIDFESEHRTEWCRVNSTYMSCFLFARKFTPGAGLRIL
         P  I+ IK         +INH+ +ESE+RTEWCR NS  + CFLFARKFT GA +R+L
Subjt:  TPLRIKEIKVSVGNVNALEINHIDFESEHRTEWCRVNSTYMSCFLFARKFTPGAGLRIL

AT1G62305.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.2e-6841.78Show/hide
Query:  PKIAFLFLTRKKLPLDFLWANFFK---------------------------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIP
        PK+AFLFL R+ LPLDFLW  FFK                                       V+WGESSMI AERLL  +AL+DP+NQRFVLLSD    
Subjt:  PKIAFLFLTRKKLPLDFLWANFFK---------------------------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIP

Query:  LHNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIF
                        SF+D+             + RY  +M PVI +EKWRKGSQWI+L+R HAEV+VND  +FP+F+KFCK                 
Subjt:  LHNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIF

Query:  VKSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDA
                            R+ P     L+ R     ++K ++  NCIPDEHYVQ+LL++RGLENE+ERRT+TY+TWN S  K + +SWHP+TF   + 
Subjt:  VKSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDA

Query:  TPLRIKEIKVSVGNVNALEINHIDFESEHRTEWCRVNSTYMSCFLFARKFTPGAGLRIL
         P  I+ IK         +INH+ +ESE+RTEWCR NS  + CFLFARKFT GA +R+L
Subjt:  TPLRIKEIKVSVGNVNALEINHIDFESEHRTEWCRVNSTYMSCFLFARKFTPGAGLRIL

AT5G14550.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein3.8e-6741.67Show/hide
Query:  PKIAFLFLTRKKLPLDFLWANFFK--------------------------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPL
        P+IAFLF+ R +LPL+F+W  FFK                                      V WGES+MIEAER+L   AL D  N RFV LSDSCIPL
Subjt:  PKIAFLFLTRKKLPLDFLWANFFK--------------------------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPL

Query:  HNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFV
        ++FS+TYNY+MS+  SFVD        SF +  + RYNP M P+I    WRKGSQW+ L RKHAE+VVND  +FP+F++ C+                  
Subjt:  HNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFV

Query:  KSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWN-SSIPKGDKRSWHPVTFYYPDA
        KS  +   D          R  P EG               K H NCIPDEHYVQ+LLS +G+++EL RR+LT+S W+ SS    ++R WHP+T+ + DA
Subjt:  KSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWN-SSIPKGDKRSWHPVTFYYPDA

Query:  TPLRIKEIKVSVGNVNALEINHIDFESEHRTEWCRVNSTYMSCFLFARKFTPGAGLRILK
        TP  I+ IK          I++I++E+E+R EWC        CFLFARKFT  A LR+L+
Subjt:  TPLRIKEIKVSVGNVNALEINHIDFESEHRTEWCRVNSTYMSCFLFARKFTPGAGLRILK

AT5G14550.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.6e-5740.57Show/hide
Query:  PKIAFLFLTRKKLPLDFLWANFFK--------------------------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPL
        P+IAFLF+ R +LPL+F+W  FFK                                      V WGES+MIEAER+L   AL D  N RFV LSDSCIPL
Subjt:  PKIAFLFLTRKKLPLDFLWANFFK--------------------------------------VLWGESSMIEAERLLFGAALDDPANQRFVLLSDSCIPL

Query:  HNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFV
        ++FS+TYNY+MS+  SFVD        SF +  + RYNP M P+I    WRKGSQW+ L RKHAE+VVND  +FP+F++ C+                  
Subjt:  HNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFKVVQFSNEAIFV

Query:  KSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWN-SSIPKGDKRSWHPVTFYYPDA
                              P EG               K H NCIPDEHYVQ+LLS +G+++EL RR+LT+S W+ SS    ++R WHP+T+ + DA
Subjt:  KSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWN-SSIPKGDKRSWHPVTFYYPDA

Query:  TPLRIKEIKVSVGNVNAL
        TP  I+ IKVS+   N L
Subjt:  TPLRIKEIKVSVGNVNAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAAGAAGAAAGGTTCAATGACGGTCGTATGCCCACCGTTGACAGCGGCGGGTTCGAAGGTGGTGGTGGTTATGTGTGCCATGCTGTTAACATTGGCCATATTGAT
ATTTCATTGTGATGAGTTCAAACTAATTCAATCCTCCAACTTCACTTACCAATTCAGAAACAATGGTTATGGAGATAGCCATGGCTTCCAATCACTTCCTAAGATTGCAT
TTCTGTTCCTCACTCGTAAAAAACTTCCTCTTGATTTTCTTTGGGCCAACTTTTTCAAGGTACTATGGGGAGAATCCAGCATGATCGAAGCAGAGCGCCTGCTATTTGGT
GCAGCTCTTGATGATCCAGCAAATCAAAGATTCGTTCTTCTTTCCGATAGCTGCATACCTCTGCATAACTTCAGCCATACTTACAATTATCTGATGTCTTCTACAAAAAG
CTTTGTCGACAGGTACGTATTCGAAATGTCGTGTAGTTTTTTGAATGTTAACGAAGGCCGATATAATCCCGAGATGTTGCCCGTAATAATGCAGGAAAAATGGCGAAAGG
GTTCTCAGTGGATTACTTTGGTGAGGAAACATGCTGAAGTTGTAGTGAATGATGCCATAATCTTTCCTCTGTTTAAGAAATTTTGTAAGGTGTGGAATGAAAGTTTTAAA
GTAGTTCAATTTAGTAATGAAGCAATATTTGTAAAATCATCCCATGATCAGATTAATGATCTTTTCGCGGTTTGCAGCGATGGCCACCACCGGAACACGCCACCGGAAGG
AAAAAACCTAAACAGGAGGTTGACAAAATTTGATATTGAAAAGGAGAAGTATCATCCCAATTGCATACCAGATGAGCATTATGTGCAGAGTTTACTTTCAATAAGGGGAC
TCGAGAATGAACTCGAACGACGAACGTTGACGTACTCGACGTGGAACAGTTCTATCCCAAAAGGGGACAAAAGATCTTGGCATCCAGTTACTTTCTATTATCCAGATGCA
ACTCCTTTGAGAATCAAAGAAATAAAGGTCAGTGTGGGCAATGTTAATGCATTAGAAATCAATCACATCGACTTTGAATCCGAGCACCGAACAGAGTGGTGTCGTGTTAA
CTCGACGTATATGTCGTGCTTCTTGTTTGCAAGAAAGTTCACTCCCGGGGCGGGGTTGCGAATCTTGAAAAATGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGACAAAGAAGAAAGGTTCAATGACGGTCGTATGCCCACCGTTGACAGCGGCGGGTTCGAAGGTGGTGGTGGTTATGTGTGCCATGCTGTTAACATTGGCCATATTGAT
ATTTCATTGTGATGAGTTCAAACTAATTCAATCCTCCAACTTCACTTACCAATTCAGAAACAATGGTTATGGAGATAGCCATGGCTTCCAATCACTTCCTAAGATTGCAT
TTCTGTTCCTCACTCGTAAAAAACTTCCTCTTGATTTTCTTTGGGCCAACTTTTTCAAGGTACTATGGGGAGAATCCAGCATGATCGAAGCAGAGCGCCTGCTATTTGGT
GCAGCTCTTGATGATCCAGCAAATCAAAGATTCGTTCTTCTTTCCGATAGCTGCATACCTCTGCATAACTTCAGCCATACTTACAATTATCTGATGTCTTCTACAAAAAG
CTTTGTCGACAGGTACGTATTCGAAATGTCGTGTAGTTTTTTGAATGTTAACGAAGGCCGATATAATCCCGAGATGTTGCCCGTAATAATGCAGGAAAAATGGCGAAAGG
GTTCTCAGTGGATTACTTTGGTGAGGAAACATGCTGAAGTTGTAGTGAATGATGCCATAATCTTTCCTCTGTTTAAGAAATTTTGTAAGGTGTGGAATGAAAGTTTTAAA
GTAGTTCAATTTAGTAATGAAGCAATATTTGTAAAATCATCCCATGATCAGATTAATGATCTTTTCGCGGTTTGCAGCGATGGCCACCACCGGAACACGCCACCGGAAGG
AAAAAACCTAAACAGGAGGTTGACAAAATTTGATATTGAAAAGGAGAAGTATCATCCCAATTGCATACCAGATGAGCATTATGTGCAGAGTTTACTTTCAATAAGGGGAC
TCGAGAATGAACTCGAACGACGAACGTTGACGTACTCGACGTGGAACAGTTCTATCCCAAAAGGGGACAAAAGATCTTGGCATCCAGTTACTTTCTATTATCCAGATGCA
ACTCCTTTGAGAATCAAAGAAATAAAGGTCAGTGTGGGCAATGTTAATGCATTAGAAATCAATCACATCGACTTTGAATCCGAGCACCGAACAGAGTGGTGTCGTGTTAA
CTCGACGTATATGTCGTGCTTCTTGTTTGCAAGAAAGTTCACTCCCGGGGCGGGGTTGCGAATCTTGAAAAATGATTGA
Protein sequenceShow/hide protein sequence
MTKKKGSMTVVCPPLTAAGSKVVVVMCAMLLTLAILIFHCDEFKLIQSSNFTYQFRNNGYGDSHGFQSLPKIAFLFLTRKKLPLDFLWANFFKVLWGESSMIEAERLLFG
AALDDPANQRFVLLSDSCIPLHNFSHTYNYLMSSTKSFVDRYVFEMSCSFLNVNEGRYNPEMLPVIMQEKWRKGSQWITLVRKHAEVVVNDAIIFPLFKKFCKVWNESFK
VVQFSNEAIFVKSSHDQINDLFAVCSDGHHRNTPPEGKNLNRRLTKFDIEKEKYHPNCIPDEHYVQSLLSIRGLENELERRTLTYSTWNSSIPKGDKRSWHPVTFYYPDA
TPLRIKEIKVSVGNVNALEINHIDFESEHRTEWCRVNSTYMSCFLFARKFTPGAGLRILKND