; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g04590 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g04590
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein, putative
Genome locationchr8:3306865..3307998
RNA-Seq ExpressionMoc08g04590
SyntenyMoc08g04590
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445230.1 PREDICTED: uncharacterized protein LOC103488317 [Cucumis melo]4.5e-16075.32Show/hide
Query:  MLSPNPLSLICALLLCFPLAILFTLHRTPS------DYPLLFPFPSLYFPNANTHRKITIF-PLPPPPDDDDFLFPLAARVNSTPSPTPKLAFMFLTTSP
        ML PNPLSLI ALLLC  LAI FT H TP+      DYP +FPF SLY   +N HRKIT+  P PPPP+DDD LFPLAA VNSTPSPT KLAF+FLT SP
Subjt:  MLSPNPLSLICALLLCFPLAILFTLHRTPS------DYPLLFPFPSLYFPNANTHRKITIF-PLPPPPDDDDFLFPLAARVNSTPSPTPKLAFMFLTTSP

Query:  LPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRS
        LPFAPLWELFF+N+PP+ FN+YIHADPTR Y+PPFSGVFA+R+IPSKP+ RFSP+L+ AARRLLAHALLHDSANSMFALLSPSCIPLHSF+FTY TLI S
Subjt:  LPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRS

Query:  NKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTATLTHVD
         KSFIEVLK+E G YDRWAARGP+VMLP+VKLADFRIGSQFW+L R+HAR+VV D+ VWSKFDLPCVR DTCYPEENYFPTLLSM D  GLV ATLTHV+
Subjt:  NKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTATLTHVD

Query:  WKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGG----MRIRT-AGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFKD
        W G FDGHPRTY ASDVGPDLIR LR ARPRYGDGG    +RIRT  GG    S  K    HPFLFARKFSADSL  LMNI+ D I KD
Subjt:  WKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGG----MRIRT-AGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFKD

XP_022132078.1 uncharacterized protein LOC111005037 [Momordica charantia]1.8e-222100Show/hide
Query:  MLSPNPLSLICALLLCFPLAILFTLHRTPSDYPLLFPFPSLYFPNANTHRKITIFPLPPPPDDDDFLFPLAARVNSTPSPTPKLAFMFLTTSPLPFAPLW
        MLSPNPLSLICALLLCFPLAILFTLHRTPSDYPLLFPFPSLYFPNANTHRKITIFPLPPPPDDDDFLFPLAARVNSTPSPTPKLAFMFLTTSPLPFAPLW
Subjt:  MLSPNPLSLICALLLCFPLAILFTLHRTPSDYPLLFPFPSLYFPNANTHRKITIFPLPPPPDDDDFLFPLAARVNSTPSPTPKLAFMFLTTSPLPFAPLW

Query:  ELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRSNKSFIEV
        ELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRSNKSFIEV
Subjt:  ELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRSNKSFIEV

Query:  LKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTATLTHVDWKGRFDG
        LKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTATLTHVDWKGRFDG
Subjt:  LKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTATLTHVDWKGRFDG

Query:  HPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFKD
        HPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFKD
Subjt:  HPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFKD

XP_022951447.1 uncharacterized protein LOC111454260 [Cucurbita moschata]1.4e-16171.82Show/hide
Query:  MLSPNPLSLICALLLCFPLAILFTLHRTP-------SDYPLLFPFPSLYFPNANTHRKITIFPLP-----PPPDDDDFLFPLAARVNSTPSPTPKLAFMF
        M + +PLSLICALLLC PLA+LFT++          SD+P +FP  SLY P   THRKIT+F +P     PPP++DD LFPLA+RV+ TPSPT KLAFMF
Subjt:  MLSPNPLSLICALLLCFPLAILFTLHRTP-------SDYPLLFPFPSLYFPNANTHRKITIFPLP-----PPPDDDDFLFPLAARVNSTPSPTPKLAFMF

Query:  LTTSPLPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYS
        LT SPLPFAPLWELFF+N+PP+ +N+YIHADPTR+Y+PPFSGVF+HR+IPSKP+ RF+P+L AAARRLLAHALLHDS+NSMFALLSPSCIPLHSF+FTY 
Subjt:  LTTSPLPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYS

Query:  TLIRSNKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTAT
        TLI S KSFIEVLKNEIG YDRWAARGP+ MLPVVKL D RIGSQFW+LTR+HAR VV D++VWSKFDLPCVRWDTCYPEENYFPTLLSM DH GL+ AT
Subjt:  TLIRSNKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTAT

Query:  LTHVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGG------MRIRT---AGGR---NSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFK
        LTHVDW G FDGHPRTY+ SDV P+LIR LR++R RYGDG       MRIRT     GR   +SSS AK +  H FLFARKFSAD+LQPLMNIS+D+IFK
Subjt:  LTHVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGG------MRIRT---AGGR---NSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFK

Query:  D
        D
Subjt:  D

XP_023002372.1 uncharacterized protein LOC111496232 [Cucurbita maxima]5.8e-16070.82Show/hide
Query:  MLSPNPLSLICALLLCFPLAILFTLHRTP-------SDYPLLFPFPSLYFPNANTHRKITIF-------PLPPPPDDDDFLFPLAARVNSTPSPTPKLAF
        M + +PLSLICALLLC PLA+LFT++          SD+P +FP  SLY P   THRKIT+F       P PPPP++DD LFPLAARV+  PSPT KLAF
Subjt:  MLSPNPLSLICALLLCFPLAILFTLHRTP-------SDYPLLFPFPSLYFPNANTHRKITIF-------PLPPPPDDDDFLFPLAARVNSTPSPTPKLAF

Query:  MFLTTSPLPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFT
        MFLT SPLPFAPLWELFF+N+PP+ +N+YIH DPTR+Y+PPFSGVF+HR+IPSKP+ RF+ +L AAARRLLAHALLHDS+NSMFALLSPSCIPLHSF+FT
Subjt:  MFLTTSPLPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFT

Query:  YSTLIRSNKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVT
        Y TLIRS KSFIEVLKNEIG YDRWAARGP+ MLPVVKL D RIGSQFW LTR+HAR VV D++VW+KFDLPCVRWDTCYPEENYFPTLLSM D  GL+ 
Subjt:  YSTLIRSNKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVT

Query:  ATLTHVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDG--------GMRIRTAGGRNSSST--AKSHPHHPFLFARKFSADSLQPLMNISTDIIFK
        ATLTHVDW GRFDGHPRTY+ +DV P+LIR LR+AR RYGDG        G + +T+G R+SSS+  AK +  H FLFARKFSAD+LQPLMNIS+D+IFK
Subjt:  ATLTHVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDG--------GMRIRTAGGRNSSST--AKSHPHHPFLFARKFSADSLQPLMNISTDIIFK

Query:  D
        D
Subjt:  D

XP_023537126.1 uncharacterized protein LOC111798298 [Cucurbita pepo subsp. pepo]2.1e-16272.07Show/hide
Query:  MLSPNPLSLICALLLCFPLAILFTLHRTP-------SDYPLLFPFPSLYFPNANTHRKITIFPLP---------PPPDDDDFLFPLAARVNSTPSPTPKL
        M + +PLSLICALLLC PLA+LFT++          SD+P +FP  SLY P   THRKIT+F +P         PPP++DD LFPLA+RV+ TPSPT KL
Subjt:  MLSPNPLSLICALLLCFPLAILFTLHRTP-------SDYPLLFPFPSLYFPNANTHRKITIFPLP---------PPPDDDDFLFPLAARVNSTPSPTPKL

Query:  AFMFLTTSPLPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFH
        AFMFLT SPLPFAPLWELFF+N+PP+ +N+YIHADPTR+Y+PPFSGVF+HR+IPSKP+ RF+P+L AAARRLLAHALLHDS+NSMFALLSPSCIPLHSF+
Subjt:  AFMFLTTSPLPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFH

Query:  FTYSTLIRSNKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGL
        FTY TLI S KSFIEVLKNEIG YDRWAARGP+ MLPVVKL D RIGSQFW+LTR+HAR VV D++VWSKFDLPCVRWDTCYPEENYFPTLLSM DH GL
Subjt:  FTYSTLIRSNKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGL

Query:  VTATLTHVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGG--MRIRT---AGGR---NSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFK
        + ATLTHVDW G FDGHPRTY+ SDV P+LIR LR++R RYGDGG  MRIRT     GR   +SSS AK +  H FLFARKFSAD+LQPLMNIS+D+IFK
Subjt:  VTATLTHVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGG--MRIRT---AGGR---NSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFK

Query:  D
        D
Subjt:  D

TrEMBL top hitse value%identityAlignment
A0A1S3BC61 uncharacterized protein LOC1034883172.2e-16075.32Show/hide
Query:  MLSPNPLSLICALLLCFPLAILFTLHRTPS------DYPLLFPFPSLYFPNANTHRKITIF-PLPPPPDDDDFLFPLAARVNSTPSPTPKLAFMFLTTSP
        ML PNPLSLI ALLLC  LAI FT H TP+      DYP +FPF SLY   +N HRKIT+  P PPPP+DDD LFPLAA VNSTPSPT KLAF+FLT SP
Subjt:  MLSPNPLSLICALLLCFPLAILFTLHRTPS------DYPLLFPFPSLYFPNANTHRKITIF-PLPPPPDDDDFLFPLAARVNSTPSPTPKLAFMFLTTSP

Query:  LPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRS
        LPFAPLWELFF+N+PP+ FN+YIHADPTR Y+PPFSGVFA+R+IPSKP+ RFSP+L+ AARRLLAHALLHDSANSMFALLSPSCIPLHSF+FTY TLI S
Subjt:  LPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRS

Query:  NKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTATLTHVD
         KSFIEVLK+E G YDRWAARGP+VMLP+VKLADFRIGSQFW+L R+HAR+VV D+ VWSKFDLPCVR DTCYPEENYFPTLLSM D  GLV ATLTHV+
Subjt:  NKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTATLTHVD

Query:  WKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGG----MRIRT-AGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFKD
        W G FDGHPRTY ASDVGPDLIR LR ARPRYGDGG    +RIRT  GG    S  K    HPFLFARKFSADSL  LMNI+ D I KD
Subjt:  WKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGG----MRIRT-AGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFKD

A0A5A7VHH3 Putative Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.2e-16075.32Show/hide
Query:  MLSPNPLSLICALLLCFPLAILFTLHRTPS------DYPLLFPFPSLYFPNANTHRKITIF-PLPPPPDDDDFLFPLAARVNSTPSPTPKLAFMFLTTSP
        ML PNPLSLI ALLLC  LAI FT H TP+      DYP +FPF SLY   +N HRKIT+  P PPPP+DDD LFPLAA VNSTPSPT KLAF+FLT SP
Subjt:  MLSPNPLSLICALLLCFPLAILFTLHRTPS------DYPLLFPFPSLYFPNANTHRKITIF-PLPPPPDDDDFLFPLAARVNSTPSPTPKLAFMFLTTSP

Query:  LPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRS
        LPFAPLWELFF+N+PP+ FN+YIHADPTR Y+PPFSGVFA+R+IPSKP+ RFSP+L+ AARRLLAHALLHDSANSMFALLSPSCIPLHSF+FTY TLI S
Subjt:  LPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRS

Query:  NKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTATLTHVD
         KSFIEVLK+E G YDRWAARGP+VMLP+VKLADFRIGSQFW+L R+HAR+VV D+ VWSKFDLPCVR DTCYPEENYFPTLLSM D  GLV ATLTHV+
Subjt:  NKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTATLTHVD

Query:  WKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGG----MRIRT-AGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFKD
        W G FDGHPRTY ASDVGPDLIR LR ARPRYGDGG    +RIRT  GG    S  K    HPFLFARKFSADSL  LMNI+ D I KD
Subjt:  WKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGG----MRIRT-AGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFKD

A0A6J1BV94 uncharacterized protein LOC1110050378.9e-223100Show/hide
Query:  MLSPNPLSLICALLLCFPLAILFTLHRTPSDYPLLFPFPSLYFPNANTHRKITIFPLPPPPDDDDFLFPLAARVNSTPSPTPKLAFMFLTTSPLPFAPLW
        MLSPNPLSLICALLLCFPLAILFTLHRTPSDYPLLFPFPSLYFPNANTHRKITIFPLPPPPDDDDFLFPLAARVNSTPSPTPKLAFMFLTTSPLPFAPLW
Subjt:  MLSPNPLSLICALLLCFPLAILFTLHRTPSDYPLLFPFPSLYFPNANTHRKITIFPLPPPPDDDDFLFPLAARVNSTPSPTPKLAFMFLTTSPLPFAPLW

Query:  ELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRSNKSFIEV
        ELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRSNKSFIEV
Subjt:  ELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRSNKSFIEV

Query:  LKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTATLTHVDWKGRFDG
        LKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTATLTHVDWKGRFDG
Subjt:  LKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTATLTHVDWKGRFDG

Query:  HPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFKD
        HPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFKD
Subjt:  HPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFKD

A0A6J1GHM6 uncharacterized protein LOC1114542606.7e-16271.82Show/hide
Query:  MLSPNPLSLICALLLCFPLAILFTLHRTP-------SDYPLLFPFPSLYFPNANTHRKITIFPLP-----PPPDDDDFLFPLAARVNSTPSPTPKLAFMF
        M + +PLSLICALLLC PLA+LFT++          SD+P +FP  SLY P   THRKIT+F +P     PPP++DD LFPLA+RV+ TPSPT KLAFMF
Subjt:  MLSPNPLSLICALLLCFPLAILFTLHRTP-------SDYPLLFPFPSLYFPNANTHRKITIFPLP-----PPPDDDDFLFPLAARVNSTPSPTPKLAFMF

Query:  LTTSPLPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYS
        LT SPLPFAPLWELFF+N+PP+ +N+YIHADPTR+Y+PPFSGVF+HR+IPSKP+ RF+P+L AAARRLLAHALLHDS+NSMFALLSPSCIPLHSF+FTY 
Subjt:  LTTSPLPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYS

Query:  TLIRSNKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTAT
        TLI S KSFIEVLKNEIG YDRWAARGP+ MLPVVKL D RIGSQFW+LTR+HAR VV D++VWSKFDLPCVRWDTCYPEENYFPTLLSM DH GL+ AT
Subjt:  TLIRSNKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTAT

Query:  LTHVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGG------MRIRT---AGGR---NSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFK
        LTHVDW G FDGHPRTY+ SDV P+LIR LR++R RYGDG       MRIRT     GR   +SSS AK +  H FLFARKFSAD+LQPLMNIS+D+IFK
Subjt:  LTHVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGG------MRIRT---AGGR---NSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFK

Query:  D
        D
Subjt:  D

A0A6J1KJC2 uncharacterized protein LOC1114962322.8e-16070.82Show/hide
Query:  MLSPNPLSLICALLLCFPLAILFTLHRTP-------SDYPLLFPFPSLYFPNANTHRKITIF-------PLPPPPDDDDFLFPLAARVNSTPSPTPKLAF
        M + +PLSLICALLLC PLA+LFT++          SD+P +FP  SLY P   THRKIT+F       P PPPP++DD LFPLAARV+  PSPT KLAF
Subjt:  MLSPNPLSLICALLLCFPLAILFTLHRTP-------SDYPLLFPFPSLYFPNANTHRKITIF-------PLPPPPDDDDFLFPLAARVNSTPSPTPKLAF

Query:  MFLTTSPLPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFT
        MFLT SPLPFAPLWELFF+N+PP+ +N+YIH DPTR+Y+PPFSGVF+HR+IPSKP+ RF+ +L AAARRLLAHALLHDS+NSMFALLSPSCIPLHSF+FT
Subjt:  MFLTTSPLPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFT

Query:  YSTLIRSNKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVT
        Y TLIRS KSFIEVLKNEIG YDRWAARGP+ MLPVVKL D RIGSQFW LTR+HAR VV D++VW+KFDLPCVRWDTCYPEENYFPTLLSM D  GL+ 
Subjt:  YSTLIRSNKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVT

Query:  ATLTHVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDG--------GMRIRTAGGRNSSST--AKSHPHHPFLFARKFSADSLQPLMNISTDIIFK
        ATLTHVDW GRFDGHPRTY+ +DV P+LIR LR+AR RYGDG        G + +T+G R+SSS+  AK +  H FLFARKFSAD+LQPLMNIS+D+IFK
Subjt:  ATLTHVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDG--------GMRIRTAGGRNSSST--AKSHPHHPFLFARKFSADSLQPLMNISTDIIFK

Query:  D
        D
Subjt:  D

SwissProt top hitse value%identityAlignment
Q65XS5 Glycosyltransferase BC104.5e-3029.47Show/hide
Query:  LICALLLCFPLAILFTLHRTPSDYPLLFPFPSLYFPNANTHRKITIFPLPPPPDDDDFLFPLAARVNSTPSP--TPKLAFMFLTTSPLPFAPLWELFFRN
        L+  + LC  + +L  LH +          PSL        RK           +++ +    A V   P P    +LAF+F+  + LP   +W+ FFR 
Subjt:  LICALLLCFPLAILFTLHRTPSDYPLLFPFPSLYFPNANTHRKITIFPLPPPPDDDDFLFPLAARVNSTPSP--TPKLAFMFLTTSPLPFAPLWELFFRN

Query:  VPPERFNIYIHADP----TRQYNPPFSGVFAHRLIPSKPSHRF-SPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRSNKSFIEVL
            RF+I++H+ P    TR      SG F +R + +     +   ++  A R LLAHA L D  N  F  +S SC+PL++F++TY  ++ S+ SF++  
Subjt:  VPPERFNIYIHADP----TRQYNPPFSGVFAHRLIPSKPSHRF-SPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRSNKSFIEVL

Query:  KNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVR---------WD-----------TCYPEENYFPTLLSM-GD
               D  A R    M P++ + ++R GSQ+ +LTRKHA +VV DE V  +F   C R         WD            C P+E+Y  TLL+  G 
Subjt:  KNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVR---------WD-----------TCYPEENYFPTLLSM-GD

Query:  HPGLVTATLTHVDW-------KGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNIS
           L   ++TH  W       + R   HP TY+ SD  P L++ ++     Y +          R    T+   P   FLFARKF+  +   L+++S
Subjt:  HPGLVTATLTHVDW-------KGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNIS

Arabidopsis top hitse value%identityAlignment
AT3G52060.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein3.5e-7044.95Show/hide
Query:  VNSTPSPTPKLAFMFLTTSPLPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPF-SGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFAL
        ++ +P+P PK+AF+FLT S L F PLWE FF+    + +N+YIHADPT   +P   S     + IP++ + R SPTL +A RRLLA+A+L D  N  FAL
Subjt:  VNSTPSPTPKLAFMFLTTSPLPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPF-SGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFAL

Query:  LSPSCIPLHSFHFTYSTLIRSN--KSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEEN
        +S  CIPLHSF + ++ L   +  +SFIE+L +E     R+ ARG + MLP ++  DFR+GSQF++L ++HA MV+ + ++W KF LPC+  ++CYPEE+
Subjt:  LSPSCIPLHSFHFTYSTLIRSN--KSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEEN

Query:  YFPTLLSMGDHPGLVTATLTHVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNIS
        YFPTLLS+ D  G    TLT V+W G   GHP TY+AS++ P LI  LR                  R++SS         + FARKF+ +SLQPLM I+
Subjt:  YFPTLLSMGDHPGLVTATLTHVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNIS

Query:  TDIIFKD
          +IF+D
Subjt:  TDIIFKD

AT3G52060.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein3.5e-7044.95Show/hide
Query:  VNSTPSPTPKLAFMFLTTSPLPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPF-SGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFAL
        ++ +P+P PK+AF+FLT S L F PLWE FF+    + +N+YIHADPT   +P   S     + IP++ + R SPTL +A RRLLA+A+L D  N  FAL
Subjt:  VNSTPSPTPKLAFMFLTTSPLPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPF-SGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFAL

Query:  LSPSCIPLHSFHFTYSTLIRSN--KSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEEN
        +S  CIPLHSF + ++ L   +  +SFIE+L +E     R+ ARG + MLP ++  DFR+GSQF++L ++HA MV+ + ++W KF LPC+  ++CYPEE+
Subjt:  LSPSCIPLHSFHFTYSTLIRSN--KSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEEN

Query:  YFPTLLSMGDHPGLVTATLTHVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNIS
        YFPTLLS+ D  G    TLT V+W G   GHP TY+AS++ P LI  LR                  R++SS         + FARKF+ +SLQPLM I+
Subjt:  YFPTLLSMGDHPGLVTATLTHVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNIS

Query:  TDIIFKD
          +IF+D
Subjt:  TDIIFKD

AT4G32290.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein4.5e-11052.97Show/hide
Query:  MLSPNPLSLICALLLCFPLAILFTLHR------TPSDYPLLFPFPSLYFPNA--NTHRKITIFPLPPPPDDDDFLFPLAARVNST--PSPTPKLAFMFLT
        M+SP    L+CAL LC P+A++FT+ R      +P          SLY  N   ++     +    P P +DD L  L++RVN    P  T K+AFM+LT
Subjt:  MLSPNPLSLICALLLCFPLAILFTLHR------TPSDYPLLFPFPSLYFPNA--NTHRKITIFPLPPPPDDDDFLFPLAARVNST--PSPTPKLAFMFLT

Query:  TSPLPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTL
        TSPLPFAPLWE+FF  +    +N+Y+HADPTR+Y+PPFSGVF +R+I SKPS R +PTL AAARRLLAHALL D  N MFA++SPSC+P+ SF FTY TL
Subjt:  TSPLPFAPLWELFFRNVPPERFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTL

Query:  IRSNKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTATLT
        + S KSFIE+LK+E   +DRW A G   MLP VKL +FRIGSQFW+L R+HAR+V  D R+W KF+  CVR D+CYPEE+YFPTLL+M D  G V ATLT
Subjt:  IRSNKSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTATLT

Query:  HVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFKD
        HVDW     GHPR YE  +V P+L+ RLR  RPRYG+ G+        N S  +K     PFLFARKFS  +L+PL+ ++  ++F D
Subjt:  HVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFKD

AT5G22070.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein6.8e-6637.91Show/hide
Query:  NPLSLICALLLCFPLAILFTLHRTPSDYPLLFPFPSLYFPNANTHRKITIFPLPPPPDDDDFLFPLAARV------NSTPSPTPKLAFMFLTTSPLPFAP
        N   L  +LLLC P    F         P +FP P    P        ++ P+    DD       A         +  P+P  K+AF+FLT S L FAP
Subjt:  NPLSLICALLLCFPLAILFTLHRTPSDYPLLFPFPSLYFPNANTHRKITIFPLPPPPDDDDFLFPLAARV------NSTPSPTPKLAFMFLTTSPLPFAP

Query:  LWELFFRNVPPERFNIYIHADPTRQYNPPFSG-VFAHRLIP-SKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRSN--
        +W+ FF       +N+Y+HADP      P +G VF +  I  +K + R SPTL +A RRLLA A L D AN+ FA+LS  CIPLHSF++ YS+L  S+  
Subjt:  LWELFFRNVPPERFNIYIHADPTRQYNPPFSG-VFAHRLIP-SKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRSN--

Query:  ------------------KSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLL
                          +SF+E++ +E   + R+ ARG   M+P V    FR+GSQF+++TR+HA + + D  +W KF LPC R D CYPEE+YFPTLL
Subjt:  ------------------KSFIEVLKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLL

Query:  SMGDHPGLVTATLTHVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFK
        +M D  G    TLT V+W G   GHP TY+  +V P+LI+RLR                         +S+    + FARKF+ D L+PL+ I+  +IF+
Subjt:  SMGDHPGLVTATLTHVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFK

Query:  D
        D
Subjt:  D

AT5G25330.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein7.2e-10052.25Show/hide
Query:  LSLICALLLCFPLAILFTLHRTPSDYPLLFPFPS-LYFPNANTHRKITIFPLPPPPDDDDFLFPLAARVNSTPSP--TPKLAFMFLTTSPLPFAPLWELF
        L+L   LL+C PL ++ T+        +    P+ L   N N +          P D D+ L   A++ N  PSP    KLAFMFLTT+ LP APLWELF
Subjt:  LSLICALLLCFPLAILFTLHRTPSDYPLLFPFPS-LYFPNANTHRKITIFPLPPPPDDDDFLFPLAARVNSTPSP--TPKLAFMFLTTSPLPFAPLWELF

Query:  FRNVPPER--FNIYIHADPTRQYNPPFSGVFAHRLIP-SKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRSNKSFIEV
        F      +  +N+Y+H DPT+++ P   G F +R+IP SKP++R +PTL +AARRLLAHALL D +N MF LLSPSCIPLHSF+FTY TL+ S KSFIE+
Subjt:  FRNVPPER--FNIYIHADPTRQYNPPFSGVFAHRLIP-SKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRSNKSFIEV

Query:  LKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTATLTHVDWKGRFDG
        LK+E G Y+RWAARGP  M P V   +FRIGSQFW LTR HA MVVSD  +WSKF+  CVR D CYPEE+YFPTLL+M D  G V+AT+THVDW     G
Subjt:  LKNEIGTYDRWAARGPEVMLPVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTATLTHVDWKGRFDG

Query:  HPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFKD
        HPRTY+  +V  +LI++LR ARPRYGDG            + T K     PFLFARKFS   +  LMNI+  +IF D
Subjt:  HPRTYEASDVGPDLIRRLRIARPRYGDGGMRIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGTCCCCTAACCCACTTTCTCTCATCTGTGCTCTGCTTCTTTGCTTCCCTTTAGCTATTCTCTTCACCCTCCACAGAACCCCCTCCGATTACCCTCTACTTTTCCC
CTTCCCCTCCCTCTATTTCCCCAACGCCAACACCCACCGTAAAATTACCATTTTCCCGCTCCCTCCTCCGCCCGATGACGACGATTTTCTGTTCCCCCTCGCCGCCCGAG
TCAACTCCACCCCATCCCCAACTCCCAAATTAGCCTTCATGTTCCTCACCACATCCCCCCTCCCTTTCGCCCCTCTTTGGGAGTTGTTCTTCAGAAACGTCCCACCCGAA
CGTTTCAATATCTACATCCACGCCGACCCGACCCGCCAATACAACCCGCCCTTCTCCGGCGTCTTCGCCCACCGCCTCATCCCCTCCAAACCGTCTCACAGATTCTCCCC
CACGCTCGCCGCCGCCGCGCGCCGCCTACTCGCCCACGCCCTCCTTCACGACTCCGCTAATTCCATGTTTGCTCTCCTCTCTCCCTCTTGCATTCCTCTCCATTCATTTC
ATTTCACTTACTCGACGCTAATTCGATCCAACAAGAGCTTCATCGAGGTTCTCAAAAATGAGATCGGCACCTACGACCGGTGGGCGGCGCGTGGTCCCGAAGTAATGCTT
CCGGTGGTTAAATTGGCGGATTTCCGGATTGGGTCGCAATTTTGGATTCTCACGCGCAAGCACGCGAGGATGGTGGTGAGTGATGAGAGGGTTTGGTCAAAGTTTGACTT
GCCCTGCGTGCGGTGGGACACGTGTTATCCTGAGGAGAATTATTTCCCCACCTTACTCAGCATGGGGGACCACCCAGGGCTGGTTACGGCTACACTTACACACGTGGACT
GGAAGGGCCGTTTCGATGGCCACCCTCGCACGTACGAGGCATCCGACGTGGGCCCCGATCTAATACGTCGTCTGAGGATCGCCCGGCCGAGATACGGCGACGGTGGAATG
AGAATTCGAACGGCCGGCGGCCGGAATTCTTCGTCGACCGCTAAATCTCACCCCCACCACCCGTTCTTGTTTGCGAGGAAATTCTCCGCCGATTCGCTCCAGCCGTTGAT
GAACATATCCACTGACATCATCTTTAAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGTCCCCTAACCCACTTTCTCTCATCTGTGCTCTGCTTCTTTGCTTCCCTTTAGCTATTCTCTTCACCCTCCACAGAACCCCCTCCGATTACCCTCTACTTTTCCC
CTTCCCCTCCCTCTATTTCCCCAACGCCAACACCCACCGTAAAATTACCATTTTCCCGCTCCCTCCTCCGCCCGATGACGACGATTTTCTGTTCCCCCTCGCCGCCCGAG
TCAACTCCACCCCATCCCCAACTCCCAAATTAGCCTTCATGTTCCTCACCACATCCCCCCTCCCTTTCGCCCCTCTTTGGGAGTTGTTCTTCAGAAACGTCCCACCCGAA
CGTTTCAATATCTACATCCACGCCGACCCGACCCGCCAATACAACCCGCCCTTCTCCGGCGTCTTCGCCCACCGCCTCATCCCCTCCAAACCGTCTCACAGATTCTCCCC
CACGCTCGCCGCCGCCGCGCGCCGCCTACTCGCCCACGCCCTCCTTCACGACTCCGCTAATTCCATGTTTGCTCTCCTCTCTCCCTCTTGCATTCCTCTCCATTCATTTC
ATTTCACTTACTCGACGCTAATTCGATCCAACAAGAGCTTCATCGAGGTTCTCAAAAATGAGATCGGCACCTACGACCGGTGGGCGGCGCGTGGTCCCGAAGTAATGCTT
CCGGTGGTTAAATTGGCGGATTTCCGGATTGGGTCGCAATTTTGGATTCTCACGCGCAAGCACGCGAGGATGGTGGTGAGTGATGAGAGGGTTTGGTCAAAGTTTGACTT
GCCCTGCGTGCGGTGGGACACGTGTTATCCTGAGGAGAATTATTTCCCCACCTTACTCAGCATGGGGGACCACCCAGGGCTGGTTACGGCTACACTTACACACGTGGACT
GGAAGGGCCGTTTCGATGGCCACCCTCGCACGTACGAGGCATCCGACGTGGGCCCCGATCTAATACGTCGTCTGAGGATCGCCCGGCCGAGATACGGCGACGGTGGAATG
AGAATTCGAACGGCCGGCGGCCGGAATTCTTCGTCGACCGCTAAATCTCACCCCCACCACCCGTTCTTGTTTGCGAGGAAATTCTCCGCCGATTCGCTCCAGCCGTTGAT
GAACATATCCACTGACATCATCTTTAAAGATTGA
Protein sequenceShow/hide protein sequence
MLSPNPLSLICALLLCFPLAILFTLHRTPSDYPLLFPFPSLYFPNANTHRKITIFPLPPPPDDDDFLFPLAARVNSTPSPTPKLAFMFLTTSPLPFAPLWELFFRNVPPE
RFNIYIHADPTRQYNPPFSGVFAHRLIPSKPSHRFSPTLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFHFTYSTLIRSNKSFIEVLKNEIGTYDRWAARGPEVML
PVVKLADFRIGSQFWILTRKHARMVVSDERVWSKFDLPCVRWDTCYPEENYFPTLLSMGDHPGLVTATLTHVDWKGRFDGHPRTYEASDVGPDLIRRLRIARPRYGDGGM
RIRTAGGRNSSSTAKSHPHHPFLFARKFSADSLQPLMNISTDIIFKD