; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg035949 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg035949
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionGDP-L-galactose phosphorylase 1-like
Genome locationscaffold5:38813560..38818827
RNA-Seq ExpressionSpg035949
SyntenySpg035949
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0000166 - nucleotide binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
GO:0080048 - GDP-D-glucose phosphorylase activity (molecular function)
InterPro domainsIPR026506 - GDP-L-galactose/GDP-D-glucose phosphorylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019471.1 GDP-L-galactose phosphorylase 1, partial [Cucurbita argyrosperma subsp. argyrosperma]1.8e-15473.25Show/hide
Query:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTN
        GIR+PLYQFGR SPLD+ SF APS VPLEEQTILES+LLAKWEE LSEG FRYDVTLSEIK IVG++KFLAQLNE+WTN SLSQYEE  KC+ GSLFQTN
Subjt:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTN

Query:  WLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRS--LEMLLRIAVEINNFTLCLFYEFSTPRTACVYFQ
        WLK  EELLFCIS GEN EPKLISA+LVP SSILV+INATPVEYGHV+LLPC VNGPLQFL+DRS  LEM L +AVEINNF LCLFYE STPRTAC +FQ
Subjt:  WLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRS--LEMLLRIAVEINNFTLCLFYEFSTPRTACVYFQ

Query:  ALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFSL
        ALFFSSLLPVEAMAVD FF DSL GIYVSTIT YPVKALLFESS NLKKMG+VIAE CSHLQEKC+LYNLLIRDCGKKIFLFLQ+               
Subjt:  ALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFSL

Query:  ALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRL-AAASLDDAEFQGVKQFCCHVASK
                                               G D+SSILSPWECWGYF+FKSRSEFDQATEDALLDR+ AAASLDDAEFQGVKQFCC VA K
Subjt:  ALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRL-AAASLDDAEFQGVKQFCCHVASK

XP_022140026.1 GDP-L-galactose phosphorylase 1-like [Momordica charantia]9.4e-15973.2Show/hide
Query:  LAGIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYE-EKSKCNQGSLF
        + GIRIPLYQFG YSPLDN S   PS V LEEQTILESLLLA+WEE LSEGLFRYDVTLS+IKVIVGRRKFLAQLNE+W ++ LS+ + +K +C+QGSL 
Subjt:  LAGIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYE-EKSKCNQGSLF

Query:  QTNWLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVEINNFTLCLFYEFSTPRTACVYF
        QTN  KRQEELLFCISSGENTEP+LISAALVPN SILVIINATPVEYGHV LLPCGVNGPLQFLEDRSLEMLLRIA+EIN+  LCLFYEFSTPRTAC YF
Subjt:  QTNWLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVEINNFTLCLFYEFSTPRTACVYF

Query:  QALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFS
        QALFFSSLLPVEAM +D FF DSLRGIYVSTITGYPVK L+FES++NLKKMGEVIA+ CSHLQEKCILYNLLIRDCGKKIFLFL                
Subjt:  QALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFS

Query:  LALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLAAASLDDAEFQGVKQFCCHVASK
                                             QP G DKSS+LSPWECWGYF+F SRSEFDQATE+ALLDRLAAASLDDAEFQGVKQFCCH+ S+
Subjt:  LALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLAAASLDDAEFQGVKQFCCHVASK

Query:  VSF
        VSF
Subjt:  VSF

XP_022927227.1 GDP-L-galactose phosphorylase 1-like isoform X1 [Cucurbita moschata]1.3e-15573.45Show/hide
Query:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTN
        GIR+PLYQFGR SPLD+ SF APS VPLEEQTILES+LLAKWEE LSEG FRYDVTLSEIK IVG++KFLAQLN +WTN SLSQYEE  KC+ GSLFQTN
Subjt:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTN

Query:  WLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRS--LEMLLRIAVEINNFTLCLFYEFSTPRTACVYFQ
        WLK  EELLFCIS GEN E KLISA+LVP SSILV+INATPVEYGHV+LLPC VNGPLQFL+DRS  LEMLLR+AVEINNF LCLFYE STPRTAC YFQ
Subjt:  WLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRS--LEMLLRIAVEINNFTLCLFYEFSTPRTACVYFQ

Query:  ALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFSL
        ALFFSSLLPVEAMAVD FF DSL GIYVSTIT YPVKALLFESS NLKKMG+VIAE CSHLQEKC+LYNLLIRDCGKKIFLFLQ+               
Subjt:  ALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFSL

Query:  ALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRL-AAASLDDAEFQGVKQFCCHVASK
                                               G D+SSILSPWECWGYF+FKSRSEFDQATEDALLDR+ AAAS DDAEFQGVKQFCC VA K
Subjt:  ALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRL-AAASLDDAEFQGVKQFCCHVASK

Query:  VSF
        VSF
Subjt:  VSF

XP_023520017.1 GDP-L-galactose phosphorylase 1-like [Cucurbita pepo subsp. pepo]6.3e-15573.2Show/hide
Query:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTN
        GIR+PLYQFGR SPLD+ S  APS VPLEEQTILES+LLAKWEE LSEG FRYDVTLSEIK IVG++KFLAQLNE+WTN SLSQYEE  KC+ GSLFQTN
Subjt:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTN

Query:  WLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRS--LEMLLRIAVEINNFTLCLFYEFSTPRTACVYFQ
        WLK QEELLFCIS GEN E KLISA+LVP SSILV+INATPVEYGHV+LLPC V+GPLQFL+DRS  LEMLLR+AVEINNF LCLFYE STPRTAC +FQ
Subjt:  WLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRS--LEMLLRIAVEINNFTLCLFYEFSTPRTACVYFQ

Query:  ALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFSL
        ALFFSSLLPVEAMAVD FF DSL  IYVSTIT YPVKALLFESS NLKKMG+VIAE CSHLQEKC+LYNLLIRDCGKKIFLFLQ+               
Subjt:  ALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFSL

Query:  ALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRL-AAASLDDAEFQGVKQFCCHVASK
                                               G D+SSILSPWECWGYF+FKSRSEFDQATEDALLDR+ AAASLDDAEFQGVKQFCC VA K
Subjt:  ALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRL-AAASLDDAEFQGVKQFCCHVASK

Query:  VSF
        VSF
Subjt:  VSF

XP_038894169.1 GDP-L-galactose phosphorylase 1-like isoform X1 [Benincasa hispida]1.0e-15773.33Show/hide
Query:  LAGIRIPLYQFGRYSPLDNTSF---NAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGS
        + GIR+PLYQFG++SPL++ SF     PS  PLEEQT+LESLLLAKWEE LSEGLFRYDVTLSEIKVIVGRRKFLAQLNE+WT+TSL QYEEK KC+ G 
Subjt:  LAGIRIPLYQFGRYSPLDNTSF---NAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGS

Query:  LFQTNWLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVEINNFTLCLFYEFSTPRTACV
        LFQTNWLK  EELL CISSGENTE KLISAALVP+SSILV+INATPVEYGHV LLPCGVNGPLQFL+DRSLEMLLRIAVE+NNF+LCLFYEFSTPRTAC+
Subjt:  LFQTNWLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVEINNFTLCLFYEFSTPRTACV

Query:  YFQALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKN
        YFQALFFSSLLPVEAM  D FF DSL GIYVSTITGYPVKAL+FE+S NLKKMG+VIAE  +HLQEKCILYNLLIRDCGKKIFLFL              
Subjt:  YFQALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKN

Query:  FSLALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLAAASLDDAEFQGVKQFCCHVA
                                               QP   DKSSILSPWECWGYF+FKSRSEFDQATE+ALL RLAAASLDDAEFQGVKQFCC VA
Subjt:  FSLALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLAAASLDDAEFQGVKQFCCHVA

Query:  SKVSF
        SKV+F
Subjt:  SKVSF

TrEMBL top hitse value%identityAlignment
A0A6J1CEI2 GDP-L-galactose phosphorylase 1-like4.6e-15973.2Show/hide
Query:  LAGIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYE-EKSKCNQGSLF
        + GIRIPLYQFG YSPLDN S   PS V LEEQTILESLLLA+WEE LSEGLFRYDVTLS+IKVIVGRRKFLAQLNE+W ++ LS+ + +K +C+QGSL 
Subjt:  LAGIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYE-EKSKCNQGSLF

Query:  QTNWLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVEINNFTLCLFYEFSTPRTACVYF
        QTN  KRQEELLFCISSGENTEP+LISAALVPN SILVIINATPVEYGHV LLPCGVNGPLQFLEDRSLEMLLRIA+EIN+  LCLFYEFSTPRTAC YF
Subjt:  QTNWLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVEINNFTLCLFYEFSTPRTACVYF

Query:  QALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFS
        QALFFSSLLPVEAM +D FF DSLRGIYVSTITGYPVK L+FES++NLKKMGEVIA+ CSHLQEKCILYNLLIRDCGKKIFLFL                
Subjt:  QALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFS

Query:  LALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLAAASLDDAEFQGVKQFCCHVASK
                                             QP G DKSS+LSPWECWGYF+F SRSEFDQATE+ALLDRLAAASLDDAEFQGVKQFCCH+ S+
Subjt:  LALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLAAASLDDAEFQGVKQFCCHVASK

Query:  VSF
        VSF
Subjt:  VSF

A0A6J1EGL2 GDP-L-galactose phosphorylase 1-like isoform X16.2e-15673.45Show/hide
Query:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTN
        GIR+PLYQFGR SPLD+ SF APS VPLEEQTILES+LLAKWEE LSEG FRYDVTLSEIK IVG++KFLAQLN +WTN SLSQYEE  KC+ GSLFQTN
Subjt:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTN

Query:  WLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRS--LEMLLRIAVEINNFTLCLFYEFSTPRTACVYFQ
        WLK  EELLFCIS GEN E KLISA+LVP SSILV+INATPVEYGHV+LLPC VNGPLQFL+DRS  LEMLLR+AVEINNF LCLFYE STPRTAC YFQ
Subjt:  WLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRS--LEMLLRIAVEINNFTLCLFYEFSTPRTACVYFQ

Query:  ALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFSL
        ALFFSSLLPVEAMAVD FF DSL GIYVSTIT YPVKALLFESS NLKKMG+VIAE CSHLQEKC+LYNLLIRDCGKKIFLFLQ+               
Subjt:  ALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFSL

Query:  ALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRL-AAASLDDAEFQGVKQFCCHVASK
                                               G D+SSILSPWECWGYF+FKSRSEFDQATEDALLDR+ AAAS DDAEFQGVKQFCC VA K
Subjt:  ALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRL-AAASLDDAEFQGVKQFCCHVASK

Query:  VSF
        VSF
Subjt:  VSF

A0A6J1EHG1 GDP-L-galactose phosphorylase 1-like isoform X28.6e-14268.98Show/hide
Query:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTN
        GIR+PLYQFGR SPLD+ SF APS VPLEEQTILES+LLA                    K IVG++KFLAQLN +WTN SLSQYEE  KC+ GSLFQTN
Subjt:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTN

Query:  WLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRS--LEMLLRIAVEINNFTLCLFYEFSTPRTACVYFQ
        WLK  EELLFCIS GEN E KLISA+LVP SSILV+INATPVEYGHV+LLPC VNGPLQFL+DRS  LEMLLR+AVEINNF LCLFYE STPRTAC YFQ
Subjt:  WLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRS--LEMLLRIAVEINNFTLCLFYEFSTPRTACVYFQ

Query:  ALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFSL
        ALFFSSLLPVEAMAVD FF DSL GIYVSTIT YPVKALLFESS NLKKMG+VIAE CSHLQEKC+LYNLLIRDCGKKIFLFLQ+               
Subjt:  ALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFSL

Query:  ALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRL-AAASLDDAEFQGVKQFCCHVASK
                                               G D+SSILSPWECWGYF+FKSRSEFDQATEDALLDR+ AAAS DDAEFQGVKQFCC VA K
Subjt:  ALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRL-AAASLDDAEFQGVKQFCCHVASK

Query:  VSF
        VSF
Subjt:  VSF

A0A6J1KK34 GDP-L-galactose phosphorylase 1-like isoform X21.3e-13768.24Show/hide
Query:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTN
        GIR+PLYQFGR SPLD+ SF APS VPLEEQTILES+LLA                    K I+G++KFLAQLNE+WTN SLSQYEE S    GSLFQTN
Subjt:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTN

Query:  WLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRS--LEMLLRIAVEINNFTLCLFYEFSTPRTACVYFQ
        WLK QEELLFCIS GEN E KLISAALVP SSILVIINATPVEYGHV+LLPC VNGPLQF +D S  LEMLLR++VEINNF LCLFYE STP TAC+YFQ
Subjt:  WLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRS--LEMLLRIAVEINNFTLCLFYEFSTPRTACVYFQ

Query:  ALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFSL
        ALFFSSLLPVEAM VD FF DSL GI VSTIT YPVKALLFESS NLKKMG+VIAE CSHLQEKC+LYNLLIRDCGKKIFLFLQ                
Subjt:  ALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFSL

Query:  ALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLA-AASLDDAEFQGVKQFCCHVASK
                                               G D+SSILSPWECWGYF+FKSRSEFDQATEDALLDR+A AASLDDAEFQGVKQFCC VA K
Subjt:  ALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLA-AASLDDAEFQGVKQFCCHVASK

Query:  VSF
        VSF
Subjt:  VSF

A0A6J1KLR2 GDP-L-galactose phosphorylase 1-like isoform X19.2e-15272.7Show/hide
Query:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTN
        GIR+PLYQFGR SPLD+ SF APS VPLEEQTILES+LLAKWEE LSEG FRYDVTLSEIK I+G++KFLAQLNE+WTN SLSQYEE S    GSLFQTN
Subjt:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTN

Query:  WLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRS--LEMLLRIAVEINNFTLCLFYEFSTPRTACVYFQ
        WLK QEELLFCIS GEN E KLISAALVP SSILVIINATPVEYGHV+LLPC VNGPLQF +D S  LEMLLR++VEINNF LCLFYE STP TAC+YFQ
Subjt:  WLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRS--LEMLLRIAVEINNFTLCLFYEFSTPRTACVYFQ

Query:  ALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFSL
        ALFFSSLLPVEAM VD FF DSL GI VSTIT YPVKALLFESS NLKKMG+VIAE CSHLQEKC+LYNLLIRDCGKKIFLFLQ                
Subjt:  ALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLKGRMMKNFSL

Query:  ALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLA-AASLDDAEFQGVKQFCCHVASK
                                               G D+SSILSPWECWGYF+FKSRSEFDQATEDALLDR+A AASLDDAEFQGVKQFCC VA K
Subjt:  ALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLA-AASLDDAEFQGVKQFCCHVASK

Query:  VSF
        VSF
Subjt:  VSF

SwissProt top hitse value%identityAlignment
Q3TLS3 GDP-D-glucose phosphorylase 12.7e-0732.05Show/hide
Query:  NAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTNWLK-RQEELLFCISSGENTE
        ++PS+V     +  +S L + W + L  GLFRY +   + +++ G   F+AQLN             +S   +    Q N+ K R  E+LF +      E
Subjt:  NAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTNWLK-RQEELLFCISSGENTE

Query:  PKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVE
        PK   A       +LV+IN +P+E+GHVLL+P     P Q L  R L  +LR+ +E
Subjt:  PKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVE

Q6ZNW5 GDP-D-glucose phosphorylase 16.0e-0732.05Show/hide
Query:  NAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTNWLK-RQEELLFCISSGENTE
        NAP       Q+  ++ L + W++ +  GLFRY +   + +++ G   F+AQLN             KS        Q N+ K R  E+LF +    + E
Subjt:  NAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTNWLK-RQEELLFCISSGENTE

Query:  PKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVE
        P L    L     ILV+IN +P+E+GHVLL+P     P + L  R L   LR  +E
Subjt:  PKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVE

Q8HXE4 GDP-D-glucose phosphorylase 12.3e-0631.41Show/hide
Query:  NAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTNWLKRQE-ELLFCISSGENTE
        NAP  +    Q+  ++ L + W++ +  GLFRY +   + +++ G   F+AQLN             KS        Q N+ K Q  E+L+ +    + E
Subjt:  NAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQGSLFQTNWLKRQE-ELLFCISSGENTE

Query:  PKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVE
        P L    L     ILV+IN +P+E+GHVLL+P     P + L  R L   LR  +E
Subjt:  PKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVE

Q8RWE8 GDP-L-galactose phosphorylase 12.2e-3329.38Show/hide
Query:  LAGIRIPLY------QFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNEN---WTNTSLSQYEEKS
        L G R+PLY      + G    + + +   P +        LESL+L +WE+    GLFRYDVT  E KVI G+  F+AQLNE        +  + ++  
Subjt:  LAGIRIPLY------QFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNEN---WTNTSLSQYEEKS

Query:  KCNQGSLFQTNWLK-RQEELLFCISSGENTEPKLISAALV--PNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVEINNFTLCLFYE
        +   GS F  N+ K  QEELLF   +GE+ + +      +   NS  +V IN +P+EYGHVLL+P  ++   Q ++ +SL + + +A E  N    L Y 
Subjt:  KCNQGSLFQTNWLK-RQEELLFCISSGENTEPKLISAALV--PNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVEINNFTLCLFYE

Query:  ----FSTPRTACVYFQALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQ
            F+T     ++FQA + +   P+E  A       ++ G+ +S +  YPV++LLFE   +++++ + +++ C  LQ   I +N+LI DCG++IFL  Q
Subjt:  ----FSTPRTACVYFQALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQ

Query:  TWLHNLKGRMMKNFSLALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLAAASLDDA
         +                                   +  G          V P  L+     + WE  G+ V K + +++ A+ED     LA ASL + 
Subjt:  TWLHNLKGRMMKNFSLALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLAAASLDDA

Query:  EFQGV
         F+ V
Subjt:  EFQGV

Q9FLP9 GDP-L-galactose phosphorylase 21.1e-2929.5Show/hide
Query:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQ---GSLF
        G R+PLY           +  +P        T LESL++ +WE+    GLFRYDVT  E KVI G+  F+AQLNE              K  Q   G+ F
Subjt:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQ---GSLF

Query:  QTNWLK-RQEELLFCISSGENTEPKLIS-AALVP----NSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVEINNFTLCLFYE----F
          N+ K  QEELLF   +  N +   I   A +P    NS  +V IN +P+EYGHVLL+P  ++   Q ++ +SL + L++A E +N    L Y     F
Subjt:  QTNWLK-RQEELLFCISSGENTEPKLIS-AALVP----NSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVEINNFTLCLFYE----F

Query:  STPRTACVYFQALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHN
        +T     ++FQA + +   P+E  A       +  G+ +S +  YPV+ LL E    +K + + +++    LQ   I +N+LI D GK+IFL  Q +   
Subjt:  STPRTACVYFQALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHN

Query:  LKGRMMKNFSLALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLAAASLDDAEFQGV
                                        +  G   +T     V P         + WE  G+ V K + +++ A+E+     LA  SL +  F+ V
Subjt:  LKGRMMKNFSLALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLAAASLDDAEFQGV

Arabidopsis top hitse value%identityAlignment
AT4G26850.1 mannose-1-phosphate guanylyltransferase (GDP)s;GDP-galactose:mannose-1-phosphate guanylyltransferases;GDP-galactose:glucose-1-phosphate guanylyltransferases;GDP-galactose:myoinositol-1-phosphate guanylyltransferases;glucose-1-phosphate guanylyltransferase1.6e-3429.38Show/hide
Query:  LAGIRIPLY------QFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNEN---WTNTSLSQYEEKS
        L G R+PLY      + G    + + +   P +        LESL+L +WE+    GLFRYDVT  E KVI G+  F+AQLNE        +  + ++  
Subjt:  LAGIRIPLY------QFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNEN---WTNTSLSQYEEKS

Query:  KCNQGSLFQTNWLK-RQEELLFCISSGENTEPKLISAALV--PNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVEINNFTLCLFYE
        +   GS F  N+ K  QEELLF   +GE+ + +      +   NS  +V IN +P+EYGHVLL+P  ++   Q ++ +SL + + +A E  N    L Y 
Subjt:  KCNQGSLFQTNWLK-RQEELLFCISSGENTEPKLISAALV--PNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVEINNFTLCLFYE

Query:  ----FSTPRTACVYFQALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQ
            F+T     ++FQA + +   P+E  A       ++ G+ +S +  YPV++LLFE   +++++ + +++ C  LQ   I +N+LI DCG++IFL  Q
Subjt:  ----FSTPRTACVYFQALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQ

Query:  TWLHNLKGRMMKNFSLALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLAAASLDDA
         +                                   +  G          V P  L+     + WE  G+ V K + +++ A+ED     LA ASL + 
Subjt:  TWLHNLKGRMMKNFSLALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLAAASLDDA

Query:  EFQGV
         F+ V
Subjt:  EFQGV

AT5G55120.1 galactose-1-phosphate guanylyltransferase (GDP)s;GDP-D-glucose phosphorylases;quercetin 4'-O-glucosyltransferases8.0e-3129.5Show/hide
Query:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQ---GSLF
        G R+PLY           +  +P        T LESL++ +WE+    GLFRYDVT  E KVI G+  F+AQLNE              K  Q   G+ F
Subjt:  GIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKFLAQLNENWTNTSLSQYEEKSKCNQ---GSLF

Query:  QTNWLK-RQEELLFCISSGENTEPKLIS-AALVP----NSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVEINNFTLCLFYE----F
          N+ K  QEELLF   +  N +   I   A +P    NS  +V IN +P+EYGHVLL+P  ++   Q ++ +SL + L++A E +N    L Y     F
Subjt:  QTNWLK-RQEELLFCISSGENTEPKLIS-AALVP----NSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVEINNFTLCLFYE----F

Query:  STPRTACVYFQALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHN
        +T     ++FQA + +   P+E  A       +  G+ +S +  YPV+ LL E    +K + + +++    LQ   I +N+LI D GK+IFL  Q +   
Subjt:  STPRTACVYFQALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHN

Query:  LKGRMMKNFSLALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLAAASLDDAEFQGV
                                        +  G   +T     V P         + WE  G+ V K + +++ A+E+     LA  SL +  F+ V
Subjt:  LKGRMMKNFSLALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLAAASLDDAEFQGV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CACGCGTTGCTGCCCAATACGTCGAGCCAATTCAACCATTGCTCTGCCAGAGGGAGATCGAGAGCGAGTGAACAGGGATGGCGTAAGGAAATTGGTTGGCGTTTCTTTCC
TGATTTGAGAACGATTCGGCGACTTGTTTGCATCGGTTGGTTGGTGAAGGCTCCTTATCTTCCTTCCATCCAAAAGGCATATGCAATTTGCTTACCATATTTTCTTTCAA
ATCTGAATTATTCTTCAGAGTTACTCCAAGACTTTTACAGGGCTGGTCCTGTGAGGGCTGGCTTCCATTGTTCAATCACGGCTTTAAGGCAAGAGTCTTGTGTTAAAATT
AGAAGGGGCAAAGAATACATGCAAGCTTTCCCTAAGTTCTTGATTGCATCCCCTTCATTTCACGGTGGGCGTGGGGCCCTGCCCTCAGCTGGTGGGCACCCAGCAGATCT
TACCTTCCTTGCTGGTATTAGAATTCCCCTTTATCAATTTGGTAGATATTCTCCTCTGGATAATACTTCCTTCAATGCGCCTTCAAGTGTTCCTCTGGAAGAACAAACAA
TATTGGAGTCCCTACTTCTTGCCAAGTGGGAAGAGGGATTGTCTGAAGGTCTTTTTAGATATGATGTGACTCTATCTGAAATTAAGGTAATAGTGGGCAGGAGAAAGTTC
CTTGCTCAGCTGAATGAAAACTGGACCAATACTTCTCTGTCGCAGTATGAAGAGAAAAGTAAATGCAACCAAGGAAGTCTATTTCAAACCAATTGGTTGAAACGGCAGGA
GGAATTGCTCTTCTGCATTTCAAGTGGTGAAAATACTGAACCCAAGCTTATTTCTGCAGCTCTAGTGCCCAACAGTTCCATCCTTGTTATTATCAACGCTACTCCTGTGG
AGTATGGCCATGTTCTCTTGTTGCCCTGTGGTGTCAATGGTCCACTTCAGTTCCTGGAGGATAGGTCCTTGGAAATGCTTTTGAGGATTGCTGTTGAAATTAATAATTTC
ACCCTCTGCTTGTTCTATGAGTTTTCTACTCCCAGGACTGCTTGTGTGTATTTTCAGGCACTGTTCTTTTCAAGCCTTTTACCCGTGGAGGCCATGGCTGTTGATGCCTT
CTTTTGTGATAGCTTGAGGGGAATTTATGTTTCTACTATCACAGGTTACCCTGTTAAGGCCCTCTTATTTGAGAGCAGCTGGAACTTGAAGAAAATGGGGGAGGTTATTG
CCGAGACATGTTCTCATTTACAGGAGAAATGCATTCTGTATAATCTGCTGATTCGTGATTGTGGCAAGAAGATTTTCTTATTTCTTCAGACTTGGCTCCACAATCTAAAG
GGTAGGATGATGAAGAATTTCTCTCTTGCCTTACCTGTCGATGGTATTAGGATTACTATGCATAGCGTACTTCTATATCTTGTCTCATTAGAACTCAAAGGACATAATTA
TACAACCTATAAAACATATCCTGTTCAGCCACCAGGGTTGGACAAATCTTCCATTCTTTCACCTTGGGAATGTTGGGGTTACTTTGTGTTCAAATCAAGGTCTGAATTTG
ACCAGGCAACTGAAGATGCCCTGCTCGATCGGCTTGCTGCTGCTTCCCTTGATGATGCAGAATTTCAGGGCGTGAAGCAATTCTGCTGTCATGTTGCAAGTAAAGTTTCT
TTCTGA
mRNA sequenceShow/hide mRNA sequence
CACGCGTTGCTGCCCAATACGTCGAGCCAATTCAACCATTGCTCTGCCAGAGGGAGATCGAGAGCGAGTGAACAGGGATGGCGTAAGGAAATTGGTTGGCGTTTCTTTCC
TGATTTGAGAACGATTCGGCGACTTGTTTGCATCGGTTGGTTGGTGAAGGCTCCTTATCTTCCTTCCATCCAAAAGGCATATGCAATTTGCTTACCATATTTTCTTTCAA
ATCTGAATTATTCTTCAGAGTTACTCCAAGACTTTTACAGGGCTGGTCCTGTGAGGGCTGGCTTCCATTGTTCAATCACGGCTTTAAGGCAAGAGTCTTGTGTTAAAATT
AGAAGGGGCAAAGAATACATGCAAGCTTTCCCTAAGTTCTTGATTGCATCCCCTTCATTTCACGGTGGGCGTGGGGCCCTGCCCTCAGCTGGTGGGCACCCAGCAGATCT
TACCTTCCTTGCTGGTATTAGAATTCCCCTTTATCAATTTGGTAGATATTCTCCTCTGGATAATACTTCCTTCAATGCGCCTTCAAGTGTTCCTCTGGAAGAACAAACAA
TATTGGAGTCCCTACTTCTTGCCAAGTGGGAAGAGGGATTGTCTGAAGGTCTTTTTAGATATGATGTGACTCTATCTGAAATTAAGGTAATAGTGGGCAGGAGAAAGTTC
CTTGCTCAGCTGAATGAAAACTGGACCAATACTTCTCTGTCGCAGTATGAAGAGAAAAGTAAATGCAACCAAGGAAGTCTATTTCAAACCAATTGGTTGAAACGGCAGGA
GGAATTGCTCTTCTGCATTTCAAGTGGTGAAAATACTGAACCCAAGCTTATTTCTGCAGCTCTAGTGCCCAACAGTTCCATCCTTGTTATTATCAACGCTACTCCTGTGG
AGTATGGCCATGTTCTCTTGTTGCCCTGTGGTGTCAATGGTCCACTTCAGTTCCTGGAGGATAGGTCCTTGGAAATGCTTTTGAGGATTGCTGTTGAAATTAATAATTTC
ACCCTCTGCTTGTTCTATGAGTTTTCTACTCCCAGGACTGCTTGTGTGTATTTTCAGGCACTGTTCTTTTCAAGCCTTTTACCCGTGGAGGCCATGGCTGTTGATGCCTT
CTTTTGTGATAGCTTGAGGGGAATTTATGTTTCTACTATCACAGGTTACCCTGTTAAGGCCCTCTTATTTGAGAGCAGCTGGAACTTGAAGAAAATGGGGGAGGTTATTG
CCGAGACATGTTCTCATTTACAGGAGAAATGCATTCTGTATAATCTGCTGATTCGTGATTGTGGCAAGAAGATTTTCTTATTTCTTCAGACTTGGCTCCACAATCTAAAG
GGTAGGATGATGAAGAATTTCTCTCTTGCCTTACCTGTCGATGGTATTAGGATTACTATGCATAGCGTACTTCTATATCTTGTCTCATTAGAACTCAAAGGACATAATTA
TACAACCTATAAAACATATCCTGTTCAGCCACCAGGGTTGGACAAATCTTCCATTCTTTCACCTTGGGAATGTTGGGGTTACTTTGTGTTCAAATCAAGGTCTGAATTTG
ACCAGGCAACTGAAGATGCCCTGCTCGATCGGCTTGCTGCTGCTTCCCTTGATGATGCAGAATTTCAGGGCGTGAAGCAATTCTGCTGTCATGTTGCAAGTAAAGTTTCT
TTCTGA
Protein sequenceShow/hide protein sequence
HALLPNTSSQFNHCSARGRSRASEQGWRKEIGWRFFPDLRTIRRLVCIGWLVKAPYLPSIQKAYAICLPYFLSNLNYSSELLQDFYRAGPVRAGFHCSITALRQESCVKI
RRGKEYMQAFPKFLIASPSFHGGRGALPSAGGHPADLTFLAGIRIPLYQFGRYSPLDNTSFNAPSSVPLEEQTILESLLLAKWEEGLSEGLFRYDVTLSEIKVIVGRRKF
LAQLNENWTNTSLSQYEEKSKCNQGSLFQTNWLKRQEELLFCISSGENTEPKLISAALVPNSSILVIINATPVEYGHVLLLPCGVNGPLQFLEDRSLEMLLRIAVEINNF
TLCLFYEFSTPRTACVYFQALFFSSLLPVEAMAVDAFFCDSLRGIYVSTITGYPVKALLFESSWNLKKMGEVIAETCSHLQEKCILYNLLIRDCGKKIFLFLQTWLHNLK
GRMMKNFSLALPVDGIRITMHSVLLYLVSLELKGHNYTTYKTYPVQPPGLDKSSILSPWECWGYFVFKSRSEFDQATEDALLDRLAAASLDDAEFQGVKQFCCHVASKVS
F