; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg009920 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg009920
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionGlycos_transf_1 domain-containing protein
Genome locationscaffold7:4940354..4941862
RNA-Seq ExpressionSpg009920
SyntenySpg009920
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR001296 - Glycosyl transferase, family 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573893.1 hypothetical protein SDJN03_27780, partial [Cucurbita argyrosperma subsp. sororia]2.9e-23982.09Show/hide
Query:  MDDLHRHQQSSDPPLPN--QSPSFKFRPSP-----IYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSS
        MDDLH     +D PLPN   S S K RPS       Y SSILIL+L+IS FAFTKTDH+K+QSL        TLFQ+LI+ LN SRN KQ  VPNP   S
Subjt:  MDDLHRHQQSSDPPLPN--QSPSFKFRPSP-----IYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSS

Query:  TSSRCVLWMAPFLSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCP
        TSS+CVLWMAPFLSGGGYSSEAWSYILALHDH+RNP FRLAI+QHGDLES+ FWEGLPDS +NLAIELHRT+CR+NETIVVCHSEPGAWNPPLFET PCP
Subjt:  TSSRCVLWMAPFLSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCP

Query:  PGVYQNFKSVIGRTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNL-EVSLEKRF
        PGVYQNFKSVIGRTMFETDRVS EHVNRCN MDFVWVPSEFHVSTFVKSGVDPSK+VK+VQPIDVNFFDPL Y PFSLESVGTLVLG  N+ EVSLEK F
Subjt:  PGVYQNFKSVIGRTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNL-EVSLEKRF

Query:  VFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGW
        VFLSIFKWEFRKGWDLLLEAYLKEFSK DGV L+LLTN YHT SDFGNKILDFVE+SGIQ+P SGWAPV+V+DTHIAQTDLPRVYKAADAFVLPSRGEGW
Subjt:  VFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGW

Query:  GRPLVEAMAMSLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHH
        GRPLVEAM+MSLPVIATNWSG T+FLTDENSYPL VERMSEVKEGPFKGHLWAEPSI+KL+VLMREVMTN DEAK KGRRAREDMVRRFSPDVVA+IV  
Subjt:  GRPLVEAMAMSLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHH

Query:  HIQNVFHD
        HIQ +F +
Subjt:  HIQNVFHD

KAG7012958.1 hypothetical protein SDJN02_25712, partial [Cucurbita argyrosperma subsp. argyrosperma]7.6e-24082.28Show/hide
Query:  MDDLHRHQQSSDPPLPN--QSPSFKFRPSP-----IYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSS
        MDDLH     +D PLPN   S S K RPS       Y SSILIL+L+IS FAFTKTDH+K+QSL        TLFQ+LIN LN SRN KQ  VPNP   S
Subjt:  MDDLHRHQQSSDPPLPN--QSPSFKFRPSP-----IYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSS

Query:  TSSRCVLWMAPFLSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCP
        TSS+CVLWMAPFLSGGGYSSEAWSYILALHDH+RNP FRLAI+QHGDLES+ FWEGLPDS +NLAIELHRT+CR+NETIVVCHSEPGAWNPPLFET PCP
Subjt:  TSSRCVLWMAPFLSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCP

Query:  PGVYQNFKSVIGRTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNL-EVSLEKRF
        PGVYQNFKSVIGRTMFETDRVS EHVNRCN MDFVWVPSEFHVSTFVKSGVDPSK+VK+VQPIDVNFFDPL Y PFSLESVGTLVLG  N+ EVSLEK F
Subjt:  PGVYQNFKSVIGRTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNL-EVSLEKRF

Query:  VFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGW
        VFLSIFKWEFRKGWDLLLEAYLKEFSK DGV L+LLTN YHT SDFGNKILDFVE+SGIQ+P SGWAPV+V+DTHIAQTDLPRVYKAADAFVLPSRGEGW
Subjt:  VFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGW

Query:  GRPLVEAMAMSLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHH
        GRPLVEAM+MSLPVIATNWSG T+FLTDENSYPL VERMSEVKEGPFKGHLWAEPSI+KL+VLMREVMTN DEAK KGRRAREDMVRRFSPDVVA+IV  
Subjt:  GRPLVEAMAMSLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHH

Query:  HIQNVFHD
        HIQ +F +
Subjt:  HIQNVFHD

XP_022945089.1 uncharacterized protein LOC111449431 [Cucurbita moschata]8.4e-23981.89Show/hide
Query:  MDDLHRHQQSSDPPLPN--QSPSFKFRPSP-----IYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSS
        MDDLH     +D PLPN   S S K RPS       Y SSILIL+L+IS FAFTK DH+K+QSL        TLFQ+LI+ LN SRN KQ  VPNP   S
Subjt:  MDDLHRHQQSSDPPLPN--QSPSFKFRPSP-----IYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSS

Query:  TSSRCVLWMAPFLSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCP
        TSS+CVLWMAPFLSGGGYSSEAWSYILALHDH+RNP FRLAI+QHGDLES+ FWEGLPDS +NLAIELHRT+CR+NETIVVCHSEPGAWNPPLFET PCP
Subjt:  TSSRCVLWMAPFLSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCP

Query:  PGVYQNFKSVIGRTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNL-EVSLEKRF
        PGVYQNFKSVIGRTMFETDRVS EHVNRCN MDFVWVPSEFHVSTFVKSGVDPSK+VK+VQP+DVNFFDPL Y PFSLESVGTLVLG  N+ EVSLEK F
Subjt:  PGVYQNFKSVIGRTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNL-EVSLEKRF

Query:  VFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGW
        VFLSIFKWEFRKGWDLLLEAYLKEFSK DGV L+LLTN YHT SDFGNKILDFVE+SGIQ+P SGWAPV+V+DTHIAQTDLPRVYKAADAFVLPSRGEGW
Subjt:  VFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGW

Query:  GRPLVEAMAMSLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHH
        GRPLVEAM+MSLPVIATNWSG T+FLTDENSYPL VERMSEVKEGPFKGHLWAEPSISKL+VLMREVMTN DEAK KGRRAREDMVRRFSPDVVA+IV  
Subjt:  GRPLVEAMAMSLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHH

Query:  HIQNVFHD
        HIQ +F +
Subjt:  HIQNVFHD

XP_022968340.1 uncharacterized protein LOC111467605 [Cucurbita maxima]4.7e-24283.2Show/hide
Query:  MDDLHRHQQSSDPPLPN--QSPSFKFRPSPI---YFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSSTS
        MDDLH     +D PLP+  +S S K RPS +   Y SSILIL+L+IS F FTKTDH+K+QSL        TLFQKLI+ LN SRN KQ  V NP   STS
Subjt:  MDDLHRHQQSSDPPLPN--QSPSFKFRPSPI---YFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSSTS

Query:  SRCVLWMAPFLSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCPPG
        S+CVLWMAPFLSGGGYSSEAWSYILALHDH+RNP FRLAI+QHGDLESV FWEGLPDS +NLAIELHRT+CR+NETIVVCHSEPGAWNPPLFET PCPPG
Subjt:  SRCVLWMAPFLSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCPPG

Query:  VYQNFKSVIGRTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNL-EVSLEKRFVF
        VYQNFKSVIGRTMFETDRVS EHVNRCN MDFVWVPSEFHVSTFVKSGVDPSK+VK+VQPIDVNFFDPL Y PFSLESVGTLVLG  N+ EVSLEK FVF
Subjt:  VYQNFKSVIGRTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNL-EVSLEKRFVF

Query:  LSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGWGR
        LSIFKWEFRKGWDLLLEAYLKEFSK DGVGL+LLTN YHT SDFGNKILDFVENSGIQKP SGWAPVYV+DTHIAQTDLP+VYKAADAFVLPSRGEGWGR
Subjt:  LSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGWGR

Query:  PLVEAMAMSLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHHHI
        PLVEAM+MSLPVIATNWSG T+FLTDENSYPL VE+MSEVKEGPFKGHLWAEPSISKL+VLMREVMTN DEAKAKGRRAREDMVRRFSPDVVA+IVH HI
Subjt:  PLVEAMAMSLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHHHI

Query:  QNVFHD
        Q +F +
Subjt:  QNVFHD

XP_023542823.1 uncharacterized protein LOC111802622 [Cucurbita pepo subsp. pepo]1.1e-24182.32Show/hide
Query:  MDDLHRHQQSSDPPLPN--QSPSFKFRPSP-----IYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSS
        MDDLH     +D PLPN   S S K RPS       Y SSILIL+L+IS FAFTKTDH+K+QSL        TLFQ+LI+ LN SRN KQ  VPNP   S
Subjt:  MDDLHRHQQSSDPPLPN--QSPSFKFRPSP-----IYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSS

Query:  TSSRCVLWMAPFLSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCP
        TSS+CVLWMAPFLSGGGYSSEAWSYILALHDH+RNP FRLAI+QHGDLES+ FWEGLPDS +NLAIELHRT+CR+NETIVVCHSEPGAWNPPLFET PCP
Subjt:  TSSRCVLWMAPFLSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCP

Query:  PGVYQNFKSVIGRTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNL--EVSLEKR
        PGVYQNFKSVIGRTMFETDRVS EHVNRCN MDFVWVPSEFHVSTFVKSGVDPSK+VK+VQPIDVNFFDPL Y PFSLESVGTLVLG  N+  EVSLEK 
Subjt:  PGVYQNFKSVIGRTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNL--EVSLEKR

Query:  FVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEG
        FVFLSIFKWEFRKGWDLLLEAYLKEFSK DGVGL+LLTN YHT +DFGNKILDFVE+SGIQ+P SGWAPV+V+DTHIAQTDLPRVYKAADAFVLPSRGEG
Subjt:  FVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEG

Query:  WGRPLVEAMAMSLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVH
        WGRPLVEAM+M+LPVIATNWSG T+FLTDENSYPL VERMSEVKEGPFKGHLWAEPSISKL+VLMREVMTN DEAK KGRRAREDMVRRFSPDVVA+IVH
Subjt:  WGRPLVEAMAMSLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVH

Query:  HHIQNVFHD
         HIQ +FH+
Subjt:  HHIQNVFHD

TrEMBL top hitse value%identityAlignment
A0A1S3BFB1 uncharacterized protein LOC1034893731.8e-22678.4Show/hide
Query:  HQQSSDP-PLPNQSPSFKFRPSPIYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSSTSSRCVLWMAPF
        H+ +  P P PNQ   FK   SPI+FSSILIL+LAISFFAF KT+ YK+QS             KL NLL   + S Q P  NPS       CVLWMAPF
Subjt:  HQQSSDP-PLPNQSPSFKFRPSPIYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSSTSSRCVLWMAPF

Query:  LSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIG
        LSGGGYSSEAWSYILAL  H+ NP FRL I+QHGDLESV FWEGLP+S RNLAIELHRT CRMNET+V+CHSEPGAWNPPLFETLPCPPG Y+ FKSVIG
Subjt:  LSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIG

Query:  RTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNLE---VSLEKRFVFLSIFKWEF
        RTMFETDRV+ EHVNRCN MD+VWVPSEFHVSTFV+SGVDPSKIVKVVQP+DVNFFDPLKY+PFSLESVGTLVLG NN E   +  +KRFVFLSIFKWEF
Subjt:  RTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNLE---VSLEKRFVFLSIFKWEF

Query:  RKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAM
        RKGWDLLLEAYLKEFSKKD VGL+LLTN YHT SDFGNKILDFVENS +Q P+SGWAPVYV+D HI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAM
Subjt:  RKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAM

Query:  SLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHHHIQNVFHDKR
        SLPVIATNWSGPT+FLTDENSYPLPVERMSEVKE PFKGH+WAEPSISKLQVLMREV  N +EAK KGRRAREDM+ RFSPD+VADIVH  I+N+FH+KR
Subjt:  SLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHHHIQNVFHDKR

A0A5D3CDB1 Group 1 family glycosyltransferase1.8e-22678.4Show/hide
Query:  HQQSSDP-PLPNQSPSFKFRPSPIYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSSTSSRCVLWMAPF
        H+ +  P P PNQ   FK   SPI+FSSILIL+LAISFFAF KT+ YK+QS             KL NLL   + S Q P  NPS       CVLWMAPF
Subjt:  HQQSSDP-PLPNQSPSFKFRPSPIYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSSTSSRCVLWMAPF

Query:  LSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIG
        LSGGGYSSEAWSYILAL  H+ NP FRL I+QHGDLESV FWEGLP+S RNLAIELHRT CRMNET+V+CHSEPGAWNPPLFETLPCPPG Y+ FKSVIG
Subjt:  LSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIG

Query:  RTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNLE---VSLEKRFVFLSIFKWEF
        RTMFETDRV+ EHVNRCN MD+VWVPSEFHVSTFV+SGVDPSKIVKVVQP+DVNFFDPLKY+PFSLESVGTLVLG NN E   +  +KRFVFLSIFKWEF
Subjt:  RTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNLE---VSLEKRFVFLSIFKWEF

Query:  RKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAM
        RKGWDLLLEAYLKEFSKKD VGL+LLTN YHT SDFGNKILDFVENS +Q P+SGWAPVYV+D HI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAM
Subjt:  RKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAM

Query:  SLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHHHIQNVFHDKR
        SLPVIATNWSGPT+FLTDENSYPLPVERMSEVKE PFKGH+WAEPSISKLQVLMREV  N +EAK KGRRAREDM+ RFSPD+VADIVH  I+N+FH+KR
Subjt:  SLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHHHIQNVFHDKR

A0A6J1D8R8 uncharacterized protein LOC1110186574.1e-23179.68Show/hide
Query:  HQQSSDPPLPNQSPSFKFRPSPIYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSSTSSRCVLWMAPFL
        H ++  PP P+   S KFR S I   SILIL++AISFF FTKTDH+KTQSL LL        QK I  LNP  NS  IP P P        CVLWMAPFL
Subjt:  HQQSSDPPLPNQSPSFKFRPSPIYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSSTSSRCVLWMAPFL

Query:  SGGGYSSEAWSYILALHDHLRNP-EFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIG
        SGGGYSSEAWSYILALH H++ P EFRLAI+QHGDLES+ FWEGLPDS R+LAI+LH T+CRMNET+V+CHSEPGAWNPPLFETLPCPPGVYQ FK+VIG
Subjt:  SGGGYSSEAWSYILALHDHLRNP-EFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIG

Query:  RTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNLEVSLEKRFVFLSIFKWEFRKG
        RTMFETDRV+PEHVNRC  MD++WVPSEFHVSTFVKSGVDPSKIVK+VQPIDVNFFDPLKYRPFSL S+GTLVLG+ ++E+ L+  FVFLSIFKWEFRKG
Subjt:  RTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNLEVSLEKRFVFLSIFKWEFRKG

Query:  WDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLP
        WDLLLEAYLKEFSKKD VGL+LLTN YH+  DFGNKILDFVENS IQKP SGWAPVYVIDTHIAQTDLPR+YKAADAFVLPSRGEGWGRPLVEAMAMSLP
Subjt:  WDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLP

Query:  VIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHHHIQNVFHDKR
        VIATNWSG T+FLTDENSYPL VERMSEVKEGPFKGHLWAEPSI KLQ LMREV TN DEAKAKGR AR+DMVR+FSPD+VADIV+HHIQNVFHDKR
Subjt:  VIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHHHIQNVFHDKR

A0A6J1G004 uncharacterized protein LOC1114494314.1e-23981.89Show/hide
Query:  MDDLHRHQQSSDPPLPN--QSPSFKFRPSP-----IYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSS
        MDDLH     +D PLPN   S S K RPS       Y SSILIL+L+IS FAFTK DH+K+QSL        TLFQ+LI+ LN SRN KQ  VPNP   S
Subjt:  MDDLHRHQQSSDPPLPN--QSPSFKFRPSP-----IYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSS

Query:  TSSRCVLWMAPFLSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCP
        TSS+CVLWMAPFLSGGGYSSEAWSYILALHDH+RNP FRLAI+QHGDLES+ FWEGLPDS +NLAIELHRT+CR+NETIVVCHSEPGAWNPPLFET PCP
Subjt:  TSSRCVLWMAPFLSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCP

Query:  PGVYQNFKSVIGRTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNL-EVSLEKRF
        PGVYQNFKSVIGRTMFETDRVS EHVNRCN MDFVWVPSEFHVSTFVKSGVDPSK+VK+VQP+DVNFFDPL Y PFSLESVGTLVLG  N+ EVSLEK F
Subjt:  PGVYQNFKSVIGRTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNL-EVSLEKRF

Query:  VFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGW
        VFLSIFKWEFRKGWDLLLEAYLKEFSK DGV L+LLTN YHT SDFGNKILDFVE+SGIQ+P SGWAPV+V+DTHIAQTDLPRVYKAADAFVLPSRGEGW
Subjt:  VFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGW

Query:  GRPLVEAMAMSLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHH
        GRPLVEAM+MSLPVIATNWSG T+FLTDENSYPL VERMSEVKEGPFKGHLWAEPSISKL+VLMREVMTN DEAK KGRRAREDMVRRFSPDVVA+IV  
Subjt:  GRPLVEAMAMSLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHH

Query:  HIQNVFHD
        HIQ +F +
Subjt:  HIQNVFHD

A0A6J1HWY2 uncharacterized protein LOC1114676052.3e-24283.2Show/hide
Query:  MDDLHRHQQSSDPPLPN--QSPSFKFRPSPI---YFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSSTS
        MDDLH     +D PLP+  +S S K RPS +   Y SSILIL+L+IS F FTKTDH+K+QSL        TLFQKLI+ LN SRN KQ  V NP   STS
Subjt:  MDDLHRHQQSSDPPLPN--QSPSFKFRPSPI---YFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSSTS

Query:  SRCVLWMAPFLSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCPPG
        S+CVLWMAPFLSGGGYSSEAWSYILALHDH+RNP FRLAI+QHGDLESV FWEGLPDS +NLAIELHRT+CR+NETIVVCHSEPGAWNPPLFET PCPPG
Subjt:  SRCVLWMAPFLSGGGYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCPPG

Query:  VYQNFKSVIGRTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNL-EVSLEKRFVF
        VYQNFKSVIGRTMFETDRVS EHVNRCN MDFVWVPSEFHVSTFVKSGVDPSK+VK+VQPIDVNFFDPL Y PFSLESVGTLVLG  N+ EVSLEK FVF
Subjt:  VYQNFKSVIGRTMFETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNL-EVSLEKRFVF

Query:  LSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGWGR
        LSIFKWEFRKGWDLLLEAYLKEFSK DGVGL+LLTN YHT SDFGNKILDFVENSGIQKP SGWAPVYV+DTHIAQTDLP+VYKAADAFVLPSRGEGWGR
Subjt:  LSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGWGR

Query:  PLVEAMAMSLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHHHI
        PLVEAM+MSLPVIATNWSG T+FLTDENSYPL VE+MSEVKEGPFKGHLWAEPSISKL+VLMREVMTN DEAKAKGRRAREDMVRRFSPDVVA+IVH HI
Subjt:  PLVEAMAMSLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHHHI

Query:  QNVFHD
        Q +F +
Subjt:  QNVFHD

SwissProt top hitse value%identityAlignment
A7TZT2 Mannosylfructose-phosphate synthase1.6e-0629.03Show/hide
Query:  VFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLT---NSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRG
        V L++ +    KG+DLL++ +     ++    L+L     N     +   N++ + V++ G++  V+          ++A  DLP +Y+AAD FVL SR 
Subjt:  VFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLT---NSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRG

Query:  EGWGRPLVEAMAMSLPVIATNWSG
        E +G   +EAMA   P + T   G
Subjt:  EGWGRPLVEAMAMSLPVIATNWSG

D6Z995 D-inositol 3-phosphate glycosyltransferase1.9e-0426.26Show/hide
Query:  DPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNLEVSLEKRFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILD
        DP KI  V   +D+  F P        ++     LG +  E       V   + + +  KG D+LL A  +    K GV L ++  +  +G      I+ 
Subjt:  DPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNLEVSLEKRFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILD

Query:  FVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGPTDFLTDENS
          E  G+   V+ W              L +VY+AAD   +PS  E +G   +EA A   PV+A    G    + D+ S
Subjt:  FVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGPTDFLTDENS

Q53JI9 Probable sucrose-phosphate synthase 55.6e-0441.07Show/hide
Query:  HIAQTDLPRVYKAA----DAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGPTDFL
        H  QTD+P +Y+ A      F+ P+  E +G  ++EA A  LPV+AT   GP D L
Subjt:  HIAQTDLPRVYKAA----DAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGPTDFL

Arabidopsis top hitse value%identityAlignment
AT1G52420.1 UDP-Glycosyltransferase superfamily protein1.5e-0433.75Show/hide
Query:  KILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGPTDFL
        ++L F+ NSG       W P        A T +  +Y AAD +V  S+G G  +GR  +EAMA  L V+ T+  G  + +
Subjt:  KILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGPTDFL

AT3G10630.1 UDP-Glycosyltransferase superfamily protein1.0e-18162.8Show/hide
Query:  DPPLPNQSPSFKFRPSPIYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHP--TLFQKLINLLNPSRNSKQIPVPNPSSSSTSSRCVLWMAPFLSGG
        D P       +KF  + +Y SSIL L+L+I    FT TD YK QSL    + +   +  Q L+   + +  SK   + NP+SS  +  CVLWMAPFLS G
Subjt:  DPPLPNQSPSFKFRPSPIYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHP--TLFQKLINLLNPSRNSKQIPVPNPSSSSTSSRCVLWMAPFLSGG

Query:  GYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIGRTMF
        GYSSEAWSY+L+L +HL NP FR+ I+ HGDLESV FW GL    + +AIE++R +CR NETIVVCHSEPGAW PPLFETLPCPP  Y++F SVIGRTMF
Subjt:  GYSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIGRTMF

Query:  ETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNLEVSLEKRFVFLSIFKWEFRKGWDLL
        ETDRV+PEHV RCN MD VWVP++FHVS+FV+SGVD SK+VK+VQP+DV FFDP KY+P  L +VG LVLG+      ++  FVFLS+FKWE RKGWD+L
Subjt:  ETDRVSPEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNLEVSLEKRFVFLSIFKWEFRKGWDLL

Query:  LEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIAT
        L+AYL EFS +D V L+LLTN+YH+ SDFGNKILDFVE   I++P +G+  VYVID HIAQ DLPR+YKAADAFVLP+RGEGWGRP+VEAMAMSLPVI T
Subjt:  LEAYLKEFSKKDGVGLYLLTNSYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIAT

Query:  NWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHHHIQNVFHDK
        NWSGPT++LT+ N YPL VE MSEVKEGPF+GH WAEPS+ KL+VLMR VM+N DEAK KG+R R+DMV+ F+P+VVA +V   I  +F +K
Subjt:  NWSGPTDFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHHHIQNVFHDK

AT3G15940.1 UDP-Glycosyltransferase superfamily protein8.9e-0533.75Show/hide
Query:  KILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGPTDFL
        ++L F+ N+G       W P        A T +  +Y AAD +V  S+G G  +GR  +EAMA  LPV+ T+  G  + +
Subjt:  KILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGPTDFL

AT3G15940.2 UDP-Glycosyltransferase superfamily protein8.9e-0533.75Show/hide
Query:  KILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGPTDFL
        ++L F+ N+G       W P        A T +  +Y AAD +V  S+G G  +GR  +EAMA  LPV+ T+  G  + +
Subjt:  KILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGPTDFL

AT4G10120.1 Sucrose-phosphate synthase family protein2.0e-0430.34Show/hide
Query:  HIAQTDLPRVYKAA----DAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGPTDFLTD-ENSYPLPVERMSEVKEGPFK----GHLWAE
        H  Q+++P +Y+ A      F+ P+  E +G  L+EA A  LP++AT   GP D +    N   +       + +   K     HLWAE
Subjt:  HIAQTDLPRVYKAA----DAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGPTDFLTD-ENSYPLPVERMSEVKEGPFK----GHLWAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGATCTCCACCGCCACCAACAATCCTCAGATCCACCACTTCCAAATCAATCTCCTTCTTTCAAATTCCGACCTTCTCCGATTTACTTTTCATCCATTCTCATTCT
CATTCTAGCAATTTCCTTCTTCGCTTTCACCAAAACAGATCACTACAAAACCCAATCCTTAAATCTCCTTCTCTCATCTCACCCAACCCTTTTTCAAAAACTCATCAATC
TCCTCAACCCATCTCGAAATTCCAAACAAATCCCAGTTCCTAATCCGTCTTCGTCTTCGACTTCCTCTCGATGTGTTCTTTGGATGGCCCCATTTCTTTCCGGTGGAGGG
TACAGTTCAGAAGCTTGGTCCTACATTTTAGCCCTTCATGATCATCTCAGAAACCCGGAATTTCGATTGGCCATTCAGCAACATGGTGATCTAGAATCCGTTGCTTTCTG
GGAGGGCTTACCAGATTCTGCGAGGAATTTGGCCATTGAACTTCACAGAACAGAATGTAGAATGAATGAAACTATTGTTGTTTGTCATAGTGAGCCTGGTGCTTGGAATC
CTCCATTGTTTGAAACTCTGCCTTGCCCACCAGGTGTTTACCAAAATTTCAAGTCAGTGATTGGCAGAACAATGTTTGAGACAGACAGGGTAAGTCCAGAACATGTAAAT
AGGTGTAATGGGATGGATTTTGTTTGGGTTCCTTCTGAATTTCATGTCTCTACATTTGTGAAAAGTGGGGTTGATCCTTCTAAGATTGTGAAAGTTGTTCAACCCATTGA
TGTGAACTTCTTTGATCCACTGAAATATAGGCCATTTAGTCTTGAATCTGTAGGAACTCTAGTTTTAGGTGCCAATAACTTGGAAGTAAGCTTAGAGAAGAGATTTGTGT
TTCTGAGTATCTTTAAATGGGAATTCAGGAAGGGTTGGGATCTGTTGTTGGAAGCCTATTTGAAAGAATTCTCCAAGAAAGATGGAGTGGGGTTGTACTTGTTGACAAAT
TCTTACCATACTGGTAGTGATTTTGGGAACAAGATTTTGGATTTTGTAGAAAACTCAGGCATACAAAAGCCAGTTTCTGGTTGGGCTCCTGTGTATGTGATTGATACTCA
TATAGCTCAAACTGATTTGCCTAGAGTTTACAAGGCTGCTGATGCATTTGTGCTGCCATCAAGAGGAGAAGGGTGGGGAAGGCCGCTCGTCGAAGCGATGGCAATGTCGT
TGCCGGTGATCGCCACCAACTGGTCGGGGCCGACGGACTTTTTGACTGATGAGAATAGCTATCCATTGCCGGTTGAGAGAATGAGTGAAGTAAAGGAAGGGCCATTCAAA
GGGCATCTGTGGGCTGAACCATCCATCAGTAAGCTTCAAGTTCTAATGAGGGAAGTAATGACCAATTTTGATGAAGCTAAGGCCAAAGGACGACGGGCGAGGGAGGACAT
GGTCAGACGATTCTCGCCCGACGTCGTTGCGGATATTGTTCATCATCATATACAAAATGTTTTTCATGACAAGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGATCTCCACCGCCACCAACAATCCTCAGATCCACCACTTCCAAATCAATCTCCTTCTTTCAAATTCCGACCTTCTCCGATTTACTTTTCATCCATTCTCATTCT
CATTCTAGCAATTTCCTTCTTCGCTTTCACCAAAACAGATCACTACAAAACCCAATCCTTAAATCTCCTTCTCTCATCTCACCCAACCCTTTTTCAAAAACTCATCAATC
TCCTCAACCCATCTCGAAATTCCAAACAAATCCCAGTTCCTAATCCGTCTTCGTCTTCGACTTCCTCTCGATGTGTTCTTTGGATGGCCCCATTTCTTTCCGGTGGAGGG
TACAGTTCAGAAGCTTGGTCCTACATTTTAGCCCTTCATGATCATCTCAGAAACCCGGAATTTCGATTGGCCATTCAGCAACATGGTGATCTAGAATCCGTTGCTTTCTG
GGAGGGCTTACCAGATTCTGCGAGGAATTTGGCCATTGAACTTCACAGAACAGAATGTAGAATGAATGAAACTATTGTTGTTTGTCATAGTGAGCCTGGTGCTTGGAATC
CTCCATTGTTTGAAACTCTGCCTTGCCCACCAGGTGTTTACCAAAATTTCAAGTCAGTGATTGGCAGAACAATGTTTGAGACAGACAGGGTAAGTCCAGAACATGTAAAT
AGGTGTAATGGGATGGATTTTGTTTGGGTTCCTTCTGAATTTCATGTCTCTACATTTGTGAAAAGTGGGGTTGATCCTTCTAAGATTGTGAAAGTTGTTCAACCCATTGA
TGTGAACTTCTTTGATCCACTGAAATATAGGCCATTTAGTCTTGAATCTGTAGGAACTCTAGTTTTAGGTGCCAATAACTTGGAAGTAAGCTTAGAGAAGAGATTTGTGT
TTCTGAGTATCTTTAAATGGGAATTCAGGAAGGGTTGGGATCTGTTGTTGGAAGCCTATTTGAAAGAATTCTCCAAGAAAGATGGAGTGGGGTTGTACTTGTTGACAAAT
TCTTACCATACTGGTAGTGATTTTGGGAACAAGATTTTGGATTTTGTAGAAAACTCAGGCATACAAAAGCCAGTTTCTGGTTGGGCTCCTGTGTATGTGATTGATACTCA
TATAGCTCAAACTGATTTGCCTAGAGTTTACAAGGCTGCTGATGCATTTGTGCTGCCATCAAGAGGAGAAGGGTGGGGAAGGCCGCTCGTCGAAGCGATGGCAATGTCGT
TGCCGGTGATCGCCACCAACTGGTCGGGGCCGACGGACTTTTTGACTGATGAGAATAGCTATCCATTGCCGGTTGAGAGAATGAGTGAAGTAAAGGAAGGGCCATTCAAA
GGGCATCTGTGGGCTGAACCATCCATCAGTAAGCTTCAAGTTCTAATGAGGGAAGTAATGACCAATTTTGATGAAGCTAAGGCCAAAGGACGACGGGCGAGGGAGGACAT
GGTCAGACGATTCTCGCCCGACGTCGTTGCGGATATTGTTCATCATCATATACAAAATGTTTTTCATGACAAGAGATGA
Protein sequenceShow/hide protein sequence
MDDLHRHQQSSDPPLPNQSPSFKFRPSPIYFSSILILILAISFFAFTKTDHYKTQSLNLLLSSHPTLFQKLINLLNPSRNSKQIPVPNPSSSSTSSRCVLWMAPFLSGGG
YSSEAWSYILALHDHLRNPEFRLAIQQHGDLESVAFWEGLPDSARNLAIELHRTECRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIGRTMFETDRVSPEHVN
RCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYRPFSLESVGTLVLGANNLEVSLEKRFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTN
SYHTGSDFGNKILDFVENSGIQKPVSGWAPVYVIDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGPTDFLTDENSYPLPVERMSEVKEGPFK
GHLWAEPSISKLQVLMREVMTNFDEAKAKGRRAREDMVRRFSPDVVADIVHHHIQNVFHDKR