; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018725 (gene) of Snake gourd v1 genome

Gene IDTan0018725
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGlycos_transf_1 domain-containing protein
Genome locationLG04:6221192..6223085
RNA-Seq ExpressionTan0018725
SyntenyTan0018725
Gene Ontology termsGO:0006144 - purine nucleobase metabolic process (biological process)
GO:0019628 - urate catabolic process (biological process)
GO:0005777 - peroxisome (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004846 - urate oxidase activity (molecular function)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR001296 - Glycosyl transferase, family 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573893.1 hypothetical protein SDJN03_27780, partial [Cucurbita argyrosperma subsp. sororia]2.8e-24283.01Show/hide
Query:  MGDLHDNQQSADRPLP--------KSRPSA-----IYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSS
        M DLH      DRPLP        KSRPS+      Y SSILILLLSISLF FTKTDH+KSQSLK       TLFQ+LI+ LN + N KQT +PNP S+S
Subjt:  MGDLHDNQQSADRPLP--------KSRPSA-----IYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSS

Query:  SSSSSSNQCVLWMAPFLSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFET
        S      QCVLWMAPFLSGGGYSSEAWSY+LALHDH+ NP FRLAIEQHGDLESIDFWEGLPDSV+NLAIELHRTKCR+NETIVVCHSEPGAWNPPLFET
Subjt:  SSSSSSNQCVLWMAPFLSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFET

Query:  LPCPPGVYQNFKSVIGRTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNL
         PCPPGVYQNFKSVIGRTMFETDRVSQEHVNRCN MDFVWVPSEFHVSTFVKSGVDPSK+VKIVQPIDVNFFDPL Y  FSLESVGTLVLG KN+E V+L
Subjt:  LPCPPGVYQNFKSVIGRTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNL

Query:  EKEFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSR
        EK FVFLSIFKWEFRKGWDLLLEAYLKEFSK DGV L+LLTNPYHTDSDFGNKILDFVE+S IQ+P SGWAPV+VVDTHIAQTDLPRVYKAADAFVLPSR
Subjt:  EKEFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSR

Query:  GEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVAD
        GEGWGRPLVEAM+MSLPVIATNWSGQTEFLTDENSYPL VERMSEVKEGPFKGHLWAEPSI+KL+VLMREVMTN DEAK KG+RAREDMVRRFSPD+VA+
Subjt:  GEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVAD

Query:  IVHHHIQKIFHE
        IV  HIQ+IF E
Subjt:  IVHHHIQKIFHE

KAG7012958.1 hypothetical protein SDJN02_25712, partial [Cucurbita argyrosperma subsp. argyrosperma]7.3e-24383.2Show/hide
Query:  MGDLHDNQQSADRPLP--------KSRPSA-----IYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSS
        M DLH      DRPLP        KSRPS+      Y SSILILLLSISLF FTKTDH+KSQSLK       TLFQ+LIN LN + N KQT +PNP S+S
Subjt:  MGDLHDNQQSADRPLP--------KSRPSA-----IYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSS

Query:  SSSSSSNQCVLWMAPFLSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFET
        S      QCVLWMAPFLSGGGYSSEAWSY+LALHDH+ NP FRLAIEQHGDLESIDFWEGLPDSV+NLAIELHRTKCR+NETIVVCHSEPGAWNPPLFET
Subjt:  SSSSSSNQCVLWMAPFLSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFET

Query:  LPCPPGVYQNFKSVIGRTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNL
         PCPPGVYQNFKSVIGRTMFETDRVSQEHVNRCN MDFVWVPSEFHVSTFVKSGVDPSK+VKIVQPIDVNFFDPL Y  FSLESVGTLVLG KN+E V+L
Subjt:  LPCPPGVYQNFKSVIGRTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNL

Query:  EKEFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSR
        EK FVFLSIFKWEFRKGWDLLLEAYLKEFSK DGV L+LLTNPYHTDSDFGNKILDFVE+S IQ+P SGWAPV+VVDTHIAQTDLPRVYKAADAFVLPSR
Subjt:  EKEFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSR

Query:  GEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVAD
        GEGWGRPLVEAM+MSLPVIATNWSGQTEFLTDENSYPL VERMSEVKEGPFKGHLWAEPSI+KL+VLMREVMTN DEAK KG+RAREDMVRRFSPD+VA+
Subjt:  GEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVAD

Query:  IVHHHIQKIFHE
        IV  HIQ+IF E
Subjt:  IVHHHIQKIFHE

XP_022945089.1 uncharacterized protein LOC111449431 [Cucurbita moschata]8.1e-24282.81Show/hide
Query:  MGDLHDNQQSADRPLP--------KSRPSA-----IYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSS
        M DLH      DRPLP        KSRPS+      Y SSILILLLSISLF FTK DH+KSQSLK       TLFQ+LI+ LN + N KQT +PNP S+S
Subjt:  MGDLHDNQQSADRPLP--------KSRPSA-----IYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSS

Query:  SSSSSSNQCVLWMAPFLSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFET
        S      QCVLWMAPFLSGGGYSSEAWSY+LALHDH+ NP FRLAIEQHGDLESIDFWEGLPDSV+NLAIELHRTKCR+NETIVVCHSEPGAWNPPLFET
Subjt:  SSSSSSNQCVLWMAPFLSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFET

Query:  LPCPPGVYQNFKSVIGRTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNL
         PCPPGVYQNFKSVIGRTMFETDRVSQEHVNRCN MDFVWVPSEFHVSTFVKSGVDPSK+VKIVQP+DVNFFDPL Y  FSLESVGTLVLG KN+E V+L
Subjt:  LPCPPGVYQNFKSVIGRTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNL

Query:  EKEFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSR
        EK FVFLSIFKWEFRKGWDLLLEAYLKEFSK DGV L+LLTNPYHTDSDFGNKILDFVE+S IQ+P SGWAPV+VVDTHIAQTDLPRVYKAADAFVLPSR
Subjt:  EKEFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSR

Query:  GEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVAD
        GEGWGRPLVEAM+MSLPVIATNWSGQTEFLTDENSYPL VERMSEVKEGPFKGHLWAEPSISKL+VLMREVMTN DEAK KG+RAREDMVRRFSPD+VA+
Subjt:  GEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVAD

Query:  IVHHHIQKIFHE
        IV  HIQ+IF E
Subjt:  IVHHHIQKIFHE

XP_022968340.1 uncharacterized protein LOC111467605 [Cucurbita maxima]2.1e-24584.12Show/hide
Query:  MGDLHDNQQSADRPLP--------KSRPSAI---YLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSSSS
        M DLH      DRPLP        KSRPS++   Y SSILILLLSISLFTFTKTDH+KSQSLK       TLFQKLI+ LN + N KQT + NP S+SS 
Subjt:  MGDLHDNQQSADRPLP--------KSRPSAI---YLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSSSS

Query:  SSSSNQCVLWMAPFLSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFETLP
             QCVLWMAPFLSGGGYSSEAWSY+LALHDH+ NP FRLAIEQHGDLES+DFWEGLPDSV+NLAIELHRTKCR+NETIVVCHSEPGAWNPPLFET P
Subjt:  SSSSNQCVLWMAPFLSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFETLP

Query:  CPPGVYQNFKSVIGRTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNLEK
        CPPGVYQNFKSVIGRTMFETDRVSQEHVNRCN MDFVWVPSEFHVSTFVKSGVDPSK+VKIVQPIDVNFFDPL Y  FSLESVGTLVLG KN+  V+LEK
Subjt:  CPPGVYQNFKSVIGRTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNLEK

Query:  EFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGE
         FVFLSIFKWEFRKGWDLLLEAYLKEFSK DGVGL+LLTNPYHTDSDFGNKILDFVENS IQKP SGWAPVYVVDTHIAQTDLP+VYKAADAFVLPSRGE
Subjt:  EFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGE

Query:  GWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVADIV
        GWGRPLVEAM+MSLPVIATNWSGQTEFLTDENSYPL VE+MSEVKEGPFKGHLWAEPSISKL+VLMREVMTN DEAKAKG+RAREDMVRRFSPD+VA+IV
Subjt:  GWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVADIV

Query:  HHHIQKIFHE
        H HIQ+IF E
Subjt:  HHHIQKIFHE

XP_023542823.1 uncharacterized protein LOC111802622 [Cucurbita pepo subsp. pepo]7.3e-24383.04Show/hide
Query:  MGDLHDNQQSADRPLP--------KSRPSA-----IYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSS
        M DLH      D+PLP        KSRPS+      Y SSILILLLSISLF FTKTDH+KSQSLK       TLFQ+LI+ LN + N KQT +PNP S+S
Subjt:  MGDLHDNQQSADRPLP--------KSRPSA-----IYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSS

Query:  SSSSSSNQCVLWMAPFLSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFET
        S      QCVLWMAPFLSGGGYSSEAWSY+LALHDH+ NP FRLAIEQHGDLESIDFWEGLPDSV+NLAIELHRTKCR+NETIVVCHSEPGAWNPPLFET
Subjt:  SSSSSSNQCVLWMAPFLSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFET

Query:  LPCPPGVYQNFKSVIGRTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNL-EIVN
         PCPPGVYQNFKSVIGRTMFETDRVSQEHVNRCN MDFVWVPSEFHVSTFVKSGVDPSK+VKIVQPIDVNFFDPL Y  FSLESVGTLVLG KN+ E V+
Subjt:  LPCPPGVYQNFKSVIGRTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNL-EIVN

Query:  LEKEFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPS
        LEK FVFLSIFKWEFRKGWDLLLEAYLKEFSK DGVGL+LLTNPYHTD+DFGNKILDFVE+S IQ+P SGWAPV+VVDTHIAQTDLPRVYKAADAFVLPS
Subjt:  LEKEFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPS

Query:  RGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVA
        RGEGWGRPLVEAM+M+LPVIATNWSGQTEFLTDENSYPL VERMSEVKEGPFKGHLWAEPSISKL+VLMREVMTN DEAK KG+RAREDMVRRFSPD+VA
Subjt:  RGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVA

Query:  DIVHHHIQKIFHE
        +IVH HIQ+IFHE
Subjt:  DIVHHHIQKIFHE

TrEMBL top hitse value%identityAlignment
A0A1S3BFB1 uncharacterized protein LOC1034893732.4e-22378.2Show/hide
Query:  DRPLP--------KSRPSAIYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSSSSSSSSNQCVLWMAPF
        DRP P        K   S I+ SSILILLL+IS F F KT+ YKSQS             KL NLL     S Q P  NPS           CVLWMAPF
Subjt:  DRPLP--------KSRPSAIYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSSSSSSSSNQCVLWMAPF

Query:  LSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIG
        LSGGGYSSEAWSY+LAL  H+TNP FRL I QHGDLES+DFWEGLP+SVRNLAIELHRT+CRMNET+V+CHSEPGAWNPPLFETLPCPPG Y+ FKSVIG
Subjt:  LSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIG

Query:  RTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNL--EKEFVFLSIFKWEF
        RTMFETDRV+QEHVNRCN MD+VWVPSEFHVSTFV+SGVDPSKIVK+VQP+DVNFFDPLKYK FSLESVGTLVLG  N E V L  +K FVFLSIFKWEF
Subjt:  RTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNL--EKEFVFLSIFKWEF

Query:  RKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAM
        RKGWDLLLEAYLKEFSKKD VGL+LLTNPYHT+SDFGNKILDFVENSD+Q P+SGWAPVYVVD HI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAM
Subjt:  RKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAM

Query:  SLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVADIVHHHIQKIFHEKR
        SLPVIATNWSG TEFLTDENSYPLPVERMSEVKE PFKGH+WAEPSISKLQVLMREV  N +EAK KG+RAREDM+ RFSPDIVADIVH  I+ IFHEKR
Subjt:  SLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVADIVHHHIQKIFHEKR

A0A5D3CDB1 Group 1 family glycosyltransferase2.4e-22378.2Show/hide
Query:  DRPLP--------KSRPSAIYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSSSSSSSSNQCVLWMAPF
        DRP P        K   S I+ SSILILLL+IS F F KT+ YKSQS             KL NLL     S Q P  NPS           CVLWMAPF
Subjt:  DRPLP--------KSRPSAIYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSSSSSSSSNQCVLWMAPF

Query:  LSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIG
        LSGGGYSSEAWSY+LAL  H+TNP FRL I QHGDLES+DFWEGLP+SVRNLAIELHRT+CRMNET+V+CHSEPGAWNPPLFETLPCPPG Y+ FKSVIG
Subjt:  LSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIG

Query:  RTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNL--EKEFVFLSIFKWEF
        RTMFETDRV+QEHVNRCN MD+VWVPSEFHVSTFV+SGVDPSKIVK+VQP+DVNFFDPLKYK FSLESVGTLVLG  N E V L  +K FVFLSIFKWEF
Subjt:  RTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNL--EKEFVFLSIFKWEF

Query:  RKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAM
        RKGWDLLLEAYLKEFSKKD VGL+LLTNPYHT+SDFGNKILDFVENSD+Q P+SGWAPVYVVD HI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAM
Subjt:  RKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAM

Query:  SLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVADIVHHHIQKIFHEKR
        SLPVIATNWSG TEFLTDENSYPLPVERMSEVKE PFKGH+WAEPSISKLQVLMREV  N +EAK KG+RAREDM+ RFSPDIVADIVH  I+ IFHEKR
Subjt:  SLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVADIVHHHIQKIFHEKR

A0A6J1D8R8 uncharacterized protein LOC1110186574.2e-22879.72Show/hide
Query:  HDNQQSADRPLPKSRPSAIYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSSSSSSSSNQCVLWMAPFL
        H N Q  +    K R SAI   SILILL++IS FTFTKTDH+K+QSLKLL        QK I  LNP PNS   P P P            CVLWMAPFL
Subjt:  HDNQQSADRPLPKSRPSAIYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSSSSSSSSNQCVLWMAPFL

Query:  SGGGYSSEAWSYVLALHDHLTNP-EFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIG
        SGGGYSSEAWSY+LALH H+  P EFRLAIEQHGDLESIDFWEGLPDSVR+LAI+LH T CRMNET+V+CHSEPGAWNPPLFETLPCPPGVYQ FK+VIG
Subjt:  SGGGYSSEAWSYVLALHDHLTNP-EFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIG

Query:  RTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNLEKEFVFLSIFKWEFRK
        RTMFETDRV+ EHVNRC  MD++WVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKY+ FSL S+GTLVLG+K++E + L+  FVFLSIFKWEFRK
Subjt:  RTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNLEKEFVFLSIFKWEFRK

Query:  GWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSL
        GWDLLLEAYLKEFSKKD VGL+LLTNPYH+D DFGNKILDFVENSDIQKP SGWAPVYV+DTHIAQTDLPR+YKAADAFVLPSRGEGWGRPLVEAMAMSL
Subjt:  GWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSL

Query:  PVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVADIVHHHIQKIFHEKR
        PVIATNWSGQTEFLTDENSYPL VERMSEVKEGPFKGHLWAEPSI KLQ LMREV TN DEAKAKG+ AR+DMVR+FSPDIVADIV+HHIQ +FH+KR
Subjt:  PVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVADIVHHHIQKIFHEKR

A0A6J1G004 uncharacterized protein LOC1114494313.9e-24282.81Show/hide
Query:  MGDLHDNQQSADRPLP--------KSRPSA-----IYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSS
        M DLH      DRPLP        KSRPS+      Y SSILILLLSISLF FTK DH+KSQSLK       TLFQ+LI+ LN + N KQT +PNP S+S
Subjt:  MGDLHDNQQSADRPLP--------KSRPSA-----IYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSS

Query:  SSSSSSNQCVLWMAPFLSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFET
        S      QCVLWMAPFLSGGGYSSEAWSY+LALHDH+ NP FRLAIEQHGDLESIDFWEGLPDSV+NLAIELHRTKCR+NETIVVCHSEPGAWNPPLFET
Subjt:  SSSSSSNQCVLWMAPFLSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFET

Query:  LPCPPGVYQNFKSVIGRTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNL
         PCPPGVYQNFKSVIGRTMFETDRVSQEHVNRCN MDFVWVPSEFHVSTFVKSGVDPSK+VKIVQP+DVNFFDPL Y  FSLESVGTLVLG KN+E V+L
Subjt:  LPCPPGVYQNFKSVIGRTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNL

Query:  EKEFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSR
        EK FVFLSIFKWEFRKGWDLLLEAYLKEFSK DGV L+LLTNPYHTDSDFGNKILDFVE+S IQ+P SGWAPV+VVDTHIAQTDLPRVYKAADAFVLPSR
Subjt:  EKEFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSR

Query:  GEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVAD
        GEGWGRPLVEAM+MSLPVIATNWSGQTEFLTDENSYPL VERMSEVKEGPFKGHLWAEPSISKL+VLMREVMTN DEAK KG+RAREDMVRRFSPD+VA+
Subjt:  GEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVAD

Query:  IVHHHIQKIFHE
        IV  HIQ+IF E
Subjt:  IVHHHIQKIFHE

A0A6J1HWY2 uncharacterized protein LOC1114676051.0e-24584.12Show/hide
Query:  MGDLHDNQQSADRPLP--------KSRPSAI---YLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSSSS
        M DLH      DRPLP        KSRPS++   Y SSILILLLSISLFTFTKTDH+KSQSLK       TLFQKLI+ LN + N KQT + NP S+SS 
Subjt:  MGDLHDNQQSADRPLP--------KSRPSAI---YLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSSSS

Query:  SSSSNQCVLWMAPFLSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFETLP
             QCVLWMAPFLSGGGYSSEAWSY+LALHDH+ NP FRLAIEQHGDLES+DFWEGLPDSV+NLAIELHRTKCR+NETIVVCHSEPGAWNPPLFET P
Subjt:  SSSSNQCVLWMAPFLSGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFETLP

Query:  CPPGVYQNFKSVIGRTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNLEK
        CPPGVYQNFKSVIGRTMFETDRVSQEHVNRCN MDFVWVPSEFHVSTFVKSGVDPSK+VKIVQPIDVNFFDPL Y  FSLESVGTLVLG KN+  V+LEK
Subjt:  CPPGVYQNFKSVIGRTMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNLEK

Query:  EFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGE
         FVFLSIFKWEFRKGWDLLLEAYLKEFSK DGVGL+LLTNPYHTDSDFGNKILDFVENS IQKP SGWAPVYVVDTHIAQTDLP+VYKAADAFVLPSRGE
Subjt:  EFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGE

Query:  GWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVADIV
        GWGRPLVEAM+MSLPVIATNWSGQTEFLTDENSYPL VE+MSEVKEGPFKGHLWAEPSISKL+VLMREVMTN DEAKAKG+RAREDMVRRFSPD+VA+IV
Subjt:  GWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVADIV

Query:  HHHIQKIFHE
        H HIQ+IF E
Subjt:  HHHIQKIFHE

SwissProt top hitse value%identityAlignment
A7TZT2 Mannosylfructose-phosphate synthase7.9e-0628.23Show/hide
Query:  VFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLT---NPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRG
        V L++ +    KG+DLL++ +     ++    L+L     N    ++   N++ + V++  ++  V+          ++A  DLP +Y+AAD FVL SR 
Subjt:  VFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLT---NPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRG

Query:  EGWGRPLVEAMAMSLPVIATNWSG
        E +G   +EAMA   P + T   G
Subjt:  EGWGRPLVEAMAMSLPVIATNWSG

Q9R9N2 Lipopolysaccharide core biosynthesis mannosyltransferase LpsB4.3e-0444.9Show/hide
Query:  TDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLT
        T++P  Y+A D FV P R EG+G   +EAMA  +PV+AT+    +E +T
Subjt:  TDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLT

Arabidopsis top hitse value%identityAlignment
AT1G52420.1 UDP-Glycosyltransferase superfamily protein5.8e-0433.75Show/hide
Query:  KILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL
        ++L F+ NS        W P        A T +  +Y AAD +V  S+G G  +GR  +EAMA  L V+ T+  G  E +
Subjt:  KILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL

AT3G10630.1 UDP-Glycosyltransferase superfamily protein1.8e-18363.86Show/hide
Query:  DRPLPKSR-----PSAIYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHT--TLFQKLINLLNPTPNSKQTPLPNPSSSSSSSSSSNQCVLWMAPFL
        D+P  +S+      + +Y SSIL LLLSI L  FT TD YK QSL+   + +   +  Q L+   + TP SK   L NP+SS+        CVLWMAPFL
Subjt:  DRPLPKSR-----PSAIYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHT--TLFQKLINLLNPTPNSKQTPLPNPSSSSSSSSSSNQCVLWMAPFL

Query:  SGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIGR
        S GGYSSEAWSYVL+L +HLTNP FR+ IE HGDLES++FW GL    + +AIE++R +CR NETIVVCHSEPGAW PPLFETLPCPP  Y++F SVIGR
Subjt:  SGGGYSSEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIGR

Query:  TMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNLEKEFVFLSIFKWEFRKG
        TMFETDRV+ EHV RCN MD VWVP++FHVS+FV+SGVD SK+VKIVQP+DV FFDP KYK   L +VG LVLG+       ++  FVFLS+FKWE RKG
Subjt:  TMFETDRVSQEHVNRCNGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNLEKEFVFLSIFKWEFRKG

Query:  WDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLP
        WD+LL+AYL EFS +D V L+LLTN YH+DSDFGNKILDFVE  +I++P +G+  VYV+D HIAQ DLPR+YKAADAFVLP+RGEGWGRP+VEAMAMSLP
Subjt:  WDLLLEAYLKEFSKKDGVGLYLLTNPYHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLP

Query:  VIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVADIVHHHIQKIFHEKRR
        VI TNWSG TE+LT+ N YPL VE MSEVKEGPF+GH WAEPS+ KL+VLMR VM+N DEAK KG+R R+DMV+ F+P++VA +V   I +IF EK R
Subjt:  VIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVADIVHHHIQKIFHEKRR

AT3G15940.1 UDP-Glycosyltransferase superfamily protein3.4e-0433.75Show/hide
Query:  KILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL
        ++L F+ N+        W P        A T +  +Y AAD +V  S+G G  +GR  +EAMA  LPV+ T+  G  E +
Subjt:  KILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL

AT3G15940.2 UDP-Glycosyltransferase superfamily protein3.4e-0433.75Show/hide
Query:  KILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL
        ++L F+ N+        W P        A T +  +Y AAD +V  S+G G  +GR  +EAMA  LPV+ T+  G  E +
Subjt:  KILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL

AT5G01220.1 sulfoquinovosyldiacylglycerol 22.6e-0428.12Show/hide
Query:  DIQKPVSGWAPVYVVDTHIAQTD-LPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPS
        D++K  +G   V+   T   Q D L + Y + D FV+PS  E  G  ++EAM+  LPV+A    G  + + ++           E K G        E  
Subjt:  DIQKPVSGWAPVYVVDTHIAQTD-LPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKGHLWAEPS

Query:  ISKLQVLMREVMTNFDEAKAKGQRARED
        ++KL+ L+ +  T     +  G+ ARE+
Subjt:  ISKLQVLMREVMTNFDEAKAKGQRARED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGATCTTCACGACAACCAACAATCAGCAGATCGACCACTTCCCAAATCTCGCCCTTCTGCGATCTACCTTTCATCTATTCTCATTCTCCTTCTATCAATTTCCCT
CTTCACTTTCACCAAAACAGATCATTACAAATCCCAATCCCTAAAACTCCTTCTCTCATCTCACACAACCCTTTTTCAAAAACTCATCAATCTTCTCAATCCAACTCCAA
ATTCTAAACAAACCCCACTTCCTAATCCGTCTTCGTCTTCGTCTTCGTCTTCGTCTTCCAATCAATGTGTTCTTTGGATGGCCCCATTTCTCTCCGGTGGAGGGTACAGT
TCAGAAGCTTGGTCCTACGTTTTAGCCCTTCATGATCATCTAACAAACCCTGAATTTCGTTTGGCCATTGAGCAACATGGTGATCTTGAATCCATTGATTTCTGGGAGGG
CTTACCCGATTCTGTGAGGAATTTGGCCATTGAACTTCACAGAACAAAATGCAGAATGAATGAAACTATTGTGGTTTGTCATAGTGAGCCTGGTGCTTGGAATCCTCCAT
TGTTTGAAACTTTGCCTTGCCCACCAGGTGTTTACCAAAATTTCAAGTCAGTCATTGGCAGAACCATGTTTGAAACTGATAGGGTGAGTCAAGAACATGTGAATCGATGT
AATGGAATGGATTTTGTTTGGGTTCCTTCTGAATTTCATGTTTCTACGTTTGTGAAAAGTGGGGTTGATCCTTCTAAGATTGTGAAAATTGTTCAACCCATTGATGTGAA
TTTCTTTGATCCTCTGAAATATAAGTCATTTAGTCTTGAATCTGTAGGAACACTGGTTTTAGGAGCCAAAAACTTGGAAATAGTAAACTTAGAGAAGGAATTTGTGTTTC
TGAGTATCTTTAAGTGGGAATTCAGGAAGGGTTGGGATCTGTTGTTGGAAGCTTATTTGAAAGAATTCTCCAAGAAAGATGGAGTGGGGTTGTATTTGTTGACAAATCCT
TACCATACTGATAGTGATTTTGGGAACAAGATTTTGGATTTTGTAGAAAATTCAGACATACAAAAGCCAGTTTCTGGTTGGGCTCCTGTGTATGTGGTAGATACTCATAT
AGCTCAAACTGATTTGCCTAGAGTTTACAAGGCTGCAGATGCATTTGTTCTGCCATCAAGAGGAGAAGGGTGGGGAAGGCCGCTCGTCGAAGCGATGGCAATGTCGTTGC
CGGTGATCGCCACCAACTGGTCGGGGCAAACGGAGTTTTTGACGGATGAGAATAGCTATCCATTGCCGGTTGAGAGAATGAGTGAGGTAAAGGAAGGGCCTTTCAAGGGC
CATCTGTGGGCTGAACCATCCATCAGTAAGCTTCAAGTTCTAATGAGGGAAGTAATGACCAATTTTGATGAAGCTAAGGCCAAAGGACAACGGGCAAGGGAGGACATGGT
TAGGCGATTCTCGCCCGACATCGTTGCCGATATTGTTCATCATCATATACAAAAGATTTTTCATGAGAAGAGACGATAG
mRNA sequenceShow/hide mRNA sequence
GTCGGTTTCGATCGGTCCAATCGATTTTCGGTGATAAAATCCACTTCGTCTCTCTGCTTCTCTTCTGTAAATTTTCGGTTAATCTGAAATCCATCGCCATGGGCGATCTT
CACGACAACCAACAATCAGCAGATCGACCACTTCCCAAATCTCGCCCTTCTGCGATCTACCTTTCATCTATTCTCATTCTCCTTCTATCAATTTCCCTCTTCACTTTCAC
CAAAACAGATCATTACAAATCCCAATCCCTAAAACTCCTTCTCTCATCTCACACAACCCTTTTTCAAAAACTCATCAATCTTCTCAATCCAACTCCAAATTCTAAACAAA
CCCCACTTCCTAATCCGTCTTCGTCTTCGTCTTCGTCTTCGTCTTCCAATCAATGTGTTCTTTGGATGGCCCCATTTCTCTCCGGTGGAGGGTACAGTTCAGAAGCTTGG
TCCTACGTTTTAGCCCTTCATGATCATCTAACAAACCCTGAATTTCGTTTGGCCATTGAGCAACATGGTGATCTTGAATCCATTGATTTCTGGGAGGGCTTACCCGATTC
TGTGAGGAATTTGGCCATTGAACTTCACAGAACAAAATGCAGAATGAATGAAACTATTGTGGTTTGTCATAGTGAGCCTGGTGCTTGGAATCCTCCATTGTTTGAAACTT
TGCCTTGCCCACCAGGTGTTTACCAAAATTTCAAGTCAGTCATTGGCAGAACCATGTTTGAAACTGATAGGGTGAGTCAAGAACATGTGAATCGATGTAATGGAATGGAT
TTTGTTTGGGTTCCTTCTGAATTTCATGTTTCTACGTTTGTGAAAAGTGGGGTTGATCCTTCTAAGATTGTGAAAATTGTTCAACCCATTGATGTGAATTTCTTTGATCC
TCTGAAATATAAGTCATTTAGTCTTGAATCTGTAGGAACACTGGTTTTAGGAGCCAAAAACTTGGAAATAGTAAACTTAGAGAAGGAATTTGTGTTTCTGAGTATCTTTA
AGTGGGAATTCAGGAAGGGTTGGGATCTGTTGTTGGAAGCTTATTTGAAAGAATTCTCCAAGAAAGATGGAGTGGGGTTGTATTTGTTGACAAATCCTTACCATACTGAT
AGTGATTTTGGGAACAAGATTTTGGATTTTGTAGAAAATTCAGACATACAAAAGCCAGTTTCTGGTTGGGCTCCTGTGTATGTGGTAGATACTCATATAGCTCAAACTGA
TTTGCCTAGAGTTTACAAGGCTGCAGATGCATTTGTTCTGCCATCAAGAGGAGAAGGGTGGGGAAGGCCGCTCGTCGAAGCGATGGCAATGTCGTTGCCGGTGATCGCCA
CCAACTGGTCGGGGCAAACGGAGTTTTTGACGGATGAGAATAGCTATCCATTGCCGGTTGAGAGAATGAGTGAGGTAAAGGAAGGGCCTTTCAAGGGCCATCTGTGGGCT
GAACCATCCATCAGTAAGCTTCAAGTTCTAATGAGGGAAGTAATGACCAATTTTGATGAAGCTAAGGCCAAAGGACAACGGGCAAGGGAGGACATGGTTAGGCGATTCTC
GCCCGACATCGTTGCCGATATTGTTCATCATCATATACAAAAGATTTTTCATGAGAAGAGACGATAGATCACACATGTTTATAACCAAGTTTTCATTCCTCTTGTTTGTG
TTATGCTCTGTTTTGATTGAGATGAGCTGAAGGCTAACATTGTAGATTTGTGTAGTAGTGAATATTAGAAGTAGTACCGAATTGAAGCCTCAGTTGAAAGAATTTACATC
GACAAAGCATTTGTTAAATGTTTGGAACTGTAGCATTATAATAATGTTGATGGTTAATAAAATCCTATACAAATATTGAAATATAATATTATCCTATGATATTTCATACC
ATTATAATAAACTCACTAGCACCA
Protein sequenceShow/hide protein sequence
MGDLHDNQQSADRPLPKSRPSAIYLSSILILLLSISLFTFTKTDHYKSQSLKLLLSSHTTLFQKLINLLNPTPNSKQTPLPNPSSSSSSSSSSNQCVLWMAPFLSGGGYS
SEAWSYVLALHDHLTNPEFRLAIEQHGDLESIDFWEGLPDSVRNLAIELHRTKCRMNETIVVCHSEPGAWNPPLFETLPCPPGVYQNFKSVIGRTMFETDRVSQEHVNRC
NGMDFVWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYKSFSLESVGTLVLGAKNLEIVNLEKEFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDGVGLYLLTNP
YHTDSDFGNKILDFVENSDIQKPVSGWAPVYVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFKG
HLWAEPSISKLQVLMREVMTNFDEAKAKGQRAREDMVRRFSPDIVADIVHHHIQKIFHEKRR