; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0000471 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0000471
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Descriptionuridylate kinase
Genome locationchr08:18458996..18462809
RNA-Seq ExpressionPI0000471
SyntenyPI0000471
Gene Ontology termsGO:0006225 - UDP biosynthetic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0044210 - 'de novo' CTP biosynthetic process (biological process)
GO:0046940 - nucleoside monophosphate phosphorylation (biological process)
GO:0005829 - cytosol (cellular component)
GO:0033862 - UMP kinase activity (molecular function)
InterPro domainsIPR001048 - Aspartate/glutamate/uridylate kinase
IPR015963 - Uridylate kinase, bacteria
IPR036393 - Acetylglutamate kinase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148645.1 uncharacterized protein LOC101221565 [Cucumis sativus]8.9e-17697.07Show/hide
Query:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPSYKWR
        MAIPTSLIPTLSFDTISSSASYSSSSFSSQ+LIPFKPH LGLKTDNPTSNGSLI HSSAR LGS S PLKRSMKHQIS ISP GMT+SEASMSMPSYKWR
Subjt:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPSYKWR

Query:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR
        RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR
Subjt:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR

Query:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQ
        MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNA LLETLTYQEVTSKDLSVMDMTAITLCQ
Subjt:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQ

Query:  ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST
        ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST
Subjt:  ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST

XP_008441028.1 PREDICTED: uridylate kinase [Cucumis melo]8.9e-17697.07Show/hide
Query:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPSYKWR
        MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPH LGLKTDNPTSNGS IVHS AR LGSNS PLKRSMKHQIS ISP GMTLSEASMSMPSYKWR
Subjt:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPSYKWR

Query:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR
        RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR
Subjt:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR

Query:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQ
        MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNA LLETLTYQEVTSKDLSVMDMTAITLCQ
Subjt:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQ

Query:  ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST
        ENNIPVVVFNLTKPDNI+KAIKGERVGTLIGGTWNSM+AST
Subjt:  ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST

XP_022132364.1 uncharacterized protein LOC111005242 [Momordica charantia]1.9e-16290.72Show/hide
Query:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTD--NPT--SNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPS
        MAIPTSLIP LSF+ ISSSASYSSSSF SQ L PFKPH   LK D  NPT  SN  L+V  SARE+GSNSDP+  SMKHQISSI+PSGMTL+EASMSMPS
Subjt:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTD--NPT--SNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPS

Query:  YKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQ
         KWRRVLLKVSGEALAGDRLQNIDPKVTM IAREVA+VTRLGIEVAIVVGGGNIFRGSSWAG SGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQ
Subjt:  YKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQ

Query:  TAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAI
        TAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAI
Subjt:  TAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAI

Query:  TLCQENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST
        TLCQENNIPVVVFNLTKPDNI+KAIKGERVGTLIGGTWNSMV S+
Subjt:  TLCQENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST

XP_023517300.1 uncharacterized protein LOC111781100 [Cucurbita pepo subsp. pepo]2.6e-15989.44Show/hide
Query:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPSYKWR
        MAIPTSL P LSFD++ SSAS SSSS S Q L PFKPH  GLK + PTSNGSL+V+ SARE+GS+SDPL  SMKHQISS++P+GM L+EASMSMPSYKWR
Subjt:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPSYKWR

Query:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR
        RVLLKVSGEALAGD LQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR
Subjt:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR

Query:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQ
        MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPR+NP A LLETLTY EVTSKDLSVMDMTAITLCQ
Subjt:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQ

Query:  ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST
        ENNIPVVVFNLTKPDNI+KAIKGERVGTLIGG  NSMV ST
Subjt:  ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST

XP_038880866.1 uridylate kinase [Benincasa hispida]6.4e-17495.89Show/hide
Query:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPSYKWR
        MAIPTSL+PT SFDTISSSASYSSSSFS QFL PFKPH  GLK DNPTSNGSL+V+SSARE+GSNSDPL RSMKHQISSISPSGMTLSEASMSMPSYKWR
Subjt:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPSYKWR

Query:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR
        RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR
Subjt:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR

Query:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQ
        MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNA LLETLTYQEVTSKDLSVMDMTAITLCQ
Subjt:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQ

Query:  ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST
        ENNIPVVVFNLTKPDNI+KAIKGERVGTLIGGTWNSMV ST
Subjt:  ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST

TrEMBL top hitse value%identityAlignment
A0A0A0KHT3 AA_kinase domain-containing protein4.3e-17697.07Show/hide
Query:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPSYKWR
        MAIPTSLIPTLSFDTISSSASYSSSSFSSQ+LIPFKPH LGLKTDNPTSNGSLI HSSAR LGS S PLKRSMKHQIS ISP GMT+SEASMSMPSYKWR
Subjt:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPSYKWR

Query:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR
        RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR
Subjt:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR

Query:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQ
        MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNA LLETLTYQEVTSKDLSVMDMTAITLCQ
Subjt:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQ

Query:  ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST
        ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST
Subjt:  ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST

A0A1S3B213 uridylate kinase4.3e-17697.07Show/hide
Query:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPSYKWR
        MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPH LGLKTDNPTSNGS IVHS AR LGSNS PLKRSMKHQIS ISP GMTLSEASMSMPSYKWR
Subjt:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPSYKWR

Query:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR
        RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR
Subjt:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR

Query:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQ
        MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNA LLETLTYQEVTSKDLSVMDMTAITLCQ
Subjt:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQ

Query:  ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST
        ENNIPVVVFNLTKPDNI+KAIKGERVGTLIGGTWNSM+AST
Subjt:  ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST

A0A6J1BSV5 uncharacterized protein LOC1110052429.4e-16390.72Show/hide
Query:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTD--NPT--SNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPS
        MAIPTSLIP LSF+ ISSSASYSSSSF SQ L PFKPH   LK D  NPT  SN  L+V  SARE+GSNSDP+  SMKHQISSI+PSGMTL+EASMSMPS
Subjt:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTD--NPT--SNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPS

Query:  YKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQ
         KWRRVLLKVSGEALAGDRLQNIDPKVTM IAREVA+VTRLGIEVAIVVGGGNIFRGSSWAG SGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQ
Subjt:  YKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQ

Query:  TAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAI
        TAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAI
Subjt:  TAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAI

Query:  TLCQENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST
        TLCQENNIPVVVFNLTKPDNI+KAIKGERVGTLIGGTWNSMV S+
Subjt:  TLCQENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST

A0A6J1GEU0 uncharacterized protein LOC1114532831.3e-15989.74Show/hide
Query:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPSYKWR
        MAIP SL+P  SF      ASYSSSSF  Q   PFKPH  GLK DNPTSNGSL+V  SARE+GSNSDPL  SMKHQISS++PSGMTL+EASMSMPSYKWR
Subjt:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPSYKWR

Query:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR
        RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAG SGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR
Subjt:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR

Query:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQ
        MSEVAEPYIRRRAVRHLEKGRVVIFAAG GNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDP+QN NA LLETLTYQEVTSKDLSVMDMTAITLCQ
Subjt:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQ

Query:  ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST
        ENNIPVVVFNLTKPDNI+KAIKGERVGTLIGGTWNS V ST
Subjt:  ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST

A0A6J1IMJ4 uncharacterized protein LOC1114787663.7e-15989.74Show/hide
Query:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPSYKWR
        MAIP SLIP  SF      ASYSSSSF  Q   PFKPH  GLKTDNPTSNGSL+V  S RELGSNSDPL  SMKHQISS++PSGMTL+EASMSMPSYKWR
Subjt:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPSYKWR

Query:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR
        RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAG SGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR
Subjt:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFR

Query:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQ
        MSEVAEPYIRRRAVRHLEKGRVVIFAAG GNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDP+QN NA LLE L+YQEVTSKDLSVMDMTAITLCQ
Subjt:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQ

Query:  ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST
        ENNIPVVVFNLTKPDNI+KAIKGERVGTLIGGTWNS V ST
Subjt:  ENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST

SwissProt top hitse value%identityAlignment
P74457 Uridylate kinase8.1e-7965.24Show/hide
Query:  WRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTA
        ++RVLLK+SGEAL GD    IDP V   IA+E+  V + G+++AIVVGGGNIFRG   A ++G+DR++ADYIGM+ATVMNA+ LQ  +E + IPTRV TA
Subjt:  WRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTA

Query:  FRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITL
          M EVAEPYIRRRA+RHLEKGRVVIF AG+GNPFFTTDT AALR AEI+AEV+ KAT VDGVYD DP+ NPNA    TLTY  V ++DL VMD TAI L
Subjt:  FRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITL

Query:  CQENNIPVVVFNLTKPDNITKAIKGERVGTLIG
        C++NNIP+++F+L  P NI +AIKGE VGTL+G
Subjt:  CQENNIPVVVFNLTKPDNITKAIKGERVGTLIG

Q10Y48 Uridylate kinase5.6e-8064.96Show/hide
Query:  WRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTA
        ++RVLLK+SGEAL G     IDP V   IA+EVA V   GI++AIVVGGGNIFRG   A S G+DR++ADY+GM+ATVMNAI LQ  +E +G+PTRVQTA
Subjt:  WRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTA

Query:  FRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITL
          M E+AEPYIRRRA+RHLEKGRVV+F AG+GNPFFTTDT AALR AEI AEV+ KAT VDGVYD DP +N  A   E+L+Y EV + DL VMD TAI L
Subjt:  FRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITL

Query:  CQENNIPVVVFNLTKPDNITKAIKGERVGTLIGG
        C+ENNIP++VFNL+   NI KA+ GE++GT++GG
Subjt:  CQENNIPVVVFNLTKPDNITKAIKGERVGTLIGG

Q2JJE2 Uridylate kinase6.6e-8164.96Show/hide
Query:  KWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQT
        K+RR+LLK+SGEAL G+R   IDP+V  +IA EVA+V R G++VAIVVGGGNI+RG   A + G+D++SADY+GMLATV+NA+ LQ  +E  GIPTRVQT
Subjt:  KWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQT

Query:  AFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAIT
        A  M EVAEPYIRRRA+RHLEKGRVVIF AG+GNPFFTTDT AALR AEI+AEV+ KAT VDGVYD DP+ +P A   + L+YQ+V ++DL VMD TAI 
Subjt:  AFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAIT

Query:  LCQENNIPVVVFNLTKPDNITKAIKGERVGTLIG
        LC+EN +P+VVF+LT P NI + ++GE +GT IG
Subjt:  LCQENNIPVVVFNLTKPDNITKAIKGERVGTLIG

Q2JS42 Uridylate kinase1.7e-8164.83Show/hide
Query:  KWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQT
        K+RR+LLK+SGEAL G+R   IDP+V  +IA EVA+V R G++VAIVVGGGNI+RG   A + G+D++SADY+GMLATV+NA+ LQ  +E  GIPTRVQT
Subjt:  KWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQT

Query:  AFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAIT
        A  M EVAEPYIRRRA+RHLEKGRVVIF AG+GNPFFTTDT AALR AEI+AEV+ KAT VDGVYD DP+ +P A   + L+YQ+V ++DL VMD TAI 
Subjt:  AFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAIT

Query:  LCQENNIPVVVFNLTKPDNITKAIKGERVGTLIGGT
        LC+EN +P+VVF+LT P NI + ++GE +GT IG T
Subjt:  LCQENNIPVVVFNLTKPDNITKAIKGERVGTLIGGT

Q3MFI4 Uridylate kinase1.1e-7864.68Show/hide
Query:  WRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTA
        +RRVLLK+SGEAL G+    IDP+V   IA+E+A V   G+++AIVVGGGNIFRG   A S+G+DR++ADYIGM+ATVMNA+ LQ ++E IG+ TRVQTA
Subjt:  WRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTA

Query:  FRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITL
          M E+AEPYIRRRA+RHLEKGRVVIF AG+GNPFFTTDT AALR AEI+AEV+ KAT VDGVYD DP   PNA    +LTY  V ++DL VMD TAI L
Subjt:  FRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITL

Query:  CQENNIPVVVFNLTKPDNITKAIKGERVGTLIGGT
        C+ENNIP++VF+LT   NI +A+ GE +GTL+GG+
Subjt:  CQENNIPVVVFNLTKPDNITKAIKGERVGTLIGGT

Arabidopsis top hitse value%identityAlignment
AT2G39800.1 delta1-pyrroline-5-carboxylate synthase 17.2e-0633.33Show/hide
Query:  FFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQ----EVTSKDLSVM---DMTAITLCQEN----NIPVVVFNLTKPDNITKAI
        F+  D+ AAL   E+ A++L+  ++V+G+Y   P  +PN+ L+ T   +    E+T  D S +    MTA      N     IPV++ +    +NI K +
Subjt:  FFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQ----EVTSKDLSVM---DMTAITLCQEN----NIPVVVFNLTKPDNITKAI

Query:  KGERVGTL
        +G RVGTL
Subjt:  KGERVGTL

AT2G39800.2 delta1-pyrroline-5-carboxylate synthase 17.2e-0633.33Show/hide
Query:  FFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQ----EVTSKDLSVM---DMTAITLCQEN----NIPVVVFNLTKPDNITKAI
        F+  D+ AAL   E+ A++L+  ++V+G+Y   P  +PN+ L+ T   +    E+T  D S +    MTA      N     IPV++ +    +NI K +
Subjt:  FFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQ----EVTSKDLSVM---DMTAITLCQEN----NIPVVVFNLTKPDNITKAI

Query:  KGERVGTL
        +G RVGTL
Subjt:  KGERVGTL

AT3G10030.1 aspartate/glutamate/uridylate kinase family protein3.7e-6346.49Show/hide
Query:  HWLGLKTDNPTSNGSLIVHSSARELGSNS---DPLKRSMKHQISSISPSGMTLSEASMSMPSYKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAV
        HW   K        SL +  ++ E  S S   + +K + ++ + + +P     +  S +  + +WRRV+LK+SG ALA     NIDPKV   IAREVA  
Subjt:  HWLGLKTDNPTSNGSLIVHSSARELGSNS---DPLKRSMKHQISSISPSGMTLSEASMSMPSYKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAV

Query:  TRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFRMSEVAEPYIRRRAVRHLEKGRVVIF---AAGTGN
         RLG+EVAIVVG  N F GS+W  ++GLDR++A +I M+A+VMN+  LQ+++E IG+  R+QTA  +  V EPY R+RA RHL+KGRVVIF    A  GN
Subjt:  TRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFRMSEVAEPYIRRRAVRHLEKGRVVIF---AAGTGN

Query:  PFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQENNIPVVVFNLTKPDNITKAIKGERVGTLI
        P  ++D +AALR  +INAE ++K TNVDGVYD    Q+ N    E +++Q++ S+ L+ MD  A+  C+EN+IPVVVFN  +  NITKA+ GE+VGTLI
Subjt:  PFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQENNIPVVVFNLTKPDNITKAIKGERVGTLI

AT3G10030.2 aspartate/glutamate/uridylate kinase family protein9.2e-5444.53Show/hide
Query:  HWLGLKTDNPTSNGSLIVHSSARELGSNS---DPLKRSMKHQISSISPSGMTLSEASMSMPSYKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAV
        HW   K        SL +  ++ E  S S   + +K + ++ + + +P     +  S +  + +WRRV+LK+SG ALA     NIDPKV   IAREVA  
Subjt:  HWLGLKTDNPTSNGSLIVHSSARELGSNS---DPLKRSMKHQISSISPSGMTLSEASMSMPSYKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAV

Query:  TRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFRMSEVAEPYIRRRAVRHLEKGRVVIF---AAGTGN
         RLG+EVAIVVG  N F GS+W  ++GLDR++A +I M+A+VMN+  LQ+++E IG+  R+QTA  +  V EPY R+RA RHL+KGRVVIF    A  GN
Subjt:  TRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFRMSEVAEPYIRRRAVRHLEKGRVVIF---AAGTGN

Query:  PFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQENNIP
        P  ++D +AALR  +INAE ++K TNVDGVYD    Q+ N    E +++Q++ S+ L+ MD  A+  C+EN+IP
Subjt:  PFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQENNIP

AT3G18680.1 Amino acid kinase family protein3.8e-12472.54Show/hide
Query:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTL-----SEASMSMP
        MAIP   +P  S   IS+S+S S +SF     +P         ++   S   LI  SS+    + S P   +     +  S +G +      S    S P
Subjt:  MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTL-----SEASMSMP

Query:  SYKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRV
          KWRRVLLKVSGEALAGD  QNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGS+WAG SGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRV
Subjt:  SYKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRV

Query:  QTAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTA
        QTAFRMSEVAEPYIRRRA+RHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEV+LKATNVDGV+DDDP++NPNA LL++LTYQEVTSKDLSVMDMTA
Subjt:  QTAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTA

Query:  ITLCQENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST
        ITLCQENNIPVVVFNL++P NI KAIKGERVGTLIGGTWNS+V +T
Subjt:  ITLCQENNIPVVVFNLTKPDNITKAIKGERVGTLIGGTWNSMVAST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTCCCACTTCCCTCATTCCCACTTTGTCTTTCGATACTATCTCTAGCTCTGCTTCTTATTCTTCTTCCTCCTTTTCATCGCAATTTCTTATTCCCTTCAAGCC
CCATTGGCTTGGTTTAAAGACGGATAACCCCACCTCAAATGGGTCGCTCATTGTTCACTCCTCCGCTCGCGAATTGGGTTCCAATTCTGACCCTTTGAAACGGAGCATGA
AGCATCAGATATCATCTATCTCTCCTAGTGGGATGACACTAAGTGAAGCTTCCATGTCCATGCCATCGTATAAATGGCGAAGAGTATTGCTTAAAGTAAGTGGTGAAGCA
CTTGCTGGTGATCGGTTGCAGAATATTGATCCAAAGGTTACTATGGCTATTGCGAGGGAGGTTGCAGCTGTAACCCGTCTGGGCATTGAGGTTGCTATAGTAGTTGGTGG
GGGTAACATTTTCCGTGGATCCTCGTGGGCCGGAAGTAGTGGTTTGGACCGTTCATCTGCTGATTACATTGGAATGTTGGCAACAGTCATGAATGCTATATTTCTTCAAG
CCACAATGGAGAGTATAGGCATTCCTACACGAGTGCAAACAGCATTTCGCATGTCCGAGGTTGCAGAGCCATATATTCGACGTAGGGCTGTGAGGCACTTGGAAAAAGGA
AGAGTTGTGATCTTTGCAGCTGGTACAGGCAATCCGTTTTTCACTACAGATACTGCAGCAGCCCTCCGTTGCGCAGAAATTAATGCTGAAGTTCTGCTGAAAGCGACAAA
TGTTGATGGGGTTTATGACGATGATCCGAGGCAAAACCCAAACGCATGTCTACTCGAGACTCTTACGTACCAGGAGGTGACTTCAAAGGACCTTTCGGTGATGGACATGA
CAGCCATCACTCTATGCCAGGAAAACAACATTCCAGTTGTTGTCTTCAATCTAACAAAACCAGACAACATCACAAAAGCTATAAAGGGCGAGAGAGTCGGAACGTTGATC
GGCGGAACATGGAACTCAATGGTAGCAAGTACATGA
mRNA sequenceShow/hide mRNA sequence
TCTGTGTATATATATAAAAGCGGGAGATATTTCCTCTAAGAAAATGATGTTATCCGTTTTCCGGTGATAACCACTTCAAACAAAAACAAACACAAGCTCCAATTCCTCCT
CTTGGTTCGAACAACAAGGAGCTGAGTGGGAGACAGAACCCTAAACCCACTTTCACTTTGGCATTCGGTAGCCGATAGAGACGAGAAAGGCGGCGTTTTTGGGTGGTGGA
GGCAATCGTCCTTCGTTCTTCGTTTCTTTTAATGGCAATTCCCACTTCCCTCATTCCCACTTTGTCTTTCGATACTATCTCTAGCTCTGCTTCTTATTCTTCTTCCTCCT
TTTCATCGCAATTTCTTATTCCCTTCAAGCCCCATTGGCTTGGTTTAAAGACGGATAACCCCACCTCAAATGGGTCGCTCATTGTTCACTCCTCCGCTCGCGAATTGGGT
TCCAATTCTGACCCTTTGAAACGGAGCATGAAGCATCAGATATCATCTATCTCTCCTAGTGGGATGACACTAAGTGAAGCTTCCATGTCCATGCCATCGTATAAATGGCG
AAGAGTATTGCTTAAAGTAAGTGGTGAAGCACTTGCTGGTGATCGGTTGCAGAATATTGATCCAAAGGTTACTATGGCTATTGCGAGGGAGGTTGCAGCTGTAACCCGTC
TGGGCATTGAGGTTGCTATAGTAGTTGGTGGGGGTAACATTTTCCGTGGATCCTCGTGGGCCGGAAGTAGTGGTTTGGACCGTTCATCTGCTGATTACATTGGAATGTTG
GCAACAGTCATGAATGCTATATTTCTTCAAGCCACAATGGAGAGTATAGGCATTCCTACACGAGTGCAAACAGCATTTCGCATGTCCGAGGTTGCAGAGCCATATATTCG
ACGTAGGGCTGTGAGGCACTTGGAAAAAGGAAGAGTTGTGATCTTTGCAGCTGGTACAGGCAATCCGTTTTTCACTACAGATACTGCAGCAGCCCTCCGTTGCGCAGAAA
TTAATGCTGAAGTTCTGCTGAAAGCGACAAATGTTGATGGGGTTTATGACGATGATCCGAGGCAAAACCCAAACGCATGTCTACTCGAGACTCTTACGTACCAGGAGGTG
ACTTCAAAGGACCTTTCGGTGATGGACATGACAGCCATCACTCTATGCCAGGAAAACAACATTCCAGTTGTTGTCTTCAATCTAACAAAACCAGACAACATCACAAAAGC
TATAAAGGGCGAGAGAGTCGGAACGTTGATCGGCGGAACATGGAACTCAATGGTAGCAAGTACATGAAGGTGATTATTGTATGGGAAATCCATTTTGTAAGAGCTCATTA
GTAGGGATTTGACAGGAGATTTAGGGCTATCTCTATATCTTTTCAAATTGATAGCTGTTTTGAATTTATGAGATGATTTATTTTGTAAATAACTTAGTGAAACTGAAACT
GGAAGTTCAACTCTTCATTGATTGTTGAATGGACATCAGATTTTGTTTTTGGAACAATTTCGCATTCACTGTCTGTAAGGTTTATTTTGCCTGCAGTCTGAATTTTATAT
CTCCATTTGCTTGTTGAAGGTATTAAAATCTTTCTACTTTTCTTTTTCCTTTT
Protein sequenceShow/hide protein sequence
MAIPTSLIPTLSFDTISSSASYSSSSFSSQFLIPFKPHWLGLKTDNPTSNGSLIVHSSARELGSNSDPLKRSMKHQISSISPSGMTLSEASMSMPSYKWRRVLLKVSGEA
LAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESIGIPTRVQTAFRMSEVAEPYIRRRAVRHLEKG
RVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRQNPNACLLETLTYQEVTSKDLSVMDMTAITLCQENNIPVVVFNLTKPDNITKAIKGERVGTLI
GGTWNSMVAST