; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013642 (gene) of Snake gourd v1 genome

Gene IDTan0013642
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionuridylate kinase
Genome locationLG03:69701490..69705742
RNA-Seq ExpressionTan0013642
SyntenyTan0013642
Gene Ontology termsGO:0006225 - UDP biosynthetic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0044210 - 'de novo' CTP biosynthetic process (biological process)
GO:0046940 - nucleoside monophosphate phosphorylation (biological process)
GO:0005829 - cytosol (cellular component)
GO:0033862 - UMP kinase activity (molecular function)
InterPro domainsIPR001048 - Aspartate/glutamate/uridylate kinase
IPR015963 - Uridylate kinase, bacteria
IPR036393 - Acetylglutamate kinase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7027137.1 pyrH [Cucurbita argyrosperma subsp. argyrosperma]8.1e-16993.55Show/hide
Query:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR
        MAIPTSL PCLSFD+M SSAS SSSS SLQS SPFKPHCHGLKM TPTSNGSLVV+CSAREMGSSSDPLNGS+KHQISSMAP+G+ LNEASMSMPS KWR
Subjt:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR

Query:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR
        RVLLKVSGEALAGD LQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMES+GIPTRVQTAFR
Subjt:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR

Query:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ
        MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNP A LLETLTY EVTSKDLSVMDMTAITLCQ
Subjt:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ

Query:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST
        ENNIPVVVFNLTKP NISKAIKGERVGTLIGG  NSMV ST
Subjt:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST

XP_022962708.1 uncharacterized protein LOC111463118 [Cucurbita moschata]8.1e-16993.55Show/hide
Query:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR
        MAIPTSL PCLSFD+M  SAS SSSS SLQS SPFKPHC+GLKM TPTSNGSLVV+CSAREMGSSSDPLNGSMKHQISSMAP+G+ LNEASMSMPS KWR
Subjt:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR

Query:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR
        RVLLKVSGEALAGD LQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMES+GIPTRVQTAFR
Subjt:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR

Query:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ
        MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNP A LLETLTY EVTSKDLSVMDMTAITLCQ
Subjt:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ

Query:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST
        ENNIPVVVFNLTKPDNISKAIKGERVGTLIGG  NSMV ST
Subjt:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST

XP_023003069.1 uncharacterized protein LOC111496785 [Cucurbita maxima]4.0e-16893.55Show/hide
Query:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR
        MAIPTSL PCLSF +M SSAS SSSS SLQS S FKPHCHGLKM TPTSNGSLVV+CSAREMGSSSDPLNGSMKHQISSMAP+G+ LNEASMSMPS KWR
Subjt:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR

Query:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR
        RVLLKVSGEALAGD LQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMES+GIPTRVQTAFR
Subjt:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR

Query:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ
        MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNP A LLETLTY EVTSKDLSVMDMTAITLCQ
Subjt:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ

Query:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST
        ENNIPVVVFNLTKPDNISKAIKGERVGTLIGG  NSMV ST
Subjt:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST

XP_023517300.1 uncharacterized protein LOC111781100 [Cucurbita pepo subsp. pepo]3.3e-17094.13Show/hide
Query:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR
        MAIPTSL PCLSFD+M SSAS SSSS SLQS SPFKPHCHGLKM TPTSNGSLVV+CSAREMGSSSDPLNGSMKHQISSMAP+G+ LNEASMSMPS KWR
Subjt:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR

Query:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR
        RVLLKVSGEALAGD LQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMES+GIPTRVQTAFR
Subjt:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR

Query:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ
        MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNP A LLETLTY EVTSKDLSVMDMTAITLCQ
Subjt:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ

Query:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST
        ENNIPVVVFNLTKPDNISKAIKGERVGTLIGG  NSMV ST
Subjt:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST

XP_038880866.1 uridylate kinase [Benincasa hispida]1.7e-17193.84Show/hide
Query:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR
        MAIPTSL+P  SFDT+SSSASYSSSSFSLQ  +PFKPHCHGLKM+ PTSNGSLVV+ SAREMGS+SDPLN SMKHQISS++PSG+TL+EASMSMPS KWR
Subjt:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR

Query:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR
        RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMES+GIPTRVQTAFR
Subjt:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR

Query:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ
        MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPR+NPNARLLETLTYQEVTSKDLSVMDMTAITLCQ
Subjt:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ

Query:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST
        ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMV ST
Subjt:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST

TrEMBL top hitse value%identityAlignment
A0A0A0KHT3 AA_kinase domain-containing protein1.0e-16490.91Show/hide
Query:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR
        MAIPTSLIP LSFDT+SSSASYSSSSFS Q   PFKPHC GLK + PTSNGSL+   SAR +GS+S PL  SMKHQIS ++P G+T++EASMSMPS KWR
Subjt:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR

Query:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR
        RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMES+GIPTRVQTAFR
Subjt:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR

Query:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ
        MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPR+NPNARLLETLTYQEVTSKDLSVMDMTAITLCQ
Subjt:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ

Query:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST
        ENNIPVVVFNLTKPDNI+KAIKGERVGTLIGGTWNSMVAST
Subjt:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST

A0A6J1BSV5 uncharacterized protein LOC1110052425.9e-16590.14Show/hide
Query:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKME----TPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPS
        MAIPTSLIPCLSF+ +SSSASYSSSSF  QS  PFKPHC  LKM+    T  SN  L+V CSARE+GS+SDP+NGSMKHQISS+APSG+TLNEASMSMPS
Subjt:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKME----TPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPS

Query:  VKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQ
         KWRRVLLKVSGEALAGDRLQNIDPKVTM IAREVA+VTRLGIEVAIVVGGGNIFRGSSWAG SGLDRSSADYIGMLATVMNAIFLQATMES+GIPTRVQ
Subjt:  VKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQ

Query:  TAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAI
        TAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPR+NPNA LLETLTYQEVTSKDLSVMDMTAI
Subjt:  TAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAI

Query:  TLCQENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST
        TLCQENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMV S+
Subjt:  TLCQENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST

A0A6J1GEU0 uncharacterized protein LOC1114532831.4e-16692.38Show/hide
Query:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR
        MAIP SL+PC SF      ASYSSSSF LQS +PFKPH HGLKM+ PTSNGSLVVSCSAREMGS+SDPLNGSMKHQISSMAPSG+TLNEASMSMPS KWR
Subjt:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR

Query:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR
        RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAG SGLDRSSADYIGMLATVMNAIFLQATMES+GIPTRVQTAFR
Subjt:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR

Query:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ
        MSEVAEPYIRRRAVRHLEKGRVVIFAAG GNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDP++N NARLLETLTYQEVTSKDLSVMDMTAITLCQ
Subjt:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ

Query:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST
        ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNS V ST
Subjt:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST

A0A6J1HFK1 uncharacterized protein LOC1114631183.9e-16993.55Show/hide
Query:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR
        MAIPTSL PCLSFD+M  SAS SSSS SLQS SPFKPHC+GLKM TPTSNGSLVV+CSAREMGSSSDPLNGSMKHQISSMAP+G+ LNEASMSMPS KWR
Subjt:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR

Query:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR
        RVLLKVSGEALAGD LQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMES+GIPTRVQTAFR
Subjt:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR

Query:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ
        MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNP A LLETLTY EVTSKDLSVMDMTAITLCQ
Subjt:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ

Query:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST
        ENNIPVVVFNLTKPDNISKAIKGERVGTLIGG  NSMV ST
Subjt:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST

A0A6J1KS97 uncharacterized protein LOC1114967851.9e-16893.55Show/hide
Query:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR
        MAIPTSL PCLSF +M SSAS SSSS SLQS S FKPHCHGLKM TPTSNGSLVV+CSAREMGSSSDPLNGSMKHQISSMAP+G+ LNEASMSMPS KWR
Subjt:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWR

Query:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR
        RVLLKVSGEALAGD LQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMES+GIPTRVQTAFR
Subjt:  RVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFR

Query:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ
        MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNP A LLETLTY EVTSKDLSVMDMTAITLCQ
Subjt:  MSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQ

Query:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST
        ENNIPVVVFNLTKPDNISKAIKGERVGTLIGG  NSMV ST
Subjt:  ENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST

SwissProt top hitse value%identityAlignment
P74457 Uridylate kinase1.2e-7965.11Show/hide
Query:  VKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQ
        + ++RVLLK+SGEAL GD    IDP V   IA+E+  V + G+++AIVVGGGNIFRG   A ++G+DR++ADYIGM+ATVMNA+ LQ  +E + IPTRV 
Subjt:  VKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQ

Query:  TAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAI
        TA  M EVAEPYIRRRA+RHLEKGRVVIF AG+GNPFFTTDT AALR AEI+AEV+ KAT VDGVYD DP+ NPNAR   TLTY  V ++DL VMD TAI
Subjt:  TAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAI

Query:  TLCQENNIPVVVFNLTKPDNISKAIKGERVGTLIG
         LC++NNIP+++F+L  P NI +AIKGE VGTL+G
Subjt:  TLCQENNIPVVVFNLTKPDNISKAIKGERVGTLIG

Q10Y48 Uridylate kinase1.1e-8065.38Show/hide
Query:  WRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTA
        ++RVLLK+SGEAL G     IDP V   IA+EVA V   GI++AIVVGGGNIFRG   A S G+DR++ADY+GM+ATVMNAI LQ  +E VG+PTRVQTA
Subjt:  WRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTA

Query:  FRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITL
          M E+AEPYIRRRA+RHLEKGRVV+F AG+GNPFFTTDT AALR AEI AEV+ KAT VDGVYD DP +N  A+  E+L+Y EV + DL VMD TAI L
Subjt:  FRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITL

Query:  CQENNIPVVVFNLTKPDNISKAIKGERVGTLIGG
        C+ENNIP++VFNL+   NI KA+ GE++GT++GG
Subjt:  CQENNIPVVVFNLTKPDNISKAIKGERVGTLIGG

Q2JJE2 Uridylate kinase4.6e-8265.11Show/hide
Query:  VKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQ
        +K+RR+LLK+SGEAL G+R   IDP+V  +IA EVA+V R G++VAIVVGGGNI+RG   A + G+D++SADY+GMLATV+NA+ LQ  +E  GIPTRVQ
Subjt:  VKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQ

Query:  TAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAI
        TA  M EVAEPYIRRRA+RHLEKGRVVIF AG+GNPFFTTDT AALR AEI+AEV+ KAT VDGVYD DP+ +P AR  + L+YQ+V ++DL VMD TAI
Subjt:  TAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAI

Query:  TLCQENNIPVVVFNLTKPDNISKAIKGERVGTLIG
         LC+EN +P+VVF+LT P NI + ++GE +GT IG
Subjt:  TLCQENNIPVVVFNLTKPDNISKAIKGERVGTLIG

Q2JS42 Uridylate kinase1.6e-8264.98Show/hide
Query:  VKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQ
        +K+RR+LLK+SGEAL G+R   IDP+V  +IA EVA+V R G++VAIVVGGGNI+RG   A + G+D++SADY+GMLATV+NA+ LQ  +E  GIPTRVQ
Subjt:  VKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQ

Query:  TAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAI
        TA  M EVAEPYIRRRA+RHLEKGRVVIF AG+GNPFFTTDT AALR AEI+AEV+ KAT VDGVYD DP+ +P AR  + L+YQ+V ++DL VMD TAI
Subjt:  TAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAI

Query:  TLCQENNIPVVVFNLTKPDNISKAIKGERVGTLIGGT
         LC+EN +P+VVF+LT P NI + ++GE +GT IG T
Subjt:  TLCQENNIPVVVFNLTKPDNISKAIKGERVGTLIGGT

Q8YXK5 Uridylate kinase6.2e-7964.26Show/hide
Query:  WRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTA
        +RRVLLK+SGEAL G+    IDP+V   IA+E+A V   G+++AIVVGGGNIFRG   A S+G+DR++ADYIGM+ATVMNA+ LQ ++E +G+ TRVQTA
Subjt:  WRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTA

Query:  FRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITL
          M E+AEPYIRRRA+RHLEKGRVVIF AG+GNPFFTTDT AALR AEI+AEV+ KAT VDGVYD DP   PNA+   +LTY  V ++DL VMD TAI L
Subjt:  FRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITL

Query:  CQENNIPVVVFNLTKPDNISKAIKGERVGTLIGGT
        C+ENNIP++VF+LT   NI +A+ GE +GTL+GG+
Subjt:  CQENNIPVVVFNLTKPDNISKAIKGERVGTLIGGT

Arabidopsis top hitse value%identityAlignment
AT2G39800.1 delta1-pyrroline-5-carboxylate synthase 11.4e-0633.33Show/hide
Query:  FFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQ----EVTSKDLSVM---DMTAITLCQEN----NIPVVVFNLTKPDNISKAI
        F+  D+ AAL   E+ A++L+  ++V+G+Y   P  +PN++L+ T   +    E+T  D S +    MTA      N     IPV++ +    +NI K +
Subjt:  FFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQ----EVTSKDLSVM---DMTAITLCQEN----NIPVVVFNLTKPDNISKAI

Query:  KGERVGTL
        +G RVGTL
Subjt:  KGERVGTL

AT2G39800.2 delta1-pyrroline-5-carboxylate synthase 11.4e-0633.33Show/hide
Query:  FFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQ----EVTSKDLSVM---DMTAITLCQEN----NIPVVVFNLTKPDNISKAI
        F+  D+ AAL   E+ A++L+  ++V+G+Y   P  +PN++L+ T   +    E+T  D S +    MTA      N     IPV++ +    +NI K +
Subjt:  FFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQ----EVTSKDLSVM---DMTAITLCQEN----NIPVVVFNLTKPDNISKAI

Query:  KGERVGTL
        +G RVGTL
Subjt:  KGERVGTL

AT3G10030.1 aspartate/glutamate/uridylate kinase family protein2.7e-6145.73Show/hide
Query:  KMETPTSNGSLVVSCSAREMGSSS--DPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIE
        KME    N   +   S  +   SS  + +  + ++ + +  P     N  S +  + +WRRV+LK+SG ALA     NIDPKV   IAREVA   RLG+E
Subjt:  KMETPTSNGSLVVSCSAREMGSSS--DPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIE

Query:  VAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFRMSEVAEPYIRRRAVRHLEKGRVVIF---AAGTGNPFFTTD
        VAIVVG  N F GS+W  ++GLDR++A +I M+A+VMN+  LQ+++E +G+  R+QTA  +  V EPY R+RA RHL+KGRVVIF    A  GNP  ++D
Subjt:  VAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFRMSEVAEPYIRRRAVRHLEKGRVVIF---AAGTGNPFFTTD

Query:  TAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQENNIPVVVFNLTKPDNISKAIKGERVGTLI
         +AALR  +INAE ++K TNVDGVY  D     +    E +++Q++ S+ L+ MD  A+  C+EN+IPVVVFN  +  NI+KA+ GE+VGTLI
Subjt:  TAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQENNIPVVVFNLTKPDNISKAIKGERVGTLI

AT3G10030.2 aspartate/glutamate/uridylate kinase family protein1.7e-5244.03Show/hide
Query:  KMETPTSNGSLVVSCSAREMGSSS--DPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIE
        KME    N   +   S  +   SS  + +  + ++ + +  P     N  S +  + +WRRV+LK+SG ALA     NIDPKV   IAREVA   RLG+E
Subjt:  KMETPTSNGSLVVSCSAREMGSSS--DPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIE

Query:  VAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFRMSEVAEPYIRRRAVRHLEKGRVVIF---AAGTGNPFFTTD
        VAIVVG  N F GS+W  ++GLDR++A +I M+A+VMN+  LQ+++E +G+  R+QTA  +  V EPY R+RA RHL+KGRVVIF    A  GNP  ++D
Subjt:  VAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFRMSEVAEPYIRRRAVRHLEKGRVVIF---AAGTGNPFFTTD

Query:  TAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQENNIP
         +AALR  +INAE ++K TNVDGVY  D     +    E +++Q++ S+ L+ MD  A+  C+EN+IP
Subjt:  TAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQENNIP

AT3G18680.1 Amino acid kinase family protein7.4e-12873.5Show/hide
Query:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGS----LVVSCS---AREMGSSSDPLNGSMKHQISSMAPSGVTLNEASM-
        MAIP  L  C    T        SSS S  SF P       L+  T  SN +    +++SCS   + + GSS D +NG+     SS+          S  
Subjt:  MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGS----LVVSCS---AREMGSSSDPLNGSMKHQISSMAPSGVTLNEASM-

Query:  --SMPSVKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVG
          S P +KWRRVLLKVSGEALAGD  QNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGS+WAG SGLDRSSADYIGMLATVMNAIFLQATMES+G
Subjt:  --SMPSVKWRRVLLKVSGEALAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVG

Query:  IPTRVQTAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSV
        IPTRVQTAFRMSEVAEPYIRRRA+RHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEV+LKATNVDGV+DDDP+RNPNARLL++LTYQEVTSKDLSV
Subjt:  IPTRVQTAFRMSEVAEPYIRRRAVRHLEKGRVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSV

Query:  MDMTAITLCQENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST
        MDMTAITLCQENNIPVVVFNL++P NI+KAIKGERVGTLIGGTWNS+V +T
Subjt:  MDMTAITLCQENNIPVVVFNLTKPDNISKAIKGERVGTLIGGTWNSMVAST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTCCCACGTCCCTCATTCCCTGTTTGTCTTTCGACACCATGTCCAGTTCTGCTTCTTATTCTTCCTCGTCGTTTTCCTTGCAATCTTTCAGTCCCTTCAAGCC
CCATTGCCATGGTTTAAAGATGGAGACTCCCACCTCCAATGGGTCGCTCGTTGTTAGCTGTTCAGCTCGCGAAATGGGTTCCAGCTCTGACCCTTTGAACGGGAGCATGA
AGCATCAGATATCATCTATGGCTCCTAGTGGGGTGACACTAAATGAAGCTTCCATGTCCATGCCATCAGTTAAATGGCGAAGAGTATTGCTTAAAGTAAGCGGCGAAGCA
CTTGCGGGTGACCGATTGCAGAATATAGATCCAAAGGTTACCATGGCAATTGCGAGGGAGGTTGCAGCTGTAACCCGTCTAGGCATCGAGGTTGCTATAGTAGTTGGTGG
GGGTAACATTTTCCGTGGATCTTCGTGGGCTGGAAGTAGTGGTTTGGACCGTTCCTCTGCTGATTACATTGGGATGTTGGCAACAGTCATGAATGCTATATTTCTTCAAG
CCACAATGGAGAGTGTAGGCATTCCTACACGAGTGCAGACTGCGTTTCGCATGTCAGAGGTTGCAGAGCCATATATTCGACGTAGGGCTGTGAGGCACTTGGAGAAAGGA
AGAGTTGTGATCTTTGCAGCTGGTACAGGCAATCCGTTTTTCACTACAGACACTGCAGCAGCCCTCCGTTGTGCAGAAATAAACGCTGAAGTGCTGCTGAAAGCGACAAA
TGTTGACGGGGTTTACGACGATGATCCAAGGCGAAACCCGAATGCACGTCTACTCGAGACTCTTACGTACCAGGAGGTGACTTCGAAGGACCTTTCAGTGATGGACATGA
CTGCCATTACTCTATGCCAGGAAAATAACATTCCCGTTGTTGTCTTCAATCTAACGAAACCGGATAACATCTCGAAAGCCATAAAGGGCGAGAGAGTCGGGACGTTGATT
GGTGGAACATGGAACTCAATGGTAGCAAGTACATGA
mRNA sequenceShow/hide mRNA sequence
GAAACGTGGCCACATCGGGAGCGCACTCTTCCCCTCCAAGTTCATCTTCAACGTATACAAAAGCGCGAGAGAGATATTTCCTTCAAGAAAATGATGTTATCCATTTCCCT
CTGATAATCACTACAAACCAACAGACGGACACACACAAAAGCTCCAACTCCCGTCTCGATTTCATACATAGAACCCTAAACCCCTCTCACAATTCGGCAGACGAGAAAAA
GGCGGCCTTAAGTTCTGGGGGTGACAATCATTCTCTTCCTCCTCCTTTCATGGCAATTCCCACGTCCCTCATTCCCTGTTTGTCTTTCGACACCATGTCCAGTTCTGCTT
CTTATTCTTCCTCGTCGTTTTCCTTGCAATCTTTCAGTCCCTTCAAGCCCCATTGCCATGGTTTAAAGATGGAGACTCCCACCTCCAATGGGTCGCTCGTTGTTAGCTGT
TCAGCTCGCGAAATGGGTTCCAGCTCTGACCCTTTGAACGGGAGCATGAAGCATCAGATATCATCTATGGCTCCTAGTGGGGTGACACTAAATGAAGCTTCCATGTCCAT
GCCATCAGTTAAATGGCGAAGAGTATTGCTTAAAGTAAGCGGCGAAGCACTTGCGGGTGACCGATTGCAGAATATAGATCCAAAGGTTACCATGGCAATTGCGAGGGAGG
TTGCAGCTGTAACCCGTCTAGGCATCGAGGTTGCTATAGTAGTTGGTGGGGGTAACATTTTCCGTGGATCTTCGTGGGCTGGAAGTAGTGGTTTGGACCGTTCCTCTGCT
GATTACATTGGGATGTTGGCAACAGTCATGAATGCTATATTTCTTCAAGCCACAATGGAGAGTGTAGGCATTCCTACACGAGTGCAGACTGCGTTTCGCATGTCAGAGGT
TGCAGAGCCATATATTCGACGTAGGGCTGTGAGGCACTTGGAGAAAGGAAGAGTTGTGATCTTTGCAGCTGGTACAGGCAATCCGTTTTTCACTACAGACACTGCAGCAG
CCCTCCGTTGTGCAGAAATAAACGCTGAAGTGCTGCTGAAAGCGACAAATGTTGACGGGGTTTACGACGATGATCCAAGGCGAAACCCGAATGCACGTCTACTCGAGACT
CTTACGTACCAGGAGGTGACTTCGAAGGACCTTTCAGTGATGGACATGACTGCCATTACTCTATGCCAGGAAAATAACATTCCCGTTGTTGTCTTCAATCTAACGAAACC
GGATAACATCTCGAAAGCCATAAAGGGCGAGAGAGTCGGGACGTTGATTGGTGGAACATGGAACTCAATGGTAGCAAGTACATGAAGGTGATGTTTAAATGGGCCATCCA
TTCTGTAAGAGCTGCTCATTAGTAGGGATTTGACAGGAGATTTAGAGCTATCATCTCTATATCTTTTCACATTGATAGCTGTTTTGAAACTTTAAAATTTTGAATTTTAT
GGTCTTAAGTTATGAGAAGATTGATTCTGTAAATAACTTAGTGAAAGCAGATCCCTGTTGAATGTACTTCAGATTTTTTTGGGAACAATTGCACATTCAGTATCTGTCAA
ATGTAATATTAGCCTGCAGTCTAAATTTTAGATATCCATTAGTTTGTTGAACATATTGAAACCTTTCAACTTTGAATTGGCCAAGGAAGTGTGTGGTTACTTGGTCGTCA
ATGTGTTGTTTTAGAAGTTTGCTGACTAA
Protein sequenceShow/hide protein sequence
MAIPTSLIPCLSFDTMSSSASYSSSSFSLQSFSPFKPHCHGLKMETPTSNGSLVVSCSAREMGSSSDPLNGSMKHQISSMAPSGVTLNEASMSMPSVKWRRVLLKVSGEA
LAGDRLQNIDPKVTMAIAREVAAVTRLGIEVAIVVGGGNIFRGSSWAGSSGLDRSSADYIGMLATVMNAIFLQATMESVGIPTRVQTAFRMSEVAEPYIRRRAVRHLEKG
RVVIFAAGTGNPFFTTDTAAALRCAEINAEVLLKATNVDGVYDDDPRRNPNARLLETLTYQEVTSKDLSVMDMTAITLCQENNIPVVVFNLTKPDNISKAIKGERVGTLI
GGTWNSMVAST