; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G19606 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G19606
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionRNA pseudouridine synthase 1
Genome locationctg4:3297779..3300388
RNA-Seq ExpressionCucsat.G19606
SyntenyCucsat.G19606
Gene Ontology termsGO:0001522 - pseudouridine synthesis (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0009982 - pseudouridine synthase activity (molecular function)
InterPro domainsIPR006145 - Pseudouridine synthase, RsuA/RluA-like
IPR006224 - Pseudouridine synthase, RluC/RluD, conserved site
IPR020103 - Pseudouridine synthase, catalytic domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586224.1 RNA pseudouridine synthase 1, partial [Cucurbita argyrosperma subsp. sororia]1.64e-20179.58Show/hide
Query:  MALFISLHPNFSISPKTFTEFPFTKSFCFHRFKSVSNSMTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEW
        MAL ++L   FSI  K+F +FP  K F FHRF+ V+NSM DSS+PVTG      ENYPVPLSPPLPAISKNLELARAM A SKSSL++LSA DV+YEDEW
Subjt:  MALFISLHPNFSISPKTFTEFPFTKSFCFHRFKSVSNSMTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEW

Query:  LIAVNKPQGIYCENVLAAVPRLLGDSANAG-----IKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKS
        LIAVNKPQG+YCE+VLA+VP  LGDSA AG     IK SLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDH VSKSYIAFCVG++PKWKKI VKS
Subjt:  LIAVNKPQGIYCENVLAAVPRLLGDSANAG-----IKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKS

Query:  GHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEE------TIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLH
        GHGRSKFGVWRVYAAADVGRSLPGGSVVRDMET FEVLSVNG+NT EEL + R+ EEE      TIV  +KSL+DID+ KDEILIRARPRSGRTHQIRLH
Subjt:  GHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEE------TIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLH

Query:  CQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQALQPQQHEVNSPQSFKTMP
         QYLGIPI GDVKYEGVTEWN KIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWA+QALQPQQ EVNS  S K MP
Subjt:  CQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQALQPQQHEVNSPQSFKTMP

KAG7021056.1 RNA pseudouridine synthase 1, partial [Cucurbita argyrosperma subsp. argyrosperma]3.50e-20680.69Show/hide
Query:  ITKMALFISLHPNFSISPKTFTEFPFTKSFCFHRFKSVSNSMTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYE
        ITKMAL +++ P FSI  K+F +FP  K F FHRF+ V+NSM DSS+PVTG      ENYPVPLSPPLPAISKNLELARAM A SKSSL++LSA DVIYE
Subjt:  ITKMALFISLHPNFSISPKTFTEFPFTKSFCFHRFKSVSNSMTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYE

Query:  DEWLIAVNKPQGIYCENVLAAVPRLLGDSANAG-----IKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKIN
        DEWLIAVNKPQG+YCE+VLA+VPR LGDSA AG     IK SLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDH VSKSYIAFCVG++PKWKKI 
Subjt:  DEWLIAVNKPQGIYCENVLAAVPRLLGDSANAG-----IKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKIN

Query:  VKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEE----TIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRL
        VKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMET FEVLSVNG+NT EEL + R+ EEE    TIV  +KSL+DID+ KDEILIRARPRSGRTHQIRL
Subjt:  VKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEE----TIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRL

Query:  HCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQALQPQQHEVNSPQSFKTMP
        H QYLGIPI GDVKYEGVTEWN KIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWA+QALQPQQ EVNS  S K MP
Subjt:  HCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQALQPQQHEVNSPQSFKTMP

XP_004144814.3 RNA pseudouridine synthase 1 [Cucumis sativus]2.64e-266100Show/hide
Query:  MALFISLHPNFSISPKTFTEFPFTKSFCFHRFKSVSNSMTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEW
        MALFISLHPNFSISPKTFTEFPFTKSFCFHRFKSVSNSMTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEW
Subjt:  MALFISLHPNFSISPKTFTEFPFTKSFCFHRFKSVSNSMTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEW

Query:  LIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRS
        LIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRS
Subjt:  LIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRS

Query:  KFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGD
        KFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGD
Subjt:  KFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGD

Query:  VKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQALQPQQHEVNSPQSFKTMP
        VKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQALQPQQHEVNSPQSFKTMP
Subjt:  VKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQALQPQQHEVNSPQSFKTMP

XP_008453651.1 PREDICTED: RNA pseudouridine synthase 1 [Cucumis melo]1.55e-22695.43Show/hide
Query:  MTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPEL
        MTDSSEPVTGNCPST ENYPVPLSPPLPAISKNLELARAM ASSKSSL+ALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLL DSANAGIKT+LPEL
Subjt:  MTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPEL

Query:  HLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGK
        HLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKI+VKSGHGRSKFGVWRVYAAADVGRSLPGGS+VRDMETYFEVLSVNGK
Subjt:  HLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGK

Query:  NTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQ
        NTMEELQKFR+ EEETIVVHTKSLVDIDS KDE+LIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWN K YDSHELHAESLYFVHP+TGIPLKLQ
Subjt:  NTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQ

Query:  APLPSWASQALQPQQHEVNSPQSFKTMP
        APLPSWASQALQPQQ EVNSPQSFKTMP
Subjt:  APLPSWASQALQPQQHEVNSPQSFKTMP

XP_038889468.1 RNA pseudouridine synthase 1 [Benincasa hispida]6.24e-22688.86Show/hide
Query:  MALFISLHPNFSISPKTFTEFPFTKSFCFHRFKSVSNSMTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEW
        MAL ISL P FS   KTFTEFP  K FC  RFKS+S SM DSSEPVTGNCPST ENYPVPLSPPLPAISKNLELARAM ASSKSSL+ALSANDVIYEDEW
Subjt:  MALFISLHPNFSISPKTFTEFPFTKSFCFHRFKSVSNSMTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEW

Query:  LIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRS
        LIAVNKPQGIYCE+VLAAVPRLL DSA AGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVG++PKWKKIN+KSGHGRS
Subjt:  LIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRS

Query:  KFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGD
        KFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLS+NG+N MEE QKF + +EET+VV TKSLVDID  KDEILIRARPRSGRTHQIRLHCQYLGIPIRGD
Subjt:  KFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGD

Query:  VKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQALQPQQHEVNSP
        VKYEGVTEWN KIYD HELHAESLYFVHP+TGIPLKLQAPLPSWASQA QPQQ EVNSP
Subjt:  VKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQALQPQQHEVNSP

TrEMBL top hitse value%identityAlignment
A0A0A0LJY6 PseudoU_synth_2 domain-containing protein1.28e-266100Show/hide
Query:  MALFISLHPNFSISPKTFTEFPFTKSFCFHRFKSVSNSMTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEW
        MALFISLHPNFSISPKTFTEFPFTKSFCFHRFKSVSNSMTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEW
Subjt:  MALFISLHPNFSISPKTFTEFPFTKSFCFHRFKSVSNSMTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEW

Query:  LIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRS
        LIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRS
Subjt:  LIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRS

Query:  KFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGD
        KFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGD
Subjt:  KFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGD

Query:  VKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQALQPQQHEVNSPQSFKTMP
        VKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQALQPQQHEVNSPQSFKTMP
Subjt:  VKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQALQPQQHEVNSPQSFKTMP

A0A1S3BW87 RNA pseudouridine synthase 17.50e-22795.43Show/hide
Query:  MTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPEL
        MTDSSEPVTGNCPST ENYPVPLSPPLPAISKNLELARAM ASSKSSL+ALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLL DSANAGIKT+LPEL
Subjt:  MTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPEL

Query:  HLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGK
        HLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKI+VKSGHGRSKFGVWRVYAAADVGRSLPGGS+VRDMETYFEVLSVNGK
Subjt:  HLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGK

Query:  NTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQ
        NTMEELQKFR+ EEETIVVHTKSLVDIDS KDE+LIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWN K YDSHELHAESLYFVHP+TGIPLKLQ
Subjt:  NTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQ

Query:  APLPSWASQALQPQQHEVNSPQSFKTMP
        APLPSWASQALQPQQ EVNSPQSFKTMP
Subjt:  APLPSWASQALQPQQHEVNSPQSFKTMP

A0A5A7TX25 RNA pseudouridine synthase 17.50e-22795.43Show/hide
Query:  MTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPEL
        MTDSSEPVTGNCPST ENYPVPLSPPLPAISKNLELARAM ASSKSSL+ALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLL DSANAGIKT+LPEL
Subjt:  MTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPEL

Query:  HLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGK
        HLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKI+VKSGHGRSKFGVWRVYAAADVGRSLPGGS+VRDMETYFEVLSVNGK
Subjt:  HLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGK

Query:  NTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQ
        NTMEELQKFR+ EEETIVVHTKSLVDIDS KDE+LIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWN K YDSHELHAESLYFVHP+TGIPLKLQ
Subjt:  NTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQ

Query:  APLPSWASQALQPQQHEVNSPQSFKTMP
        APLPSWASQALQPQQ EVNSPQSFKTMP
Subjt:  APLPSWASQALQPQQHEVNSPQSFKTMP

A0A6J1DQ12 RNA pseudouridine synthase 1 isoform X25.80e-20085.15Show/hide
Query:  MTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAG-----IKT
        M D S PVTGN P+T +NYP+PLSPPLPAISKNLELARAM ASSKSSL+ALSA DVI+EDEWLIAVNKPQG+YCE+VLA+VPRLLGDSA AG     I+ 
Subjt:  MTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAG-----IKT

Query:  SLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVL
        SLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDH VSKSYIAFCVG++PKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVL
Subjt:  SLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVL

Query:  SVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGI
        SVNG++TM+EL + R+ EEETIVV +KSL+DID+HKDEILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGV EWN  IYDSHELHAESL+FVHPVTGI
Subjt:  SVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGI

Query:  PLKLQAPLPSWASQALQPQQHEVNSPQSFK
        PLKLQAPLPSWASQAL  QQ EV+SP + K
Subjt:  PLKLQAPLPSWASQALQPQQHEVNSPQSFK

A0A6J1DT56 RNA pseudouridine synthase 1 isoform X12.79e-19882.65Show/hide
Query:  MTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAG--------
        M D S PVTGN P+T +NYP+PLSPPLPAISKNLELARAM ASSKSSL+ALSA DVI+EDEWLIAVNKPQG+YCE+VLA+VPRLLGDSA AG        
Subjt:  MTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAG--------

Query:  -------IKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVV
               I+ SLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDH VSKSYIAFCVG++PKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVV
Subjt:  -------IKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVV

Query:  RDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAES
        RDMETYFEVLSVNG++TM+EL + R+ EEETIVV +KSL+DID+HKDEILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGV EWN  IYDSHELHAES
Subjt:  RDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAES

Query:  LYFVHPVTGIPLKLQAPLPSWASQALQPQQHEVNSPQSFK
        L+FVHPVTGIPLKLQAPLPSWASQAL  QQ EV+SP + K
Subjt:  LYFVHPVTGIPLKLQAPLPSWASQALQPQQHEVNSPQSFK

SwissProt top hitse value%identityAlignment
Q2QNM3 RNA pseudouridine synthase 12.7e-9961.36Show/hide
Query:  YPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITK
        YPVP+SPP PA SK++EL RAM AS++S+ Y  S+  V++EDEWL  V+KP G+YC+ +L+A+P     +A  G + + P LHLANRLDRDTSG+MVITK
Subjt:  YPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITK

Query:  SHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIV
         +KVA KLVKAFT+H V K+Y+A C+G  P W+KI + SGHGRSK G WRVYA +DVGRSLPGGSVVRDM T FEVL +NGK    E   F   E E+I 
Subjt:  SHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIV

Query:  VHTKSLVDIDSHKDE----ILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQ
        V  K+  D+ S  DE    IL+RA P+SGRTHQIRLHCQYLG PIRGDVKY GV EWN   YD H LHAESL FVHPVTG+P+  ++PLPSWA++
Subjt:  VHTKSLVDIDSHKDE----ILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQ

Q68XB2 Ribosomal large subunit pseudouridine synthase C1.8e-1830.62Show/hide
Query:  VIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINV
        +IYED+ LIA+NKP  +  +    +   L  DSA   +     +  L +RLD++TSG+++I K++  + KL  AF +  V K Y A   G   K   I V
Subjt:  VIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINV

Query:  KSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYL
        KS  G+SK    R++   D+                    S NGK                 + + K L  ++++    LI   P +GR HQ+RLH Q L
Subjt:  KSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYL

Query:  GIPIRGDVKY--EGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQAL
        G PI GD KY  + V  +++ ++    LHA ++Y    + G  +KL+A LP + ++ L
Subjt:  GIPIRGDVKY--EGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQAL

Q7XA65 RNA pseudouridine synthase 12.7e-9660.54Show/hide
Query:  NYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVIT
        NYP P+S P P ISK++EL RAM ASSKSSL+ L+ +D++YEDE+L+AVNKP+G+YCE VL + P+++ DS++        E HLANRLDRDTSGVM+IT
Subjt:  NYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVIT

Query:  KSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNG-KNTMEELQKFRRGEEET
        KSHKVA+KLVKAFT+H + KSYIA C+G+SP W+++ V SGHGRSK G WRVYAA DVGR LPGGS VRDMET FEV+SVN  KN   EL+         
Subjt:  KSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNG-KNTMEELQKFRRGEEET

Query:  IVVHTKSLVDIDSHKDE--ILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWAS
        IV   +  +      D+  +++RA PRSGRTHQIRLHCQYLGIPIRGDVKY GV EWN + ++ HELHAE L   HPVTG  + ++APLP WA+
Subjt:  IVVHTKSLVDIDSHKDE--ILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWAS

Q87S65 Ribosomal large subunit pseudouridine synthase D1.1e-1727.78Show/hide
Query:  DVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIK-----TSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPK
        D++YED+ +I +NKP+  +  +  A  P   G   NA +        +P   + +RLD+DT+G+MV+ K+    ++LV+A     +++ Y A  +G    
Subjt:  DVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIK-----TSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPK

Query:  WKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIR
          K++   G   +K     + A A +G         +   T++ V            + FR         HT+             IR R  +GRTHQIR
Subjt:  WKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIR

Query:  LHCQYLGIPIRGDVKYEG--------VTEWNEKI--YDSHELHAESLYFVHPVTGIPLKLQAPLPS---WASQALQPQQHEVNSPQSF
        +H  YL  P+ GD  Y G          E  + I  +D   LHA  L F HP+TG  L+  AP+P      ++AL+    E   P  F
Subjt:  LHCQYLGIPIRGDVKYEG--------VTEWNEKI--YDSHELHAESLYFVHPVTGIPLKLQAPLPS---WASQALQPQQHEVNSPQSF

Q9ZDR7 Ribosomal large subunit pseudouridine synthase C2.5e-1728.68Show/hide
Query:  VIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINV
        +IYED+ LIA+NKP  +  +    +   L  DSA   +     +  L +RLD++TSG+++I K++  + KL  AF +  V K Y A   G  P      V
Subjt:  VIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINV

Query:  KSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYL
        KS  G+SK                      R +     + S +GK                 + + K L  ++++    LI   P +GR HQ+RLH Q L
Subjt:  KSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYL

Query:  GIPIRGDVKY--EGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQAL
        G PI GD KY  + +  +++ ++    LHA ++Y    + G  +KL+A LP + ++ +
Subjt:  GIPIRGDVKY--EGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWASQAL

Arabidopsis top hitse value%identityAlignment
AT1G56345.1 Pseudouridine synthase family protein1.9e-9760.54Show/hide
Query:  NYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVIT
        NYP P+S P P ISK++EL RAM ASSKSSL+ L+ +D++YEDE+L+AVNKP+G+YCE VL + P+++ DS++        E HLANRLDRDTSGVM+IT
Subjt:  NYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVIT

Query:  KSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNG-KNTMEELQKFRRGEEET
        KSHKVA+KLVKAFT+H + KSYIA C+G+SP W+++ V SGHGRSK G WRVYAA DVGR LPGGS VRDMET FEV+SVN  KN   EL+         
Subjt:  KSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNG-KNTMEELQKFRRGEEET

Query:  IVVHTKSLVDIDSHKDE--ILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWAS
        IV   +  +      D+  +++RA PRSGRTHQIRLHCQYLGIPIRGDVKY GV EWN + ++ HELHAE L   HPVTG  + ++APLP WA+
Subjt:  IVVHTKSLVDIDSHKDE--ILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAESLYFVHPVTGIPLKLQAPLPSWAS

AT1G76050.2 Pseudouridine synthase family protein3.3e-1226.34Show/hide
Query:  LANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGT-SPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGK
        + +RLD+ T+G++V+ K     + L + F  HT+ + Y++   G  SP   +I +  G   S     R+  AA     +PGG              V G 
Subjt:  LANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGT-SPKWKKINVKSGHGRSKFGVWRVYAAADVGRSLPGGSVVRDMETYFEVLSVNGK

Query:  NTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEW-------------NEKI------YDSHEL
               +++             +++  +     L+  R  +GRTHQIR H +Y+G+P+ GD  Y G                  E+I       D   L
Subjt:  NTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEW-------------NEKI------YDSHEL

Query:  HAESLYFVHPVTGIPLKLQAPLPS
        HA  L F HP TG  +K   P PS
Subjt:  HAESLYFVHPVTGIPLKLQAPLPS

AT3G19440.1 Pseudouridine synthase family protein4.8e-1129.74Show/hide
Query:  VIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPEL-------------HLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAF
        V+Y+D  +I +NKP G             L     +GIKTS+ EL              L +RLDRD SG++V+ ++   A+ L   F + T        
Subjt:  VIYEDEWLIAVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPEL-------------HLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAF

Query:  CVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVG--------RSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEI
          G S    K NVKS        + R Y A  +G         S P   VV D +   E ++VN    +   Q             T+  V   S     
Subjt:  CVGTSPKWKKINVKSGHGRSKFGVWRVYAAADVG--------RSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEI

Query:  LIRARPRSGRTHQIRLHC-QYLGIPIRGDVKY
         +  RP +GR HQ+R+HC + LG PI GD KY
Subjt:  LIRARPRSGRTHQIRLHC-QYLGIPIRGDVKY

AT3G52260.1 Pseudouridine synthase family protein2.4e-0726.36Show/hide
Query:  DVIYEDEWLIAVNKPQGIYC--------ENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGT
        +V+YED+ LIA+NKP G+            VL  +    G + +       P     +RL R TSG+++  K+    +KL   F + T         VG+
Subjt:  DVIYEDEWLIAVNKPQGIYC--------ENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGT

Query:  SPKWKKINVKSGHGRSKFGVWRVYAAA-----DVGRSLPGGSVVR--DMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRAR
              ++ + G GR    ++R  A       +V    P G VVR   +     V S  GK    ++    R                D  K+  L++  
Subjt:  SPKWKKINVKSGHGRSKFGVWRVYAAA-----DVGRSLPGGSVVR--DMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRAR

Query:  PRSGRTHQIRLHCQYLGIPI
         +SGR HQIR+H  Y+G P+
Subjt:  PRSGRTHQIRLHCQYLGIPI

AT3G52260.2 Pseudouridine synthase family protein9.9e-0926.99Show/hide
Query:  DVIYEDEWLIAVNKPQGIYC--------ENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGT
        +V+YED+ LIA+NKP G+            VL  +    G + +       P     +RL R TSG+++  K+    +KL   F + T         VG+
Subjt:  DVIYEDEWLIAVNKPQGIYC--------ENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGT

Query:  SPKWKKINVKSGHGRSKFGVWRVYAAA-----DVGRSLPGGSVVR--DMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRAR
              ++ + G GR    ++R  A       +V    P G VVR   +     V S  GK    ++    R                D  K+  L++  
Subjt:  SPKWKKINVKSGHGRSKFGVWRVYAAA-----DVGRSLPGGSVVR--DMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRAR

Query:  PRSGRTHQIRLHCQYLGIPIRGDVKY
         +SGR HQIR+H  Y+G P+ GD  Y
Subjt:  PRSGRTHQIRLHCQYLGIPIRGDVKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTCTTTCTAATTTCTATTACAAAAATGGCCCTCTTCATCTCACTTCATCCCAATTTCTCAATTTCCCCCAAAACCTTCACCGAATTCCCATTCACAAAATCCTTCTGTTT
CCATAGATTCAAATCCGTCAGCAATAGCATGACTGACTCATCGGAACCGGTGACCGGAAACTGCCCATCAACCGCAGAGAATTATCCGGTGCCCTTATCGCCTCCACTCC
CCGCCATATCCAAGAACTTAGAGCTCGCCAGAGCCATGGTCGCCTCCTCTAAATCCAGCCTCTATGCTTTATCGGCTAACGATGTTATTTACGAAGACGAGTGGCTCATT
GCCGTCAATAAACCTCAAGGAATCTACTGTGAAAATGTTTTGGCTGCTGTTCCTCGTCTTCTCGGTGATTCGGCTAACGCTGGGATCAAAACCAGTTTGCCAGAGCTTCA
TCTTGCTAACCGCCTTGATCGCGATACAAGTGGGGTTATGGTTATAACGAAGTCACATAAAGTTGCTTCTAAATTAGTGAAGGCATTTACTGACCACACGGTTTCAAAAT
CATATATAGCCTTCTGTGTTGGCACATCTCCAAAATGGAAAAAAATCAACGTCAAATCTGGTCATGGAAGGTCAAAGTTTGGGGTCTGGCGTGTCTATGCTGCAGCTGAT
GTGGGTCGTTCCTTACCTGGGGGTTCCGTAGTCAGAGATATGGAAACATATTTTGAAGTATTGTCTGTAAATGGAAAAAACACTATGGAGGAATTACAAAAATTCAGGAG
AGGTGAAGAAGAAACTATTGTAGTCCATACAAAATCTTTGGTGGACATTGATAGTCACAAGGACGAGATCTTGATAAGAGCACGACCTCGCAGTGGAAGAACTCATCAAA
TCCGTCTGCACTGCCAATATCTCGGAATTCCTATAAGAGGGGATGTGAAATATGAGGGTGTTACAGAGTGGAATGAAAAAATTTACGATAGCCATGAGCTTCATGCTGAA
AGCTTGTATTTTGTGCACCCTGTTACAGGTATTCCTCTCAAACTTCAAGCTCCTTTACCATCATGGGCCAGTCAAGCGTTGCAGCCCCAACAACATGAAGTGAATTCTCC
ACAGTCCTTCAAAACTATGCCTTAA
mRNA sequenceShow/hide mRNA sequence
CTCTTTCTAATTTCTATTACAAAAATGGCCCTCTTCATCTCACTTCATCCCAATTTCTCAATTTCCCCCAAAACCTTCACCGAATTCCCATTCACAAAATCCTTCTGTTT
CCATAGATTCAAATCCGTCAGCAATAGCATGACTGACTCATCGGAACCGGTGACCGGAAACTGCCCATCAACCGCAGAGAATTATCCGGTGCCCTTATCGCCTCCACTCC
CCGCCATATCCAAGAACTTAGAGCTCGCCAGAGCCATGGTCGCCTCCTCTAAATCCAGCCTCTATGCTTTATCGGCTAACGATGTTATTTACGAAGACGAGTGGCTCATT
GCCGTCAATAAACCTCAAGGAATCTACTGTGAAAATGTTTTGGCTGCTGTTCCTCGTCTTCTCGGTGATTCGGCTAACGCTGGGATCAAAACCAGTTTGCCAGAGCTTCA
TCTTGCTAACCGCCTTGATCGCGATACAAGTGGGGTTATGGTTATAACGAAGTCACATAAAGTTGCTTCTAAATTAGTGAAGGCATTTACTGACCACACGGTTTCAAAAT
CATATATAGCCTTCTGTGTTGGCACATCTCCAAAATGGAAAAAAATCAACGTCAAATCTGGTCATGGAAGGTCAAAGTTTGGGGTCTGGCGTGTCTATGCTGCAGCTGAT
GTGGGTCGTTCCTTACCTGGGGGTTCCGTAGTCAGAGATATGGAAACATATTTTGAAGTATTGTCTGTAAATGGAAAAAACACTATGGAGGAATTACAAAAATTCAGGAG
AGGTGAAGAAGAAACTATTGTAGTCCATACAAAATCTTTGGTGGACATTGATAGTCACAAGGACGAGATCTTGATAAGAGCACGACCTCGCAGTGGAAGAACTCATCAAA
TCCGTCTGCACTGCCAATATCTCGGAATTCCTATAAGAGGGGATGTGAAATATGAGGGTGTTACAGAGTGGAATGAAAAAATTTACGATAGCCATGAGCTTCATGCTGAA
AGCTTGTATTTTGTGCACCCTGTTACAGGTATTCCTCTCAAACTTCAAGCTCCTTTACCATCATGGGCCAGTCAAGCGTTGCAGCCCCAACAACATGAAGTGAATTCTCC
ACAGTCCTTCAAAACTATGCCTTAA
Protein sequenceShow/hide protein sequence
LFLISITKMALFISLHPNFSISPKTFTEFPFTKSFCFHRFKSVSNSMTDSSEPVTGNCPSTAENYPVPLSPPLPAISKNLELARAMVASSKSSLYALSANDVIYEDEWLI
AVNKPQGIYCENVLAAVPRLLGDSANAGIKTSLPELHLANRLDRDTSGVMVITKSHKVASKLVKAFTDHTVSKSYIAFCVGTSPKWKKINVKSGHGRSKFGVWRVYAAAD
VGRSLPGGSVVRDMETYFEVLSVNGKNTMEELQKFRRGEEETIVVHTKSLVDIDSHKDEILIRARPRSGRTHQIRLHCQYLGIPIRGDVKYEGVTEWNEKIYDSHELHAE
SLYFVHPVTGIPLKLQAPLPSWASQALQPQQHEVNSPQSFKTMP