; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024227 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024227
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPseudouridine synthase family protein
Genome locationtig00001047:4295232..4322572
RNA-Seq ExpressionSgr024227
SyntenySgr024227
Gene Ontology termsGO:0001522 - pseudouridine synthesis (biological process)
GO:0006364 - rRNA processing (biological process)
GO:0006367 - transcription initiation from RNA polymerase II promoter (biological process)
GO:0042742 - defense response to bacterium (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0012505 - endomembrane system (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0009982 - pseudouridine synthase activity (molecular function)
InterPro domainsIPR006145 - Pseudouridine synthase, RsuA/RluA-like
IPR006224 - Pseudouridine synthase, RluC/RluD, conserved site
IPR011039 - Transcription Factor IIF, Rap30/Rap74, interaction
IPR020103 - Pseudouridine synthase, catalytic domain superfamily
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR040450 - TFIIF beta subunit, HTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602326.1 General transcription factor IIF subunit 2, partial [Cucurbita argyrosperma subsp. sororia]6.1e-27474.82Show/hide
Query:  MDLDDKAIDRCSTRKVQSNNNAVVATDFLETNKADRAMWLLKCPQAVTRALSNSPDGPSCPVAKVIVSVDPLQSNDDDGSSSTEVYEDV-------LLWS
        MD DDK IDR S+RK+ SN++ V  TDFLETNKADRAMWLLKCPQ VTRALSNSPD PS PVAKVIVSVDP+QSNDDD SSSTE   ++        L +
Subjt:  MDLDDKAIDRCSTRKVQSNNNAVVATDFLETNKADRAMWLLKCPQAVTRALSNSPDGPSCPVAKVIVSVDPLQSNDDDGSSSTEVYEDV-------LLWS

Query:  YSLNMSTDFIPMSVFSDSSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVT-GHMRPMPGMDVVSFGAAEKKKVVSKGSET
        YSLNMSTDFIPMSVFS+S+QGKFTVEGKILNKFDMKPHDQNL+ YGKLCRERTHKSMTKSRQIQVIDHVT GHMRPMPGMDV+SFGAAEKKKVVSKGSE 
Subjt:  YSLNMSTDFIPMSVFSDSSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVT-GHMRPMPGMDVVSFGAAEKKKVVSKGSET

Query:  KRLRKERGELEKILFRLFERQPYWTSKQLIQETDQPE---------------------------------------------------------------
        KRLRKERGELEKI+FRLFERQPYWTSKQLIQETDQPE                                                               
Subjt:  KRLRKERGELEKILFRLFERQPYWTSKQLIQETDQPE---------------------------------------------------------------

Query:  --------------------------------------VRLFKVQMDNFLGGFTEDSAPFNQDILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFE
                                               RL KV+MDNFLGGFTE       DI RCPFLRNINEPT+FSFSSS+AFPMPVRGAKGPIFE
Subjt:  --------------------------------------VRLFKVQMDNFLGGFTEDSAPFNQDILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFE

Query:  DGPNFDMAFRLFHGRDGVVPLSERSMHPQSVEFKPSPSQFNPLAAKAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEW
        DGPNFDMAFRLFHGRDGVVPLS RSMHP+SVE KP+PSQFNPLAAKAATISLSSFGPGGPFSFDSFS+KW+NQKKKFESSKKESSS+GGNS HEAV NEW
Subjt:  DGPNFDMAFRLFHGRDGVVPLSERSMHPQSVEFKPSPSQFNPLAAKAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEW

Query:  LQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAA
        LQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPA+VAARAALA+TAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAA
Subjt:  LQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAA

Query:  VPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVASKKLTLQDSLGESTLLPVVTVKNGHCGDIESWNPVTSLQVAGP
        VPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAER RLK VAS+KLTLQDS+ +ST  PVVTVKNGHCGDIESWNPVT+LQVAGP
Subjt:  VPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVASKKLTLQDSLGESTLLPVVTVKNGHCGDIESWNPVTSLQVAGP

OMO71569.1 Pseudouridine synthase, RsuA/RluB/C/D/E/F [Corchorus capsularis]3.2e-28367.25Show/hide
Query:  MDNFLGGFTEDSAPFNQDILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERS-MHPQSVEFKPSPSQFNPLA
        MD+F  G  EDS     DILRCPFLRNINEPTNFSFSS++ FPMPVRG KGPIFEDGPNFDMAFRLFHGRDGVVPLSE+S +  +  E + +P +FNPLA
Subjt:  MDNFLGGFTEDSAPFNQDILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERS-MHPQSVEFKPSPSQFNPLA

Query:  AKAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAAR
        AKAATISLSSFGPGGPFSFD+FS KW N+K K + SKKESSSQGGNS HEA+GNEWLQ G CPIAKSYR VS V+PLVAK  Q PPGMK++CPPA+VAAR
Subjt:  AKAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAAR

Query:  AALARTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVA
        AALA+TAFAKNLRPQ LP KVL IG+LGMAANVPLGIWREHTEKFSPSWF AVHAAVPFIAMLRKS+LMPK+AMAFTI AS+LGQVIGSRAER R+K VA
Subjt:  AALARTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVA

Query:  SKKLTLQD---SLGESTLLPVVTVKNGHCGDIESWNPVT--SLQVAGPVARNPLLTIVSRLSHSRSIRTMLGRYFQAIGIPWPDLNDGLFYNDVVKSSDS
        +K+L ++    S+  ++ L VV VKNG+CG    W   T   L+ +  + R  L             RT      +  G+PWP+LNDGLFY DVV  SD 
Subjt:  SKKLTLQD---SLGESTLLPVVTVKNGHCGDIESWNPVT--SLQVAGPVARNPLLTIVSRLSHSRSIRTMLGRYFQAIGIPWPDLNDGLFYNDVVKSSDS

Query:  GSLSSIQLPLLAEEMVISRKALIEFYSSKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPS
                         S   LIEFYSSKYKSSAPL+GWLQRIQN QI++DG+V+ DP TILR GSE+VY RLPW+EPD P+ LEVL+ED ++IALNKPS
Subjt:  GSLSSIQLPLLAEEMVISRKALIEFYSSKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPS

Query:  GLQVLPGGVFQQRTVLTQLQWWSRKQSSSV-SEGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTG
        GLQVLPGG+FQQRTVLTQL W  R Q+ S+ S+    VPVHRLGRGTSGILLCAKT++A+T+LAAYFADGT  V  N    ME G++R I+KIYRALVTG
Subjt:  GLQVLPGGVFQQRTVLTQLQWWSRKQSSSV-SEGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTG

Query:  IIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSH
        I+ +D+V I+QP+G ++YPGVA+GLYVA+ +GKPA SKV VLERD Q   TLVQVEI+SGRPHQIRIHLS IGHPLLGDPLY+AGGQP C   ++VD+S+
Subjt:  IIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSH

Query:  AEDGGYQKPIKPVPGDCGYYLHAHQVVFHHPRTNECIAV
        A+DGGYQ+P  PVPGDCGYYLHAH++V  HP  NE + +
Subjt:  AEDGGYQKPIKPVPGDCGYYLHAHQVVFHHPRTNECIAV

RXH94290.1 hypothetical protein DVH24_023974 [Malus domestica]0.0e+0063.83Show/hide
Query:  LNMSTDFIPMSVFSDSSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTG-HMRPMPGMDV--VSFGAAEKKKVVSKGSET
        +++  D +PMSVFS+SSQG  +VEG+IL KFDMKPH +NLE YGKLCRERT K M KSR+IQV+++  G HMRP PGM +  +S GA+EKKK+ +KGS+ 
Subjt:  LNMSTDFIPMSVFSDSSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTG-HMRPMPGMDV--VSFGAAEKKKVVSKGSET

Query:  KRLRKERGELEKILFRLFERQPYWTSKQLIQETDQPEVRLFKV----------------------QMDNFLG--GFTEDSAPFNQDILRCPFLRNINEPT
        KR RK+RGE+E I+F LFE++P  T +QLIQET+QPE  L  +                      +MD+F      ++D  P   DIL+CPFLRNINEPT
Subjt:  KRLRKERGELEKILFRLFERQPYWTSKQLIQETDQPEVRLFKV----------------------QMDNFLG--GFTEDSAPFNQDILRCPFLRNINEPT

Query:  NFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERS-MHPQSVEFKPSPSQFNPLAAKAATISLSSFGPGGPFSFDSFSEKWKNQKKK
        NFSF+SS+AFP+P RGAKGPIFEDGPNFDMAFRLFHG DGVVPLS +S +H   VE +P+P  FNPLAAKAATISLSSFG GGPFSFD+F +KWKNQ+KK
Subjt:  NFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERS-MHPQSVEFKPSPSQFNPLAAKAATISLSSFGPGGPFSFDSFSEKWKNQKKK

Query:  FESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAFAKNLRPQPLPAKVLAIGLLGMAAN
          SSKK+S+S+GGNS HEAVGNEWLQ GNCPIAKSYRAVS VIPLVAKA QPPPGMK +CPPAIVAARAAL+RTAFAKNLRPQPLPAKVL IG +GMAAN
Subjt:  FESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAFAKNLRPQPLPAKVLAIGLLGMAAN

Query:  VPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLK-TVASKKLTL----------------------QDS
        VPLGIWREHT+KFS SWFAAVHAAVPFIA+LRKS+LMPKSAMAFTIAASVLGQVIGSRAER RLK  VASK   L                      + S
Subjt:  VPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLK-TVASKKLTL----------------------QDS

Query:  LGESTL----LPVVTVKNGHCGDIESWNPVTSLQVAGPVARNPLLTIVSRLSHSRSIRTMLGRYFQAIGIPWPDLNDGLFYNDVVKSSDSGSLSSIQLPL
         G +T     L V+  K GHC  IE WN V SLQVA P +        S    SR+ +       Q  G+PWPDLNDGLFYNDVV++ D           
Subjt:  LGESTL----LPVVTVKNGHCGDIESWNPVTSLQVAGPVARNPLLTIVSRLSHSRSIRTMLGRYFQAIGIPWPDLNDGLFYNDVVKSSDSGSLSSIQLPL

Query:  LAEEMVISRKALIEFYSSKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVF
               S   LIEFYSSKY +SAPL+GWLQRI++ QIT+DG+V+ DP+TILR+GS++ Y RLPW+EPD PYLLEVL+ED ++IALNKPSGLQVLP G+F
Subjt:  LAEEMVISRKALIEFYSSKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVF

Query:  QQRTVLTQLQWWSRKQSSSVS-EGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSID
        QQRTVLTQL W + ++ S +S +  HPVP HRLGRGTSGILLCAKT+ A+TQLAAY ADGT  +    N  ME  T RKI+KIYRALVTGI+ +D+V + 
Subjt:  QQRTVLTQLQWWSRKQSSSVS-EGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSID

Query:  QPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSHAEDGGYQKPI
        QP+G ++YPGVA+GLYVA+ +GKPA SKV VLER+    +TLVQVEIQSGRPHQIRIHLS IGHPLLGDPLYV GGQP C   DLVD+S A+DGG+++P 
Subjt:  QPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSHAEDGGYQKPI

Query:  KPVPGDCGYYLHAHQVVFHHPRTNECIAV
         PVPGDCGY LHAHQV   HP +NE I V
Subjt:  KPVPGDCGYYLHAHQVVFHHPRTNECIAV

RYR36534.1 hypothetical protein Ahy_A09g041495 isoform B [Arachis hypogaea]2.9e-26360.71Show/hide
Query:  SAPFNQD-ILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERSM-HPQSVEFKPSPSQFNPLAAKAATISLSS
        S   N+D ILRCPFLRNINEPTNFSFSS +AF MPVR AKGPIFEDGPNFD+AFRLFHG DGVVPLSERS    + V+ +   SQFNPLAAKAATISLSS
Subjt:  SAPFNQD-ILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERSM-HPQSVEFKPSPSQFNPLAAKAATISLSS

Query:  FGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAFAK
        FG GG F FD+FSEKW NQK+K +SSKKE SSQ G S HEA GN+WLQ GNCPIAKSYRAVSNV+PLVAK +Q PPGMK++CPPA+VAARAALARTAFAK
Subjt:  FGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAFAK

Query:  NLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVASKKLTLQD--
        NLRPQ LP KVLAIG+LGMAANVPLG+WREHT+KFSPSWFAAVHAAVPFIAMLRKS+LMPKSAMAFTIAASVLGQVIGSRAER RLK +A++KL++ D  
Subjt:  NLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVASKKLTLQD--

Query:  ---------------SLGESTLLPVVTVKNGHCGDIESWNPVTSLQVAGPVA------RNPL------------------------LTIVSRLSHSRSIR
                       S   S  L +V  K  HCG+   WN V SLQ+AG  +      +N +                        L IV    ++ SI 
Subjt:  ---------------SLGESTLLPVVTVKNGHCGDIESWNPVTSLQVAGPVA------RNPL------------------------LTIVSRLSHSRSIR

Query:  TM--------LGRYFQA---------------------------------IGIPWPDLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYS
        ++        L R+  +                                  G PWP+ NDGL Y+DVV   D G                    LI+FYS
Subjt:  TM--------LGRYFQA---------------------------------IGIPWPDLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYS

Query:  SKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQS
        SKYKSSAPL+GWLQRI+N+QIT+D  V+TDP+ +LR+GS++VY RLPWKEPD PY+LEVL+ED ++IALNKPSGLQVLPGG+FQQRTVLTQL+W + KQS
Subjt:  SKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQS

Query:  S-SVSEGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYV
        +    + LH VPVHRLGRGTSG+LLCAKT+LA+ +LA+YFADGT  +   R+   E G   KIAKIYRALV+GI+ +D+V I+QP+G ++YP VA GLYV
Subjt:  S-SVSEGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYV

Query:  ATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSHAEDGGYQKPIKPVPGDCGYYLHAHQVV
        A+ SGKPA S V VLER  QG  TLVQV+I+SGRPHQIRIHLS IGHPLLGDPLY  GGQP C  CD++++  A DGGY++P KPVPGDCGYYLHAHQ++
Subjt:  ATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSHAEDGGYQKPIKPVPGDCGYYLHAHQVV

Query:  FHHPRTNECIAV
          HP TNE I +
Subjt:  FHHPRTNECIAV

RYR36535.1 hypothetical protein Ahy_A09g041495 isoform C [Arachis hypogaea]9.2e-24659.32Show/hide
Query:  SAPFNQD-ILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERSM-HPQSVEFKPSPSQFNPLAAKAATISLSS
        S   N+D ILRCPFLRNINEPTNFSFSS +AF MPVR AKGPIFEDGPNFD+AFRLFHG DGVVPLSERS    + V+ +   SQFNPLAAKAATISLSS
Subjt:  SAPFNQD-ILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERSM-HPQSVEFKPSPSQFNPLAAKAATISLSS

Query:  FGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAFAK
        FG GG F FD+FSEKW NQK+K +SSKKE SSQ G S HEA GN+WLQ GNCPIAKSYRAVSNV+PLVAK +Q PPGMK++CPPA+VAARAALARTAFAK
Subjt:  FGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAFAK

Query:  NLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVASKKLTLQD--
        NLRPQ LP KVLAIG+LGMAANVPLG+WREHT+KFSPSWFAAVHAAVPFIAMLRKS+LMPKSAMAFTIAASVLGQVIGSRAER RLK +A++KL++ D  
Subjt:  NLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVASKKLTLQD--

Query:  ---------------SLGESTLLPVVTVKNGHCGDIESWNPVTSLQVAGPVARNPLLTIVSRLSHSRSIRTMLGRY--FQAIGIPWP-------------
                       S   S  L +V  K  HCG+   WN V SLQ+AG  +     T ++   +  S+   +  +  FQ I I                
Subjt:  ---------------SLGESTLLPVVTVKNGHCGDIESWNPVTSLQVAGPVARNPLLTIVSRLSHSRSIRTMLGRY--FQAIGIPWP-------------

Query:  ----DLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYSSKYK-------------SSAPLEG-----------------WLQRIQN----
            D    L YN  +    S    S+Q  LL   ++ S   + E      K             +++ + G                  LQ++Q     
Subjt:  ----DLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYSSKYK-------------SSAPLEG-----------------WLQRIQN----

Query:  EQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQSS-SVSEGLHPVPVHRLGR
         +IT+D  V+TDP+ +LR+GS++VY RLPWKEPD PY+LEVL+ED ++IALNKPSGLQVLPGG+FQQRTVLTQL+W + KQS+    + LH VPVHRLGR
Subjt:  EQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQSS-SVSEGLHPVPVHRLGR

Query:  GTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERD
        GTSG+LLCAKT+LA+ +LA+YFADGT  +   R+   E G   KIAKIYRALV+GI+ +D+V I+QP+G ++YP VA GLYVA+ SGKPA S V VLER 
Subjt:  GTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERD

Query:  EQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSHAEDGGYQKPIKPVPGDCGYYLHAHQVVFHHPRTNECIAV
         QG  TLVQV+I+SGRPHQIRIHLS IGHPLLGDPLY  GGQP C  CD++++  A DGGY++P KPVPGDCGYYLHAHQ++  HP TNE I +
Subjt:  EQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSHAEDGGYQKPIKPVPGDCGYYLHAHQVVFHHPRTNECIAV

TrEMBL top hitse value%identityAlignment
A0A1R3HMH9 Pseudouridine synthase, RsuA/RluB/C/D/E/F1.6e-28367.25Show/hide
Query:  MDNFLGGFTEDSAPFNQDILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERS-MHPQSVEFKPSPSQFNPLA
        MD+F  G  EDS     DILRCPFLRNINEPTNFSFSS++ FPMPVRG KGPIFEDGPNFDMAFRLFHGRDGVVPLSE+S +  +  E + +P +FNPLA
Subjt:  MDNFLGGFTEDSAPFNQDILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERS-MHPQSVEFKPSPSQFNPLA

Query:  AKAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAAR
        AKAATISLSSFGPGGPFSFD+FS KW N+K K + SKKESSSQGGNS HEA+GNEWLQ G CPIAKSYR VS V+PLVAK  Q PPGMK++CPPA+VAAR
Subjt:  AKAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAAR

Query:  AALARTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVA
        AALA+TAFAKNLRPQ LP KVL IG+LGMAANVPLGIWREHTEKFSPSWF AVHAAVPFIAMLRKS+LMPK+AMAFTI AS+LGQVIGSRAER R+K VA
Subjt:  AALARTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVA

Query:  SKKLTLQD---SLGESTLLPVVTVKNGHCGDIESWNPVT--SLQVAGPVARNPLLTIVSRLSHSRSIRTMLGRYFQAIGIPWPDLNDGLFYNDVVKSSDS
        +K+L ++    S+  ++ L VV VKNG+CG    W   T   L+ +  + R  L             RT      +  G+PWP+LNDGLFY DVV  SD 
Subjt:  SKKLTLQD---SLGESTLLPVVTVKNGHCGDIESWNPVT--SLQVAGPVARNPLLTIVSRLSHSRSIRTMLGRYFQAIGIPWPDLNDGLFYNDVVKSSDS

Query:  GSLSSIQLPLLAEEMVISRKALIEFYSSKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPS
                         S   LIEFYSSKYKSSAPL+GWLQRIQN QI++DG+V+ DP TILR GSE+VY RLPW+EPD P+ LEVL+ED ++IALNKPS
Subjt:  GSLSSIQLPLLAEEMVISRKALIEFYSSKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPS

Query:  GLQVLPGGVFQQRTVLTQLQWWSRKQSSSV-SEGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTG
        GLQVLPGG+FQQRTVLTQL W  R Q+ S+ S+    VPVHRLGRGTSGILLCAKT++A+T+LAAYFADGT  V  N    ME G++R I+KIYRALVTG
Subjt:  GLQVLPGGVFQQRTVLTQLQWWSRKQSSSV-SEGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTG

Query:  IIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSH
        I+ +D+V I+QP+G ++YPGVA+GLYVA+ +GKPA SKV VLERD Q   TLVQVEI+SGRPHQIRIHLS IGHPLLGDPLY+AGGQP C   ++VD+S+
Subjt:  IIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSH

Query:  AEDGGYQKPIKPVPGDCGYYLHAHQVVFHHPRTNECIAV
        A+DGGYQ+P  PVPGDCGYYLHAH++V  HP  NE + +
Subjt:  AEDGGYQKPIKPVPGDCGYYLHAHQVVFHHPRTNECIAV

A0A445BCX3 PseudoU_synth_2 domain-containing protein4.5e-24659.32Show/hide
Query:  SAPFNQD-ILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERSM-HPQSVEFKPSPSQFNPLAAKAATISLSS
        S   N+D ILRCPFLRNINEPTNFSFSS +AF MPVR AKGPIFEDGPNFD+AFRLFHG DGVVPLSERS    + V+ +   SQFNPLAAKAATISLSS
Subjt:  SAPFNQD-ILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERSM-HPQSVEFKPSPSQFNPLAAKAATISLSS

Query:  FGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAFAK
        FG GG F FD+FSEKW NQK+K +SSKKE SSQ G S HEA GN+WLQ GNCPIAKSYRAVSNV+PLVAK +Q PPGMK++CPPA+VAARAALARTAFAK
Subjt:  FGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAFAK

Query:  NLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVASKKLTLQD--
        NLRPQ LP KVLAIG+LGMAANVPLG+WREHT+KFSPSWFAAVHAAVPFIAMLRKS+LMPKSAMAFTIAASVLGQVIGSRAER RLK +A++KL++ D  
Subjt:  NLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVASKKLTLQD--

Query:  ---------------SLGESTLLPVVTVKNGHCGDIESWNPVTSLQVAGPVARNPLLTIVSRLSHSRSIRTMLGRY--FQAIGIPWP-------------
                       S   S  L +V  K  HCG+   WN V SLQ+AG  +     T ++   +  S+   +  +  FQ I I                
Subjt:  ---------------SLGESTLLPVVTVKNGHCGDIESWNPVTSLQVAGPVARNPLLTIVSRLSHSRSIRTMLGRY--FQAIGIPWP-------------

Query:  ----DLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYSSKYK-------------SSAPLEG-----------------WLQRIQN----
            D    L YN  +    S    S+Q  LL   ++ S   + E      K             +++ + G                  LQ++Q     
Subjt:  ----DLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYSSKYK-------------SSAPLEG-----------------WLQRIQN----

Query:  EQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQSS-SVSEGLHPVPVHRLGR
         +IT+D  V+TDP+ +LR+GS++VY RLPWKEPD PY+LEVL+ED ++IALNKPSGLQVLPGG+FQQRTVLTQL+W + KQS+    + LH VPVHRLGR
Subjt:  EQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQSS-SVSEGLHPVPVHRLGR

Query:  GTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERD
        GTSG+LLCAKT+LA+ +LA+YFADGT  +   R+   E G   KIAKIYRALV+GI+ +D+V I+QP+G ++YP VA GLYVA+ SGKPA S V VLER 
Subjt:  GTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERD

Query:  EQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSHAEDGGYQKPIKPVPGDCGYYLHAHQVVFHHPRTNECIAV
         QG  TLVQV+I+SGRPHQIRIHLS IGHPLLGDPLY  GGQP C  CD++++  A DGGY++P KPVPGDCGYYLHAHQ++  HP TNE I +
Subjt:  EQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSHAEDGGYQKPIKPVPGDCGYYLHAHQVVFHHPRTNECIAV

A0A445BCX7 PseudoU_synth_2 domain-containing protein1.4e-26360.71Show/hide
Query:  SAPFNQD-ILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERSM-HPQSVEFKPSPSQFNPLAAKAATISLSS
        S   N+D ILRCPFLRNINEPTNFSFSS +AF MPVR AKGPIFEDGPNFD+AFRLFHG DGVVPLSERS    + V+ +   SQFNPLAAKAATISLSS
Subjt:  SAPFNQD-ILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERSM-HPQSVEFKPSPSQFNPLAAKAATISLSS

Query:  FGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAFAK
        FG GG F FD+FSEKW NQK+K +SSKKE SSQ G S HEA GN+WLQ GNCPIAKSYRAVSNV+PLVAK +Q PPGMK++CPPA+VAARAALARTAFAK
Subjt:  FGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAFAK

Query:  NLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVASKKLTLQD--
        NLRPQ LP KVLAIG+LGMAANVPLG+WREHT+KFSPSWFAAVHAAVPFIAMLRKS+LMPKSAMAFTIAASVLGQVIGSRAER RLK +A++KL++ D  
Subjt:  NLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVASKKLTLQD--

Query:  ---------------SLGESTLLPVVTVKNGHCGDIESWNPVTSLQVAGPVA------RNPL------------------------LTIVSRLSHSRSIR
                       S   S  L +V  K  HCG+   WN V SLQ+AG  +      +N +                        L IV    ++ SI 
Subjt:  ---------------SLGESTLLPVVTVKNGHCGDIESWNPVTSLQVAGPVA------RNPL------------------------LTIVSRLSHSRSIR

Query:  TM--------LGRYFQA---------------------------------IGIPWPDLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYS
        ++        L R+  +                                  G PWP+ NDGL Y+DVV   D G                    LI+FYS
Subjt:  TM--------LGRYFQA---------------------------------IGIPWPDLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYS

Query:  SKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQS
        SKYKSSAPL+GWLQRI+N+QIT+D  V+TDP+ +LR+GS++VY RLPWKEPD PY+LEVL+ED ++IALNKPSGLQVLPGG+FQQRTVLTQL+W + KQS
Subjt:  SKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQS

Query:  S-SVSEGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYV
        +    + LH VPVHRLGRGTSG+LLCAKT+LA+ +LA+YFADGT  +   R+   E G   KIAKIYRALV+GI+ +D+V I+QP+G ++YP VA GLYV
Subjt:  S-SVSEGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYV

Query:  ATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSHAEDGGYQKPIKPVPGDCGYYLHAHQVV
        A+ SGKPA S V VLER  QG  TLVQV+I+SGRPHQIRIHLS IGHPLLGDPLY  GGQP C  CD++++  A DGGY++P KPVPGDCGYYLHAHQ++
Subjt:  ATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSHAEDGGYQKPIKPVPGDCGYYLHAHQVV

Query:  FHHPRTNECIAV
          HP TNE I +
Subjt:  FHHPRTNECIAV

A0A445BCZ1 PseudoU_synth_2 domain-containing protein4.5e-24659.32Show/hide
Query:  SAPFNQD-ILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERSM-HPQSVEFKPSPSQFNPLAAKAATISLSS
        S   N+D ILRCPFLRNINEPTNFSFSS +AF MPVR AKGPIFEDGPNFD+AFRLFHG DGVVPLSERS    + V+ +   SQFNPLAAKAATISLSS
Subjt:  SAPFNQD-ILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERSM-HPQSVEFKPSPSQFNPLAAKAATISLSS

Query:  FGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAFAK
        FG GG F FD+FSEKW NQK+K +SSKKE SSQ G S HEA GN+WLQ GNCPIAKSYRAVSNV+PLVAK +Q PPGMK++CPPA+VAARAALARTAFAK
Subjt:  FGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAFAK

Query:  NLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVASKKLTLQD--
        NLRPQ LP KVLAIG+LGMAANVPLG+WREHT+KFSPSWFAAVHAAVPFIAMLRKS+LMPKSAMAFTIAASVLGQVIGSRAER RLK +A++KL++ D  
Subjt:  NLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVASKKLTLQD--

Query:  ---------------SLGESTLLPVVTVKNGHCGDIESWNPVTSLQVAGPVARNPLLTIVSRLSHSRSIRTMLGRY--FQAIGIPWP-------------
                       S   S  L +V  K  HCG+   WN V SLQ+AG  +     T ++   +  S+   +  +  FQ I I                
Subjt:  ---------------SLGESTLLPVVTVKNGHCGDIESWNPVTSLQVAGPVARNPLLTIVSRLSHSRSIRTMLGRY--FQAIGIPWP-------------

Query:  ----DLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYSSKYK-------------SSAPLEG-----------------WLQRIQN----
            D    L YN  +    S    S+Q  LL   ++ S   + E      K             +++ + G                  LQ++Q     
Subjt:  ----DLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYSSKYK-------------SSAPLEG-----------------WLQRIQN----

Query:  EQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQSS-SVSEGLHPVPVHRLGR
         +IT+D  V+TDP+ +LR+GS++VY RLPWKEPD PY+LEVL+ED ++IALNKPSGLQVLPGG+FQQRTVLTQL+W + KQS+    + LH VPVHRLGR
Subjt:  EQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQSS-SVSEGLHPVPVHRLGR

Query:  GTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERD
        GTSG+LLCAKT+LA+ +LA+YFADGT  +   R+   E G   KIAKIYRALV+GI+ +D+V I+QP+G ++YP VA GLYVA+ SGKPA S V VLER 
Subjt:  GTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERD

Query:  EQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSHAEDGGYQKPIKPVPGDCGYYLHAHQVVFHHPRTNECIAV
         QG  TLVQV+I+SGRPHQIRIHLS IGHPLLGDPLY  GGQP C  CD++++  A DGGY++P KPVPGDCGYYLHAHQ++  HP TNE I +
Subjt:  EQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSHAEDGGYQKPIKPVPGDCGYYLHAHQVVFHHPRTNECIAV

A0A498JFH5 Uncharacterized protein0.0e+0063.83Show/hide
Query:  LNMSTDFIPMSVFSDSSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTG-HMRPMPGMDV--VSFGAAEKKKVVSKGSET
        +++  D +PMSVFS+SSQG  +VEG+IL KFDMKPH +NLE YGKLCRERT K M KSR+IQV+++  G HMRP PGM +  +S GA+EKKK+ +KGS+ 
Subjt:  LNMSTDFIPMSVFSDSSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTG-HMRPMPGMDV--VSFGAAEKKKVVSKGSET

Query:  KRLRKERGELEKILFRLFERQPYWTSKQLIQETDQPEVRLFKV----------------------QMDNFLG--GFTEDSAPFNQDILRCPFLRNINEPT
        KR RK+RGE+E I+F LFE++P  T +QLIQET+QPE  L  +                      +MD+F      ++D  P   DIL+CPFLRNINEPT
Subjt:  KRLRKERGELEKILFRLFERQPYWTSKQLIQETDQPEVRLFKV----------------------QMDNFLG--GFTEDSAPFNQDILRCPFLRNINEPT

Query:  NFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERS-MHPQSVEFKPSPSQFNPLAAKAATISLSSFGPGGPFSFDSFSEKWKNQKKK
        NFSF+SS+AFP+P RGAKGPIFEDGPNFDMAFRLFHG DGVVPLS +S +H   VE +P+P  FNPLAAKAATISLSSFG GGPFSFD+F +KWKNQ+KK
Subjt:  NFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERS-MHPQSVEFKPSPSQFNPLAAKAATISLSSFGPGGPFSFDSFSEKWKNQKKK

Query:  FESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAFAKNLRPQPLPAKVLAIGLLGMAAN
          SSKK+S+S+GGNS HEAVGNEWLQ GNCPIAKSYRAVS VIPLVAKA QPPPGMK +CPPAIVAARAAL+RTAFAKNLRPQPLPAKVL IG +GMAAN
Subjt:  FESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAFAKNLRPQPLPAKVLAIGLLGMAAN

Query:  VPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLK-TVASKKLTL----------------------QDS
        VPLGIWREHT+KFS SWFAAVHAAVPFIA+LRKS+LMPKSAMAFTIAASVLGQVIGSRAER RLK  VASK   L                      + S
Subjt:  VPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLK-TVASKKLTL----------------------QDS

Query:  LGESTL----LPVVTVKNGHCGDIESWNPVTSLQVAGPVARNPLLTIVSRLSHSRSIRTMLGRYFQAIGIPWPDLNDGLFYNDVVKSSDSGSLSSIQLPL
         G +T     L V+  K GHC  IE WN V SLQVA P +        S    SR+ +       Q  G+PWPDLNDGLFYNDVV++ D           
Subjt:  LGESTL----LPVVTVKNGHCGDIESWNPVTSLQVAGPVARNPLLTIVSRLSHSRSIRTMLGRYFQAIGIPWPDLNDGLFYNDVVKSSDSGSLSSIQLPL

Query:  LAEEMVISRKALIEFYSSKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVF
               S   LIEFYSSKY +SAPL+GWLQRI++ QIT+DG+V+ DP+TILR+GS++ Y RLPW+EPD PYLLEVL+ED ++IALNKPSGLQVLP G+F
Subjt:  LAEEMVISRKALIEFYSSKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVF

Query:  QQRTVLTQLQWWSRKQSSSVS-EGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSID
        QQRTVLTQL W + ++ S +S +  HPVP HRLGRGTSGILLCAKT+ A+TQLAAY ADGT  +    N  ME  T RKI+KIYRALVTGI+ +D+V + 
Subjt:  QQRTVLTQLQWWSRKQSSSVS-EGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSID

Query:  QPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSHAEDGGYQKPI
        QP+G ++YPGVA+GLYVA+ +GKPA SKV VLER+    +TLVQVEIQSGRPHQIRIHLS IGHPLLGDPLYV GGQP C   DLVD+S A+DGG+++P 
Subjt:  QPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSHAEDGGYQKPI

Query:  KPVPGDCGYYLHAHQVVFHHPRTNECIAV
         PVPGDCGY LHAHQV   HP +NE I V
Subjt:  KPVPGDCGYYLHAHQVVFHHPRTNECIAV

SwissProt top hitse value%identityAlignment
O67638 Uncharacterized RNA pseudouridine synthase aq_17583.4e-2528.15Show/hide
Query:  ITMDGQVITDPETILRIGSEVVYRRLPWKEP----DTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQSSSVSEGLHPVPVHRLG
        + ++G+ +  P   LR G  V+   +P  EP         + +++ED ++  + KP GL V P   +   T++  L  +  K  SS+     P  VHRL 
Subjt:  ITMDGQVITDPETILRIGSEVVYRRLPWKEP----DTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQSSSVSEGLHPVPVHRLG

Query:  RGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLER
        + T+G+++ AK   A   L   F +                  RK  K Y+ LV G++  D  +ID P+ R  +P V +  +     GK A ++ +VL+R
Subjt:  RGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLER

Query:  DEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSHAEDGGYQKPIKPVPGDCGYYLHAHQVVFHHPRTNECIAVAVDVK
         E+ + TL++V+I +GR HQIR+H +SIGHP+LGD  Y                   +     K +  + G+C   L A+ + F+HP   E +   ++  
Subjt:  DEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSHAEDGGYQKPIKPVPGDCGYYLHAHQVVFHHPRTNECIAVAVDVK

Query:  KT
        +T
Subjt:  KT

P33640 Ribosomal large subunit pseudouridine synthase D1.7e-2432.55Show/hide
Query:  SSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYR-----RLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQ
        S + L GW   I++ ++T+DG V+  P  I+  G+++V       +  W   D    LE+++ED  ++ ++KP+GL V P    Q  T+L  L +     
Subjt:  SSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYR-----RLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQ

Query:  SSSVSEGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYV
        ++    G+    VHRL + T+G+++ AKT  A T+L A                      R +++IY A+V G+I     +ID P+GR    GV +    
Subjt:  SSSVSEGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYV

Query:  ATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLY
           +GK A S   VLER      T  +V++++GR HQIR+H+S IG+PL+GDP+Y
Subjt:  ATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLY

P72970 Uncharacterized RNA pseudouridine synthase slr15927.0e-4737.2Show/hide
Query:  LIEFYSSKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQW
        +++FYS +Y  S   + W +RI+  QI +DG+ + DP   L+ G  + Y R PW EP+ P  L VLH D +L A++KPSGL VLPGG F   TVL QL+ 
Subjt:  LIEFYSSKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQW

Query:  WSRKQSSSVSEGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVA
           ++S          P+HRLG GTSG+LL  ++ L R +L+  F +                      KIYR ++      ++    Q +G++ YP + 
Subjt:  WSRKQSSSVSEGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVA

Query:  QGLYVATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQP-NCHHCDLVDQSHAEDGGYQKPIKPVPGDCGYYL
          +Y A+S GKPA S   +L R  +  +T ++VEI++GRPHQIRIHL+S+G+PLLGD LY  GG P NC                       P D GY L
Subjt:  QGLYVATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQP-NCHHCDLVDQSHAEDGGYQKPIKPVPGDCGYYL

Query:  HAHQVVFHHPRTNECIAVAVDVKKTSPI
        H++Q+ F H  T + I      K T+P+
Subjt:  HAHQVVFHHPRTNECIAVAVDVKKTSPI

Q0DST9 RNA pseudouridine synthase 52.7e-11557.45Show/hide
Query:  AIGIPWPDLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYSSKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPWK
        + G PWP+ N+G+ Y D  + +D+G+ ++                LIEFYS+ YKSSAPL GW++RI++ QIT+DG+V TDP+ ILR GS++VY RLPW+
Subjt:  AIGIPWPDLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYSSKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPWK

Query:  EPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQSSSV-----SEGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTF
        EP  P+LL+VL+ED ++IALNKPSGLQVLP G+FQQRTVL QLQ    K  SS          HPVPVHRLGRGTSG+LLCAKT+LA+ QLAAYFA+G  
Subjt:  EPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQSSSV-----SEGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTF

Query:  CVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSI
           ++R+ T +    RKI+K YRALVTGI+ +DEV I QP+G ++YPGVA+GLY A SSGKPA SKV VLER +    TL+QVEI SGRPHQIRIHL+ I
Subjt:  CVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSI

Query:  GHPLLGDPLYVAGGQPNCHHCDLVD--QSHAEDGGYQKPIKPVPGDCGYYLHAHQVVFHHPRTNECIAV
        GHPL+ DPLY  GGQP     +      S A DGGY++P++PVPGDCGY+LHAH +V  HP TNE I +
Subjt:  GHPLLGDPLYVAGGQPNCHHCDLVD--QSHAEDGGYQKPIKPVPGDCGYYLHAHQVVFHHPRTNECIAV

Q5M721 RNA pseudouridine synthase 52.1e-12865.54Show/hide
Query:  QAIGIPWPDLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYSSKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPW
        Q IG+PWP+LNDGL Y D + SS+S  L+++                 EFY +KYKSSAPL GW+QRIQN QI +DG+V+ DP T+LR GS++VY RLPW
Subjt:  QAIGIPWPDLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYSSKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPW

Query:  KEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQSSSVS--EGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCV
        KEPDTPY LEVL+ED +LIALNKPSGLQVLPGG+FQQRTVLTQLQW   K  S +   E  HPVPVHRLGRGTSGILLCAKT+LA+T+LAAYFA+GT  V
Subjt:  KEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQSSSVS--EGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCV

Query:  KRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGH
          + N   E GT RK++KIYRAL  GI+ +DEV I QP+G ++YPGVAQGLYVA+  GKPAFSKV+VLERD +   +LV+VEIQSGRPHQIRIHL+ +GH
Subjt:  KRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGH

Query:  PLLGDPLYVAGGQPNCHHCDLVDQ--SHAEDGGYQKPIKPVPGDCGYYLHAHQV
        PL+GDPLYVAGGQP C   DLVD   + AEDGGY++P + VPGDCGY+LHAHQV
Subjt:  PLLGDPLYVAGGQPNCHHCDLVDQ--SHAEDGGYQKPIKPVPGDCGYYLHAHQV

Arabidopsis top hitse value%identityAlignment
AT3G52260.2 Pseudouridine synthase family protein1.5e-12965.54Show/hide
Query:  QAIGIPWPDLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYSSKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPW
        Q IG+PWP+LNDGL Y D + SS+S  L+++                 EFY +KYKSSAPL GW+QRIQN QI +DG+V+ DP T+LR GS++VY RLPW
Subjt:  QAIGIPWPDLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYSSKYKSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPW

Query:  KEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQSSSVS--EGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCV
        KEPDTPY LEVL+ED +LIALNKPSGLQVLPGG+FQQRTVLTQLQW   K  S +   E  HPVPVHRLGRGTSGILLCAKT+LA+T+LAAYFA+GT  V
Subjt:  KEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQSSSVS--EGLHPVPVHRLGRGTSGILLCAKTRLARTQLAAYFADGTFCV

Query:  KRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGH
          + N   E GT RK++KIYRAL  GI+ +DEV I QP+G ++YPGVAQGLYVA+  GKPAFSKV+VLERD +   +LV+VEIQSGRPHQIRIHL+ +GH
Subjt:  KRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERDEQGKRTLVQVEIQSGRPHQIRIHLSSIGH

Query:  PLLGDPLYVAGGQPNCHHCDLVDQ--SHAEDGGYQKPIKPVPGDCGYYLHAHQV
        PL+GDPLYVAGGQP C   DLVD   + AEDGGY++P + VPGDCGY+LHAHQV
Subjt:  PLLGDPLYVAGGQPNCHHCDLVDQ--SHAEDGGYQKPIKPVPGDCGYYLHAHQV

AT3G52260.3 Pseudouridine synthase family protein8.6e-11756.36Show/hide
Query:  QAIGIPWPDLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYSSKYKSSAPLEGWL------------------------------QRIQN
        Q IG+PWP+LNDGL Y D + SS+S  L+++                 EFY +KYKSSAPL G+L                              +   +
Subjt:  QAIGIPWPDLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYSSKYKSSAPLEGWL------------------------------QRIQN

Query:  EQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQSSSVS--EGLHPVPVHRLG
          I +DG+V+ DP T+LR GS++VY RLPWKEPDTPY LEVL+ED +LIALNKPSGLQVLPGG+FQQRTVLTQLQW   K  S +   E  HPVPVHRLG
Subjt:  EQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQSSSVS--EGLHPVPVHRLG

Query:  RGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLER
        RGTSGILLCAKT+LA+T+LAAYFA+GT  V  + N   E GT RK++KIYRAL  GI+ +DEV I QP+G ++YPGVAQGLYVA+  GKPAFSKV+VLER
Subjt:  RGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLER

Query:  DEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLL-----------------GDPLYVAGGQPNCHHCDLVDQ--SHAEDGGYQKPIKPVPGDCGYYLHAHQ
        D +   +LV+VEIQSGRPHQIRIHL+ +GHPL+                 GDPLYVAGGQP C   DLVD   + AEDGGY++P + VPGDCGY+LHAHQ
Subjt:  DEQGKRTLVQVEIQSGRPHQIRIHLSSIGHPLL-----------------GDPLYVAGGQPNCHHCDLVDQ--SHAEDGGYQKPIKPVPGDCGYYLHAHQ

Query:  V
        V
Subjt:  V

AT4G25030.1 unknown protein1.2e-12167.43Show/hide
Query:  MDNFLGGFTEDSAPFNQDILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERSMHPQSVEFKPSPSQFNPLAA
        MDN  G   E+      +ILRCPFLRNINEPTN SFSSS+ FP+P R  KGPIFEDGPNFD AFRLFHG+DGVVPLS+ +   ++   KP P  F+PLAA
Subjt:  MDNFLGGFTEDSAPFNQDILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERSMHPQSVEFKPSPSQFNPLAA

Query:  KAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARA
        KAATISLSSFG GGPF FD+FS+ +KNQKKK +SSK    ++GGN  HEA+G+EWL+ GNCPIAKSYRAVS V PLVAK LQPPPGMKF+CP AIV ARA
Subjt:  KAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARA

Query:  ALARTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVAS
        A+++T FAKNLRPQPLPAKVL IG+LGMA NVPLG+WREHTEKFS SWF A+HAAVPFI +LRKS+LMPK+AM FTIAASVLGQVIGSRAER RLK+VA 
Subjt:  ALARTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVAS

Query:  KKLTLQ----DSLGESTLLPVVTVKNGHCGD--IESWNPVTSLQVAGPVA
        KKLTL+     S+    +       +G CGD  +  WNP+  L VA PV+
Subjt:  KKLTLQ----DSLGESTLLPVVTVKNGHCGD--IESWNPVTSLQVAGPVA

AT4G25030.2 unknown protein1.2e-12167.43Show/hide
Query:  MDNFLGGFTEDSAPFNQDILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERSMHPQSVEFKPSPSQFNPLAA
        MDN  G   E+      +ILRCPFLRNINEPTN SFSSS+ FP+P R  KGPIFEDGPNFD AFRLFHG+DGVVPLS+ +   ++   KP P  F+PLAA
Subjt:  MDNFLGGFTEDSAPFNQDILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERSMHPQSVEFKPSPSQFNPLAA

Query:  KAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARA
        KAATISLSSFG GGPF FD+FS+ +KNQKKK +SSK    ++GGN  HEA+G+EWL+ GNCPIAKSYRAVS V PLVAK LQPPPGMKF+CP AIV ARA
Subjt:  KAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARA

Query:  ALARTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVAS
        A+++T FAKNLRPQPLPAKVL IG+LGMA NVPLG+WREHTEKFS SWF A+HAAVPFI +LRKS+LMPK+AM FTIAASVLGQVIGSRAER RLK+VA 
Subjt:  ALARTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVAS

Query:  KKLTLQ----DSLGESTLLPVVTVKNGHCGD--IESWNPVTSLQVAGPVA
        KKLTL+     S+    +       +G CGD  +  WNP+  L VA PV+
Subjt:  KKLTLQ----DSLGESTLLPVVTVKNGHCGD--IESWNPVTSLQVAGPVA

AT5G45410.1 unknown protein2.3e-10964.38Show/hide
Query:  EDSAPFNQD--ILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERSMHPQSVEFKPSPSQFNPLAAKAATISL
        +   PF  +  I +CPFLRNIN+PTN SF SS++FP+PV+G KGPIFEDGP FD AF+LFHG+DG+VPLS  +   +  E      QFNPLA K ATISL
Subjt:  EDSAPFNQD--ILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERSMHPQSVEFKPSPSQFNPLAAKAATISL

Query:  SSFGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAF
        S+FGPGGPF F  FSEKWK Q+KK + SK + S  G +S+HEAVG+EWL+ GNCPIAKS+RA S V+PL++KAL  PPGMK+RCP  IVAARAAL++TA 
Subjt:  SSFGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALARTAF

Query:  AKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVASKKLTLQD
         K+LRPQPLP K+LAI L+GMAANVPLG+WREHT+KFSP+WF AVHAAVPFIAMLRKS+LMPK+AMA TI AS+LGQVIGSRAER RLK VA K + +  
Subjt:  AKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVASKKLTLQD

Query:  SLGESTLLPVVT-VKNGHCG
         +      P  + +  GHCG
Subjt:  SLGESTLLPVVT-VKNGHCG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCTCGACGACAAAGCCATAGACCGTTGCAGTACTAGAAAAGTCCAAAGCAATAACAACGCCGTCGTCGCGACCGACTTCTTGGAGACTAACAAGGCGGACAGAGC
AATGTGGCTCTTGAAATGCCCGCAAGCGGTAACTCGCGCCCTTTCCAATTCGCCGGATGGCCCTTCTTGCCCGGTCGCCAAGGTCATTGTTTCCGTGGACCCTCTGCAAT
CCAACGATGACGATGGCTCTTCATCTACTGAGGTATATGAGGACGTCCTGCTATGGTCCTACTCCTTGAATATGTCCACGGACTTCATTCCGATGTCCGTATTTTCCGAT
TCATCACAAGGGAAGTTTACAGTCGAAGGGAAAATACTGAACAAATTTGACATGAAGCCCCATGATCAAAATCTTGAGAGGTATGGGAAACTCTGTCGTGAAAGAACTCA
CAAGTCTATGACCAAAAGCAGACAGATTCAGGTGATTGATCACGTCACAGGACATATGAGGCCTATGCCAGGGATGGATGTGGTGTCTTTTGGAGCTGCAGAGAAGAAAA
AAGTGGTGAGCAAGGGATCGGAAACAAAGAGGTTGAGAAAGGAGCGTGGTGAATTGGAGAAGATTCTTTTTAGGCTCTTTGAAAGACAACCATACTGGACGTCAAAACAA
TTGATTCAAGAGACTGACCAACCTGAAGTTAGGTTGTTCAAGGTACAAATGGATAATTTCCTTGGAGGCTTTACAGAGGACTCTGCACCTTTTAATCAGGATATCCTAAG
ATGTCCATTCTTGAGGAATATTAATGAGCCAACCAATTTCTCATTTTCCTCGTCCATCGCATTCCCAATGCCTGTACGTGGAGCTAAAGGCCCAATTTTTGAAGATGGTC
CCAATTTTGACATGGCATTTAGGCTTTTCCATGGTCGAGATGGAGTGGTTCCACTCTCTGAAAGATCTATGCATCCACAGAGTGTAGAATTCAAGCCATCCCCATCACAG
TTCAATCCTTTAGCTGCAAAGGCAGCCACTATTAGTCTCTCATCCTTCGGACCTGGAGGGCCCTTCAGCTTTGATTCATTTTCAGAGAAGTGGAAAAATCAGAAGAAAAA
ATTTGAGTCATCCAAGAAAGAGTCTTCTTCGCAGGGTGGAAATTCACAGCATGAAGCAGTGGGCAATGAGTGGCTGCAAATGGGGAACTGCCCAATTGCAAAGTCGTACC
GTGCTGTTAGCAATGTCATACCACTTGTTGCAAAAGCCCTTCAACCCCCACCAGGTATGAAATTTAGATGCCCGCCTGCAATAGTTGCAGCAAGAGCAGCTTTAGCACGG
ACTGCATTTGCAAAGAACCTCCGCCCCCAACCGTTGCCAGCAAAAGTACTCGCAATCGGCCTACTAGGCATGGCAGCAAACGTACCTTTAGGAATATGGAGAGAACACAC
TGAAAAATTCTCACCCTCTTGGTTTGCTGCTGTTCATGCAGCTGTTCCATTTATAGCTATGCTGAGGAAGTCTATCCTCATGCCCAAGTCAGCCATGGCGTTTACCATTG
CAGCATCAGTTTTAGGGCAGGTTATCGGCTCAAGGGCAGAGCGACTCCGGCTCAAGACAGTGGCTTCAAAAAAATTAACCCTCCAAGATTCTCTGGGCGAATCCACCCTT
TTGCCGGTGGTTACTGTGAAGAATGGTCATTGTGGCGACATAGAGAGCTGGAATCCAGTTACCAGTCTTCAGGTAGCAGGACCAGTGGCCCGGAATCCACTGTTGACAAT
CGTTTCTCGGCTTTCACATTCAAGGTCAATCCGTACAATGCTCGGAAGGTATTTTCAGGCTATCGGAATTCCATGGCCCGATCTCAACGACGGCTTGTTCTACAACGATG
TCGTTAAATCCTCCGACTCCGGTTCTCTTTCTTCTATTCAACTTCCGCTTCTCGCAGAAGAAATGGTCATTTCACGGAAGGCTTTGATCGAGTTCTATTCTAGCAAATAT
AAGAGCTCGGCTCCATTGGAGGGCTGGTTGCAGCGAATTCAGAACGAACAGATAACAATGGATGGTCAAGTTATCACAGATCCCGAAACAATTCTCAGGATTGGTTCAGA
AGTAGTCTATCGTCGACTTCCATGGAAAGAACCGGATACTCCATATTTACTTGAGGTTTTACACGAAGACGGCGAATTGATTGCTTTAAATAAACCGTCTGGCTTGCAAG
TATTACCTGGCGGTGTTTTCCAGCAACGGACAGTTTTGACGCAGCTTCAATGGTGGTCAAGAAAGCAGAGTTCCTCCGTCTCGGAAGGGTTGCATCCTGTCCCTGTCCAT
CGCCTAGGAAGGGGGACTTCAGGAATATTATTATGTGCAAAAACAAGGCTCGCTAGGACACAACTTGCTGCATATTTTGCTGATGGAACATTTTGTGTTAAGAGAAACAG
AAACAACACTATGGAGGAAGGTACAACGAGGAAAATTGCAAAAATATACCGTGCACTTGTAACTGGAATAATTCACGATGATGAGGTGAGCATTGACCAGCCAGTTGGAA
GGATGCAATATCCAGGGGTTGCTCAGGGCCTGTATGTTGCTACTTCCTCAGGGAAACCGGCTTTCAGTAAAGTCTATGTTCTTGAGAGAGACGAACAAGGAAAACGCACT
TTGGTCCAGGTGGAAATTCAGTCTGGACGACCCCACCAAATCCGCATCCATCTTTCTTCAATAGGACATCCTCTATTGGGTGATCCTCTCTATGTTGCTGGAGGCCAACC
AAACTGCCATCATTGTGATCTAGTAGATCAGAGTCATGCAGAAGACGGAGGATACCAAAAGCCTATAAAACCTGTGCCTGGAGACTGTGGCTACTACTTGCATGCACATC
AGGTGGTCTTTCATCACCCAAGAACCAATGAGTGCATAGCGGTGGCGGTGGATGTCAAAAAGACTAGTCCTATCCTTTTACTTCATTCCCCGAATTCTGAAGACAAACTA
ACTTGTTATATATGCCATGGCACTCTCCACTTGATATCCATCGTAGCTAACGCCTGTAGAGATGTCAATCGTTTTATAATCGCCAGTCTGTTAATCTACCGCTGCAGCAA
CTTACCTCCGAGATCTGAGAATCGCCATTTGATTTCTGCTCATTGTCGACAGGTTGCTGCTGAATTGGGGAGGCAGATCTCAAGTAATGATCAAATGCAGGATATCTGGA
CAGTAGGACGGTGGGATCGGGCAAAGATGAAAGCCCAGTGCGTATTCTGTTCCCGTCCCAAATTCCATAACTTCTTGAGGGACCATCCTCCTGGCCCGGAGAGCTGCAAT
GAGGTGACGCGAGAATTTATGGGTGCTGAAGAACTATCAGCTCTAGAACTTCCAAATGAGGTACACCATAGTCGATTGCAGCATTTCCAACTGATGCTGCGGCTGGAGCA
GAAATGCCTTGTTCTTTCGCCGTTTCATCAGCAGTCGCCAACACAACTGCACAATTCACAGACTGGCAGTCATCATCGAAAGCTGAATGAGAGCTATCCTTCCTATTGCT
TGCATCGGACACCACAGCATCCGCAGCAGCTATATTATGACAAACTGCTTATTCATGGACTCCATGGGATTAGATTCCCCACCGTTCATACCCGTCACAGACTTCGGCAA
GTCTTCAACCATCTGAGAGCCAACCTTGTTGAAAGGAAAGGCGTCGCCTTTAGTACCTACACCAGCTTCACCATCGCCTCCGTTAACATCGCAGTCATATCCATTTGCTT
TATCATCATTTTCTGGTGGCATTTTCTTTGCAGTACATATCAAATCCAACCCAAGGCACTTCCGTGCTTTGCTAAAGAAGACCTTGCACTGAGTACTTATGAGCGCGAGC
AGTCATCGCTGAAGCAGCACCCAACATGTCAAGAGAAGCGGCATTTGTTTCAGGATTCCATTTCTTCCCTGTTGTCACCAAATAGAAAGGAGGCAATTTTCCCGAAATCT
TTCCCAAAACATTGTAACTTCTCCATAAAAACATCCTTCTCTTCGGAGGTCCATGGGTTGATCATCGTCCTTTCCTTCTCGACAGCACACGGATCCTCAACTACTCCGTT
ATTAGAGATGAACCTTGAGGCCGTCTTATCCTTCGGGTCCAAATTTAATGCAGGCATCTTCAAGGTCCTCCGGTGCTGCTTAACCTGGGAAGAATCAGATAGCTGCTTGC
TCACGTGCTTAAGAATCTCCGTGTTAGAAACGGGGTTCAGATTTCCTGCTGCAAGGCCAGTGAAGGCATACGGAATCGGATGGAAGAGCGGTTCTTCTGATAACCACTGT
AACTTGTCCACAAACTCAACTGATATTTTTTCTGAGATTTTGCACGTAACTTCCTTACAGGAGGCACATGCAAACTCTCCTTCCACAAGGACTGGCAACAGCCTACTTAA
AGCTTCAGACGCCCTACATGCAGATTCTCTATTGGATGCCAGAATTAAATTGAACACACTATTTCCCTCGTCAGCAATTTCATACATATCACTAGAAATGGGTTCTAATG
CAATAGCTTCACTCTTTGTAAGCATGCTATTGTCAACAGAGGCCACAACATTATCTTCGACACTGCAACCAGATATTGTTAGCTTCTCTCCCATGATAACAGATTCCTCT
GCATCCAACCTCCCAGACCTAATGATATCCTTTTCCTTGACATCAGCATGCAGTTCTTCCAAGTCACTTGTGGAAAGTGCCATCTTTTTTGAAATAGTATTTTTGGAAGT
AACAACTGGCAGGGTAGCAGCTGGAGGGATCATATCAGAGACATCATTGTGCTCTTCAAAGTATTTATCCCCCACCAGCAAAGAACTGGATGATAAGGGAAAAGAAATGT
TACCCTTACTTTCAGATCTCAACCCCTTGAGTTCATTTTCAATATCACTTGATAAACACCATAATGAAGTGAAAAAGTGGCCCACAAAAACAAAAACACAAAAGAGGACT
GGCTTACCTGGCGAGGAAGCACAAACAACAGAAGAAGGGGTCGCGGGTGAGGTACAATCGAAAAACCCTGAACCTCTAGGGCCTTTTTCAATCAAGCTGGAATTTAGAGA
ATGGGTTGATTCTGCATTAACATTTGTAAAGGCAGTACCGTCAGGAACTTCTACCTTCTTTTTTTCATACTTAGCAAGTCCCTCACCCCATCCTAGCCTTGGCTTCTTCC
TGACAGTTGTCTCATCACATGGCAGAGAAGACACCGCGCACTCAGTAGGATCAGCTGAAGGAGACAGATTTTGCGACACATTTTTCAACACTGTCTCAGACTTTGTCTCG
TTAGAATCTAAAGCATCCATGCTCTTTGTGCTACTGGAATGGCCGGAAGTTGAAGTGCGTGAAGATAAGCCACCAGAACGTGTCCATTTAAGAGGCTTCCACCCTGAGGA
ACTCAATGAGTACTCCCGATCACGCGGCATGAAAAAGAGGCCAATAGCATGGGCACCCTCAAAGAATAACCGTTTTCAACGCACAGCACACGGAGCGCGCCGAATAGCAC
CGCTGGAAATCAATTCAACTATTTTGGCAACCATGTCAAACAAGACGTCTCATGAACGAAACTTCTTTCACATAAAAGATCCTAAACTATATCTCGCTAAGGCAAACGAC
AGAGAGAAAAAGAAAATCAACTCGAGAAAGCGATCGCTTCTGCCTCCTCTCACTGGACGCGCAATAACGAGTATATGCCGCTACGGCAGCCACGACGAGCCACACGGATA
A
mRNA sequenceShow/hide mRNA sequence
ATGGACCTCGACGACAAAGCCATAGACCGTTGCAGTACTAGAAAAGTCCAAAGCAATAACAACGCCGTCGTCGCGACCGACTTCTTGGAGACTAACAAGGCGGACAGAGC
AATGTGGCTCTTGAAATGCCCGCAAGCGGTAACTCGCGCCCTTTCCAATTCGCCGGATGGCCCTTCTTGCCCGGTCGCCAAGGTCATTGTTTCCGTGGACCCTCTGCAAT
CCAACGATGACGATGGCTCTTCATCTACTGAGGTATATGAGGACGTCCTGCTATGGTCCTACTCCTTGAATATGTCCACGGACTTCATTCCGATGTCCGTATTTTCCGAT
TCATCACAAGGGAAGTTTACAGTCGAAGGGAAAATACTGAACAAATTTGACATGAAGCCCCATGATCAAAATCTTGAGAGGTATGGGAAACTCTGTCGTGAAAGAACTCA
CAAGTCTATGACCAAAAGCAGACAGATTCAGGTGATTGATCACGTCACAGGACATATGAGGCCTATGCCAGGGATGGATGTGGTGTCTTTTGGAGCTGCAGAGAAGAAAA
AAGTGGTGAGCAAGGGATCGGAAACAAAGAGGTTGAGAAAGGAGCGTGGTGAATTGGAGAAGATTCTTTTTAGGCTCTTTGAAAGACAACCATACTGGACGTCAAAACAA
TTGATTCAAGAGACTGACCAACCTGAAGTTAGGTTGTTCAAGGTACAAATGGATAATTTCCTTGGAGGCTTTACAGAGGACTCTGCACCTTTTAATCAGGATATCCTAAG
ATGTCCATTCTTGAGGAATATTAATGAGCCAACCAATTTCTCATTTTCCTCGTCCATCGCATTCCCAATGCCTGTACGTGGAGCTAAAGGCCCAATTTTTGAAGATGGTC
CCAATTTTGACATGGCATTTAGGCTTTTCCATGGTCGAGATGGAGTGGTTCCACTCTCTGAAAGATCTATGCATCCACAGAGTGTAGAATTCAAGCCATCCCCATCACAG
TTCAATCCTTTAGCTGCAAAGGCAGCCACTATTAGTCTCTCATCCTTCGGACCTGGAGGGCCCTTCAGCTTTGATTCATTTTCAGAGAAGTGGAAAAATCAGAAGAAAAA
ATTTGAGTCATCCAAGAAAGAGTCTTCTTCGCAGGGTGGAAATTCACAGCATGAAGCAGTGGGCAATGAGTGGCTGCAAATGGGGAACTGCCCAATTGCAAAGTCGTACC
GTGCTGTTAGCAATGTCATACCACTTGTTGCAAAAGCCCTTCAACCCCCACCAGGTATGAAATTTAGATGCCCGCCTGCAATAGTTGCAGCAAGAGCAGCTTTAGCACGG
ACTGCATTTGCAAAGAACCTCCGCCCCCAACCGTTGCCAGCAAAAGTACTCGCAATCGGCCTACTAGGCATGGCAGCAAACGTACCTTTAGGAATATGGAGAGAACACAC
TGAAAAATTCTCACCCTCTTGGTTTGCTGCTGTTCATGCAGCTGTTCCATTTATAGCTATGCTGAGGAAGTCTATCCTCATGCCCAAGTCAGCCATGGCGTTTACCATTG
CAGCATCAGTTTTAGGGCAGGTTATCGGCTCAAGGGCAGAGCGACTCCGGCTCAAGACAGTGGCTTCAAAAAAATTAACCCTCCAAGATTCTCTGGGCGAATCCACCCTT
TTGCCGGTGGTTACTGTGAAGAATGGTCATTGTGGCGACATAGAGAGCTGGAATCCAGTTACCAGTCTTCAGGTAGCAGGACCAGTGGCCCGGAATCCACTGTTGACAAT
CGTTTCTCGGCTTTCACATTCAAGGTCAATCCGTACAATGCTCGGAAGGTATTTTCAGGCTATCGGAATTCCATGGCCCGATCTCAACGACGGCTTGTTCTACAACGATG
TCGTTAAATCCTCCGACTCCGGTTCTCTTTCTTCTATTCAACTTCCGCTTCTCGCAGAAGAAATGGTCATTTCACGGAAGGCTTTGATCGAGTTCTATTCTAGCAAATAT
AAGAGCTCGGCTCCATTGGAGGGCTGGTTGCAGCGAATTCAGAACGAACAGATAACAATGGATGGTCAAGTTATCACAGATCCCGAAACAATTCTCAGGATTGGTTCAGA
AGTAGTCTATCGTCGACTTCCATGGAAAGAACCGGATACTCCATATTTACTTGAGGTTTTACACGAAGACGGCGAATTGATTGCTTTAAATAAACCGTCTGGCTTGCAAG
TATTACCTGGCGGTGTTTTCCAGCAACGGACAGTTTTGACGCAGCTTCAATGGTGGTCAAGAAAGCAGAGTTCCTCCGTCTCGGAAGGGTTGCATCCTGTCCCTGTCCAT
CGCCTAGGAAGGGGGACTTCAGGAATATTATTATGTGCAAAAACAAGGCTCGCTAGGACACAACTTGCTGCATATTTTGCTGATGGAACATTTTGTGTTAAGAGAAACAG
AAACAACACTATGGAGGAAGGTACAACGAGGAAAATTGCAAAAATATACCGTGCACTTGTAACTGGAATAATTCACGATGATGAGGTGAGCATTGACCAGCCAGTTGGAA
GGATGCAATATCCAGGGGTTGCTCAGGGCCTGTATGTTGCTACTTCCTCAGGGAAACCGGCTTTCAGTAAAGTCTATGTTCTTGAGAGAGACGAACAAGGAAAACGCACT
TTGGTCCAGGTGGAAATTCAGTCTGGACGACCCCACCAAATCCGCATCCATCTTTCTTCAATAGGACATCCTCTATTGGGTGATCCTCTCTATGTTGCTGGAGGCCAACC
AAACTGCCATCATTGTGATCTAGTAGATCAGAGTCATGCAGAAGACGGAGGATACCAAAAGCCTATAAAACCTGTGCCTGGAGACTGTGGCTACTACTTGCATGCACATC
AGGTGGTCTTTCATCACCCAAGAACCAATGAGTGCATAGCGGTGGCGGTGGATGTCAAAAAGACTAGTCCTATCCTTTTACTTCATTCCCCGAATTCTGAAGACAAACTA
ACTTGTTATATATGCCATGGCACTCTCCACTTGATATCCATCGTAGCTAACGCCTGTAGAGATGTCAATCGTTTTATAATCGCCAGTCTGTTAATCTACCGCTGCAGCAA
CTTACCTCCGAGATCTGAGAATCGCCATTTGATTTCTGCTCATTGTCGACAGGTTGCTGCTGAATTGGGGAGGCAGATCTCAAGTAATGATCAAATGCAGGATATCTGGA
CAGTAGGACGGTGGGATCGGGCAAAGATGAAAGCCCAGTGCGTATTCTGTTCCCGTCCCAAATTCCATAACTTCTTGAGGGACCATCCTCCTGGCCCGGAGAGCTGCAAT
GAGGTGACGCGAGAATTTATGGGTGCTGAAGAACTATCAGCTCTAGAACTTCCAAATGAGGTACACCATAGTCGATTGCAGCATTTCCAACTGATGCTGCGGCTGGAGCA
GAAATGCCTTGTTCTTTCGCCGTTTCATCAGCAGTCGCCAACACAACTGCACAATTCACAGACTGGCAGTCATCATCGAAAGCTGAATGAGAGCTATCCTTCCTATTGCT
TGCATCGGACACCACAGCATCCGCAGCAGCTATATTATGACAAACTGCTTATTCATGGACTCCATGGGATTAGATTCCCCACCGTTCATACCCGTCACAGACTTCGGCAA
GTCTTCAACCATCTGAGAGCCAACCTTGTTGAAAGGAAAGGCGTCGCCTTTAGTACCTACACCAGCTTCACCATCGCCTCCGTTAACATCGCAGTCATATCCATTTGCTT
TATCATCATTTTCTGGTGGCATTTTCTTTGCAGTACATATCAAATCCAACCCAAGGCACTTCCGTGCTTTGCTAAAGAAGACCTTGCACTGAGTACTTATGAGCGCGAGC
AGTCATCGCTGAAGCAGCACCCAACATGTCAAGAGAAGCGGCATTTGTTTCAGGATTCCATTTCTTCCCTGTTGTCACCAAATAGAAAGGAGGCAATTTTCCCGAAATCT
TTCCCAAAACATTGTAACTTCTCCATAAAAACATCCTTCTCTTCGGAGGTCCATGGGTTGATCATCGTCCTTTCCTTCTCGACAGCACACGGATCCTCAACTACTCCGTT
ATTAGAGATGAACCTTGAGGCCGTCTTATCCTTCGGGTCCAAATTTAATGCAGGCATCTTCAAGGTCCTCCGGTGCTGCTTAACCTGGGAAGAATCAGATAGCTGCTTGC
TCACGTGCTTAAGAATCTCCGTGTTAGAAACGGGGTTCAGATTTCCTGCTGCAAGGCCAGTGAAGGCATACGGAATCGGATGGAAGAGCGGTTCTTCTGATAACCACTGT
AACTTGTCCACAAACTCAACTGATATTTTTTCTGAGATTTTGCACGTAACTTCCTTACAGGAGGCACATGCAAACTCTCCTTCCACAAGGACTGGCAACAGCCTACTTAA
AGCTTCAGACGCCCTACATGCAGATTCTCTATTGGATGCCAGAATTAAATTGAACACACTATTTCCCTCGTCAGCAATTTCATACATATCACTAGAAATGGGTTCTAATG
CAATAGCTTCACTCTTTGTAAGCATGCTATTGTCAACAGAGGCCACAACATTATCTTCGACACTGCAACCAGATATTGTTAGCTTCTCTCCCATGATAACAGATTCCTCT
GCATCCAACCTCCCAGACCTAATGATATCCTTTTCCTTGACATCAGCATGCAGTTCTTCCAAGTCACTTGTGGAAAGTGCCATCTTTTTTGAAATAGTATTTTTGGAAGT
AACAACTGGCAGGGTAGCAGCTGGAGGGATCATATCAGAGACATCATTGTGCTCTTCAAAGTATTTATCCCCCACCAGCAAAGAACTGGATGATAAGGGAAAAGAAATGT
TACCCTTACTTTCAGATCTCAACCCCTTGAGTTCATTTTCAATATCACTTGATAAACACCATAATGAAGTGAAAAAGTGGCCCACAAAAACAAAAACACAAAAGAGGACT
GGCTTACCTGGCGAGGAAGCACAAACAACAGAAGAAGGGGTCGCGGGTGAGGTACAATCGAAAAACCCTGAACCTCTAGGGCCTTTTTCAATCAAGCTGGAATTTAGAGA
ATGGGTTGATTCTGCATTAACATTTGTAAAGGCAGTACCGTCAGGAACTTCTACCTTCTTTTTTTCATACTTAGCAAGTCCCTCACCCCATCCTAGCCTTGGCTTCTTCC
TGACAGTTGTCTCATCACATGGCAGAGAAGACACCGCGCACTCAGTAGGATCAGCTGAAGGAGACAGATTTTGCGACACATTTTTCAACACTGTCTCAGACTTTGTCTCG
TTAGAATCTAAAGCATCCATGCTCTTTGTGCTACTGGAATGGCCGGAAGTTGAAGTGCGTGAAGATAAGCCACCAGAACGTGTCCATTTAAGAGGCTTCCACCCTGAGGA
ACTCAATGAGTACTCCCGATCACGCGGCATGAAAAAGAGGCCAATAGCATGGGCACCCTCAAAGAATAACCGTTTTCAACGCACAGCACACGGAGCGCGCCGAATAGCAC
CGCTGGAAATCAATTCAACTATTTTGGCAACCATGTCAAACAAGACGTCTCATGAACGAAACTTCTTTCACATAAAAGATCCTAAACTATATCTCGCTAAGGCAAACGAC
AGAGAGAAAAAGAAAATCAACTCGAGAAAGCGATCGCTTCTGCCTCCTCTCACTGGACGCGCAATAACGAGTATATGCCGCTACGGCAGCCACGACGAGCCACACGGATA
A
Protein sequenceShow/hide protein sequence
MDLDDKAIDRCSTRKVQSNNNAVVATDFLETNKADRAMWLLKCPQAVTRALSNSPDGPSCPVAKVIVSVDPLQSNDDDGSSSTEVYEDVLLWSYSLNMSTDFIPMSVFSD
SSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGHMRPMPGMDVVSFGAAEKKKVVSKGSETKRLRKERGELEKILFRLFERQPYWTSKQ
LIQETDQPEVRLFKVQMDNFLGGFTEDSAPFNQDILRCPFLRNINEPTNFSFSSSIAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSERSMHPQSVEFKPSPSQ
FNPLAAKAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAIVAARAALAR
TAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKTVASKKLTLQDSLGESTL
LPVVTVKNGHCGDIESWNPVTSLQVAGPVARNPLLTIVSRLSHSRSIRTMLGRYFQAIGIPWPDLNDGLFYNDVVKSSDSGSLSSIQLPLLAEEMVISRKALIEFYSSKY
KSSAPLEGWLQRIQNEQITMDGQVITDPETILRIGSEVVYRRLPWKEPDTPYLLEVLHEDGELIALNKPSGLQVLPGGVFQQRTVLTQLQWWSRKQSSSVSEGLHPVPVH
RLGRGTSGILLCAKTRLARTQLAAYFADGTFCVKRNRNNTMEEGTTRKIAKIYRALVTGIIHDDEVSIDQPVGRMQYPGVAQGLYVATSSGKPAFSKVYVLERDEQGKRT
LVQVEIQSGRPHQIRIHLSSIGHPLLGDPLYVAGGQPNCHHCDLVDQSHAEDGGYQKPIKPVPGDCGYYLHAHQVVFHHPRTNECIAVAVDVKKTSPILLLHSPNSEDKL
TCYICHGTLHLISIVANACRDVNRFIIASLLIYRCSNLPPRSENRHLISAHCRQVAAELGRQISSNDQMQDIWTVGRWDRAKMKAQCVFCSRPKFHNFLRDHPPGPESCN
EVTREFMGAEELSALELPNEVHHSRLQHFQLMLRLEQKCLVLSPFHQQSPTQLHNSQTGSHHRKLNESYPSYCLHRTPQHPQQLYYDKLLIHGLHGIRFPTVHTRHRLRQ
VFNHLRANLVERKGVAFSTYTSFTIASVNIAVISICFIIIFWWHFLCSTYQIQPKALPCFAKEDLALSTYEREQSSLKQHPTCQEKRHLFQDSISSLLSPNRKEAIFPKS
FPKHCNFSIKTSFSSEVHGLIIVLSFSTAHGSSTTPLLEMNLEAVLSFGSKFNAGIFKVLRCCLTWEESDSCLLTCLRISVLETGFRFPAARPVKAYGIGWKSGSSDNHC
NLSTNSTDIFSEILHVTSLQEAHANSPSTRTGNSLLKASDALHADSLLDARIKLNTLFPSSAISYISLEMGSNAIASLFVSMLLSTEATTLSSTLQPDIVSFSPMITDSS
ASNLPDLMISFSLTSACSSSKSLVESAIFFEIVFLEVTTGRVAAGGIISETSLCSSKYLSPTSKELDDKGKEMLPLLSDLNPLSSFSISLDKHHNEVKKWPTKTKTQKRT
GLPGEEAQTTEEGVAGEVQSKNPEPLGPFSIKLEFREWVDSALTFVKAVPSGTSTFFFSYLASPSPHPSLGFFLTVVSSHGREDTAHSVGSAEGDRFCDTFFNTVSDFVS
LESKASMLFVLLEWPEVEVREDKPPERVHLRGFHPEELNEYSRSRGMKKRPIAWAPSKNNRFQRTAHGARRIAPLEINSTILATMSNKTSHERNFFHIKDPKLYLAKAND
REKKKINSRKRSLLPPLTGRAITSICRYGSHDEPHG