; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020930 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020930
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function, DUF547
Genome locationtig00153577:576349..585485
RNA-Seq ExpressionSgr020930
SyntenySgr020930
Gene Ontology termsNA
InterPro domainsIPR006869 - Domain of unknown function DUF547
IPR025757 - Ternary complex factor MIP1, leucine-zipper


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139551.1 uncharacterized protein LOC101221529 [Cucumis sativus]5.2e-12949.75Show/hide
Query:  HLQVSPPNSMATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGISNLLPPKVKELLAELAMVESEIARLEIQ
        H+   PP        PHSP SQFLKKKLSGQRK+EELEREVLMLQKLL+QEEK+HEILEGV+KQQNGSA+GISNLLPPKVKE+LAELAMVESEIARLEIQ
Subjt:  HLQVSPPNSMATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGISNLLPPKVKELLAELAMVESEIARLEIQ

Query:  ITQLQKDLKTEQQQ----------QQQHTSN------------------------KAIKGDYCALN-HFNLEDKTDQHL--RHRKESTHPLLLEEVLKVN
        ITQLQKDLK EQQQ          QQ  T+N                        KAIKGDY  LN HF L+   +  L  R  K+S HPL     +K++
Subjt:  ITQLQKDLKTEQQQ----------QQQHTSN------------------------KAIKGDYCALN-HFNLEDKTDQHL--RHRKESTHPLLLEEVLKVN

Query:  TERVPRKSGLLVASSPLRDPRHPSPKV----PLYFILATRAESIRHAASKI---HPNSYSGRGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREV
           V RKSGLLVASSPLRDPRHPSPK     PL   L      +  A   I   HPN  S           ++I +    + + +   ++ ++   +  +
Subjt:  TERVPRKSGLLVASSPLRDPRHPSPKV----PLYFILATRAESIRHAASKI---HPNSYSGRGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREV

Query:  RSHFKISAFLPGLQKLPSRERLKLESSATQRVEATRSLR------YLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSHKKVKGLDEQFAKSGS
              S+         S    ++E+     + A + LR              P+   P + L          K        P  +K++ L     K   
Subjt:  RSHFKISAFLPGLQKLPSRERLKLESSATQRVEATRSLR------YLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSHKKVKGLDEQFAKSGS

Query:  KAIELPTETGI-LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCG
        + +    +    ++ +  C+ + ++      S P KLA LMNKA +N+GGNTINAQAI+HYILRKP+  + ++D+KEA+VRKLYGLESSEPNVTF LCCG
Subjt:  KAIELPTETGI-LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCG

Query:  TRSSPAVRIYSGEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDF-AADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTPPTVQTLPYDFE
        TRSSPAVRIYSGE VG ELERSKLEYLQAS+VVTSS+R+AVPELLVR+  +F +AD+  +VEWVCHQLPT+G+LRKSMV+CFRGHPKT PT+ TLPYDFE
Subjt:  TRSSPAVRIYSGEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDF-AADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTPPTVQTLPYDFE

Query:  FQYLLAL
        FQYLL L
Subjt:  FQYLLAL

XP_008462917.1 PREDICTED: uncharacterized protein LOC103501181 isoform X1 [Cucumis melo]1.7e-12749.83Show/hide
Query:  PHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGISNLLPPKVKELLAELAMVESEIARLEIQITQLQKDLKTEQQ--
        PHSP SQFLKKKLSGQRK+EELEREVLMLQKLL+QEEKVHEILEG+NKQQNGSA+GISNLLPPKVKE+LAELAMVESEIARLEIQITQL+KDLK EQQ  
Subjt:  PHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGISNLLPPKVKELLAELAMVESEIARLEIQITQLQKDLKTEQQ--

Query:  --------QQQQHTSN-------------------------KAIKGDYCALNHFNLEDKTDQHL--RHRKESTHPLLLEEVLKVNTERVPRKSGLLVASS
                +QQ  T+N                         KAIKGDY   +HF L++  +  L  R  K+S HPL     +K++   V RKSGLLVASS
Subjt:  --------QQQQHTSN-------------------------KAIKGDYCALNHFNLEDKTDQHL--RHRKESTHPLLLEEVLKVNTERVPRKSGLLVASS

Query:  PLRDPRHPSPKV----PLYFILATRAESIRHAASKI---HPNSYSGRGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREVRSHFKISAFLPGLQK
        PLRDPRHPSPK     PL   L      +  A   I   HPN  S           ++I +    + + +   ++ ++   +  +      S+       
Subjt:  PLRDPRHPSPKV----PLYFILATRAESIRHAASKI---HPNSYSGRGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREVRSHFKISAFLPGLQK

Query:  LPSRERLKLESSATQRVEATRSLR------YLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAIELPTETGI-LDQ
          S    ++E+     + A + LR              P+   P + L          K        P  +K++ L     K   + +    +    ++ 
Subjt:  LPSRERLKLESSATQRVEATRSLR------YLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAIELPTETGI-LDQ

Query:  HVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAVRIYSGEAV
        +  C+ + ++      S P KLA LMNKA VNIGGNTINAQAI+HYILRKP+  + ++D+KEA+VRKLYGLESSEPNVTF LCCGTRSSPAVRIYSGE V
Subjt:  HVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAVRIYSGEAV

Query:  GAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDF-AADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTPPTVQTLPYDFEFQYLLAL
         AELERSKLEYLQAS+VVTSS+R+AVPELL+R+  +F +AD+  +VEWVCHQLPT+G+LRKS+V+CFRGHPKT PT+ TL YDFEFQYLL L
Subjt:  GAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDF-AADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTPPTVQTLPYDFEFQYLLAL

XP_022152890.1 uncharacterized protein LOC111020513 [Momordica charantia]6.8e-12151.4Show/hide
Query:  PHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGISNLLPPKVKELLAELAMVESEIARLEIQITQLQKDLKTEQQQQ
        PHSP SQFLKKKLSGQRKREELEREVLMLQKLL+QEEKVHEILEGVNKQQNGSAL ISNLLPPKVKE+LAELAMVESEIARLE+QITQLQKDLKTEQQQ 
Subjt:  PHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGISNLLPPKVKELLAELAMVESEIARLEIQITQLQKDLKTEQQQQ

Query:  QQ-------------HTSNKAIKGDYCALNHFNLEDKTDQHLRHRKESTHPLLLEEVLKVNTERVPRKSGLLVASSPLRDPRHPSPKV----PLYF----
         +             H  +KAIKGDY ALN+ N +     H+      THP  L +   V    V RKSGLLVASSPLRDPRHPSPK     PL      
Subjt:  QQ-------------HTSNKAIKGDYCALNHFNLEDKTDQHLRHRKESTHPLLLEEVLKVNTERVPRKSGLLVASSPLRDPRHPSPKV----PLYF----

Query:  ---ILATRAESIRHAASKIHPNSYSGRGKHPELAAQQAIREHNEVLELHICETAK--NIKNNGAREVRSHFKISAFLPGLQKLPSRERLKLESSATQRVE
           I     E+++H      PN  S           ++I +    + + +  T++   ++ +G      H  +S+    ++       L L   + Q+  
Subjt:  ---ILATRAESIRHAASKIHPNSYSGRGKHPELAAQQAIREHNEVLELHICETAK--NIKNNGAREVRSHFKISAFLPGLQKLPSRERLKLESSATQRVE

Query:  ATRSLRYLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSH----KKVKGLDEQFAKSGSKAIELPTETGI-LDQHVQCLYHAWISPIWCASFPR
                      P+   P + L    + F  + +  K    PS     +K++ L     +   + +    +    ++ +  C+ H ++      S P 
Subjt:  ATRSLRYLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSH----KKVKGLDEQFAKSGSKAIELPTETGI-LDQHVQCLYHAWISPIWCASFPR

Query:  KLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAVRIYSGEAVGAELERSKLEYLQASMVVTS
        KLAALMNKATVNIGGN+INA AIEHYIL+KP   + ++++KE VV KLYGLES E N TFGLCCGTRSSPAVRIYSGEAV AELERSKLEYLQAS+VVTS
Subjt:  KLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAVRIYSGEAVGAELERSKLEYLQASMVVTS

Query:  SRRIAVPELLVRNGLDFAADVNALVEWVCHQLPTAGTLRKSMVDCFR-GHPKTPPTVQTLPYDFEFQYLLAL
        SRR+AVPELLVR+  +F     A  EWVCHQLPT+G+LRKSMV+CFR GH   PPT+ TLPYDFEFQYLL L
Subjt:  SRRIAVPELLVRNGLDFAADVNALVEWVCHQLPTAGTLRKSMVDCFR-GHPKTPPTVQTLPYDFEFQYLLAL

XP_023525206.1 uncharacterized protein LOC111788877 isoform X1 [Cucurbita pepo subsp. pepo]2.2e-11949.34Show/hide
Query:  MATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGIS-NLLPPKVKELLAELAMVESEIARLEIQITQLQKDL
        MA    PHSP SQF+KKK+SGQRK+EELEREVLMLQKLLHQEEKVHEIL G+N QQN S L IS N LPPKVKELLAELAMVESEI RLEIQI +L+KDL
Subjt:  MATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGIS-NLLPPKVKELLAELAMVESEIARLEIQITQLQKDL

Query:  KTEQQQQQQ------------------------------------HTSNKAIKGDYCALNHFNLEDKTDQHLRHRKESTHPLLLEEVLKVNTERVPRKSG
        K+E+Q+  +                                    H  +KAIKGDY ALNHF L+ K ++ LR +K STH  L  E LK+  ERVPRKSG
Subjt:  KTEQQQQQQ------------------------------------HTSNKAIKGDYCALNHFNLEDKTDQHLRHRKESTHPLLLEEVLKVNTERVPRKSG

Query:  LLVASSPLRDPRHPSPKVPLYFILATRAESIRHAASKIH--PNSYSG---------RGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREVRSHFK
        LLV  SPLR+P+HPSPK              R   S +H  P  Y           +  HP   ++  I+  N +         + ++ +   E+     
Subjt:  LLVASSPLRDPRHPSPKVPLYFILATRAESIRHAASKIH--PNSYSG---------RGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREVRSHFK

Query:  ISAFLPGLQKLPSRERLKLESSATQRVEATRSLR------YLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAIEL
        IS     L    S    ++E+     +   + LR              P+   P + L          K        P   K++ L         K +  
Subjt:  ISAFLPGLQKLPSRERLKLESSATQRVEATRSLR------YLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAIEL

Query:  PTETGI-LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSP
          +    ++ +  C+ + ++      S P KLAAL+NKA VNIGGNTINAQAIEH+ILRKP  ++N++++KEAVVRKLYGLESSEPNVTF LCCGTRSSP
Subjt:  PTETGI-LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSP

Query:  AVRIYSGEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDF-----AADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTPPTVQTLPYDFEF
        AVRIYSGEAV AELERSKLEYLQAS+VVTSSRR+AVPELLVR+  +F     AAD+ A+VEWVC+QLPT+G+LRKSMV+CFRGH KTPPTV+TLPYDFEF
Subjt:  AVRIYSGEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDF-----AADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTPPTVQTLPYDFEF

Query:  QYLL
        QYLL
Subjt:  QYLL

XP_038894153.1 uncharacterized protein LOC120082868 [Benincasa hispida]1.7e-12752.31Show/hide
Query:  PSSLHLQVSPPNSMATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGISNLLPPKVKELLAELAMVESEIAR
        PS  H+   PP        PHSP SQFLKKK+SGQRK+EELEREVLMLQKLL+QEEKVHEILEGVNKQQNGSALG+SNLLPPKVKE+LAELAMVESEIAR
Subjt:  PSSLHLQVSPPNSMATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGISNLLPPKVKELLAELAMVESEIAR

Query:  LEIQITQLQKDLKTEQQQ----------QQQHTSN-------------------------KAIKGDYCALNHFNLED-KTDQHLRHRKESTHPLLLEEVL
        LEIQITQLQKDLKTEQQ           +Q  T+N                         KAIKGDY ALNHF L++ K  +      +  H LL E  L
Subjt:  LEIQITQLQKDLKTEQQQ----------QQQHTSN-------------------------KAIKGDYCALNHFNLED-KTDQHLRHRKESTHPLLLEEVL

Query:  KVNTERVPRKSGLLVASSPLRDPRHPSPKVPLYFILATRAESIRHAASKIHPNSYSG----RGKHPELAAQQAIREHNEV-LELHICETAKNIKNNGARE
            ERV RKSGLLVASSPLRDPRHPSPK            S+     K  P         +  HP   ++  ++  N V + L        ++ +G   
Subjt:  KVNTERVPRKSGLLVASSPLRDPRHPSPKVPLYFILATRAESIRHAASKIHPNSYSG----RGKHPELAAQQAIREHNEV-LELHICETAKNIKNNGARE

Query:  VRSHFKISAFLPGLQKLPSRERLKLESSATQRVEATRSLRY--LRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAI
           H+        L     R    L SS +   E  +   Y         P+   P + L          K        P  +K++ L     K   + +
Subjt:  VRSHFKISAFLPGLQKLPSRERLKLESSATQRVEATRSLRY--LRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAI

Query:  ELPTETGI-LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDD-KEAVVRKLYGLESSEPNVTFGLCCGTR
            +    ++ +  C+ + ++      S P KLA LMNKA VNIGGNTINAQAIEHYILRK  +SSNKEDD KEAVVRKLYGLESSEPNVTF LCCGTR
Subjt:  ELPTETGI-LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDD-KEAVVRKLYGLESSEPNVTFGLCCGTR

Query:  SSPAVRIYSGEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDF--AADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTPPTVQTLPYDFEF
        SSPAVRIYSGEAV AELERSKLEYLQAS+VVT+SRR+AVPELLVR+  +F  AAD+ A+VEWVCHQLPT+G+LRKSMV+CFR HPKT PT+ TLPYDFEF
Subjt:  SSPAVRIYSGEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDF--AADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTPPTVQTLPYDFEF

Query:  QYLLAL
        QYLL L
Subjt:  QYLLAL

TrEMBL top hitse value%identityAlignment
A0A1S3CHZ3 uncharacterized protein LOC103501181 isoform X18.1e-12849.83Show/hide
Query:  PHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGISNLLPPKVKELLAELAMVESEIARLEIQITQLQKDLKTEQQ--
        PHSP SQFLKKKLSGQRK+EELEREVLMLQKLL+QEEKVHEILEG+NKQQNGSA+GISNLLPPKVKE+LAELAMVESEIARLEIQITQL+KDLK EQQ  
Subjt:  PHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGISNLLPPKVKELLAELAMVESEIARLEIQITQLQKDLKTEQQ--

Query:  --------QQQQHTSN-------------------------KAIKGDYCALNHFNLEDKTDQHL--RHRKESTHPLLLEEVLKVNTERVPRKSGLLVASS
                +QQ  T+N                         KAIKGDY   +HF L++  +  L  R  K+S HPL     +K++   V RKSGLLVASS
Subjt:  --------QQQQHTSN-------------------------KAIKGDYCALNHFNLEDKTDQHL--RHRKESTHPLLLEEVLKVNTERVPRKSGLLVASS

Query:  PLRDPRHPSPKV----PLYFILATRAESIRHAASKI---HPNSYSGRGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREVRSHFKISAFLPGLQK
        PLRDPRHPSPK     PL   L      +  A   I   HPN  S           ++I +    + + +   ++ ++   +  +      S+       
Subjt:  PLRDPRHPSPKV----PLYFILATRAESIRHAASKI---HPNSYSGRGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREVRSHFKISAFLPGLQK

Query:  LPSRERLKLESSATQRVEATRSLR------YLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAIELPTETGI-LDQ
          S    ++E+     + A + LR              P+   P + L          K        P  +K++ L     K   + +    +    ++ 
Subjt:  LPSRERLKLESSATQRVEATRSLR------YLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAIELPTETGI-LDQ

Query:  HVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAVRIYSGEAV
        +  C+ + ++      S P KLA LMNKA VNIGGNTINAQAI+HYILRKP+  + ++D+KEA+VRKLYGLESSEPNVTF LCCGTRSSPAVRIYSGE V
Subjt:  HVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAVRIYSGEAV

Query:  GAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDF-AADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTPPTVQTLPYDFEFQYLLAL
         AELERSKLEYLQAS+VVTSS+R+AVPELL+R+  +F +AD+  +VEWVCHQLPT+G+LRKS+V+CFRGHPKT PT+ TL YDFEFQYLL L
Subjt:  GAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDF-AADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTPPTVQTLPYDFEFQYLLAL

A0A6J1DF87 uncharacterized protein LOC1110205133.3e-12151.4Show/hide
Query:  PHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGISNLLPPKVKELLAELAMVESEIARLEIQITQLQKDLKTEQQQQ
        PHSP SQFLKKKLSGQRKREELEREVLMLQKLL+QEEKVHEILEGVNKQQNGSAL ISNLLPPKVKE+LAELAMVESEIARLE+QITQLQKDLKTEQQQ 
Subjt:  PHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGISNLLPPKVKELLAELAMVESEIARLEIQITQLQKDLKTEQQQQ

Query:  QQ-------------HTSNKAIKGDYCALNHFNLEDKTDQHLRHRKESTHPLLLEEVLKVNTERVPRKSGLLVASSPLRDPRHPSPKV----PLYF----
         +             H  +KAIKGDY ALN+ N +     H+      THP  L +   V    V RKSGLLVASSPLRDPRHPSPK     PL      
Subjt:  QQ-------------HTSNKAIKGDYCALNHFNLEDKTDQHLRHRKESTHPLLLEEVLKVNTERVPRKSGLLVASSPLRDPRHPSPKV----PLYF----

Query:  ---ILATRAESIRHAASKIHPNSYSGRGKHPELAAQQAIREHNEVLELHICETAK--NIKNNGAREVRSHFKISAFLPGLQKLPSRERLKLESSATQRVE
           I     E+++H      PN  S           ++I +    + + +  T++   ++ +G      H  +S+    ++       L L   + Q+  
Subjt:  ---ILATRAESIRHAASKIHPNSYSGRGKHPELAAQQAIREHNEVLELHICETAK--NIKNNGAREVRSHFKISAFLPGLQKLPSRERLKLESSATQRVE

Query:  ATRSLRYLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSH----KKVKGLDEQFAKSGSKAIELPTETGI-LDQHVQCLYHAWISPIWCASFPR
                      P+   P + L    + F  + +  K    PS     +K++ L     +   + +    +    ++ +  C+ H ++      S P 
Subjt:  ATRSLRYLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSH----KKVKGLDEQFAKSGSKAIELPTETGI-LDQHVQCLYHAWISPIWCASFPR

Query:  KLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAVRIYSGEAVGAELERSKLEYLQASMVVTS
        KLAALMNKATVNIGGN+INA AIEHYIL+KP   + ++++KE VV KLYGLES E N TFGLCCGTRSSPAVRIYSGEAV AELERSKLEYLQAS+VVTS
Subjt:  KLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAVRIYSGEAVGAELERSKLEYLQASMVVTS

Query:  SRRIAVPELLVRNGLDFAADVNALVEWVCHQLPTAGTLRKSMVDCFR-GHPKTPPTVQTLPYDFEFQYLLAL
        SRR+AVPELLVR+  +F     A  EWVCHQLPT+G+LRKSMV+CFR GH   PPT+ TLPYDFEFQYLL L
Subjt:  SRRIAVPELLVRNGLDFAADVNALVEWVCHQLPTAGTLRKSMVDCFR-GHPKTPPTVQTLPYDFEFQYLLAL

A0A6J1G939 uncharacterized protein LOC111452055 isoform X15.8e-11848.84Show/hide
Query:  MATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGIS-NLLPPKVKELLAELAMVESEIARLEIQITQLQKDL
        MA    PHSP SQ++KKK+SGQRK+EELEREVLMLQKLLHQEEKVHEIL+G+  QQN S L IS N LPPKVKELLAELAMVESEI RLEIQIT+LQKDL
Subjt:  MATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGIS-NLLPPKVKELLAELAMVESEIARLEIQITQLQKDL

Query:  KTEQQQQQQ------------------------------------HTSNKAIKGDYCALNHFNLEDKTDQHLRHRKESTHPLLLEEVLKVNTERVPRKSG
        K+E+Q+  +                                    H  +KAIKGDY ALN F L+ K ++ LR +K STH  L  E LK+  ERVPRKSG
Subjt:  KTEQQQQQQ------------------------------------HTSNKAIKGDYCALNHFNLEDKTDQHLRHRKESTHPLLLEEVLKVNTERVPRKSG

Query:  LLVASSPLRDPRHPSPKVPLYFILATRAESIRHAASKIH--PNSYSG---------RGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREVRSHFK
        LLV  SPLR+P+HPSPK              R   S +H  P  Y           +  HP   ++  I+  N +         + ++ +   E+     
Subjt:  LLVASSPLRDPRHPSPKVPLYFILATRAESIRHAASKIH--PNSYSG---------RGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREVRSHFK

Query:  ISAFLPGLQKLPSRERLKLESSATQRVEATRSLR------YLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAIEL
        IS     L    S    ++E+     +   + LR              P+   P + L          K        P   K++ L         K +  
Subjt:  ISAFLPGLQKLPSRERLKLESSATQRVEATRSLR------YLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAIEL

Query:  PTETGI-LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSP
          +    ++ +  C+ + ++      S P KLAAL+NKA +NIGGNTINAQAIEH+ILRKP   +N++++KEAVVRKLYGLESS+PNVTF LCCGTRSSP
Subjt:  PTETGI-LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSP

Query:  AVRIYSGEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDF-----AADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTPPTVQTLPYDFEF
        AVRIYSGEAV AELERSKLEYLQAS+VVTSSRR+AVPELLVR+  +F     AAD+ A+VEWVC+QLPT+G+LRKSMV+CFRGH KTPPTV+TLPYDFEF
Subjt:  AVRIYSGEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDF-----AADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTPPTVQTLPYDFEF

Query:  QYLL
        QYLL
Subjt:  QYLL

A0A6J1G9N0 uncharacterized protein LOC111452055 isoform X26.4e-11749.01Show/hide
Query:  MATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGIS-NLLPPKVKELLAELAMVESEIARLEIQITQLQKDL
        MA    PHSP SQ++KKK+SGQRK+EELEREVLMLQKLLHQEEKVHEIL+G+  QQN S L IS N LPPKVKELLAELAMVESEI RLEIQIT+LQKDL
Subjt:  MATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGIS-NLLPPKVKELLAELAMVESEIARLEIQITQLQKDL

Query:  KTEQQQQQQ------------------------------------HTSNKAIKGDYCALNHFNLEDKTDQHLRHRKESTHPLLLEEVLKVNTERVPRKSG
        K+E+Q+  +                                    H  +KAIKGDY ALN F L+ K ++ LR +K STH  L  E LK+  ERVPRKSG
Subjt:  KTEQQQQQQ------------------------------------HTSNKAIKGDYCALNHFNLEDKTDQHLRHRKESTHPLLLEEVLKVNTERVPRKSG

Query:  LLVASSPLRDPRHPSPKVPLYFILATRAESIRHAASKIH--PNSYSG---------RGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREVRSHFK
        LLV  SPLR+P+HPSPK              R   S +H  P  Y           +  HP   ++  I+  N +         + ++ +   E+     
Subjt:  LLVASSPLRDPRHPSPKVPLYFILATRAESIRHAASKIH--PNSYSG---------RGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREVRSHFK

Query:  ISAFLPGLQKLPSRERLKLESSATQRVEATRSLR------YLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAIEL
        IS     L    S    ++E+     +   + LR              P+   P + L          K        P   KV  L         K +  
Subjt:  ISAFLPGLQKLPSRERLKLESSATQRVEATRSLR------YLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAIEL

Query:  PTETGI-LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSP
          +    ++ +  C+ + ++      S P KLAAL+NKA +NIGGNTINAQAIEH+ILRKP   +N++++KEAVVRKLYGLESS+PNVTF LCCGTRSSP
Subjt:  PTETGI-LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSP

Query:  AVRIYSGEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDF-----AADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTPPTVQTLPYDFEF
        AVRIYSGEAV AELERSKLEYLQAS+VVTSSRR+AVPELLVR+  +F     AAD+ A+VEWVC+QLPT+G+LRKSMV+CFRGH KTPPTV+TLPYDFEF
Subjt:  AVRIYSGEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDF-----AADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTPPTVQTLPYDFEF

Query:  QYLL
        QYLL
Subjt:  QYLL

A0A6J1KFI3 uncharacterized protein LOC1114933971.5e-11848.91Show/hide
Query:  MATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGIS-NLLPPKVKELLAELAMVESEIARLEIQITQLQKDL
        MA    PHSP SQ++KKK+SGQRK+EELEREVLMLQKLLHQEEKVHEIL G+  QQN S L IS N LPPKVKELL ELAMVESEI RLEIQI +LQKDL
Subjt:  MATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGIS-NLLPPKVKELLAELAMVESEIARLEIQITQLQKDL

Query:  KTEQQQQQQ------------------------------------HTSNKAIKGDYCALNHFNLEDKTDQHLRHRKESTHPLLLEEVLKVNTERVPRKSG
        K+E+Q+  +                                    H  +KAIKGDY ALNHF L+ K ++ LR +K +    L  E LK+  ERVPRKSG
Subjt:  KTEQQQQQQ------------------------------------HTSNKAIKGDYCALNHFNLEDKTDQHLRHRKESTHPLLLEEVLKVNTERVPRKSG

Query:  LLVASSPLRDPRHPSPKVPLYFILATRAESIRHAASKIH-----PNSYSGRGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREVRSHFKISAFLP
        LLV  SPLR+P+HPSPK         R  S  H  S+ +     P   + +  HP   ++  I+  N +         + ++ +   E+     IS    
Subjt:  LLVASSPLRDPRHPSPKVPLYFILATRAESIRHAASKIH-----PNSYSGRGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREVRSHFKISAFLP

Query:  GLQKLPSRERLKLESSATQRVEATRSLR------YLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAIELPTETGI
         L    S    ++E+     +   + LR              P+   P + L          K        P   K++ L         K +    +   
Subjt:  GLQKLPSRERLKLESSATQRVEATRSLR------YLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAIELPTETGI

Query:  -LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAVRIYS
         ++ +  C+ + ++      S P KLAAL+NKA VNIGGNTINAQAIEH+ILRKP+ ++N++ +KEAVVRKLYGLESSEPNVTF LCCGTRSSPAVRIYS
Subjt:  -LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAVRIYS

Query:  GEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDF------AADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTPPTVQTLPYDFEFQYLL
        GEAV AELERSKLEYLQAS+VVTSSRR+AVPELLVR+  +F      AAD+ A+VEWVC+QLPT+G+LRKSMV+CFRGH KTPPTV+TLPYDFEFQYLL
Subjt:  GEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDF------AADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTPPTVQTLPYDFEFQYLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39690.1 Protein of unknown function, DUF5475.3e-3943.46Show/hide
Query:  CLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKE-DDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAVRIYSGEAVGA
        C+ HA++      S   +L  LMNKA++N+GG  +NA AIEH++LR P    +   D+KE ++R  YGL  SEPNVTF LC G+ SSPA+R+Y+ + V  
Subjt:  CLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKE-DDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAVRIYSGEAVGA

Query:  ELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDFAADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTP--PTVQTLPYDFEFQYLLAL
        +L R+++EYL+AS+ V+S ++I VP+LL  +  DFA D+ +L+EW+  QLP +G L+  +++C +   K P    V+   Y  EF+YLL+L
Subjt:  ELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDFAADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTP--PTVQTLPYDFEFQYLLAL

AT2G39690.1 Protein of unknown function, DUF5473.4e-0138.67Show/hide
Query:  LQKLLHQEEKVHEILEGVNKQQNGSALGISNL-LPPKVKELLAELAMVESEIARLEIQITQLQKDLKTEQQQQQQ
        L++ L +EE V   L         S   +S+L LPP+  EL+ ELA+VE+EI  L+ +I +L+  L +EQ+Q Q+
Subjt:  LQKLLHQEEKVHEILEGVNKQQNGSALGISNL-LPPKVKELLAELAMVESEIARLEIQITQLQKDLKTEQQQQQQ

AT2G39690.2 Protein of unknown function, DUF5478.2e-4043.75Show/hide
Query:  CLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKE--DDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAVRIYSGEAVG
        C+ HA++      S   +L  LMNKA++N+GG  +NA AIEH++LR P    +K+  D+KE ++R  YGL  SEPNVTF LC G+ SSPA+R+Y+ + V 
Subjt:  CLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKE--DDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAVRIYSGEAVG

Query:  AELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDFAADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTP--PTVQTLPYDFEFQYLLAL
         +L R+++EYL+AS+ V+S ++I VP+LL  +  DFA D+ +L+EW+  QLP +G L+  +++C +   K P    V+   Y  EF+YLL+L
Subjt:  AELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDFAADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTP--PTVQTLPYDFEFQYLLAL

AT4G37080.1 Protein of unknown function, DUF5478.8e-3425.71Show/hide
Query:  SMATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGISNLLPPKVKELLAELAMVESEIARLEIQITQLQKDL
        S    AV +S  S  + ++ + + K+ +L ++V  L++ L QEE VH  LE    +  G+   + + LP    ELLAE+A++E E+ RLE Q+   ++ L
Subjt:  SMATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGISNLLPPKVKELLAELAMVESEIARLEIQITQLQKDL

Query:  KTEQQQQQQHTSNKAIKGDYCALNHFNLEDKTDQHLRHRKESTHPL-----------------LLEEVLKVNTERVPRKSGLLVASSPLRDPRHPSPKVP
            Q+    +S + ++       + N   ++ +H R +  S H                   +    L  + + V  +SG  V S      +  SPK  
Subjt:  KTEQQQQQQHTSNKAIKGDYCALNHFNLEDKTDQHLRHRKESTHPL-----------------LLEEVLKVNTERVPRKSGLLVASSPLRDPRHPSPKVP

Query:  LYFILATRAESIRHAASKIHPNS------------------YSGRGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREVRSHFKISAFLPG-LQKL
        L  +  T+   +R   ++   N+                   S + K P +  + A  +H+E  +L + +   +          S  +      G +   
Subjt:  LYFILATRAESIRHAASKIHPNS------------------YSGRGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREVRSHFKISAFLPG-LQKL

Query:  PSRERLKLESSATQRVEATRSLRYLRQRRVDPK---GYWPLQKLG-HLHINF-------HGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAIELPTETGI
         S + LK   +   R+ +++ +       +DP      W  ++LG + H +         G +I+    +H    ++K L  + +      +    +   
Subjt:  PSRERLKLESSATQRVEATRSLRYLRQRRVDPK---GYWPLQKLG-HLHINF-------HGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAIELPTETGI

Query:  -LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKP----VLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAV
         ++ +  C+ +A++      + P  + ALM KAT+ +GG+++NA  IEH+ILR P             +E      +GLE SEP VTF L CG+ SSPAV
Subjt:  -LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKP----VLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAV

Query:  RIYSGEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDFAADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTP--PTVQTLPYDFEFQYLL
        R+Y+   V  ELE +K +YLQAS+ ++ + ++ +P++L    LDFA D+ +L++WVC QLP    LR+    C     K      VQ +PYDF F+ LL
Subjt:  RIYSGEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDFAADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTP--PTVQTLPYDFEFQYLL

AT4G37080.2 Protein of unknown function, DUF5478.8e-3425.71Show/hide
Query:  SMATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGISNLLPPKVKELLAELAMVESEIARLEIQITQLQKDL
        S    AV +S  S  + ++ + + K+ +L ++V  L++ L QEE VH  LE    +  G+   + + LP    ELLAE+A++E E+ RLE Q+   ++ L
Subjt:  SMATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGISNLLPPKVKELLAELAMVESEIARLEIQITQLQKDL

Query:  KTEQQQQQQHTSNKAIKGDYCALNHFNLEDKTDQHLRHRKESTHPL-----------------LLEEVLKVNTERVPRKSGLLVASSPLRDPRHPSPKVP
            Q+    +S + ++       + N   ++ +H R +  S H                   +    L  + + V  +SG  V S      +  SPK  
Subjt:  KTEQQQQQQHTSNKAIKGDYCALNHFNLEDKTDQHLRHRKESTHPL-----------------LLEEVLKVNTERVPRKSGLLVASSPLRDPRHPSPKVP

Query:  LYFILATRAESIRHAASKIHPNS------------------YSGRGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREVRSHFKISAFLPG-LQKL
        L  +  T+   +R   ++   N+                   S + K P +  + A  +H+E  +L + +   +          S  +      G +   
Subjt:  LYFILATRAESIRHAASKIHPNS------------------YSGRGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREVRSHFKISAFLPG-LQKL

Query:  PSRERLKLESSATQRVEATRSLRYLRQRRVDPK---GYWPLQKLG-HLHINF-------HGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAIELPTETGI
         S + LK   +   R+ +++ +       +DP      W  ++LG + H +         G +I+    +H    ++K L  + +      +    +   
Subjt:  PSRERLKLESSATQRVEATRSLRYLRQRRVDPK---GYWPLQKLG-HLHINF-------HGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAIELPTETGI

Query:  -LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKP----VLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAV
         ++ +  C+ +A++      + P  + ALM KAT+ +GG+++NA  IEH+ILR P             +E      +GLE SEP VTF L CG+ SSPAV
Subjt:  -LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKP----VLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAV

Query:  RIYSGEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDFAADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTP--PTVQTLPYDFEFQYLL
        R+Y+   V  ELE +K +YLQAS+ ++ + ++ +P++L    LDFA D+ +L++WVC QLP    LR+    C     K      VQ +PYDF F+ LL
Subjt:  RIYSGEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDFAADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTP--PTVQTLPYDFEFQYLL

AT5G60720.1 Protein of unknown function, DUF5474.5e-7031.87Show/hide
Query:  PNSMATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQ---NGSALGISNLLPPKVKELLAELAMVESEIARLEIQITQ
        P S++   VPHS  +    KK +GQ+K+EE+E+EV ML+++L QEEK  EILE V K Q   + S+L +   LPPK+KEL+ EL++VE EI+RLEIQI+ 
Subjt:  PNSMATHAVPHSPTSQFLKKKLSGQRKREELEREVLMLQKLLHQEEKVHEILEGVNKQQ---NGSALGISNLLPPKVKELLAELAMVESEIARLEIQITQ

Query:  LQKDLKTEQQQ-------------------------------------------------------------------------QQQHTSNKAIKGDYCA
        LQ +LK EQ +                                                                         +  H  NKAIKGDY A
Subjt:  LQKDLKTEQQQ-------------------------------------------------------------------------QQQHTSNKAIKGDYCA

Query:  LNHFNLEDKTDQHLRHRKESTHPLLLEEVLKVNTERVPRKSGLLVASSPLRDPRHPSPKVPLYFILATRAESIRHAASKIHPNSYSGRGK-----HPELA
        +  F    K+++ +   ++  H  +  E  KV      +K   + + SPLR+PR+ SP  P    +A    S+      +        G+     HP   
Subjt:  LNHFNLEDKTDQHLRHRKESTHPLLLEEVLKVNTERVPRKSGLLVASSPLRDPRHPSPKVPLYFILATRAESIRHAASKIHPNSYSGRGK-----HPELA

Query:  AQQAIREHNEV-LELHICETAKNIKNNGAREVRSHFKISAFLPGLQKLPSRERLKLESSATQRVEATRSLRYLRQRRVDPKGYWPLQK-----LGHLH--
        A+  ++  N + + L        ++  G     ++F +S+              +++++ +   ++   + Y   R+ DP G + ++      +G     
Subjt:  AQQAIREHNEV-LELHICETAKNIKNNGAREVRSHFKISAFLPGLQKLPSRERLKLESSATQRVEATRSLRYLRQRRVDPKGYWPLQK-----LGHLH--

Query:  INFHGSKIHFKCQLHPSH----KKVKGLDEQFAKSGSKAIELPTETGI-LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYIL
        + F  S +  KC    S     +K++ L         K +    +    ++    C+ H ++      +  R  + + NKAT+N+GG  I+A  IEH IL
Subjt:  INFHGSKIHFKCQLHPSH----KKVKGLDEQFAKSGSKAIELPTETGI-LDQHVQCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYIL

Query:  RKPVLSSNKED-DKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAVRIYSGEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDFAA--------
        RK   S+  +D  +E ++RKLYG+E+++PN+TF L CGTRSSPAVRIY+GE V  ELE+SKLEYLQAS+VVT+++RI +PELL+++  DF          
Subjt:  RKPVLSSNKED-DKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAVRIYSGEAVGAELERSKLEYLQASMVVTSSRRIAVPELLVRNGLDFAA--------

Query:  ---DVNALVEWVCHQLPTAGTLRKSMVDCFRG----HPKTPPTVQTLPYDFEFQYLLAL
            + +LV+WVC+QLPT+G+LRKSMVDCF+        +   V+ +PYDFEFQYLLA+
Subjt:  ---DVNALVEWVCHQLPTAGTLRKSMVDCFRG----HPKTPPTVQTLPYDFEFQYLLAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCACTTCCGACTCCAACTCTCACCCTCTGCCTGATCAGTTTCTGAACCCAAACCCAAAAGCCATCGCCATCATCATCCTCATCCTCCCCCTGAAAGCTCCTCTCCA
TCTTCCATCACCCTTTCTTTTCCAAATAAAACCTCCCCACCACATCACAGAGCCGCAGCAACCACCGCCACCGCCACCGCCACCGCCACCGCCACCACCCCCATCTTCTC
TTCACCTCCAGGTTTCGCCGCCTAACTCCATGGCTACCCACGCCGTTCCTCACTCTCCCACCTCCCAATTCCTGAAGAAGAAACTTAGTGGGCAAAGAAAGAGGGAGGAG
CTTGAAAGAGAGGTATTAATGCTTCAGAAGTTGCTCCATCAAGAAGAAAAGGTGCATGAGATTTTAGAAGGTGTTAATAAACAGCAAAATGGTTCAGCACTTGGCATTTC
AAACTTGCTTCCTCCCAAGGTGAAGGAGCTGTTAGCAGAGCTAGCAATGGTGGAAAGTGAAATAGCAAGGCTTGAGATCCAAATAACCCAACTCCAAAAGGACTTGAAAA
CTGAGCAGCAGCAACAACAACAACATACTTCAAACAAGGCCATCAAGGGAGATTACTGTGCTCTCAATCACTTCAACTTGGAGGACAAAACTGATCAACATCTTAGACAT
AGGAAAGAGAGTACTCATCCTCTTCTTCTTGAGGAGGTTCTTAAAGTCAATACTGAAAGAGTTCCTAGAAAGAGTGGGCTTCTAGTTGCTTCATCTCCGTTGAGAGACCC
CCGTCACCCTTCTCCAAAGGTACCGTTATATTTCATTTTGGCAACGAGAGCGGAATCCATCAGACATGCCGCCTCCAAAATCCATCCCAATTCTTATTCAGGCAGAGGAA
AACATCCAGAATTGGCAGCCCAACAAGCTATCAGAGAGCATAATGAAGTGCTTGAACTTCATATATGTGAGACTGCTAAGAACATCAAGAACAATGGAGCTAGAGAAGTC
AGGTCCCATTTCAAGATCTCTGCATTCCTCCCTGGGCTCCAGAAGCTTCCGAGTCGAGAACGGCTTAAACTCGAGTCTTCTGCTACACAAAGAGTCGAGGCAACAAGATC
CTTACGGTATCTTCGACAACGAAGAGTCGATCCCAAGGGATATTGGCCCTTACAAAAACTTGGTCATCTTCACATCAACTTCCATGGATCCAAAATCCATTTCAAGTGCC
AGCTCCATCCCTCTCATAAGAAAGTTAAGGGTCTTGATGAGCAATTTGCAAAGAGTGGATCTAAGGCCATTGAGTTACCAACAGAAACTGGCATTTTGGATCAACATGTA
CAATGCTTGTATCATGCATGGATTTCTCCAATATGGTGTGCCTCCTTCCCCAGAAAATTAGCTGCTCTGATGAACAAGGCAACGGTTAACATCGGAGGCAACACCATAAA
CGCACAAGCCATAGAACATTACATTCTAAGAAAACCAGTACTGTCTTCTAACAAGGAGGACGACAAAGAAGCCGTGGTTCGGAAGCTTTACGGCCTGGAATCGTCGGAAC
CTAACGTCACTTTTGGCCTCTGCTGCGGAACTCGGTCTTCTCCGGCGGTGAGAATATACAGCGGCGAGGCGGTGGGCGCAGAGCTGGAGAGATCGAAGCTGGAATATCTG
CAGGCGTCGATGGTGGTGACGAGCTCCAGAAGGATCGCCGTGCCGGAGCTGCTGGTTCGGAACGGGCTGGATTTCGCGGCGGACGTGAACGCGCTGGTGGAGTGGGTCTG
CCACCAGCTGCCGACCGCCGGAACTCTGAGGAAATCCATGGTGGACTGCTTCAGAGGCCACCCCAAGACGCCGCCCACCGTCCAGACTCTGCCTTACGATTTCGAGTTTC
AGTATCTCTTGGCTCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCACTTCCGACTCCAACTCTCACCCTCTGCCTGATCAGTTTCTGAACCCAAACCCAAAAGCCATCGCCATCATCATCCTCATCCTCCCCCTGAAAGCTCCTCTCCA
TCTTCCATCACCCTTTCTTTTCCAAATAAAACCTCCCCACCACATCACAGAGCCGCAGCAACCACCGCCACCGCCACCGCCACCGCCACCGCCACCACCCCCATCTTCTC
TTCACCTCCAGGTTTCGCCGCCTAACTCCATGGCTACCCACGCCGTTCCTCACTCTCCCACCTCCCAATTCCTGAAGAAGAAACTTAGTGGGCAAAGAAAGAGGGAGGAG
CTTGAAAGAGAGGTATTAATGCTTCAGAAGTTGCTCCATCAAGAAGAAAAGGTGCATGAGATTTTAGAAGGTGTTAATAAACAGCAAAATGGTTCAGCACTTGGCATTTC
AAACTTGCTTCCTCCCAAGGTGAAGGAGCTGTTAGCAGAGCTAGCAATGGTGGAAAGTGAAATAGCAAGGCTTGAGATCCAAATAACCCAACTCCAAAAGGACTTGAAAA
CTGAGCAGCAGCAACAACAACAACATACTTCAAACAAGGCCATCAAGGGAGATTACTGTGCTCTCAATCACTTCAACTTGGAGGACAAAACTGATCAACATCTTAGACAT
AGGAAAGAGAGTACTCATCCTCTTCTTCTTGAGGAGGTTCTTAAAGTCAATACTGAAAGAGTTCCTAGAAAGAGTGGGCTTCTAGTTGCTTCATCTCCGTTGAGAGACCC
CCGTCACCCTTCTCCAAAGGTACCGTTATATTTCATTTTGGCAACGAGAGCGGAATCCATCAGACATGCCGCCTCCAAAATCCATCCCAATTCTTATTCAGGCAGAGGAA
AACATCCAGAATTGGCAGCCCAACAAGCTATCAGAGAGCATAATGAAGTGCTTGAACTTCATATATGTGAGACTGCTAAGAACATCAAGAACAATGGAGCTAGAGAAGTC
AGGTCCCATTTCAAGATCTCTGCATTCCTCCCTGGGCTCCAGAAGCTTCCGAGTCGAGAACGGCTTAAACTCGAGTCTTCTGCTACACAAAGAGTCGAGGCAACAAGATC
CTTACGGTATCTTCGACAACGAAGAGTCGATCCCAAGGGATATTGGCCCTTACAAAAACTTGGTCATCTTCACATCAACTTCCATGGATCCAAAATCCATTTCAAGTGCC
AGCTCCATCCCTCTCATAAGAAAGTTAAGGGTCTTGATGAGCAATTTGCAAAGAGTGGATCTAAGGCCATTGAGTTACCAACAGAAACTGGCATTTTGGATCAACATGTA
CAATGCTTGTATCATGCATGGATTTCTCCAATATGGTGTGCCTCCTTCCCCAGAAAATTAGCTGCTCTGATGAACAAGGCAACGGTTAACATCGGAGGCAACACCATAAA
CGCACAAGCCATAGAACATTACATTCTAAGAAAACCAGTACTGTCTTCTAACAAGGAGGACGACAAAGAAGCCGTGGTTCGGAAGCTTTACGGCCTGGAATCGTCGGAAC
CTAACGTCACTTTTGGCCTCTGCTGCGGAACTCGGTCTTCTCCGGCGGTGAGAATATACAGCGGCGAGGCGGTGGGCGCAGAGCTGGAGAGATCGAAGCTGGAATATCTG
CAGGCGTCGATGGTGGTGACGAGCTCCAGAAGGATCGCCGTGCCGGAGCTGCTGGTTCGGAACGGGCTGGATTTCGCGGCGGACGTGAACGCGCTGGTGGAGTGGGTCTG
CCACCAGCTGCCGACCGCCGGAACTCTGAGGAAATCCATGGTGGACTGCTTCAGAGGCCACCCCAAGACGCCGCCCACCGTCCAGACTCTGCCTTACGATTTCGAGTTTC
AGTATCTCTTGGCTCTCTGA
Protein sequenceShow/hide protein sequence
MPTSDSNSHPLPDQFLNPNPKAIAIIILILPLKAPLHLPSPFLFQIKPPHHITEPQQPPPPPPPPPPPPPPSSLHLQVSPPNSMATHAVPHSPTSQFLKKKLSGQRKREE
LEREVLMLQKLLHQEEKVHEILEGVNKQQNGSALGISNLLPPKVKELLAELAMVESEIARLEIQITQLQKDLKTEQQQQQQHTSNKAIKGDYCALNHFNLEDKTDQHLRH
RKESTHPLLLEEVLKVNTERVPRKSGLLVASSPLRDPRHPSPKVPLYFILATRAESIRHAASKIHPNSYSGRGKHPELAAQQAIREHNEVLELHICETAKNIKNNGAREV
RSHFKISAFLPGLQKLPSRERLKLESSATQRVEATRSLRYLRQRRVDPKGYWPLQKLGHLHINFHGSKIHFKCQLHPSHKKVKGLDEQFAKSGSKAIELPTETGILDQHV
QCLYHAWISPIWCASFPRKLAALMNKATVNIGGNTINAQAIEHYILRKPVLSSNKEDDKEAVVRKLYGLESSEPNVTFGLCCGTRSSPAVRIYSGEAVGAELERSKLEYL
QASMVVTSSRRIAVPELLVRNGLDFAADVNALVEWVCHQLPTAGTLRKSMVDCFRGHPKTPPTVQTLPYDFEFQYLLAL