; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019143 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019143
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153285:1003432..1013453
RNA-Seq ExpressionSgr019143
SyntenySgr019143
Gene Ontology termsGO:0000724 - double-strand break repair via homologous recombination (biological process)
GO:0031323 - regulation of cellular metabolic process (biological process)
GO:0051171 - regulation of nitrogen compound metabolic process (biological process)
GO:0060255 - regulation of macromolecule metabolic process (biological process)
GO:0071480 - cellular response to gamma radiation (biological process)
GO:0080090 - regulation of primary metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0140513 - nuclear protein-containing complex (cellular component)
GO:0008270 - zinc ion binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0004842 - ubiquitin-protein transferase activity (molecular function)
InterPro domainsIPR036420 - BRCT domain superfamily
IPR034732 - Extended PHD (ePHD) domain
IPR032867 - DYW domain
IPR017907 - Zinc finger, RING-type, conserved site
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR002885 - Pentatricopeptide repeat
IPR001841 - Zinc finger, RING-type
IPR001357 - BRCT domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594433.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0064.78Show/hide
Query:  MRSNQPSFLYSTARSVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNT
        MRSNQ  +L ST  +   TFPENHLSF+LRKCISLLQLCGSSQ KLKQIHAFS+RHGVPP NPD  K+LIFSLVS+SAPMSYA+RIF QIQAPNIFTWNT
Subjt:  MRSNQPSFLYSTARSVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNT

Query:  MIRGHAESENPRPAVELYCQMH-TSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAW
        M+RG +ESENPRPAVELY QMH  SSI PDTHTFPFL KAVAKLMD R+GEGIHSI +RNGFDSLLFVQNSLVHMYSVFG AESAY+VFEF+ ERDLVAW
Subjt:  MIRGHAESENPRPAVELYCQMH-TSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAW

Query:  NSVINGFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVS
        NSVINGFALNGMANEALTLF+EMGS GVEPDGFTMVSLLSACVELGA+ALGERVHVYMLKVGL QNPHASNALLDLYSKCGNI  A+KVFDEM E+SVVS
Subjt:  NSVINGFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVS

Query:  WTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPN
        WTSLIVGLAVNGLGNEAL+ FGELERKGLKPSEITFVGVLYACSHCGMVDEGF+YFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAY+YIRN+ +PPN
Subjt:  WTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPN

Query:  AVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAM
        AVIWRTLLG+CT+HGHLELGE+ARA+IL LEPKH GDYVLLSNLYASERRWLDVQNVRRTMLMKGV+KTPGYSLVELKNRVYEFIMGDRSHPQSEETYAM
Subjt:  AVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAM

Query:  LAKITELLKIEGYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLFISMGDLGHLE
        L KITE LKIEGY+PRTVNVLADIEEEEKETALSHHTEKVAIAF LVNT P TPIRIMKNLRVCADCHLAIKLISKVFEREI+V     F       H  
Subjt:  LAKITELLKIEGYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLFISMGDLGHLE

Query:  KMGRELKCPICLSLLNSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFA
        K G       CLSLLNSAASLGCNHVFCY CIEKSMKS SNCPVCKVP+RRREVRPAPHMDNLV+IYKSMEAASG+NIF++QNLSSAKLS          
Subjt:  KMGRELKCPICLSLLNSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFA

Query:  IPPEDGDKQVEGDCNGSKRLNAEITEITAYNQKLRRKGHIKYKK-------------------------------PSGRILLL---------PLKSAIAL
            DG+ QVEGD  GSKR NAE +E  AY Q+   K   + +K                               P+    L+         P KSA+A 
Subjt:  IPPEDGDKQVEGDCNGSKRLNAEITEITAYNQKLRRKGHIKYKK-------------------------------PSGRILLL---------PLKSAIAL

Query:  EEKGQPVLSPFFWLRERDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKRKC--VASHPMTWISLIVKCLN--GLRELFPELCSSPFKLQVE
        E KGQPVLSPFFWLRERD EDEKSNQQSD++QPTDSM M+VLSFSD+KDSLDESPSK     V   P   + L    +     R   PELC SPFKLQVE
Subjt:  EEKGQPVLSPFFWLRERDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKRKC--VASHPMTWISLIVKCLN--GLRELFPELCSSPFKLQVE

Query:  DNARTEIALLAAAPYEEPRNQNPNGSSNQNGGIPDVVFLM----------------------------CPLRKVTVRRIEWEYETYPK--RETECLSQKQ
        D ARTEIALLAAAP EEPR QN NG SN +GGIPD + +                               LRK   R  E   + Y     ETECL QKQ
Subjt:  DNARTEIALLAAAPYEEPRNQNPNGSSNQNGGIPDVVFLM----------------------------CPLRKVTVRRIEWEYETYPK--RETECLSQKQ

Query:  EHDVNNSFGSLKSGTKRSKKKMHFGIDANMGTLENVPADPINLGTPNGDLENFGTKTSASLEVEKVSQFPKRV---------------------------
        EH V+NS  +LK+ TKRSK+KMH GIDAN  TLENVP DPINL TPN   ENFGT+TS   EVEKVSQFP++                            
Subjt:  EHDVNNSFGSLKSGTKRSKKKMHFGIDANMGTLENVPADPINLGTPNGDLENFGTKTSASLEVEKVSQFPKRV---------------------------

Query:  ---------------------------------ARMADPARKR---------TLV-----------------------------LMQVSQFPEKSPKNGG
                                         +RM    RK+         T++                             + +VSQFPEK+ KNG 
Subjt:  ---------------------------------ARMADPARKR---------TLV-----------------------------LMQVSQFPEKSPKNGG

Query:  ARRAQRVVDCRRQSKKQKLDSWTTNC--SNFFNQNQHDDCAIPGLTTAPSAIATSTDQNRGHEKQEKSSSVCIITSEYDNIIQEKHVDAQANCSQLSE--
          + QR+V  RR+SKKQKL S          FNQNQ D CAIP LTT P  IATSTD+ R HEKQ+KSSS CI TSEYDN+ QEK V AQ N  ++SE  
Subjt:  ARRAQRVVDCRRQSKKQKLDSWTTNC--SNFFNQNQHDDCAIPGLTTAPSAIATSTDQNRGHEKQEKSSSVCIITSEYDNIIQEKHVDAQANCSQLSE--

Query:  --------------------------NLQCSFW-------KNGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKC
                                     C+F         +G ++ +   KPI TDD+KNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKC
Subjt:  --------------------------NLQCSFW-------KNGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKC

Query:  SCCGNKGAALGCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQ------------------------GSDIKNG----------KAAVL
          CGNKGAALGCYEK CRKSFHVPCAK MPQCQWDTENFVMLCPLH DSKLPSQ                          +I N           K  + 
Subjt:  SCCGNKGAALGCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQ------------------------GSDIKNG----------KAAVL

Query:  LSALTTAERETVAEFQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFR
         SALTTAERE V EFQRLSGV VLQKWDDSVTHIIASTDENEACKRT KILMGILKGKWVL ++WI+ACIQAM  IEE+RFEITLDV GIRDGP+LGR R
Subjt:  LSALTTAERETVAEFQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFR

Query:  VLNNYQVNIDIGDCRIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQNIPCSSPTAK
        VLNN                        K+FSGFKFFF ADF+PSYKGYLQQLVTAAGGT+LLRKPVSSNQN PCSSP  +
Subjt:  VLNNYQVNIDIGDCRIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQNIPCSSPTAK

KAG7026439.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0063.36Show/hide
Query:  MRSNQPSFLYSTARSVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNT
        MRSNQ  +L ST  +   TFPENHLSF+LRKCISLLQLCGSSQ KLKQIHAFS+RHGVPP NPD  K+LIFSLVS+SAPMSYA+RIF QIQAPNIFTWNT
Subjt:  MRSNQPSFLYSTARSVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNT

Query:  MIRGHAESENPRPAVELYCQMH-TSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAW
        M+RG AESENPRPAVELY QMH  SSI PDTHTFPFL KAVAKLMD R+GEGIHSI +RNGFDSLLFVQNSLVHMYSVFG AESAY+VFEF+ ERDLVAW
Subjt:  MIRGHAESENPRPAVELYCQMH-TSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAW

Query:  NSVINGFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVS
        NSVINGFALNGMANEALTLF+EMGS GVEPDGFTMVSLLSACVELGA+ALGERVHVYMLKVGL QNPHASNALLDLYSKCGNI  A+KVFDEM E+SVVS
Subjt:  NSVINGFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVS

Query:  WTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPN
        WTSLIVGLAVNGLGNEAL+ FGELERKGLKPSEITFVGVLYACSHCGMVDEGF+YFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAY+YIRN+ +PPN
Subjt:  WTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPN

Query:  AVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAM
        AVIWRTLLG+CT+HGHLELGE+ARA+IL LEPKH GDYVLLSNLYASERRWLDVQ+VRRTMLMKGV+KTPGYSLVELKNRVYEFIMGDRSHPQSEETYAM
Subjt:  AVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAM

Query:  LAKITELLKIEGYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLFISMGDLGHLE
        L KITE LKIEGY+PRTVNVLADIEEEEKETALSHHTEKVAIAF LVNT P TPIRIMKNLRVCADCHLAIKLISK               S G+ G   
Subjt:  LAKITELLKIEGYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLFISMGDLGHLE

Query:  KMGRELKCPICLSLLNSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFA
        +M         L   +    +  +      CIEKSMKS SNCPVCKVP+RRREVRPAPHMDNLV+IYKSMEAASG+NIF++QNLSSAKLS          
Subjt:  KMGRELKCPICLSLLNSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFA

Query:  IPPEDGDKQVEGDCNGSKRLNAEITEITAYNQKLRRKGHIKYKK-------------------------------PSGRILLL---------PLKSAIAL
            DG+ QVEGD  GSKR NAE +E  AY Q+   K   + +K                               P+    L+         P KSA+A 
Subjt:  IPPEDGDKQVEGDCNGSKRLNAEITEITAYNQKLRRKGHIKYKK-------------------------------PSGRILLL---------PLKSAIAL

Query:  EEKGQPVLSPFFWLRERDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKRKC--VASHPMTWISLIVKCLN--GLRELFPELCSSPFKLQVE
        E KGQPVLSPF WLRERD EDEKSNQQSD++QPTDSM M+VLSFSD+KDSLDESPSK     V   P   + L    +     R   PELC SPFKLQVE
Subjt:  EEKGQPVLSPFFWLRERDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKRKC--VASHPMTWISLIVKCLN--GLRELFPELCSSPFKLQVE

Query:  DNARTEIALLAAAPYEEPRNQNPNGSSNQNGGIPDVVFLM----------------------------CPLRKVTVRRIEWEYETYPK--RETECLSQKQ
        D ARTEIALLAAAP EEPR QN NG SN +GGIPD + +                               LRK   R  E   + Y     ETECL QKQ
Subjt:  DNARTEIALLAAAPYEEPRNQNPNGSSNQNGGIPDVVFLM----------------------------CPLRKVTVRRIEWEYETYPK--RETECLSQKQ

Query:  EHDVNNSFGSLKSGTKRSKKKMHFGIDANMGTLENVPADPINLGTPNGDLENFGTKTSASLEVEKVSQFPKRV---------------------------
        EH V+NS  +LK+ TKRSK+KMH GIDAN  TLENVP DPINL TPN   ENFGT+TS   EVEKVSQFP++                            
Subjt:  EHDVNNSFGSLKSGTKRSKKKMHFGIDANMGTLENVPADPINLGTPNGDLENFGTKTSASLEVEKVSQFPKRV---------------------------

Query:  ---------------------------------ARMADPARKR---------TLV-----------------------------LMQVSQFPEKSPKNGG
                                         +RM    RK+         T++                             + +VSQFPEK+ KNG 
Subjt:  ---------------------------------ARMADPARKR---------TLV-----------------------------LMQVSQFPEKSPKNGG

Query:  ARRAQRVVDCRRQSKKQKLDSWTTNC--SNFFNQNQHDDCAIPGLTTAPSAIATSTDQNRGHEKQEKSSSVCIITSEYDNIIQEKHVDAQANCSQLSE--
        A + QR+V  RR+SKKQKL S          FNQNQ D CAIP LTT P  IATSTD+ R HEKQ+KSSS CI TSEYDN+ QEK V AQ N  +LSE  
Subjt:  ARRAQRVVDCRRQSKKQKLDSWTTNC--SNFFNQNQHDDCAIPGLTTAPSAIATSTDQNRGHEKQEKSSSVCIITSEYDNIIQEKHVDAQANCSQLSE--

Query:  --------------------------NLQCSFW-------KNGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKC
                                     C+F         +G ++ +   KPI TDD+KNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKC
Subjt:  --------------------------NLQCSFW-------KNGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKC

Query:  SCCGNKGAALGCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQ------------------------GSDIKNG----------KAAVL
          CGNKGAALGCYEK CRKSFHVPCAK MPQCQWDTENFVMLCPLH DSKLPSQ                          +I N           K  + 
Subjt:  SCCGNKGAALGCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQ------------------------GSDIKNG----------KAAVL

Query:  LSALTTAERETVAEFQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFR
         SALTTAERE V EFQRLSGV VLQKWDDSVTHIIASTDENEACKRT KILMGILKGKWVL ++WI+ACIQAM  IEE+RFEITLDV GIRDGP+LGR R
Subjt:  LSALTTAERETVAEFQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFR

Query:  VLNNYQVNIDIGDCRIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQNIPCSSPTAK
        VLNN                        K+FSGFKFFF ADF+PSYKGYLQQLVTAAGGT+LLRKPVSSNQN PCSSP  +
Subjt:  VLNNYQVNIDIGDCRIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQNIPCSSPTAK

RXH98919.1 hypothetical protein DVH24_011244 [Malus domestica]0.0e+0046.98Show/hide
Query:  YSTARSVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESE
        +ST    + T P+N +  +L+KCI+LLQ C SS+ KL+QIHAFS+RHGVP +NPDMGK+LIFS+VSLSAP+SYA +IF QI+ PN+FTWNTMIRG+AE +
Subjt:  YSTARSVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESE

Query:  NPRPAVELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALN
        NP P ++LY QMH SS+ PDTHT+PFLLKAVAKLMDV  GE IHS+ +RNGF+SL+FV+N+L+H+Y+  GH ESA++VFE + ERDLVAWNSVINGFALN
Subjt:  NPRPAVELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALN

Query:  GMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAV
        G  NEALT+FR+M  EGVEPDGFTMVSLLSAC E+G +ALG R+HVYMLKVGLT N HA+NALLDLY+KCG+IR+A KVF+ M+E+SVVSWT+L+VGLAV
Subjt:  GMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAV

Query:  NGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGS
        NG G EALELF EL+ +GL P+EITFVGVLYACSHCGMVDEGF YFR MKE YGI PRIEH+GC+VDLL RAG V+ AYEYI N+PM PNAVIWRTLLG+
Subjt:  NGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGS

Query:  CTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKI
        CT+HG L LGE ARA+I  LEP HSGDYVLLSNLYASE RW DVQ VRRTML +GVRKTPGYS+VEL+N +YEF MGDRSHPQSEE YAMLA+IT LLK 
Subjt:  CTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKI

Query:  EGYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLF-------ISMGDLGHLEKM-
        +GY+P T NVLADIEEEEKE ALS+H+EK+A+AF L+NT+P  PIRI KNLRVCADCHLA KLISKV++REI+V     F        S  D    E+  
Subjt:  EGYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLF-------ISMGDLGHLEKM-

Query:  ---------------------------------------------GRELKCP-------ICLSLLNSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYR
                                                     G+  + P        C      ++ L C  +   ACI KSMKS SNCPVCK+PYR
Subjt:  ---------------------------------------------GRELKCP-------ICLSLLNSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYR

Query:  RREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFAIPPEDGDKQVEGDCNGSKRLNAEITEITAYNQKLRRKGHIKYK---KPSG
        RREVRPAPHMDNLVSIYKSME ASGINIFVTQN  S K S              DG +Q E D +  +    E  +     + LR KG  K K   K SG
Subjt:  RREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFAIPPEDGDKQVEGDCNGSKRLNAEITEITAYNQKLRRKGHIKYK---KPSG

Query:  RILLLP----------------------------------------LKSAIALEEKGQPVLSPFFWLRERDEEDEKSNQQSDLEQPTDSMAMDVLSFSDI
         I + P                                        LK    L EKG+PVLSPFFWL ++D   E  +Q S  +Q TD  + +V +FSDI
Subjt:  RILLLP----------------------------------------LKSAIALEEKGQPVLSPFFWLRERDEEDEKSNQQSDLEQPTDSMAMDVLSFSDI

Query:  KDSLDES------PSKRKCVASHPMTWISLIVKCLNGLRELFPELCSSPFK-----LQVEDNARTEIAL--------------LAAAPYEEPRNQNPNG-
        KDS DE+      P +    ++H        +     +    PEL  SP K     +QV ++    + L              L+A      RN+   G 
Subjt:  KDSLDES------PSKRKCVASHPMTWISLIVKCLNGLRELFPELCSSPFK-----LQVEDNARTEIAL--------------LAAAPYEEPRNQNPNG-

Query:  --------SSNQNGGIPDVVFLMCPLRKVTVRRIEWEYETYPKRETECLSQKQE-----HDVNNSFGSLKSGTKRSKKKMHFGIDANMGTL-ENVPADPI
                +S  N    +        + +T R+    + T P  +    S+  E         N+  SL    KRSKK+ H  +     TL +N   D  
Subjt:  --------SSNQNGGIPDVVFLMCPLRKVTVRRIEWEYETYPKRETECLSQKQE-----HDVNNSFGSLKSGTKRSKKKMHFGIDANMGTL-ENVPADPI

Query:  NLGTPNGDLENFGTKTSASLEVEKVSQ--------FPKRVARMADPARKRTLVLMQVSQFPEKSPKNGGARRAQRVVDCR--------RQSKKQKLDSWT
                      K      V    Q          + V+ +   A K T   + +   P  + K   +R   R +  +        +++KK K+ S  
Subjt:  NLGTPNGDLENFGTKTSASLEVEKVSQ--------FPKRVARMADPARKRTLVLMQVSQFPEKSPKNGGARRAQRVVDCR--------RQSKKQKLDSWT

Query:  TNCSNFFNQNQHDDCAIPGLTTAPSAIATSTDQNRGHEKQEKSSSVCIITSEYDNIIQEKHVDAQA---NCSQLSENLQCSF----------WKNGPLLQ
               + N  D+      T              G++  EKS   C + +    + +   +        C  +   +QC+F             G    
Subjt:  TNCSNFFNQNQHDDCAIPGLTTAPSAIATSTDQNRGHEKQEKSSSVCIITSEYDNIIQEKHVDAQA---NCSQLSENLQCSF----------WKNGPLLQ

Query:  WKPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKCSCCGNKGAALGCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSK
         KP+  D     KV+HAH  C EWAPNVYF+ DTAINLEAEL+RSR+IKCSCC  KGAALGCYEKSCR+SFHV CAK MPQC+WDT+NFVMLCP+H  SK
Subjt:  WKPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKCSCCGNKGAALGCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSK

Query:  LPSQGSDIK------------------------------------NGKAAVLLSALTTAERETVAEFQRLSGVTVLQKWDDSVTHIIASTDENEACKRTL
        LP++  + +                                    + K  +  S+LT AERE+VA F+RLSG+ VL+ WD +VTH+IASTDEN ACKRTL
Subjt:  LPSQGSDIK------------------------------------NGKAAVLLSALTTAERETVAEFQRLSGVTVLQKWDDSVTHIIASTDENEACKRTL

Query:  KILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFRVLNNYQVNIDIGDCRIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKG
        K+LMGIL+GKW+L ++WI AC +AM  ++E+ +EI +D++GIRDGPRLGR R+ N                         K+F GFKF+F+ DF+PSYKG
Subjt:  KILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFRVLNNYQVNIDIGDCRIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKG

Query:  YLQQLVTAAGGTVLLRKPVSSNQNIPCSSPTAKFSSFTVLS
        YLQ LV AAGGT+L RKPV   Q    S+ + K  ++ + S
Subjt:  YLQQLVTAAGGTVLLRKPVSSNQNIPCSSPTAKFSSFTVLS

RYQ81897.1 hypothetical protein Ahy_B10g100490 isoform B [Arachis hypogaea]0.0e+0047.88Show/hide
Query:  SVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESENPRPA
        S ++T  +N +  ++ KCI+LLQ   SS  KL+QIHAFS+RHGVP  NPDMGK+LIF++VSLSAPMSYA  +F  +  PN+FTWNTMIRG+AES++P PA
Subjt:  SVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESENPRPA

Query:  VELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALNGMANE
        +  Y +M  S + PDTHT+PFLLKAVAK ++VR GE IHS+ +R GF+SL+FVQN L+HMY+V G  ESA++VFE ++ERDLVAWNSVINGFALNG  +E
Subjt:  VELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALNGMANE

Query:  ALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGN
        ALTLFREM  EG+EPDGFT+VSLLSAC ELGA+ LG RVHV++LKVGLT+N H +N+LLD Y+KCG+IR+A +VF EM +++VVS+TSLIVGLAVNG G 
Subjt:  ALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGN

Query:  EALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHG
        EALE F E+E + +KPSEITFVGVLYACSHCGM++EGFNYFRRMK+EYGI+PRIEH+GC+VDLL RAG V+ AYEYI+N+PM PN+VIWRTLLG+CT+HG
Subjt:  EALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHG

Query:  HLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYIP
        HL LGE+AR+ +L LEPKHSGDYVLLSN+YA+ERRW DVQ VRR+ML  GVRKTPGYS+VEL NRVYEF MGDRSHPQS++ YA+L KITELLK+EGY+P
Subjt:  HLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYIP

Query:  RTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLFISMGDLGHLEKMGRELKCPICLSLL
         T NVLADIEEEEKE ALS+H+EKVAIAF ++NT P TPIR+MKNLRVCADCH+AIKLISKV++REI++     F       H  + G            
Subjt:  RTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLFISMGDLGHLEKMGRELKCPICLSLL

Query:  NSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFAIPPEDGDKQVEGDC-
            S  C   +C+ CI KSMKS S+CPVCK+P+  REVRP+P   NLV+IYK ME +SG+N+F+TQN   +K S+            E+   Q + DC 
Subjt:  NSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFAIPPEDGDKQVEGDC-

Query:  ----------NGSKRLNAEITEITAYNQKLRRKGHIKYKK----PSGRILLLPLKSAIA------------------------LEEKGQPVLSPFFWLRE
                     KR   + +     N     K  +  KK    P   +   P K+  +                          E+  PVLSPFFWLRE
Subjt:  ----------NGSKRLNAEITEITAYNQKLRRKGHIKYKK----PSGRILLLPLKSAIA------------------------LEEKGQPVLSPFFWLRE

Query:  RDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKRKCVASHPMTWISLIVKCLN--GLRELFPELCSSPFKLQ------VEDN------ARTE
         +  +  S Q    +   +S      SFSD+KDS DE+P+K         T   L    L     R   PEL SSP KLQ      + +N      A  E
Subjt:  RDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKRKCVASHPMTWISLIVKCLN--GLRELFPELCSSPFKLQ------VEDN------ARTE

Query:  IALLAAAPYEEPRNQNPNGSSNQNGGIPDVV--FLMCPLRKVTVRRIEWEYETYPKRETECLSQKQEHDVNNSFGSLKSGTKRSKKKMHFGIDANMGTLE
        +    ++ Y E  N     S N+   + DV+   +  P R         +Y+   ++  E  +Q+Q  +  +S   +   + RS   +    D +    E
Subjt:  IALLAAAPYEEPRNQNPNGSSNQNGGIPDVV--FLMCPLRKVTVRRIEWEYETYPKRETECLSQKQEHDVNNSFGSLKSGTKRSKKKMHFGIDANMGTLE

Query:  NVPADPINLGTPNGDLEN--FGTKTSASLEVEKVSQFPKRVARMADPARKRTLVLMQVSQFPEKSPKNGGARRAQRVV-DCR-RQSKKQKLDSWTTNCSN
           A   NLG  N   +   F T+T +++ V         V R  +       V M    +   S K       Q +   C+ ++S KQKL+        
Subjt:  NVPADPINLGTPNGDLEN--FGTKTSASLEVEKVSQFPKRVARMADPARKRTLVLMQVSQFPEKSPKNGGARRAQRVV-DCR-RQSKKQKLDSWTTNCSN

Query:  F-FNQNQHDDCAIPGLTTAPSAIATSTDQN------------------------RGHEKQEKSSSV------------------CIITSEYDNIIQEK--
            QNQ DD  + G  +A S ++  TD+N                        R  +K + SS                    C      D+  +EK  
Subjt:  F-FNQNQHDDCAIPGLTTAPSAIATSTDQN------------------------RGHEKQEKSSSV------------------CIITSEYDNIIQEK--

Query:  ---HVDAQANCSQLSENLQCSFWK-------NGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKCSCCGNKGAAL
           HV    NC   +E  QC F         +GP++ +   KP++ +    SKV+H+H NC EWAPNVYFD D AINLEAE+SRSRRIKC  CG KGAAL
Subjt:  ---HVDAQANCSQLSENLQCSFWK-------NGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKCSCCGNKGAAL

Query:  GCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQGSDIKN------------------------------GKAAVLLSALTTAERETVAE
        GCYEKSCR+SFHVPCAK    C+WD ENFVMLCPLHT   LP + S  K                                K  +  SAL+  ER+ V+E
Subjt:  GCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQGSDIKN------------------------------GKAAVLLSALTTAERETVAE

Query:  FQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFRVLNNYQVNIDIGDC
        FQR++ VTVL+KWD SVTH+IASTDEN+ACKRTLK+LMGIL+GKW+L I+WIKAC++ M  ++E+ +EI LD+HGI DGP LGR RVLN           
Subjt:  FQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFRVLNNYQVNIDIGDC

Query:  RIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQN
                      K+F GFKF+F+ DF+PSYKGYLQ LVTAAGG +L RKPVS + N
Subjt:  RIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQN

RYQ81898.1 hypothetical protein Ahy_B10g100490 isoform A [Arachis hypogaea]0.0e+0049.04Show/hide
Query:  SVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESENPRPA
        S ++T  +N +  ++ KCI+LLQ   SS  KL+QIHAFS+RHGVP  NPDMGK+LIF++VSLSAPMSYA  +F  +  PN+FTWNTMIRG+AES++P PA
Subjt:  SVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESENPRPA

Query:  VELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALNGMANE
        +  Y +M  S + PDTHT+PFLLKAVAK ++VR GE IHS+ +R GF+SL+FVQN L+HMY+V G  ESA++VFE ++ERDLVAWNSVINGFALNG  +E
Subjt:  VELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALNGMANE

Query:  ALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGN
        ALTLFREM  EG+EPDGFT+VSLLSAC ELGA+ LG RVHV++LKVGLT+N H +N+LLD Y+KCG+IR+A +VF EM +++VVS+TSLIVGLAVNG G 
Subjt:  ALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGN

Query:  EALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHG
        EALE F E+E + +KPSEITFVGVLYACSHCGM++EGFNYFRRMK+EYGI+PRIEH+GC+VDLL RAG V+ AYEYI+N+PM PN+VIWRTLLG+CT+HG
Subjt:  EALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHG

Query:  HLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYIP
        HL LGE+AR+ +L LEPKHSGDYVLLSN+YA+ERRW DVQ VRR+ML  GVRKTPGYS+VEL NRVYEF MGDRSHPQS++ YA+L KITELLK+EGY+P
Subjt:  HLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYIP

Query:  RTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLFISMGDLGHLEKMGRELKCPICLSLL
         T NVLADIEEEEKE ALS+H+EKVAIAF ++NT P TPIR+MKNLRVCADCH+AIKLISK   R          I+M +   LE+MGREL+CPICLSLL
Subjt:  RTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLFISMGDLGHLEKMGRELKCPICLSLL

Query:  NSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFAIPPEDGDKQVEGDC-
        NS ASL C+HVFC  CI KSMKS S+CPVCK+P+  REVRP+P   NLV+IYK ME +SG+N+F+TQN   +K S+            E+   Q + DC 
Subjt:  NSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFAIPPEDGDKQVEGDC-

Query:  ----------NGSKRLNAEITEITAYNQKLRRKGHIKYKK----PSGRILLLPLKSAIA------------------------LEEKGQPVLSPFFWLRE
                     KR   + +     N     K  +  KK    P   +   P K+  +                          E+  PVLSPFFWLRE
Subjt:  ----------NGSKRLNAEITEITAYNQKLRRKGHIKYKK----PSGRILLLPLKSAIA------------------------LEEKGQPVLSPFFWLRE

Query:  RDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKRKCVASHPMTWISLIVKCLN--GLRELFPELCSSPFKLQ------VEDN------ARTE
         +  +  S Q    +   +S      SFSD+KDS DE+P+K         T   L    L     R   PEL SSP KLQ      + +N      A  E
Subjt:  RDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKRKCVASHPMTWISLIVKCLN--GLRELFPELCSSPFKLQ------VEDN------ARTE

Query:  IALLAAAPYEEPRNQNPNGSSNQNGGIPDVV--FLMCPLRKVTVRRIEWEYETYPKRETECLSQKQEHDVNNSFGSLKSGTKRSKKKMHFGIDANMGTLE
        +    ++ Y E  N     S N+   + DV+   +  P R         +Y+   ++  E  +Q+Q  +  +S   +   + RS   +    D +    E
Subjt:  IALLAAAPYEEPRNQNPNGSSNQNGGIPDVV--FLMCPLRKVTVRRIEWEYETYPKRETECLSQKQEHDVNNSFGSLKSGTKRSKKKMHFGIDANMGTLE

Query:  NVPADPINLGTPNGDLEN--FGTKTSASLEVEKVSQFPKRVARMADPARKRTLVLMQVSQFPEKSPKNGGARRAQRVV-DCR-RQSKKQKLDSWTTNCSN
           A   NLG  N   +   F T+T +++ V         V R  +       V M    +   S K       Q +   C+ ++S KQKL+        
Subjt:  NVPADPINLGTPNGDLEN--FGTKTSASLEVEKVSQFPKRVARMADPARKRTLVLMQVSQFPEKSPKNGGARRAQRVV-DCR-RQSKKQKLDSWTTNCSN

Query:  F-FNQNQHDDCAIPGLTTAPSAIATSTDQN------------------------RGHEKQEKSSSV------------------CIITSEYDNIIQEK--
            QNQ DD  + G  +A S ++  TD+N                        R  +K + SS                    C      D+  +EK  
Subjt:  F-FNQNQHDDCAIPGLTTAPSAIATSTDQN------------------------RGHEKQEKSSSV------------------CIITSEYDNIIQEK--

Query:  ---HVDAQANCSQLSENLQCSFWK-------NGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKCSCCGNKGAAL
           HV    NC   +E  QC F         +GP++ +   KP++ +    SKV+H+H NC EWAPNVYFD D AINLEAE+SRSRRIKC  CG KGAAL
Subjt:  ---HVDAQANCSQLSENLQCSFWK-------NGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKCSCCGNKGAAL

Query:  GCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQGSDIKN------------------------------GKAAVLLSALTTAERETVAE
        GCYEKSCR+SFHVPCAK    C+WD ENFVMLCPLHT   LP + S  K                                K  +  SAL+  ER+ V+E
Subjt:  GCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQGSDIKN------------------------------GKAAVLLSALTTAERETVAE

Query:  FQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFRVLNNYQVNIDIGDC
        FQR++ VTVL+KWD SVTH+IASTDEN+ACKRTLK+LMGIL+GKW+L I+WIKAC++ M  ++E+ +EI LD+HGI DGP LGR RVLN           
Subjt:  FQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFRVLNNYQVNIDIGDC

Query:  RIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQN
                      K+F GFKF+F+ DF+PSYKGYLQ LVTAAGG +L RKPVS + N
Subjt:  RIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQN

TrEMBL top hitse value%identityAlignment
A0A371H1H8 Pentatricopeptide repeat-containing protein (Fragment)0.0e+0046.99Show/hide
Query:  SFLY---STARSVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIR
        SFL+    T+    ++F  N    VL KCI+LLQ CGSS+ KL+QIHAFS+RHGVP  NPDMGK+LIF++VSLSAPMSYA  +F  I  PN+FTWNTMIR
Subjt:  SFLY---STARSVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIR

Query:  GHAESENPRPAVELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVI
        G+AES+NPRPA+  Y QM  S + PDTHT+PFLLKA++KL+++R GE IHS+ IRNGF+SLLFVQNSL+H+Y+  G  ESAY+VFE + ERDLVAWNSVI
Subjt:  GHAESENPRPAVELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVI

Query:  NGFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSL
        NGFALNG  NEALTLFREMG EGVEPDGFT+VSLLSAC ELGA+ LG RVHVY+LKVGL +N H +N+LLD Y+KCG IR+A  VF EM +++ VSWTSL
Subjt:  NGFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSL

Query:  IVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIW
        IVGLAVNG G EALELF E+E  GL PSEITFVGVLYACSHCGM+DEGF+YFRRMKEEYGI+PRIEH+GC+VDLL RAG V+ AYEYI+N+P+ PNAVIW
Subjt:  IVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIW

Query:  RTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKI
        RTLLG+CT+HGHL LGE+AR+ +L LEPKHSGDYVLLSNLYASERRW DVQ +RR+ML  GV+KTPGYSLVEL NRVYEF MGDRSHPQS++ YA+L KI
Subjt:  RTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKI

Query:  TELLKIEGYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLFISMGDLGHLEKMGR
        TELLK+EGY+PRT NVLADIEEE+KE ALS+H+EKVA+AF L+NT+P TPIR+MKNLRVCADCH+AIKL+SK++ REI++     F       H  + G 
Subjt:  TELLKIEGYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLFISMGDLGHLEKMGR

Query:  ELKCPICLSLLNSAASLGCN-------HV-----FC-YACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLS-
              C  L       G N       H+     FC +  ++ S        V ++     +VRPAPHMD+LVSIYKSMEAASGIN+FVTQN+   KLS 
Subjt:  ELKCPICLSLLNSAASLGCN-------HV-----FC-YACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLS-

Query:  EFLSTLFFFAIPPEDGDKQVEGDCN------GSKRLNAEITEITAYNQKLRR--------------KGHIKYKK----PSGRILLLPLK-----------
        +++           DG+KQ EGD +      G+ R      + T   +KL++              K     KK    P   +   PLK           
Subjt:  EFLSTLFFFAIPPEDGDKQVEGDCN------GSKRLNAEITEITAYNQKLRR--------------KGHIKYKK----PSGRILLLPLK-----------

Query:  --------SAIALE-----EKGQPVLSPFFWLRERDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKRKCVASHPMTWISLIVKCLNG----
                S +  E     EK  PVLSPFFWLRE +++ EK +Q +D +Q  D +  +  SFSD+KDS DE+PS    VA          V   +     
Subjt:  --------SAIALE-----EKGQPVLSPFFWLRERDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKRKCVASHPMTWISLIVKCLNG----

Query:  --LRELFPELCSSPFKLQVEDNARTEIALLAAAPYEEPRNQNPNGSSNQNGGIPDVVFLMCPLRKVTVRRIEWEYETYPKRETECLSQK---QEHDVNNS
           R   PEL  SP K+QV D             YE+  NQ    ++++         L      +    I+ E      R  + L +     +   ++ 
Subjt:  --LRELFPELCSSPFKLQVEDNARTEIALLAAAPYEEPRNQNPNGSSNQNGGIPDVVFLMCPLRKVTVRRIEWEYETYPKRETECLSQK---QEHDVNNS

Query:  FGSLKSGTKRSKKKMHFGIDANMGTLENVPADPINLGTPNGDLENFGTKTSASLEV--EKVSQFPKRVARMADPARKRTLVLMQVSQFPEKSP------K
           +K  TKRS+K         +G L+N              ++     ++ SLEV  E+   +  + + +   +R+   V    S  P           
Subjt:  FGSLKSGTKRSKKKMHFGIDANMGTLENVPADPINLGTPNGDLENFGTKTSASLEV--EKVSQFPKRVARMADPARKRTLVLMQVSQFPEKSP------K

Query:  NGGARRAQR-------VVDCRRQSKK--QKLDSWTTNCSNFFNQNQHDDCAIPGLTTAPSAIATSTDQNRGHEKQEKS---------------------S
        +GG    +        +  C+++++K   + ++  +       QN      + G  ++  ++ T  +    + KQ KS                     S
Subjt:  NGGARRAQR-------VVDCRRQSKK--QKLDSWTTNCSNFFNQNQHDDCAIPGLTTAPSAIATSTDQNRGHEKQEKS---------------------S

Query:  SVCIITSE-------------------YDNIIQEKHVDAQ-----ANCSQLSENLQCSFW-------KNGPLLQW---KPINTDDIKNSKVVHAHWNCVE
        S CI  ++                    ++I +EKH           C    +  QC F         +GP++ +   KP+  D    S+V H H NC E
Subjt:  SVCIITSE-------------------YDNIIQEKHVDAQ-----ANCSQLSENLQCSFW-------KNGPLLQW---KPINTDDIKNSKVVHAHWNCVE

Query:  WAPNVYFDGDTAINLEAELSRSRRIKCSCCGNKGAALGCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQGS------------DIKN-
        WAPNVYFDGD AINLEAE+SRSRRIKC       AALGCYEKSCR+SFHV CAK   QC+WDT+NFVMLCPLH  S LP +GS            + KN 
Subjt:  WAPNVYFDGDTAINLEAELSRSRRIKCSCCGNKGAALGCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQGS------------DIKN-

Query:  -----------------GKAAVLLSALTTAERETVAEFQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIE
                          K  +  SAL+  ER  V+EF+R+S VTV+  WD SVTH+IASTDEN AC+RTLK+L+GIL+GKW+L I+WIK+C++ M  ++
Subjt:  -----------------GKAAVLLSALTTAERETVAEFQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIE

Query:  EQRFEITLDVHGIRDGPRLGRFRVLNNYQVNIDIGDCRIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQ
        E+ +EI +D+HGIRDGPRLGR RVLN                         K+F G+KF+F+ DF+PSYKGYLQ LV AAGG +L RKPVS +Q
Subjt:  EQRFEITLDVHGIRDGPRLGRFRVLNNYQVNIDIGDCRIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQ

A0A444WWR5 Uncharacterized protein0.0e+0047.88Show/hide
Query:  SVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESENPRPA
        S ++T  +N +  ++ KCI+LLQ   SS  KL+QIHAFS+RHGVP  NPDMGK+LIF++VSLSAPMSYA  +F  +  PN+FTWNTMIRG+AES++P PA
Subjt:  SVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESENPRPA

Query:  VELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALNGMANE
        +  Y +M  S + PDTHT+PFLLKAVAK ++VR GE IHS+ +R GF+SL+FVQN L+HMY+V G  ESA++VFE ++ERDLVAWNSVINGFALNG  +E
Subjt:  VELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALNGMANE

Query:  ALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGN
        ALTLFREM  EG+EPDGFT+VSLLSAC ELGA+ LG RVHV++LKVGLT+N H +N+LLD Y+KCG+IR+A +VF EM +++VVS+TSLIVGLAVNG G 
Subjt:  ALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGN

Query:  EALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHG
        EALE F E+E + +KPSEITFVGVLYACSHCGM++EGFNYFRRMK+EYGI+PRIEH+GC+VDLL RAG V+ AYEYI+N+PM PN+VIWRTLLG+CT+HG
Subjt:  EALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHG

Query:  HLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYIP
        HL LGE+AR+ +L LEPKHSGDYVLLSN+YA+ERRW DVQ VRR+ML  GVRKTPGYS+VEL NRVYEF MGDRSHPQS++ YA+L KITELLK+EGY+P
Subjt:  HLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYIP

Query:  RTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLFISMGDLGHLEKMGRELKCPICLSLL
         T NVLADIEEEEKE ALS+H+EKVAIAF ++NT P TPIR+MKNLRVCADCH+AIKLISKV++REI++     F       H  + G            
Subjt:  RTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLFISMGDLGHLEKMGRELKCPICLSLL

Query:  NSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFAIPPEDGDKQVEGDC-
            S  C   +C+ CI KSMKS S+CPVCK+P+  REVRP+P   NLV+IYK ME +SG+N+F+TQN   +K S+            E+   Q + DC 
Subjt:  NSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFAIPPEDGDKQVEGDC-

Query:  ----------NGSKRLNAEITEITAYNQKLRRKGHIKYKK----PSGRILLLPLKSAIA------------------------LEEKGQPVLSPFFWLRE
                     KR   + +     N     K  +  KK    P   +   P K+  +                          E+  PVLSPFFWLRE
Subjt:  ----------NGSKRLNAEITEITAYNQKLRRKGHIKYKK----PSGRILLLPLKSAIA------------------------LEEKGQPVLSPFFWLRE

Query:  RDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKRKCVASHPMTWISLIVKCLN--GLRELFPELCSSPFKLQ------VEDN------ARTE
         +  +  S Q    +   +S      SFSD+KDS DE+P+K         T   L    L     R   PEL SSP KLQ      + +N      A  E
Subjt:  RDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKRKCVASHPMTWISLIVKCLN--GLRELFPELCSSPFKLQ------VEDN------ARTE

Query:  IALLAAAPYEEPRNQNPNGSSNQNGGIPDVV--FLMCPLRKVTVRRIEWEYETYPKRETECLSQKQEHDVNNSFGSLKSGTKRSKKKMHFGIDANMGTLE
        +    ++ Y E  N     S N+   + DV+   +  P R         +Y+   ++  E  +Q+Q  +  +S   +   + RS   +    D +    E
Subjt:  IALLAAAPYEEPRNQNPNGSSNQNGGIPDVV--FLMCPLRKVTVRRIEWEYETYPKRETECLSQKQEHDVNNSFGSLKSGTKRSKKKMHFGIDANMGTLE

Query:  NVPADPINLGTPNGDLEN--FGTKTSASLEVEKVSQFPKRVARMADPARKRTLVLMQVSQFPEKSPKNGGARRAQRVV-DCR-RQSKKQKLDSWTTNCSN
           A   NLG  N   +   F T+T +++ V         V R  +       V M    +   S K       Q +   C+ ++S KQKL+        
Subjt:  NVPADPINLGTPNGDLEN--FGTKTSASLEVEKVSQFPKRVARMADPARKRTLVLMQVSQFPEKSPKNGGARRAQRVV-DCR-RQSKKQKLDSWTTNCSN

Query:  F-FNQNQHDDCAIPGLTTAPSAIATSTDQN------------------------RGHEKQEKSSSV------------------CIITSEYDNIIQEK--
            QNQ DD  + G  +A S ++  TD+N                        R  +K + SS                    C      D+  +EK  
Subjt:  F-FNQNQHDDCAIPGLTTAPSAIATSTDQN------------------------RGHEKQEKSSSV------------------CIITSEYDNIIQEK--

Query:  ---HVDAQANCSQLSENLQCSFWK-------NGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKCSCCGNKGAAL
           HV    NC   +E  QC F         +GP++ +   KP++ +    SKV+H+H NC EWAPNVYFD D AINLEAE+SRSRRIKC  CG KGAAL
Subjt:  ---HVDAQANCSQLSENLQCSFWK-------NGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKCSCCGNKGAAL

Query:  GCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQGSDIKN------------------------------GKAAVLLSALTTAERETVAE
        GCYEKSCR+SFHVPCAK    C+WD ENFVMLCPLHT   LP + S  K                                K  +  SAL+  ER+ V+E
Subjt:  GCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQGSDIKN------------------------------GKAAVLLSALTTAERETVAE

Query:  FQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFRVLNNYQVNIDIGDC
        FQR++ VTVL+KWD SVTH+IASTDEN+ACKRTLK+LMGIL+GKW+L I+WIKAC++ M  ++E+ +EI LD+HGI DGP LGR RVLN           
Subjt:  FQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFRVLNNYQVNIDIGDC

Query:  RIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQN
                      K+F GFKF+F+ DF+PSYKGYLQ LVTAAGG +L RKPVS + N
Subjt:  RIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQN

A0A444WWS0 Uncharacterized protein0.0e+0049.04Show/hide
Query:  SVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESENPRPA
        S ++T  +N +  ++ KCI+LLQ   SS  KL+QIHAFS+RHGVP  NPDMGK+LIF++VSLSAPMSYA  +F  +  PN+FTWNTMIRG+AES++P PA
Subjt:  SVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESENPRPA

Query:  VELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALNGMANE
        +  Y +M  S + PDTHT+PFLLKAVAK ++VR GE IHS+ +R GF+SL+FVQN L+HMY+V G  ESA++VFE ++ERDLVAWNSVINGFALNG  +E
Subjt:  VELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALNGMANE

Query:  ALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGN
        ALTLFREM  EG+EPDGFT+VSLLSAC ELGA+ LG RVHV++LKVGLT+N H +N+LLD Y+KCG+IR+A +VF EM +++VVS+TSLIVGLAVNG G 
Subjt:  ALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGN

Query:  EALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHG
        EALE F E+E + +KPSEITFVGVLYACSHCGM++EGFNYFRRMK+EYGI+PRIEH+GC+VDLL RAG V+ AYEYI+N+PM PN+VIWRTLLG+CT+HG
Subjt:  EALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHG

Query:  HLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYIP
        HL LGE+AR+ +L LEPKHSGDYVLLSN+YA+ERRW DVQ VRR+ML  GVRKTPGYS+VEL NRVYEF MGDRSHPQS++ YA+L KITELLK+EGY+P
Subjt:  HLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYIP

Query:  RTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLFISMGDLGHLEKMGRELKCPICLSLL
         T NVLADIEEEEKE ALS+H+EKVAIAF ++NT P TPIR+MKNLRVCADCH+AIKLISK   R          I+M +   LE+MGREL+CPICLSLL
Subjt:  RTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLFISMGDLGHLEKMGRELKCPICLSLL

Query:  NSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFAIPPEDGDKQVEGDC-
        NS ASL C+HVFC  CI KSMKS S+CPVCK+P+  REVRP+P   NLV+IYK ME +SG+N+F+TQN   +K S+            E+   Q + DC 
Subjt:  NSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFAIPPEDGDKQVEGDC-

Query:  ----------NGSKRLNAEITEITAYNQKLRRKGHIKYKK----PSGRILLLPLKSAIA------------------------LEEKGQPVLSPFFWLRE
                     KR   + +     N     K  +  KK    P   +   P K+  +                          E+  PVLSPFFWLRE
Subjt:  ----------NGSKRLNAEITEITAYNQKLRRKGHIKYKK----PSGRILLLPLKSAIA------------------------LEEKGQPVLSPFFWLRE

Query:  RDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKRKCVASHPMTWISLIVKCLN--GLRELFPELCSSPFKLQ------VEDN------ARTE
         +  +  S Q    +   +S      SFSD+KDS DE+P+K         T   L    L     R   PEL SSP KLQ      + +N      A  E
Subjt:  RDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKRKCVASHPMTWISLIVKCLN--GLRELFPELCSSPFKLQ------VEDN------ARTE

Query:  IALLAAAPYEEPRNQNPNGSSNQNGGIPDVV--FLMCPLRKVTVRRIEWEYETYPKRETECLSQKQEHDVNNSFGSLKSGTKRSKKKMHFGIDANMGTLE
        +    ++ Y E  N     S N+   + DV+   +  P R         +Y+   ++  E  +Q+Q  +  +S   +   + RS   +    D +    E
Subjt:  IALLAAAPYEEPRNQNPNGSSNQNGGIPDVV--FLMCPLRKVTVRRIEWEYETYPKRETECLSQKQEHDVNNSFGSLKSGTKRSKKKMHFGIDANMGTLE

Query:  NVPADPINLGTPNGDLEN--FGTKTSASLEVEKVSQFPKRVARMADPARKRTLVLMQVSQFPEKSPKNGGARRAQRVV-DCR-RQSKKQKLDSWTTNCSN
           A   NLG  N   +   F T+T +++ V         V R  +       V M    +   S K       Q +   C+ ++S KQKL+        
Subjt:  NVPADPINLGTPNGDLEN--FGTKTSASLEVEKVSQFPKRVARMADPARKRTLVLMQVSQFPEKSPKNGGARRAQRVV-DCR-RQSKKQKLDSWTTNCSN

Query:  F-FNQNQHDDCAIPGLTTAPSAIATSTDQN------------------------RGHEKQEKSSSV------------------CIITSEYDNIIQEK--
            QNQ DD  + G  +A S ++  TD+N                        R  +K + SS                    C      D+  +EK  
Subjt:  F-FNQNQHDDCAIPGLTTAPSAIATSTDQN------------------------RGHEKQEKSSSV------------------CIITSEYDNIIQEK--

Query:  ---HVDAQANCSQLSENLQCSFWK-------NGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKCSCCGNKGAAL
           HV    NC   +E  QC F         +GP++ +   KP++ +    SKV+H+H NC EWAPNVYFD D AINLEAE+SRSRRIKC  CG KGAAL
Subjt:  ---HVDAQANCSQLSENLQCSFWK-------NGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKCSCCGNKGAAL

Query:  GCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQGSDIKN------------------------------GKAAVLLSALTTAERETVAE
        GCYEKSCR+SFHVPCAK    C+WD ENFVMLCPLHT   LP + S  K                                K  +  SAL+  ER+ V+E
Subjt:  GCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQGSDIKN------------------------------GKAAVLLSALTTAERETVAE

Query:  FQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFRVLNNYQVNIDIGDC
        FQR++ VTVL+KWD SVTH+IASTDEN+ACKRTLK+LMGIL+GKW+L I+WIKAC++ M  ++E+ +EI LD+HGI DGP LGR RVLN           
Subjt:  FQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFRVLNNYQVNIDIGDC

Query:  RIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQN
                      K+F GFKF+F+ DF+PSYKGYLQ LVTAAGG +L RKPVS + N
Subjt:  RIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQN

A0A445BCG2 Uncharacterized protein0.0e+0046.98Show/hide
Query:  LQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESENPRPAVELYCQMHTSSILPDTHTFPF
        +Q   SS  KL+QIHAFS+RHGVP  NPDMGK+LIF++VS SAPMSYA  +F  +  PN+FTWNTMIRG+AES++P PA+  Y +M  S + PDTHT+PF
Subjt:  LQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESENPRPAVELYCQMHTSSILPDTHTFPF

Query:  LLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALNGMANEALTLFREMGSEGVEPDGFTMV
        LLKAVAK ++VR GE IHS+ +R GF+SL+FVQN L+HMY+V G  ESA++VFE ++ERDLVAWNSVINGFALNG  +EALTLFREM  EG+EPDGFT+V
Subjt:  LLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALNGMANEALTLFREMGSEGVEPDGFTMV

Query:  SLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITF
        SLLSAC ELGA+ LG RVHV++LKVGLT+N H +N+LLD Y+KCG+IR+A +VF EM E++VVS+TSLIVGLAVNG G EALE F E+E + +KPSEITF
Subjt:  SLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITF

Query:  VGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSG
        VGVLYACSHCGM++EGFNYFRRMK+EYGI+PRIEH+GC+VDLL RAG V+ AYEYI+N+P+ PN+VIWRTLLG+CT+HGHL LGE+AR+ +L LEPKHSG
Subjt:  VGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSG

Query:  DYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYIPRTVNVLADIEEEEKETALSHH
        DYVLLSN+YA+ERRW DVQ VRR+ML  GVRKTPGYS+VEL NRVYEF MGDRSHP+S++ YA+L KITELLK+EGY+P T NVLADIEEEEKE ALS+H
Subjt:  DYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYIPRTVNVLADIEEEEKETALSHH

Query:  TEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLFISMGDLGHLEKMGRELKCPICLSLLNSAASLGCNHVFCYACIEKSM
        +EKVAIAF ++NT P TPIR++KNLRVCADCH+AIKLISKV++REI++           L H                     S  C   +C+ CI KSM
Subjt:  TEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLFISMGDLGHLEKMGRELKCPICLSLLNSAASLGCNHVFCYACIEKSM

Query:  KSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFAIPPEDGDKQVEGDCNGSKRLNAEITEITAYNQKLRR
        KS S+CPVCK+P+  REVRP+P   NLV+IYK ME +SG+N+F+TQN   +K S+            E+   Q + DC    R +A  +      ++   
Subjt:  KSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFAIPPEDGDKQVEGDCNGSKRLNAEITEITAYNQKLRR

Query:  KGHIKYKK--PSGRILLLPLKSAIAL----------------------------------------EEKGQPVLSPFFWLRERDEEDEKSNQQSDLEQPT
        K  +  KK  P      LP K  + +                                         E+  PVLSPFFWLRE +  +  S Q    +   
Subjt:  KGHIKYKK--PSGRILLLPLKSAIAL----------------------------------------EEKGQPVLSPFFWLRERDEEDEKSNQQSDLEQPT

Query:  DSMAMDVLSFSDIKDSLDESPSKRKCVAS-HPMTWISLIVKCLN--GLRELFPELCSSPFKLQ------VEDNARTEIALLAAAPYEEPRNQNPNGSSN-
        +S      SFSD+KDS DE+P+K          T   L    L     R   PEL SSP KLQ      + +N       L  A  E  RN++ + + N 
Subjt:  DSMAMDVLSFSDIKDSLDESPSKRKCVAS-HPMTWISLIVKCLN--GLRELFPELCSSPFKLQ------VEDNARTEIALLAAAPYEEPRNQNPNGSSN-

Query:  -----QNG-GIPDVV--FLMCPLRKVTVRRIEWEYETYPKRETECLSQKQEHDVNNSFGSLKSGTKRS---------KKKMHFGIDANMGTLENVPADP-
             QNG  + DV+   +  P R         +Y+   ++  E  +Q+Q  +  +S   +   + RS         +K+    +       + V  D  
Subjt:  -----QNG-GIPDVV--FLMCPLRKVTVRRIEWEYETYPKRETECLSQKQEHDVNNSFGSLKSGTKRS---------KKKMHFGIDANMGTLENVPADP-

Query:  ----INLGTPNGDLENFGTKTSASLEVEKVSQF---------------PKRVARMADPARKRTLVLMQ------------VSQFPEKSPKNGGARRAQRV
            I++   NG++ N    +  S   ++ ++F                +++  + D A + + +  Q            ++Q  +K+ K    R+ +  
Subjt:  ----INLGTPNGDLENFGTKTSASLEVEKVSQF---------------PKRVARMADPARKRTLVLMQ------------VSQFPEKSPKNGGARRAQRV

Query:  VDCR--------RQSKKQKLDSWTTNCSNFFNQNQHDDCAIPGLTTAPSAIATSTDQNRGHEKQEKSSSVCIITSEYDNIIQEKHVDAQANCSQLSENLQ
        V  +        R +KK KL       S+  +Q +  +   P  +T  S      D       +EK S++             KHV    NC   +E  Q
Subjt:  VDCR--------RQSKKQKLDSWTTNCSNFFNQNQHDDCAIPGLTTAPSAIATSTDQNRGHEKQEKSSSVCIITSEYDNIIQEKHVDAQANCSQLSENLQ

Query:  CSFWK-------NGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKCSCCGNKGAALGCYEKSCRKSFHVPCAKSM
        C F         +GP++ +   KP+  +    SKV+H+H NC EWAPNVYF+ D AINLEAE+SRSRRIKC  CG KGAALGCYEKSCR+SFHVPCAK  
Subjt:  CSFWK-------NGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKCSCCGNKGAALGCYEKSCRKSFHVPCAKSM

Query:  PQCQWDTENFVMLCPLHTDSKLPSQGSDIKN------------------------------GKAAVLLSALTTAERETVAEFQRLSGVTVLQKWDDSVTH
          C+WD ENFVMLCPLHT   LP + S  K                                K  +  SAL+  ER+ V+EFQR++ VTVL+KWD SVTH
Subjt:  PQCQWDTENFVMLCPLHTDSKLPSQGSDIKN------------------------------GKAAVLLSALTTAERETVAEFQRLSGVTVLQKWDDSVTH

Query:  IIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFRVLNNYQVNIDIGDCRIAQLTVCLLFAATKIFSG
        +IASTDEN+ACKRTLK+LMGIL+GKW+L I+WIKAC++ M  ++E  +EI LD+HGI DGP LGR RVLN                         K+  G
Subjt:  IIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFRVLNNYQVNIDIGDCRIAQLTVCLLFAATKIFSG

Query:  FKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQN
        FKF+F+ DF+PSYKGYLQ LVTAAGG +L RKPVS + N
Subjt:  FKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQN

A0A498JSQ9 Uncharacterized protein0.0e+0046.98Show/hide
Query:  YSTARSVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESE
        +ST    + T P+N +  +L+KCI+LLQ C SS+ KL+QIHAFS+RHGVP +NPDMGK+LIFS+VSLSAP+SYA +IF QI+ PN+FTWNTMIRG+AE +
Subjt:  YSTARSVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESE

Query:  NPRPAVELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALN
        NP P ++LY QMH SS+ PDTHT+PFLLKAVAKLMDV  GE IHS+ +RNGF+SL+FV+N+L+H+Y+  GH ESA++VFE + ERDLVAWNSVINGFALN
Subjt:  NPRPAVELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALN

Query:  GMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAV
        G  NEALT+FR+M  EGVEPDGFTMVSLLSAC E+G +ALG R+HVYMLKVGLT N HA+NALLDLY+KCG+IR+A KVF+ M+E+SVVSWT+L+VGLAV
Subjt:  GMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAV

Query:  NGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGS
        NG G EALELF EL+ +GL P+EITFVGVLYACSHCGMVDEGF YFR MKE YGI PRIEH+GC+VDLL RAG V+ AYEYI N+PM PNAVIWRTLLG+
Subjt:  NGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGS

Query:  CTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKI
        CT+HG L LGE ARA+I  LEP HSGDYVLLSNLYASE RW DVQ VRRTML +GVRKTPGYS+VEL+N +YEF MGDRSHPQSEE YAMLA+IT LLK 
Subjt:  CTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKI

Query:  EGYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLF-------ISMGDLGHLEKM-
        +GY+P T NVLADIEEEEKE ALS+H+EK+A+AF L+NT+P  PIRI KNLRVCADCHLA KLISKV++REI+V     F        S  D    E+  
Subjt:  EGYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLF-------ISMGDLGHLEKM-

Query:  ---------------------------------------------GRELKCP-------ICLSLLNSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYR
                                                     G+  + P        C      ++ L C  +   ACI KSMKS SNCPVCK+PYR
Subjt:  ---------------------------------------------GRELKCP-------ICLSLLNSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYR

Query:  RREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFAIPPEDGDKQVEGDCNGSKRLNAEITEITAYNQKLRRKGHIKYK---KPSG
        RREVRPAPHMDNLVSIYKSME ASGINIFVTQN  S K S              DG +Q E D +  +    E  +     + LR KG  K K   K SG
Subjt:  RREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFAIPPEDGDKQVEGDCNGSKRLNAEITEITAYNQKLRRKGHIKYK---KPSG

Query:  RILLLP----------------------------------------LKSAIALEEKGQPVLSPFFWLRERDEEDEKSNQQSDLEQPTDSMAMDVLSFSDI
         I + P                                        LK    L EKG+PVLSPFFWL ++D   E  +Q S  +Q TD  + +V +FSDI
Subjt:  RILLLP----------------------------------------LKSAIALEEKGQPVLSPFFWLRERDEEDEKSNQQSDLEQPTDSMAMDVLSFSDI

Query:  KDSLDES------PSKRKCVASHPMTWISLIVKCLNGLRELFPELCSSPFK-----LQVEDNARTEIAL--------------LAAAPYEEPRNQNPNG-
        KDS DE+      P +    ++H        +     +    PEL  SP K     +QV ++    + L              L+A      RN+   G 
Subjt:  KDSLDES------PSKRKCVASHPMTWISLIVKCLNGLRELFPELCSSPFK-----LQVEDNARTEIAL--------------LAAAPYEEPRNQNPNG-

Query:  --------SSNQNGGIPDVVFLMCPLRKVTVRRIEWEYETYPKRETECLSQKQE-----HDVNNSFGSLKSGTKRSKKKMHFGIDANMGTL-ENVPADPI
                +S  N    +        + +T R+    + T P  +    S+  E         N+  SL    KRSKK+ H  +     TL +N   D  
Subjt:  --------SSNQNGGIPDVVFLMCPLRKVTVRRIEWEYETYPKRETECLSQKQE-----HDVNNSFGSLKSGTKRSKKKMHFGIDANMGTL-ENVPADPI

Query:  NLGTPNGDLENFGTKTSASLEVEKVSQ--------FPKRVARMADPARKRTLVLMQVSQFPEKSPKNGGARRAQRVVDCR--------RQSKKQKLDSWT
                      K      V    Q          + V+ +   A K T   + +   P  + K   +R   R +  +        +++KK K+ S  
Subjt:  NLGTPNGDLENFGTKTSASLEVEKVSQ--------FPKRVARMADPARKRTLVLMQVSQFPEKSPKNGGARRAQRVVDCR--------RQSKKQKLDSWT

Query:  TNCSNFFNQNQHDDCAIPGLTTAPSAIATSTDQNRGHEKQEKSSSVCIITSEYDNIIQEKHVDAQA---NCSQLSENLQCSF----------WKNGPLLQ
               + N  D+      T              G++  EKS   C + +    + +   +        C  +   +QC+F             G    
Subjt:  TNCSNFFNQNQHDDCAIPGLTTAPSAIATSTDQNRGHEKQEKSSSVCIITSEYDNIIQEKHVDAQA---NCSQLSENLQCSF----------WKNGPLLQ

Query:  WKPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKCSCCGNKGAALGCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSK
         KP+  D     KV+HAH  C EWAPNVYF+ DTAINLEAEL+RSR+IKCSCC  KGAALGCYEKSCR+SFHV CAK MPQC+WDT+NFVMLCP+H  SK
Subjt:  WKPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKCSCCGNKGAALGCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSK

Query:  LPSQGSDIK------------------------------------NGKAAVLLSALTTAERETVAEFQRLSGVTVLQKWDDSVTHIIASTDENEACKRTL
        LP++  + +                                    + K  +  S+LT AERE+VA F+RLSG+ VL+ WD +VTH+IASTDEN ACKRTL
Subjt:  LPSQGSDIK------------------------------------NGKAAVLLSALTTAERETVAEFQRLSGVTVLQKWDDSVTHIIASTDENEACKRTL

Query:  KILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFRVLNNYQVNIDIGDCRIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKG
        K+LMGIL+GKW+L ++WI AC +AM  ++E+ +EI +D++GIRDGPRLGR R+ N                         K+F GFKF+F+ DF+PSYKG
Subjt:  KILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFRVLNNYQVNIDIGDCRIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKG

Query:  YLQQLVTAAGGTVLLRKPVSSNQNIPCSSPTAKFSSFTVLS
        YLQ LV AAGGT+L RKPV   Q    S+ + K  ++ + S
Subjt:  YLQQLVTAAGGTVLLRKPVSSNQNIPCSSPTAKFSSFTVLS

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210651.0e-21063.35Show/hide
Query:  VLRKCISLLQLCG-SSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSA--PMSYASRIFHQIQAP-NIFTWNTMIRGHAESENPRPAVELYCQMHT
        ++ KCI+LLQ  G SS  KL+QIHAFS+RHGV  ++ ++GK+LIF LVSL +  PMSYA ++F +I+ P N+F WNT+IRG+AE  N   A  LY +M  
Subjt:  VLRKCISLLQLCG-SSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSA--PMSYASRIFHQIQAP-NIFTWNTMIRGHAESENPRPAVELYCQMHT

Query:  SSIL-PDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALNGMANEALTLFREM
        S ++ PDTHT+PFL+KAV  + DVR+GE IHS+ IR+GF SL++VQNSL+H+Y+  G   SAY+VF+ + E+DLVAWNSVINGFA NG   EAL L+ EM
Subjt:  SSIL-PDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALNGMANEALTLFREM

Query:  GSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGNEALELFGE
         S+G++PDGFT+VSLLSAC ++GA+ LG+RVHVYM+KVGLT+N H+SN LLDLY++CG + +A  +FDEM +K+ VSWTSLIVGLAVNG G EA+ELF  
Subjt:  GSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGNEALELFGE

Query:  LE-RKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEV
        +E  +GL P EITFVG+LYACSHCGMV EGF YFRRM+EEY I PRIEH GC+VDLL RAG+V+ AYEYI+++PM PN VIWRTLLG+CTVHG  +L E 
Subjt:  LE-RKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEV

Query:  ARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYIPRTVNVLA
        AR QIL LEP HSGDYVLLSN+YASE+RW DVQ +R+ ML  GV+K PG+SLVE+ NRV+EF+MGD+SHPQS+  YA L ++T  L+ EGY+P+  NV  
Subjt:  ARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYIPRTVNVLA

Query:  DIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIV
        D+EEEEKE A+ +H+EK+AIAF L++T  R+PI ++KNLRVCADCHLAIKL+SKV+ REI+V
Subjt:  DIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIV

Q8RXD4 Protein BREAST CANCER SUSCEPTIBILITY 1 homolog9.3e-12433.98Show/hide
Query:  MGDLGHLEKMGRELKCPICLSLLNSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEF
        M D  HLE+MGRELKCPICLSL NSA SL CNHVFC ACI KSMK D+ CPVCK+PY RRE+R APHMD+LVSIYK+ME ASGI +FV+QN  S      
Subjt:  MGDLGHLEKMGRELKCPICLSLLNSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEF

Query:  LSTLFFFAIPPEDGDKQVEGDCNGSKRLNAEITEITAYNQKLRRKGHIKYK-------------------KPSG----RILLLPLKSAIALEEKGQPV--
                  P D +KQV          +A + + +  N++  RKG    +                   KPS     R+ LL   SA +L +  + V  
Subjt:  LSTLFFFAIPPEDGDKQVEGDCNGSKRLNAEITEITAYNQKLRRKGHIKYK-------------------KPSG----RILLLPLKSAIALEEKGQPV--

Query:  --------------------------LSPFFWLRERDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKR---------KCVASHPMTWISLI
                                  LSPFFWLR+ D + E S+Q+++ +Q   +  ++V SFSD+ DS  ESPSK              S    W    
Subjt:  --------------------------LSPFFWLRERDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKR---------KCVASHPMTWISLI

Query:  VKCLNGLRELFPELCSSPFKLQVEDNARTEIAL---------LAAAPYEEPR---------------NQNPNGSSNQNGGIPDVVFLMCPLRKVTVRRIE
               R   PE+  SP K +V    R EI L         +A++  +  +               +Q  N  S+    I +          +    + 
Subjt:  VKCLNGLRELFPELCSSPFKLQVEDNARTEIAL---------LAAAPYEEPR---------------NQNPNGSSNQNGGIPDVVFLMCPLRKVTVRRIE

Query:  WEYETYPKRETECLSQ--KQEHDVN-NSFGSLKSGTKRSKKKMHFGIDANMGTLENVPADPINLGTPNGDLENFGTKTSASLEVEKVSQFPKRVARMADP
         +     KR T   +Q  + + D+N ++    K GTKR +            ++++ PA PI    PN    + GT+      V K  Q          P
Subjt:  WEYETYPKRETECLSQ--KQEHDVN-NSFGSLKSGTKRSKKKMHFGIDANMGTLENVPADPINLGTPNGDLENFGTKTSASLEVEKVSQFPKRVARMADP

Query:  ARKRTLVLMQVSQFPEKSPKNGGARRAQRVVDCRRQSKKQKLDSWTTNCSNFFNQNQHDDCAIPGLTTAPSAIATSTDQNRGHEKQEKSSSVCIITSEYD
         ++       + +   KS  +   +        +   KK KLDS     S+   Q   +     GL             N+G +KQ+  ++      + D
Subjt:  ARKRTLVLMQVSQFPEKSPKNGGARRAQRVVDCRRQSKKQKLDSWTTNCSNFFNQNQHDDCAIPGLTTAPSAIATSTDQNRGHEKQEKSSSVCIITSEYD

Query:  NIIQEKHVDAQANCS-------------QLSENLQCSFWK-------NGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSR
        + +Q     +  N S              L++   C+F +       +G +  +   +P++ D    SKV+H H NC EWAPNVYF+  T +NL+ EL+R
Subjt:  NIIQEKHVDAQANCS-------------QLSENLQCSFWK-------NGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSR

Query:  SRRIKCSCCGNKGAALGCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQGSDIK-------------------NGKA------------
        SRRI CSCCG KGAALGCY KSC+ SFHV CAK +P+C+WD   FVMLCPL    KLP + ++ K                   +GKA            
Subjt:  SRRIKCSCCGNKGAALGCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQGSDIK-------------------NGKA------------

Query:  -----AVLLSALTTAERETVAEFQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRD
              +  S LT  E+  +AEF  LSGVT+ + WD +VTH+IAS +EN ACKRTLK +M IL+GKW+L I WIKAC++   ++ E+ +EIT+DVHGIR+
Subjt:  -----AVLLSALTTAERETVAEFQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRD

Query:  GPRLGRFRVLNNYQVNIDIGDCRIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQN
        GP LGR R L                          K+F+G KF+ + DF  +YKGYLQ L+ AAGGT+L R+PVSS+ N
Subjt:  GPRLGRFRVLNNYQVNIDIGDCRIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQN

Q9CA54 Pentatricopeptide repeat-containing protein At1g746301.8e-12236.85Show/hide
Query:  LSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESENPRPAVELYCQ-MHT
        ++  +  C+SLL  C + +  L QIH   +++GV   +   GK ++   +S+S  + YA R+      P+ F +NT++RG++ES+ P  +V ++ + M  
Subjt:  LSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESENPRPAVELYCQ-MHT

Query:  SSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVIN-------------------
          + PD+ +F F++KAV     +R G  +H  A+++G +S LFV  +L+ MY   G  E A +VF+ + + +LVAWN+VI                    
Subjt:  SSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVIN-------------------

Query:  -------------------------------------------GFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVG
                                                   G A NG  NE+   FRE+   G+ P+  ++  +LSAC + G+   G+ +H ++ K G
Subjt:  -------------------------------------------GFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVG

Query:  LTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEK-SVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKE
         +     +NAL+D+YS+CGN+  A  VF+ M+EK  +VSWTS+I GLA++G G EA+ LF E+   G+ P  I+F+ +L+ACSH G+++EG +YF  MK 
Subjt:  LTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEK-SVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKE

Query:  EYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTM
         Y I P IEH+GC+VDL  R+GK++ AY++I  +P+PP A++WRTLLG+C+ HG++EL E  + ++  L+P +SGD VLLSN YA+  +W DV ++R++M
Subjt:  EYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTM

Query:  LMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIE-GYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKN
        +++ ++KT  +SLVE+   +Y+F  G++      E +  L +I   LK E GY P   + L D+EEEEKE  +S H+EK+A+AF L   S    IRI+KN
Subjt:  LMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIE-GYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKN

Query:  LRVCADCHLAIKLISKVFEREIIVSYSLLFISMGD
        LR+C DCH  +KL SKV+  EI+V     F S  D
Subjt:  LRVCADCHLAIKLISKVFEREIIVSYSLLFISMGD

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic3.3e-12142.03Show/hide
Query:  ASRIFHQIQAPNIFTWNTMIRGHAESENPRPAVELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAE
        A ++F +I   ++ +WN MI G+AE+ N + A+EL+  M  +++ PD  T   ++ A A+   + +G  +H     +GF S L + N+L+ +YS  G  E
Subjt:  ASRIFHQIQAPNIFTWNTMIRGHAESENPRPAVELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAE

Query:  SAYQVFEFVLERDLVAWNSVINGFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLK--VGLTQNPHASNALLDLYSKCG
        +A  +FE +  +D+++WN++I G+    +  EAL LF+EM   G  P+  TM+S+L AC  LGA+ +G  +HVY+ K   G+T       +L+D+Y+KCG
Subjt:  SAYQVFEFVLERDLVAWNSVINGFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLK--VGLTQNPHASNALLDLYSKCG

Query:  NIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCR
        +I  A +VF+ +  KS+ SW ++I G A++G  + + +LF  + + G++P +ITFVG+L ACSH GM+D G + FR M ++Y + P++EH+GC++DLL  
Subjt:  NIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCR

Query:  AGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRV
        +G  ++A E I  + M P+ VIW +LL +C +HG++ELGE     ++ +EP++ G YVLLSN+YAS  RW +V   R  +  KG++K PG S +E+ + V
Subjt:  AGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRV

Query:  YEFIMGDRSHPQSEETYAMLAKITELLKIEGYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFERE
        +EFI+GD+ HP++ E Y ML ++  LL+  G++P T  VL ++EEE KE AL HH+EK+AIAF L++T P T + I+KNLRVC +CH A KLISK+++RE
Subjt:  YEFIMGDRSHPQSEETYAMLAKITELLKIEGYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFERE

Query:  II
        II
Subjt:  II

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233303.0e-12241.88Show/hide
Query:  RIFHQIQAPNIFTWNTMIRGHAESENPRPAVELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESA
        R+F  +   ++ ++NT+I G+A+S     A+ +  +M T+ + PD+ T   +L   ++ +DV  G+ IH   IR G DS +++ +SLV MY+     E +
Subjt:  RIFHQIQAPNIFTWNTMIRGHAESENPRPAVELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESA

Query:  YQVFEFVLERDLVAWNSVINGFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRD
         +VF  +  RD ++WNS++ G+  NG  NEAL LFR+M +  V+P      S++ AC  L  + LG+++H Y+L+ G   N   ++AL+D+YSKCGNI+ 
Subjt:  YQVFEFVLERDLVAWNSVINGFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRD

Query:  AVKVFDEMEEKSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKV
        A K+FD M     VSWT++I+G A++G G+EA+ LF E++R+G+KP+++ FV VL ACSH G+VDE + YF  M + YG+   +EH+  + DLL RAGK+
Subjt:  AVKVFDEMEEKSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKV

Query:  RDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFI
         +AY +I  + + P   +W TLL SC+VH +LEL E    +I  ++ ++ G YVL+ N+YAS  RW ++  +R  M  KG+RK P  S +E+KN+ + F+
Subjt:  RDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFI

Query:  MGDRSHPQSEETYAMLAKITELLKIEGYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIV
         GDRSHP  ++    L  + E ++ EGY+  T  VL D++EE K   L  H+E++A+AF ++NT P T IR+ KN+R+C DCH+AIK ISK+ EREIIV
Subjt:  MGDRSHPQSEETYAMLAKITELLKIEGYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIV

Arabidopsis top hitse value%identityAlignment
AT1G74630.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-12336.85Show/hide
Query:  LSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESENPRPAVELYCQ-MHT
        ++  +  C+SLL  C + +  L QIH   +++GV   +   GK ++   +S+S  + YA R+      P+ F +NT++RG++ES+ P  +V ++ + M  
Subjt:  LSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESENPRPAVELYCQ-MHT

Query:  SSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVIN-------------------
          + PD+ +F F++KAV     +R G  +H  A+++G +S LFV  +L+ MY   G  E A +VF+ + + +LVAWN+VI                    
Subjt:  SSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVIN-------------------

Query:  -------------------------------------------GFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVG
                                                   G A NG  NE+   FRE+   G+ P+  ++  +LSAC + G+   G+ +H ++ K G
Subjt:  -------------------------------------------GFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVG

Query:  LTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEK-SVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKE
         +     +NAL+D+YS+CGN+  A  VF+ M+EK  +VSWTS+I GLA++G G EA+ LF E+   G+ P  I+F+ +L+ACSH G+++EG +YF  MK 
Subjt:  LTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEK-SVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKE

Query:  EYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTM
         Y I P IEH+GC+VDL  R+GK++ AY++I  +P+PP A++WRTLLG+C+ HG++EL E  + ++  L+P +SGD VLLSN YA+  +W DV ++R++M
Subjt:  EYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTM

Query:  LMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIE-GYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKN
        +++ ++KT  +SLVE+   +Y+F  G++      E +  L +I   LK E GY P   + L D+EEEEKE  +S H+EK+A+AF L   S    IRI+KN
Subjt:  LMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIE-GYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKN

Query:  LRVCADCHLAIKLISKVFEREIIVSYSLLFISMGD
        LR+C DCH  +KL SKV+  EI+V     F S  D
Subjt:  LRVCADCHLAIKLISKVFEREIIVSYSLLFISMGD

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-12341.88Show/hide
Query:  RIFHQIQAPNIFTWNTMIRGHAESENPRPAVELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESA
        R+F  +   ++ ++NT+I G+A+S     A+ +  +M T+ + PD+ T   +L   ++ +DV  G+ IH   IR G DS +++ +SLV MY+     E +
Subjt:  RIFHQIQAPNIFTWNTMIRGHAESENPRPAVELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESA

Query:  YQVFEFVLERDLVAWNSVINGFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRD
         +VF  +  RD ++WNS++ G+  NG  NEAL LFR+M +  V+P      S++ AC  L  + LG+++H Y+L+ G   N   ++AL+D+YSKCGNI+ 
Subjt:  YQVFEFVLERDLVAWNSVINGFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRD

Query:  AVKVFDEMEEKSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKV
        A K+FD M     VSWT++I+G A++G G+EA+ LF E++R+G+KP+++ FV VL ACSH G+VDE + YF  M + YG+   +EH+  + DLL RAGK+
Subjt:  AVKVFDEMEEKSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKV

Query:  RDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFI
         +AY +I  + + P   +W TLL SC+VH +LEL E    +I  ++ ++ G YVL+ N+YAS  RW ++  +R  M  KG+RK P  S +E+KN+ + F+
Subjt:  RDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFI

Query:  MGDRSHPQSEETYAMLAKITELLKIEGYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIV
         GDRSHP  ++    L  + E ++ EGY+  T  VL D++EE K   L  H+E++A+AF ++NT P T IR+ KN+R+C DCH+AIK ISK+ EREIIV
Subjt:  MGDRSHPQSEETYAMLAKITELLKIEGYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIV

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.1e-21263.35Show/hide
Query:  VLRKCISLLQLCG-SSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSA--PMSYASRIFHQIQAP-NIFTWNTMIRGHAESENPRPAVELYCQMHT
        ++ KCI+LLQ  G SS  KL+QIHAFS+RHGV  ++ ++GK+LIF LVSL +  PMSYA ++F +I+ P N+F WNT+IRG+AE  N   A  LY +M  
Subjt:  VLRKCISLLQLCG-SSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSA--PMSYASRIFHQIQAP-NIFTWNTMIRGHAESENPRPAVELYCQMHT

Query:  SSIL-PDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALNGMANEALTLFREM
        S ++ PDTHT+PFL+KAV  + DVR+GE IHS+ IR+GF SL++VQNSL+H+Y+  G   SAY+VF+ + E+DLVAWNSVINGFA NG   EAL L+ EM
Subjt:  SSIL-PDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALNGMANEALTLFREM

Query:  GSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGNEALELFGE
         S+G++PDGFT+VSLLSAC ++GA+ LG+RVHVYM+KVGLT+N H+SN LLDLY++CG + +A  +FDEM +K+ VSWTSLIVGLAVNG G EA+ELF  
Subjt:  GSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGNEALELFGE

Query:  LE-RKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEV
        +E  +GL P EITFVG+LYACSHCGMV EGF YFRRM+EEY I PRIEH GC+VDLL RAG+V+ AYEYI+++PM PN VIWRTLLG+CTVHG  +L E 
Subjt:  LE-RKGLKPSEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEV

Query:  ARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYIPRTVNVLA
        AR QIL LEP HSGDYVLLSN+YASE+RW DVQ +R+ ML  GV+K PG+SLVE+ NRV+EF+MGD+SHPQS+  YA L ++T  L+ EGY+P+  NV  
Subjt:  ARAQILLLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYIPRTVNVLA

Query:  DIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIV
        D+EEEEKE A+ +H+EK+AIAF L++T  R+PI ++KNLRVCADCHLAIKL+SKV+ REI+V
Subjt:  DIEEEEKETALSHHTEKVAIAFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIV

AT4G21065.2 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-17665.84Show/hide
Query:  LMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACV
        + DVR+GE IHS+ IR+GF SL++VQNSL+H+Y+  G   SAY+VF+ + E+DLVAWNSVINGFA NG   EAL L+ EM S+G++PDGFT+VSLLSAC 
Subjt:  LMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALNGMANEALTLFREMGSEGVEPDGFTMVSLLSACV

Query:  ELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGNEALELFGELE-RKGLKPSEITFVGVLYA
        ++GA+ LG+RVHVYM+KVGLT+N H+SN LLDLY++CG + +A  +FDEM +K+ VSWTSLIVGLAVNG G EA+ELF  +E  +GL P EITFVG+LYA
Subjt:  ELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGNEALELFGELE-RKGLKPSEITFVGVLYA

Query:  CSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLS
        CSHCGMV EGF YFRRM+EEY I PRIEH GC+VDLL RAG+V+ AYEYI+++PM PN VIWRTLLG+CTVHG  +L E AR QIL LEP HSGDYVLLS
Subjt:  CSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLLS

Query:  NLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYIPRTVNVLADIEEEEKETALSHHTEKVAI
        N+YASE+RW DVQ +R+ ML  GV+K PG+SLVE+ NRV+EF+MGD+SHPQS+  YA L ++T  L+ EGY+P+  NV  D+EEEEKE A+ +H+EK+AI
Subjt:  NLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYIPRTVNVLADIEEEEKETALSHHTEKVAI

Query:  AFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIV
        AF L++T  R+PI ++KNLRVCADCHLAIKL+SKV+ REI+V
Subjt:  AFTLVNTSPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIV

AT4G21070.1 breast cancer susceptibility16.6e-12533.98Show/hide
Query:  MGDLGHLEKMGRELKCPICLSLLNSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEF
        M D  HLE+MGRELKCPICLSL NSA SL CNHVFC ACI KSMK D+ CPVCK+PY RRE+R APHMD+LVSIYK+ME ASGI +FV+QN  S      
Subjt:  MGDLGHLEKMGRELKCPICLSLLNSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYRRREVRPAPHMDNLVSIYKSMEAASGINIFVTQNLSSAKLSEF

Query:  LSTLFFFAIPPEDGDKQVEGDCNGSKRLNAEITEITAYNQKLRRKGHIKYK-------------------KPSG----RILLLPLKSAIALEEKGQPV--
                  P D +KQV          +A + + +  N++  RKG    +                   KPS     R+ LL   SA +L +  + V  
Subjt:  LSTLFFFAIPPEDGDKQVEGDCNGSKRLNAEITEITAYNQKLRRKGHIKYK-------------------KPSG----RILLLPLKSAIALEEKGQPV--

Query:  --------------------------LSPFFWLRERDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKR---------KCVASHPMTWISLI
                                  LSPFFWLR+ D + E S+Q+++ +Q   +  ++V SFSD+ DS  ESPSK              S    W    
Subjt:  --------------------------LSPFFWLRERDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKR---------KCVASHPMTWISLI

Query:  VKCLNGLRELFPELCSSPFKLQVEDNARTEIAL---------LAAAPYEEPR---------------NQNPNGSSNQNGGIPDVVFLMCPLRKVTVRRIE
               R   PE+  SP K +V    R EI L         +A++  +  +               +Q  N  S+    I +          +    + 
Subjt:  VKCLNGLRELFPELCSSPFKLQVEDNARTEIAL---------LAAAPYEEPR---------------NQNPNGSSNQNGGIPDVVFLMCPLRKVTVRRIE

Query:  WEYETYPKRETECLSQ--KQEHDVN-NSFGSLKSGTKRSKKKMHFGIDANMGTLENVPADPINLGTPNGDLENFGTKTSASLEVEKVSQFPKRVARMADP
         +     KR T   +Q  + + D+N ++    K GTKR +            ++++ PA PI    PN    + GT+      V K  Q          P
Subjt:  WEYETYPKRETECLSQ--KQEHDVN-NSFGSLKSGTKRSKKKMHFGIDANMGTLENVPADPINLGTPNGDLENFGTKTSASLEVEKVSQFPKRVARMADP

Query:  ARKRTLVLMQVSQFPEKSPKNGGARRAQRVVDCRRQSKKQKLDSWTTNCSNFFNQNQHDDCAIPGLTTAPSAIATSTDQNRGHEKQEKSSSVCIITSEYD
         ++       + +   KS  +   +        +   KK KLDS     S+   Q   +     GL             N+G +KQ+  ++      + D
Subjt:  ARKRTLVLMQVSQFPEKSPKNGGARRAQRVVDCRRQSKKQKLDSWTTNCSNFFNQNQHDDCAIPGLTTAPSAIATSTDQNRGHEKQEKSSSVCIITSEYD

Query:  NIIQEKHVDAQANCS-------------QLSENLQCSFWK-------NGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSR
        + +Q     +  N S              L++   C+F +       +G +  +   +P++ D    SKV+H H NC EWAPNVYF+  T +NL+ EL+R
Subjt:  NIIQEKHVDAQANCS-------------QLSENLQCSFWK-------NGPLLQW---KPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSR

Query:  SRRIKCSCCGNKGAALGCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQGSDIK-------------------NGKA------------
        SRRI CSCCG KGAALGCY KSC+ SFHV CAK +P+C+WD   FVMLCPL    KLP + ++ K                   +GKA            
Subjt:  SRRIKCSCCGNKGAALGCYEKSCRKSFHVPCAKSMPQCQWDTENFVMLCPLHTDSKLPSQGSDIK-------------------NGKA------------

Query:  -----AVLLSALTTAERETVAEFQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRD
              +  S LT  E+  +AEF  LSGVT+ + WD +VTH+IAS +EN ACKRTLK +M IL+GKW+L I WIKAC++   ++ E+ +EIT+DVHGIR+
Subjt:  -----AVLLSALTTAERETVAEFQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIKACIQAMGHIEEQRFEITLDVHGIRD

Query:  GPRLGRFRVLNNYQVNIDIGDCRIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQN
        GP LGR R L                          K+F+G KF+ + DF  +YKGYLQ L+ AAGGT+L R+PVSS+ N
Subjt:  GPRLGRFRVLNNYQVNIDIGDCRIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCTCAAACCAACCATCATTCCTCTACTCCACCGCCCGCTCCGTCGCTTCAACTTTCCCGGAAAATCACCTCTCTTTCGTTCTCAGAAAATGCATCTCTCTCCTACA
ACTCTGCGGCTCCTCCCAATTCAAGCTGAAGCAAATCCACGCCTTCTCCGTCAGGCACGGCGTCCCACCCACCAACCCCGACATGGGCAAGTATCTCATCTTCTCCCTCG
TGTCCCTCTCGGCTCCCATGTCCTACGCGAGCCGAATTTTCCATCAGATTCAAGCACCCAATATCTTCACATGGAACACCATGATTAGAGGACATGCCGAGAGCGAGAAT
CCGAGGCCCGCCGTGGAGTTGTATTGCCAAATGCACACGTCTTCGATTCTGCCTGATACGCATACTTTCCCCTTTCTTTTGAAGGCTGTTGCTAAGTTAATGGATGTTAG
AGTAGGCGAGGGGATTCACTCGATTGCCATTAGAAATGGGTTCGATTCATTGCTTTTTGTTCAGAATTCGTTGGTTCATATGTATTCGGTTTTTGGGCATGCCGAGAGTG
CATACCAGGTGTTTGAGTTTGTGCTTGAAAGAGATCTCGTGGCCTGGAACTCTGTTATTAATGGCTTTGCTCTTAATGGAATGGCCAACGAAGCTCTGACCCTTTTCAGA
GAAATGGGTTCCGAGGGCGTGGAGCCAGATGGGTTCACCATGGTTAGTTTGTTATCTGCTTGTGTTGAGCTTGGCGCTATGGCCTTGGGGGAGAGGGTTCATGTGTACAT
GTTGAAGGTTGGTTTAACACAGAACCCGCATGCTAGCAATGCCCTTCTTGATCTCTATTCCAAATGTGGGAACATTAGAGATGCAGTGAAGGTGTTTGATGAAATGGAAG
AAAAAAGTGTGGTTTCTTGGACCTCTCTGATTGTTGGGTTGGCTGTTAATGGATTAGGCAATGAAGCACTTGAGCTATTTGGGGAGTTGGAAAGGAAGGGGTTGAAGCCT
AGTGAGATCACATTTGTTGGAGTCTTGTATGCTTGTAGCCATTGTGGGATGGTTGATGAAGGCTTCAATTACTTTAGAAGGATGAAAGAAGAATACGGCATCTTGCCAAG
AATAGAGCACCATGGGTGTATTGTTGATTTGCTGTGCAGGGCGGGCAAGGTCAGAGATGCTTATGAGTATATCCGAAACTTGCCGATGCCACCCAATGCTGTCATATGGA
GGACCTTACTGGGATCATGCACAGTCCATGGGCATTTGGAACTGGGTGAGGTTGCAAGAGCTCAAATCCTACTCTTAGAACCGAAACATAGTGGCGACTATGTCCTTCTC
TCGAACCTTTATGCATCGGAGCGACGTTGGCTGGACGTGCAAAACGTGAGGAGGACAATGCTTATGAAAGGTGTGAGGAAAACTCCCGGGTATAGCCTTGTCGAGTTGAA
AAATCGTGTTTATGAATTTATCATGGGTGATAGATCTCATCCTCAAAGTGAGGAGACATATGCAATGCTAGCAAAGATCACGGAGTTGTTGAAAATTGAAGGCTACATTC
CTCGCACGGTTAATGTTCTTGCCGATATAGAAGAGGAAGAAAAGGAGACGGCTTTGTCTCATCATACGGAGAAAGTTGCAATTGCTTTTACGTTAGTTAACACGTCACCA
AGAACTCCCATTAGAATCATGAAGAATTTGAGAGTCTGTGCAGATTGTCATCTGGCAATCAAACTCATATCCAAGGTTTTTGAACGAGAGATCATCGTATCCTATTCTTT
ATTGTTTATCTCAATGGGGGATCTCGGCCACCTGGAGAAGATGGGAAGAGAGCTCAAATGCCCAATTTGTTTGAGTCTTTTAAACTCTGCCGCTTCACTGGGATGCAACC
ACGTATTCTGCTATGCATGTATCGAGAAATCGATGAAATCGGATTCAAACTGCCCAGTTTGTAAGGTGCCTTATCGGCGAAGAGAAGTTCGTCCTGCTCCACACATGGAT
AATTTGGTGAGCATTTACAAAAGCATGGAAGCTGCTTCGGGAATTAATATATTTGTTACTCAGAACTTGTCTTCTGCTAAATTATCTGAATTTCTAAGCACGCTCTTTTT
CTTTGCTATTCCACCAGAAGATGGAGACAAACAAGTTGAAGGTGATTGCAATGGTTCAAAACGGCTTAATGCAGAAATCACTGAGATCACAGCTTATAATCAAAAACTTC
GAAGAAAGGGCCACATAAAGTACAAAAAACCAAGCGGAAGAATTCTGCTTCTTCCCCTAAAAAGTGCAATTGCTCTGGAAGAGAAAGGCCAGCCAGTTCTTTCACCTTTC
TTTTGGTTGAGAGAAAGAGATGAAGAAGATGAAAAGTCGAATCAGCAGTCTGATTTGGAGCAACCTACGGACTCAATGGCAATGGATGTTCTTTCCTTCAGTGATATCAA
GGATTCACTGGATGAAAGCCCTTCAAAGAGGAAGTGTGTGGCAAGCCATCCTATGACTTGGATCTCTTTGATAGTGAAATGTTTGAATGGACTCAGAGAGCTGTTCCCCG
AACTTTGTTCGAGTCCCTTTAAACTGCAGGTTGAAGACAATGCTAGAACTGAAATAGCATTATTAGCAGCAGCACCTTATGAAGAACCTAGGAATCAAAATCCAAATGGA
AGTTCTAATCAAAATGGTGGTATCCCGGACGTGGTGTTCCTGATGTGCCCCCTCCGGAAGGTAACAGTACGAAGGATCGAATGGGAGTACGAAACTTACCCGAAGAGGGA
AACTGAATGTTTGAGTCAAAAGCAGGAACACGATGTTAATAACAGCTTTGGTAGTTTGAAAAGTGGTACCAAAAGAAGCAAGAAAAAAATGCACTTTGGTATTGATGCAA
ATATGGGGACTCTTGAAAATGTTCCTGCCGATCCCATTAATTTGGGAACTCCAAACGGTGATCTTGAGAATTTTGGAACCAAGACATCAGCTTCTCTAGAAGTGGAAAAG
GTTAGTCAATTCCCGAAAAGAGTTGCAAGAATGGCAGATCCAGCAAGAAAGCGCACTTTGGTATTGATGCAAGTTAGTCAATTCCCTGAAAAGAGTCCCAAGAATGGCGG
AGCCCGCAGAGCCCAGAGGGTGGTCGACTGCCGCAGGCAGTCTAAGAAACAAAAGCTGGATTCATGGACGACAAACTGCTCGAACTTCTTCAATCAGAATCAGCACGATG
ATTGTGCTATTCCTGGTTTGACCACCGCACCTTCTGCAATTGCTACATCTACTGATCAAAATAGGGGGCATGAGAAACAAGAGAAAAGTTCCTCTGTTTGCATAATAACT
AGTGAATATGATAACATCATCCAAGAGAAGCATGTTGATGCTCAGGCAAATTGCAGTCAGCTCTCTGAAAATCTTCAATGCAGTTTCTGGAAGAATGGTCCACTACTTCA
ATGGAAGCCGATTAATACAGATGACATTAAAAACTCGAAGGTCGTACATGCACACTGGAATTGTGTTGAATGGGCGCCCAATGTTTACTTTGATGGCGACACTGCGATTA
ACCTTGAAGCTGAGCTCAGTCGAAGTCGGAGAATTAAATGTAGTTGCTGTGGAAATAAGGGGGCTGCTCTTGGGTGTTATGAGAAGAGTTGTCGCAAGAGCTTTCACGTT
CCTTGTGCAAAGTCGATGCCTCAATGTCAATGGGATACTGAAAATTTTGTGATGTTATGCCCTCTTCATACTGATTCTAAACTGCCAAGCCAAGGTTCGGACATCAAGAA
CGGAAAAGCAGCTGTGCTCCTAAGCGCTCTCACCACGGCAGAGAGGGAAACTGTTGCTGAATTTCAGAGATTATCAGGAGTTACAGTGCTACAAAAATGGGATGATAGCG
TTACACATATTATTGCATCAACAGATGAAAATGAGGCATGTAAAAGAACCCTCAAAATTTTGATGGGTATTTTGAAGGGAAAATGGGTATTGGGTATTCAATGGATTAAG
GCTTGTATACAAGCCATGGGACATATAGAGGAACAGCGCTTTGAGATTACTCTCGACGTCCACGGAATCAGAGACGGCCCTCGACTTGGAAGATTTAGAGTCCTGAACAA
TTATCAAGTAAATATTGATATAGGCGATTGTAGAATTGCACAGTTAACAGTTTGTCTTCTTTTTGCAGCAACCAAAATTTTTTCTGGGTTCAAGTTCTTCTTTATAGCGG
ATTTTGTACCTTCATACAAAGGATATCTCCAACAACTTGTAACTGCAGCTGGAGGAACTGTTCTGCTCAGAAAACCAGTTTCAAGCAACCAAAACATCCCTTGTTCTTCA
CCTACTGCCAAGTTTTCATCATTTACAGTCTTGAGCTTCCTGATCAATGCGATCTGGCCGAAAAGACAACATTCTCAGTCGCAGGCGTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGCTCAAACCAACCATCATTCCTCTACTCCACCGCCCGCTCCGTCGCTTCAACTTTCCCGGAAAATCACCTCTCTTTCGTTCTCAGAAAATGCATCTCTCTCCTACA
ACTCTGCGGCTCCTCCCAATTCAAGCTGAAGCAAATCCACGCCTTCTCCGTCAGGCACGGCGTCCCACCCACCAACCCCGACATGGGCAAGTATCTCATCTTCTCCCTCG
TGTCCCTCTCGGCTCCCATGTCCTACGCGAGCCGAATTTTCCATCAGATTCAAGCACCCAATATCTTCACATGGAACACCATGATTAGAGGACATGCCGAGAGCGAGAAT
CCGAGGCCCGCCGTGGAGTTGTATTGCCAAATGCACACGTCTTCGATTCTGCCTGATACGCATACTTTCCCCTTTCTTTTGAAGGCTGTTGCTAAGTTAATGGATGTTAG
AGTAGGCGAGGGGATTCACTCGATTGCCATTAGAAATGGGTTCGATTCATTGCTTTTTGTTCAGAATTCGTTGGTTCATATGTATTCGGTTTTTGGGCATGCCGAGAGTG
CATACCAGGTGTTTGAGTTTGTGCTTGAAAGAGATCTCGTGGCCTGGAACTCTGTTATTAATGGCTTTGCTCTTAATGGAATGGCCAACGAAGCTCTGACCCTTTTCAGA
GAAATGGGTTCCGAGGGCGTGGAGCCAGATGGGTTCACCATGGTTAGTTTGTTATCTGCTTGTGTTGAGCTTGGCGCTATGGCCTTGGGGGAGAGGGTTCATGTGTACAT
GTTGAAGGTTGGTTTAACACAGAACCCGCATGCTAGCAATGCCCTTCTTGATCTCTATTCCAAATGTGGGAACATTAGAGATGCAGTGAAGGTGTTTGATGAAATGGAAG
AAAAAAGTGTGGTTTCTTGGACCTCTCTGATTGTTGGGTTGGCTGTTAATGGATTAGGCAATGAAGCACTTGAGCTATTTGGGGAGTTGGAAAGGAAGGGGTTGAAGCCT
AGTGAGATCACATTTGTTGGAGTCTTGTATGCTTGTAGCCATTGTGGGATGGTTGATGAAGGCTTCAATTACTTTAGAAGGATGAAAGAAGAATACGGCATCTTGCCAAG
AATAGAGCACCATGGGTGTATTGTTGATTTGCTGTGCAGGGCGGGCAAGGTCAGAGATGCTTATGAGTATATCCGAAACTTGCCGATGCCACCCAATGCTGTCATATGGA
GGACCTTACTGGGATCATGCACAGTCCATGGGCATTTGGAACTGGGTGAGGTTGCAAGAGCTCAAATCCTACTCTTAGAACCGAAACATAGTGGCGACTATGTCCTTCTC
TCGAACCTTTATGCATCGGAGCGACGTTGGCTGGACGTGCAAAACGTGAGGAGGACAATGCTTATGAAAGGTGTGAGGAAAACTCCCGGGTATAGCCTTGTCGAGTTGAA
AAATCGTGTTTATGAATTTATCATGGGTGATAGATCTCATCCTCAAAGTGAGGAGACATATGCAATGCTAGCAAAGATCACGGAGTTGTTGAAAATTGAAGGCTACATTC
CTCGCACGGTTAATGTTCTTGCCGATATAGAAGAGGAAGAAAAGGAGACGGCTTTGTCTCATCATACGGAGAAAGTTGCAATTGCTTTTACGTTAGTTAACACGTCACCA
AGAACTCCCATTAGAATCATGAAGAATTTGAGAGTCTGTGCAGATTGTCATCTGGCAATCAAACTCATATCCAAGGTTTTTGAACGAGAGATCATCGTATCCTATTCTTT
ATTGTTTATCTCAATGGGGGATCTCGGCCACCTGGAGAAGATGGGAAGAGAGCTCAAATGCCCAATTTGTTTGAGTCTTTTAAACTCTGCCGCTTCACTGGGATGCAACC
ACGTATTCTGCTATGCATGTATCGAGAAATCGATGAAATCGGATTCAAACTGCCCAGTTTGTAAGGTGCCTTATCGGCGAAGAGAAGTTCGTCCTGCTCCACACATGGAT
AATTTGGTGAGCATTTACAAAAGCATGGAAGCTGCTTCGGGAATTAATATATTTGTTACTCAGAACTTGTCTTCTGCTAAATTATCTGAATTTCTAAGCACGCTCTTTTT
CTTTGCTATTCCACCAGAAGATGGAGACAAACAAGTTGAAGGTGATTGCAATGGTTCAAAACGGCTTAATGCAGAAATCACTGAGATCACAGCTTATAATCAAAAACTTC
GAAGAAAGGGCCACATAAAGTACAAAAAACCAAGCGGAAGAATTCTGCTTCTTCCCCTAAAAAGTGCAATTGCTCTGGAAGAGAAAGGCCAGCCAGTTCTTTCACCTTTC
TTTTGGTTGAGAGAAAGAGATGAAGAAGATGAAAAGTCGAATCAGCAGTCTGATTTGGAGCAACCTACGGACTCAATGGCAATGGATGTTCTTTCCTTCAGTGATATCAA
GGATTCACTGGATGAAAGCCCTTCAAAGAGGAAGTGTGTGGCAAGCCATCCTATGACTTGGATCTCTTTGATAGTGAAATGTTTGAATGGACTCAGAGAGCTGTTCCCCG
AACTTTGTTCGAGTCCCTTTAAACTGCAGGTTGAAGACAATGCTAGAACTGAAATAGCATTATTAGCAGCAGCACCTTATGAAGAACCTAGGAATCAAAATCCAAATGGA
AGTTCTAATCAAAATGGTGGTATCCCGGACGTGGTGTTCCTGATGTGCCCCCTCCGGAAGGTAACAGTACGAAGGATCGAATGGGAGTACGAAACTTACCCGAAGAGGGA
AACTGAATGTTTGAGTCAAAAGCAGGAACACGATGTTAATAACAGCTTTGGTAGTTTGAAAAGTGGTACCAAAAGAAGCAAGAAAAAAATGCACTTTGGTATTGATGCAA
ATATGGGGACTCTTGAAAATGTTCCTGCCGATCCCATTAATTTGGGAACTCCAAACGGTGATCTTGAGAATTTTGGAACCAAGACATCAGCTTCTCTAGAAGTGGAAAAG
GTTAGTCAATTCCCGAAAAGAGTTGCAAGAATGGCAGATCCAGCAAGAAAGCGCACTTTGGTATTGATGCAAGTTAGTCAATTCCCTGAAAAGAGTCCCAAGAATGGCGG
AGCCCGCAGAGCCCAGAGGGTGGTCGACTGCCGCAGGCAGTCTAAGAAACAAAAGCTGGATTCATGGACGACAAACTGCTCGAACTTCTTCAATCAGAATCAGCACGATG
ATTGTGCTATTCCTGGTTTGACCACCGCACCTTCTGCAATTGCTACATCTACTGATCAAAATAGGGGGCATGAGAAACAAGAGAAAAGTTCCTCTGTTTGCATAATAACT
AGTGAATATGATAACATCATCCAAGAGAAGCATGTTGATGCTCAGGCAAATTGCAGTCAGCTCTCTGAAAATCTTCAATGCAGTTTCTGGAAGAATGGTCCACTACTTCA
ATGGAAGCCGATTAATACAGATGACATTAAAAACTCGAAGGTCGTACATGCACACTGGAATTGTGTTGAATGGGCGCCCAATGTTTACTTTGATGGCGACACTGCGATTA
ACCTTGAAGCTGAGCTCAGTCGAAGTCGGAGAATTAAATGTAGTTGCTGTGGAAATAAGGGGGCTGCTCTTGGGTGTTATGAGAAGAGTTGTCGCAAGAGCTTTCACGTT
CCTTGTGCAAAGTCGATGCCTCAATGTCAATGGGATACTGAAAATTTTGTGATGTTATGCCCTCTTCATACTGATTCTAAACTGCCAAGCCAAGGTTCGGACATCAAGAA
CGGAAAAGCAGCTGTGCTCCTAAGCGCTCTCACCACGGCAGAGAGGGAAACTGTTGCTGAATTTCAGAGATTATCAGGAGTTACAGTGCTACAAAAATGGGATGATAGCG
TTACACATATTATTGCATCAACAGATGAAAATGAGGCATGTAAAAGAACCCTCAAAATTTTGATGGGTATTTTGAAGGGAAAATGGGTATTGGGTATTCAATGGATTAAG
GCTTGTATACAAGCCATGGGACATATAGAGGAACAGCGCTTTGAGATTACTCTCGACGTCCACGGAATCAGAGACGGCCCTCGACTTGGAAGATTTAGAGTCCTGAACAA
TTATCAAGTAAATATTGATATAGGCGATTGTAGAATTGCACAGTTAACAGTTTGTCTTCTTTTTGCAGCAACCAAAATTTTTTCTGGGTTCAAGTTCTTCTTTATAGCGG
ATTTTGTACCTTCATACAAAGGATATCTCCAACAACTTGTAACTGCAGCTGGAGGAACTGTTCTGCTCAGAAAACCAGTTTCAAGCAACCAAAACATCCCTTGTTCTTCA
CCTACTGCCAAGTTTTCATCATTTACAGTCTTGAGCTTCCTGATCAATGCGATCTGGCCGAAAAGACAACATTCTCAGTCGCAGGCGTTGTGA
Protein sequenceShow/hide protein sequence
MRSNQPSFLYSTARSVASTFPENHLSFVLRKCISLLQLCGSSQFKLKQIHAFSVRHGVPPTNPDMGKYLIFSLVSLSAPMSYASRIFHQIQAPNIFTWNTMIRGHAESEN
PRPAVELYCQMHTSSILPDTHTFPFLLKAVAKLMDVRVGEGIHSIAIRNGFDSLLFVQNSLVHMYSVFGHAESAYQVFEFVLERDLVAWNSVINGFALNGMANEALTLFR
EMGSEGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLTQNPHASNALLDLYSKCGNIRDAVKVFDEMEEKSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKP
SEITFVGVLYACSHCGMVDEGFNYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNLPMPPNAVIWRTLLGSCTVHGHLELGEVARAQILLLEPKHSGDYVLL
SNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYIPRTVNVLADIEEEEKETALSHHTEKVAIAFTLVNTSP
RTPIRIMKNLRVCADCHLAIKLISKVFEREIIVSYSLLFISMGDLGHLEKMGRELKCPICLSLLNSAASLGCNHVFCYACIEKSMKSDSNCPVCKVPYRRREVRPAPHMD
NLVSIYKSMEAASGINIFVTQNLSSAKLSEFLSTLFFFAIPPEDGDKQVEGDCNGSKRLNAEITEITAYNQKLRRKGHIKYKKPSGRILLLPLKSAIALEEKGQPVLSPF
FWLRERDEEDEKSNQQSDLEQPTDSMAMDVLSFSDIKDSLDESPSKRKCVASHPMTWISLIVKCLNGLRELFPELCSSPFKLQVEDNARTEIALLAAAPYEEPRNQNPNG
SSNQNGGIPDVVFLMCPLRKVTVRRIEWEYETYPKRETECLSQKQEHDVNNSFGSLKSGTKRSKKKMHFGIDANMGTLENVPADPINLGTPNGDLENFGTKTSASLEVEK
VSQFPKRVARMADPARKRTLVLMQVSQFPEKSPKNGGARRAQRVVDCRRQSKKQKLDSWTTNCSNFFNQNQHDDCAIPGLTTAPSAIATSTDQNRGHEKQEKSSSVCIIT
SEYDNIIQEKHVDAQANCSQLSENLQCSFWKNGPLLQWKPINTDDIKNSKVVHAHWNCVEWAPNVYFDGDTAINLEAELSRSRRIKCSCCGNKGAALGCYEKSCRKSFHV
PCAKSMPQCQWDTENFVMLCPLHTDSKLPSQGSDIKNGKAAVLLSALTTAERETVAEFQRLSGVTVLQKWDDSVTHIIASTDENEACKRTLKILMGILKGKWVLGIQWIK
ACIQAMGHIEEQRFEITLDVHGIRDGPRLGRFRVLNNYQVNIDIGDCRIAQLTVCLLFAATKIFSGFKFFFIADFVPSYKGYLQQLVTAAGGTVLLRKPVSSNQNIPCSS
PTAKFSSFTVLSFLINAIWPKRQHSQSQAL