; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC10G193910 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC10G193910
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCiama_Chr10:28842705..28860474
RNA-Seq ExpressionCaUC10G193910
SyntenyCaUC10G193910
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6531962.1 hypothetical protein ZIOFF_005799 [Zingiber officinale]4.8e-29443.15Show/hide
Query:  FGTTAALSDRYIFFEGNNLATWPSSCIYISSHGLSSQAGAENSGEEDKMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDDIDDGTQNEL
        F   + + + Y+  +  +   +  +  ++ S  +SS  GA +  +++  EDGF ++DE  P+   +E  +   +NEE  T G  +D + D + +      
Subjt:  FGTTAALSDRYIFFEGNNLATWPSSCIYISSHGLSSQAGAENSGEEDKMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDDIDDGTQNEL

Query:  DLPEVETELAEKISTKRASSELFHAINGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDSIERDYASRLDLIAKVH
                     S K     L  AI  AP  SV  V  KWV EG  + R  +S  MLNLRKRR++ KALQF +WLEA+ +++ +E DYASRL++I K+ 
Subjt:  DLPEVETELAEKISTKRASSELFHAINGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDSIERDYASRLDLIAKVH

Query:  GLHKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKMADDLLLMEKENIKPSLFTYKILIDAKGL
        GL KAE YI KIP S +GEV+YRTLLANCV  NN++KAEEVFN++KDLG P+T F CNQLL+L+KR D++KM + L +MEKENIKP+L T+++LID KG 
Subjt:  GLHKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKMADDLLLMEKENIKPSLFTYKILIDAKGL

Query:  SNDMLGMEQVVDTMKAEGVEIDVNVLSILAKHYASGGLKDKAKAILKEMEDVNSKGSQWPCRILLPLYGELQMEDDVRRLWEICEPNPRIEECMAAIVAW
        +ND+ GMEQ++DTMKAE ++ D+ +   +AK+Y S GL  KA+ +LKEME  + + +    ++LLPLY  L   DDV R+WEIC+ + R++E +AAI AW
Subjt:  SNDMLGMEQVVDTMKAEGVEIDVNVLSILAKHYASGGLKDKAKAILKEMEDVNSKGSQWPCRILLPLYGELQMEDDVRRLWEICEPNPRIEECMAAIVAW

Query:  GKLKNVPEAEKIFDRVVKTWKKLSPKQYSTMLNVYADNKMLTKGKDLVKQMADSGCSIGPLSWDAIVKLYVEAGEVEKADSFLVKAVQKSQKKPLFTSYM
        GKL N+ EAE++F+ ++K   K+S + Y T+L VYA+NK L+KGKD  K+M D GC IGP +WDA++ LY E GE+EKADS L +A Q+++ KP ++SYM
Subjt:  GKLKNVPEAEKIFDRVVKTWKKLSPKQYSTMLNVYADNKMLTKGKDLVKQMADSGCSIGPLSWDAIVKLYVEAGEVEKADSFLVKAVQKSQKKPLFTSYM

Query:  VIMDHYARRGDVHNAEKIFHNMRLWGYVARLRQFQTLIQAYVNAKTPAYGMRERMKADNVFPNKALAGKLAQVDALRKTAVN------------------
         ++D Y+ +GD+ NAEKIF  +R  GYV RL+ +Q L+QAY NAK PAYG ++RMKA+N+ PN+A+  +L   DAL  T ++                  
Subjt:  VIMDHYARRGDVHNAEKIFHNMRLWGYVARLRQFQTLIQAYVNAKTPAYGMRERMKADNVFPNKALAGKLAQVDALRKTAVN------------------

Query:  -----------------------------------RGYRVRTSYVFGKLEV---------------------PYSWEGNVA----GFGTTAALSDR----
                                              R  +S  +G   +                     P S +        G GT  + S R    
Subjt:  -----------------------------------RGYRVRTSYVFGKLEV---------------------PYSWEGNVA----GFGTTAALSDR----

Query:  CIFFEGNNLAT----------WPSSGI--------------------YISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEELT
        C  F G+ L+              SGI                    ++    LSS  GA+ S ++++ ED FSELDE      P ED    D+  E   
Subjt:  CIFFEGNNLAT----------WPSSGI--------------------YISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEELT

Query:  CGSEIDDDDDVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQ
           E + ++ +  G  +          +  +   KR  S L  AI+ AP  S+ IVL+KWV +G  + +  +S AML LR+ R++ KALQF EWLE + +
Subjt:  CGSEIDDDDDVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQ

Query:  LDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLLMEK
        +DF+ERDYAS LDL+ K++G  KAE Y+ KIP S QGEV+YRTLL NCV+ANNV+KAEEVFNK+KDLG P+T F CNQLL+L+KR+D++K+ADVL++MEK
Subjt:  LDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLLMEK

Query:  ENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLW
        ENIKPSL TY++L+D KG ++D++GME+++DTMK EG++ D  +  +  K+Y+  GL ++A+A+LK+ME  + + +  A ++LLP+Y  L   ++V R+W
Subjt:  ENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLW

Query:  KICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADS
        +IC+ + R++EC+ AI AWGKL N+ +AE++F+ ++K W K+S K Y  +L VY++NK+L+KGK+  K+M+D GC IGP  WDA+VKLYVEAGE+EKA S
Subjt:  KICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADS

Query:  FLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDAFRKTAV
         L KA Q+   KPLF SYM + + Y+++GD+HNAEKIF  +R  GY  R   +Q L+QAY+NAK PAYG ++RMKADN+FPNKA+  +L   DA  K+ +
Subjt:  FLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDAFRKTAV

KAG6536249.1 hypothetical protein ZIOFF_001300 [Zingiber officinale]2.5e-29543.91Show/hide
Query:  YISSHGLSSQAGAENSGEEDKMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDDIDDGTQNELDLPEVETELAEKISTKRASSELFHAIN
        ++ S  +SS  GA +  +++  EDGF ++DE  P+   +E  +   +NEE  T G  +D + D + +                   S K     L  AI 
Subjt:  YISSHGLSSQAGAENSGEEDKMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDDIDDGTQNELDLPEVETELAEKISTKRASSELFHAIN

Query:  GAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDSIERDYASRLDLIAKVHGLHKAERYIAKIPKSFQGEVIYRTLLA
         AP  SV  V  KWV EG  + R  +S  MLNLRKRR++ KALQF +WLEA+ +++ +E DYASR+++I K+ GL KAE YI KIP S +GEV+YRTLLA
Subjt:  GAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDSIERDYASRLDLIAKVHGLHKAERYIAKIPKSFQGEVIYRTLLA

Query:  NCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKMADDLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDVNVLS
        NCV  NN++KAEEVFN++KDLG P+T F CNQLL+L+KR D++KM + L +MEKENIKP+L TY++LID KG +ND+ GMEQ++DTMKAE V+ D+ +  
Subjt:  NCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKMADDLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDVNVLS

Query:  ILAKHYASGGLKDKAKAILKEMEDVNSKGSQWPCRILLPLYGELQMEDDVRRLWEICEPNPRIEECMAAIVAWGKLKNVPEAEKIFDRVVKTWKKLSPKQ
         +AK+Y S GL  KA+ +LKEME  + + +    ++LLPLY  L   DDV R+WEIC+ + R++E +AAI AWGKL N+ EAE++F+ ++K   K+S + 
Subjt:  ILAKHYASGGLKDKAKAILKEMEDVNSKGSQWPCRILLPLYGELQMEDDVRRLWEICEPNPRIEECMAAIVAWGKLKNVPEAEKIFDRVVKTWKKLSPKQ

Query:  YSTMLNVYADNKMLTKGKDLVKQMADSGCSIGPLSWDAIVKLYVEAGEVEKADSFLVKAVQKSQKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGY
        Y T+L VYA+NK L+KGKD  K+M D GC IGP +WDA++ LY E GE+EKADS L +A Q+++ KP ++SYM ++D Y+ +GD+ NAEKIF  +R  GY
Subjt:  YSTMLNVYADNKMLTKGKDLVKQMADSGCSIGPLSWDAIVKLYVEAGEVEKADSFLVKAVQKSQKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGY

Query:  VARLRQFQTLIQAYVNAKTPAYGMRERMKADNVFPNKALAGKLAQVDALRKTAVN-----------------------------------------RGYR
        V RL+ +Q L+QAY NAK PAYG R+RMKA+N+ PN+A+  +L   DAL  T ++                                           + 
Subjt:  VARLRQFQTLIQAYVNAKTPAYGMRERMKADNVFPNKALAGKLAQVDALRKTAVN-----------------------------------------RGYR

Query:  VRTSYVFGKLEVPYSW----EGNVAGFGTTAALSDRCIF-------------------------------------FEGNNLAT----------WPSSGI
           + V+  LE   SW       +    T+ A+S  C F                                     F G+ L+              SGI
Subjt:  VRTSYVFGKLEVPYSW----EGNVAGFGTTAALSDRCIF-------------------------------------FEGNNLAT----------WPSSGI

Query:  --------------------YISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVETE
                            ++    LSS  GA+ S ++++ ED FSELDE      P ED   +D+  E      E + ++ +  G  +          
Subjt:  --------------------YISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVETE

Query:  LAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAERY
        +  +   KR  S L  AI+ AP  S+  VL+KWV +G  + +  +S AML LR+ R++ KALQF EWLE + ++DF+ERDYAS LDL+ K++G  KAE Y
Subjt:  LAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAERY

Query:  IAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGME
        + KIP S QGEV+YRTL  NCV+ANNV+KAEEVFNK+KDLG P+T F CNQLL+L+KR+D++K+ADVL++MEKENIKPSL TY++L+D KG ++D++GME
Subjt:  IAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGME

Query:  QVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIPE
        +++DTMKAEG++ D  +  +  K+Y   GL ++A+A+LK+ME  + + +  A ++LLP+Y  L   ++V R+W+IC+ + R++EC+ AI AWGKL N+ +
Subjt:  QVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIPE

Query:  AEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYAR
        AE++F+ ++K W K+S K Y  +L VY++NK+L+KGK+  K+M+D GC IGP  WDA+VKLYVEAGE+EKA S L KA Q    KPLF SYM + + Y++
Subjt:  AEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYAR

Query:  RGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDAFRKTAV
        +GD+HNAEKIF  +R  GY  R   +Q L+QAY+NAK PAYG ++RMKADN+FPNKA+  +L   DA  K+ +
Subjt:  RGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDAFRKTAV

XP_023905439.1 uncharacterized protein LOC112017200 [Quercus suber]0.0e+0050.25Show/hide
Query:  IYISSHGLSSQAGAENSGEEDKMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEI-DDDDDDIDDGTQNELDLPEVETELAEKISTKRASSELFHA
        I++ S  L S A A +S +   +EDG  E  ET  +   +ED    ++  EELT    I DDD  D + G             + E+   K+AS EL  A
Subjt:  IYISSHGLSSQAGAENSGEEDKMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEI-DDDDDDIDDGTQNELDLPEVETELAEKISTKRASSELFHA

Query:  INGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDSIERDYASRLDLIAKVHGLHKAERYIAKIPKSFQGEVIYRTL
        I  AP  SV  VLDKW++EG +++R  +S  +++LRKRRM+GK LQFSEWLE + Q D  E+D+A+RLDLIAKVHG+ KAE+YI  IPK F+GE++YRTL
Subjt:  INGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDSIERDYASRLDLIAKVHGLHKAERYIAKIPKSFQGEVIYRTL

Query:  LANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKMADDLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDVNV
        L  CV   N+KKAE VF K+++L FP+T  ACNQ+++L+KR DK ++AD LLLMEKEN+KPSLFTY++L+D KG SND++GMEQ+ +TMKAEGV  D++V
Subjt:  LANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKMADDLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDVNV

Query:  LSILAKHYASGGLKDKAKAILKEMEDVNSKGSQWPCRILLPLYGELQMEDDVRRLWEICEPNPRIEECMAAIVAWGKLKNVPEAEKIFDRVVKTWKKLSP
        L++LAKHY SGGL+DKAK +LKE+E+   K S    R+LLPLY  L + DDV R+W+ CE +P++ ECMAAI AWGKL  V +AE +F+ +++ WKKLS 
Subjt:  LSILAKHYASGGLKDKAKAILKEMEDVNSKGSQWPCRILLPLYGELQMEDDVRRLWEICEPNPRIEECMAAIVAWGKLKNVPEAEKIFDRVVKTWKKLSP

Query:  KQYSTMLNVYADNKMLTKGKDLVKQMADSGCSIGPLSWDAIVKLYVEAGEVEKADSFLVKAVQKSQKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLW
        + YS +L VY ++K+LTKGK+ VK+M D G  +GPL+WDA+V+LY+ AG+V+KADS L KAVQK++ +PLF +Y V+M+ YA RGD+HN EK+FH M+  
Subjt:  KQYSTMLNVYADNKMLTKGKDLVKQMADSGCSIGPLSWDAIVKLYVEAGEVEKADSFLVKAVQKSQKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLW

Query:  GYVARLRQFQTLIQAYVNAKTPAYGMRERMKADNVFPNKALAGKLAQVDALRKTAVNR---------------------------------GYRVRTSYV
        GY  RL+  + LI+AYVNAK P YG RERM+A+N+FPNKA A +LAQ D  RK   +R                                  Y +  S V
Subjt:  GYVARLRQFQTLIQAYVNAKTPAYGMRERMKADNVFPNKALAGKLAQVDALRKTAVNR---------------------------------GYRVRTSYV

Query:  F-GKLEVPYSWEGNVAGFGTTAALSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGS
        F  + +   +  G+ A     + L             T    GI++ S  L S A A +S +  ++EDG  E  ET  +   +ED    ++  EELT   
Subjt:  F-GKLEVPYSWEGNVAGFGTTAALSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGS

Query:  EIDDDD--DVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQL
         I DDD  D + G             + E+   K+AS EL  AI+ AP  SV  VLDKW++EG +++R  +S  +++LR+RRM+GK LQFSEWLE + Q 
Subjt:  EIDDDD--DVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQL

Query:  DFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLLMEKE
        D  E+D+A+RLDLIAKVHG+ KAE+YI  IPK F+GE++YRTLL  CV   N+KKAE VF K+++L FP+T  ACNQ+++L+KRLDK ++AD+LLLMEKE
Subjt:  DFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLLMEKE

Query:  NIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLWK
        N+KPSLFTY++L+D KG SND++GMEQ+ +TMKAEGV  D +VL++LAKHY SGGL+D+AK +LKE+E+   K S    R+LLP+Y  L + D+V R+WK
Subjt:  NIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLWK

Query:  ICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADSF
         CE +P++ ECMAAI AWGKL  + +AE +F+ +++ WKKLS + YS +L+VY ++K+LTKGKE VK+M D G  +GPL WDA+V+LY+ AG+V+KADS 
Subjt:  ICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADSF

Query:  LVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDAFRK
        L KAVQKN+ +PLF +Y V+M+ YA RGD+HN EK+FH M+  GY  R    + LI+AY+NAK P YG +ERM+A+N+FPNKA A++LAQ D FRK
Subjt:  LVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDAFRK

XP_030941952.1 uncharacterized protein LOC115966970 [Quercus lobata]0.0e+0049.96Show/hide
Query:  IYISSHGLSSQAGAENSGEEDKMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDDIDDGTQNELDLPEVETELAEKISTKRASSELFHAI
        I++ SH L S+A A +SG+   +E G  E  ET  +   +EDS A ++  E+L   SE     DD+ D  +           + E    K+AS EL  AI
Subjt:  IYISSHGLSSQAGAENSGEEDKMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDDIDDGTQNELDLPEVETELAEKISTKRASSELFHAI

Query:  NGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDSIERDYASRLDLIAKVHGLHKAERYIAKIPKSFQGEVIYRTLL
          A   SV  VL KW  EG +++R   S  ++ LRK +M+GKALQFSEWLE + Q D  E+DYA RLDLIAKVHG+  AE+YI KIP  F+GE++Y  LL
Subjt:  NGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDSIERDYASRLDLIAKVHGLHKAERYIAKIPKSFQGEVIYRTLL

Query:  ANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKMADDLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDVNVL
          CV+  N++KAE VF KM++L FP+T  ACNQ+++L+KR DKR++ D LLLME+EN+KPSLFTY++L+DAKG S+D++GME + + MKAEGV  D+ VL
Subjt:  ANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKMADDLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDVNVL

Query:  SILAKHYASGGLKDKAKAILKEMEDVNSKGSQWPCRILLPLYGELQMEDDVRRLWEICEPNPRIEECMAAIVAWGKLKNVPEAEKIFDRVVKTWKKLSPK
        ++LAKHY SGGL+DKAK +LKE+E+   K S    R LLPLY  L   D+V R+W+ CE +  + E +AAI AWGKL  V EAE +F+ +++ WKKLS +
Subjt:  SILAKHYASGGLKDKAKAILKEMEDVNSKGSQWPCRILLPLYGELQMEDDVRRLWEICEPNPRIEECMAAIVAWGKLKNVPEAEKIFDRVVKTWKKLSPK

Query:  QYSTMLNVYADNKMLTKGKDLVKQMADSGCSIGPLSWDAIVKLYVEAGEVEKADSFLVKAVQKSQKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWG
         Y+ +L VY ++K+LTKG++ VK+M DSGC IGPL+WDA+V++Y+ AG+VEKADS L KA Q+ + +PLF +YM +M+ Y+  GD+HN EK+F+ M+   
Subjt:  QYSTMLNVYADNKMLTKGKDLVKQMADSGCSIGPLSWDAIVKLYVEAGEVEKADSFLVKAVQKSQKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWG

Query:  YVARLRQFQTLIQAYVNAKTPAYGMRERMKADNVFPNKALAGKLAQVDALRK---TAVNRGY---RVRTSYVFGKL--------EVPYSWEGNV---AGF
        +  +L+ F+ LI+AY+NAK+P YG RERMKA+N+FPNKA A  LAQVDA R+     + R Y   +  +S +   L        +  Y  E +    A  
Subjt:  YVARLRQFQTLIQAYVNAKTPAYGMRERMKADNVFPNKALAGKLAQVDALRK---TAVNRGY---RVRTSYVFGKL--------EVPYSWEGNV---AGF

Query:  GTTAAL-SDR-CIFFEGNNLATWPSS-----------GIYISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDD
        GT++ +  DR  I    + L+T   S           GI++ S  L S A A +S +  ++EDG  E  ET  +   +ED  A ++  EEL     I DD
Subjt:  GTTAAL-SDR-CIFFEGNNLATWPSS-----------GIYISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDD

Query:  D--DVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQLDFIER
        D  D + G             + E+   K+AS EL  AI+ AP  SV  VLDKW++EG +++R  +S  +++LR+RRM+GK LQFSEWLE + Q D  E+
Subjt:  D--DVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQLDFIER

Query:  DYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLLMEKENIKPS
        D+A+RLDLIAKVHG+ KAE+YI  IPK+F+GE++YRTLL  CV   N+KKAE VF K+++L FP+T  ACNQ+++L+KRLDKR++AD+LLLMEKEN+KPS
Subjt:  DYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLLMEKENIKPS

Query:  LFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLWKICEPN
        LFTY++L+D KG SND++GMEQ+ +TMKAEGV  D +VL++LAKHY SGGL+D+AKA+LKE+E+   K S    R++LP+Y  L + D+V R+WK CE +
Subjt:  LFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLWKICEPN

Query:  PRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADSFLVKAV
        P + ECMAAI AWGKL  + +AE +F+ +++ WKKLS + YS +L+VY ++K+LTKGKE VKQM D G  +GPL WDA+V+L++ AG+VEKADS L KAV
Subjt:  PRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADSFLVKAV

Query:  QKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDAFRK
        QKN+ +PLF++Y V+M+ YA++GD+HN EK+FH M+  GY  R    + LIQAY+NAK P YG +ERMKADN+FPNKA A++LAQ D FRK
Subjt:  QKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDAFRK

XP_042959442.1 uncharacterized protein LOC122294618 [Carya illinoinensis]9.9e-30849.25Show/hide
Query:  IYISSHGLSSQAGAENSGEEDKMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSE-IDDDDDDIDDGTQNELDLPEVETELAEKISTKRASSELFHA
        I++ S  L S A  E SG +  +ED F E  ET  +   +  S A ++  E L   +  +DDD  D + G            E  E    K+A  EL+ A
Subjt:  IYISSHGLSSQAGAENSGEEDKMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSE-IDDDDDDIDDGTQNELDLPEVETELAEKISTKRASSELFHA

Query:  INGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDSIERDYASRLDLIAKVHGLHKAERYIAKIPKSFQGEVIYRTL
        I  +P  SV  +LDKWV EG ++ +  +S  ++ LR+RRM+ KALQF EWLE + Q+   E+DYA+RLDLI KV G+ KAE+YI +IP SF+GE++YRTL
Subjt:  INGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDSIERDYASRLDLIAKVHGLHKAERYIAKIPKSFQGEVIYRTL

Query:  LANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKMADDLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDVNV
        LA CV   N+KKAE VF K+++L  P+  FA NQ++LL+KR DKRK+AD L LMEKEN+KPSL TYK+L+D KG SND++GMEQ+ +TMK EGV  DV+V
Subjt:  LANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKMADDLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDVNV

Query:  LSILAKHYASGGLKDKAKAILKEMEDVNSKGSQWPCRILLPLYGELQMEDDVRRLWEICEPNPRIEECMAAIVAWGKLKNVPEAEKIFDRVVKTWKKLSP
        L++LAKHY SGGLK KA+ +LKE+E+   KGS    R LL LY  L   ++V R+W+ C+  PR+ EC+AAI AWGKL  V E E +F+ +++ WK LS 
Subjt:  LSILAKHYASGGLKDKAKAILKEMEDVNSKGSQWPCRILLPLYGELQMEDDVRRLWEICEPNPRIEECMAAIVAWGKLKNVPEAEKIFDRVVKTWKKLSP

Query:  KQYSTMLNVYADNKMLTKGKDLVKQMADSGCSIGPLSWDAIVKLYVEAGEVEKADSFLVKAVQKSQKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLW
        ++Y  +L VY D K++TKGK+ VK+M DSGC +GPL+WDA+V+L +  GE+EKADS L KA Q+++++  F++Y+V+M+ YA+RGD+HNAEK+FH M+  
Subjt:  KQYSTMLNVYADNKMLTKGKDLVKQMADSGCSIGPLSWDAIVKLYVEAGEVEKADSFLVKAVQKSQKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLW

Query:  GYVARLRQFQTLIQAYVNAKTPAYGMRERMKADNVFPNKALAGKLAQVD------------------------------ALRKTA---VNRGYRVRTSYV
        GY  RL+ F  LIQAY+NAK P YG  ERMKA+N+FPNKA   +LAQ D                               LR+ A     +GY +  S +
Subjt:  GYVARLRQFQTLIQAYVNAKTPAYGMRERMKADNVFPNKALAGKLAQVD------------------------------ALRKTA---VNRGYRVRTSYV

Query:  FGKLEVPYSWEGNVAGFGTTAA-----LSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEEL
        FG      S   NV G G +       +S + +  E     T     I++ +  L S A  E SG +  +ED F E  ET+ +   +  S A ++  E L
Subjt:  FGKLEVPYSWEGNVAGFGTTAA-----LSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEEL

Query:  TCGSEIDDDD--DVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEA
           +   DDD  D + G            E  E    K+A  EL+ AI+ +P  SV  +LDKWV EG ++ +  +S  +++LRRRRM+ KALQFSEWLE 
Subjt:  TCGSEIDDDD--DVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEA

Query:  SGQLDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLL
        + Q+   ERDYA+RLDLI KV G+ KAE+Y+ +IP SF+GE++YRTLLA CV   N+ KAE VF K+++L  P+T FA NQ+++L+KRLDKRK+ADVL L
Subjt:  SGQLDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLL

Query:  MEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVR
        MEKEN+KPSL TYK+LID KG SND+ GMEQ+ +TMKAEGV  D +VL++LAKHY SGGLK +A+ +LKE+ +   KGS  A R LL +Y  L   DEV 
Subjt:  MEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVR

Query:  RLWKICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEK
        R+WK C+ +PR+ EC+AAI AWGKL  + EAE +F+ +++ WK+LS +QY ++LKVY D K++TKGKE VK+M DSGC +GPLAWDA+V+L + AGE+EK
Subjt:  RLWKICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEK

Query:  ADSFLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDA-FR
        ADS L KA Q+NR++  F++Y+V+M+ YA+RGDVHNAEK+FH MR  GY  R   F  LIQAY+NAK+PAYG  ERMKA+N+FPNKA + +LAQ D  FR
Subjt:  ADSFLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDA-FR

Query:  KTAV
        K  V
Subjt:  KTAV

TrEMBL top hitse value%identityAlignment
A0A6J1EZV3 pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like1.3e-28983.14Show/hide
Query:  ALRKTAV---NRGYRVRTSYVFGKLEVPYSWEGNVAGFGTTAALSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSELDETLPST
        ALR+ +    N+GYR+RTSYVFGKLE PYS +GN+ G     A+SDRCI FE NNLATW SSG+ ISSHGLSSQAGAENSGEED++EDGFSEL ETLPST
Subjt:  ALRKTAV---NRGYRVRTSYVFGKLEVPYSWEGNVAGFGTTAALSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSELDETLPST

Query:  SPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRR
        + LED+KAAD+NE ELT  SE+D     DDGTQNELDLPEVETEL EKIS KRA SELF AI  APGLSVP  LDKWVSEGKE+SRA +SLAMLNLRRRR
Subjt:  SPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRR

Query:  MFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLH
        MFGKALQFSEWLEASGQL+F++RDYASRLDLIAKVHGL++AE YIAKIPKSFQGEVIYRTLLANCV+ANNVKKAEEVFNKMKDL FP+T FACNQLLLL+
Subjt:  MFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLH

Query:  KRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRIL
        KRLDKRK+ADVLLLMEKEN+KPSLFTYKILIDAKGLSNDM+GMEQVVDTMKAEG+E+D N LSILAKHYASGGLKD+AKAILKEMEDV SK S+W CR+L
Subjt:  KRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRIL

Query:  LPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWD
        LP+YGELQMEDEVRR+WKICE NPRIEECMAAIVAWGKLKN+ EAE+IFDRV+KTWKKLS KQYSTMLKVY+DNKMLTKGK+LVKQMADSGC IGPL W+
Subjt:  LPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWD

Query:  AIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNK
        A+VKLYVEAGEVEKADSFL KAVQKN+ KPLFTSYM+I+D YARRGDVHNAEK+FH MRL GYVARFSQFQ LIQAY+NAK PAYGMKERMKADNVFPNK
Subjt:  AIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNK

Query:  ALAEKLAQLDAFRKTAV
        ALA KLAQ+DAFRKTAV
Subjt:  ALAEKLAQLDAFRKTAV

A0A6J1F506 pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like2.2e-28983.14Show/hide
Query:  ALRKTAV---NRGYRVRTSYVFGKLEVPYSWEGNVAGFGTTAALSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSELDETLPST
        ALR+ +    N+GYR+RTSYVFGKLE PYS +GN+ G     A+SDRCI FE NNLATW SSG+ ISSHGLSSQAGAENSGEED++EDGFSEL ETLPST
Subjt:  ALRKTAV---NRGYRVRTSYVFGKLEVPYSWEGNVAGFGTTAALSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSELDETLPST

Query:  SPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRR
        + LED+KAAD+NE ELT  SE+D     DDGTQNELDLPEVETEL EKIS KRA SELF AI  APGLSVP  LDKWVSEGKE+SRA +SLAMLNLRRRR
Subjt:  SPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRR

Query:  MFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLH
        MFGKALQFSEWLEASGQL+F++RDYASRLDLIAKVHGL++AE YIAKIPKSFQGEVIYRTLLANCV+ANNVKKAEEVFNKMKDL FP+T FACNQLLLL+
Subjt:  MFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLH

Query:  KRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRIL
        KRLDKRK+ADVLLLMEKEN+KPSLFTYKILIDAKGLSNDM+GMEQVVDTMKAEG+E+D N LSILAKHYASGGLKD+AKAILKEMEDV SK S+W CR+L
Subjt:  KRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRIL

Query:  LPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWD
        LP+YGELQMEDEVRR+WK+CE NPRIEECMAAIVAWGKLKN+ EAE+IFDRV+KTWKKLS KQYSTMLKVY+DNKMLTKGK+LVKQMADSGC IGPL WD
Subjt:  LPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWD

Query:  AIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNK
        A+VKLYVEAGEVEKADSFL KAVQKN+ KPLFTSYMVI+D YARRGDVHNAEK+FH MRL GYVARFS FQ LIQAY+NAK PAYGMKERMKADNVFPNK
Subjt:  AIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNK

Query:  ALAEKLAQLDAFRKTAV
        ALA KLAQ+DAFRKTAV
Subjt:  ALAEKLAQLDAFRKTAV

A0A6J1I2N1 pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like4.6e-28782.66Show/hide
Query:  ALRKTAV---NRGYRVRTSYVFGKLEVPYSWEGNVAGFGTTAALSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSELDETLPST
        ALR+ +    N+GYR+RTSYVFGKLE PYSWEGN+       A+SD CI FE N+LATW  SG+ ISSHGLSSQAGAENSGEED++EDGFSEL ETLPST
Subjt:  ALRKTAV---NRGYRVRTSYVFGKLEVPYSWEGNVAGFGTTAALSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSELDETLPST

Query:  SPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRR
          LED+KAAD+NE ELT  SE+ DDD VD GTQNELDLPE+ETELAEKI  KRA SELF AI  APG SVP  LDKWVSEGKE+SRA +SLAMLNLRRRR
Subjt:  SPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRR

Query:  MFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLH
        MFGKALQFSEWLEASGQL+F++RDYASRLDLIAKVHGL++AE YIAKIPKSFQGEVIYRTLLANCV+ANNVKKAEEVFNKMKDL FP+T FACNQLLLL+
Subjt:  MFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLH

Query:  KRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRIL
        KRLDKRK+ADVLLLMEKEN+KPSLFTYKILIDAKGLSNDM+GMEQVVDTMKAEG+E+D N LSILAKHYASGGL D+AKAILKEMEDV SK S+W CRIL
Subjt:  KRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRIL

Query:  LPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWD
        LP+YGELQ+EDEVRR+WKICE NPR++ECMAAIVAWGKLKN+ EAE+IFDRV+KTWKKLS KQYSTMLKVY+DNKMLTKGK+LVKQMADSGC IGPL WD
Subjt:  LPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWD

Query:  AIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNK
        A+VKLYVEAGEVEKADSFL KAVQKN+ KPLFTSYMVI+D YARRGDVHNAEK+FH MRL GYVARFSQFQ LIQAY+NAK PAYGMKERMKADNVFPNK
Subjt:  AIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNK

Query:  ALAEKLAQLDAFRKTAV
        ALA KLAQ+DAFRKTAV
Subjt:  ALAEKLAQLDAFRKTAV

A0A6J1I524 pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like1.1e-28883.14Show/hide
Query:  ALRKTAV---NRGYRVRTSYVFGKLEVPYSWEGNVAGFGTTAALSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSELDETLPST
        ALR+ +    N+GYR+RTSYVFGKLE PYSWEGN+       A+SD CI FE N+LATW  SG+ ISSHGLSSQAGAENSGEED++EDGFSEL ETLPST
Subjt:  ALRKTAV---NRGYRVRTSYVFGKLEVPYSWEGNVAGFGTTAALSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSELDETLPST

Query:  SPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRR
        + LED+KAAD+NE ELT  SE+ DDD VD GTQNELDLPE+ETELAEKI  KRA SELF AI  APG SVP  LDKWVSEGKE+SRA +SLAMLNLRRRR
Subjt:  SPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRR

Query:  MFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLH
        MFGKALQFSEWLEASGQL+F++RDYASRLDLIAKVHGL++AE YIAKIPKSFQGEVIYRTLLANCV+ANNVKKAEEVFNKMKDL FP+T FACNQLLLL+
Subjt:  MFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLH

Query:  KRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRIL
        KRLDKRK+ADVLLLMEKEN+KPSLFTYKILIDAKGLSNDM+GMEQVVDTMKAEG+E+D + LSILAKHYASGGLKD+AKAILKEMEDV SK S+W CRIL
Subjt:  KRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRIL

Query:  LPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWD
        LP+YGELQMEDEVRR+WKICE NPRIEECMAAIVAWGKLKN+ EAE+IFDRV+KTWKKLS KQYSTMLKVY+DNKMLTKGK+LVKQMADSGC IGPL WD
Subjt:  LPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWD

Query:  AIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNK
        A+VKLYVEAGEVEKADSFL KAVQKN+ KPLFTSYMVI+D YARRGDVHNAEK+FH MRL GYVARFSQFQ LIQAY+NAK PAYGMKERMKADNVFPNK
Subjt:  AIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNK

Query:  ALAEKLAQLDAFRKTAV
        ALA KLAQ+DAFRKTAV
Subjt:  ALAEKLAQLDAFRKTAV

A0A6J1I7V1 pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like isoform X24.6e-28783.39Show/hide
Query:  VNRGYRVRTSYVFGKLEVPYSWEGNVAGFGTTAALSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAA
        +N+GYR+RTSYVFG LE PYSWEGN+       A+SD CI FE N+LATW  SG+ ISSHGLSSQAGAENSGEED++EDGFSEL ETLPST+ LED+KAA
Subjt:  VNRGYRVRTSYVFGKLEVPYSWEGNVAGFGTTAALSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAA

Query:  DDNEEELTCGSEIDDDDDVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFS
        D+NE ELT  SE+ DDD VD GTQNELDLPE+ETELAEKI  KRA SELF AI  APG SVP  LDKWVSEGKE+SRA +SLAMLNLRRRRMFGKALQFS
Subjt:  DDNEEELTCGSEIDDDDDVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFS

Query:  EWLEASGQLDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMA
        EWLEASGQL+F++RDYASRLDLIAKVHGL++AE YIAKIPKSFQGEVIYRTLLANCV+ANNVKKAEEVFNKMKDL FP+T FACNQLLLL+KRLDKRK+A
Subjt:  EWLEASGQLDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMA

Query:  DVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQM
        DVLLLMEKEN+KPSLFTYKILIDAKGLSNDM+GMEQVVDTMKAEG+E+D N LSILAKHYASGGL D+AKAILKEMEDV SK S+W CRILLP+YGELQM
Subjt:  DVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQM

Query:  EDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEA
        EDEVRR+WKICE NPR++ECMAAIVAWGKLKN+ EAE+IFDRV+KTWKKLS KQYSTMLKVY+DNKMLTKGK+LVKQMADSGC IGPL WDA+VKLYVEA
Subjt:  EDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEA

Query:  GEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQL
        GEVEKADSFL KAVQKN+ KPLFTSYMVI+D YARRGDVHNAEK+FH MRL GYVARFSQFQ LIQAY+NAK PAYGMKERMKADNVFPNKALA KLAQ+
Subjt:  GEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQL

Query:  DAFRKTAV
        DAFRKTAV
Subjt:  DAFRKTAV

SwissProt top hitse value%identityAlignment
O22714 Pentatricopeptide repeat-containing protein At1g607705.9e-3725.84Show/hide
Query:  KISTKRASSE-LFNAILGAPGLSVPI--VLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAERY
        K STK+   E L+N +    G  V +   L++++   K + +  V   +  LR R ++  AL+ SE +E  G    +  D A  LDL+AK   +   E Y
Subjt:  KISTKRASSE-LFNAILGAPGLSVPI--VLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAERY

Query:  IAKIPKSFQGEVIYRTLLANCVIANNV-KKAEEVFNKMKDLGFPLTTFACNQLLLLH-KRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLG
           +P++ + E+ Y +LL NC     + +KAE + NKMK+L    ++ + N L+ L+ K  +  K+  ++  ++ EN+ P  +TY + + A   +ND+ G
Subjt:  IAKIPKSFQGEVIYRTLLANCVIANNV-KKAEEVFNKMKDLGFPLTTFACNQLLLLH-KRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLG

Query:  MEQVVDTMKAEG-VEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLW---KICEPNPRIEECMAAIVAWGK
        +E+V++ M  +G V  D+   S +A  Y   GL  +A+  L+E+E   ++    A + L+ +YG L    EV R+W   ++  P       +  I    K
Subjt:  MEQVVDTMKAEG-VEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLW---KICEPNPRIEECMAAIVAWGK

Query:  LKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADSFLVKAVQKNRKK-----PLFT
        L ++P AE +F            +  + ++  Y+   ++ K  EL ++    G  +    W+  +  YV++G++ +A   + KAV   +       P   
Subjt:  LKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADSFLVKAVQKNRKK-----PLFT

Query:  SYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQL
        +   +M ++ ++ DV+ AE +   ++          F+ LI+ Y  A      M+ R+K +NV  N+A  + L ++
Subjt:  SYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQL

Q940Q2 Pentatricopeptide repeat-containing protein At1g07590, mitochondrial1.3e-3624.9Show/hide
Query:  LAQLDAFRKTAVETELAEKISTKRASSELFIAILGAP-GLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDFIERDYASR
        L  L + +K+  ET +  +   +  +  L + I   P G++V   L  W+ +G  +    V  A+  LRK     +AL+  EW+         E +Y+  
Subjt:  LAQLDAFRKTAVETELAEKISTKRASSELFIAILGAP-GLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDFIERDYASR

Query:  LDLIAKVHGLHKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKM-ADDLLLMEKENIKPSLFTY
        L+   K+HG+ + E+   ++P+ FQ E++Y  L+  C+    ++ A E   KM++LG+  +    N+L++ +    +RK+ A DL LM+ +   P + TY
Subjt:  LDLIAKVHGLHKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKM-ADDLLLMEKENIKPSLFTY

Query:  KILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDEAKAILKEMEDVYSKGSQWPCRILLPIYGELQMEDEVRRLWEICE--PNPR
         IL+  +   +++ G+ +  D MK  GVE +     ILA  +A   L   A+A  +E+E   +  +     IL+ +YG L  E E+ R W +     + R
Subjt:  KILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDEAKAILKEMEDVYSKGSQWPCRILLPIYGELQMEDEVRRLWEICE--PNPR

Query:  IEECLAAIVAWGKLKNIPEAEKMFDRVVKTWKKLSPKHYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPSTWD------AIVKLYVEAGEVEKADSFL
         +  L A  A+ ++ N+  AE+++  +         + ++++L VY  + ++ K   + ++M  +G      T+       A  KL  EA +  +    L
Subjt:  IEECLAAIVAWGKLKNIPEAEKMFDRVVKTWKKLSPKHYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPSTWD------AIVKLYVEAGEVEKADSFL

Query:  VKAVQKNRKKPLFTSYMVIMNHYARRGDVHNAEKIFHNMRLWGYVARFSQYQTLIQAYLNAKTPAYGMKDRMKADNVFPNKTLAGKLAQVDAFR
          +       P   + + I+  +A +GDV N+EK+F  ++   Y      Y  L +AY+ AK     +  RM      P+      L  V+ ++
Subjt:  VKAVQKNRKKPLFTSYMVIMNHYARRGDVHNAEKIFHNMRLWGYVARFSQYQTLIQAYLNAKTPAYGMKDRMKADNVFPNKTLAGKLAQVDAFR

Q9C977 Pentatricopeptide repeat-containing protein At1g80270, mitochondrial8.2e-17254.88Show/hide
Query:  FEGNNLATWPSSGI-------YISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVET
        F+ N++A+     +        +S+  LSS AG ++  EED++EDGFSEL+     +   + S ++D++E +L+           D+  + ELDL  +ET
Subjt:  FEGNNLATWPSSGI-------YISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVET

Query:  ELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAER
        +++ K + ++  SELF  I+ APGLS+   LDKWV EG EI+R  ++ AML LRRRRM+G+ALQ SEWLEA+ +++  ERDYASRLDL  K+ GL K E 
Subjt:  ELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAER

Query:  YIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGM
         + KIPKSF+GEV+YRTLLANCV A NVKK+E VFNKMKDLGFPL+ F C+Q+LLLHKR+D++K+ADVLLLMEKENIKPSL TYKILID KG +ND+ GM
Subjt:  YIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGM

Query:  EQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIP
        EQ+++TMK EGVE+DF   ++ A+HY+  GLKD+A+ +LKEME    + ++ A + LL IY  L  EDEV+R+WKICE  P  EE +AAI A+GKL  + 
Subjt:  EQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIP

Query:  EAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYA
        EAE IF+++VK  ++ S   YS +L+VY D+KML+KGK+LVK+MA+SGC I    WDA++KLYVEAGEVEKADS L KA +++  K +  S+M IMD Y+
Subjt:  EAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYA

Query:  RRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDAFRKTAV
        +RGDVHN EKIF  MR  GY +R  QFQ L+QAY+NAK PAYGM++R+KADN+FPNK++A +LAQ D F+KTA+
Subjt:  RRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDAFRKTAV

Q9LRP6 Pentatricopeptide repeat-containing protein At3g15590, mitochondrial4.1e-14753.68Show/hide
Query:  KISTKRASSELFIAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLHKAERYIAK
        K + KR  SEL+ +I+     SV  VL+KWV EGK++S+A V+LA+ NLRKR+ +   LQ  EWL A+ Q +F E +YAS+LDL+AKVH L KAE ++  
Subjt:  KISTKRASSELFIAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLHKAERYIAK

Query:  IPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKMADDLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVV
        IP+S +GEV+YRTLLANCV+ ++V KAE++FNKMK+L FP + FACNQLLLL+  HD++K++D LLLME+ENIKPS  TY  LI++KGL+ D+ GME++V
Subjt:  IPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKMADDLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVV

Query:  DTMKAEGVEIDFNVLSILAKHYASGGLKDEAKAILKEMEDVYSKGSQWPCRILLPIYGELQMEDEVRRLWEICEPNPRIEECLAAIVAWGKLKNIPEAEK
        +T+K EG+E+D  + SILAK+Y   GLK+ A+ ++KE+E    + + W CR LLP+Y ++   D VRRL    + NPR + C++AI AWGKLK + EAE 
Subjt:  DTMKAEGVEIDFNVLSILAKHYASGGLKDEAKAILKEMEDVYSKGSQWPCRILLPIYGELQMEDEVRRLWEICEPNPRIEECLAAIVAWGKLKNIPEAEK

Query:  MFDRVVKTWKKLSPKHYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPSTWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMNHYARRGD
        +F+R+V+ +K      Y  ++++Y++NKML KG++LVK+M ++G +IGPSTW A+VKLY++AGEV KA+  L +A + N+ +P+FT+YM I+  YA+RGD
Subjt:  MFDRVVKTWKKLSPKHYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPSTWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMNHYARRGD

Query:  VHNAEKIFHNMRLWGYVARFSQYQTLIQAYLNAKTPAYGMKDRMKADNVFPNKTLAGKLAQVDAFRKTAVSDLLD
        VHN EK+F  M+   Y A+  QY+T++ AY+NAKTPAYGM +RMKADNVFPNK+LA KLAQV+ F+K  VS LLD
Subjt:  VHNAEKIFHNMRLWGYVARFSQYQTLIQAYLNAKTPAYGMKDRMKADNVFPNKTLAGKLAQVDAFRKTAVSDLLD

Q9XI21 Pentatricopeptide repeat-containing protein At1g15480, mitochondrial1.4e-16851.61Show/hide
Query:  ALAGKLAQVDALRKTAVNRGYRVRTSYVFGKLEVPYSWEGNVAGFGTTAALSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSEL
        AL+  L +  +LR  A N         V+ KL++P   E N+A   + A + D+          +W SS        LSS AGA+ +G++D++E      
Subjt:  ALAGKLAQVDALRKTAVNRGYRVRTSYVFGKLEVPYSWEGNVAGFGTTAALSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSEL

Query:  DETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAM
        D+ +   +P E S  ++D       G E   D+   +G + EL +PE            +  SE+F AI+   GLSV   LDKWV +GK+ +R     AM
Subjt:  DETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAM

Query:  LNLRRRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFAC
        L LR+RRMFG+ALQ +EWL+ + Q +  ERDYA RLDLI+KV G YK E YI  IP+SF+GE++YRTLLAN V  +NV+ AE VFNKMKDLGFPL+TF C
Subjt:  LNLRRRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFAC

Query:  NQLLLLHKRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGS
        NQ+L+L+KR+DK+K+ADVLLL+EKEN+KP+L TYKILID KG SND+ GMEQ+V+TMK+EGVE+D    +++A+HYAS GLK++A+ +LKEME    + +
Subjt:  NQLLLLHKRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGS

Query:  QWACRILLPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCS
        +  C+ LL +YG LQ EDEVRR+WKICE NPR  E +AAI+A+GK+  + +AE +F++V+K   ++S   YS +L+VY D+KM+++GK+LVKQM+DSGC+
Subjt:  QWACRILLPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCS

Query:  IGPLAWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKA
        IG L WDA++KLYVEAGEVEKA+S L KA+Q  + KPL +S+M +M  Y RRGDVHN EKIF  M+  GY +RF  +QTLIQAY+NAK PAYGMKERMKA
Subjt:  IGPLAWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKA

Query:  DNVFPNKALAEKLAQLDAFRKT
        DN+FPNK LA +LA+ D F+KT
Subjt:  DNVFPNKALAEKLAQLDAFRKT

Arabidopsis top hitse value%identityAlignment
AT1G15480.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.0e-16951.61Show/hide
Query:  ALAGKLAQVDALRKTAVNRGYRVRTSYVFGKLEVPYSWEGNVAGFGTTAALSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSEL
        AL+  L +  +LR  A N         V+ KL++P   E N+A   + A + D+          +W SS        LSS AGA+ +G++D++E      
Subjt:  ALAGKLAQVDALRKTAVNRGYRVRTSYVFGKLEVPYSWEGNVAGFGTTAALSDRCIFFEGNNLATWPSSGIYISSHGLSSQAGAENSGEEDEMEDGFSEL

Query:  DETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAM
        D+ +   +P E S  ++D       G E   D+   +G + EL +PE            +  SE+F AI+   GLSV   LDKWV +GK+ +R     AM
Subjt:  DETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAM

Query:  LNLRRRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFAC
        L LR+RRMFG+ALQ +EWL+ + Q +  ERDYA RLDLI+KV G YK E YI  IP+SF+GE++YRTLLAN V  +NV+ AE VFNKMKDLGFPL+TF C
Subjt:  LNLRRRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFAC

Query:  NQLLLLHKRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGS
        NQ+L+L+KR+DK+K+ADVLLL+EKEN+KP+L TYKILID KG SND+ GMEQ+V+TMK+EGVE+D    +++A+HYAS GLK++A+ +LKEME    + +
Subjt:  NQLLLLHKRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGS

Query:  QWACRILLPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCS
        +  C+ LL +YG LQ EDEVRR+WKICE NPR  E +AAI+A+GK+  + +AE +F++V+K   ++S   YS +L+VY D+KM+++GK+LVKQM+DSGC+
Subjt:  QWACRILLPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCS

Query:  IGPLAWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKA
        IG L WDA++KLYVEAGEVEKA+S L KA+Q  + KPL +S+M +M  Y RRGDVHN EKIF  M+  GY +RF  +QTLIQAY+NAK PAYGMKERMKA
Subjt:  IGPLAWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKA

Query:  DNVFPNKALAEKLAQLDAFRKT
        DN+FPNK LA +LA+ D F+KT
Subjt:  DNVFPNKALAEKLAQLDAFRKT

AT1G80270.1 PENTATRICOPEPTIDE REPEAT 5965.8e-17354.88Show/hide
Query:  FEGNNLATWPSSGI-------YISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVET
        F+ N++A+     +        +S+  LSS AG ++  EED++EDGFSEL+     +   + S ++D++E +L+           D+  + ELDL  +ET
Subjt:  FEGNNLATWPSSGI-------YISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVET

Query:  ELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAER
        +++ K + ++  SELF  I+ APGLS+   LDKWV EG EI+R  ++ AML LRRRRM+G+ALQ SEWLEA+ +++  ERDYASRLDL  K+ GL K E 
Subjt:  ELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAER

Query:  YIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGM
         + KIPKSF+GEV+YRTLLANCV A NVKK+E VFNKMKDLGFPL+ F C+Q+LLLHKR+D++K+ADVLLLMEKENIKPSL TYKILID KG +ND+ GM
Subjt:  YIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGM

Query:  EQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIP
        EQ+++TMK EGVE+DF   ++ A+HY+  GLKD+A+ +LKEME    + ++ A + LL IY  L  EDEV+R+WKICE  P  EE +AAI A+GKL  + 
Subjt:  EQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIP

Query:  EAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYA
        EAE IF+++VK  ++ S   YS +L+VY D+KML+KGK+LVK+MA+SGC I    WDA++KLYVEAGEVEKADS L KA +++  K +  S+M IMD Y+
Subjt:  EAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYA

Query:  RRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDAFRKTAV
        +RGDVHN EKIF  MR  GY +R  QFQ L+QAY+NAK PAYGM++R+KADN+FPNK++A +LAQ D F+KTA+
Subjt:  RRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDAFRKTAV

AT1G80270.2 PENTATRICOPEPTIDE REPEAT 5965.8e-17354.88Show/hide
Query:  FEGNNLATWPSSGI-------YISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVET
        F+ N++A+     +        +S+  LSS AG ++  EED++EDGFSEL+     +   + S ++D++E +L+           D+  + ELDL  +ET
Subjt:  FEGNNLATWPSSGI-------YISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVET

Query:  ELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAER
        +++ K + ++  SELF  I+ APGLS+   LDKWV EG EI+R  ++ AML LRRRRM+G+ALQ SEWLEA+ +++  ERDYASRLDL  K+ GL K E 
Subjt:  ELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAER

Query:  YIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGM
         + KIPKSF+GEV+YRTLLANCV A NVKK+E VFNKMKDLGFPL+ F C+Q+LLLHKR+D++K+ADVLLLMEKENIKPSL TYKILID KG +ND+ GM
Subjt:  YIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGM

Query:  EQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIP
        EQ+++TMK EGVE+DF   ++ A+HY+  GLKD+A+ +LKEME    + ++ A + LL IY  L  EDEV+R+WKICE  P  EE +AAI A+GKL  + 
Subjt:  EQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIP

Query:  EAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYA
        EAE IF+++VK  ++ S   YS +L+VY D+KML+KGK+LVK+MA+SGC I    WDA++KLYVEAGEVEKADS L KA +++  K +  S+M IMD Y+
Subjt:  EAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYA

Query:  RRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDAFRKTAV
        +RGDVHN EKIF  MR  GY +R  QFQ L+QAY+NAK PAYGM++R+KADN+FPNK++A +LAQ D F+KTA+
Subjt:  RRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDAFRKTAV

AT1G80270.3 PENTATRICOPEPTIDE REPEAT 5965.8e-17354.88Show/hide
Query:  FEGNNLATWPSSGI-------YISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVET
        F+ N++A+     +        +S+  LSS AG ++  EED++EDGFSEL+     +   + S ++D++E +L+           D+  + ELDL  +ET
Subjt:  FEGNNLATWPSSGI-------YISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVET

Query:  ELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAER
        +++ K + ++  SELF  I+ APGLS+   LDKWV EG EI+R  ++ AML LRRRRM+G+ALQ SEWLEA+ +++  ERDYASRLDL  K+ GL K E 
Subjt:  ELAEKISTKRASSELFNAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAER

Query:  YIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGM
         + KIPKSF+GEV+YRTLLANCV A NVKK+E VFNKMKDLGFPL+ F C+Q+LLLHKR+D++K+ADVLLLMEKENIKPSL TYKILID KG +ND+ GM
Subjt:  YIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGM

Query:  EQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIP
        EQ+++TMK EGVE+DF   ++ A+HY+  GLKD+A+ +LKEME    + ++ A + LL IY  L  EDEV+R+WKICE  P  EE +AAI A+GKL  + 
Subjt:  EQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYSKGSQWACRILLPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIP

Query:  EAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYA
        EAE IF+++VK  ++ S   YS +L+VY D+KML+KGK+LVK+MA+SGC I    WDA++KLYVEAGEVEKADS L KA +++  K +  S+M IMD Y+
Subjt:  EAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYA

Query:  RRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDAFRKTAV
        +RGDVHN EKIF  MR  GY +R  QFQ L+QAY+NAK PAYGM++R+KADN+FPNK++A +LAQ D F+KTA+
Subjt:  RRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLDAFRKTAV

AT3G15590.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.9e-14853.68Show/hide
Query:  KISTKRASSELFIAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLHKAERYIAK
        K + KR  SEL+ +I+     SV  VL+KWV EGK++S+A V+LA+ NLRKR+ +   LQ  EWL A+ Q +F E +YAS+LDL+AKVH L KAE ++  
Subjt:  KISTKRASSELFIAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLHKAERYIAK

Query:  IPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKMADDLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVV
        IP+S +GEV+YRTLLANCV+ ++V KAE++FNKMK+L FP + FACNQLLLL+  HD++K++D LLLME+ENIKPS  TY  LI++KGL+ D+ GME++V
Subjt:  IPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKMADDLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVV

Query:  DTMKAEGVEIDFNVLSILAKHYASGGLKDEAKAILKEMEDVYSKGSQWPCRILLPIYGELQMEDEVRRLWEICEPNPRIEECLAAIVAWGKLKNIPEAEK
        +T+K EG+E+D  + SILAK+Y   GLK+ A+ ++KE+E    + + W CR LLP+Y ++   D VRRL    + NPR + C++AI AWGKLK + EAE 
Subjt:  DTMKAEGVEIDFNVLSILAKHYASGGLKDEAKAILKEMEDVYSKGSQWPCRILLPIYGELQMEDEVRRLWEICEPNPRIEECLAAIVAWGKLKNIPEAEK

Query:  MFDRVVKTWKKLSPKHYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPSTWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMNHYARRGD
        +F+R+V+ +K      Y  ++++Y++NKML KG++LVK+M ++G +IGPSTW A+VKLY++AGEV KA+  L +A + N+ +P+FT+YM I+  YA+RGD
Subjt:  MFDRVVKTWKKLSPKHYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPSTWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMNHYARRGD

Query:  VHNAEKIFHNMRLWGYVARFSQYQTLIQAYLNAKTPAYGMKDRMKADNVFPNKTLAGKLAQVDAFRKTAVSDLLD
        VHN EK+F  M+   Y A+  QY+T++ AY+NAKTPAYGM +RMKADNVFPNK+LA KLAQV+ F+K  VS LLD
Subjt:  VHNAEKIFHNMRLWGYVARFSQYQTLIQAYLNAKTPAYGMKDRMKADNVFPNKTLAGKLAQVDAFRKTAVSDLLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAATCGAGGGTACAGAGTAAGAACTTCATATGTCTTTGGCAAACTAGAGGTACCATATTCTTGGGAAGGAAATGTAGCTGGTTTTGGAACCACCGCTGCTTTATC
TGATAGATACATTTTTTTTGAGGGAAACAACCTTGCAACATGGCCGTCCTCTTGTATTTATATTAGTAGTCATGGTCTATCTTCACAAGCTGGTGCTGAGAACAGTGGAG
AGGAAGATAAAATGGAAGATGGATTTTCTGAACTTGATGAAACACTTCCAAGCACTAGTCCACTCGAAGATAGTAAGGCAGCTGATGATAATGAAGAGGAACTAACCTGT
GGATCAGAAATTGATGATGACGACGACGACATTGATGATGGGACACAAAATGAACTGGATTTACCTGAGGTAGAAACTGAACTTGCTGAAAAGATATCAACAAAAAGGGC
TTCTTCAGAGCTGTTCCATGCTATTAATGGTGCTCCAGGTTTATCTGTTCCTATTGTACTTGATAAGTGGGTCAGTGAAGGAAAAGAAATAAGCCGGGCTGTTGTCTCTC
TTGCCATGCTCAATCTTCGTAAACGTCGAATGTTTGGGAAGGCTTTGCAGTTTTCGGAGTGGTTGGAAGCAAGCGGGCAACTCGATTCTATTGAGAGAGATTATGCTTCT
CGCCTTGACTTGATTGCAAAGGTACACGGTCTCCATAAGGCAGAGAGGTACATTGCTAAAATCCCAAAGTCCTTCCAGGGGGAGGTGATATACCGAACTCTTTTGGCTAA
CTGTGTGATAGCCAACAATGTTAAAAAAGCAGAGGAAGTATTTAACAAAATGAAGGACCTCGGATTCCCACTCACAACATTTGCTTGCAACCAGTTGCTTCTTCTTCACA
AGAGGCATGACAAGAGGAAAATGGCCGACGATTTGTTGTTGATGGAGAAAGAAAATATCAAGCCTTCTCTATTTACTTACAAAATCTTAATAGATGCTAAAGGCTTATCA
AATGACATGCTAGGGATGGAACAAGTTGTTGATACAATGAAGGCCGAAGGAGTTGAAATTGATGTCAATGTACTTTCCATATTAGCTAAGCACTATGCTTCAGGTGGGCT
TAAAGACAAAGCCAAGGCCATTTTAAAGGAGATGGAAGATGTTAACTCCAAAGGTTCTCAATGGCCTTGCAGAATTTTACTTCCCCTTTATGGAGAACTCCAAATGGAAG
ATGACGTGAGGAGGCTCTGGGAGATCTGCGAGCCAAATCCTCGTATCGAAGAATGCATGGCTGCCATTGTTGCTTGGGGAAAGTTGAAGAACGTCCCGGAAGCAGAGAAA
ATTTTTGATAGAGTTGTAAAAACATGGAAGAAGCTGTCCCCAAAACAATATTCTACCATGTTGAACGTTTATGCAGATAATAAGATGCTGACGAAGGGCAAGGATCTAGT
CAAGCAGATGGCAGACAGCGGTTGCAGCATCGGTCCGTTATCATGGGATGCAATTGTGAAGCTCTATGTGGAAGCTGGGGAGGTAGAAAAAGCAGACTCTTTCTTGGTTA
AGGCTGTTCAAAAAAGCCAGAAGAAGCCATTGTTTACCTCATACATGGTTATCATGGATCACTATGCAAGAAGGGGGGATGTCCACAATGCAGAGAAAATCTTTCACAAT
ATGAGACTGTGGGGTTACGTGGCTCGATTGAGGCAATTTCAAACTCTAATACAGGCATACGTTAACGCGAAGACTCCGGCCTATGGTATGAGAGAGAGAATGAAGGCAGA
TAATGTATTTCCAAACAAAGCTTTGGCAGGAAAATTAGCCCAAGTTGATGCTTTAAGGAAGACAGCAGTGAATCGAGGGTACAGAGTAAGAACTTCATATGTCTTTGGCA
AACTAGAGGTACCATATTCTTGGGAAGGAAATGTAGCTGGTTTTGGAACCACCGCTGCTTTATCTGATAGATGCATTTTTTTTGAGGGAAACAACCTTGCAACATGGCCG
TCCTCTGGGATTTATATTAGTAGTCATGGTCTATCTTCACAAGCTGGTGCTGAGAACAGTGGAGAGGAAGATGAAATGGAAGATGGATTTTCTGAACTTGATGAAACACT
TCCAAGCACTAGTCCACTTGAAGATAGTAAGGCAGCTGATGATAATGAAGAGGAACTAACCTGTGGATCAGAAATCGATGATGACGACGACGTTGATGATGGGACTCAAA
ATGAACTGGATTTACCAGAGGTAGAAACTGAACTTGCTGAAAAGATATCAACAAAAAGGGCTTCTTCAGAGCTATTCAATGCTATTTTGGGTGCTCCTGGTTTATCTGTT
CCTATTGTACTTGATAAGTGGGTCAGTGAAGGAAAAGAAATAAGCCGGGCTGTTGTCTCTCTTGCCATGCTCAATCTTCGTAGACGTCGAATGTTTGGGAAGGCTTTGCA
GTTTTCGGAGTGGTTGGAAGCAAGCGGGCAACTCGATTTTATTGAGAGAGATTATGCTTCTCGCCTTGACTTGATTGCAAAGGTACACGGTCTCTATAAGGCAGAGAGGT
ACATTGCTAAAATCCCAAAGTCCTTCCAGGGGGAGGTGATATACCGAACTCTTTTGGCTAACTGTGTGATAGCCAACAATGTTAAAAAAGCAGAGGAAGTATTTAACAAA
ATGAAGGACCTCGGATTCCCACTCACAACATTTGCTTGCAACCAGTTGCTTCTTCTTCACAAGAGGCTTGACAAGAGGAAAATGGCCGACGTTTTGTTGTTGATGGAGAA
AGAAAATATCAAGCCTTCACTGTTTACTTACAAAATCTTAATAGATGCTAAAGGCCTATCAAATGACATGCTAGGGATGGAACAAGTTGTTGATACAATGAAGGCCGAAG
GAGTTGAAATTGATTTTAATGTCCTTTCAATATTAGCTAAGCACTATGCTTCAGGTGGGCTTAAAGACAGAGCCAAGGCCATTTTAAAGGAGATGGAAGATGTTTACTCC
AAAGGTTCTCAATGGGCTTGCAGAATTTTACTTCCTATTTATGGAGAACTCCAAATGGAAGATGAAGTGAGGAGGCTCTGGAAGATCTGTGAGCCAAATCCTCGTATCGA
AGAATGCATGGCTGCCATTGTTGCTTGGGGAAAGCTGAAGAACATCCCGGAAGCAGAGAAAATTTTTGATAGAGTTGTAAAAACATGGAAGAAGCTGTCCCCAAAACAAT
ATTCTACCATGTTGAAGGTTTATTCAGACAATAAGATGCTGACGAAGGGCAAGGAACTAGTCAAGCAGATGGCAGACAGCGGTTGCAGCATCGGTCCGTTGGCATGGGAT
GCAATTGTGAAGCTCTATGTGGAAGCTGGGGAGGTAGAAAAAGCAGACTCTTTCTTGGTTAAGGCTGTTCAAAAAAACCGGAAGAAGCCATTGTTTACCTCATACATGGT
TATCATGGATCACTATGCAAGAAGGGGGGATGTCCACAATGCAGAGAAAATCTTTCATAATATGAGACTATGGGGTTACGTGGCTCGATTCAGTCAATTTCAAACTCTGA
TACAGGCATACCTTAACGCGAAGATTCCGGCCTATGGTATGAAAGAGAGAATGAAGGCAGATAATGTATTTCCAAACAAAGCTTTGGCAGAAAAATTAGCCCAACTTGAT
GCTTTCAGGAAGACAGCAGTAGAAACTGAACTTGCTGAAAAGATATCAACAAAAAGGGCTTCTTCAGAGCTATTCATTGCTATTTTGGGTGCTCCTGGTTTATCTGTTCC
TATTGTACTTGATAAGTGGGTCAGTGAAGGAAAAGAAATAAGCCGGGCTGTTGTCTCTCTTGCCATGCTCAATCTTCGTAAACGTCGAATGTTTGGGAAGGCTTTGCAGT
TTTCGGAGTGGTTGGAAGCAAGCGGGCAACTCGATTTTATTGAGAGAGATTATGCTTCTCGCCTTGACTTGATTGCAAAGGTACACGGTCTCCATAAGGCAGAGAGGTAC
ATTGCTAAAATCCCAAAGTCCTTCCAGGGGGAGGTGATATACCGAACTCTTTTGGCTAACTGTGTGATAGCCAACAATGTTAAAAAAGCAGAGGAAGTATTTAACAAAAT
GAAGGACCTCGGATTCCCACTCACAACATTTGCTTGCAACCAGTTGCTTCTTCTTCACAAGAGGCATGACAAGAGGAAAATGGCTGACGATTTGTTGTTGATGGAGAAAG
AAAATATCAAGCCTTCTCTGTTTACTTACAAAATCTTAATAGATGCTAAAGGCCTATCAAATGACATGCTAGGGATGGAACAAGTTGTTGATACAATGAAGGCCGAAGGA
GTTGAAATTGATTTTAATGTACTTTCCATATTAGCTAAGCACTATGCTTCAGGTGGGCTTAAAGACGAAGCCAAGGCCATTTTAAAGGAGATGGAAGATGTTTACTCCAA
AGGTTCTCAATGGCCTTGCAGAATTTTACTTCCCATTTATGGAGAACTCCAAATGGAAGATGAAGTGAGGAGGCTCTGGGAGATCTGCGAGCCAAATCCTCGTATCGAAG
AATGCTTGGCTGCCATTGTTGCTTGGGGAAAGTTGAAGAACATCCCGGAAGCAGAGAAAATGTTTGATAGAGTTGTAAAAACATGGAAGAAGCTTTCCCCAAAACATTAT
TCTACCATGTTGAAGGTTTATTCAGACAATAAGATGCTGACAAAGGGCAAGGAACTAGTCAAACAGATGGCAGACAGCGGTTGCAGCATCGGTCCGTCGACATGGGATGC
AATTGTGAAGCTCTATGTGGAAGCTGGGGAGGTAGAAAAAGCAGACTCTTTTTTGGTTAAGGCTGTTCAAAAAAACCGGAAGAAGCCATTGTTTACCTCATACATGGTTA
TCATGAATCACTATGCAAGAAGGGGGGATGTCCACAATGCAGAGAAAATCTTTCATAATATGAGACTATGGGGTTACGTGGCTCGATTCAGCCAATATCAAACTCTGATA
CAGGCATACCTTAACGCGAAGACTCCGGCCTATGGTATGAAAGATCGAATGAAGGCAGATAATGTATTTCCAAACAAAACTTTGGCAGGGAAATTAGCCCAAGTGGATGC
TTTCAGGAAGACAGCAGTGTCGGATTTGCTTGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAATCGAGGGTACAGAGTAAGAACTTCATATGTCTTTGGCAAACTAGAGGTACCATATTCTTGGGAAGGAAATGTAGCTGGTTTTGGAACCACCGCTGCTTTATC
TGATAGATACATTTTTTTTGAGGGAAACAACCTTGCAACATGGCCGTCCTCTTGTATTTATATTAGTAGTCATGGTCTATCTTCACAAGCTGGTGCTGAGAACAGTGGAG
AGGAAGATAAAATGGAAGATGGATTTTCTGAACTTGATGAAACACTTCCAAGCACTAGTCCACTCGAAGATAGTAAGGCAGCTGATGATAATGAAGAGGAACTAACCTGT
GGATCAGAAATTGATGATGACGACGACGACATTGATGATGGGACACAAAATGAACTGGATTTACCTGAGGTAGAAACTGAACTTGCTGAAAAGATATCAACAAAAAGGGC
TTCTTCAGAGCTGTTCCATGCTATTAATGGTGCTCCAGGTTTATCTGTTCCTATTGTACTTGATAAGTGGGTCAGTGAAGGAAAAGAAATAAGCCGGGCTGTTGTCTCTC
TTGCCATGCTCAATCTTCGTAAACGTCGAATGTTTGGGAAGGCTTTGCAGTTTTCGGAGTGGTTGGAAGCAAGCGGGCAACTCGATTCTATTGAGAGAGATTATGCTTCT
CGCCTTGACTTGATTGCAAAGGTACACGGTCTCCATAAGGCAGAGAGGTACATTGCTAAAATCCCAAAGTCCTTCCAGGGGGAGGTGATATACCGAACTCTTTTGGCTAA
CTGTGTGATAGCCAACAATGTTAAAAAAGCAGAGGAAGTATTTAACAAAATGAAGGACCTCGGATTCCCACTCACAACATTTGCTTGCAACCAGTTGCTTCTTCTTCACA
AGAGGCATGACAAGAGGAAAATGGCCGACGATTTGTTGTTGATGGAGAAAGAAAATATCAAGCCTTCTCTATTTACTTACAAAATCTTAATAGATGCTAAAGGCTTATCA
AATGACATGCTAGGGATGGAACAAGTTGTTGATACAATGAAGGCCGAAGGAGTTGAAATTGATGTCAATGTACTTTCCATATTAGCTAAGCACTATGCTTCAGGTGGGCT
TAAAGACAAAGCCAAGGCCATTTTAAAGGAGATGGAAGATGTTAACTCCAAAGGTTCTCAATGGCCTTGCAGAATTTTACTTCCCCTTTATGGAGAACTCCAAATGGAAG
ATGACGTGAGGAGGCTCTGGGAGATCTGCGAGCCAAATCCTCGTATCGAAGAATGCATGGCTGCCATTGTTGCTTGGGGAAAGTTGAAGAACGTCCCGGAAGCAGAGAAA
ATTTTTGATAGAGTTGTAAAAACATGGAAGAAGCTGTCCCCAAAACAATATTCTACCATGTTGAACGTTTATGCAGATAATAAGATGCTGACGAAGGGCAAGGATCTAGT
CAAGCAGATGGCAGACAGCGGTTGCAGCATCGGTCCGTTATCATGGGATGCAATTGTGAAGCTCTATGTGGAAGCTGGGGAGGTAGAAAAAGCAGACTCTTTCTTGGTTA
AGGCTGTTCAAAAAAGCCAGAAGAAGCCATTGTTTACCTCATACATGGTTATCATGGATCACTATGCAAGAAGGGGGGATGTCCACAATGCAGAGAAAATCTTTCACAAT
ATGAGACTGTGGGGTTACGTGGCTCGATTGAGGCAATTTCAAACTCTAATACAGGCATACGTTAACGCGAAGACTCCGGCCTATGGTATGAGAGAGAGAATGAAGGCAGA
TAATGTATTTCCAAACAAAGCTTTGGCAGGAAAATTAGCCCAAGTTGATGCTTTAAGGAAGACAGCAGTGAATCGAGGGTACAGAGTAAGAACTTCATATGTCTTTGGCA
AACTAGAGGTACCATATTCTTGGGAAGGAAATGTAGCTGGTTTTGGAACCACCGCTGCTTTATCTGATAGATGCATTTTTTTTGAGGGAAACAACCTTGCAACATGGCCG
TCCTCTGGGATTTATATTAGTAGTCATGGTCTATCTTCACAAGCTGGTGCTGAGAACAGTGGAGAGGAAGATGAAATGGAAGATGGATTTTCTGAACTTGATGAAACACT
TCCAAGCACTAGTCCACTTGAAGATAGTAAGGCAGCTGATGATAATGAAGAGGAACTAACCTGTGGATCAGAAATCGATGATGACGACGACGTTGATGATGGGACTCAAA
ATGAACTGGATTTACCAGAGGTAGAAACTGAACTTGCTGAAAAGATATCAACAAAAAGGGCTTCTTCAGAGCTATTCAATGCTATTTTGGGTGCTCCTGGTTTATCTGTT
CCTATTGTACTTGATAAGTGGGTCAGTGAAGGAAAAGAAATAAGCCGGGCTGTTGTCTCTCTTGCCATGCTCAATCTTCGTAGACGTCGAATGTTTGGGAAGGCTTTGCA
GTTTTCGGAGTGGTTGGAAGCAAGCGGGCAACTCGATTTTATTGAGAGAGATTATGCTTCTCGCCTTGACTTGATTGCAAAGGTACACGGTCTCTATAAGGCAGAGAGGT
ACATTGCTAAAATCCCAAAGTCCTTCCAGGGGGAGGTGATATACCGAACTCTTTTGGCTAACTGTGTGATAGCCAACAATGTTAAAAAAGCAGAGGAAGTATTTAACAAA
ATGAAGGACCTCGGATTCCCACTCACAACATTTGCTTGCAACCAGTTGCTTCTTCTTCACAAGAGGCTTGACAAGAGGAAAATGGCCGACGTTTTGTTGTTGATGGAGAA
AGAAAATATCAAGCCTTCACTGTTTACTTACAAAATCTTAATAGATGCTAAAGGCCTATCAAATGACATGCTAGGGATGGAACAAGTTGTTGATACAATGAAGGCCGAAG
GAGTTGAAATTGATTTTAATGTCCTTTCAATATTAGCTAAGCACTATGCTTCAGGTGGGCTTAAAGACAGAGCCAAGGCCATTTTAAAGGAGATGGAAGATGTTTACTCC
AAAGGTTCTCAATGGGCTTGCAGAATTTTACTTCCTATTTATGGAGAACTCCAAATGGAAGATGAAGTGAGGAGGCTCTGGAAGATCTGTGAGCCAAATCCTCGTATCGA
AGAATGCATGGCTGCCATTGTTGCTTGGGGAAAGCTGAAGAACATCCCGGAAGCAGAGAAAATTTTTGATAGAGTTGTAAAAACATGGAAGAAGCTGTCCCCAAAACAAT
ATTCTACCATGTTGAAGGTTTATTCAGACAATAAGATGCTGACGAAGGGCAAGGAACTAGTCAAGCAGATGGCAGACAGCGGTTGCAGCATCGGTCCGTTGGCATGGGAT
GCAATTGTGAAGCTCTATGTGGAAGCTGGGGAGGTAGAAAAAGCAGACTCTTTCTTGGTTAAGGCTGTTCAAAAAAACCGGAAGAAGCCATTGTTTACCTCATACATGGT
TATCATGGATCACTATGCAAGAAGGGGGGATGTCCACAATGCAGAGAAAATCTTTCATAATATGAGACTATGGGGTTACGTGGCTCGATTCAGTCAATTTCAAACTCTGA
TACAGGCATACCTTAACGCGAAGATTCCGGCCTATGGTATGAAAGAGAGAATGAAGGCAGATAATGTATTTCCAAACAAAGCTTTGGCAGAAAAATTAGCCCAACTTGAT
GCTTTCAGGAAGACAGCAGTAGAAACTGAACTTGCTGAAAAGATATCAACAAAAAGGGCTTCTTCAGAGCTATTCATTGCTATTTTGGGTGCTCCTGGTTTATCTGTTCC
TATTGTACTTGATAAGTGGGTCAGTGAAGGAAAAGAAATAAGCCGGGCTGTTGTCTCTCTTGCCATGCTCAATCTTCGTAAACGTCGAATGTTTGGGAAGGCTTTGCAGT
TTTCGGAGTGGTTGGAAGCAAGCGGGCAACTCGATTTTATTGAGAGAGATTATGCTTCTCGCCTTGACTTGATTGCAAAGGTACACGGTCTCCATAAGGCAGAGAGGTAC
ATTGCTAAAATCCCAAAGTCCTTCCAGGGGGAGGTGATATACCGAACTCTTTTGGCTAACTGTGTGATAGCCAACAATGTTAAAAAAGCAGAGGAAGTATTTAACAAAAT
GAAGGACCTCGGATTCCCACTCACAACATTTGCTTGCAACCAGTTGCTTCTTCTTCACAAGAGGCATGACAAGAGGAAAATGGCTGACGATTTGTTGTTGATGGAGAAAG
AAAATATCAAGCCTTCTCTGTTTACTTACAAAATCTTAATAGATGCTAAAGGCCTATCAAATGACATGCTAGGGATGGAACAAGTTGTTGATACAATGAAGGCCGAAGGA
GTTGAAATTGATTTTAATGTACTTTCCATATTAGCTAAGCACTATGCTTCAGGTGGGCTTAAAGACGAAGCCAAGGCCATTTTAAAGGAGATGGAAGATGTTTACTCCAA
AGGTTCTCAATGGCCTTGCAGAATTTTACTTCCCATTTATGGAGAACTCCAAATGGAAGATGAAGTGAGGAGGCTCTGGGAGATCTGCGAGCCAAATCCTCGTATCGAAG
AATGCTTGGCTGCCATTGTTGCTTGGGGAAAGTTGAAGAACATCCCGGAAGCAGAGAAAATGTTTGATAGAGTTGTAAAAACATGGAAGAAGCTTTCCCCAAAACATTAT
TCTACCATGTTGAAGGTTTATTCAGACAATAAGATGCTGACAAAGGGCAAGGAACTAGTCAAACAGATGGCAGACAGCGGTTGCAGCATCGGTCCGTCGACATGGGATGC
AATTGTGAAGCTCTATGTGGAAGCTGGGGAGGTAGAAAAAGCAGACTCTTTTTTGGTTAAGGCTGTTCAAAAAAACCGGAAGAAGCCATTGTTTACCTCATACATGGTTA
TCATGAATCACTATGCAAGAAGGGGGGATGTCCACAATGCAGAGAAAATCTTTCATAATATGAGACTATGGGGTTACGTGGCTCGATTCAGCCAATATCAAACTCTGATA
CAGGCATACCTTAACGCGAAGACTCCGGCCTATGGTATGAAAGATCGAATGAAGGCAGATAATGTATTTCCAAACAAAACTTTGGCAGGGAAATTAGCCCAAGTGGATGC
TTTCAGGAAGACAGCAGTGTCGGATTTGCTTGACTGAGAATTTTAACCTGATACTTTTGATGAAAGTACAGCTCTTAGGACAAAGAAATCTCGAAATTGGATCCTGCTGC
ACCTGTGTTGATTATATTTTTATTATCCATTCTTCGCTAGAATTAGAACTCTGTTTTTTCAGAAACTCAAAATGGCTTGTTGTTATATATTCTCCCACTTTAATTCGTCA
AATGGATTCAACAAATGTCTCTGGC
Protein sequenceShow/hide protein sequence
MRNRGYRVRTSYVFGKLEVPYSWEGNVAGFGTTAALSDRYIFFEGNNLATWPSSCIYISSHGLSSQAGAENSGEEDKMEDGFSELDETLPSTSPLEDSKAADDNEEELTC
GSEIDDDDDDIDDGTQNELDLPEVETELAEKISTKRASSELFHAINGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDSIERDYAS
RLDLIAKVHGLHKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKMADDLLLMEKENIKPSLFTYKILIDAKGLS
NDMLGMEQVVDTMKAEGVEIDVNVLSILAKHYASGGLKDKAKAILKEMEDVNSKGSQWPCRILLPLYGELQMEDDVRRLWEICEPNPRIEECMAAIVAWGKLKNVPEAEK
IFDRVVKTWKKLSPKQYSTMLNVYADNKMLTKGKDLVKQMADSGCSIGPLSWDAIVKLYVEAGEVEKADSFLVKAVQKSQKKPLFTSYMVIMDHYARRGDVHNAEKIFHN
MRLWGYVARLRQFQTLIQAYVNAKTPAYGMRERMKADNVFPNKALAGKLAQVDALRKTAVNRGYRVRTSYVFGKLEVPYSWEGNVAGFGTTAALSDRCIFFEGNNLATWP
SSGIYISSHGLSSQAGAENSGEEDEMEDGFSELDETLPSTSPLEDSKAADDNEEELTCGSEIDDDDDVDDGTQNELDLPEVETELAEKISTKRASSELFNAILGAPGLSV
PIVLDKWVSEGKEISRAVVSLAMLNLRRRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLYKAERYIAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNK
MKDLGFPLTTFACNQLLLLHKRLDKRKMADVLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEGVEIDFNVLSILAKHYASGGLKDRAKAILKEMEDVYS
KGSQWACRILLPIYGELQMEDEVRRLWKICEPNPRIEECMAAIVAWGKLKNIPEAEKIFDRVVKTWKKLSPKQYSTMLKVYSDNKMLTKGKELVKQMADSGCSIGPLAWD
AIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMDHYARRGDVHNAEKIFHNMRLWGYVARFSQFQTLIQAYLNAKIPAYGMKERMKADNVFPNKALAEKLAQLD
AFRKTAVETELAEKISTKRASSELFIAILGAPGLSVPIVLDKWVSEGKEISRAVVSLAMLNLRKRRMFGKALQFSEWLEASGQLDFIERDYASRLDLIAKVHGLHKAERY
IAKIPKSFQGEVIYRTLLANCVIANNVKKAEEVFNKMKDLGFPLTTFACNQLLLLHKRHDKRKMADDLLLMEKENIKPSLFTYKILIDAKGLSNDMLGMEQVVDTMKAEG
VEIDFNVLSILAKHYASGGLKDEAKAILKEMEDVYSKGSQWPCRILLPIYGELQMEDEVRRLWEICEPNPRIEECLAAIVAWGKLKNIPEAEKMFDRVVKTWKKLSPKHY
STMLKVYSDNKMLTKGKELVKQMADSGCSIGPSTWDAIVKLYVEAGEVEKADSFLVKAVQKNRKKPLFTSYMVIMNHYARRGDVHNAEKIFHNMRLWGYVARFSQYQTLI
QAYLNAKTPAYGMKDRMKADNVFPNKTLAGKLAQVDAFRKTAVSDLLD