; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019928 (gene) of Snake gourd v1 genome

Gene IDTan0019928
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG10:18992395..18995521
RNA-Seq ExpressionTan0019928
SyntenyTan0019928
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044536.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]3.5e-10581.59Show/hide
Query:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL
        MKSTLMGRLQ H PQLGLRQNLTN  LHCCT AAPPPNIICGLRKGL+KPLGRSRVPS EAIQAVQSLKLAK  SKMEDVIN+KL RLLKADLFDALTEL
Subjt:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL

Query:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR
        QRQNELELSLQVFKFM+NEEW+EPDL LYHGMI+M+GKNKMIE+AEE+FH+L+ DGLEPDTRAFNEMMGAYLQV M+ERA + Y  M ASGCTPDKLT +
Subjt:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR

Query:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFLDDVEQKL
        IL+KNLEKF EEFA VVK++C +YLD+P+KF +D  QKL
Subjt:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFLDDVEQKL

KAG6582003.1 hypothetical protein SDJN03_22005, partial [Cucurbita argyrosperma subsp. sororia]2.0e-10585.34Show/hide
Query:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL
        MKSTLMGRLQ H PQLG RQNLTN  LHCCT A PPP+IICGLRKG RKPLG+SRVPSTE+IQAVQSLKLAKSASKMEDVINSKL RLLKADLFDAL EL
Subjt:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL

Query:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR
        QRQNELELSLQVFKFMRNEEW+EPDL+LYH MI M+GKNKMIE+AEE+FHELK DGLEPDTRAFNEMMGAYLQV MVERAVE YE MKASGC PDKLT +
Subjt:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR

Query:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFL
        IL+KNLE+F EEFAAVVK+EC ++LDSPEKFL
Subjt:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFL

XP_022980054.1 pentatricopeptide repeat-containing protein At1g62350-like [Cucurbita maxima]2.1e-11085.25Show/hide
Query:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL
        MKSTLMGRLQ HLPQLG RQNLTN  LHCCT A PPPNIICGLRKG RKPLG+SRVPSTE+IQAVQSLKLAKSASKMEDVINSKL RLLKADLFDAL EL
Subjt:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL

Query:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR
        QRQNELELSLQVFKFMRNEEW+EPDL LYH MI M+GKNKMIE+AEE+FH+ K DGLEPDTRAFNEMMGAYLQV MVERAVE YE MKASGCTPDKLT +
Subjt:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR

Query:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFLDDVEQKLMKDRI
        IL+KNLE+F EEFAAVVK+EC ++LDSPEKFL DVEQKLMK +I
Subjt:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFLDDVEQKLMKDRI

XP_023527118.1 pentatricopeptide repeat-containing protein At1g62350-like [Cucurbita pepo subsp. pepo]6.1e-11084.84Show/hide
Query:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL
        MKSTLMGRLQ H PQLG RQNLTN  LHCCT A PPPNIICGLRKG RKPLG+SRVPSTE+IQAVQSLKLAKSASKMEDVINSKL RLLKADLFDAL EL
Subjt:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL

Query:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR
        QRQNELELSLQVFKFMRNEEW+EPDL LYH MI M+GKNKMIE+AEE+FHELK DGLEPDTRAFNEMMGAY+QV MVERAVE YE MKASGC PDKLT +
Subjt:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR

Query:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFLDDVEQKLMKDRI
        IL+KNLE+F EEFAAVVK+EC ++LDSPEKFL DVEQKLMK +I
Subjt:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFLDDVEQKLMKDRI

XP_038878176.1 pentatricopeptide repeat-containing protein At1g62350-like [Benincasa hispida]3.6e-11087.82Show/hide
Query:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL
        MKSTLMG LQFH PQLGLRQ+LTN  LHCCT AAPPPNIICGLRKGLRKPLGRSRVPS EAIQAVQSLKLAKS SKMEDVINSKL RLLKADLFDALTEL
Subjt:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL

Query:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR
        QRQNELELSLQVFKF++NEEW+EPDL LYHGMILM GKNKMIE+AEEIFH+LK DGLEPD RAFNEMMGAYLQV MVERAVE YE MKASGCTPDKLT +
Subjt:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR

Query:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFLDDVEQK
        IL+KNLEKF EEFAAVVK+EC  YLDSPEKFL+DVEQK
Subjt:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFLDDVEQK

TrEMBL top hitse value%identityAlignment
A0A1S3BX97 pentatricopeptide repeat-containing protein At1g62350-like2.2e-10581.17Show/hide
Query:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL
        MKSTLMGRLQ H P+LGLRQNLTN  LHCCT AAPPPNIICGLRKGL+KPLGRSRVPS EAIQAVQSLKLAK  SKMEDVIN+KL RLLKADLFDALTEL
Subjt:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL

Query:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR
        QRQNELELSLQVFKFM+NEEW+EPDL LYHGMI+M+GKNKMIE+AEE+FH+L+ DGLEPDTRAFNEMMGAYLQV M+ERA + Y  M ASGCTPDKLT +
Subjt:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR

Query:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFLDDVEQKL
        IL+KNLEKF EEFA VVK++C +YLD+P+KF +D  QKL
Subjt:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFLDDVEQKL

A0A5A7TQZ8 Pentatricopeptide repeat-containing protein1.7e-10581.59Show/hide
Query:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL
        MKSTLMGRLQ H PQLGLRQNLTN  LHCCT AAPPPNIICGLRKGL+KPLGRSRVPS EAIQAVQSLKLAK  SKMEDVIN+KL RLLKADLFDALTEL
Subjt:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL

Query:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR
        QRQNELELSLQVFKFM+NEEW+EPDL LYHGMI+M+GKNKMIE+AEE+FH+L+ DGLEPDTRAFNEMMGAYLQV M+ERA + Y  M ASGCTPDKLT +
Subjt:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR

Query:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFLDDVEQKL
        IL+KNLEKF EEFA VVK++C +YLD+P+KF +D  QKL
Subjt:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFLDDVEQKL

A0A5D3CZ95 Pentatricopeptide repeat-containing protein2.2e-10581.17Show/hide
Query:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL
        MKSTLMGRLQ H P+LGLRQNLTN  LHCCT AAPPPNIICGLRKGL+KPLGRSRVPS EAIQAVQSLKLAK  SKMEDVIN+KL RLLKADLFDALTEL
Subjt:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL

Query:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR
        QRQNELELSLQVFKFM+NEEW+EPDL LYHGMI+M+GKNKMIE+AEE+FH+L+ DGLEPDTRAFNEMMGAYLQV M+ERA + Y  M ASGCTPDKLT +
Subjt:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR

Query:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFLDDVEQKL
        IL+KNLEKF EEFA VVK++C +YLD+P+KF +D  QKL
Subjt:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFLDDVEQKL

A0A6J1GUU8 pentatricopeptide repeat-containing protein At1g62350-like4.9e-10584.91Show/hide
Query:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL
        MKSTLMGRLQ H PQLG RQNLTN  LHCCT A PPP+ ICGLRKG RKPLG+SRVPSTE+IQAVQSLKLAKSASKMEDVINSKL RLLKADLFDAL EL
Subjt:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL

Query:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR
        QRQNELELSLQVFKFMRNEEW+EPDL LYH MI M+GKNKMIE+AEE+FHELK DGLEPDTRAFNEMMGAYLQV MVERAVE YE MKASGC PDKLT +
Subjt:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR

Query:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFL
        IL+KNLE+F EEFAAVVK+EC ++LDSPEKFL
Subjt:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFL

A0A6J1IY98 pentatricopeptide repeat-containing protein At1g62350-like1.0e-11085.25Show/hide
Query:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL
        MKSTLMGRLQ HLPQLG RQNLTN  LHCCT A PPPNIICGLRKG RKPLG+SRVPSTE+IQAVQSLKLAKSASKMEDVINSKL RLLKADLFDAL EL
Subjt:  MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTEL

Query:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR
        QRQNELELSLQVFKFMRNEEW+EPDL LYH MI M+GKNKMIE+AEE+FH+ K DGLEPDTRAFNEMMGAYLQV MVERAVE YE MKASGCTPDKLT +
Subjt:  QRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLR

Query:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFLDDVEQKLMKDRI
        IL+KNLE+F EEFAAVVK+EC ++LDSPEKFL DVEQKLMK +I
Subjt:  ILMKNLEKFGEEFAAVVKEECIKYLDSPEKFLDDVEQKLMKDRI

SwissProt top hitse value%identityAlignment
Q1PFH7 Pentatricopeptide repeat-containing protein At1g623508.8e-1928.65Show/hide
Query:  STEAIQAVQSLKLAKSAS-KMEDVINSKLGRLLKADLFDALTELQRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDG
        S E + A + LK  ++ S +++  I S + RLLK+DL   L E QRQN++ L +++++ +R E W+ PD+  Y  M++ML +NK ++  ++++ +LK + 
Subjt:  STEAIQAVQSLKLAKSAS-KMEDVINSKLGRLLKADLFDALTELQRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDG

Query:  LEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLRILMKNLEKFGEEFAAVVKEECIKYL------DSPEKFLDDVEQKLMKD
        +  D   F +++  +L   +   A+ LY  M+ S   P  L  R+++K L  +  E    VK++ ++        D PE   +D +++   D
Subjt:  LEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLRILMKNLEKFGEEFAAVVKEECIKYL------DSPEKFLDDVEQKLMKD

Q9FKC3 Pentatricopeptide repeat-containing protein At5g48730, chloroplastic5.5e-1332.9Show/hide
Query:  SRVPSTEAIQAVQSLKLAKSASKMEDVINSKLG--RLLKADLFDALTELQRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHE
        +++ S +AI    S+ L + A+K   +I  K G  +LL   + ++L E       E ++QVF+ +R + W++P++ +Y  +I+MLGK K  E A E+F E
Subjt:  SRVPSTEAIQAVQSLKLAKSASKMEDVINSKLG--RLLKADLFDALTELQRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHE

Query:  LKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKAS-GCTPDKLTLRILMKN
        + N+G   +   +  ++ AY + G  + A  L E MK+S  C PD  T  IL+K+
Subjt:  LKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKAS-GCTPDKLTLRILMKN

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic1.9e-1331.12Show/hide
Query:  SHLHCCTAAAPPPNII---CGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTELQRQNELELSLQVFKFMRNEEW
        SH H  +   P    +   CG R   R PL + R+ STEAIQ++QSLK A        +    L RL+K+DL   L EL RQ+   L++ V   +R E  
Subjt:  SHLHCCTAAAPPPNII---CGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTELQRQNELELSLQVFKFMRNEEW

Query:  FEP-DLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASG-----CTPDKLTLRILMKNLEKFGE
        + P DL LY  ++  L +NK  +  + +  E+       D +A  +++ A +     E  V +Y  M+ SG        D+    +L K L + GE
Subjt:  FEP-DLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASG-----CTPDKLTLRILMKNLEKFGE

Q9LW84 Pentatricopeptide repeat-containing protein At3g160108.8e-1130.46Show/hide
Query:  QAVQSLKLAKSASKMEDVINSKLGRLLK---ADLFDALTELQRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEP
        + V++L  AK  SK   V     GR  K   +     +  L ++ + E   +V+  M NE    PD   Y  +I    K    + A  +F E+K++ ++P
Subjt:  QAVQSLKLAKSASKMEDVINSKLGRLLK---ADLFDALTELQRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEP

Query:  DTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLRILMKNLEKFG
          + +  ++G Y +VG VE+A++L+E MK +GC+P   T   L+K L K G
Subjt:  DTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLRILMKNLEKFG

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic5.2e-1931.68Show/hide
Query:  RKPLGR-SRVPSTEAIQAVQSLK-LAKSASKMEDVINSKLGRLLKADLFDALTELQRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVA
        R PL R  ++   EA+  +  LK L +   K++  I + + RLLK D+   + EL+RQ E  L++++F+ ++ +EW++PD+ +Y  +I+ L K+K ++ A
Subjt:  RKPLGR-SRVPSTEAIQAVQSLK-LAKSASKMEDVINSKLGRLLKADLFDALTELQRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVA

Query:  EEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLRILMKNL
          ++ ++K + L PD++ + E++  +L+ G    A+ +YE M  S   P++L  R+L+K L
Subjt:  EEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLRILMKNL

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein6.3e-2028.65Show/hide
Query:  STEAIQAVQSLKLAKSAS-KMEDVINSKLGRLLKADLFDALTELQRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDG
        S E + A + LK  ++ S +++  I S + RLLK+DL   L E QRQN++ L +++++ +R E W+ PD+  Y  M++ML +NK ++  ++++ +LK + 
Subjt:  STEAIQAVQSLKLAKSAS-KMEDVINSKLGRLLKADLFDALTELQRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDG

Query:  LEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLRILMKNLEKFGEEFAAVVKEECIKYL------DSPEKFLDDVEQKLMKD
        +  D   F +++  +L   +   A+ LY  M+ S   P  L  R+++K L  +  E    VK++ ++        D PE   +D +++   D
Subjt:  LEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLRILMKNLEKFGEEFAAVVKEECIKYL------DSPEKFLDDVEQKLMKD

AT3G27750.1 FUNCTIONS IN: molecular_function unknown1.4e-1431.12Show/hide
Query:  SHLHCCTAAAPPPNII---CGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTELQRQNELELSLQVFKFMRNEEW
        SH H  +   P    +   CG R   R PL + R+ STEAIQ++QSLK A        +    L RL+K+DL   L EL RQ+   L++ V   +R E  
Subjt:  SHLHCCTAAAPPPNII---CGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTELQRQNELELSLQVFKFMRNEEW

Query:  FEP-DLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASG-----CTPDKLTLRILMKNLEKFGE
        + P DL LY  ++  L +NK  +  + +  E+       D +A  +++ A +     E  V +Y  M+ SG        D+    +L K L + GE
Subjt:  FEP-DLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASG-----CTPDKLTLRILMKNLEKFGE

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein3.7e-2031.68Show/hide
Query:  RKPLGR-SRVPSTEAIQAVQSLK-LAKSASKMEDVINSKLGRLLKADLFDALTELQRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVA
        R PL R  ++   EA+  +  LK L +   K++  I + + RLLK D+   + EL+RQ E  L++++F+ ++ +EW++PD+ +Y  +I+ L K+K ++ A
Subjt:  RKPLGR-SRVPSTEAIQAVQSLK-LAKSASKMEDVINSKLGRLLKADLFDALTELQRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVA

Query:  EEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLRILMKNL
          ++ ++K + L PD++ + E++  +L+ G    A+ +YE M  S   P++L  R+L+K L
Subjt:  EEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLRILMKNL

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain1.9e-2433.17Show/hide
Query:  RKPLGRSRVPSTEAIQAVQSLKLA----------------KSASKMEDVINSKLGRLLKADLFDALTELQRQNELELSLQVFKFMRNEEWFEPDLTLYHG
        RKPL R R+ S EAIQAVQ+LK A                 S++ ++ VI SK  RLLK D+   L EL RQNE  L+L+VF+ +R E W++P + +Y  
Subjt:  RKPLGRSRVPSTEAIQAVQSLKLA----------------KSASKMEDVINSKLGRLLKADLFDALTELQRQNELELSLQVFKFMRNEEWFEPDLTLYHG

Query:  MILMLGKNKMIEVAEEIFHELKND-GLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLRILMKNLEKFGE-EFAAVVKEECIKYLDSPE
        MI ++  N ++E    ++  +K++ GL  +   FN ++   L   + +  ++ Y  M++ G  PD+ + R+L+  LE  GE   +A+V+++  +Y     
Subjt:  MILMLGKNKMIEVAEEIFHELKND-GLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLRILMKNLEKFGE-EFAAVVKEECIKYLDSPE

Query:  KFLDDVEQ
        +F+++ E+
Subjt:  KFLDDVEQ

AT5G48730.1 Pentatricopeptide repeat (PPR) superfamily protein3.9e-1432.9Show/hide
Query:  SRVPSTEAIQAVQSLKLAKSASKMEDVINSKLG--RLLKADLFDALTELQRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHE
        +++ S +AI    S+ L + A+K   +I  K G  +LL   + ++L E       E ++QVF+ +R + W++P++ +Y  +I+MLGK K  E A E+F E
Subjt:  SRVPSTEAIQAVQSLKLAKSASKMEDVINSKLG--RLLKADLFDALTELQRQNELELSLQVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHE

Query:  LKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKAS-GCTPDKLTLRILMKN
        + N+G   +   +  ++ AY + G  + A  L E MK+S  C PD  T  IL+K+
Subjt:  LKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKAS-GCTPDKLTLRILMKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCTACTCTAATGGGTCGTTTACAATTCCATCTTCCTCAATTGGGTCTCCGCCAAAACCTCACAAACTCACACCTCCATTGTTGTACGGCGGCAGCTCCACCTCC
AAATATCATTTGTGGCCTCAGAAAGGGCCTGAGGAAGCCCTTAGGGAGGTCAAGGGTGCCCTCCACTGAGGCTATTCAAGCAGTCCAGTCCCTCAAGCTCGCTAAATCCG
CCTCCAAAATGGAGGACGTAATCAATAGCAAGCTCGGCAGATTGTTGAAAGCAGATTTGTTTGATGCTCTGACTGAATTACAGAGGCAAAATGAACTGGAGCTATCGCTT
CAGGTCTTCAAATTTATGCGGAATGAAGAATGGTTCGAGCCAGATTTAACTTTGTACCATGGGATGATTCTGATGTTGGGGAAGAACAAAATGATTGAAGTGGCTGAAGA
GATCTTCCATGAGTTAAAAAATGATGGCTTAGAACCAGACACAAGAGCTTTTAATGAGATGATGGGAGCATATCTGCAAGTGGGTATGGTCGAAAGAGCGGTTGAGTTGT
ATGAATCAATGAAGGCGTCAGGTTGTACTCCAGATAAACTGACTTTAAGGATTTTGATGAAGAATCTAGAGAAATTTGGGGAAGAGTTTGCTGCAGTGGTGAAGGAAGAA
TGCATTAAGTATTTGGATTCTCCTGAGAAGTTTCTTGACGATGTCGAACAGAAACTCATGAAAGATCGAATTCCTTAA
mRNA sequenceShow/hide mRNA sequence
AGTTGTATTTGTAGGGATACAATTCTGCGCAAGCGTAAATCCTCCCTTTCCCCTTCACTGAACTCAGCTTATCGCAGAAGGGCGGTCGAGATGAAATCTACTCTAATGGG
TCGTTTACAATTCCATCTTCCTCAATTGGGTCTCCGCCAAAACCTCACAAACTCACACCTCCATTGTTGTACGGCGGCAGCTCCACCTCCAAATATCATTTGTGGCCTCA
GAAAGGGCCTGAGGAAGCCCTTAGGGAGGTCAAGGGTGCCCTCCACTGAGGCTATTCAAGCAGTCCAGTCCCTCAAGCTCGCTAAATCCGCCTCCAAAATGGAGGACGTA
ATCAATAGCAAGCTCGGCAGATTGTTGAAAGCAGATTTGTTTGATGCTCTGACTGAATTACAGAGGCAAAATGAACTGGAGCTATCGCTTCAGGTCTTCAAATTTATGCG
GAATGAAGAATGGTTCGAGCCAGATTTAACTTTGTACCATGGGATGATTCTGATGTTGGGGAAGAACAAAATGATTGAAGTGGCTGAAGAGATCTTCCATGAGTTAAAAA
ATGATGGCTTAGAACCAGACACAAGAGCTTTTAATGAGATGATGGGAGCATATCTGCAAGTGGGTATGGTCGAAAGAGCGGTTGAGTTGTATGAATCAATGAAGGCGTCA
GGTTGTACTCCAGATAAACTGACTTTAAGGATTTTGATGAAGAATCTAGAGAAATTTGGGGAAGAGTTTGCTGCAGTGGTGAAGGAAGAATGCATTAAGTATTTGGATTC
TCCTGAGAAGTTTCTTGACGATGTCGAACAGAAACTCATGAAAGATCGAATTCCTTAATCGTGCACTTGGAAGCTTCTGCTAACATTTTTTATTTTTGAGATGGTTGGTG
TTAGGGAATTTCATCAAAGCCTCAAATCTGTCCTAACAAATTGGTATCAAAGCTCGGAATAAGTTAAATGGCAGCAACTACGTTTGAGGTGAAGTGTAATGGATATGGAG
ATTCAATTTGTGGAAGATCAAGATCAAGGCAGTCAATCTTATGGGAAAGACTAGAGTATGGAGGAAACTTGGATAAGTTGAAGATATTAATAATTAGGGAGACAACTAGG
AGAAGAGAGCATTCAGATCACAGAATTCTTAACCTTTTAAAATGCAAAAGATGTTAAAGTATAATTTATTCTTTGTAATTTAAGTATTGTGTGTTTTATTTTTAAAATTT
TTGAGCCCAACAATATTGGTGGATTTGAACCTATGACCTCTTGAACATAAACTCATATTATATTTTGTATAAGTTGAGCTATGCTCTTATTGGCAATTTAA
Protein sequenceShow/hide protein sequence
MKSTLMGRLQFHLPQLGLRQNLTNSHLHCCTAAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSASKMEDVINSKLGRLLKADLFDALTELQRQNELELSL
QVFKFMRNEEWFEPDLTLYHGMILMLGKNKMIEVAEEIFHELKNDGLEPDTRAFNEMMGAYLQVGMVERAVELYESMKASGCTPDKLTLRILMKNLEKFGEEFAAVVKEE
CIKYLDSPEKFLDDVEQKLMKDRIP