; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0027002 (gene) of Chayote v1 genome

Gene IDSed0027002
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG04:46356400..46357884
RNA-Seq ExpressionSed0027002
SyntenySed0027002
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022135810.1 pentatricopeptide repeat-containing protein At1g62350-like [Momordica charantia]2.9e-9979.44Show/hide
Query:  MSFLAPSPSP-MIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVL
        M+FLA  PSP  I    +KLLSSGEK+S  RL R +E+RRVTMRGG+ENRK LQKGRNLSIEAIQA+QSLKRVKND QQLDRVYDSKISRLLKFD+MAVL
Subjt:  MSFLAPSPSP-MIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVL

Query:  RELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDK
        RELLRQNEC LALKVFEDVR EHWYKPQVSLYADII +LASNGLFEQ+Q++HSYLK +T+LAPEIDGFNALLRAL SYN GELAMESY+LMK+V CEPDK
Subjt:  RELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDK

Query:  TSFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTIS
        TSFRI+IKGLEST E   LRIVKQDA+K+YG+ LEFLE+EDE A   S
Subjt:  TSFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTIS

XP_022952712.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita moschata]6.5e-9977.51Show/hide
Query:  MSFLAPSPSPMIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLR
        MSFLA +PS  I  P   LL SG K S+ RL R + +RRVTMRGG+ENRK LQKGRNLSIEAIQA+QSLKR K D QQLDRVYDSKI RLLKFD+MAVLR
Subjt:  MSFLAPSPSPMIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLYADI+++LASNGLFEQ+Q++HSY KA+T+LAPEI+GFN LL+AL SYN GELAMESY+LMKEV CEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKT

Query:  SFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTISTH
        SFRI+IKGLEST E+  LR VK+DA++LYGESLEFLE+EDE AT ISTH
Subjt:  SFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTISTH

XP_022969229.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita maxima]1.4e-9877.51Show/hide
Query:  MSFLAPSPSPMIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLR
        MSFLA +PS  I  P   LL SG K S+ RL R + +RRVTMRGG+ENRK LQKGRNLSIEAIQA+QSLKR K D QQLDRVYDSKI RLLKFD+MAVLR
Subjt:  MSFLAPSPSPMIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLYADI+++LASNGLFEQ+Q++HSYLKA+T+LAPEI+GFN LL+AL SYN GELAMESY+LMKEV CEPDK 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKT

Query:  SFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTISTH
        SFRI+IKGLEST E+  LR VK+DA++LYGESLEFLE+EDE AT ISTH
Subjt:  SFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTISTH

XP_023554588.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita pepo subsp. pepo]1.9e-9876.71Show/hide
Query:  MSFLAPSPSPMIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLR
        MSFLA +PSP I  P   LL SG K S+ RL R + +RRVTMRGG+ENRK LQKGRNLSIEAIQA+QSLKR K D QQLDRVYDSKI RLLKFD+MAVLR
Subjt:  MSFLAPSPSPMIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKT
        ELLRQNECSLALKVFEDVRKE WYKPQ+SLYADI+++LASNGLFE +Q++HSYLKA+T+LAPEI+GFN LL+AL SYN GELAMESY+LMK+V CEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKT

Query:  SFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTISTH
        SFRI+IKGLEST E+  LR VK+DA++LYGESLEFLE+ED+ AT ISTH
Subjt:  SFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTISTH

XP_038888189.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Benincasa hispida]1.9e-9877.11Show/hide
Query:  MSFLAPSPSPMIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLR
        MSFL   PSP I  P+ KLLSSG + S   L R + +R+VTMRGG+ENRK LQKGRNLSIEAIQA+QSLKR K D QQLDRVYDSKI RLLKFD+MAVLR
Subjt:  MSFLAPSPSPMIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKT
        ELLRQNECSLALKVFEDVR EHWYKPQVSLYADII +LASNGLFE++Q++HSYLKA+T+LAPEIDGFNALL+AL S+N GELAMESY+LMKE+ CEPDK 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKT

Query:  SFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTISTH
        SFRI+IKGLES GE   LR VKQDA+KLYGESLEFLE+E+E A  ISTH
Subjt:  SFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTISTH

TrEMBL top hitse value%identityAlignment
A0A1S3BRZ8 pentatricopeptide repeat-containing protein At3g46870-like1.3e-9273.9Show/hide
Query:  MSFLAPSPSPMIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLR
        MSFL   PSP I  P  KL SS  K    +L R + + RVTMRGG+ENRK LQKGRNLSIEAIQA+QSLKR K D QQLDRVYDSKI RLLKFD++AVLR
Subjt:  MSFLAPSPSPMIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKT
        ELLRQNECSLALKVFEDVR EHWYKPQVSLYADII +LASNGLFE++Q++ SY+KA+T+LAPEIDGFNALL+AL  +N G+LAMESY+LMKEV CEP+K 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKT

Query:  SFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTISTH
        SFRI+IKGLE  GE   LR VKQDA+KLYGESLEFLE+ +EGAT IS H
Subjt:  SFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTISTH

A0A5A7UZI6 Pentatricopeptide repeat-containing protein1.3e-9273.9Show/hide
Query:  MSFLAPSPSPMIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLR
        MSFL   PSP I  P  KL SS  K    +L R + + RVTMRGG+ENRK LQKGRNLSIEAIQA+QSLKR K D QQLDRVYDSKI RLLKFD++AVLR
Subjt:  MSFLAPSPSPMIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKT
        ELLRQNECSLALKVFEDVR EHWYKPQVSLYADII +LASNGLFE++Q++ SY+KA+T+LAPEIDGFNALL+AL  +N G+LAMESY+LMKEV CEP+K 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKT

Query:  SFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTISTH
        SFRI+IKGLE  GE   LR VKQDA+KLYGESLEFLE+ +EGAT IS H
Subjt:  SFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTISTH

A0A6J1C1T5 pentatricopeptide repeat-containing protein At1g62350-like1.4e-9979.44Show/hide
Query:  MSFLAPSPSP-MIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVL
        M+FLA  PSP  I    +KLLSSGEK+S  RL R +E+RRVTMRGG+ENRK LQKGRNLSIEAIQA+QSLKRVKND QQLDRVYDSKISRLLKFD+MAVL
Subjt:  MSFLAPSPSP-MIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVL

Query:  RELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDK
        RELLRQNEC LALKVFEDVR EHWYKPQVSLYADII +LASNGLFEQ+Q++HSYLK +T+LAPEIDGFNALLRAL SYN GELAMESY+LMK+V CEPDK
Subjt:  RELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDK

Query:  TSFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTIS
        TSFRI+IKGLEST E   LRIVKQDA+K+YG+ LEFLE+EDE A   S
Subjt:  TSFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTIS

A0A6J1GL50 protein THYLAKOID ASSEMBLY 8-like, chloroplastic3.1e-9977.51Show/hide
Query:  MSFLAPSPSPMIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLR
        MSFLA +PS  I  P   LL SG K S+ RL R + +RRVTMRGG+ENRK LQKGRNLSIEAIQA+QSLKR K D QQLDRVYDSKI RLLKFD+MAVLR
Subjt:  MSFLAPSPSPMIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLYADI+++LASNGLFEQ+Q++HSY KA+T+LAPEI+GFN LL+AL SYN GELAMESY+LMKEV CEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKT

Query:  SFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTISTH
        SFRI+IKGLEST E+  LR VK+DA++LYGESLEFLE+EDE AT ISTH
Subjt:  SFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTISTH

A0A6J1HX85 protein THYLAKOID ASSEMBLY 8-like, chloroplastic7.0e-9977.51Show/hide
Query:  MSFLAPSPSPMIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLR
        MSFLA +PS  I  P   LL SG K S+ RL R + +RRVTMRGG+ENRK LQKGRNLSIEAIQA+QSLKR K D QQLDRVYDSKI RLLKFD+MAVLR
Subjt:  MSFLAPSPSPMIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLYADI+++LASNGLFEQ+Q++HSYLKA+T+LAPEI+GFN LL+AL SYN GELAMESY+LMKEV CEPDK 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKT

Query:  SFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTISTH
        SFRI+IKGLEST E+  LR VK+DA++LYGESLEFLE+EDE AT ISTH
Subjt:  SFRIIIKGLESTGETDYLRIVKQDAEKLYGESLEFLEDEDEGATTISTH

SwissProt top hitse value%identityAlignment
O82178 Pentatricopeptide repeat-containing protein At2g351305.1e-0624.44Show/hide
Query:  LQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVH
        L K +  + EAI   Q +KR        DR   +  +  L  +L        + ++  ++ K++ ++R  H  KP +  Y  ++   A  GL E+ + + 
Subjt:  LQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVH

Query:  SYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKTSFRIIIKGLESTGETDYLRIVKQDAEKLYGE
          L+ D  L P++  +NAL+ +     +   A E + LM+ + CEPD+ S+ I++      G       +  DAE ++ E
Subjt:  SYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKTSFRIIIKGLESTGETDYLRIVKQDAEKLYGE

Q1PFH7 Pentatricopeptide repeat-containing protein At1g623503.1e-1935.71Show/hide
Query:  LSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKAD
        +S E + A + LKR++  S +LDR   S +SRLLK DL++VL E  RQN+  L +K++E VR+E WY+P +  Y D++ +LA N   ++ + V   LK +
Subjt:  LSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKAD

Query:  TELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKTSFRIIIKGLESTGETDYLRIVKQDAEKL------YGESLEFLEDEDEGATTIS
         E+  +   F  L+R          AM  Y  M+E    P    FR+I+KGL    E      VK D  +L      Y    +  ED DE A T S
Subjt:  TELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKTSFRIIIKGLESTGETDYLRIVKQDAEKL------YGESLEFLEDEDEGATTIS

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic6.0e-1537.23Show/hide
Query:  GGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYADIIALLASNG
        G  +NR  L KGR LS EAIQ+IQSLKR       L       + RL+K DL++VLRELLRQ+ C+LA+ V   +R E  Y P  + LYADI+  L  N 
Subjt:  GGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYADIIALLASNG

Query:  LFEQLQMVHSYLKADTELAPEIDGFN---------ALLRALFSYNFGELAMESYFLMKE-----VDCEPDKTSFRIIIKGLESTGETD
         F+++            L  EIDG +          L+RA+      E  +  Y LM+E        E D+    ++ KGL   GE D
Subjt:  LFEQLQMVHSYLKADTELAPEIDGFN---------ALLRALFSYNFGELAMESYFLMKE-----VDCEPDKTSFRIIIKGLESTGETD

Q9SCP4 Pentatricopeptide repeat-containing protein At3g531709.3e-0825.78Show/hide
Query:  LMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEV-
        ++  L E +++N    ALK+F  +RK+HWY+P+   Y  +  +L +    +Q  ++   + ++  L P ID + +L+         + A  +   MK V 
Subjt:  LMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEV-

Query:  DCEPDKTSFRIIIKGLESTGETDYLRIV
        DC+PD  +F ++I      G  D ++ +
Subjt:  DCEPDKTSFRIIIKGLESTGETDYLRIV

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic1.2e-1833.15Show/hide
Query:  RKQLQKGRNL-SIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQL
        R  L +G+ L   EA+  I  LKR+K D ++LD+   + + RLLK D++AV+ EL RQ E +LA+K+FE ++K+ WY+P V +Y D+I  LA +   ++ 
Subjt:  RKQLQKGRNL-SIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQL

Query:  QMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKTSFRIIIKGLESTGETDYLRIVKQDAEKLYGE
          +   +K +  L P+   +  ++R          AM  Y  M +    P++  FR+++KGL           VK+D E+L+ E
Subjt:  QMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKTSFRIIIKGLESTGETDYLRIVKQDAEKLYGE

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-2035.71Show/hide
Query:  LSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKAD
        +S E + A + LKR++  S +LDR   S +SRLLK DL++VL E  RQN+  L +K++E VR+E WY+P +  Y D++ +LA N   ++ + V   LK +
Subjt:  LSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKAD

Query:  TELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKTSFRIIIKGLESTGETDYLRIVKQDAEKL------YGESLEFLEDEDEGATTIS
         E+  +   F  L+R          AM  Y  M+E    P    FR+I+KGL    E      VK D  +L      Y    +  ED DE A T S
Subjt:  TELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKTSFRIIIKGLESTGETDYLRIVKQDAEKL------YGESLEFLEDEDEGATTIS

AT3G27750.1 FUNCTIONS IN: molecular_function unknown4.3e-1637.23Show/hide
Query:  GGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYADIIALLASNG
        G  +NR  L KGR LS EAIQ+IQSLKR       L       + RL+K DL++VLRELLRQ+ C+LA+ V   +R E  Y P  + LYADI+  L  N 
Subjt:  GGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYADIIALLASNG

Query:  LFEQLQMVHSYLKADTELAPEIDGFN---------ALLRALFSYNFGELAMESYFLMKE-----VDCEPDKTSFRIIIKGLESTGETD
         F+++            L  EIDG +          L+RA+      E  +  Y LM+E        E D+    ++ KGL   GE D
Subjt:  LFEQLQMVHSYLKADTELAPEIDGFN---------ALLRALFSYNFGELAMESYFLMKE-----VDCEPDKTSFRIIIKGLESTGETD

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein8.3e-2033.15Show/hide
Query:  RKQLQKGRNL-SIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQL
        R  L +G+ L   EA+  I  LKR+K D ++LD+   + + RLLK D++AV+ EL RQ E +LA+K+FE ++K+ WY+P V +Y D+I  LA +   ++ 
Subjt:  RKQLQKGRNL-SIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQL

Query:  QMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKTSFRIIIKGLESTGETDYLRIVKQDAEKLYGE
          +   +K +  L P+   +  ++R          AM  Y  M +    P++  FR+++KGL           VK+D E+L+ E
Subjt:  QMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKTSFRIIIKGLESTGETDYLRIVKQDAEKLYGE

AT3G53170.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.6e-0925.78Show/hide
Query:  LMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEV-
        ++  L E +++N    ALK+F  +RK+HWY+P+   Y  +  +L +    +Q  ++   + ++  L P ID + +L+         + A  +   MK V 
Subjt:  LMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEV-

Query:  DCEPDKTSFRIIIKGLESTGETDYLRIV
        DC+PD  +F ++I      G  D ++ +
Subjt:  DCEPDKTSFRIIIKGLESTGETDYLRIV

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain8.0e-5550.69Show/hide
Query:  VTMRGGNENRKQLQKGRNLSIEAIQAIQSLKR---------------VKNDSQQLDRVYDSKISRLLKFDLMAVLRELLRQNECSLALKVFEDVRKEHWY
        + MR  ++NRK LQ+GR LSIEAIQA+Q+LKR                 + S  LDRV  SK  RLLKFD++AVLRELLRQNECSLALKVFE++RKE+WY
Subjt:  VTMRGGNENRKQLQKGRNLSIEAIQAIQSLKR---------------VKNDSQQLDRVYDSKISRLLKFDLMAVLRELLRQNECSLALKVFEDVRKEHWY

Query:  KPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKTSFRIIIKGLESTGETDYLRIVKQD
        KPQV +Y D+I ++A N L E++  ++S +K++  L  EI+ FN LL  L ++   +L M+ Y  M+ +  EPD+ SFR+++ GLES GE     IV+QD
Subjt:  KPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKTSFRIIIKGLESTGETDYLRIVKQD

Query:  AEKLYGESLEFLEDEDE
        A + YGESLEF+E+++E
Subjt:  AEKLYGESLEFLEDEDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTCTTAGCACCTTCTCCGTCGCCGATGATTCGCTGTCCGACGAACAAGTTGCTGAGTTCCGGCGAGAAAACATCTTTCCCGCGACTAAACCGAACGAAGGAACA
TCGGAGAGTTACGATGAGAGGCGGAAATGAGAACCGGAAGCAGCTGCAGAAGGGGAGGAATCTCAGCATCGAAGCCATTCAAGCGATTCAATCATTGAAACGAGTCAAGA
ATGATTCACAGCAATTGGACCGAGTGTATGATTCCAAAATCAGCCGCTTATTGAAGTTCGATTTGATGGCTGTTCTTCGCGAGCTCCTTCGCCAGAACGAGTGTTCTTTG
GCTCTTAAGGTTTTCGAAGATGTTAGAAAGGAACACTGGTATAAGCCTCAGGTTTCGCTGTATGCTGATATTATTGCACTATTGGCTAGCAATGGATTGTTCGAACAACT
ACAAATGGTTCATTCGTATTTGAAAGCGGACACTGAACTAGCACCTGAAATCGACGGGTTTAACGCTCTTCTGAGGGCTTTATTTAGTTACAATTTCGGCGAACTTGCGA
TGGAGTCCTACTTCTTGATGAAAGAAGTTGATTGTGAGCCAGATAAGACTTCTTTCAGGATAATCATAAAAGGATTGGAATCAACCGGAGAGACGGATTATTTAAGAATT
GTGAAGCAGGATGCAGAAAAGCTGTATGGTGAATCGCTTGAGTTTCTAGAAGATGAAGACGAGGGAGCTACAACCATATCTACGCACTGA
mRNA sequenceShow/hide mRNA sequence
GCTGAGGTTAGCACTTCGTTAAAGCGTTAACTGGTAGTACCTCCCAGGAGTTGGACTTCGCCGGAGAGAGTAATCATGAGTTTCTTAGCACCTTCTCCGTCGCCGATGAT
TCGCTGTCCGACGAACAAGTTGCTGAGTTCCGGCGAGAAAACATCTTTCCCGCGACTAAACCGAACGAAGGAACATCGGAGAGTTACGATGAGAGGCGGAAATGAGAACC
GGAAGCAGCTGCAGAAGGGGAGGAATCTCAGCATCGAAGCCATTCAAGCGATTCAATCATTGAAACGAGTCAAGAATGATTCACAGCAATTGGACCGAGTGTATGATTCC
AAAATCAGCCGCTTATTGAAGTTCGATTTGATGGCTGTTCTTCGCGAGCTCCTTCGCCAGAACGAGTGTTCTTTGGCTCTTAAGGTTTTCGAAGATGTTAGAAAGGAACA
CTGGTATAAGCCTCAGGTTTCGCTGTATGCTGATATTATTGCACTATTGGCTAGCAATGGATTGTTCGAACAACTACAAATGGTTCATTCGTATTTGAAAGCGGACACTG
AACTAGCACCTGAAATCGACGGGTTTAACGCTCTTCTGAGGGCTTTATTTAGTTACAATTTCGGCGAACTTGCGATGGAGTCCTACTTCTTGATGAAAGAAGTTGATTGT
GAGCCAGATAAGACTTCTTTCAGGATAATCATAAAAGGATTGGAATCAACCGGAGAGACGGATTATTTAAGAATTGTGAAGCAGGATGCAGAAAAGCTGTATGGTGAATC
GCTTGAGTTTCTAGAAGATGAAGACGAGGGAGCTACAACCATATCTACGCACTGAGCTGAAAACTCGGATTTGGATCATTTTAGGGAAGGCATGTACGCTAACAAGAGCC
TAACTCAATCGACATGAAGTATGTACATATGACCTAGAGGACACGCGTTTGAATTATTTAACCACATATGTTGTTGAACTCGAAAAAAGGAAGGCGTATACAGATCGAAC
TTGTAGGACTGTTTCATTTGAAGAAACAAGATTGAGAAACTTTCCTCGGATATGAACTTTAACTGTATAATGGCTAATGTCAACCTGTGCTTAGTTGTGGATATCTGTGA
TATGATAACTTTCGAGATTAATTTTTACTGGATAAATGACACTAGCTGGAACTCATGAGTTGGATTGATATACTCTCTTATTCTTGTTCTTTTGCATGACAAGTCAATCA
TCTTGCAACATAGTTCAGCTCTTCTGTCATGAGCATGAGAATTGCATCTAAGTCAACCGAAGACAGAAGCATCAGGAAAAAGCTGGACTATAATGCAAAGTGAAC
Protein sequenceShow/hide protein sequence
MSFLAPSPSPMIRCPTNKLLSSGEKTSFPRLNRTKEHRRVTMRGGNENRKQLQKGRNLSIEAIQAIQSLKRVKNDSQQLDRVYDSKISRLLKFDLMAVLRELLRQNECSL
ALKVFEDVRKEHWYKPQVSLYADIIALLASNGLFEQLQMVHSYLKADTELAPEIDGFNALLRALFSYNFGELAMESYFLMKEVDCEPDKTSFRIIIKGLESTGETDYLRI
VKQDAEKLYGESLEFLEDEDEGATTISTH