; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g03040 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g03040
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein THYLAKOID ASSEMBLY 8, chloroplastic
Genome locationchr9:2420331..2421020
RNA-Seq ExpressionMoc09g03040
SyntenyMoc09g03040
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014066.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.0e-9077.83Show/hide
Query:  MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
        MA SL    S FLKS+IP P   +A A VV S+PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRA++SDPTKL  VLS+TLSRLLKADLVATLKELLRQ
Subjt:  MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ

Query:  EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV
        ++C LALEVF VVRSEYGADLG+YAE+AAALSRNG  EEIDRL+C+LE E   I+CDDKGLIKLI+AVIGGDRRESTVRIYRMM+RSGWGST KADD+TV
Subjt:  EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV

Query:  KILSKGLRRLGEVELADEINREFQNLVATY
        ++LSKGLRRLGE+E+ADE+N +FQ+LV ++
Subjt:  KILSKGLRRLGEVELADEINREFQNLVATY

XP_022150715.1 uncharacterized protein LOC111018772 [Momordica charantia]3.0e-119100Show/hide
Query:  MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
        MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
Subjt:  MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ

Query:  EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVK
        EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVK
Subjt:  EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVK

Query:  ILSKGLRRLGEVELADEINREFQNLVATY
        ILSKGLRRLGEVELADEINREFQNLVATY
Subjt:  ILSKGLRRLGEVELADEINREFQNLVATY

XP_022954347.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita moschata]5.0e-9077.83Show/hide
Query:  MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
        MA SL    S FLKS+IP P   +A A VV S+PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRA++SDPTKL  VLS+TLSRLLKADLVATLKELLRQ
Subjt:  MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ

Query:  EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV
        ++C LALEVF VVRSEYGADLG+YAE+AAALSRNG  EEIDRL+C+LE E   I+CDDKGLIKLI+AVIGGDRRESTVRIYRMM+RSGWGST KADD+TV
Subjt:  EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV

Query:  KILSKGLRRLGEVELADEINREFQNLVATY
        ++LSKGLRRLGE+E+ADE+N +FQ+LV ++
Subjt:  KILSKGLRRLGEVELADEINREFQNLVATY

XP_022992386.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita maxima]5.6e-8977.39Show/hide
Query:  MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
        MA SL    S FLKS+IP P   +A A VV S+PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRA++SDPTKL  VLS+TLSRLLKADLVATLKELLRQ
Subjt:  MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ

Query:  EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV
        ++C LALEVF VVRSEYG DLG+YAE+AAALSRNG  EEIDRL+C+LE E   I+CDDKGLIKLI+AVIGGDRRESTVRIYRMM+RSGWGST KADD+ V
Subjt:  EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV

Query:  KILSKGLRRLGEVELADEINREFQNLVATY
        ++LSKGLRRLGE+E+ADEIN +FQ+LV ++
Subjt:  KILSKGLRRLGEVELADEINREFQNLVATY

XP_023522652.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Cucurbita pepo subsp. pepo]1.3e-9078.26Show/hide
Query:  MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
        MA SL    S FLKS+IP P   +A A VV S+PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRA++SDPTKL  VLS+TLSRLLKADLVATLKELLRQ
Subjt:  MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ

Query:  EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV
        ++C LALEVF VVRSEYGADLG+YAE+AAALSRNG AEEIDRL+C+LE E   I+CDDKGLIKLI+AVIGGDRRESTVRIYRMM+RSGWGST KADD+TV
Subjt:  EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV

Query:  KILSKGLRRLGEVELADEINREFQNLVATY
        ++LSKGLRRLGE+E+ADE+N +FQ+LV ++
Subjt:  KILSKGLRRLGEVELADEINREFQNLVATY

TrEMBL top hitse value%identityAlignment
A0A1S3CGD3 uncharacterized protein LOC1035005972.9e-8375.43Show/hide
Query:  MASSLRPNPSPFLKSEIPTP-RSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLR
        MASSL    S FLKS+I  P  ++ A AAV  S  VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRA+RSDPTKL  VLS+TLSRLLKADLVATLKELLR
Subjt:  MASSLRPNPSPFLKSEIPTP-RSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLR

Query:  QEQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEG-EGEIEC--DDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADD
        QE+C LALEVF V+RSEY A+LGLYAE+AAALSRNG AEEIDRL+C+L+G +G IE   DDKGLIKLI+AVI G+RRESTVRIYRMMRR+GWGS  K DD
Subjt:  QEQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEG-EGEIEC--DDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADD

Query:  HTVKILSKGLRRLGEVELADEINREFQNLVAT
        + +K++SKGLRR+GE+ELADEINREFQ+LV +
Subjt:  HTVKILSKGLRRLGEVELADEINREFQNLVAT

A0A5A7V0E2 Uncharacterized protein1.3e-8375.86Show/hide
Query:  MASSLRPNPSPFLKSEIPTP-RSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLR
        MASSLR   S FLKS+I  P  ++ A AAV  S  VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRA+RSDPTKL  VLS+TLSRLLKADLVATLKELLR
Subjt:  MASSLRPNPSPFLKSEIPTP-RSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLR

Query:  QEQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEG-EGEIEC--DDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADD
        QE+C LALEVF V+RSEY A+LGLYAE+AAALSRNG AEEIDRL+C+L+G +G IE   DDKGLIKLI+AVI G+RRESTVRIYRMMRR+GWGS  K DD
Subjt:  QEQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEG-EGEIEC--DDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADD

Query:  HTVKILSKGLRRLGEVELADEINREFQNLVAT
        + +K++SKGLRR+GE+ELADEINREFQ+LV +
Subjt:  HTVKILSKGLRRLGEVELADEINREFQNLVAT

A0A6J1D998 uncharacterized protein LOC1110187721.5e-119100Show/hide
Query:  MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
        MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
Subjt:  MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ

Query:  EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVK
        EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVK
Subjt:  EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVK

Query:  ILSKGLRRLGEVELADEINREFQNLVATY
        ILSKGLRRLGEVELADEINREFQNLVATY
Subjt:  ILSKGLRRLGEVELADEINREFQNLVATY

A0A6J1GQS0 protein THYLAKOID ASSEMBLY 8, chloroplastic2.4e-9077.83Show/hide
Query:  MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
        MA SL    S FLKS+IP P   +A A VV S+PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRA++SDPTKL  VLS+TLSRLLKADLVATLKELLRQ
Subjt:  MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ

Query:  EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV
        ++C LALEVF VVRSEYGADLG+YAE+AAALSRNG  EEIDRL+C+LE E   I+CDDKGLIKLI+AVIGGDRRESTVRIYRMM+RSGWGST KADD+TV
Subjt:  EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV

Query:  KILSKGLRRLGEVELADEINREFQNLVATY
        ++LSKGLRRLGE+E+ADE+N +FQ+LV ++
Subjt:  KILSKGLRRLGEVELADEINREFQNLVATY

A0A6J1JTE8 protein THYLAKOID ASSEMBLY 8, chloroplastic2.7e-8977.39Show/hide
Query:  MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
        MA SL    S FLKS+IP P   +A A VV S+PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRA++SDPTKL  VLS+TLSRLLKADLVATLKELLRQ
Subjt:  MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ

Query:  EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV
        ++C LALEVF VVRSEYG DLG+YAE+AAALSRNG  EEIDRL+C+LE E   I+CDDKGLIKLI+AVIGGDRRESTVRIYRMM+RSGWGST KADD+ V
Subjt:  EQCGLALEVFTVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV

Query:  KILSKGLRRLGEVELADEINREFQNLVATY
        ++LSKGLRRLGE+E+ADEIN +FQ+LV ++
Subjt:  KILSKGLRRLGEVELADEINREFQNLVATY

SwissProt top hitse value%identityAlignment
Q1PFH7 Pentatricopeptide repeat-containing protein At1g623501.3e-0832.85Show/hide
Query:  LSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFTVVRSE--YGADLGLYAELAAALSRNGMAEEIDRLLCELEG
        +S E + A + LKR Q +   +L   + S +SRLLK+DLV+ L E  RQ Q  L ++++ VVR E  Y  D+  Y ++   L+RN   +E  ++  +L+ 
Subjt:  LSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFTVVRSE--YGADLGLYAELAAALSRNGMAEEIDRLLCELEG

Query:  EGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRS
        E E+  D      L+R  +  +     +R+Y  MR S
Subjt:  EGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRS

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic9.3e-4755.21Show/hide
Query:  VPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSST---LSRLLKADLVATLKELLRQEQCGLALEVFTVVRSEY-GADLGLYAELA
        V +RCGPRDNRGPL+KGR LSTEAIQ+IQSLKRA R+  +     LS T   L RL+K+DL++ L+ELLRQ+ C LA+ V + +R+EY   DL LYA++ 
Subjt:  VPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSST---LSRLLKADLVATLKELLRQEQCGLALEVFTVVRSEY-GADLGLYAELA

Query:  AALSRNGMAEEIDRLLCELEGEGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTT-KADDHTVKILSKGLRRLGEVELADEIN
         AL+RN   +EIDRL+ E++G  +   DDK L KLIRAV+G +RRES VR+Y +MR SGWGS + +AD++  ++LSKGL RLGE +LA +++
Subjt:  AALSRNGMAEEIDRLLCELEGEGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTT-KADDHTVKILSKGLRRLGEVELADEIN

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic1.3e-0827.96Show/hide
Query:  RGPLVKGRTL-STEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFTVVRSE--YGADLGLYAELAAALSRNGMAEE
        RGPL +G+ L   EA+  I  LKR  + D  KL   + + + RLLK D++A + EL RQE+  LA+++F V++ +  Y  D+ +Y +L  +L+++   +E
Subjt:  RGPLVKGRTL-STEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFTVVRSE--YGADLGLYAELAAALSRNGMAEE

Query:  IDRLLCELEGEGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVKILSKGLRRLGEVELADEINREFQNL
           L  +++ E  +  D +   ++IR  +        + +Y  M +    S    ++   ++L KGL  L    L +++ ++F+ L
Subjt:  IDRLLCELEGEGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVKILSKGLRRLGEVELADEINREFQNL

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein9.4e-1032.85Show/hide
Query:  LSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFTVVRSE--YGADLGLYAELAAALSRNGMAEEIDRLLCELEG
        +S E + A + LKR Q +   +L   + S +SRLLK+DLV+ L E  RQ Q  L ++++ VVR E  Y  D+  Y ++   L+RN   +E  ++  +L+ 
Subjt:  LSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFTVVRSE--YGADLGLYAELAAALSRNGMAEEIDRLLCELEG

Query:  EGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRS
        E E+  D      L+R  +  +     +R+Y  MR S
Subjt:  EGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRS

AT3G27750.1 FUNCTIONS IN: molecular_function unknown6.6e-4855.21Show/hide
Query:  VPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSST---LSRLLKADLVATLKELLRQEQCGLALEVFTVVRSEY-GADLGLYAELA
        V +RCGPRDNRGPL+KGR LSTEAIQ+IQSLKRA R+  +     LS T   L RL+K+DL++ L+ELLRQ+ C LA+ V + +R+EY   DL LYA++ 
Subjt:  VPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSST---LSRLLKADLVATLKELLRQEQCGLALEVFTVVRSEY-GADLGLYAELA

Query:  AALSRNGMAEEIDRLLCELEGEGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTT-KADDHTVKILSKGLRRLGEVELADEIN
         AL+RN   +EIDRL+ E++G  +   DDK L KLIRAV+G +RRES VR+Y +MR SGWGS + +AD++  ++LSKGL RLGE +LA +++
Subjt:  AALSRNGMAEEIDRLLCELEGEGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTT-KADDHTVKILSKGLRRLGEVELADEIN

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein9.4e-1027.96Show/hide
Query:  RGPLVKGRTL-STEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFTVVRSE--YGADLGLYAELAAALSRNGMAEE
        RGPL +G+ L   EA+  I  LKR  + D  KL   + + + RLLK D++A + EL RQE+  LA+++F V++ +  Y  D+ +Y +L  +L+++   +E
Subjt:  RGPLVKGRTL-STEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFTVVRSE--YGADLGLYAELAAALSRNGMAEE

Query:  IDRLLCELEGEGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVKILSKGLRRLGEVELADEINREFQNL
           L  +++ E  +  D +   ++IR  +        + +Y  M +    S    ++   ++L KGL  L    L +++ ++F+ L
Subjt:  IDRLLCELEGEGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVKILSKGLRRLGEVELADEINREFQNL

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain1.6e-1731.12Show/hide
Query:  NRGPLVKGRTLSTEAIQAIQSLKRAQ--------------RSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFTVVRSE--YGADLGLYA
        NR PL +GR LS EAIQA+Q+LKRA                S    L  V+ S   RLLK D+VA L+ELLRQ +C LAL+VF  +R E  Y   + +Y 
Subjt:  NRGPLVKGRTLSTEAIQAIQSLKRAQ--------------RSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFTVVRSE--YGADLGLYA

Query:  ELAAALSRNGMAEEIDRLLCELEGEGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVKILSKGLRRLGEVELADEINRE
        ++   ++ N + EE++ L   ++ E  +  + +    L+  ++     +  +  Y  M+  G+    + D  + ++L  GL   GE+ L+  + ++
Subjt:  ELAAALSRNGMAEEIDRLLCELEGEGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVKILSKGLRRLGEVELADEINRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTCTACGCCCCAATCCTTCGCCCTTCCTCAAATCTGAAATCCCAACTCCGAGGTCCGCCGCCGCCGTCGCCGCCGTAGTCTTCTCCGTTCCGGTGCGGTG
CGGGCCACGCGACAACCGAGGACCTCTGGTGAAAGGCAGAACCCTAAGCACCGAAGCGATCCAAGCCATTCAATCTCTGAAACGGGCCCAAAGATCCGACCCGACGAAGC
TCCACCACGTACTCTCCAGCACGCTCTCCCGATTGCTCAAAGCGGACCTCGTCGCGACCTTGAAGGAGCTTCTCCGGCAGGAGCAGTGCGGCCTCGCGTTGGAGGTTTTC
ACCGTCGTCCGATCGGAGTACGGCGCCGACTTAGGGCTGTACGCGGAGCTGGCGGCGGCGCTGTCGAGGAACGGGATGGCGGAGGAAATCGACCGGCTGTTGTGCGAATT
GGAGGGAGAGGGGGAGATCGAGTGCGACGATAAGGGTTTGATTAAGCTGATCAGGGCGGTGATTGGTGGGGACAGAAGGGAATCGACGGTCAGGATTTATCGGATGATGA
GGAGGAGCGGGTGGGGGTCCACCACCAAGGCCGATGATCACACGGTTAAGATTTTGAGCAAGGGTTTAAGGAGGCTTGGGGAAGTGGAGTTAGCTGATGAGATTAATAGG
GAATTCCAAAATTTAGTGGCCACTTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCTCTACGCCCCAATCCTTCGCCCTTCCTCAAATCTGAAATCCCAACTCCGAGGTCCGCCGCCGCCGTCGCCGCCGTAGTCTTCTCCGTTCCGGTGCGGTG
CGGGCCACGCGACAACCGAGGACCTCTGGTGAAAGGCAGAACCCTAAGCACCGAAGCGATCCAAGCCATTCAATCTCTGAAACGGGCCCAAAGATCCGACCCGACGAAGC
TCCACCACGTACTCTCCAGCACGCTCTCCCGATTGCTCAAAGCGGACCTCGTCGCGACCTTGAAGGAGCTTCTCCGGCAGGAGCAGTGCGGCCTCGCGTTGGAGGTTTTC
ACCGTCGTCCGATCGGAGTACGGCGCCGACTTAGGGCTGTACGCGGAGCTGGCGGCGGCGCTGTCGAGGAACGGGATGGCGGAGGAAATCGACCGGCTGTTGTGCGAATT
GGAGGGAGAGGGGGAGATCGAGTGCGACGATAAGGGTTTGATTAAGCTGATCAGGGCGGTGATTGGTGGGGACAGAAGGGAATCGACGGTCAGGATTTATCGGATGATGA
GGAGGAGCGGGTGGGGGTCCACCACCAAGGCCGATGATCACACGGTTAAGATTTTGAGCAAGGGTTTAAGGAGGCTTGGGGAAGTGGAGTTAGCTGATGAGATTAATAGG
GAATTCCAAAATTTAGTGGCCACTTATTGA
Protein sequenceShow/hide protein sequence
MASSLRPNPSPFLKSEIPTPRSAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVF
TVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGEIECDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVKILSKGLRRLGEVELADEINR
EFQNLVATY