; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009502 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009502
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein THYLAKOID ASSEMBLY 8, chloroplastic
Genome locationscaffold813:1772135..1772809
RNA-Seq ExpressionMS009502
SyntenyMS009502
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014066.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-8978.7Show/hide
Query:  MASSLRPNPSPFLKSQIPTP----RSAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
        MA SL    S FLKSQIP P     +A VV S+PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRA++SDPTKL  VLS+TLSRLLKADLVATLKELLRQ
Subjt:  MASSLRPNPSPFLKSQIPTP----RSAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ

Query:  EQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV
        ++C LALEVFAVVRSEYGADLG+YAE+AAALSRNG  EEIDRL+C+LE E   IQCDDKGLIKLI+AVIGGDRRESTVRIYRMM+RSGWGST KADD+TV
Subjt:  EQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV

Query:  KILSKGLRRLGEVELADEINREFQNLVATY
        ++LSKGLRRLGE+E+ADE+N +FQ+LV ++
Subjt:  KILSKGLRRLGEVELADEINREFQNLVATY

XP_022150715.1 uncharacterized protein LOC111018772 [Momordica charantia]1.3e-11496.94Show/hide
Query:  MASSLRPNPSPFLKSQIPTPRS----AAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
        MASSLRPNPSPFLKS+IPTPRS    AAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
Subjt:  MASSLRPNPSPFLKSQIPTPRS----AAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ

Query:  EQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVK
        EQCGLALEVF VVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGEI+CDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVK
Subjt:  EQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVK

Query:  ILSKGLRRLGEVELADEINREFQNLVATY
        ILSKGLRRLGEVELADEINREFQNLVATY
Subjt:  ILSKGLRRLGEVELADEINREFQNLVATY

XP_022954347.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita moschata]1.1e-8978.7Show/hide
Query:  MASSLRPNPSPFLKSQIPTP----RSAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
        MA SL    S FLKSQIP P     +A VV S+PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRA++SDPTKL  VLS+TLSRLLKADLVATLKELLRQ
Subjt:  MASSLRPNPSPFLKSQIPTP----RSAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ

Query:  EQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV
        ++C LALEVFAVVRSEYGADLG+YAE+AAALSRNG  EEIDRL+C+LE E   IQCDDKGLIKLI+AVIGGDRRESTVRIYRMM+RSGWGST KADD+TV
Subjt:  EQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV

Query:  KILSKGLRRLGEVELADEINREFQNLVATY
        ++LSKGLRRLGE+E+ADE+N +FQ+LV ++
Subjt:  KILSKGLRRLGEVELADEINREFQNLVATY

XP_022992386.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita maxima]1.2e-8878.26Show/hide
Query:  MASSLRPNPSPFLKSQIPTP----RSAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
        MA SL    S FLKSQIP P     +A VV S+PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRA++SDPTKL  VLS+TLSRLLKADLVATLKELLRQ
Subjt:  MASSLRPNPSPFLKSQIPTP----RSAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ

Query:  EQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV
        ++C LALEVFAVVRSEYG DLG+YAE+AAALSRNG  EEIDRL+C+LE E   IQCDDKGLIKLI+AVIGGDRRESTVRIYRMM+RSGWGST KADD+ V
Subjt:  EQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV

Query:  KILSKGLRRLGEVELADEINREFQNLVATY
        ++LSKGLRRLGE+E+ADEIN +FQ+LV ++
Subjt:  KILSKGLRRLGEVELADEINREFQNLVATY

XP_023522652.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Cucurbita pepo subsp. pepo]2.9e-9079.13Show/hide
Query:  MASSLRPNPSPFLKSQIPTP----RSAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
        MA SL    S FLKSQIP P     +A VV S+PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRA++SDPTKL  VLS+TLSRLLKADLVATLKELLRQ
Subjt:  MASSLRPNPSPFLKSQIPTP----RSAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ

Query:  EQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV
        ++C LALEVFAVVRSEYGADLG+YAE+AAALSRNG AEEIDRL+C+LE E   IQCDDKGLIKLI+AVIGGDRRESTVRIYRMM+RSGWGST KADD+TV
Subjt:  EQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV

Query:  KILSKGLRRLGEVELADEINREFQNLVATY
        ++LSKGLRRLGE+E+ADE+N +FQ+LV ++
Subjt:  KILSKGLRRLGEVELADEINREFQNLVATY

TrEMBL top hitse value%identityAlignment
A0A1S3CGD3 uncharacterized protein LOC1035005976.3e-8375.43Show/hide
Query:  MASSLRPNPSPFLKSQIPTP-----RSAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLR
        MASSL    S FLKSQI  P      +AAV  S  VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRA+RSDPTKL  VLS+TLSRLLKADLVATLKELLR
Subjt:  MASSLRPNPSPFLKSQIPTP-----RSAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLR

Query:  QEQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEG-EGEIQC--DDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADD
        QE+C LALEVFAV+RSEY A+LGLYAE+AAALSRNG AEEIDRL+C+L+G +G I+   DDKGLIKLI+AVI G+RRESTVRIYRMMRR+GWGS  K DD
Subjt:  QEQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEG-EGEIQC--DDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADD

Query:  HTVKILSKGLRRLGEVELADEINREFQNLVAT
        + +K++SKGLRR+GE+ELADEINREFQ+LV +
Subjt:  HTVKILSKGLRRLGEVELADEINREFQNLVAT

A0A5A7V0E2 Uncharacterized protein2.8e-8375.86Show/hide
Query:  MASSLRPNPSPFLKSQIPTP-----RSAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLR
        MASSLR   S FLKSQI  P      +AAV  S  VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRA+RSDPTKL  VLS+TLSRLLKADLVATLKELLR
Subjt:  MASSLRPNPSPFLKSQIPTP-----RSAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLR

Query:  QEQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEG-EGEIQC--DDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADD
        QE+C LALEVFAV+RSEY A+LGLYAE+AAALSRNG AEEIDRL+C+L+G +G I+   DDKGLIKLI+AVI G+RRESTVRIYRMMRR+GWGS  K DD
Subjt:  QEQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEG-EGEIQC--DDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADD

Query:  HTVKILSKGLRRLGEVELADEINREFQNLVAT
        + +K++SKGLRR+GE+ELADEINREFQ+LV +
Subjt:  HTVKILSKGLRRLGEVELADEINREFQNLVAT

A0A6J1D998 uncharacterized protein LOC1110187726.3e-11596.94Show/hide
Query:  MASSLRPNPSPFLKSQIPTPRS----AAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
        MASSLRPNPSPFLKS+IPTPRS    AAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
Subjt:  MASSLRPNPSPFLKSQIPTPRS----AAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ

Query:  EQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVK
        EQCGLALEVF VVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGEI+CDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVK
Subjt:  EQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVK

Query:  ILSKGLRRLGEVELADEINREFQNLVATY
        ILSKGLRRLGEVELADEINREFQNLVATY
Subjt:  ILSKGLRRLGEVELADEINREFQNLVATY

A0A6J1GQS0 protein THYLAKOID ASSEMBLY 8, chloroplastic5.3e-9078.7Show/hide
Query:  MASSLRPNPSPFLKSQIPTP----RSAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
        MA SL    S FLKSQIP P     +A VV S+PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRA++SDPTKL  VLS+TLSRLLKADLVATLKELLRQ
Subjt:  MASSLRPNPSPFLKSQIPTP----RSAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ

Query:  EQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV
        ++C LALEVFAVVRSEYGADLG+YAE+AAALSRNG  EEIDRL+C+LE E   IQCDDKGLIKLI+AVIGGDRRESTVRIYRMM+RSGWGST KADD+TV
Subjt:  EQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV

Query:  KILSKGLRRLGEVELADEINREFQNLVATY
        ++LSKGLRRLGE+E+ADE+N +FQ+LV ++
Subjt:  KILSKGLRRLGEVELADEINREFQNLVATY

A0A6J1JTE8 protein THYLAKOID ASSEMBLY 8, chloroplastic5.9e-8978.26Show/hide
Query:  MASSLRPNPSPFLKSQIPTP----RSAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ
        MA SL    S FLKSQIP P     +A VV S+PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRA++SDPTKL  VLS+TLSRLLKADLVATLKELLRQ
Subjt:  MASSLRPNPSPFLKSQIPTP----RSAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQ

Query:  EQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV
        ++C LALEVFAVVRSEYG DLG+YAE+AAALSRNG  EEIDRL+C+LE E   IQCDDKGLIKLI+AVIGGDRRESTVRIYRMM+RSGWGST KADD+ V
Subjt:  EQCGLALEVFAVVRSEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGE-IQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTV

Query:  KILSKGLRRLGEVELADEINREFQNLVATY
        ++LSKGLRRLGE+E+ADEIN +FQ+LV ++
Subjt:  KILSKGLRRLGEVELADEINREFQNLVATY

SwissProt top hitse value%identityAlignment
Q1PFH7 Pentatricopeptide repeat-containing protein At1g623509.9e-0932.85Show/hide
Query:  LSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFAVVRSE--YGADLGLYAELAAALSRNGMAEEIDRLLCELEG
        +S E + A + LKR Q +   +L   + S +SRLLK+DLV+ L E  RQ Q  L ++++ VVR E  Y  D+  Y ++   L+RN   +E  ++  +L+ 
Subjt:  LSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFAVVRSE--YGADLGLYAELAAALSRNGMAEEIDRLLCELEG

Query:  EGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRS
        E E+  D      L+R  +  +     +R+Y  MR S
Subjt:  EGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRS

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic7.0e-4755.21Show/hide
Query:  VPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSST---LSRLLKADLVATLKELLRQEQCGLALEVFAVVRSEY-GADLGLYAELA
        V +RCGPRDNRGPL+KGR LSTEAIQ+IQSLKRA R+  +     LS T   L RL+K+DL++ L+ELLRQ+ C LA+ V + +R+EY   DL LYA++ 
Subjt:  VPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSST---LSRLLKADLVATLKELLRQEQCGLALEVFAVVRSEY-GADLGLYAELA

Query:  AALSRNGMAEEIDRLLCELEGEGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTT-KADDHTVKILSKGLRRLGEVELADEIN
         AL+RN   +EIDRL+ E++G  + + DDK L KLIRAV+G +RRES VR+Y +MR SGWGS + +AD++  ++LSKGL RLGE +LA +++
Subjt:  AALSRNGMAEEIDRLLCELEGEGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTT-KADDHTVKILSKGLRRLGEVELADEIN

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic1.7e-0827.96Show/hide
Query:  RGPLVKGRTL-STEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFAVVRSE--YGADLGLYAELAAALSRNGMAEE
        RGPL +G+ L   EA+  I  LKR  + D  KL   + + + RLLK D++A + EL RQE+  LA+++F V++ +  Y  D+ +Y +L  +L+++   +E
Subjt:  RGPLVKGRTL-STEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFAVVRSE--YGADLGLYAELAAALSRNGMAEE

Query:  IDRLLCELEGEGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVKILSKGLRRLGEVELADEINREFQNL
           L  +++ E  +  D +   ++IR  +        + +Y  M +    S    ++   ++L KGL  L    L +++ ++F+ L
Subjt:  IDRLLCELEGEGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVKILSKGLRRLGEVELADEINREFQNL

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein7.1e-1032.85Show/hide
Query:  LSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFAVVRSE--YGADLGLYAELAAALSRNGMAEEIDRLLCELEG
        +S E + A + LKR Q +   +L   + S +SRLLK+DLV+ L E  RQ Q  L ++++ VVR E  Y  D+  Y ++   L+RN   +E  ++  +L+ 
Subjt:  LSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFAVVRSE--YGADLGLYAELAAALSRNGMAEEIDRLLCELEG

Query:  EGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRS
        E E+  D      L+R  +  +     +R+Y  MR S
Subjt:  EGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRS

AT3G27750.1 FUNCTIONS IN: molecular_function unknown5.0e-4855.21Show/hide
Query:  VPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSST---LSRLLKADLVATLKELLRQEQCGLALEVFAVVRSEY-GADLGLYAELA
        V +RCGPRDNRGPL+KGR LSTEAIQ+IQSLKRA R+  +     LS T   L RL+K+DL++ L+ELLRQ+ C LA+ V + +R+EY   DL LYA++ 
Subjt:  VPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSST---LSRLLKADLVATLKELLRQEQCGLALEVFAVVRSEY-GADLGLYAELA

Query:  AALSRNGMAEEIDRLLCELEGEGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTT-KADDHTVKILSKGLRRLGEVELADEIN
         AL+RN   +EIDRL+ E++G  + + DDK L KLIRAV+G +RRES VR+Y +MR SGWGS + +AD++  ++LSKGL RLGE +LA +++
Subjt:  AALSRNGMAEEIDRLLCELEGEGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTT-KADDHTVKILSKGLRRLGEVELADEIN

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-0927.96Show/hide
Query:  RGPLVKGRTL-STEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFAVVRSE--YGADLGLYAELAAALSRNGMAEE
        RGPL +G+ L   EA+  I  LKR  + D  KL   + + + RLLK D++A + EL RQE+  LA+++F V++ +  Y  D+ +Y +L  +L+++   +E
Subjt:  RGPLVKGRTL-STEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFAVVRSE--YGADLGLYAELAAALSRNGMAEE

Query:  IDRLLCELEGEGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVKILSKGLRRLGEVELADEINREFQNL
           L  +++ E  +  D +   ++IR  +        + +Y  M +    S    ++   ++L KGL  L    L +++ ++F+ L
Subjt:  IDRLLCELEGEGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVKILSKGLRRLGEVELADEINREFQNL

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain1.2e-1731.12Show/hide
Query:  NRGPLVKGRTLSTEAIQAIQSLKRAQ--------------RSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFAVVRSE--YGADLGLYA
        NR PL +GR LS EAIQA+Q+LKRA                S    L  V+ S   RLLK D+VA L+ELLRQ +C LAL+VF  +R E  Y   + +Y 
Subjt:  NRGPLVKGRTLSTEAIQAIQSLKRAQ--------------RSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFAVVRSE--YGADLGLYA

Query:  ELAAALSRNGMAEEIDRLLCELEGEGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVKILSKGLRRLGEVELADEINRE
        ++   ++ N + EE++ L   ++ E  +  + +    L+  ++     +  +  Y  M+  G+    + D  + ++L  GL   GE+ L+  + ++
Subjt:  ELAAALSRNGMAEEIDRLLCELEGEGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVKILSKGLRRLGEVELADEINRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTCTACGCCCCAATCCTTCGCCCTTCCTCAAATCTCAAATCCCAACTCCGAGGTCCGCCGCCGTAGTCTTCTCCGTTCCGGTGCGGTGCGGGCCACGCGA
CAACCGAGGACCTCTGGTGAAAGGCAGAACCCTAAGCACCGAAGCGATCCAAGCCATTCAATCTCTGAAACGGGCCCAAAGATCCGACCCGACGAAGCTCCACCACGTCC
TCTCCAGCACGCTCTCCCGATTGCTCAAAGCGGACCTCGTCGCGACCTTGAAGGAGCTCCTCCGGCAGGAGCAGTGCGGCCTCGCGTTGGAGGTTTTCGCCGTCGTCCGA
TCGGAGTACGGCGCCGACTTAGGGCTGTACGCGGAGCTGGCGGCGGCGCTGTCGAGGAACGGGATGGCGGAGGAAATCGACCGGCTGTTGTGCGAATTGGAGGGAGAGGG
GGAGATCCAGTGCGACGATAAGGGTTTGATTAAGCTGATCAGGGCGGTGATTGGTGGGGACAGAAGGGAATCGACGGTCAGGATTTATCGGATGATGAGGAGGAGCGGGT
GGGGGTCCACCACCAAGGCCGATGATCACACGGTTAAGATTTTGAGCAAGGGTTTAAGGAGGCTTGGGGAAGTGGAGTTAGCTGATGAGATCAATAGGGAATTCCAAAAT
TTAGTGGCCACTTAT
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCTCTACGCCCCAATCCTTCGCCCTTCCTCAAATCTCAAATCCCAACTCCGAGGTCCGCCGCCGTAGTCTTCTCCGTTCCGGTGCGGTGCGGGCCACGCGA
CAACCGAGGACCTCTGGTGAAAGGCAGAACCCTAAGCACCGAAGCGATCCAAGCCATTCAATCTCTGAAACGGGCCCAAAGATCCGACCCGACGAAGCTCCACCACGTCC
TCTCCAGCACGCTCTCCCGATTGCTCAAAGCGGACCTCGTCGCGACCTTGAAGGAGCTCCTCCGGCAGGAGCAGTGCGGCCTCGCGTTGGAGGTTTTCGCCGTCGTCCGA
TCGGAGTACGGCGCCGACTTAGGGCTGTACGCGGAGCTGGCGGCGGCGCTGTCGAGGAACGGGATGGCGGAGGAAATCGACCGGCTGTTGTGCGAATTGGAGGGAGAGGG
GGAGATCCAGTGCGACGATAAGGGTTTGATTAAGCTGATCAGGGCGGTGATTGGTGGGGACAGAAGGGAATCGACGGTCAGGATTTATCGGATGATGAGGAGGAGCGGGT
GGGGGTCCACCACCAAGGCCGATGATCACACGGTTAAGATTTTGAGCAAGGGTTTAAGGAGGCTTGGGGAAGTGGAGTTAGCTGATGAGATCAATAGGGAATTCCAAAAT
TTAGTGGCCACTTAT
Protein sequenceShow/hide protein sequence
MASSLRPNPSPFLKSQIPTPRSAAVVFSVPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFAVVR
SEYGADLGLYAELAAALSRNGMAEEIDRLLCELEGEGEIQCDDKGLIKLIRAVIGGDRRESTVRIYRMMRRSGWGSTTKADDHTVKILSKGLRRLGEVELADEINREFQN
LVATY