; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015661 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015661
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein THYLAKOID ASSEMBLY 8, chloroplastic
Genome locationtig00004836:778969..779772
RNA-Seq ExpressionSgr015661
SyntenySgr015661
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150715.1 uncharacterized protein LOC111018772 [Momordica charantia]1.7e-9782.98Show/hide
Query:  SSALRPNPSSTFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKE
        +S+LRPNP S FLKS+IP PR    +AAA  AV FSVPVRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAQRSDP +L HVLS+TLSRLLKADLVATLKE
Subjt:  SSALRPNPSSTFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKE

Query:  LLRQDQCALALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKA
        LLRQ+QC LALEVF VVRSE+GADLGLYAELAAALSRNGMAEEIDRL+ +L+GEG  +I+CDDKGLIKLIRAVIGGDRRESTVRIYR+MRRSGWGST KA
Subjt:  LLRQDQCALALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKA

Query:  DDYMVKVLSEGLRRLGEMDLADEINREFQNLVGTY
        DD+ VK+LS+GLRRLGE++LADEINREFQNLV TY
Subjt:  DDYMVKVLSEGLRRLGEMDLADEINREFQNLVGTY

XP_022954347.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita moschata]9.1e-9179.2Show/hide
Query:  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCAL
        STFLKSQIPIP P+    +AA  V  S+PVRCGPRDNRGPLVKGRTLS EAIQAIQSLKRA++SDP +L+ VLS TLSRLLKADLVATLKELLRQD+CAL
Subjt:  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCAL

Query:  ALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLS
        ALEVFAVVRSE+GADLG+YAE+AAALSRNG  EEIDRLV DL+ E ++ IQCDDKGLIKLI+AVIGGDRRESTVRIYR+M+RSGWGSTIKADDY V+VLS
Subjt:  ALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLS

Query:  EGLRRLGEMDLADEINREFQNLVGTY
        +GLRRLGEM++ADE+N +FQ+LVG++
Subjt:  EGLRRLGEMDLADEINREFQNLVGTY

XP_022992386.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita maxima]1.2e-9079.2Show/hide
Query:  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCAL
        STFLKSQIPIP P+    +A   V  S+PVRCGPRDNRGPLVKGRTLS EAIQAIQSLKRA++SDP +L+ VLS TLSRLLKADLVATLKELLRQD+CAL
Subjt:  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCAL

Query:  ALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLS
        ALEVFAVVRSE+G DLG+YAE+AAALSRNG  EEIDRLV DL+ E ++ IQCDDKGLIKLI+AVIGGDRRESTVRIYR+M+RSGWGSTIKADDYMV+VLS
Subjt:  ALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLS

Query:  EGLRRLGEMDLADEINREFQNLVGTY
        +GLRRLGEM++ADEIN +FQ+LVG++
Subjt:  EGLRRLGEMDLADEINREFQNLVGTY

XP_023522652.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Cucurbita pepo subsp. pepo]2.0e-9079.2Show/hide
Query:  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCAL
        STFLKSQIPIP P+    +A   V  S+PVRCGPRDNRGPLVKGRTLS EAIQAIQSLKRA++SDP +L+ VLS TLSRLLKADLVATLKELLRQD+CAL
Subjt:  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCAL

Query:  ALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLS
        ALEVFAVVRSE+GADLG+YAE+AAALSRNG AEEIDRLV DL+ E +  IQCDDKGLIKLI+AVIGGDRRESTVRIYR+M+RSGWGSTIKADDY V+VLS
Subjt:  ALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLS

Query:  EGLRRLGEMDLADEINREFQNLVGTY
        +GLRRLGEM++ADE+N +FQ+LVG++
Subjt:  EGLRRLGEMDLADEINREFQNLVGTY

XP_038898717.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Benincasa hispida]3.1e-9183.48Show/hide
Query:  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCAL
        STFLKSQI I  P+P +AAAAVAV  S+PVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRA+RSDP +LQ VLS TLSRLLKADLVA+LKELLRQD+CAL
Subjt:  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCAL

Query:  ALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLS
        ALEVFAV+RSE+GADLG+YAE+AAALSRNG AEEIDRLV DLDG G + IQ DDKGLIKLI+AVI GDRRESTVRIYR+MRR+GWGSTIKADDY+V+VLS
Subjt:  ALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLS

Query:  EGLRRLGEMDLADEINREFQNLVG
        +GLRR GEM+LADEINREFQ+LVG
Subjt:  EGLRRLGEMDLADEINREFQNLVG

TrEMBL top hitse value%identityAlignment
A0A1S3CGD3 uncharacterized protein LOC1035005978.3e-9072.39Show/hide
Query:  MFNF-PTPGEGYNSHCEIAHPIHYHGLRSAMAFSSALRPNPSSTFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSL
        +F+F P   +GY SHCE   P     L +AMA S        STFLKSQI IP P    A AAVAV F   VRCGPRDNRGPLVKGRTLSIEAIQAIQSL
Subjt:  MFNF-PTPGEGYNSHCEIAHPIHYHGLRSAMAFSSALRPNPSSTFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSL

Query:  KRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDG-EGKIQIQCDDKGL
        KRA+RSDP +LQ VLS TLSRLLKADLVATLKELLRQ++CALALEVFAV+RSE+ A+LGLYAE+AAALSRNG AEEIDRLV DLDG +G I+   DDKGL
Subjt:  KRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDG-EGKIQIQCDDKGL

Query:  IKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQNLVGT
        IKLI+AVI G+RRESTVRIYR+MRR+GWGS IK DDYM+KV+S+GLRR+GE++LADEINREFQ+LVG+
Subjt:  IKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQNLVGT

A0A5A7V0E2 Uncharacterized protein3.3e-8679.2Show/hide
Query:  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCAL
        STFLKSQI IP P    A AAVAV F   VRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRA+RSDP +LQ VLS TLSRLLKADLVATLKELLRQ++CAL
Subjt:  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCAL

Query:  ALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDG-EGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVL
        ALEVFAV+RSE+ A+LGLYAE+AAALSRNG AEEIDRLV DLDG +G I+   DDKGLIKLI+AVI G+RRESTVRIYR+MRR+GWGS IK DDYM+KV+
Subjt:  ALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDG-EGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVL

Query:  SEGLRRLGEMDLADEINREFQNLVGT
        S+GLRR+GE++LADEINREFQ+LVG+
Subjt:  SEGLRRLGEMDLADEINREFQNLVGT

A0A6J1D998 uncharacterized protein LOC1110187728.3e-9882.98Show/hide
Query:  SSALRPNPSSTFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKE
        +S+LRPNP S FLKS+IP PR    +AAA  AV FSVPVRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAQRSDP +L HVLS+TLSRLLKADLVATLKE
Subjt:  SSALRPNPSSTFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKE

Query:  LLRQDQCALALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKA
        LLRQ+QC LALEVF VVRSE+GADLGLYAELAAALSRNGMAEEIDRL+ +L+GEG  +I+CDDKGLIKLIRAVIGGDRRESTVRIYR+MRRSGWGST KA
Subjt:  LLRQDQCALALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKA

Query:  DDYMVKVLSEGLRRLGEMDLADEINREFQNLVGTY
        DD+ VK+LS+GLRRLGE++LADEINREFQNLV TY
Subjt:  DDYMVKVLSEGLRRLGEMDLADEINREFQNLVGTY

A0A6J1GQS0 protein THYLAKOID ASSEMBLY 8, chloroplastic4.4e-9179.2Show/hide
Query:  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCAL
        STFLKSQIPIP P+    +AA  V  S+PVRCGPRDNRGPLVKGRTLS EAIQAIQSLKRA++SDP +L+ VLS TLSRLLKADLVATLKELLRQD+CAL
Subjt:  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCAL

Query:  ALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLS
        ALEVFAVVRSE+GADLG+YAE+AAALSRNG  EEIDRLV DL+ E ++ IQCDDKGLIKLI+AVIGGDRRESTVRIYR+M+RSGWGSTIKADDY V+VLS
Subjt:  ALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLS

Query:  EGLRRLGEMDLADEINREFQNLVGTY
        +GLRRLGEM++ADE+N +FQ+LVG++
Subjt:  EGLRRLGEMDLADEINREFQNLVGTY

A0A6J1JTE8 protein THYLAKOID ASSEMBLY 8, chloroplastic5.7e-9179.2Show/hide
Query:  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCAL
        STFLKSQIPIP P+    +A   V  S+PVRCGPRDNRGPLVKGRTLS EAIQAIQSLKRA++SDP +L+ VLS TLSRLLKADLVATLKELLRQD+CAL
Subjt:  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCAL

Query:  ALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLS
        ALEVFAVVRSE+G DLG+YAE+AAALSRNG  EEIDRLV DL+ E ++ IQCDDKGLIKLI+AVIGGDRRESTVRIYR+M+RSGWGSTIKADDYMV+VLS
Subjt:  ALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLS

Query:  EGLRRLGEMDLADEINREFQNLVGTY
        +GLRRLGEM++ADEIN +FQ+LVG++
Subjt:  EGLRRLGEMDLADEINREFQNLVGTY

SwissProt top hitse value%identityAlignment
Q1PFH7 Pentatricopeptide repeat-containing protein At1g623505.8e-0831.65Show/hide
Query:  LSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSE--FGADLGLYAELAAALSRNGMAEEIDRLVGDLDG
        +S E + A + LKR Q +   RL   + + +SRLLK+DLV+ L E  RQ+Q  L ++++ VVR E  +  D+  Y ++   L+RN   +E  ++  DL  
Subjt:  LSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSE--FGADLGLYAELAAALSRNGMAEEIDRLVGDLDG

Query:  EGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRS
        E   ++  D      L+R  +  +     +R+Y  MR S
Subjt:  EGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRS

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic8.9e-4956.02Show/hide
Query:  VPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSEF-GADLGLYAELAAAL
        V +RCGPRDNRGPL+KGR LS EAIQ+IQSLKRA R+  +    +    L RL+K+DL++ L+ELLRQD C LA+ V + +R+E+   DL LYA++  AL
Subjt:  VPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSEF-GADLGLYAELAAAL

Query:  SRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGS-TIKADDYMVKVLSEGLRRLGEMDLADEIN
        +RN   +EIDRL+G++DG   I  + DDK L KLIRAV+G +RRES VR+Y LMR SGWGS + +AD+Y+ +VLS+GL RLGE DLA +++
Subjt:  SRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGS-TIKADDYMVKVLSEGLRRLGEMDLADEIN

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic1.9e-0626.6Show/hide
Query:  RGPLVKGRTL-SIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSE--FGADLGLYAELAAALSRNGMAEE
        RGPL +G+ L   EA+  I  LKR  + D  +L   +   + RLLK D++A + EL RQ++ ALA+++F V++ +  +  D+ +Y +L  +L+++   +E
Subjt:  RGPLVKGRTL-SIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSE--FGADLGLYAELAAALSRNGMAEE

Query:  IDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQNL
           L   +  E       D +   ++IR  +        + +Y  M +    S    ++   +VL +GL  L    L +++ ++F+ L
Subjt:  IDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQNL

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein4.2e-0931.65Show/hide
Query:  LSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSE--FGADLGLYAELAAALSRNGMAEEIDRLVGDLDG
        +S E + A + LKR Q +   RL   + + +SRLLK+DLV+ L E  RQ+Q  L ++++ VVR E  +  D+  Y ++   L+RN   +E  ++  DL  
Subjt:  LSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSE--FGADLGLYAELAAALSRNGMAEEIDRLVGDLDG

Query:  EGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRS
        E   ++  D      L+R  +  +     +R+Y  MR S
Subjt:  EGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRS

AT3G27750.1 FUNCTIONS IN: molecular_function unknown6.3e-5056.02Show/hide
Query:  VPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSEF-GADLGLYAELAAAL
        V +RCGPRDNRGPL+KGR LS EAIQ+IQSLKRA R+  +    +    L RL+K+DL++ L+ELLRQD C LA+ V + +R+E+   DL LYA++  AL
Subjt:  VPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSEF-GADLGLYAELAAAL

Query:  SRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGS-TIKADDYMVKVLSEGLRRLGEMDLADEIN
        +RN   +EIDRL+G++DG   I  + DDK L KLIRAV+G +RRES VR+Y LMR SGWGS + +AD+Y+ +VLS+GL RLGE DLA +++
Subjt:  SRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGS-TIKADDYMVKVLSEGLRRLGEMDLADEIN

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-0726.6Show/hide
Query:  RGPLVKGRTL-SIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSE--FGADLGLYAELAAALSRNGMAEE
        RGPL +G+ L   EA+  I  LKR  + D  +L   +   + RLLK D++A + EL RQ++ ALA+++F V++ +  +  D+ +Y +L  +L+++   +E
Subjt:  RGPLVKGRTL-SIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSE--FGADLGLYAELAAALSRNGMAEE

Query:  IDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQNL
           L   +  E       D +   ++IR  +        + +Y  M +    S    ++   +VL +GL  L    L +++ ++F+ L
Subjt:  IDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQNL

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain1.4e-1731.37Show/hide
Query:  NRGPLVKGRTLSIEAIQAIQSLKRAQ--------------RSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSE--FGADLGLYA
        NR PL +GR LSIEAIQA+Q+LKRA                S  A L  V+ +   RLLK D+VA L+ELLRQ++C+LAL+VF  +R E  +   + +Y 
Subjt:  NRGPLVKGRTLSIEAIQAIQSLKRAQ--------------RSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSE--FGADLGLYA

Query:  ELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQ
        ++   ++ N + EE++ L   +  E  +  + +      L+  ++     +  +  Y  M+  G+    + D    +VL  GL   GEM L+  + ++  
Subjt:  ELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQ

Query:  NLVG
           G
Subjt:  NLVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTAATTTCCCAACGCCCGGCGAGGGGTATAATAGTCATTGCGAAATCGCCCACCCAATCCACTACCACGGTCTGCGCTCCGCCATGGCTTTTTCTTCCGCTCTTCG
CCCCAATCCTTCCAGTACCTTCCTCAAATCTCAAATCCCAATTCCGAGGCCCGTGCCCCCCGCCGCCGCCGCCGCCGTCGCCGTCTTTTTCTCCGTTCCAGTACGCTGCG
GCCCACGCGACAACCGAGGACCTCTGGTGAAAGGCAGAACCCTAAGCATCGAGGCAATCCAAGCCATTCAATCACTGAAACGGGCACAAAGATCCGACCCGGCAAGGCTC
CAACACGTCCTCTCCAATACCCTCTCGCGATTGCTCAAAGCAGACCTCGTCGCGACGTTGAAGGAGCTCCTCCGGCAGGACCAGTGCGCCCTCGCCTTGGAGGTTTTCGC
CGTCGTCCGATCGGAGTTCGGAGCCGACCTGGGGTTGTACGCGGAGCTGGCTGCGGCACTGTCGAGGAACGGGATGGCGGAGGAAATCGACCGGCTGGTGGGTGATTTGG
ATGGAGAGGGGAAGATCCAGATCCAGTGTGACGATAAGGGTTTGATTAAGTTGATCAGGGCGGTGATTGGTGGGGACAGAAGGGAATCAACGGTCAGGATTTATAGGCTG
ATGAGGAGGAGCGGTTGGGGGTCCACCATCAAAGCTGATGATTACATGGTTAAGGTTTTGAGCGAGGGTTTAAGGAGACTTGGAGAAATGGACTTGGCTGATGAGATCAA
TAGGGAATTTCAAAATTTAGTGGGCACTTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTAATTTCCCAACGCCCGGCGAGGGGTATAATAGTCATTGCGAAATCGCCCACCCAATCCACTACCACGGTCTGCGCTCCGCCATGGCTTTTTCTTCCGCTCTTCG
CCCCAATCCTTCCAGTACCTTCCTCAAATCTCAAATCCCAATTCCGAGGCCCGTGCCCCCCGCCGCCGCCGCCGCCGTCGCCGTCTTTTTCTCCGTTCCAGTACGCTGCG
GCCCACGCGACAACCGAGGACCTCTGGTGAAAGGCAGAACCCTAAGCATCGAGGCAATCCAAGCCATTCAATCACTGAAACGGGCACAAAGATCCGACCCGGCAAGGCTC
CAACACGTCCTCTCCAATACCCTCTCGCGATTGCTCAAAGCAGACCTCGTCGCGACGTTGAAGGAGCTCCTCCGGCAGGACCAGTGCGCCCTCGCCTTGGAGGTTTTCGC
CGTCGTCCGATCGGAGTTCGGAGCCGACCTGGGGTTGTACGCGGAGCTGGCTGCGGCACTGTCGAGGAACGGGATGGCGGAGGAAATCGACCGGCTGGTGGGTGATTTGG
ATGGAGAGGGGAAGATCCAGATCCAGTGTGACGATAAGGGTTTGATTAAGTTGATCAGGGCGGTGATTGGTGGGGACAGAAGGGAATCAACGGTCAGGATTTATAGGCTG
ATGAGGAGGAGCGGTTGGGGGTCCACCATCAAAGCTGATGATTACATGGTTAAGGTTTTGAGCGAGGGTTTAAGGAGACTTGGAGAAATGGACTTGGCTGATGAGATCAA
TAGGGAATTTCAAAATTTAGTGGGCACTTATTGA
Protein sequenceShow/hide protein sequence
MFNFPTPGEGYNSHCEIAHPIHYHGLRSAMAFSSALRPNPSSTFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARL
QHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRL
MRRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQNLVGTY