; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022062 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022062
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr05:20412587..20414427
RNA-Seq ExpressionHG10022062
SyntenyHG10022062
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033178.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.1e-13882.05Show/hide
Query:  MIRKRTNDS-SFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVS
        MIRKRTND+ S NRFQRS+LWQKCT+F ALKQ HAFLVVNG NSS +ALRELIF+SAIAVSGTM YAHQVFAQITEPDIFMWNTMIRGSAQSL PA+AVS
Subjt:  MIRKRTNDS-SFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVS

Query:  LYAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD---
        LYAQMENRGV+PDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSN FVRNTLIYFHANCGDL+TARALFDAS+KRDVVPWSALTAGYARRGELD   
Subjt:  LYAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD---

Query:  -----------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIH
                               LG MEKARKLFDEAP KDVVTWNAMIAGYVLS LN++ALEMFDAMRD+GQRPDDVTMLSILSA+ADLGD+E+GKKI+
Subjt:  -----------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIH

Query:  RSIFDMCCGDLS
        RSIFDM CGD+S
Subjt:  RSIFDMCCGDLS

XP_004141574.1 pentatricopeptide repeat-containing protein At5g15300 isoform X1 [Cucumis sativus]6.8e-14382.96Show/hide
Query:  MIRKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSL
        MIRKRTND+SFNRFQ+S+LWQKCT+F +LKQ HAFL+VNGLNS+T+ LRELIFVSAI VSGTMDYAHQ+FAQI++PDIFMWNTMIRGSAQ+LKPATAVSL
Subjt:  MIRKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSL

Query:  YAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDL---
        Y QMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLK GFQSN FVRNTLIYFHANCGDLATARALFDAS+KR+VVPWSALTAGYARRG+LD+   
Subjt:  YAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDL---

Query:  -----------------------GEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHR
                               GEMEKARKLFDE PKKDVVTWNAMIAGYVLSRLNK+ALEMFDAMRD+GQRPDDVTMLSILSASADLGD+EIGKKIHR
Subjt:  -----------------------GEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHR

Query:  SIFDMCCGDLS
        SIFDMCCGDLS
Subjt:  SIFDMCCGDLS

XP_022133508.1 pentatricopeptide repeat-containing protein At5g15300 [Momordica charantia]2.9e-14183.23Show/hide
Query:  MIRKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSL
        MIRKRTNDS  NRFQRS+LWQKCTSF ALKQ HAFLVVNG NSST+ALRELIFV AIA+SGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKP  AVS+
Subjt:  MIRKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSL

Query:  YAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD----
        YAQMENRGVRPDKFTFSFVLKACTKLSWV LGFGIHGKV+KFGFQSN FVRNTLIYFHANCGDLATARALFDAS+KRDVVPWSALTAGYARRGELD    
Subjt:  YAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD----

Query:  ----------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHR
                              LGEMEKARKLFD+AP+KDVVTWNAMIAGYVL+ LNKQALEMFDAM D GQRPDDVTMLSILSASADLGD+E+GK IHR
Subjt:  ----------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHR

Query:  SIFDMCCGDL
        SIF+MCCGDL
Subjt:  SIFDMCCGDL

XP_031742300.1 pentatricopeptide repeat-containing protein At5g15300 isoform X2 [Cucumis sativus]6.8e-14382.96Show/hide
Query:  MIRKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSL
        MIRKRTND+SFNRFQ+S+LWQKCT+F +LKQ HAFL+VNGLNS+T+ LRELIFVSAI VSGTMDYAHQ+FAQI++PDIFMWNTMIRGSAQ+LKPATAVSL
Subjt:  MIRKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSL

Query:  YAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDL---
        Y QMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLK GFQSN FVRNTLIYFHANCGDLATARALFDAS+KR+VVPWSALTAGYARRG+LD+   
Subjt:  YAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDL---

Query:  -----------------------GEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHR
                               GEMEKARKLFDE PKKDVVTWNAMIAGYVLSRLNK+ALEMFDAMRD+GQRPDDVTMLSILSASADLGD+EIGKKIHR
Subjt:  -----------------------GEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHR

Query:  SIFDMCCGDLS
        SIFDMCCGDLS
Subjt:  SIFDMCCGDLS

XP_038889201.1 pentatricopeptide repeat-containing protein At5g15300, partial [Benincasa hispida]5.8e-14283.82Show/hide
Query:  RKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYA
        R+ TNDSSFNRFQRS+LWQKCTSF  LKQ HAFL+VNGLNSST+ALRELIFVSAIAVSGT++YAHQ+FAQITEPDIF+WNTMIRGSAQSLKPATAVSLY 
Subjt:  RKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYA

Query:  QMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD------
        QMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLI FHANCGDLATAR LFDAS+KRDVVPWSALT GYARRGELD      
Subjt:  QMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD------

Query:  --------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHRSI
                            LGE+EKA++LFDEAPKKDVVTWNAMIAGYVLS+LNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGD+EIGKKIHR+I
Subjt:  --------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHRSI

Query:  FDMCCGDLS
        F+MCCGDLS
Subjt:  FDMCCGDLS

TrEMBL top hitse value%identityAlignment
A0A0A0KU97 Uncharacterized protein3.3e-14382.96Show/hide
Query:  MIRKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSL
        MIRKRTND+SFNRFQ+S+LWQKCT+F +LKQ HAFL+VNGLNS+T+ LRELIFVSAI VSGTMDYAHQ+FAQI++PDIFMWNTMIRGSAQ+LKPATAVSL
Subjt:  MIRKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSL

Query:  YAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDL---
        Y QMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLK GFQSN FVRNTLIYFHANCGDLATARALFDAS+KR+VVPWSALTAGYARRG+LD+   
Subjt:  YAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDL---

Query:  -----------------------GEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHR
                               GEMEKARKLFDE PKKDVVTWNAMIAGYVLSRLNK+ALEMFDAMRD+GQRPDDVTMLSILSASADLGD+EIGKKIHR
Subjt:  -----------------------GEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHR

Query:  SIFDMCCGDLS
        SIFDMCCGDLS
Subjt:  SIFDMCCGDLS

A0A5D3BN20 Pentatricopeptide repeat-containing protein2.3e-13680.39Show/hide
Query:  MIRKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSL
        MIRKRTND+ FNRF  S+LWQKCT+F ALKQ HAFL+VNGLNS+ + LRELIFVSA+ VSGTMDYAHQ+FAQIT+PDIFMWNTMIRGS QSLKPATAVSL
Subjt:  MIRKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSL

Query:  YAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD----
        Y QM+NRGVRPDKFTFSFVLKACTKLSW KLG  IHGK+LK GFQSN FVRNTLIYFHANCGDLA ARALFD S+KRDVVPWSA+TAGYARRG+LD    
Subjt:  YAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD----

Query:  ----------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHR
                              LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNK+ALEMFDAMR MGQRPDDVTMLSILSASADLGD+EIGKKIHR
Subjt:  ----------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHR

Query:  SIFDMCCGDLS
        SIFDM CGDLS
Subjt:  SIFDMCCGDLS

A0A6J1BZB3 pentatricopeptide repeat-containing protein At5g153001.4e-14183.23Show/hide
Query:  MIRKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSL
        MIRKRTNDS  NRFQRS+LWQKCTSF ALKQ HAFLVVNG NSST+ALRELIFV AIA+SGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKP  AVS+
Subjt:  MIRKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSL

Query:  YAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD----
        YAQMENRGVRPDKFTFSFVLKACTKLSWV LGFGIHGKV+KFGFQSN FVRNTLIYFHANCGDLATARALFDAS+KRDVVPWSALTAGYARRGELD    
Subjt:  YAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD----

Query:  ----------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHR
                              LGEMEKARKLFD+AP+KDVVTWNAMIAGYVL+ LNKQALEMFDAM D GQRPDDVTMLSILSASADLGD+E+GK IHR
Subjt:  ----------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHR

Query:  SIFDMCCGDL
        SIF+MCCGDL
Subjt:  SIFDMCCGDL

A0A6J1GRS9 pentatricopeptide repeat-containing protein At5g153001.8e-13680.77Show/hide
Query:  MIRKRTNDS-SFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVS
        MIRKRTND+ S NRFQRS+LWQKCT+F ALKQ HAFLVVNG NSS +ALRELIF+S+IAVSGTM YAHQVFAQITEPDIF+WNTMIRGSAQSL PA+AVS
Subjt:  MIRKRTNDS-SFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVS

Query:  LYAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD---
        LYAQMENRGV+PDKFTFSFVLKACTKLSWVKLGFGIHGKV+KFGFQSN FVRNTLIYFHANCGDL+TARALFDAS+K DVVPWSALTAGYARRGELD   
Subjt:  LYAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD---

Query:  -----------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIH
                               LG MEKARKLFDEAP KDVVTWNAMIAGYVLS LN++ALEMFDAMRD+GQRPDDVTMLSILSA+ADLGD+E+GKKI+
Subjt:  -----------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIH

Query:  RSIFDMCCGDLS
        RSIFDM CGD+S
Subjt:  RSIFDMCCGDLS

A0A6J1JPF2 pentatricopeptide repeat-containing protein At5g153001.6e-13781.41Show/hide
Query:  MIRKRTNDS-SFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVS
        MIRKRTND+ S NRFQRS+LWQKCT+F  LKQ HAFLVVNG NSS +ALRELIF+SAIAVSGTM YAHQVFAQITEPDIFMWNTMIRGSAQSL PA+AVS
Subjt:  MIRKRTNDS-SFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVS

Query:  LYAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD---
        LYAQMENRGV+PDKFTFSFVLKACTKLSWVKLGFGIHGKV+KFGFQSN FVRNTLIYFHANCGDL+TARALFDAS+KRDVVPWSALTAGYARRGELD   
Subjt:  LYAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD---

Query:  -----------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIH
                               LG MEKARKLFDEAP KDVVTWNAMIAGYVLS LN++ALEMFDAMRD+GQRPDDVTMLSILSA+ADLGD+E+GKKI+
Subjt:  -----------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIH

Query:  RSIFDMCCGDLS
        RSIFDM CGD+S
Subjt:  RSIFDMCCGDLS

SwissProt top hitse value%identityAlignment
P0C8Q7 Pentatricopeptide repeat-containing protein At5g083051.1e-4536.4Show/hide
Query:  SSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYAQMENRG
        SS +   +S L  +C S   L + H  L+  GL+     + + +  SA++ SG +DYA++  +++++P  + WN +IRG + S  P  ++S+Y QM   G
Subjt:  SSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYAQMENRG

Query:  VRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDLGEMEKARKLFD
        + PD  T+ F++K+ ++LS  KLG  +H  V+K G + + F+ NTLI+ + +  D A+AR LFD    +++V W+++   YA+      G++  AR +FD
Subjt:  VRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDLGEMEKARKLFD

Query:  EAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMG-QRPDDVTMLSILSASADLGDMEIGKKIHRSIFDM
        E  ++DVVTW++MI GYV      +ALE+FD M  MG  + ++VTM+S++ A A LG +  GK +HR I D+
Subjt:  EAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMG-QRPDDVTMLSILSASADLGDMEIGKKIHRSIFDM

Q9FG16 Pentatricopeptide repeat-containing protein At5g065405.4e-4233.08Show/hide
Query:  LWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAV--------SGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYAQMENRGVR
        L Q C+SF  LK  H FL+   L S       L+   A+ V        +  + YA+ +F+QI  P++F++N +IR  +   +P+ A   Y QM    + 
Subjt:  LWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAV--------SGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYAQMENRGVR

Query:  PDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDLGEMEKARKLFDEA
        PD  TF F++KA +++  V +G   H ++++FGFQ++ +V N+L++ +ANCG +A A  +F     RDVV W+++ AGY +      G +E AR++FDE 
Subjt:  PDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDLGEMEKARKLFDEA

Query:  PKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHRSI
        P +++ TW+ MI GY  +   ++A+++F+ M+  G   ++  M+S++S+ A LG +E G++ +  +
Subjt:  PKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHRSI

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic2.3e-4841.98Show/hide
Query:  NLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELI-FVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYAQMENRGVRPDKFTF
        +L   C +  +L+  HA ++  GL+++  AL +LI F         + YA  VF  I EP++ +WNTM RG A S  P +A+ LY  M + G+ P+ +TF
Subjt:  NLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELI-FVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYAQMENRGVRPDKFTF

Query:  SFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDLGEMEKARKLFDEAPKKDVV
         FVLK+C K    K G  IHG VLK G   + +V  +LI  +   G L  A  +FD S  RDVV ++AL  GYA RG +     E A+KLFDE P KDVV
Subjt:  SFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDLGEMEKARKLFDEAPKKDVV

Query:  TWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHRSIFD
        +WNAMI+GY  +   K+ALE+F  M     RPD+ TM++++SA A  G +E+G+++H  I D
Subjt:  TWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHRSIFD

Q9LXF2 Pentatricopeptide repeat-containing protein At5g153004.2e-7949.34Show/hide
Query:  MIRKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSL
        MIR++TND + NR +R  LWQ C +   LKQ HA +VVNGL S+ + + ELI+ ++++V G + YAH++F +I +PD+ + N ++RGSAQS+KP   VSL
Subjt:  MIRKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSL

Query:  YAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD----
        Y +ME RGV PD++TF+FVLKAC+KL W   GF  HGKV++ GF  N++V+N LI FHANCGDL  A  LFD S+K   V WS++T+GYA+RG++D    
Subjt:  YAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD----

Query:  ----------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHR
                                EM+ AR+LFD   +KDVVTWNAMI+GYV     K+AL +F  MRD G+ PD VT+LS+LSA A LGD+E GK++H 
Subjt:  ----------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHR

Query:  SIFD
         I +
Subjt:  SIFD

Q9SJG6 Pentatricopeptide repeat-containing protein At2g42920, chloroplastic5.4e-4237.98Show/hide
Query:  KCTSFHALKQCHAFLVVNGLNSST-AALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYAQM--ENRGVRPDKFTFSF
        +C++   LKQ HA L+  GL S T  A R L F    A    M+YA+ VF +I   + F+WNT+IRG ++S  P  A+S++  M   +  V+P + T+  
Subjt:  KCTSFHALKQCHAFLVVNGLNSST-AALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYAQM--ENRGVRPDKFTFSF

Query:  VLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDLGEMEKARKLFDEAPKKDVVTW
        V KA  +L   + G  +HG V+K G + + F+RNT+++ +  CG L  A  +F      DVV W+++  G+A+ G +D     +A+ LFDE P+++ V+W
Subjt:  VLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDLGEMEKARKLFDEAPKKDVVTW

Query:  NAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHRSI
        N+MI+G+V +   K AL+MF  M++   +PD  TM+S+L+A A LG  E G+ IH  I
Subjt:  NAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHRSI

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-4941.98Show/hide
Query:  NLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELI-FVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYAQMENRGVRPDKFTF
        +L   C +  +L+  HA ++  GL+++  AL +LI F         + YA  VF  I EP++ +WNTM RG A S  P +A+ LY  M + G+ P+ +TF
Subjt:  NLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELI-FVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYAQMENRGVRPDKFTF

Query:  SFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDLGEMEKARKLFDEAPKKDVV
         FVLK+C K    K G  IHG VLK G   + +V  +LI  +   G L  A  +FD S  RDVV ++AL  GYA RG +     E A+KLFDE P KDVV
Subjt:  SFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDLGEMEKARKLFDEAPKKDVV

Query:  TWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHRSIFD
        +WNAMI+GY  +   K+ALE+F  M     RPD+ TM++++SA A  G +E+G+++H  I D
Subjt:  TWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHRSIFD

AT2G42920.1 Pentatricopeptide repeat (PPR-like) superfamily protein3.8e-4337.98Show/hide
Query:  KCTSFHALKQCHAFLVVNGLNSST-AALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYAQM--ENRGVRPDKFTFSF
        +C++   LKQ HA L+  GL S T  A R L F    A    M+YA+ VF +I   + F+WNT+IRG ++S  P  A+S++  M   +  V+P + T+  
Subjt:  KCTSFHALKQCHAFLVVNGLNSST-AALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYAQM--ENRGVRPDKFTFSF

Query:  VLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDLGEMEKARKLFDEAPKKDVVTW
        V KA  +L   + G  +HG V+K G + + F+RNT+++ +  CG L  A  +F      DVV W+++  G+A+ G +D     +A+ LFDE P+++ V+W
Subjt:  VLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDLGEMEKARKLFDEAPKKDVVTW

Query:  NAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHRSI
        N+MI+G+V +   K AL+MF  M++   +PD  TM+S+L+A A LG  E G+ IH  I
Subjt:  NAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHRSI

AT5G06540.1 Pentatricopeptide repeat (PPR) superfamily protein3.8e-4333.08Show/hide
Query:  LWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAV--------SGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYAQMENRGVR
        L Q C+SF  LK  H FL+   L S       L+   A+ V        +  + YA+ +F+QI  P++F++N +IR  +   +P+ A   Y QM    + 
Subjt:  LWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAV--------SGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYAQMENRGVR

Query:  PDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDLGEMEKARKLFDEA
        PD  TF F++KA +++  V +G   H ++++FGFQ++ +V N+L++ +ANCG +A A  +F     RDVV W+++ AGY +      G +E AR++FDE 
Subjt:  PDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDLGEMEKARKLFDEA

Query:  PKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHRSI
        P +++ TW+ MI GY  +   ++A+++F+ M+  G   ++  M+S++S+ A LG +E G++ +  +
Subjt:  PKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHRSI

AT5G08305.1 Pentatricopeptide repeat (PPR) superfamily protein7.5e-4736.4Show/hide
Query:  SSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYAQMENRG
        SS +   +S L  +C S   L + H  L+  GL+     + + +  SA++ SG +DYA++  +++++P  + WN +IRG + S  P  ++S+Y QM   G
Subjt:  SSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYAQMENRG

Query:  VRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDLGEMEKARKLFD
        + PD  T+ F++K+ ++LS  KLG  +H  V+K G + + F+ NTLI+ + +  D A+AR LFD    +++V W+++   YA+      G++  AR +FD
Subjt:  VRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDLGEMEKARKLFD

Query:  EAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMG-QRPDDVTMLSILSASADLGDMEIGKKIHRSIFDM
        E  ++DVVTW++MI GYV      +ALE+FD M  MG  + ++VTM+S++ A A LG +  GK +HR I D+
Subjt:  EAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMG-QRPDDVTMLSILSASADLGDMEIGKKIHRSIFDM

AT5G15300.1 Pentatricopeptide repeat (PPR) superfamily protein3.0e-8049.34Show/hide
Query:  MIRKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSL
        MIR++TND + NR +R  LWQ C +   LKQ HA +VVNGL S+ + + ELI+ ++++V G + YAH++F +I +PD+ + N ++RGSAQS+KP   VSL
Subjt:  MIRKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSL

Query:  YAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD----
        Y +ME RGV PD++TF+FVLKAC+KL W   GF  HGKV++ GF  N++V+N LI FHANCGDL  A  LFD S+K   V WS++T+GYA+RG++D    
Subjt:  YAQMENRGVRPDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELD----

Query:  ----------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHR
                                EM+ AR+LFD   +KDVVTWNAMI+GYV     K+AL +F  MRD G+ PD VT+LS+LSA A LGD+E GK++H 
Subjt:  ----------------------LGEMEKARKLFDEAPKKDVVTWNAMIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHR

Query:  SIFD
         I +
Subjt:  SIFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCGAAAGAGAACAAATGACAGTAGCTTCAACCGCTTTCAACGGTCTAACTTATGGCAGAAATGCACCAGCTTTCATGCTTTGAAGCAATGTCATGCCTTTCTGGT
CGTCAATGGCCTTAATTCAAGCACAGCTGCGCTCAGAGAACTCATTTTCGTCAGTGCAATAGCTGTTTCTGGGACAATGGATTATGCCCACCAAGTGTTCGCTCAAATTA
CTGAACCGGATATCTTCATGTGGAACACCATGATCAGGGGTTCGGCTCAGAGCTTGAAGCCAGCAACTGCTGTTTCTCTTTATGCCCAGATGGAAAATCGTGGGGTTAGG
CCTGATAAATTTACCTTCTCCTTTGTGCTTAAGGCCTGCACTAAGCTTTCTTGGGTTAAGTTGGGATTCGGGATTCATGGGAAGGTTCTGAAGTTTGGGTTTCAATCCAA
TAAATTTGTAAGGAATACTCTTATTTATTTCCATGCCAATTGTGGCGATTTGGCCACTGCAAGAGCACTTTTTGACGCTTCTTCCAAAAGGGATGTTGTGCCTTGGTCAG
CTTTGACAGCAGGCTATGCAAGAAGAGGGGAATTGGATCTTGGGGAGATGGAGAAGGCAAGGAAACTGTTTGATGAAGCTCCGAAGAAAGATGTTGTGACTTGGAATGCA
ATGATTGCAGGATACGTGCTTTCTAGATTGAACAAGCAAGCCTTGGAGATGTTTGATGCAATGAGAGATATGGGACAAAGGCCGGATGATGTGACAATGCTGAGTATCTT
GTCTGCTTCAGCTGATTTGGGTGATATGGAAATTGGAAAGAAGATACATCGTTCCATTTTCGACATGTGCTGCGGAGATTTAAGCCACAGAAACAAGTGCGCAGATCCAG
CTGTGGTCAAAGGCATGACTGCGGACCATGATTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGATCCGAAAGAGAACAAATGACAGTAGCTTCAACCGCTTTCAACGGTCTAACTTATGGCAGAAATGCACCAGCTTTCATGCTTTGAAGCAATGTCATGCCTTTCTGGT
CGTCAATGGCCTTAATTCAAGCACAGCTGCGCTCAGAGAACTCATTTTCGTCAGTGCAATAGCTGTTTCTGGGACAATGGATTATGCCCACCAAGTGTTCGCTCAAATTA
CTGAACCGGATATCTTCATGTGGAACACCATGATCAGGGGTTCGGCTCAGAGCTTGAAGCCAGCAACTGCTGTTTCTCTTTATGCCCAGATGGAAAATCGTGGGGTTAGG
CCTGATAAATTTACCTTCTCCTTTGTGCTTAAGGCCTGCACTAAGCTTTCTTGGGTTAAGTTGGGATTCGGGATTCATGGGAAGGTTCTGAAGTTTGGGTTTCAATCCAA
TAAATTTGTAAGGAATACTCTTATTTATTTCCATGCCAATTGTGGCGATTTGGCCACTGCAAGAGCACTTTTTGACGCTTCTTCCAAAAGGGATGTTGTGCCTTGGTCAG
CTTTGACAGCAGGCTATGCAAGAAGAGGGGAATTGGATCTTGGGGAGATGGAGAAGGCAAGGAAACTGTTTGATGAAGCTCCGAAGAAAGATGTTGTGACTTGGAATGCA
ATGATTGCAGGATACGTGCTTTCTAGATTGAACAAGCAAGCCTTGGAGATGTTTGATGCAATGAGAGATATGGGACAAAGGCCGGATGATGTGACAATGCTGAGTATCTT
GTCTGCTTCAGCTGATTTGGGTGATATGGAAATTGGAAAGAAGATACATCGTTCCATTTTCGACATGTGCTGCGGAGATTTAAGCCACAGAAACAAGTGCGCAGATCCAG
CTGTGGTCAAAGGCATGACTGCGGACCATGATTTCTAA
Protein sequenceShow/hide protein sequence
MIRKRTNDSSFNRFQRSNLWQKCTSFHALKQCHAFLVVNGLNSSTAALRELIFVSAIAVSGTMDYAHQVFAQITEPDIFMWNTMIRGSAQSLKPATAVSLYAQMENRGVR
PDKFTFSFVLKACTKLSWVKLGFGIHGKVLKFGFQSNKFVRNTLIYFHANCGDLATARALFDASSKRDVVPWSALTAGYARRGELDLGEMEKARKLFDEAPKKDVVTWNA
MIAGYVLSRLNKQALEMFDAMRDMGQRPDDVTMLSILSASADLGDMEIGKKIHRSIFDMCCGDLSHRNKCADPAVVKGMTADHDF