; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0000726 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0000726
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr04:4253574..4255352
RNA-Seq ExpressionPI0000726
SyntenyPI0000726
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044002.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]8.2e-26980.54Show/hide
Query:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL
        MSNQ IATNI+KLILKSGL+PFKTTPSLLSKLDSRVTQ +LSD NVPT+SCLRFFNFLR+NPSCKPDLPAHLILVCRLYRARKFA MKNVL FIVN GNL
Subjt:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL

Query:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD
        WSNVERIVSSIGGEFNEPKFV+ FCDMLFRVYMD RMFDS+LEVFDYARKNGFEIEERSCFEFLLALKRSGN+EL VEFLRQ+VDSGIEIRV SWT VVD
Subjt:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD

Query:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY
        GLCKKG VVRAKAL+DELVCKGFKPNV TYNTLLNGYIEIKD GGVNEILSLMEKD VDYNV TYT+LIEWYSR SKIEE+EKLFDEMLKKGIEPDVYVY
Subjt:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TS+INWNC FGNMKRAFVLFDEMTERRLVPNA+TYGAL+NGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
Subjt:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------
        DVFTCNIIASGFCR +R+EEARRLLLTMEERGVAPD VSF                                                            
Subjt:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------

Query:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKPG
             +T+TYTSL+N ECASGNVDRALELFNEMPQRGLNRN VTYTVMISGLSK GRADEAFKLYDEMN +GI+PDDRI+SSLI SLH+ G
Subjt:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKPG

XP_004137973.1 pentatricopeptide repeat-containing protein At2g32630 [Cucumis sativus]3.5e-26479.86Show/hide
Query:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL
        MS Q IA NI+KLILKSGL+PFKTTPSLLS  DSRV QLVLSD N+PT+SCLRFF+FLRQNPS KPDLPAHLIL  RLYRARKFA MKNVL FIVNDGNL
Subjt:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL

Query:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD
        WSNVERIVSSIGGEFNEP  V+KFCDMLFRVYMDNRMFDS+LEVFDYARK GFEI+ERSCFEFLLALKRSGN+EL VEFLRQMVDSGIEIRVCSWT VVD
Subjt:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD

Query:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY
        GLCKKG VVRAKALMDELVCKGFKP+V TYNTLLNGYIEIKDVGGVNEILSLMEK+ VDYNVTTYT+LIEWYSRSSKIEEAEKLFDEMLKKGIEPDVY+Y
Subjt:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TSIINWNC FGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAG+M+AAEMMVNDMQSKGVDVN+VIFNTL+DGYCKKGMIDEALRLQNIMQQKGFEI
Subjt:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------
        D FTCNIIASGFCRSNR+EEA+RLLLTMEERGVAP+ VSF                                                            
Subjt:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------

Query:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKPG
             +T+TYTSLI+ E ASGNVDRALELFNEMPQ GLNRNVVTYTV+ISGLSKDGRADEAFKLYDEMN EGIVPDD IYSSLI SLHK G
Subjt:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKPG

XP_008442691.1 PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g32630-like [Cucumis melo]4.0e-26880.37Show/hide
Query:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL
        MSNQ IATNI+KLILKSGL+PFKTTPSLLSKLDSRVTQ +LSD NVPT+SCLRFFNFLR+NPSCKPDLPAHLILVCRLYRARKFA MKNVL FIVN GNL
Subjt:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL

Query:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD
        WSNVERIVSSIGGEFNEPKFV+ FCDMLFRVYMD RMFDS+LEVFDYARKNGFEIEERSCFEFLLALKRSGN+EL VEFLRQ+VDSGIEIRV SWT VVD
Subjt:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD

Query:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY
        GLCKKG VVRAKAL+DELVCKGFKPNV TYNTLLNGYIEIKD GGVNEILSLMEKD VDYNV TYT+LIEWYSR SKIEE+EKLFDEMLKKGIEPDVYVY
Subjt:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TS+INWNC FGNMKRAFVLFDEMTERRLVPNA+TYGAL+NGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYC KGMIDEALRLQNIMQQKGFEI
Subjt:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------
        DVFTCNIIASGFCR +R+EEARRLLLTMEERGVAPD VSF                                                            
Subjt:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------

Query:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKPG
             +T+TYTSL+N ECASGNVDRALELFNEMPQRGLNRN VTYTVMISGLSK GRADEAFKLYDEMN +GI+PDDRI+SSLI SLH+ G
Subjt:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKPG

XP_023528780.1 pentatricopeptide repeat-containing protein At2g32630 [Cucurbita pepo subsp. pepo]1.0e-24775.68Show/hide
Query:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL
        M+NQ +ATNI+KLI+KSGLKPFKTTPSLLS LDSRVTQLVLS+ +VPTQSCL FFNFLRQNPS KPDL AHLIL+CRLYRARKFAVMKNVLNFIVNDGNL
Subjt:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL

Query:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD
         S+ ERIVSSIGGE +EPKFVDKFCDMLFRVY+DN MFDSALEVFDYARKN FEIEERSC   LLALKRSGNVELS+EFLRQMVDSG+EI V S T VVD
Subjt:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD

Query:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY
        GLC+KG V RAKALMDELV KGFKPNVFTYNTLL  YIE K++  VNEILSLMEKD VDYN TTYTILIEWYSRS KIEEAEK+FDEMLK+GIEPDVYVY
Subjt:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TSIINWNCN GNMKRAF LFDEMTER LVPNAYTYGALINGACKAG+MEAAEM+VNDMQSKG+DVNQVIFNTLIDGYCKKGM+DEALRLQ+IMQQKGFEI
Subjt:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------
        DVFT NIIASGFCRSNR++EA+ LLLTMEERGVAP+AVSF                                                            
Subjt:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------

Query:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKPGS
             +TFTY+SLIN EC  GN+D ALELFNEMPQRGLNRN++TYT +ISGLSKDGR+DEAFKLYDEM A GI PDDRIYSSL  SLH+ GS
Subjt:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKPGS

XP_038904125.1 pentatricopeptide repeat-containing protein At2g32630 [Benincasa hispida]7.2e-24976.36Show/hide
Query:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL
        MSNQ IATNI+KLILKSGLKPFKTTPSLLS LDSRVTQLVLSD N+PTQSCL FFNFLRQNPS KPDL AHLIL+CRLYRARKFAVMKNVLNF+VNDGNL
Subjt:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL

Query:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD
         S VERIVSSIG EFNEPKFVDKFCDMLFRVY+DNRMFDSALEVFDYARK+G EIEERSCF FLLALKRSGNVELS+EFL QMVDSG+EI V S T VVD
Subjt:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD

Query:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY
        GLCKK  VVRAKALMDEL CKGFKPN+FTYNTLLN YIE  D+G VNEILSLMEKD VDYN +TYTILIEWYSR+ KIEEAE+LF++MLKKG+EPDVYVY
Subjt:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TSIINWNCN  NMKRAF LFD+MTER +VPNAYTYGALING CKAG+MEAAEM+VNDMQSKG+D+N VIFNTLIDGYCKKGMIDEALRLQ+IMQQKGFE 
Subjt:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------
        DVFT NIIASGFCR NRQ+EARRLLLTMEERGVAP+AVSF                                                            
Subjt:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------

Query:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLH
             +TFTYTSLIN EC  GNVDRALELFNEM ++GLNRNV+TYT MISGLSKDGRADEAFKLYDEM A GI PDDRIYSSL  SLH
Subjt:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLH

TrEMBL top hitse value%identityAlignment
A0A0A0LDI0 Uncharacterized protein3.7e-26788.59Show/hide
Query:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL
        MS Q IA NI+KLILKSGL+PFKTTPSLLS  DSRV QLVLSD N+PT+SCLRFF+FLRQNPS KPDLPAHLIL  RLYRARKFA MKNVL FIVNDGNL
Subjt:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL

Query:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD
        WSNVERIVSSIGGEFNEP  V+KFCDMLFRVYMDNRMFDS+LEVFDYARK GFEI+ERSCFEFLLALKRSGN+EL VEFLRQMVDSGIEIRVCSWT VVD
Subjt:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD

Query:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY
        GLCKKG VVRAKALMDELVCKGFKP+V TYNTLLNGYIEIKDVGGVNEILSLMEK+ VDYNVTTYT+LIEWYSRSSKIEEAEKLFDEMLKKGIEPDVY+Y
Subjt:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TSIINWNC FGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAG+M+AAEMMVNDMQSKGVDVN+VIFNTL+DGYCKKGMIDEALRLQNIMQQKGFEI
Subjt:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSFNTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLY
        D FTCNIIASGFCRSNR+EEA+RLLLTMEERGVAP+ VSF         N E ASGNVDRALELFNEMPQ GLNRNVVTYTV+ISGLSKDGRADEAFKLY
Subjt:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSFNTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLY

Query:  DEMNAEGIVPDDRIYSSLIVSLHKPG
        DEMN EGIVPDD IYSSLI SLHK G
Subjt:  DEMNAEGIVPDDRIYSSLIVSLHKPG

A0A1S3B5T3 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g32630-like2.0e-26880.37Show/hide
Query:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL
        MSNQ IATNI+KLILKSGL+PFKTTPSLLSKLDSRVTQ +LSD NVPT+SCLRFFNFLR+NPSCKPDLPAHLILVCRLYRARKFA MKNVL FIVN GNL
Subjt:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL

Query:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD
        WSNVERIVSSIGGEFNEPKFV+ FCDMLFRVYMD RMFDS+LEVFDYARKNGFEIEERSCFEFLLALKRSGN+EL VEFLRQ+VDSGIEIRV SWT VVD
Subjt:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD

Query:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY
        GLCKKG VVRAKAL+DELVCKGFKPNV TYNTLLNGYIEIKD GGVNEILSLMEKD VDYNV TYT+LIEWYSR SKIEE+EKLFDEMLKKGIEPDVYVY
Subjt:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TS+INWNC FGNMKRAFVLFDEMTERRLVPNA+TYGAL+NGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYC KGMIDEALRLQNIMQQKGFEI
Subjt:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------
        DVFTCNIIASGFCR +R+EEARRLLLTMEERGVAPD VSF                                                            
Subjt:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------

Query:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKPG
             +T+TYTSL+N ECASGNVDRALELFNEMPQRGLNRN VTYTVMISGLSK GRADEAFKLYDEMN +GI+PDDRI+SSLI SLH+ G
Subjt:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKPG

A0A5A7TL02 Pentatricopeptide repeat-containing protein3.9e-26980.54Show/hide
Query:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL
        MSNQ IATNI+KLILKSGL+PFKTTPSLLSKLDSRVTQ +LSD NVPT+SCLRFFNFLR+NPSCKPDLPAHLILVCRLYRARKFA MKNVL FIVN GNL
Subjt:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL

Query:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD
        WSNVERIVSSIGGEFNEPKFV+ FCDMLFRVYMD RMFDS+LEVFDYARKNGFEIEERSCFEFLLALKRSGN+EL VEFLRQ+VDSGIEIRV SWT VVD
Subjt:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD

Query:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY
        GLCKKG VVRAKAL+DELVCKGFKPNV TYNTLLNGYIEIKD GGVNEILSLMEKD VDYNV TYT+LIEWYSR SKIEE+EKLFDEMLKKGIEPDVYVY
Subjt:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TS+INWNC FGNMKRAFVLFDEMTERRLVPNA+TYGAL+NGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
Subjt:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------
        DVFTCNIIASGFCR +R+EEARRLLLTMEERGVAPD VSF                                                            
Subjt:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------

Query:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKPG
             +T+TYTSL+N ECASGNVDRALELFNEMPQRGLNRN VTYTVMISGLSK GRADEAFKLYDEMN +GI+PDDRI+SSLI SLH+ G
Subjt:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKPG

A0A6J1F818 pentatricopeptide repeat-containing protein At2g32630 isoform X11.2e-24675.34Show/hide
Query:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL
        M+NQ +ATNI KLI+KSGLKPFKTTPSLLS LDSRVTQLVLS+ +VPTQSCL FFNFLRQNPS KPDL AHLIL+CRLYRARKFAVMKNVLNFIVNDGNL
Subjt:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL

Query:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD
         S+ ERIVSSIGGE +EPKFVDKFCDMLFRVY+DN MFDSALEVFDYARKNGFEIEERSC   LLALKRSGNVELS+EFLRQMVDSG+EI V S T VVD
Subjt:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD

Query:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY
        GLC+KG V RAKALMDELV KGFKPNV TYNTLLN YIE +++  VNEILSLM KD VDY+ TTYTILIEWYSRS KIEEAEK+FDEMLK+GIEPDVYVY
Subjt:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TSIINWNCN GNMKRAF LFDEMTER LVPNAYTYGALINGACKAG+MEAAEM+VNDMQSKG+DVNQVIFNTLIDGYCKKGM+DEALRLQ+IMQQKGF+I
Subjt:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------
        DVFT NIIASGFCRSNR++EAR LLLTMEERGVAP+AVSF                                                            
Subjt:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------

Query:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKPGS
             +TFTY+SLIN EC  GN+D ALELFNEMPQRGLNRN++TYT +ISGLSKDGR+DEAFKLYDEM A GI PDDRIYSSL  SLH+ GS
Subjt:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKPGS

A0A6J1J171 pentatricopeptide repeat-containing protein At2g32630 isoform X19.5e-24775.51Show/hide
Query:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL
        M+NQ +ATNI+KLI+KSGLKPFKTTPSLLS LDSRVTQLVLS+ +VPTQSCL FFNFLRQNPS KPDL AHLIL+CRLYRARKFAVMKNVLNFIVNDGNL
Subjt:  MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNL

Query:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD
            ERIVSSIGGE +EPKFVDKFCDMLFRVY+DN MFDSALEVFDYARKN FEIEERSC   LLALKRSGNVELS+EFLRQMVDSG+EI V S T VVD
Subjt:  WSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVD

Query:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY
        GLC+KG V RAKALMDELV KGFKPNVFTYNTLLN YIE K++  VNEILSLMEKD VDYN TTYTILIEWYSRS KIEEAEK+FDEMLK+GIEPDVYVY
Subjt:  GLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TSIINWN N GNMKRAF LFDEMTER LVPNAYTYGALINGACKAG+MEAAEM+VNDMQSKG+DVNQVIFNTLIDGYCKKGM+DEALRLQ+IMQQKGFEI
Subjt:  TSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------
        DVFT NIIASGFCRSNR++EA+ LLLTMEERGVAP+AVSF                                                            
Subjt:  DVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF------------------------------------------------------------

Query:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKPGS
             +TFTY+SLIN EC  GN+D ALELFNEMPQRGLNRN++TYT +ISGLSKDGR+DEAFKLYDEM A GI PDDRIYSSL  SLH+ GS
Subjt:  -----NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKPGS

SwissProt top hitse value%identityAlignment
O04491 Putative pentatricopeptide repeat-containing protein At1g096803.4e-6027.92Show/hide
Query:  NVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNLWSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEV
        ++P +S   FF F+   P  +  +  + +L   L     F   ++++  +V+     S     +S +  E           D L   Y D      A++ 
Subjt:  NVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNLWSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEV

Query:  FDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVDGLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVG
        F  +RK+ F++  R C   L  + +         F  +++D+G  + V  +  +++  CK+G +  A+ + DE+  +  +P V ++NTL+NGY ++ ++ 
Subjt:  FDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVDGLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVG

Query:  GVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVYTSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACK
            +   MEK     +V TY+ LI    + +K++ A  LFDEM K+G+ P+  ++T++I+ +   G +      + +M  + L P+   Y  L+NG CK
Subjt:  GVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVYTSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACK

Query:  AGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSFNTFT
         G + AA  +V+ M  +G+  +++ + TLIDG+C+ G ++ AL ++  M Q G E+D    + +  G C+  R  +A R L  M   G+ PD V     T
Subjt:  AGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSFNTFT

Query:  YTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHK
        YT +++  C  G+     +L  EM   G   +VVTY V+++GL K G+   A  L D M   G+VPDD  Y++L+   H+
Subjt:  YTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHK

O04504 Pentatricopeptide repeat-containing protein At1g098205.8e-6028.84Show/hide
Query:  SKLDSRVTQL-------VLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNLWSNVERIVSSIGGEFNEPKFVD
        SKL   VT +        L  S +    CLR++++L +N      L     L+  L  A++++ +++ L+  V +G+     +  V SI   F+     D
Subjt:  SKLDSRVTQL-------VLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNLWSNVERIVSSIGGEFNEPKFVD

Query:  KFC------DMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFL-RQMVDSGIEIRVCSWTTVVDGLCKKGLVVRAKALM
          C      DML   Y +N  F+   E F  +   G+++   SC   ++AL +  N    VE++ ++M+   I+  V ++  V++ LCK G + +A+ +M
Subjt:  KFC------DMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFL-RQMVDSGIEIRVCSWTTVVDGLCKKGLVVRAKALM

Query:  DELVCKGFKPNVFTYNTLLNGYIEIKDVGGV---NEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVYTSIINWNCNFGN
        +++   G  PNV +YNTL++GY ++   G +   + +L  M ++ V  N+TT+ ILI+ + +   +  + K+F EML + ++P+V  Y S+IN  CN G 
Subjt:  DELVCKGFKPNVFTYNTLLNGYIEIKDVGGV---NEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVYTSIINWNCNFGN

Query:  MKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGF
        +  A  + D+M    + PN  TY ALING CK   ++ A  M   ++ +G      ++N LID YCK G ID+   L+  M+++G   DV T N + +G 
Subjt:  MKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGF

Query:  CRSNRQEEARRLLLTMEERGVAPDAVSFNT------------------------------FTYTSLINRECASGNVDRALELFNEM-PQRGLNRNVVTYT
        CR+   E A++L   +  +G+ PD V+F+                                TY  ++   C  GN+  A  +  +M  +R L  NV +Y 
Subjt:  CRSNRQEEARRLLLTMEERGVAPDAVSFNT------------------------------FTYTSLINRECASGNVDRALELFNEM-PQRGLNRNVVTYT

Query:  VMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIY
        V++ G S+ G+ ++A  L +EM  +G+VP+   Y
Subjt:  VMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIY

Q8S8P6 Pentatricopeptide repeat-containing protein At2g326301.8e-14946.8Show/hide
Query:  SNQVIATNISK-LILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQ-NPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGN
        S+Q  A  I+  L+ KS +   ++ PSLL  L+S VT+LVLS+  +PTQSC+ FF  LR+   + KPDL A + L  RLY  R+F  M+++LN +VNDG 
Subjt:  SNQVIATNISK-LILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQ-NPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGN

Query:  LWSNVERIVSS-IGGEFNEPK--FVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWT
            VE + S+ +  + +E K  F +KF D++FRVY+DN MF+  L VFDY  K G  I+ERSC  FL+A K+   ++L +E  R+MVDSG++I V S T
Subjt:  LWSNVERIVSS-IGGEFNEPK--FVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWT

Query:  TVVDGLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPD
         VV+GLC++G V ++K L+ E   KG KP  +TYNT++N Y++ +D  GV  +L +M+KD V YN  TYT+L+E   ++ K+ +AEKLFDEM ++GIE D
Subjt:  TVVDGLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPD

Query:  VYVYTSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQK
        V+VYTS+I+WNC  GNMKRAF+LFDE+TE+ L P++YTYGALI+G CK G+M AAE+++N+MQSKGV++ QV+FNTLIDGYC+KGM+DEA  + ++M+QK
Subjt:  VYVYTSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQK

Query:  GFEIDVFTCNIIASGF-----------------------------------CRSNRQEEARRLLLTMEERGVAPDAVSFN--------------------
        GF+ DVFTCN IAS F                                   C+    EEA+RL + M  +GV P+A+++N                    
Subjt:  GFEIDVFTCNIIASGF-----------------------------------CRSNRQEEARRLLLTMEERGVAPDAVSFN--------------------

Query:  ----------TFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKP
                  ++TYTSLI+ EC + NVD A+ LF+EM  +GL++N VTYTVMISGLSK G++DEAF LYDEM  +G   D+++Y++LI S+H P
Subjt:  ----------TFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKP

Q9LFC5 Pentatricopeptide repeat-containing protein At5g011102.2e-5931.43Show/hide
Query:  IVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVDGLCKKG
        IV+S+   F+     D   D+L R Y+  R    A E F   R  GF +   +C   + +L R G VEL+    +++  SG+ I V +   +V+ LCK G
Subjt:  IVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVDGLCKKG

Query:  LVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVYTSIINW
         + +    + ++  KG  P++ TYNTL++ Y     +    E+++ M        V TY  +I    +  K E A+++F EML+ G+ PD   Y S++  
Subjt:  LVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVYTSIINW

Query:  NCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCN
         C  G++     +F +M  R +VP+   + ++++   ++G ++ A M  N ++  G+  + VI+  LI GYC+KGMI  A+ L+N M Q+G  +DV T N
Subjt:  NCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCN

Query:  IIASGFCRSNRQEEARRLLLTMEERGVAPDAVSFNTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAE
         I  G C+     EA +L   M ER + PD     ++T T LI+  C  GN+  A+ELF +M ++ +  +VVTY  ++ G  K G  D A +++ +M ++
Subjt:  IIASGFCRSNRQEEARRLLLTMEERGVAPDAVSFNTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAE

Query:  GIVPDDRIYSSLIVSLHKPG
         I+P    YS L+ +L   G
Subjt:  GIVPDDRIYSSLIVSLHKPG

Q9ZUU7 Pentatricopeptide repeat-containing protein At2g280504.5e-6032.51Show/hide
Query:  SNQVIATNISKLILKSGL--KPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSC---KPDLPAHLILVCRLYRARKFAVMKNVLNFIVN
        + Q    +I KL+L S    +   +  + LS L+    + +LSD ++ +  C+  FNF+ +NPS    +PDL  HL L  R+   R+F+  K +L  +  
Subjt:  SNQVIATNISKLILKSGL--KPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSC---KPDLPAHLILVCRLYRARKFAVMKNVLNFIVN

Query:  DGNLWSNVERIVSSIGGEFN-EPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIE-IRVCS
        D  L      IVSS+  E   E K V +F + +  VY DN  F   +EVF+Y + N  +I+E++C   LL LKR   +EL+ +F   MV+SGI+ + V S
Subjt:  DGNLWSNVERIVSSIGGEFN-EPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIE-IRVCS

Query:  WTTVVDGLCKKGLVVRAKALMDEL-VCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGI
         T VV  LC  G + RA+ L++E+ + KG K N+ T+ +++   ++  D   ++ +L LMEK++V  ++ +Y +LI+ ++   K+EEAE+L   M  K +
Subjt:  WTTVVDGLCKKGLVVRAKALMDEL-VCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGI

Query:  EPDVYVYTSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIM
          + Y+Y  I+N    FG +++   L+ EM+ R + PN  TY  L+NG CKAG++  A   +N+++    ++++ +++TL +   + GMID++L +   M
Subjt:  EPDVYVYTSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIM

Query:  QQKGFEIDVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVS
         + GF      C  +A      NR +EA+ L+  + + G+ P + S
Subjt:  QQKGFEIDVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVS

Arabidopsis top hitse value%identityAlignment
AT1G09680.1 Pentatricopeptide repeat (PPR) superfamily protein2.4e-6127.92Show/hide
Query:  NVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNLWSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEV
        ++P +S   FF F+   P  +  +  + +L   L     F   ++++  +V+     S     +S +  E           D L   Y D      A++ 
Subjt:  NVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNLWSNVERIVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEV

Query:  FDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVDGLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVG
        F  +RK+ F++  R C   L  + +         F  +++D+G  + V  +  +++  CK+G +  A+ + DE+  +  +P V ++NTL+NGY ++ ++ 
Subjt:  FDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVDGLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVG

Query:  GVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVYTSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACK
            +   MEK     +V TY+ LI    + +K++ A  LFDEM K+G+ P+  ++T++I+ +   G +      + +M  + L P+   Y  L+NG CK
Subjt:  GVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVYTSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACK

Query:  AGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSFNTFT
         G + AA  +V+ M  +G+  +++ + TLIDG+C+ G ++ AL ++  M Q G E+D    + +  G C+  R  +A R L  M   G+ PD V     T
Subjt:  AGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSFNTFT

Query:  YTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHK
        YT +++  C  G+     +L  EM   G   +VVTY V+++GL K G+   A  L D M   G+VPDD  Y++L+   H+
Subjt:  YTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHK

AT1G09820.1 Pentatricopeptide repeat (PPR-like) superfamily protein4.1e-6128.84Show/hide
Query:  SKLDSRVTQL-------VLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNLWSNVERIVSSIGGEFNEPKFVD
        SKL   VT +        L  S +    CLR++++L +N      L     L+  L  A++++ +++ L+  V +G+     +  V SI   F+     D
Subjt:  SKLDSRVTQL-------VLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNLWSNVERIVSSIGGEFNEPKFVD

Query:  KFC------DMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFL-RQMVDSGIEIRVCSWTTVVDGLCKKGLVVRAKALM
          C      DML   Y +N  F+   E F  +   G+++   SC   ++AL +  N    VE++ ++M+   I+  V ++  V++ LCK G + +A+ +M
Subjt:  KFC------DMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFL-RQMVDSGIEIRVCSWTTVVDGLCKKGLVVRAKALM

Query:  DELVCKGFKPNVFTYNTLLNGYIEIKDVGGV---NEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVYTSIINWNCNFGN
        +++   G  PNV +YNTL++GY ++   G +   + +L  M ++ V  N+TT+ ILI+ + +   +  + K+F EML + ++P+V  Y S+IN  CN G 
Subjt:  DELVCKGFKPNVFTYNTLLNGYIEIKDVGGV---NEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVYTSIINWNCNFGN

Query:  MKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGF
        +  A  + D+M    + PN  TY ALING CK   ++ A  M   ++ +G      ++N LID YCK G ID+   L+  M+++G   DV T N + +G 
Subjt:  MKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGF

Query:  CRSNRQEEARRLLLTMEERGVAPDAVSFNT------------------------------FTYTSLINRECASGNVDRALELFNEM-PQRGLNRNVVTYT
        CR+   E A++L   +  +G+ PD V+F+                                TY  ++   C  GN+  A  +  +M  +R L  NV +Y 
Subjt:  CRSNRQEEARRLLLTMEERGVAPDAVSFNT------------------------------FTYTSLINRECASGNVDRALELFNEM-PQRGLNRNVVTYT

Query:  VMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIY
        V++ G S+ G+ ++A  L +EM  +G+VP+   Y
Subjt:  VMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIY

AT2G28050.1 Pentatricopeptide repeat (PPR) superfamily protein3.2e-6132.51Show/hide
Query:  SNQVIATNISKLILKSGL--KPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSC---KPDLPAHLILVCRLYRARKFAVMKNVLNFIVN
        + Q    +I KL+L S    +   +  + LS L+    + +LSD ++ +  C+  FNF+ +NPS    +PDL  HL L  R+   R+F+  K +L  +  
Subjt:  SNQVIATNISKLILKSGL--KPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSC---KPDLPAHLILVCRLYRARKFAVMKNVLNFIVN

Query:  DGNLWSNVERIVSSIGGEFN-EPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIE-IRVCS
        D  L      IVSS+  E   E K V +F + +  VY DN  F   +EVF+Y + N  +I+E++C   LL LKR   +EL+ +F   MV+SGI+ + V S
Subjt:  DGNLWSNVERIVSSIGGEFN-EPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIE-IRVCS

Query:  WTTVVDGLCKKGLVVRAKALMDEL-VCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGI
         T VV  LC  G + RA+ L++E+ + KG K N+ T+ +++   ++  D   ++ +L LMEK++V  ++ +Y +LI+ ++   K+EEAE+L   M  K +
Subjt:  WTTVVDGLCKKGLVVRAKALMDEL-VCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGI

Query:  EPDVYVYTSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIM
          + Y+Y  I+N    FG +++   L+ EM+ R + PN  TY  L+NG CKAG++  A   +N+++    ++++ +++TL +   + GMID++L +   M
Subjt:  EPDVYVYTSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIM

Query:  QQKGFEIDVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVS
         + GF      C  +A      NR +EA+ L+  + + G+ P + S
Subjt:  QQKGFEIDVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVS

AT2G32630.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.3e-15046.8Show/hide
Query:  SNQVIATNISK-LILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQ-NPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGN
        S+Q  A  I+  L+ KS +   ++ PSLL  L+S VT+LVLS+  +PTQSC+ FF  LR+   + KPDL A + L  RLY  R+F  M+++LN +VNDG 
Subjt:  SNQVIATNISK-LILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQ-NPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGN

Query:  LWSNVERIVSS-IGGEFNEPK--FVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWT
            VE + S+ +  + +E K  F +KF D++FRVY+DN MF+  L VFDY  K G  I+ERSC  FL+A K+   ++L +E  R+MVDSG++I V S T
Subjt:  LWSNVERIVSS-IGGEFNEPK--FVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWT

Query:  TVVDGLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPD
         VV+GLC++G V ++K L+ E   KG KP  +TYNT++N Y++ +D  GV  +L +M+KD V YN  TYT+L+E   ++ K+ +AEKLFDEM ++GIE D
Subjt:  TVVDGLCKKGLVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPD

Query:  VYVYTSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQK
        V+VYTS+I+WNC  GNMKRAF+LFDE+TE+ L P++YTYGALI+G CK G+M AAE+++N+MQSKGV++ QV+FNTLIDGYC+KGM+DEA  + ++M+QK
Subjt:  VYVYTSIINWNCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQK

Query:  GFEIDVFTCNIIASGF-----------------------------------CRSNRQEEARRLLLTMEERGVAPDAVSFN--------------------
        GF+ DVFTCN IAS F                                   C+    EEA+RL + M  +GV P+A+++N                    
Subjt:  GFEIDVFTCNIIASGF-----------------------------------CRSNRQEEARRLLLTMEERGVAPDAVSFN--------------------

Query:  ----------TFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKP
                  ++TYTSLI+ EC + NVD A+ LF+EM  +GL++N VTYTVMISGLSK G++DEAF LYDEM  +G   D+++Y++LI S+H P
Subjt:  ----------TFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKP

AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-6031.43Show/hide
Query:  IVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVDGLCKKG
        IV+S+   F+     D   D+L R Y+  R    A E F   R  GF +   +C   + +L R G VEL+    +++  SG+ I V +   +V+ LCK G
Subjt:  IVSSIGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVDGLCKKG

Query:  LVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVYTSIINW
         + +    + ++  KG  P++ TYNTL++ Y     +    E+++ M        V TY  +I    +  K E A+++F EML+ G+ PD   Y S++  
Subjt:  LVVRAKALMDELVCKGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVYTSIINW

Query:  NCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCN
         C  G++     +F +M  R +VP+   + ++++   ++G ++ A M  N ++  G+  + VI+  LI GYC+KGMI  A+ L+N M Q+G  +DV T N
Subjt:  NCNFGNMKRAFVLFDEMTERRLVPNAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCN

Query:  IIASGFCRSNRQEEARRLLLTMEERGVAPDAVSFNTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAE
         I  G C+     EA +L   M ER + PD     ++T T LI+  C  GN+  A+ELF +M ++ +  +VVTY  ++ G  K G  D A +++ +M ++
Subjt:  IIASGFCRSNRQEEARRLLLTMEERGVAPDAVSFNTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAE

Query:  GIVPDDRIYSSLIVSLHKPG
         I+P    YS L+ +L   G
Subjt:  GIVPDDRIYSSLIVSLHKPG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAATCAGGTAATTGCCACGAACATTTCGAAGCTAATTCTGAAATCTGGCCTTAAACCCTTCAAAACGACCCCATCGCTACTTTCAAAACTTGATTCGCGGGTAAC
ACAATTGGTTTTATCTGATTCAAATGTTCCTACTCAGTCCTGTTTGAGGTTTTTCAACTTTCTCCGACAAAACCCATCTTGTAAACCCGATCTTCCGGCACATTTAATCC
TCGTCTGTAGGTTGTATCGAGCTAGGAAGTTCGCGGTAATGAAAAATGTGTTGAACTTCATCGTTAATGATGGAAATCTTTGGAGCAATGTTGAGCGGATTGTTTCTTCG
ATTGGAGGTGAGTTTAATGAGCCGAAATTTGTTGATAAATTTTGTGATATGTTGTTTAGAGTATACATGGATAACAGAATGTTTGATTCGGCTTTGGAGGTTTTTGATTA
TGCGAGAAAGAACGGGTTTGAGATTGAGGAGAGATCATGTTTTGAGTTTTTACTTGCTTTGAAGAGATCTGGTAATGTGGAATTATCTGTAGAATTCTTGCGTCAAATGG
TCGATTCGGGTATAGAAATACGTGTTTGTTCGTGGACGACTGTGGTTGATGGGTTGTGTAAGAAAGGGTTGGTTGTAAGGGCTAAAGCTTTGATGGATGAACTTGTTTGT
AAAGGATTTAAGCCCAATGTTTTCACATATAATACTCTTTTGAATGGTTATATTGAAATTAAGGATGTGGGAGGTGTTAATGAGATTCTTAGTTTGATGGAGAAGGATGC
TGTGGATTATAATGTAACAACATATACAATTTTGATTGAATGGTATTCAAGAAGTTCGAAAATTGAGGAAGCAGAGAAGCTGTTTGATGAAATGCTTAAGAAAGGAATAG
AGCCTGATGTGTATGTTTACACCTCCATTATCAATTGGAATTGTAATTTTGGGAACATGAAGAGGGCCTTTGTTCTGTTTGATGAAATGACTGAGAGAAGGCTTGTTCCA
AATGCATACACTTATGGTGCCCTTATAAATGGTGCCTGCAAGGCAGGGCAGATGGAGGCAGCTGAGATGATGGTAAATGACATGCAAAGCAAAGGGGTTGATGTAAATCA
AGTGATATTCAATACATTGATAGATGGATACTGCAAAAAAGGTATGATTGACGAAGCTCTAAGGCTGCAGAATATCATGCAGCAAAAAGGATTCGAGATTGATGTGTTTA
CTTGTAACATAATTGCCAGTGGATTTTGTAGATCGAATCGGCAAGAGGAAGCAAGGAGATTATTGCTTACAATGGAAGAAAGAGGAGTGGCTCCAGATGCAGTGAGCTTC
AACACATTTACATATACGTCACTTATAAATAGGGAATGTGCTAGTGGGAATGTGGATAGAGCGCTTGAACTATTCAATGAAATGCCACAACGAGGGCTAAATCGAAATGT
GGTAACTTACACGGTAATGATCTCTGGGTTGTCCAAGGATGGTAGAGCTGATGAAGCTTTTAAATTATACGATGAAATGAACGCAGAAGGCATTGTACCTGATGATAGAA
TATATTCTTCCTTGATAGTGAGCCTTCATAAGCCAGGATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGAATCAGGTAATTGCCACGAACATTTCGAAGCTAATTCTGAAATCTGGCCTTAAACCCTTCAAAACGACCCCATCGCTACTTTCAAAACTTGATTCGCGGGTAAC
ACAATTGGTTTTATCTGATTCAAATGTTCCTACTCAGTCCTGTTTGAGGTTTTTCAACTTTCTCCGACAAAACCCATCTTGTAAACCCGATCTTCCGGCACATTTAATCC
TCGTCTGTAGGTTGTATCGAGCTAGGAAGTTCGCGGTAATGAAAAATGTGTTGAACTTCATCGTTAATGATGGAAATCTTTGGAGCAATGTTGAGCGGATTGTTTCTTCG
ATTGGAGGTGAGTTTAATGAGCCGAAATTTGTTGATAAATTTTGTGATATGTTGTTTAGAGTATACATGGATAACAGAATGTTTGATTCGGCTTTGGAGGTTTTTGATTA
TGCGAGAAAGAACGGGTTTGAGATTGAGGAGAGATCATGTTTTGAGTTTTTACTTGCTTTGAAGAGATCTGGTAATGTGGAATTATCTGTAGAATTCTTGCGTCAAATGG
TCGATTCGGGTATAGAAATACGTGTTTGTTCGTGGACGACTGTGGTTGATGGGTTGTGTAAGAAAGGGTTGGTTGTAAGGGCTAAAGCTTTGATGGATGAACTTGTTTGT
AAAGGATTTAAGCCCAATGTTTTCACATATAATACTCTTTTGAATGGTTATATTGAAATTAAGGATGTGGGAGGTGTTAATGAGATTCTTAGTTTGATGGAGAAGGATGC
TGTGGATTATAATGTAACAACATATACAATTTTGATTGAATGGTATTCAAGAAGTTCGAAAATTGAGGAAGCAGAGAAGCTGTTTGATGAAATGCTTAAGAAAGGAATAG
AGCCTGATGTGTATGTTTACACCTCCATTATCAATTGGAATTGTAATTTTGGGAACATGAAGAGGGCCTTTGTTCTGTTTGATGAAATGACTGAGAGAAGGCTTGTTCCA
AATGCATACACTTATGGTGCCCTTATAAATGGTGCCTGCAAGGCAGGGCAGATGGAGGCAGCTGAGATGATGGTAAATGACATGCAAAGCAAAGGGGTTGATGTAAATCA
AGTGATATTCAATACATTGATAGATGGATACTGCAAAAAAGGTATGATTGACGAAGCTCTAAGGCTGCAGAATATCATGCAGCAAAAAGGATTCGAGATTGATGTGTTTA
CTTGTAACATAATTGCCAGTGGATTTTGTAGATCGAATCGGCAAGAGGAAGCAAGGAGATTATTGCTTACAATGGAAGAAAGAGGAGTGGCTCCAGATGCAGTGAGCTTC
AACACATTTACATATACGTCACTTATAAATAGGGAATGTGCTAGTGGGAATGTGGATAGAGCGCTTGAACTATTCAATGAAATGCCACAACGAGGGCTAAATCGAAATGT
GGTAACTTACACGGTAATGATCTCTGGGTTGTCCAAGGATGGTAGAGCTGATGAAGCTTTTAAATTATACGATGAAATGAACGCAGAAGGCATTGTACCTGATGATAGAA
TATATTCTTCCTTGATAGTGAGCCTTCATAAGCCAGGATCTTAG
Protein sequenceShow/hide protein sequence
MSNQVIATNISKLILKSGLKPFKTTPSLLSKLDSRVTQLVLSDSNVPTQSCLRFFNFLRQNPSCKPDLPAHLILVCRLYRARKFAVMKNVLNFIVNDGNLWSNVERIVSS
IGGEFNEPKFVDKFCDMLFRVYMDNRMFDSALEVFDYARKNGFEIEERSCFEFLLALKRSGNVELSVEFLRQMVDSGIEIRVCSWTTVVDGLCKKGLVVRAKALMDELVC
KGFKPNVFTYNTLLNGYIEIKDVGGVNEILSLMEKDAVDYNVTTYTILIEWYSRSSKIEEAEKLFDEMLKKGIEPDVYVYTSIINWNCNFGNMKRAFVLFDEMTERRLVP
NAYTYGALINGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGFCRSNRQEEARRLLLTMEERGVAPDAVSF
NTFTYTSLINRECASGNVDRALELFNEMPQRGLNRNVVTYTVMISGLSKDGRADEAFKLYDEMNAEGIVPDDRIYSSLIVSLHKPGS