; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0017158 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0017158
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr04:30688278..30690056
RNA-Seq ExpressionIVF0017158
SyntenyIVF0017158
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044002.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.088.85Show/hide
Query:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL
        MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL
Subjt:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL

Query:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD
        WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD
Subjt:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD

Query:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY
        GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY
Subjt:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY

Query:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
Subjt:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSFN-----------------------------------------------------------
        DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSF+                                                           
Subjt:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSFN-----------------------------------------------------------

Query:  ------TYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAGF
              TYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAGF
Subjt:  ------TYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAGF

XP_004137973.1 pentatricopeptide repeat-containing protein At2g32630 [Cucumis sativus]0.079.02Show/hide
Query:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL
        MS QAIA NIAKLILKSGLQPFKTTPSLLS  DSRV Q +LSDPN+PT+SCLRFF+FLR+NPS KPDLPAHLIL  RLYRARKFAEMKNVLKFIVN GNL
Subjt:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL

Query:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD
        WSNVERIVSSIGGEFNEP  VE FCDMLFRVYMD RMFDSSLEVFDYARK GFEI+ERSCFEFLLALKRSGNMELCVEFLRQ+VDSGIEIRV SWTAVVD
Subjt:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD

Query:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY
        GLCKKGEVVRAKAL+DELVCKGFKP+VITYNTLLNGYIEIKD GGVNEILSLMEK+VVDYNV TYTMLIEWYSR SKIEE+EKLFDEMLKKGIEPDVY+Y
Subjt:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY

Query:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TS+INWNCKFGNMKRAFVLFDEMTERRLVPNA+TYGAL+NGACKAG+M+AAEMMVNDMQSKGVDVN+VIFNTL+DGYCKKGMIDEALRLQNIMQQKGFEI
Subjt:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSFN-----------------------------------------------------------
        D FTCNIIASGFCR +RREEA+RLLLTMEERGVAP+VVSF+                                                           
Subjt:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSFN-----------------------------------------------------------

Query:  ------TYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAG
              TYTYTSL++ E ASGNVDRALELFNEMPQ GLNRN VTYTV+ISGLSK GRADEAFKLYDEMN +GI+PDD I+SSLIASLH+ G
Subjt:  ------TYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAG

XP_008442691.1 PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g32630-like [Cucumis melo]0.088.68Show/hide
Query:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL
        MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL
Subjt:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL

Query:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD
        WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD
Subjt:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD

Query:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY
        GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY
Subjt:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY

Query:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYC KGMIDEALRLQNIMQQKGFEI
Subjt:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSFN-----------------------------------------------------------
        DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSF+                                                           
Subjt:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSFN-----------------------------------------------------------

Query:  ------TYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAGF
              TYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAGF
Subjt:  ------TYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAGF

XP_022934560.1 pentatricopeptide repeat-containing protein At2g32630 isoform X1 [Cucurbita moschata]1.35e-29270.73Show/hide
Query:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL
        M+NQA+ATNI KLI+KSGL+PFKTTPSLLS LDSRVTQ +LS+P+VPT+SCL FFNFLR+NPS KPDL AHLIL+CRLYRARKFA MKNVL FIVN GNL
Subjt:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL

Query:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD
         S+ ERIVSSIGGE +EPKFV+ FCDMLFRVY+D  MFDS+LEVFDYARKNGFEIEERSC   LLALKRSGN+EL +EFLRQ+VDSG+EI V S T VVD
Subjt:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD

Query:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY
        GLC+KGEV RAKAL+DELV KGFKPNV+TYNTLLN YIE ++   VNEILSLM KD VDY+  TYT+LIEWYSR  KIEE+EK+FDEMLK+GIEPDVYVY
Subjt:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY

Query:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TS+INWNC  GNMKRAF LFDEMTER LVPNA+TYGAL+NGACKAG+MEAAEM+VNDMQSKG+DVNQVIFNTLIDGYCKKGM+DEALRLQ+IMQQKGF+I
Subjt:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSFNT----------------------------------------------------------
        DVFT NIIASGFCR +RR+EAR LLLTMEERGVAP+ VSF+T                                                          
Subjt:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSFNT----------------------------------------------------------

Query:  -------YTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAG
               +TY+SL+N EC  GN+D ALELFNEMPQRGLNRN +TYT +ISGLSK GR+DEAFKLYDEM   GI PDDRI+SSL  SLH AG
Subjt:  -------YTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAG

XP_038904125.1 pentatricopeptide repeat-containing protein At2g32630 [Benincasa hispida]2.13e-29671.69Show/hide
Query:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL
        MSNQAIATNIAKLILKSGL+PFKTTPSLLS LDSRVTQ +LSDPN+PT+SCL FFNFLR+NPS KPDL AHLIL+CRLYRARKFA MKNVL F+VN GNL
Subjt:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL

Query:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD
         S VERIVSSIG EFNEPKFV+ FCDMLFRVY+D RMFDS+LEVFDYARK+G EIEERSCF FLLALKRSGN+EL +EFL Q+VDSG+EI V S T VVD
Subjt:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD

Query:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY
        GLCKK EVVRAKAL+DEL CKGFKPN+ TYNTLLN YIE  D G VNEILSLMEKD VDYN +TYT+LIEWYSR  KIEE+E+LF++MLKKG+EPDVYVY
Subjt:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY

Query:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TS+INWNC   NMKRAF LFD+MTER +VPNA+TYGAL+NG CKAG+MEAAEM+VNDMQSKG+D+N VIFNTLIDGYCKKGMIDEALRLQ+IMQQKGFE 
Subjt:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSFNT----------------------------------------------------------
        DVFT NIIASGFCRL+R++EARRLLLTMEERGVAP+ VSF+T                                                          
Subjt:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSFNT----------------------------------------------------------

Query:  -------YTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEA
               +TYTSL+N EC  GNVDRALELFNEM ++GLNRN +TYT MISGLSK GRADEAFKLYDEM   GI PDDRI+SSL  SLH A
Subjt:  -------YTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEA

TrEMBL top hitse value%identityAlignment
A0A0A0LDI0 Uncharacterized protein1.6e-26587.64Show/hide
Query:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL
        MS QAIA NIAKLILKSGLQPFKTTPSLLS  DSRV Q +LSDPN+PT+SCLRFF+FLR+NPS KPDLPAHLIL  RLYRARKFAEMKNVLKFIVN GNL
Subjt:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL

Query:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD
        WSNVERIVSSIGGEFNEP  VE FCDMLFRVYMD RMFDSSLEVFDYARK GFEI+ERSCFEFLLALKRSGNMELCVEFLRQ+VDSGIEIRV SWTAVVD
Subjt:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD

Query:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY
        GLCKKGEVVRAKAL+DELVCKGFKP+VITYNTLLNGYIEIKD GGVNEILSLMEK+VVDYNV TYTMLIEWYSR SKIEE+EKLFDEMLKKGIEPDVY+Y
Subjt:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY

Query:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TS+INWNCKFGNMKRAFVLFDEMTERRLVPNA+TYGAL+NGACKAG+M+AAEMMVNDMQSKGVDVN+VIFNTL+DGYCKKGMIDEALRLQNIMQQKGFEI
Subjt:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSFNTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLY
        D FTCNIIASGFCR +RREEA+RLLLTMEERGVAP+VVSF         N E ASGNVDRALELFNEMPQ GLNRN VTYTV+ISGLSK GRADEAFKLY
Subjt:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSFNTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLY

Query:  DEMNTQGIIPDDRIFSSLIASLHEAG
        DEMN +GI+PDD I+SSLIASLH+ G
Subjt:  DEMNTQGIIPDDRIFSSLIASLHEAG

A0A1S3B5T3 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g32630-like6.5e-29688.68Show/hide
Query:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL
        MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL
Subjt:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL

Query:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD
        WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD
Subjt:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD

Query:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY
        GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY
Subjt:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY

Query:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYC KGMIDEALRLQNIMQQKGFEI
Subjt:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSF------------------------------------------------------------
        DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSF                                                            
Subjt:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSF------------------------------------------------------------

Query:  -----NTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAGF
             +TYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAGF
Subjt:  -----NTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAGF

A0A5A7TL02 Pentatricopeptide repeat-containing protein1.3e-29688.85Show/hide
Query:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL
        MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL
Subjt:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL

Query:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD
        WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD
Subjt:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD

Query:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY
        GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY
Subjt:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY

Query:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
Subjt:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSF------------------------------------------------------------
        DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSF                                                            
Subjt:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSF------------------------------------------------------------

Query:  -----NTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAGF
             +TYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAGF
Subjt:  -----NTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAGF

A0A6J1F818 pentatricopeptide repeat-containing protein At2g32630 isoform X13.5e-23370.73Show/hide
Query:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL
        M+NQA+ATNI KLI+KSGL+PFKTTPSLLS LDSRVTQ +LS+P+VPT+SCL FFNFLR+NPS KPDL AHLIL+CRLYRARKFA MKNVL FIVN GNL
Subjt:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL

Query:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD
         S+ ERIVSSIGGE +EPKFV+ FCDMLFRVY+D  MFDS+LEVFDYARKNGFEIEERSC   LLALKRSGN+EL +EFLRQ+VDSG+EI V S T VVD
Subjt:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD

Query:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY
        GLC+KGEV RAKAL+DELV KGFKPNV+TYNTLLN YIE ++   VNEILSLM KD VDY+  TYT+LIEWYSR  KIEE+EK+FDEMLK+GIEPDVYVY
Subjt:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY

Query:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TS+INWNC  GNMKRAF LFDEMTER LVPNA+TYGAL+NGACKAG+MEAAEM+VNDMQSKG+DVNQVIFNTLIDGYCKKGM+DEALRLQ+IMQQKGF+I
Subjt:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSF------------------------------------------------------------
        DVFT NIIASGFCR +RR+EAR LLLTMEERGVAP+ VSF                                                            
Subjt:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSF------------------------------------------------------------

Query:  -----NTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAG
             +T+TY+SL+N EC  GN+D ALELFNEMPQRGLNRN +TYT +ISGLSK GR+DEAFKLYDEM   GI PDDRI+SSL  SLH AG
Subjt:  -----NTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAG

A0A6J1J171 pentatricopeptide repeat-containing protein At2g32630 isoform X13.0e-23270.9Show/hide
Query:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL
        M+NQA+ATNIAKLI+KSGL+PFKTTPSLLS LDSRVTQ +LS+P+VPT+SCL FFNFLR+NPS KPDL AHLIL+CRLYRARKFA MKNVL FIVN GNL
Subjt:  MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNL

Query:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD
            ERIVSSIGGE +EPKFV+ FCDMLFRVY+D  MFDS+LEVFDYARKN FEIEERSC   LLALKRSGN+EL +EFLRQ+VDSG+EI V S T VVD
Subjt:  WSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVD

Query:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY
        GLC+KGEV RAKAL+DELV KGFKPNV TYNTLLN YIE K+   VNEILSLMEKD VDYN  TYT+LIEWYSR  KIEE+EK+FDEMLK+GIEPDVYVY
Subjt:  GLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVY

Query:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI
        TS+INWN   GNMKRAF LFDEMTER LVPNA+TYGAL+NGACKAG+MEAAEM+VNDMQSKG+DVNQVIFNTLIDGYCKKGM+DEALRLQ+IMQQKGFEI
Subjt:  TSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEI

Query:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSF------------------------------------------------------------
        DVFT NIIASGFCR +RR+EA+ LLLTMEERGVAP+ VSF                                                            
Subjt:  DVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSF------------------------------------------------------------

Query:  -----NTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAG
             +T+TY+SL+N EC  GN+D ALELFNEMPQRGLNRN +TYT +ISGLSK GR+DEAFKLYDEM   GI PDDRI+SSL  SLH AG
Subjt:  -----NTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAG

SwissProt top hitse value%identityAlignment
O04491 Putative pentatricopeptide repeat-containing protein At1g096809.9e-6028.49Show/hide
Query:  FKTTPSLLSKLDS----RVTQSILSDP-NVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVN--GGNLWSNVERIVSSIGGE
        F   PS+   L S     V   I  +P ++P +S   FF F+   P  +  +  + +L   L     F E +++++ +V+  G N  S+V   +S +  E
Subjt:  FKTTPSLLSKLDS----RVTQSILSDP-NVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVN--GGNLWSNVERIVSSIGGE

Query:  FNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVDGLCKKGEVVRAKAL
                   D L   Y D      +++ F  +RK+ F++  R C   L  + +         F  +I+D+G  + V  +  +++  CK+G +  A+ +
Subjt:  FNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVDGLCKKGEVVRAKAL

Query:  VDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVYTSLINWNCKFGNMK
         DE+  +  +P V+++NTL+NGY ++ +      +   MEK     +V TY+ LI    + +K++ +  LFDEM K+G+ P+  ++T+LI+ + + G + 
Subjt:  VDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVYTSLINWNCKFGNMK

Query:  RAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGFCR
             + +M  + L P+   Y  LVNG CK G + AA  +V+ M  +G+  +++ + TLIDG+C+ G ++ AL ++  M Q G E+D    + +  G C+
Subjt:  RAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGFCR

Query:  LDRREEARRLLLTMEERGVAPDVVSFNTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRI
          R  +A R L  M   G+ PD V     TYT +M+  C  G+     +L  EM   G   + VTY V+++GL K G+   A  L D M   G++PDD  
Subjt:  LDRREEARRLLLTMEERGVAPDVVSFNTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRI

Query:  FSSLIASLH
        +++L+   H
Subjt:  FSSLIASLH

Q8S8P6 Pentatricopeptide repeat-containing protein At2g326308.2e-14746.28Show/hide
Query:  SNQAIATNI-AKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRR-NPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGN
        S+Q  A  I A L+ KS +   ++ PSLL  L+S VT+ +LS+P +PT+SC+ FF  LR    + KPDL A + L  RLY  R+F EM+++L  +VN G 
Subjt:  SNQAIATNI-AKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRR-NPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGN

Query:  LWSNVERIVSS-IGGEFNEPK--FVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWT
            VE + S+ +  + +E K  F E F D++FRVY+D  MF+  L VFDY  K G  I+ERSC  FL+A K+   ++LC+E  R++VDSG++I V S T
Subjt:  LWSNVERIVSS-IGGEFNEPK--FVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWT

Query:  AVVDGLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPD
         VV+GLC++GEV ++K L+ E   KG KP   TYNT++N Y++ +D  GV  +L +M+KD V YN  TYT+L+E   +  K+ ++EKLFDEM ++GIE D
Subjt:  AVVDGLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPD

Query:  VYVYTSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQK
        V+VYTSLI+WNC+ GNMKRAF+LFDE+TE+ L P+++TYGAL++G CK G+M AAE+++N+MQSKGV++ QV+FNTLIDGYC+KGM+DEA  + ++M+QK
Subjt:  VYVYTSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQK

Query:  GFEIDVFTCNIIASGFCRLDRR-----------------------------------EEARRLLLTMEERGVAPDVVSFN--------------------
        GF+ DVFTCN IAS F RL R                                    EEA+RL + M  +GV P+ +++N                    
Subjt:  GFEIDVFTCNIIASGFCRLDRR-----------------------------------EEARRLLLTMEERGVAPDVVSFN--------------------

Query:  ----------TYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLH
                  +YTYTSL++ EC + NVD A+ LF+EM  +GL++N VTYTVMISGLSKAG++DEAF LYDEM  +G   D++++++LI S+H
Subjt:  ----------TYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLH

Q9LFC5 Pentatricopeptide repeat-containing protein At5g011104.2e-5830.71Show/hide
Query:  IVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVDGLCKKG
        IV+S+   F+     ++  D+L R Y+  R    + E F   R  GF +   +C   + +L R G +EL     ++I  SG+ I V +   +V+ LCK G
Subjt:  IVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVDGLCKKG

Query:  EVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVYTSLINW
        ++ +    + ++  KG  P+++TYNTL++ Y          E+++ M        V TY  +I    +  K E ++++F EML+ G+ PD   Y SL+  
Subjt:  EVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVYTSLINW

Query:  NCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCN
         CK G++     +F +M  R +VP+   + ++++   ++G ++ A M  N ++  G+  + VI+  LI GYC+KGMI  A+ L+N M Q+G  +DV T N
Subjt:  NCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCN

Query:  IIASGFCRLDRREEARRLLLTMEERGVAPDVVSFNTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQ
         I  G C+     EA +L   M ER + PD     +YT T L++  C  GN+  A+ELF +M ++ +  + VTY  ++ G  K G  D A +++ +M ++
Subjt:  IIASGFCRLDRREEARRLLLTMEERGVAPDVVSFNTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQ

Query:  GIIPDDRIFSSLIASLHEAG
         I+P    +S L+ +L   G
Subjt:  GIIPDDRIFSSLIASLHEAG

Q9LN69 Putative pentatricopeptide repeat-containing protein At1g192901.3e-5930.63Show/hide
Query:  SILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIV----NGGNLWSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMD
        SIL    +  ++CL  FN   +    +PD  A+  +V  L RAR + + K+ L  +V    +G  +W  + R+       F E  F     DM+ +VY +
Subjt:  SILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIV----NGGNLWSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMD

Query:  KRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVDGLCKKGEVVRAKALVDELVCK-GFKPNVITYNTL
        K +  ++L VFD     G      SC   L  L R G   + +    Q++   +   V + + VV+  C+ G V +A     E     G + NV+TYN+L
Subjt:  KRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVDGLCKKGEVVRAKALVDELVCK-GFKPNVITYNTL

Query:  LNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVYTSLINWNCKFGNMKRAFVLFDEMTERRLVPNAF
        +NGY  I D  G+  +L LM +  V  NV TYT LI+ Y +   +EE+E +F+ + +K +  D ++Y  L++  C+ G ++ A  + D M E  +  N  
Subjt:  LNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVYTSLINWNCKFGNMKRAFVLFDEMTERRLVPNAF

Query:  TYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGFCRLDRREEARRLLLTMEERGV
           +L+NG CK+GQ+  AE + + M    +  +   +NTL+DGYC+ G +DEAL+L + M QK     V T NI+  G+ R+    +   L   M +RGV
Subjt:  TYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGFCRLDRREEARRLLLTMEERGV

Query:  APDVVSFNTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAG
          D +S +T     L+      G+ + A++L+  +  RGL  + +T  VMISGL K  + +EA ++ D +N     P  + + +L    ++ G
Subjt:  APDVVSFNTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAG

Q9ZUU7 Pentatricopeptide repeat-containing protein At2g280507.6e-6032.81Show/hide
Query:  SNQAIATNIAKLILKSGL--QPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSC---KPDLPAHLILVCRLYRARKFAEMKNVLKFIVN
        + Q    +I KL+L S    Q   +  + LS L+    + ILSDP++ +  C+  FNF+  NPS    +PDL  HL L  R+   R+F+  K +LK +  
Subjt:  SNQAIATNIAKLILKSGL--QPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSC---KPDLPAHLILVCRLYRARKFAEMKNVLKFIVN

Query:  GGNLWSNVERIVSSIGGEFN-EPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIE-IRVSS
           L      IVSS+  E   E K V  F + +  VY D   F   +EVF+Y + N  +I+E++C   LL LKR   MEL  +F   +V+SGI+ + V S
Subjt:  GGNLWSNVERIVSSIGGEFN-EPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIE-IRVSS

Query:  WTAVVDGLCKKGEVVRAKALVDEL-VCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGI
         T VV  LC  GE+ RA+ LV+E+ + KG K N++T+ +++   ++  D   ++ +L LMEK+ V  ++ +Y +LI+ ++   K+EE+E+L   M  K +
Subjt:  WTAVVDGLCKKGEVVRAKALVDEL-VCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGI

Query:  EPDVYVYTSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIM
          + Y+Y  ++N   +FG +++   L+ EM+ R + PN  TY  L+NG CKAG++  A   +N+++    ++++ +++TL +   + GMID++L +   M
Subjt:  EPDVYVYTSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIM

Query:  QQKGFEIDVFTCNIIASGFCRLDRREEARRLLLTMEERGVAP
         + GF      C  +A     ++R+ EA+ L+  + + G+ P
Subjt:  QQKGFEIDVFTCNIIASGFCRLDRREEARRLLLTMEERGVAP

Arabidopsis top hitse value%identityAlignment
AT1G09680.1 Pentatricopeptide repeat (PPR) superfamily protein7.1e-6128.49Show/hide
Query:  FKTTPSLLSKLDS----RVTQSILSDP-NVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVN--GGNLWSNVERIVSSIGGE
        F   PS+   L S     V   I  +P ++P +S   FF F+   P  +  +  + +L   L     F E +++++ +V+  G N  S+V   +S +  E
Subjt:  FKTTPSLLSKLDS----RVTQSILSDP-NVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVN--GGNLWSNVERIVSSIGGE

Query:  FNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVDGLCKKGEVVRAKAL
                   D L   Y D      +++ F  +RK+ F++  R C   L  + +         F  +I+D+G  + V  +  +++  CK+G +  A+ +
Subjt:  FNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVDGLCKKGEVVRAKAL

Query:  VDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVYTSLINWNCKFGNMK
         DE+  +  +P V+++NTL+NGY ++ +      +   MEK     +V TY+ LI    + +K++ +  LFDEM K+G+ P+  ++T+LI+ + + G + 
Subjt:  VDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVYTSLINWNCKFGNMK

Query:  RAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGFCR
             + +M  + L P+   Y  LVNG CK G + AA  +V+ M  +G+  +++ + TLIDG+C+ G ++ AL ++  M Q G E+D    + +  G C+
Subjt:  RAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGFCR

Query:  LDRREEARRLLLTMEERGVAPDVVSFNTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRI
          R  +A R L  M   G+ PD V     TYT +M+  C  G+     +L  EM   G   + VTY V+++GL K G+   A  L D M   G++PDD  
Subjt:  LDRREEARRLLLTMEERGVAPDVVSFNTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRI

Query:  FSSLIASLH
        +++L+   H
Subjt:  FSSLIASLH

AT1G19290.1 Pentatricopeptide repeat (PPR) superfamily protein9.2e-6130.63Show/hide
Query:  SILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIV----NGGNLWSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMD
        SIL    +  ++CL  FN   +    +PD  A+  +V  L RAR + + K+ L  +V    +G  +W  + R+       F E  F     DM+ +VY +
Subjt:  SILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIV----NGGNLWSNVERIVSSIGGEFNEPKFVENFCDMLFRVYMD

Query:  KRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVDGLCKKGEVVRAKALVDELVCK-GFKPNVITYNTL
        K +  ++L VFD     G      SC   L  L R G   + +    Q++   +   V + + VV+  C+ G V +A     E     G + NV+TYN+L
Subjt:  KRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVDGLCKKGEVVRAKALVDELVCK-GFKPNVITYNTL

Query:  LNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVYTSLINWNCKFGNMKRAFVLFDEMTERRLVPNAF
        +NGY  I D  G+  +L LM +  V  NV TYT LI+ Y +   +EE+E +F+ + +K +  D ++Y  L++  C+ G ++ A  + D M E  +  N  
Subjt:  LNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVYTSLINWNCKFGNMKRAFVLFDEMTERRLVPNAF

Query:  TYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGFCRLDRREEARRLLLTMEERGV
           +L+NG CK+GQ+  AE + + M    +  +   +NTL+DGYC+ G +DEAL+L + M QK     V T NI+  G+ R+    +   L   M +RGV
Subjt:  TYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGFCRLDRREEARRLLLTMEERGV

Query:  APDVVSFNTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAG
          D +S +T     L+      G+ + A++L+  +  RGL  + +T  VMISGL K  + +EA ++ D +N     P  + + +L    ++ G
Subjt:  APDVVSFNTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAG

AT2G28050.1 Pentatricopeptide repeat (PPR) superfamily protein5.4e-6132.81Show/hide
Query:  SNQAIATNIAKLILKSGL--QPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSC---KPDLPAHLILVCRLYRARKFAEMKNVLKFIVN
        + Q    +I KL+L S    Q   +  + LS L+    + ILSDP++ +  C+  FNF+  NPS    +PDL  HL L  R+   R+F+  K +LK +  
Subjt:  SNQAIATNIAKLILKSGL--QPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSC---KPDLPAHLILVCRLYRARKFAEMKNVLKFIVN

Query:  GGNLWSNVERIVSSIGGEFN-EPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIE-IRVSS
           L      IVSS+  E   E K V  F + +  VY D   F   +EVF+Y + N  +I+E++C   LL LKR   MEL  +F   +V+SGI+ + V S
Subjt:  GGNLWSNVERIVSSIGGEFN-EPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIE-IRVSS

Query:  WTAVVDGLCKKGEVVRAKALVDEL-VCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGI
         T VV  LC  GE+ RA+ LV+E+ + KG K N++T+ +++   ++  D   ++ +L LMEK+ V  ++ +Y +LI+ ++   K+EE+E+L   M  K +
Subjt:  WTAVVDGLCKKGEVVRAKALVDEL-VCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGI

Query:  EPDVYVYTSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIM
          + Y+Y  ++N   +FG +++   L+ EM+ R + PN  TY  L+NG CKAG++  A   +N+++    ++++ +++TL +   + GMID++L +   M
Subjt:  EPDVYVYTSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIM

Query:  QQKGFEIDVFTCNIIASGFCRLDRREEARRLLLTMEERGVAP
         + GF      C  +A     ++R+ EA+ L+  + + G+ P
Subjt:  QQKGFEIDVFTCNIIASGFCRLDRREEARRLLLTMEERGVAP

AT2G32630.1 Pentatricopeptide repeat (PPR-like) superfamily protein5.8e-14846.28Show/hide
Query:  SNQAIATNI-AKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRR-NPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGN
        S+Q  A  I A L+ KS +   ++ PSLL  L+S VT+ +LS+P +PT+SC+ FF  LR    + KPDL A + L  RLY  R+F EM+++L  +VN G 
Subjt:  SNQAIATNI-AKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRR-NPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGN

Query:  LWSNVERIVSS-IGGEFNEPK--FVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWT
            VE + S+ +  + +E K  F E F D++FRVY+D  MF+  L VFDY  K G  I+ERSC  FL+A K+   ++LC+E  R++VDSG++I V S T
Subjt:  LWSNVERIVSS-IGGEFNEPK--FVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWT

Query:  AVVDGLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPD
         VV+GLC++GEV ++K L+ E   KG KP   TYNT++N Y++ +D  GV  +L +M+KD V YN  TYT+L+E   +  K+ ++EKLFDEM ++GIE D
Subjt:  AVVDGLCKKGEVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPD

Query:  VYVYTSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQK
        V+VYTSLI+WNC+ GNMKRAF+LFDE+TE+ L P+++TYGAL++G CK G+M AAE+++N+MQSKGV++ QV+FNTLIDGYC+KGM+DEA  + ++M+QK
Subjt:  VYVYTSLINWNCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQK

Query:  GFEIDVFTCNIIASGFCRLDRR-----------------------------------EEARRLLLTMEERGVAPDVVSFN--------------------
        GF+ DVFTCN IAS F RL R                                    EEA+RL + M  +GV P+ +++N                    
Subjt:  GFEIDVFTCNIIASGFCRLDRR-----------------------------------EEARRLLLTMEERGVAPDVVSFN--------------------

Query:  ----------TYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLH
                  +YTYTSL++ EC + NVD A+ LF+EM  +GL++N VTYTVMISGLSKAG++DEAF LYDEM  +G   D++++++LI S+H
Subjt:  ----------TYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLH

AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-5930.71Show/hide
Query:  IVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVDGLCKKG
        IV+S+   F+     ++  D+L R Y+  R    + E F   R  GF +   +C   + +L R G +EL     ++I  SG+ I V +   +V+ LCK G
Subjt:  IVSSIGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVDGLCKKG

Query:  EVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVYTSLINW
        ++ +    + ++  KG  P+++TYNTL++ Y          E+++ M        V TY  +I    +  K E ++++F EML+ G+ PD   Y SL+  
Subjt:  EVVRAKALVDELVCKGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVYTSLINW

Query:  NCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCN
         CK G++     +F +M  R +VP+   + ++++   ++G ++ A M  N ++  G+  + VI+  LI GYC+KGMI  A+ L+N M Q+G  +DV T N
Subjt:  NCKFGNMKRAFVLFDEMTERRLVPNAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCN

Query:  IIASGFCRLDRREEARRLLLTMEERGVAPDVVSFNTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQ
         I  G C+     EA +L   M ER + PD     +YT T L++  C  GN+  A+ELF +M ++ +  + VTY  ++ G  K G  D A +++ +M ++
Subjt:  IIASGFCRLDRREEARRLLLTMEERGVAPDVVSFNTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQ

Query:  GIIPDDRIFSSLIASLHEAG
         I+P    +S L+ +L   G
Subjt:  GIIPDDRIFSSLIASLHEAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAATCAGGCGATAGCCACGAACATTGCGAAGCTAATTCTGAAATCTGGCCTTCAACCCTTCAAAACGACCCCATCGCTGCTTTCAAAACTTGATTCGCGGGTAAC
ACAATCGATTTTATCTGATCCAAATGTTCCTACTAAGTCCTGTTTGAGGTTCTTCAACTTTCTCCGACGCAACCCATCTTGTAAACCCGATCTTCCGGCACATTTAATCC
TCGTCTGTAGGTTGTATCGAGCTCGCAAGTTCGCGGAAATGAAAAATGTGTTGAAATTCATCGTTAATGGTGGAAATCTTTGGAGCAATGTTGAGCGGATCGTTTCTTCG
ATTGGAGGTGAGTTTAATGAGCCGAAATTTGTTGAAAATTTTTGTGATATGTTGTTTAGAGTATACATGGATAAAAGAATGTTTGATTCGTCTTTGGAGGTTTTTGATTA
TGCGAGAAAGAACGGGTTTGAGATTGAGGAGAGATCATGTTTTGAGTTTTTACTTGCTTTGAAGAGATCTGGTAATATGGAATTATGTGTAGAATTCTTGCGCCAAATAG
TCGATTCGGGCATAGAAATACGTGTTTCTTCGTGGACGGCTGTGGTTGATGGGCTGTGTAAGAAAGGGGAGGTTGTAAGGGCTAAAGCTTTGGTGGATGAACTTGTCTGT
AAAGGATTTAAGCCCAATGTTATCACATATAATACTCTTTTAAATGGTTATATTGAAATTAAGGATGAGGGAGGTGTTAATGAGATTCTTAGTTTGATGGAGAAGGATGT
TGTGGATTATAATGTAGCAACATATACAATGTTGATTGAATGGTATTCAAGAATTTCGAAAATTGAGGAATCAGAGAAGCTGTTTGATGAAATGCTAAAGAAAGGAATAG
AGCCTGATGTGTATGTTTACACCTCCCTTATCAATTGGAATTGTAAATTTGGGAACATGAAGAGGGCCTTTGTTCTGTTTGATGAAATGACTGAGAGAAGGCTTGTTCCA
AATGCATTCACTTATGGTGCCCTTGTAAATGGTGCCTGCAAGGCAGGGCAGATGGAGGCAGCTGAGATGATGGTAAATGACATGCAAAGCAAAGGGGTTGATGTAAATCA
AGTGATATTCAATACATTGATAGATGGGTACTGCAAAAAAGGAATGATTGACGAAGCTCTAAGGCTGCAGAATATCATGCAGCAAAAAGGATTTGAGATTGATGTGTTTA
CTTGTAACATAATTGCCAGTGGTTTTTGTAGATTGGATCGGCGAGAGGAAGCAAGGAGACTATTGCTTACAATGGAAGAAAGAGGAGTGGCTCCAGATGTAGTGAGCTTC
AACACATATACATATACATCACTTATGAATTGGGAATGTGCTAGTGGGAATGTGGATAGAGCGCTTGAACTATTCAATGAAATGCCACAACGAGGACTAAATCGAAATGA
AGTAACTTACACGGTAATGATCTCTGGGTTGTCCAAGGCTGGTAGAGCTGATGAAGCTTTTAAATTATACGATGAAATGAACACACAAGGCATTATACCTGATGATAGAA
TATTTTCTTCCTTGATCGCCAGCCTTCATGAGGCAGGATTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGAATCAGGCGATAGCCACGAACATTGCGAAGCTAATTCTGAAATCTGGCCTTCAACCCTTCAAAACGACCCCATCGCTGCTTTCAAAACTTGATTCGCGGGTAAC
ACAATCGATTTTATCTGATCCAAATGTTCCTACTAAGTCCTGTTTGAGGTTCTTCAACTTTCTCCGACGCAACCCATCTTGTAAACCCGATCTTCCGGCACATTTAATCC
TCGTCTGTAGGTTGTATCGAGCTCGCAAGTTCGCGGAAATGAAAAATGTGTTGAAATTCATCGTTAATGGTGGAAATCTTTGGAGCAATGTTGAGCGGATCGTTTCTTCG
ATTGGAGGTGAGTTTAATGAGCCGAAATTTGTTGAAAATTTTTGTGATATGTTGTTTAGAGTATACATGGATAAAAGAATGTTTGATTCGTCTTTGGAGGTTTTTGATTA
TGCGAGAAAGAACGGGTTTGAGATTGAGGAGAGATCATGTTTTGAGTTTTTACTTGCTTTGAAGAGATCTGGTAATATGGAATTATGTGTAGAATTCTTGCGCCAAATAG
TCGATTCGGGCATAGAAATACGTGTTTCTTCGTGGACGGCTGTGGTTGATGGGCTGTGTAAGAAAGGGGAGGTTGTAAGGGCTAAAGCTTTGGTGGATGAACTTGTCTGT
AAAGGATTTAAGCCCAATGTTATCACATATAATACTCTTTTAAATGGTTATATTGAAATTAAGGATGAGGGAGGTGTTAATGAGATTCTTAGTTTGATGGAGAAGGATGT
TGTGGATTATAATGTAGCAACATATACAATGTTGATTGAATGGTATTCAAGAATTTCGAAAATTGAGGAATCAGAGAAGCTGTTTGATGAAATGCTAAAGAAAGGAATAG
AGCCTGATGTGTATGTTTACACCTCCCTTATCAATTGGAATTGTAAATTTGGGAACATGAAGAGGGCCTTTGTTCTGTTTGATGAAATGACTGAGAGAAGGCTTGTTCCA
AATGCATTCACTTATGGTGCCCTTGTAAATGGTGCCTGCAAGGCAGGGCAGATGGAGGCAGCTGAGATGATGGTAAATGACATGCAAAGCAAAGGGGTTGATGTAAATCA
AGTGATATTCAATACATTGATAGATGGGTACTGCAAAAAAGGAATGATTGACGAAGCTCTAAGGCTGCAGAATATCATGCAGCAAAAAGGATTTGAGATTGATGTGTTTA
CTTGTAACATAATTGCCAGTGGTTTTTGTAGATTGGATCGGCGAGAGGAAGCAAGGAGACTATTGCTTACAATGGAAGAAAGAGGAGTGGCTCCAGATGTAGTGAGCTTC
AACACATATACATATACATCACTTATGAATTGGGAATGTGCTAGTGGGAATGTGGATAGAGCGCTTGAACTATTCAATGAAATGCCACAACGAGGACTAAATCGAAATGA
AGTAACTTACACGGTAATGATCTCTGGGTTGTCCAAGGCTGGTAGAGCTGATGAAGCTTTTAAATTATACGATGAAATGAACACACAAGGCATTATACCTGATGATAGAA
TATTTTCTTCCTTGATCGCCAGCCTTCATGAGGCAGGATTTTAG
Protein sequenceShow/hide protein sequence
MSNQAIATNIAKLILKSGLQPFKTTPSLLSKLDSRVTQSILSDPNVPTKSCLRFFNFLRRNPSCKPDLPAHLILVCRLYRARKFAEMKNVLKFIVNGGNLWSNVERIVSS
IGGEFNEPKFVENFCDMLFRVYMDKRMFDSSLEVFDYARKNGFEIEERSCFEFLLALKRSGNMELCVEFLRQIVDSGIEIRVSSWTAVVDGLCKKGEVVRAKALVDELVC
KGFKPNVITYNTLLNGYIEIKDEGGVNEILSLMEKDVVDYNVATYTMLIEWYSRISKIEESEKLFDEMLKKGIEPDVYVYTSLINWNCKFGNMKRAFVLFDEMTERRLVP
NAFTYGALVNGACKAGQMEAAEMMVNDMQSKGVDVNQVIFNTLIDGYCKKGMIDEALRLQNIMQQKGFEIDVFTCNIIASGFCRLDRREEARRLLLTMEERGVAPDVVSF
NTYTYTSLMNWECASGNVDRALELFNEMPQRGLNRNEVTYTVMISGLSKAGRADEAFKLYDEMNTQGIIPDDRIFSSLIASLHEAGF