; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022609 (gene) of Snake gourd v1 genome

Gene IDTan0022609
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG05:18830815..18833061
RNA-Seq ExpressionTan0022609
SyntenyTan0022609
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038903328.1 pentatricopeptide repeat-containing protein At2g36730 isoform X1 [Benincasa hispida]1.4e-27692.01Show/hide
Query:  MVRLRLSAVHQIFPPNA--LNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWN
        MVRLR+SA+HQIFPP+A   NSN NF SRKHQFLSLLNLCSSTNHLF+IHAQILVSGLQ+DPFL  ELLR AALSPSRNLSYGRSLLFHC  HSAPLPWN
Subjt:  MVRLRLSAVHQIFPPNA--LNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWN

Query:  FIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSW
         IIRGYASSDSP+EAIWVFGEMRRRGIRPNNLT PFLLKACATL TLQEGKQFHA AIKCGLDLDVYVRNTLINFYGSCKRMS ARKVFDEMSERTLVSW
Subjt:  FIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSW

Query:  NAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT
        NAVITACVENFCFDEAI++FLKM  HGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAF+DMYAKSGDVGCARLVFNCLKQRSVWT
Subjt:  NAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPD
        WSAMILGLAQHG+ANEAIELFT+MMSSSVVPNYVTFIGVLCACSHA LVDK YHYFNIMERVYGIKPMMIHYGSMVDVLGRAG+VKEAYE IM MP+EPD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPD

Query:  PIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGI
        PIVWRTLLSAC+ RDVDGGAQVAEEARKRLLELEPKRGGNVVMVAN FAEVGMWKQAADCRRAMKDRGIKKMAGESCIE+GGSL KFFSGFDA A SDGI
Subjt:  PIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGI

Query:  YDLLDGLNLHMQM
        YDLLDGLNLHMQM
Subjt:  YDLLDGLNLHMQM

XP_038903329.1 pentatricopeptide repeat-containing protein At2g36730 isoform X2 [Benincasa hispida]1.4e-27692.01Show/hide
Query:  MVRLRLSAVHQIFPPNA--LNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWN
        MVRLR+SA+HQIFPP+A   NSN NF SRKHQFLSLLNLCSSTNHLF+IHAQILVSGLQ+DPFL  ELLR AALSPSRNLSYGRSLLFHC  HSAPLPWN
Subjt:  MVRLRLSAVHQIFPPNA--LNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWN

Query:  FIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSW
         IIRGYASSDSP+EAIWVFGEMRRRGIRPNNLT PFLLKACATL TLQEGKQFHA AIKCGLDLDVYVRNTLINFYGSCKRMS ARKVFDEMSERTLVSW
Subjt:  FIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSW

Query:  NAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT
        NAVITACVENFCFDEAI++FLKM  HGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAF+DMYAKSGDVGCARLVFNCLKQRSVWT
Subjt:  NAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPD
        WSAMILGLAQHG+ANEAIELFT+MMSSSVVPNYVTFIGVLCACSHA LVDK YHYFNIMERVYGIKPMMIHYGSMVDVLGRAG+VKEAYE IM MP+EPD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPD

Query:  PIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGI
        PIVWRTLLSAC+ RDVDGGAQVAEEARKRLLELEPKRGGNVVMVAN FAEVGMWKQAADCRRAMKDRGIKKMAGESCIE+GGSL KFFSGFDA A SDGI
Subjt:  PIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGI

Query:  YDLLDGLNLHMQM
        YDLLDGLNLHMQM
Subjt:  YDLLDGLNLHMQM

XP_038903330.1 pentatricopeptide repeat-containing protein At2g36730 isoform X3 [Benincasa hispida]8.0e-27791.49Show/hide
Query:  MVRLRLSAVHQIFPPNA--LNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWN
        MVRLR+SA+HQIFPP+A   NSN NF SRKHQFLSLLNLCSSTNHLF+IHAQILVSGLQ+DPFL  ELLR AALSPSRNLSYGRSLLFHC  HSAPLPWN
Subjt:  MVRLRLSAVHQIFPPNA--LNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWN

Query:  FIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSW
         IIRGYASSDSP+EAIWVFGEMRRRGIRPNNLT PFLLKACATL TLQEGKQFHA AIKCGLDLDVYVRNTLINFYGSCKRMS ARKVFDEMSERTLVSW
Subjt:  FIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSW

Query:  NAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT
        NAVITACVENFCFDEAI++FLKM  HGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAF+DMYAKSGDVGCARLVFNCLKQRSVWT
Subjt:  NAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPD
        WSAMILGLAQHG+ANEAIELFT+MMSSSVVPNYVTFIGVLCACSHA LVDK YHYFNIMERVYGIKPMMIHYGSMVDVLGRAG+VKEAYE IM MP+EPD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPD

Query:  PIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGI
        PIVWRTLLSAC+ RDVDGGAQVAEEARKRLLELEPKRGGNVVMVAN FAEVGMWKQAADCRRAMKDRGIKKMAGESCIE+GGSL KFFSGFDA A SDGI
Subjt:  PIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGI

Query:  YDLLDGLNLHMQMTWRE
        YDLLDGLNLHMQM   E
Subjt:  YDLLDGLNLHMQMTWRE

XP_038903331.1 pentatricopeptide repeat-containing protein At2g36730 isoform X4 [Benincasa hispida]1.4e-27692.01Show/hide
Query:  MVRLRLSAVHQIFPPNA--LNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWN
        MVRLR+SA+HQIFPP+A   NSN NF SRKHQFLSLLNLCSSTNHLF+IHAQILVSGLQ+DPFL  ELLR AALSPSRNLSYGRSLLFHC  HSAPLPWN
Subjt:  MVRLRLSAVHQIFPPNA--LNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWN

Query:  FIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSW
         IIRGYASSDSP+EAIWVFGEMRRRGIRPNNLT PFLLKACATL TLQEGKQFHA AIKCGLDLDVYVRNTLINFYGSCKRMS ARKVFDEMSERTLVSW
Subjt:  FIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSW

Query:  NAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT
        NAVITACVENFCFDEAI++FLKM  HGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAF+DMYAKSGDVGCARLVFNCLKQRSVWT
Subjt:  NAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPD
        WSAMILGLAQHG+ANEAIELFT+MMSSSVVPNYVTFIGVLCACSHA LVDK YHYFNIMERVYGIKPMMIHYGSMVDVLGRAG+VKEAYE IM MP+EPD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPD

Query:  PIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGI
        PIVWRTLLSAC+ RDVDGGAQVAEEARKRLLELEPKRGGNVVMVAN FAEVGMWKQAADCRRAMKDRGIKKMAGESCIE+GGSL KFFSGFDA A SDGI
Subjt:  PIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGI

Query:  YDLLDGLNLHMQM
        YDLLDGLNLHMQM
Subjt:  YDLLDGLNLHMQM

XP_038903332.1 pentatricopeptide repeat-containing protein At2g36730 isoform X5 [Benincasa hispida]1.4e-27692.01Show/hide
Query:  MVRLRLSAVHQIFPPNA--LNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWN
        MVRLR+SA+HQIFPP+A   NSN NF SRKHQFLSLLNLCSSTNHLF+IHAQILVSGLQ+DPFL  ELLR AALSPSRNLSYGRSLLFHC  HSAPLPWN
Subjt:  MVRLRLSAVHQIFPPNA--LNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWN

Query:  FIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSW
         IIRGYASSDSP+EAIWVFGEMRRRGIRPNNLT PFLLKACATL TLQEGKQFHA AIKCGLDLDVYVRNTLINFYGSCKRMS ARKVFDEMSERTLVSW
Subjt:  FIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSW

Query:  NAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT
        NAVITACVENFCFDEAI++FLKM  HGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAF+DMYAKSGDVGCARLVFNCLKQRSVWT
Subjt:  NAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPD
        WSAMILGLAQHG+ANEAIELFT+MMSSSVVPNYVTFIGVLCACSHA LVDK YHYFNIMERVYGIKPMMIHYGSMVDVLGRAG+VKEAYE IM MP+EPD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPD

Query:  PIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGI
        PIVWRTLLSAC+ RDVDGGAQVAEEARKRLLELEPKRGGNVVMVAN FAEVGMWKQAADCRRAMKDRGIKKMAGESCIE+GGSL KFFSGFDA A SDGI
Subjt:  PIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGI

Query:  YDLLDGLNLHMQM
        YDLLDGLNLHMQM
Subjt:  YDLLDGLNLHMQM

TrEMBL top hitse value%identityAlignment
A0A0A0K153 Uncharacterized protein2.1e-26287.55Show/hide
Query:  MVRLRLSAVHQIFPPNALN--SNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWN
        MVRL +SAVHQ FP N  N  S   F S KHQ LSLLN CSSTNHLF+IHAQILVSGLQ+D F   ELLR AALSPSRNLSYG SLLFHC  HSA +PWN
Subjt:  MVRLRLSAVHQIFPPNALN--SNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWN

Query:  FIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSW
        FIIRGY+SSDSP+EAI +FGEMRRRG+RPNNLT PFLLKACATL TLQEGKQFHA AIKCGLDLDVYVRNTLI FYGSCKRMS ARKVFDEM+ERTLVSW
Subjt:  FIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSW

Query:  NAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT
        NAVITACVENFCFDEAI+YFLKM +HGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCAR VFNCLKQ+SVWT
Subjt:  NAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPD
        WSAMILGLAQHGFANEAIELFTNMMSS +VPN+VTFIGVLCACSHAGLVDK YHYFN+MERVYGIKPMMIHYGSMVDVLGRAG+VKEAYE IM MPVEPD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPD

Query:  PIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGI
        PIVWRTLLSACS RDV+GGA+VAEEARKRLLELEPKRGGNVVMVAN FAE+GMWKQAAD RR MKDRGIKKMAGESCIE+GGSL KFFSGFD+RA  DGI
Subjt:  PIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGI

Query:  YDLLDGLNLHMQMT
        YDLLDGLNLHMQ+T
Subjt:  YDLLDGLNLHMQMT

A0A6J1F5Z2 pentatricopeptide repeat-containing protein At2g36730 isoform X18.1e-27591.59Show/hide
Query:  MVRLRLSAVHQIFPPNALNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWNFI
        MVRLR+ AVHQIFPPNA NSNSNF SRKHQFLS++ LCSS NHLFQIH+QI+VSGLQ+D FL  ELLRFAALSPSRNLSY RSLLFH +LH +PLPWN I
Subjt:  MVRLRLSAVHQIFPPNALNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWNFI

Query:  IRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNA
        IRGYASSDSPREAIWVF EMRRRGIRPNNLT PFL+KACATL TLQEGK+FHADAIKCGLDLDVYVRNTLINFYGSCKRMS ARKVFDEMS RTLVSWNA
Subjt:  IRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNA

Query:  VITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWS
        VITACVENFCFD+AIEYFLKM +HGFEPDETTMVVILSACAELGNLSLGRWVHSQVV RGMVLNVQLGTA VDMYAKSGDVGCARLVFNCLKQRSVWTWS
Subjt:  VITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWS

Query:  AMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPI
        AMILGLAQHGFA+EAIELFTNMMSSSV PNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVL RAGRVKEAYEFIMRMPVEPDPI
Subjt:  AMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPI

Query:  VWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGIYD
        VWRTLLSACS RDVDGGAQV EEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKD GIKKMAGESC+EVGGSL KFFSGFD RA+S GIYD
Subjt:  VWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGIYD

Query:  LLDGLNLHMQM
        LLDGLNLHMQM
Subjt:  LLDGLNLHMQM

A0A6J1F6K4 pentatricopeptide repeat-containing protein At2g36730-like3.9e-26990.41Show/hide
Query:  MVRLRLSAVHQIFPPNAL--NSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWN
        MVRLR+SAV Q+F PNA   NSNSNF SRKH+FLSLLNL SST+HLFQIHAQILV GLQ+DPFLI ELL FAALSP RNLSYGRSLLFHCDLHSAP PWN
Subjt:  MVRLRLSAVHQIFPPNAL--NSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWN

Query:  FIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSW
        FIIRGYASS+SP++AI VFGEMRRRGIRPNNLT PFLLKACATL  LQEGKQFHA AIKCGLDLDVYVRNTLINFYGSCK MS+ARKVFDEMSERTLVSW
Subjt:  FIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSW

Query:  NAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT
        NA+ITACVEN CFDEAI+YFLKM +HGFEPDETTMVVILSAC ELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFN LKQRSVWT
Subjt:  NAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPD
        WSAMILGLAQHGFANEAIE FTNMMSSSVVPNYVTFIGVLCACSHAGLVD+GYHYFNIMERVYGIKPMMIHY SMVD LGRAGRVKEAYEFIM M V+PD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPD

Query:  PIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGI
        PIVWRTLLSACS RDVDGGAQVAEEARKRLLELE KRGGNVVMVANMFAE GMWKQAADCRR MKDRGIKKMA ESCIEVGGSLCKFFSGF+ARA+S GI
Subjt:  PIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGI

Query:  YDLLDGLNLHM
        YDLLDGLNLHM
Subjt:  YDLLDGLNLHM

A0A6J1J399 pentatricopeptide repeat-containing protein At2g36730-like2.1e-27090.64Show/hide
Query:  MVRLRLSAVHQIFPPNAL--NSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWN
        MVRLR+SAV Q+F PNA   NSNSNF SRKH+FLSLL L SST+HLFQIHAQILVSGLQ+DPFLI ELL FAALSP RNLSYGRSLLFHCDLHSAPLPWN
Subjt:  MVRLRLSAVHQIFPPNAL--NSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWN

Query:  FIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSW
        FIIRGYASS+SP+ AI VFGEMRRRGIRPNNLT PFLLKACATL  LQEGKQF A AIKCGLDLDVYVRNTLINFYGSCKRMS+ARKVFDEMSERTLVSW
Subjt:  FIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSW

Query:  NAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT
        NA+ITACVEN CFDEAI+YFLKM +HGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDV CARLVFN LKQRSV T
Subjt:  NAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPD
        WSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVD+GYH+FNIMERVYGIKPMMIHYGSMVD+LGRAGRVKEAYEFIM M V+PD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPD

Query:  PIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGI
        PIVWRTLLSACS RDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFA+VGMWKQAADCRR MKDRGIKK+AGESCIEVGGSLCKFFSGF+ARA+S GI
Subjt:  PIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGI

Query:  YDLLDGLNLHMQM
        YDLLDGLNLHMQ+
Subjt:  YDLLDGLNLHMQM

A0A6J1KVA8 pentatricopeptide repeat-containing protein At2g36730-like isoform X14.0e-27491.39Show/hide
Query:  MVRLRLSAVHQIFPPNALNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWNFI
        MVRLR+SAVHQIFPPNA NSNSNF SRKHQFLSL+ LCSS NHLFQIH+QI+V GLQ+D FL  ELLRFAALSPSRNLSY RSLLFH +LH +PLPWN I
Subjt:  MVRLRLSAVHQIFPPNALNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWNFI

Query:  IRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNA
        IRGYASSDSPREAIWVF EMRRRGIRPN+LT PFL+KACATL TLQEGK+FHADAIKCGLDLDVYVRNTLINFYGSCKRMS ARKVFDEMS RTLVSWNA
Subjt:  IRGYASSDSPREAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNA

Query:  VITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWS
        VITACVENFCFDEAIEYFL+M +HGFE DETTMVVILSACAELGNLSLGRWVHSQVV RGMVLNVQLGTA VDMYAKSGDVGCARLVFNCLKQRSVWTWS
Subjt:  VITACVENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWS

Query:  AMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPI
        AMILGLAQHGFANEAIELFTNMMSSSV PNYVTFIGVLCACSHAGLVDKGYHYFNIMERVY IKPMMIHYGSMVDVL RAGRVKEAYEFIMRMPVEPDPI
Subjt:  AMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPI

Query:  VWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGIYD
        VWRTLLSACS RDVDGGAQV EEA+KRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKD GIKKMAGESC+EVGGSL KFFSGFD RA+SDGIYD
Subjt:  VWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGIYD

Query:  LLDGLNLHMQM
        LLDGLNLHMQM
Subjt:  LLDGLNLHMQM

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210658.2e-9138.28Show/hide
Query:  SSTNHLFQIHAQILVSGLQ-HDPFLIAELLRFAALSPS-RNLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSPREAIWVFGEMRRRG-IRPNNLTLPFL
        SS   L QIHA  +  G+   D  L   L+ +    PS   +SY   +    +       WN +IRGYA   +   A  ++ EMR  G + P+  T PFL
Subjt:  SSTNHLFQIHAQILVSGLQ-HDPFLIAELLRFAALSPS-RNLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSPREAIWVFGEMRRRG-IRPNNLTLPFL

Query:  LKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVV
        +KA  T+  ++ G+  H+  I+ G    +YV+N+L++ Y +C  +++A KVFD+M E+ LV+WN+VI    EN   +EA+  + +M   G +PD  T+V 
Subjt:  LKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVV

Query:  ILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTNMMSS-SVVPNYVTF
        +LSACA++G L+LG+ VH  ++  G+  N+      +D+YA+ G V  A+ +F+ +  ++  +W+++I+GLA +GF  EAIELF  M S+  ++P  +TF
Subjt:  ILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTNMMSS-SVVPNYVTF

Query:  IGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPK
        +G+L ACSH G+V +G+ YF  M   Y I+P + H+G MVD+L RAG+VK+AYE+I  MP++P+ ++WRTLL AC+   V G + +AE AR ++L+LEP 
Subjt:  IGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPK

Query:  RGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGIY
          G+ V+++NM+A    W      R+ M   G+KK+ G S +EVG  + +F  G  +  +SD IY
Subjt:  RGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGIY

Q0WQW5 Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial3.5e-8937.55Show/hide
Query:  PPNALNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDP---FLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSP
        PP +  S S   +   +  SL   CS  + L Q+HA  L +    +P   FL  ++L+ +  S   +++Y   +    + HS+   WN +IR  A   S 
Subjt:  PPNALNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDP---FLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSP

Query:  R-EAIWVFGEMRRRG-IRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITACVEN
        + EA  ++ +M  RG   P+  T PF+LKACA +    EGKQ H   +K G   DVYV N LI+ YGSC  +  ARKVFDEM ER+LVSWN++I A V  
Subjt:  R-EAIWVFGEMRRRG-IRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITACVEN

Query:  FCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGR---GMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILG
          +D A++ F +M+   FEPD  TM  +LSACA LG+LSLG W H+ ++ +    + ++V +  + ++MY K G +  A  VF  +++R + +W+AMILG
Subjt:  FCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGR---GMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILG

Query:  LAQHGFANEAIELFTNMMS--SSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWR
         A HG A EA+  F  M+    +V PN VTF+G+L AC+H G V+KG  YF++M R Y I+P + HYG +VD++ RAG + EA + +M MP++PD ++WR
Subjt:  LAQHGFANEAIELFTNMMS--SSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWR

Query:  TLLSACSTRDVDGGAQVAEEARKRLL----ELEPKRG---GNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESD
        +LL AC  +      +++EE  + ++    + E   G   G  V+++ ++A    W      R+ M + GI+K  G S IE+ G   +FF+G  +  ++ 
Subjt:  TLLSACSTRDVDGGAQVAEEARKRLL----ELEPKRG---GNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESD

Query:  GIYDLL
         IY  L
Subjt:  GIYDLL

Q8LK93 Pentatricopeptide repeat-containing protein At2g02980, chloroplastic3.9e-9337.71Show/hide
Query:  LLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSR-NLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTL
        L++ C+S   L QI A  + S ++ D   +A+L+ F   SP+  ++SY R  LF        + +N + RGY+   +P E   +F E+   GI P+N T 
Subjt:  LLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSR-NLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTL

Query:  PFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITACVENFCFDEAIEYFLKMRDHGFEPDETT
        P LLKACA    L+EG+Q H  ++K GLD +VYV  TLIN Y  C+ + +AR VFD + E  +V +NA+IT        +EA+  F +M+    +P+E T
Subjt:  PFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITACVENFCFDEAIEYFLKMRDHGFEPDETT

Query:  MVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTNMMSSSVVPNYV
        ++ +LS+CA LG+L LG+W+H           V++ TA +DM+AK G +  A  +F  ++ +    WSAMI+  A HG A +++ +F  M S +V P+ +
Subjt:  MVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTNMMSSSVVPNYV

Query:  TFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSTRDVDGGAQVAEEARKRLLELE
        TF+G+L ACSH G V++G  YF+ M   +GI P + HYGSMVD+L RAG +++AYEFI ++P+ P P++WR LL+ACS+ +      +AE+  +R+ EL+
Subjt:  TFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSTRDVDGGAQVAEEARKRLLELE

Query:  PKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGIYDLLDGLNLHMQMT
           GG+ V+++N++A    W+     R+ MKDR   K+ G S IEV   + +FFSG   ++ +  ++  LD +   ++++
Subjt:  PKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGIYDLLDGLNLHMQMT

Q9CA54 Pentatricopeptide repeat-containing protein At1g746303.6e-9436.45Show/hide
Query:  HQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSPREAIWVFGEMRRRG-IRP
        H  LSLLN C +   L QIH   +  G+  D +   +L+   A+S S  L Y R LL  C        +N ++RGY+ SD P  ++ VF EM R+G + P
Subjt:  HQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSPREAIWVFGEMRRRG-IRP

Query:  NNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITAC----------------------
        ++ +  F++KA     +L+ G Q H  A+K GL+  ++V  TLI  YG C  +  ARKVFDEM +  LV+WNAVITAC                      
Subjt:  NNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITAC----------------------

Query:  ----------------------------------------VENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNV
                                                  N  F+E+  YF +++  G  P+E ++  +LSAC++ G+   G+ +H  V   G    V
Subjt:  ----------------------------------------VENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNV

Query:  QLGTAFVDMYAKSGDVGCARLVFNCLKQ-RSVWTWSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIK
         +  A +DMY++ G+V  ARLVF  +++ R + +W++MI GLA HG   EA+ LF  M +  V P+ ++FI +L ACSHAGL+++G  YF+ M+RVY I+
Subjt:  QLGTAFVDMYAKSGDVGCARLVFNCLKQ-RSVWTWSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIK

Query:  PMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKD
        P + HYG MVD+ GR+G++++AY+FI +MP+ P  IVWRTLL ACS+    G  ++AE+ ++RL EL+P   G++V+++N +A  G WK  A  R++M  
Subjt:  PMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKD

Query:  RGIKKMAGESCIEVGGSLCKFFSG
        + IKK    S +EVG ++ KF +G
Subjt:  RGIKKMAGESCIEVGGSLCKFFSG

Q9ZQA1 Pentatricopeptide repeat-containing protein At2g367301.4e-16758.75Show/hide
Query:  NSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSPREAIWVFG
        +S+S F SRKHQ L  L LCSS  HL QIH QI +S LQ+D F+I+EL+R ++LS +++L++ R+LL H    S P  WN + RGY+SSDSP E+IWV+ 
Subjt:  NSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSPREAIWVFG

Query:  EMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITACVENFCFDEAIEYF
        EM+RRGI+PN LT PFLLKACA+ + L  G+Q   + +K G D DVYV N LI+ YG+CK+ S ARKVFDEM+ER +VSWN+++TA VEN   +   E F
Subjt:  EMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITACVENFCFDEAIEYF

Query:  LKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIEL
         +M    F PDETTMVV+LSAC   GNLSLG+ VHSQV+ R + LN +LGTA VDMYAKSG +  ARLVF  +  ++VWTWSAMI+GLAQ+GFA EA++L
Subjt:  LKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIEL

Query:  FTNMM-SSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSTRDVDGG
        F+ MM  SSV PNYVTF+GVLCACSH GLVD GY YF+ ME+++ IKPMMIHYG+MVD+LGRAGR+ EAY+FI +MP EPD +VWRTLLSACS    +  
Subjt:  FTNMM-SSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSTRDVDGG

Query:  AQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGIYDLLDGLNLHMQMTWR
          + E+ +KRL+ELEPKR GN+V+VAN FAE  MW +AA+ RR MK+  +KK+AGESC+E+GGS  +FFSG+D R+E   IY+LLD     +   +R
Subjt:  AQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGIYDLLDGLNLHMQMTWR

Arabidopsis top hitse value%identityAlignment
AT1G59720.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.5e-9037.55Show/hide
Query:  PPNALNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDP---FLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSP
        PP +  S S   +   +  SL   CS  + L Q+HA  L +    +P   FL  ++L+ +  S   +++Y   +    + HS+   WN +IR  A   S 
Subjt:  PPNALNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDP---FLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSP

Query:  R-EAIWVFGEMRRRG-IRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITACVEN
        + EA  ++ +M  RG   P+  T PF+LKACA +    EGKQ H   +K G   DVYV N LI+ YGSC  +  ARKVFDEM ER+LVSWN++I A V  
Subjt:  R-EAIWVFGEMRRRG-IRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITACVEN

Query:  FCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGR---GMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILG
          +D A++ F +M+   FEPD  TM  +LSACA LG+LSLG W H+ ++ +    + ++V +  + ++MY K G +  A  VF  +++R + +W+AMILG
Subjt:  FCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGR---GMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILG

Query:  LAQHGFANEAIELFTNMMS--SSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWR
         A HG A EA+  F  M+    +V PN VTF+G+L AC+H G V+KG  YF++M R Y I+P + HYG +VD++ RAG + EA + +M MP++PD ++WR
Subjt:  LAQHGFANEAIELFTNMMS--SSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWR

Query:  TLLSACSTRDVDGGAQVAEEARKRLL----ELEPKRG---GNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESD
        +LL AC  +      +++EE  + ++    + E   G   G  V+++ ++A    W      R+ M + GI+K  G S IE+ G   +FF+G  +  ++ 
Subjt:  TLLSACSTRDVDGGAQVAEEARKRLL----ELEPKRG---GNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESD

Query:  GIYDLL
         IY  L
Subjt:  GIYDLL

AT1G74630.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.5e-9536.45Show/hide
Query:  HQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSPREAIWVFGEMRRRG-IRP
        H  LSLLN C +   L QIH   +  G+  D +   +L+   A+S S  L Y R LL  C        +N ++RGY+ SD P  ++ VF EM R+G + P
Subjt:  HQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSPREAIWVFGEMRRRG-IRP

Query:  NNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITAC----------------------
        ++ +  F++KA     +L+ G Q H  A+K GL+  ++V  TLI  YG C  +  ARKVFDEM +  LV+WNAVITAC                      
Subjt:  NNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITAC----------------------

Query:  ----------------------------------------VENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNV
                                                  N  F+E+  YF +++  G  P+E ++  +LSAC++ G+   G+ +H  V   G    V
Subjt:  ----------------------------------------VENFCFDEAIEYFLKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNV

Query:  QLGTAFVDMYAKSGDVGCARLVFNCLKQ-RSVWTWSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIK
         +  A +DMY++ G+V  ARLVF  +++ R + +W++MI GLA HG   EA+ LF  M +  V P+ ++FI +L ACSHAGL+++G  YF+ M+RVY I+
Subjt:  QLGTAFVDMYAKSGDVGCARLVFNCLKQ-RSVWTWSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIK

Query:  PMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKD
        P + HYG MVD+ GR+G++++AY+FI +MP+ P  IVWRTLL ACS+    G  ++AE+ ++RL EL+P   G++V+++N +A  G WK  A  R++M  
Subjt:  PMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKD

Query:  RGIKKMAGESCIEVGGSLCKFFSG
        + IKK    S +EVG ++ KF +G
Subjt:  RGIKKMAGESCIEVGGSLCKFFSG

AT2G02980.1 Pentatricopeptide repeat (PPR) superfamily protein2.8e-9437.71Show/hide
Query:  LLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSR-NLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTL
        L++ C+S   L QI A  + S ++ D   +A+L+ F   SP+  ++SY R  LF        + +N + RGY+   +P E   +F E+   GI P+N T 
Subjt:  LLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSR-NLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSPREAIWVFGEMRRRGIRPNNLTL

Query:  PFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITACVENFCFDEAIEYFLKMRDHGFEPDETT
        P LLKACA    L+EG+Q H  ++K GLD +VYV  TLIN Y  C+ + +AR VFD + E  +V +NA+IT        +EA+  F +M+    +P+E T
Subjt:  PFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITACVENFCFDEAIEYFLKMRDHGFEPDETT

Query:  MVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTNMMSSSVVPNYV
        ++ +LS+CA LG+L LG+W+H           V++ TA +DM+AK G +  A  +F  ++ +    WSAMI+  A HG A +++ +F  M S +V P+ +
Subjt:  MVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTNMMSSSVVPNYV

Query:  TFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSTRDVDGGAQVAEEARKRLLELE
        TF+G+L ACSH G V++G  YF+ M   +GI P + HYGSMVD+L RAG +++AYEFI ++P+ P P++WR LL+ACS+ +      +AE+  +R+ EL+
Subjt:  TFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSTRDVDGGAQVAEEARKRLLELE

Query:  PKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGIYDLLDGLNLHMQMT
           GG+ V+++N++A    W+     R+ MKDR   K+ G S IEV   + +FFSG   ++ +  ++  LD +   ++++
Subjt:  PKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGIYDLLDGLNLHMQMT

AT2G36730.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-16858.75Show/hide
Query:  NSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSPREAIWVFG
        +S+S F SRKHQ L  L LCSS  HL QIH QI +S LQ+D F+I+EL+R ++LS +++L++ R+LL H    S P  WN + RGY+SSDSP E+IWV+ 
Subjt:  NSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSPREAIWVFG

Query:  EMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITACVENFCFDEAIEYF
        EM+RRGI+PN LT PFLLKACA+ + L  G+Q   + +K G D DVYV N LI+ YG+CK+ S ARKVFDEM+ER +VSWN+++TA VEN   +   E F
Subjt:  EMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITACVENFCFDEAIEYF

Query:  LKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIEL
         +M    F PDETTMVV+LSAC   GNLSLG+ VHSQV+ R + LN +LGTA VDMYAKSG +  ARLVF  +  ++VWTWSAMI+GLAQ+GFA EA++L
Subjt:  LKMRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIEL

Query:  FTNMM-SSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSTRDVDGG
        F+ MM  SSV PNYVTF+GVLCACSH GLVD GY YF+ ME+++ IKPMMIHYG+MVD+LGRAGR+ EAY+FI +MP EPD +VWRTLLSACS    +  
Subjt:  FTNMM-SSSVVPNYVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSTRDVDGG

Query:  AQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGIYDLLDGLNLHMQMTWR
          + E+ +KRL+ELEPKR GN+V+VAN FAE  MW +AA+ RR MK+  +KK+AGESC+E+GGS  +FFSG+D R+E   IY+LLD     +   +R
Subjt:  AQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGIYDLLDGLNLHMQMTWR

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.8e-9238.28Show/hide
Query:  SSTNHLFQIHAQILVSGLQ-HDPFLIAELLRFAALSPS-RNLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSPREAIWVFGEMRRRG-IRPNNLTLPFL
        SS   L QIHA  +  G+   D  L   L+ +    PS   +SY   +    +       WN +IRGYA   +   A  ++ EMR  G + P+  T PFL
Subjt:  SSTNHLFQIHAQILVSGLQ-HDPFLIAELLRFAALSPS-RNLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSPREAIWVFGEMRRRG-IRPNNLTLPFL

Query:  LKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVV
        +KA  T+  ++ G+  H+  I+ G    +YV+N+L++ Y +C  +++A KVFD+M E+ LV+WN+VI    EN   +EA+  + +M   G +PD  T+V 
Subjt:  LKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITACVENFCFDEAIEYFLKMRDHGFEPDETTMVV

Query:  ILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTNMMSS-SVVPNYVTF
        +LSACA++G L+LG+ VH  ++  G+  N+      +D+YA+ G V  A+ +F+ +  ++  +W+++I+GLA +GF  EAIELF  M S+  ++P  +TF
Subjt:  ILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTNMMSS-SVVPNYVTF

Query:  IGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPK
        +G+L ACSH G+V +G+ YF  M   Y I+P + H+G MVD+L RAG+VK+AYE+I  MP++P+ ++WRTLL AC+   V G + +AE AR ++L+LEP 
Subjt:  IGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPK

Query:  RGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGIY
          G+ V+++NM+A    W      R+ M   G+KK+ G S +EVG  + +F  G  +  +SD IY
Subjt:  RGGNVVMVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGIY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGACTTCGCTTATCGGCCGTTCATCAGATTTTCCCACCCAACGCCCTCAATTCCAATTCCAATTTCTTCTCCAGAAAGCATCAATTCCTCTCCCTTCTCAACCT
CTGTTCCTCAACAAATCATCTATTTCAAATCCACGCTCAAATTCTCGTCTCTGGCCTTCAACATGACCCATTTCTCATCGCTGAACTCCTCCGCTTCGCTGCTCTTTCGC
CCTCCAGAAATCTCAGCTATGGCCGCTCTCTCCTCTTCCATTGCGACCTTCATTCCGCCCCTTTGCCATGGAATTTCATCATCAGAGGATATGCCTCGAGCGATTCTCCA
CGAGAGGCCATTTGGGTGTTTGGAGAAATGCGAAGACGAGGAATCAGACCCAATAACCTCACTTTGCCCTTCCTTCTCAAGGCCTGCGCCACGCTCATGACACTCCAAGA
AGGTAAGCAATTTCATGCTGACGCCATTAAGTGTGGTTTAGATTTAGATGTTTATGTTCGGAACACTCTGATTAATTTCTATGGGTCATGCAAAAGAATGTCTGCTGCAC
GGAAGGTGTTCGACGAAATGTCTGAAAGAACTTTAGTTTCATGGAATGCGGTTATTACAGCATGTGTTGAGAATTTTTGCTTTGATGAAGCAATTGAGTACTTTTTGAAA
ATGCGAGACCATGGTTTTGAGCCGGATGAGACTACAATGGTGGTTATATTATCAGCTTGTGCAGAGCTTGGTAACTTAAGCTTAGGGAGGTGGGTTCATTCTCAAGTGGT
GGGTAGAGGGATGGTTTTGAATGTTCAATTGGGCACTGCCTTCGTCGACATGTATGCAAAATCTGGCGATGTTGGATGTGCTAGACTTGTATTCAATTGTTTGAAACAGA
GAAGTGTATGGACATGGAGTGCAATGATTTTGGGGCTAGCCCAACATGGATTTGCTAATGAAGCCATTGAACTTTTCACAAATATGATGAGCTCCTCTGTAGTCCCTAAT
TATGTCACTTTCATTGGTGTCCTATGTGCCTGTAGCCATGCTGGATTGGTGGATAAAGGCTACCATTACTTCAACATTATGGAGAGAGTGTACGGGATTAAGCCGATGAT
GATACATTACGGGTCGATGGTGGATGTTTTAGGTCGTGCAGGTCGGGTCAAGGAGGCTTATGAGTTCATCATGAGGATGCCTGTGGAACCCGATCCAATTGTGTGGAGGA
CATTGCTGAGTGCATGCAGCACTCGTGATGTTGATGGTGGGGCTCAGGTTGCAGAGGAGGCAAGGAAGAGGCTGCTCGAGCTCGAGCCAAAGAGGGGTGGGAATGTGGTG
ATGGTTGCGAACATGTTTGCTGAAGTTGGGATGTGGAAGCAGGCAGCAGATTGCCGGAGGGCCATGAAAGATAGAGGGATCAAAAAGATGGCTGGGGAGAGCTGCATTGA
AGTGGGTGGCTCTTTGTGTAAATTCTTCTCAGGTTTTGATGCTCGGGCTGAATCTGATGGCATTTATGATTTGCTTGATGGATTGAACCTGCATATGCAAATGACATGGA
GAGAACGTGATGGAAGTGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGACTTCGCTTATCGGCCGTTCATCAGATTTTCCCACCCAACGCCCTCAATTCCAATTCCAATTTCTTCTCCAGAAAGCATCAATTCCTCTCCCTTCTCAACCT
CTGTTCCTCAACAAATCATCTATTTCAAATCCACGCTCAAATTCTCGTCTCTGGCCTTCAACATGACCCATTTCTCATCGCTGAACTCCTCCGCTTCGCTGCTCTTTCGC
CCTCCAGAAATCTCAGCTATGGCCGCTCTCTCCTCTTCCATTGCGACCTTCATTCCGCCCCTTTGCCATGGAATTTCATCATCAGAGGATATGCCTCGAGCGATTCTCCA
CGAGAGGCCATTTGGGTGTTTGGAGAAATGCGAAGACGAGGAATCAGACCCAATAACCTCACTTTGCCCTTCCTTCTCAAGGCCTGCGCCACGCTCATGACACTCCAAGA
AGGTAAGCAATTTCATGCTGACGCCATTAAGTGTGGTTTAGATTTAGATGTTTATGTTCGGAACACTCTGATTAATTTCTATGGGTCATGCAAAAGAATGTCTGCTGCAC
GGAAGGTGTTCGACGAAATGTCTGAAAGAACTTTAGTTTCATGGAATGCGGTTATTACAGCATGTGTTGAGAATTTTTGCTTTGATGAAGCAATTGAGTACTTTTTGAAA
ATGCGAGACCATGGTTTTGAGCCGGATGAGACTACAATGGTGGTTATATTATCAGCTTGTGCAGAGCTTGGTAACTTAAGCTTAGGGAGGTGGGTTCATTCTCAAGTGGT
GGGTAGAGGGATGGTTTTGAATGTTCAATTGGGCACTGCCTTCGTCGACATGTATGCAAAATCTGGCGATGTTGGATGTGCTAGACTTGTATTCAATTGTTTGAAACAGA
GAAGTGTATGGACATGGAGTGCAATGATTTTGGGGCTAGCCCAACATGGATTTGCTAATGAAGCCATTGAACTTTTCACAAATATGATGAGCTCCTCTGTAGTCCCTAAT
TATGTCACTTTCATTGGTGTCCTATGTGCCTGTAGCCATGCTGGATTGGTGGATAAAGGCTACCATTACTTCAACATTATGGAGAGAGTGTACGGGATTAAGCCGATGAT
GATACATTACGGGTCGATGGTGGATGTTTTAGGTCGTGCAGGTCGGGTCAAGGAGGCTTATGAGTTCATCATGAGGATGCCTGTGGAACCCGATCCAATTGTGTGGAGGA
CATTGCTGAGTGCATGCAGCACTCGTGATGTTGATGGTGGGGCTCAGGTTGCAGAGGAGGCAAGGAAGAGGCTGCTCGAGCTCGAGCCAAAGAGGGGTGGGAATGTGGTG
ATGGTTGCGAACATGTTTGCTGAAGTTGGGATGTGGAAGCAGGCAGCAGATTGCCGGAGGGCCATGAAAGATAGAGGGATCAAAAAGATGGCTGGGGAGAGCTGCATTGA
AGTGGGTGGCTCTTTGTGTAAATTCTTCTCAGGTTTTGATGCTCGGGCTGAATCTGATGGCATTTATGATTTGCTTGATGGATTGAACCTGCATATGCAAATGACATGGA
GAGAACGTGATGGAAGTGAATAA
Protein sequenceShow/hide protein sequence
MVRLRLSAVHQIFPPNALNSNSNFFSRKHQFLSLLNLCSSTNHLFQIHAQILVSGLQHDPFLIAELLRFAALSPSRNLSYGRSLLFHCDLHSAPLPWNFIIRGYASSDSP
REAIWVFGEMRRRGIRPNNLTLPFLLKACATLMTLQEGKQFHADAIKCGLDLDVYVRNTLINFYGSCKRMSAARKVFDEMSERTLVSWNAVITACVENFCFDEAIEYFLK
MRDHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIELFTNMMSSSVVPN
YVTFIGVLCACSHAGLVDKGYHYFNIMERVYGIKPMMIHYGSMVDVLGRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSTRDVDGGAQVAEEARKRLLELEPKRGGNVV
MVANMFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEVGGSLCKFFSGFDARAESDGIYDLLDGLNLHMQMTWRERDGSE