; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC07g0339 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC07g0339
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationMC07:11217719..11219410
RNA-Seq ExpressionMC07g0339
SyntenyMC07g0339
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013750.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma]6.00e-27669.5Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------
        HSLVIKLGL N+L VQNK+L +YV+C+    AR LFDEM RRNVVSWNTVICG+VDCGYG                                        
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------

Query:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI
                                                  AF  +LY+DLVLWNVMLYCYVFN+L + AI+IF LMQLEGF GDDFTFSSLLSSC + 
Subjt:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI

Query:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF
        GSGELGKQ+H  LIK SFDLDILVASSLVNMYAKNN L++ARKV DEMP +NSVSWTTMIVGYGQQE GKEAV+L  RMF EDY PDELTFASVLSSCGF
Subjt:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF

Query:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS
         SGASEL QVHSCL+K GFEAFLS+NNGLI AYSKCG+++ AL+CF LIAEPDLV+WTSIICGLAFCG+E+DAVELF+KMLS GIRPD+IAFLGVLSACS
Subjt:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS

Query:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA
        HGG VNMGLHYFNLMTN YQIVPDSEHLTCLIDLIGRAG LDEAF LLKSV +EAG DAF +FIRACRTHG  RLAKW MEF+S+P +PVN SLMSNMYA
Subjt:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA

Query:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK
        SEG WSDVARMRKL+KDSCE K PG+SWIEIAG+NHLFVSSDRSHPQS DLY M+GLLLNT KK
Subjt:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK

XP_008458191.1 PREDICTED: pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Cucumis melo]3.85e-27568.79Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------
        HS+V+KLGL N+LSVQNK+L +YV+C+    AR LFDEM RRN VSWNTVICGLVD GYG                                        
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------

Query:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI
                                                  AF C LY+DLVLWNVMLYCYVFN L R AIE F LMQLEGF+GD+FTFSSLLSSC + 
Subjt:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI

Query:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF
        GSGELGKQ+HGLLIKQSFDLDILVASSL+++YAKN++L++ARKV DEMPT+NSVSWTTMIVGYGQQE GKEAV+LF RMF +DYC DELTFASVLSSCGF
Subjt:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF

Query:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS
         SGASEL QVHSCL+K GFEAFLSINNGLI AYSKCG VAAAL+CF LIAEPDLVTWTSIICGLAFCGLE+DAV+LF+KMLSYGIRPD+IAFLGVLSACS
Subjt:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS

Query:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA
        HGG V+MGLHYFNLMTN YQ+VPD EHLTCLIDL+GRAG LD+AFDLLKS+ KEAG DA  AFIRACRTHGN +LAKW MEF SEP EPVNYSL+SNMYA
Subjt:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA

Query:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK
        SEG WSDVARM KL+ D CE+K+PG SW+EIAG+NHLF S DRSHPQS DLY M+GLLLNT K+
Subjt:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK

XP_022157684.1 pentatricopeptide repeat-containing protein At2g46050, mitochondrial [Momordica charantia]0.085.46Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------
        HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG                                        
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------

Query:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI
                                                  AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI
Subjt:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI

Query:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF
        GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF
Subjt:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF

Query:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS
        ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS
Subjt:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS

Query:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA
        HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA
Subjt:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA

Query:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK
        SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK
Subjt:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK

XP_022958961.1 pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Cucurbita moschata]1.72e-27569.5Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------
        HSLVIKLGL N+LSVQNK+L +YV+C+    AR LFDEM RRNVVSWNTVICG+V+CGYG                                        
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------

Query:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI
                                                  AF  +LY+DLVLWNVMLYCYVFN L + AIEIF+LMQLEGF GDDFTFSSLLSSC + 
Subjt:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI

Query:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF
        GSGELGKQ+H  LIK SFDLDILVASSLVNMYAKNN L++ARK  DEMP +NSVSWTTMIVGYGQQE GKEAV+L  RMF EDY PDELTFASVLSSCGF
Subjt:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF

Query:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS
         SGASEL QVHSCL+K GFEAFLS+NNGLI AYSKCG+++ ALRCF LIAEPDLV+WTSIICG AFCGLE+ AVELF+KMLS GIRPD+IAFLGVLSACS
Subjt:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS

Query:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA
        HGG VNMGLHYFNLMTN YQIVPDSEHLTCLIDLIGRAG LDEAF LLKSV +EAG DAF +FIRACRTHG  RLAKW MEF+S+P +PVN SLMSNMYA
Subjt:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA

Query:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK
        SEG WSDVARMRKL+KDSCE K PG+SWIEIAG+NHLFVSSDRSHPQS DLY M+GLLLNT KK
Subjt:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK

XP_038874466.1 pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Benincasa hispida]1.02e-28170.16Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------
        HS  IKLGL N+LSVQNK+L +YV+C+  + AR LFDEM RRNVVSWNTVICGLV+CGYG                                        
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------

Query:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI
                                                  AF CILYRDLVLWNVMLYCYVFN LGR AIE+F LMQLEGF+GDDFTFSSLLSSC + 
Subjt:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI

Query:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF
        GSGELGKQ+HGLLIKQSFDLDILVASSLVN+YAKN++L++ARKV DEMP++NSVSWTTMIVG+GQQEDGKEAV+LF RMF EDY PDELTFASVLSSCG 
Subjt:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF

Query:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS
         SGA ELKQVHSCL+K GFEAF SINNGLI AYSKCG ++AAL+CF LIAEPDLVTWTS ICGLA CGLE++A+ELF+KMLSY IRPD+IAFLGVLSACS
Subjt:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS

Query:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA
        HGG V+MGLHYFNLMTN YQIVPDSEHLTCLIDL+GRAG LDEAFDLLKS+P  AG DAF AFIRACRTHGN RLAKW MEF+SEP E VNYSL+SNMYA
Subjt:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA

Query:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTK
        SEG WSDVARMRKLMKDSC+RK+PG+SW+EIAG+NHLFVS DRSHP+SLDLY M+GLLLNT K
Subjt:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTK

TrEMBL top hitse value%identityAlignment
A0A0A0K863 Uncharacterized protein1.19e-28270.39Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------
        HSLV+KLGLVN+LSVQNK+L +YV+C+    AR LFDEM+RRNVVSWNTVICGLVD GYG                                        
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------

Query:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI
                                                  AF CILYRDLVLWNVMLYC VFN L R AIE+F LMQLEGF+GDDFTFSSLLSSC + 
Subjt:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI

Query:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF
        GSGELGKQ+H LLIKQSFDLDILVASSLVN+Y KN++L++ARKV DEMPT+NSVSWTTMIVGYGQ E GKEAV+LF RMFR+DYCPDELTFASVLSSCGF
Subjt:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF

Query:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS
         SGASEL QVHSCL+K GFEAFLSINNGLI AYSKCG +AAAL+CF LIAEPDLVTWTSIICGLA CGLE+DAV+LF+KMLSYGIRPD+IAFLGVLSACS
Subjt:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS

Query:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA
        HGG V+MGLHYFNLMTN YQ+VPDSEHLTCLIDL+GRAG LD+AFDLLKS+PKEAG DA  AFIRACRTHGN RLAK  MEF+SEP EPVNYSL+SNMYA
Subjt:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA

Query:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK
        SEG WSDVARMRKL+ D CE+K+PG SW+EIAG+NHLF+S DRSHPQSLDLY M+GLLLNT KK
Subjt:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK

A0A1S3C6T7 pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X11.87e-27568.79Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------
        HS+V+KLGL N+LSVQNK+L +YV+C+    AR LFDEM RRN VSWNTVICGLVD GYG                                        
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------

Query:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI
                                                  AF C LY+DLVLWNVMLYCYVFN L R AIE F LMQLEGF+GD+FTFSSLLSSC + 
Subjt:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI

Query:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF
        GSGELGKQ+HGLLIKQSFDLDILVASSL+++YAKN++L++ARKV DEMPT+NSVSWTTMIVGYGQQE GKEAV+LF RMF +DYC DELTFASVLSSCGF
Subjt:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF

Query:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS
         SGASEL QVHSCL+K GFEAFLSINNGLI AYSKCG VAAAL+CF LIAEPDLVTWTSIICGLAFCGLE+DAV+LF+KMLSYGIRPD+IAFLGVLSACS
Subjt:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS

Query:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA
        HGG V+MGLHYFNLMTN YQ+VPD EHLTCLIDL+GRAG LD+AFDLLKS+ KEAG DA  AFIRACRTHGN +LAKW MEF SEP EPVNYSL+SNMYA
Subjt:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA

Query:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK
        SEG WSDVARM KL+ D CE+K+PG SW+EIAG+NHLF S DRSHPQS DLY M+GLLLNT K+
Subjt:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK

A0A5D3BXR6 Pentatricopeptide repeat-containing protein1.87e-27568.79Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------
        HS+V+KLGL N+LSVQNK+L +YV+C+    AR LFDEM RRN VSWNTVICGLVD GYG                                        
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------

Query:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI
                                                  AF C LY+DLVLWNVMLYCYVFN L R AIE F LMQLEGF+GD+FTFSSLLSSC + 
Subjt:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI

Query:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF
        GSGELGKQ+HGLLIKQSFDLDILVASSL+++YAKN++L++ARKV DEMPT+NSVSWTTMIVGYGQQE GKEAV+LF RMF +DYC DELTFASVLSSCGF
Subjt:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF

Query:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS
         SGASEL QVHSCL+K GFEAFLSINNGLI AYSKCG VAAAL+CF LIAEPDLVTWTSIICGLAFCGLE+DAV+LF+KMLSYGIRPD+IAFLGVLSACS
Subjt:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS

Query:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA
        HGG V+MGLHYFNLMTN YQ+VPD EHLTCLIDL+GRAG LD+AFDLLKS+ KEAG DA  AFIRACRTHGN +LAKW MEF SEP EPVNYSL+SNMYA
Subjt:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA

Query:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK
        SEG WSDVARM KL+ D CE+K+PG SW+EIAG+NHLF S DRSHPQS DLY M+GLLLNT K+
Subjt:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK

A0A6J1DX95 pentatricopeptide repeat-containing protein At2g46050, mitochondrial0.085.46Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------
        HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG                                        
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------

Query:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI
                                                  AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI
Subjt:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI

Query:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF
        GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF
Subjt:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF

Query:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS
        ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS
Subjt:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS

Query:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA
        HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA
Subjt:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA

Query:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK
        SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK
Subjt:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK

A0A6J1H3L2 pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X18.31e-27669.5Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------
        HSLVIKLGL N+LSVQNK+L +YV+C+    AR LFDEM RRNVVSWNTVICG+V+CGYG                                        
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYG----------------------------------------

Query:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI
                                                  AF  +LY+DLVLWNVMLYCYVFN L + AIEIF+LMQLEGF GDDFTFSSLLSSC + 
Subjt:  ------------------------------------------AFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFI

Query:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF
        GSGELGKQ+H  LIK SFDLDILVASSLVNMYAKNN L++ARK  DEMP +NSVSWTTMIVGYGQQE GKEAV+L  RMF EDY PDELTFASVLSSCGF
Subjt:  GSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGF

Query:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS
         SGASEL QVHSCL+K GFEAFLS+NNGLI AYSKCG+++ ALRCF LIAEPDLV+WTSIICG AFCGLE+ AVELF+KMLS GIRPD+IAFLGVLSACS
Subjt:  ISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACS

Query:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA
        HGG VNMGLHYFNLMTN YQIVPDSEHLTCLIDLIGRAG LDEAF LLKSV +EAG DAF +FIRACRTHG  RLAKW MEF+S+P +PVN SLMSNMYA
Subjt:  HGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYA

Query:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK
        SEG WSDVARMRKL+KDSCE K PG+SWIEIAG+NHLFVSSDRSHPQS DLY M+GLLLNT KK
Subjt:  SEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK

SwissProt top hitse value%identityAlignment
O82363 Pentatricopeptide repeat-containing protein At2g46050, mitochondrial3.2e-10541.98Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVI-------------------------------------------------
        H  ++K G+ N L +QNK+L  Y + ++F DA KLFDEM  RN+V+WN +I                                                 
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVI-------------------------------------------------

Query:  -------------------------------CGLVDCGYGAFGCILYRDLVLWNVMLYCYVFNYLGRGAI-EIFYLMQLEG-----FRGDDFTFSSLLSS
                                       CGL+      F  +L RDLVLWN ++  YV N    G I E F L++L G     FRGD FTFSSLLS+
Subjt:  -------------------------------CGLVDCGYGAFGCILYRDLVLWNVMLYCYVFNYLGRGAI-EIFYLMQLEG-----FRGDDFTFSSLLSS

Query:  CNFIGSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLS
        C      E GKQIH +L K S+  DI VA++L+NMYAK+N L +AR+  + M  +N VSW  MIVG+ Q  +G+EA+RLFG+M  E+  PDELTFASVLS
Subjt:  CNFIGSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLS

Query:  SCGFISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVL
        SC   S   E+KQV + + K G   FLS+ N LI++YS+ G+++ AL CF  I EPDLV+WTS+I  LA  G   +++++FE ML   ++PD+I FL VL
Subjt:  SCGFISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVL

Query:  SACSHGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTME--FSSEPKEPVNYSL
        SACSHGGLV  GL  F  MT  Y+I  + EH TCLIDL+GRAG +DEA D+L S+P E  + A  AF   C  H      KW  +     EP +PVNYS+
Subjt:  SACSHGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTME--FSSEPKEPVNYSL

Query:  MSNMYASEGSWSDVARMRKLMKDSC-ERKSPGYSWI
        +SN Y SEG W+  A +RK  + +C   K+PG SW+
Subjt:  MSNMYASEGSWSDVARMRKLMKDSC-ERKSPGYSWI

Q9CAA8 Putative pentatricopeptide repeat-containing protein At1g689309.0e-9236.98Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYGAFGCILYR----DLVLWNVMLYCYVFNYLGRGAIEIFYLM
        H  VIKLG  + L V + +L +Y       DA+K+F  +  RN V +N+++ GL+ CG       L+R    D V W  M+     N L + AIE F  M
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYGAFGCILYR----DLVLWNVMLYCYVFNYLGRGAIEIFYLM

Query:  QLEGFRGDDFTFSSLLSSCNFIGSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGR
        +++G + D + F S+L +C  +G+   GKQIH  +I+ +F   I V S+L++MY K   LH A+ V D M  KN VSWT M+VGYGQ    +EAV++F  
Subjt:  QLEGFRGDDFTFSSLLSSCNFIGSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGR

Query:  MFREDYCPDELTFASVLSSCGFISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFE
        M R    PD  T    +S+C  +S   E  Q H   +  G   +++++N L+T Y KCG +  + R F+ +   D V+WT+++   A  G   + ++LF+
Subjt:  MFREDYCPDELTFASVLSSCGFISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFE

Query:  KMLSYGIRPDRIAFLGVLSACSHGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKW
        KM+ +G++PD +   GV+SACS  GLV  G  YF LMT+ Y IVP   H +C+IDL  R+G L+EA   +  +P    +  +   + ACR  GN  + KW
Subjt:  KMLSYGIRPDRIAFLGVLSACSHGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKW

Query:  TME--FSSEPKEPVNYSLMSNMYASEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLN
          E     +P  P  Y+L+S++YAS+G W  VA++R+ M++   +K PG SWI+  G  H F + D S P    +Y  +  L N
Subjt:  TME--FSSEPKEPVNYSLMSNMYASEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLN

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220703.3e-8634.92Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVIC-----GLVDCGYGAFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYL
        HS ++KLGL   +SV N +L +Y +C     A+ +FD M  R++ SWN +I      G +D     F  +  RD+V WN M+  +        A++IF  
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVIC-----GLVDCGYGAFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYL

Query:  MQLEGFRGDD-FTFSSLLSSCNFIGSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTK-------------------------
        M  +     D FT +S+LS+C  +    +GKQIH  ++   FD+  +V ++L++MY++   +  AR+++++  TK                         
Subjt:  MQLEGFRGDD-FTFSSLLSSCNFIGSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTK-------------------------

Query:  --------NSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGFISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAAL
                + V+WT MIVGY Q     EA+ LF  M      P+  T A++LS    ++  S  KQ+H   +K G    +S++N LIT Y+K G++ +A 
Subjt:  --------NSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGFISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAAL

Query:  RCFSLI-AEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACSHGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLD
        R F LI  E D V+WTS+I  LA  G   +A+ELFE ML  G+RPD I ++GV SAC+H GLVN G  YF++M +  +I+P   H  C++DL GRAGLL 
Subjt:  RCFSLI-AEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACSHGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLD

Query:  EAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTME--FSSEPKEPVNYSLMSNMYASEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVS
        EA + ++ +P E     +G+ + ACR H N  L K   E     EP+    YS ++N+Y++ G W + A++RK MKD   +K  G+SWIE+    H+F  
Subjt:  EAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTME--FSSEPKEPVNYSLMSNMYASEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVS

Query:  SDRSHPQSLDLYVMVGLLLNTTKK
         D +HP+  ++Y+ +  + +  KK
Subjt:  SDRSHPQSLDLYVMVGLLLNTTKK

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136005.9e-8332.69Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYGAFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEG
        HSL+ K   ++ + + + ++ +Y +C    DA+++FDEM  RNVVSWN++I                           C+  N     A+++F +M    
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYGAFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEG

Query:  FRGDDFTFSSLLSSCNFIGSGELGKQIHGLLIK-QSFDLDILVASSLVNMYAKNNSLHNARKVLDEMP-------------------------------T
           D+ T +S++S+C  + + ++G+++HG ++K      DI+++++ V+MYAK + +  AR + D MP                                
Subjt:  FRGDDFTFSSLLSSCNFIGSGELGKQIHGLLIK-QSFDLDILVASSLVNMYAKNNSLHNARKVLDEMP-------------------------------T

Query:  KNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGFISGASELKQVHSCLLKFGF------EAFLSINNGLITAYSKCGSVAAALR
        +N VSW  +I GY Q  + +EA+ LF  + RE  CP   +FA++L +C  ++      Q H  +LK GF      E  + + N LI  Y KCG V     
Subjt:  KNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGFISGASELKQVHSCLLKFGF------EAFLSINNGLITAYSKCGSVAAALR

Query:  CFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACSHGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEA
         F  + E D V+W ++I G A  G   +A+ELF +ML  G +PD I  +GVLSAC H G V  G HYF+ MT  + + P  +H TC++DL+GRAG L+EA
Subjt:  CFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACSHGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEA

Query:  FDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTME--FSSEPKEPVNYSLMSNMYASEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSD
          +++ +P +  S  +G+ + AC+ H N  L K+  E     EP     Y L+SNMYA  G W DV  +RK M+     K PG SWI+I G +H+F+  D
Subjt:  FDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTME--FSSEPKEPVNYSLMSNMYASEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSD

Query:  RSHPQSLDLYVMVGLLL
        +SHP+   ++ ++ +L+
Subjt:  RSHPQSLDLYVMVGLLL

Q9SMZ2 Pentatricopeptide repeat-containing protein At4g331705.5e-8131.31Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICG----------------LVDCG--------------------------
        H + +KLGL   L+V N ++ +Y + +KF  AR +FD MS R+++SWN+VI G                L+ CG                          
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICG----------------LVDCG--------------------------

Query:  ------------------------YGAFGC-----ILYR----DLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFIGSGELGK
                                Y    C     IL+     DLV WN M+  Y  ++ G   +++F LM  +G R DDFT +++  +C F+ +   GK
Subjt:  ------------------------YGAFGC-----ILYR----DLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFIGSGELGK

Query:  QIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGFISGASEL
        Q+H   IK  +DLD+ V+S +++MY K   +  A+   D +P  + V+WTTMI G  +  + + A  +F +M      PDE T A++  +   ++   + 
Subjt:  QIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGFISGASEL

Query:  KQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACSHGGLVNM
        +Q+H+  LK        +   L+  Y+KCGS+  A   F  I   ++  W +++ GLA  G  ++ ++LF++M S GI+PD++ F+GVLSACSH GLV+ 
Subjt:  KQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACSHGGLVNM

Query:  GLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAK--WTMEFSSEPKEPVNYSLMSNMYASEGSW
           +   M   Y I P+ EH +CL D +GRAGL+ +A +L++S+  EA +  +   + ACR  G+    K   T     EP +   Y L+SNMYA+   W
Subjt:  GLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAK--WTMEFSSEPKEPVNYSLMSNMYASEGSW

Query:  SDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK
         ++   R +MK    +K PG+SWIE+    H+FV  DRS+ Q+  +Y  V  ++   K+
Subjt:  SDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK

Arabidopsis top hitse value%identityAlignment
AT1G68930.1 pentatricopeptide (PPR) repeat-containing protein6.4e-9336.98Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYGAFGCILYR----DLVLWNVMLYCYVFNYLGRGAIEIFYLM
        H  VIKLG  + L V + +L +Y       DA+K+F  +  RN V +N+++ GL+ CG       L+R    D V W  M+     N L + AIE F  M
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYGAFGCILYR----DLVLWNVMLYCYVFNYLGRGAIEIFYLM

Query:  QLEGFRGDDFTFSSLLSSCNFIGSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGR
        +++G + D + F S+L +C  +G+   GKQIH  +I+ +F   I V S+L++MY K   LH A+ V D M  KN VSWT M+VGYGQ    +EAV++F  
Subjt:  QLEGFRGDDFTFSSLLSSCNFIGSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGR

Query:  MFREDYCPDELTFASVLSSCGFISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFE
        M R    PD  T    +S+C  +S   E  Q H   +  G   +++++N L+T Y KCG +  + R F+ +   D V+WT+++   A  G   + ++LF+
Subjt:  MFREDYCPDELTFASVLSSCGFISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFE

Query:  KMLSYGIRPDRIAFLGVLSACSHGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKW
        KM+ +G++PD +   GV+SACS  GLV  G  YF LMT+ Y IVP   H +C+IDL  R+G L+EA   +  +P    +  +   + ACR  GN  + KW
Subjt:  KMLSYGIRPDRIAFLGVLSACSHGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKW

Query:  TME--FSSEPKEPVNYSLMSNMYASEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLN
          E     +P  P  Y+L+S++YAS+G W  VA++R+ M++   +K PG SWI+  G  H F + D S P    +Y  +  L N
Subjt:  TME--FSSEPKEPVNYSLMSNMYASEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLN

AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein4.2e-8432.69Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYGAFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEG
        HSL+ K   ++ + + + ++ +Y +C    DA+++FDEM  RNVVSWN++I                           C+  N     A+++F +M    
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYGAFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEG

Query:  FRGDDFTFSSLLSSCNFIGSGELGKQIHGLLIK-QSFDLDILVASSLVNMYAKNNSLHNARKVLDEMP-------------------------------T
           D+ T +S++S+C  + + ++G+++HG ++K      DI+++++ V+MYAK + +  AR + D MP                                
Subjt:  FRGDDFTFSSLLSSCNFIGSGELGKQIHGLLIK-QSFDLDILVASSLVNMYAKNNSLHNARKVLDEMP-------------------------------T

Query:  KNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGFISGASELKQVHSCLLKFGF------EAFLSINNGLITAYSKCGSVAAALR
        +N VSW  +I GY Q  + +EA+ LF  + RE  CP   +FA++L +C  ++      Q H  +LK GF      E  + + N LI  Y KCG V     
Subjt:  KNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGFISGASELKQVHSCLLKFGF------EAFLSINNGLITAYSKCGSVAAALR

Query:  CFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACSHGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEA
         F  + E D V+W ++I G A  G   +A+ELF +ML  G +PD I  +GVLSAC H G V  G HYF+ MT  + + P  +H TC++DL+GRAG L+EA
Subjt:  CFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACSHGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEA

Query:  FDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTME--FSSEPKEPVNYSLMSNMYASEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSD
          +++ +P +  S  +G+ + AC+ H N  L K+  E     EP     Y L+SNMYA  G W DV  +RK M+     K PG SWI+I G +H+F+  D
Subjt:  FDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTME--FSSEPKEPVNYSLMSNMYASEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSD

Query:  RSHPQSLDLYVMVGLLL
        +SHP+   ++ ++ +L+
Subjt:  RSHPQSLDLYVMVGLLL

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein2.4e-8734.92Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVIC-----GLVDCGYGAFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYL
        HS ++KLGL   +SV N +L +Y +C     A+ +FD M  R++ SWN +I      G +D     F  +  RD+V WN M+  +        A++IF  
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVIC-----GLVDCGYGAFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYL

Query:  MQLEGFRGDD-FTFSSLLSSCNFIGSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTK-------------------------
        M  +     D FT +S+LS+C  +    +GKQIH  ++   FD+  +V ++L++MY++   +  AR+++++  TK                         
Subjt:  MQLEGFRGDD-FTFSSLLSSCNFIGSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTK-------------------------

Query:  --------NSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGFISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAAL
                + V+WT MIVGY Q     EA+ LF  M      P+  T A++LS    ++  S  KQ+H   +K G    +S++N LIT Y+K G++ +A 
Subjt:  --------NSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGFISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAAL

Query:  RCFSLI-AEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACSHGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLD
        R F LI  E D V+WTS+I  LA  G   +A+ELFE ML  G+RPD I ++GV SAC+H GLVN G  YF++M +  +I+P   H  C++DL GRAGLL 
Subjt:  RCFSLI-AEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACSHGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLD

Query:  EAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTME--FSSEPKEPVNYSLMSNMYASEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVS
        EA + ++ +P E     +G+ + ACR H N  L K   E     EP+    YS ++N+Y++ G W + A++RK MKD   +K  G+SWIE+    H+F  
Subjt:  EAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTME--FSSEPKEPVNYSLMSNMYASEGSWSDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVS

Query:  SDRSHPQSLDLYVMVGLLLNTTKK
         D +HP+  ++Y+ +  + +  KK
Subjt:  SDRSHPQSLDLYVMVGLLLNTTKK

AT2G46050.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.3e-10641.98Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVI-------------------------------------------------
        H  ++K G+ N L +QNK+L  Y + ++F DA KLFDEM  RN+V+WN +I                                                 
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVI-------------------------------------------------

Query:  -------------------------------CGLVDCGYGAFGCILYRDLVLWNVMLYCYVFNYLGRGAI-EIFYLMQLEG-----FRGDDFTFSSLLSS
                                       CGL+      F  +L RDLVLWN ++  YV N    G I E F L++L G     FRGD FTFSSLLS+
Subjt:  -------------------------------CGLVDCGYGAFGCILYRDLVLWNVMLYCYVFNYLGRGAI-EIFYLMQLEG-----FRGDDFTFSSLLSS

Query:  CNFIGSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLS
        C      E GKQIH +L K S+  DI VA++L+NMYAK+N L +AR+  + M  +N VSW  MIVG+ Q  +G+EA+RLFG+M  E+  PDELTFASVLS
Subjt:  CNFIGSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLS

Query:  SCGFISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVL
        SC   S   E+KQV + + K G   FLS+ N LI++YS+ G+++ AL CF  I EPDLV+WTS+I  LA  G   +++++FE ML   ++PD+I FL VL
Subjt:  SCGFISGASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVL

Query:  SACSHGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTME--FSSEPKEPVNYSL
        SACSHGGLV  GL  F  MT  Y+I  + EH TCLIDL+GRAG +DEA D+L S+P E  + A  AF   C  H      KW  +     EP +PVNYS+
Subjt:  SACSHGGLVNMGLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTME--FSSEPKEPVNYSL

Query:  MSNMYASEGSWSDVARMRKLMKDSC-ERKSPGYSWI
        +SN Y SEG W+  A +RK  + +C   K+PG SW+
Subjt:  MSNMYASEGSWSDVARMRKLMKDSC-ERKSPGYSWI

AT4G33170.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.9e-8231.31Show/hide
Query:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICG----------------LVDCG--------------------------
        H + +KLGL   L+V N ++ +Y + +KF  AR +FD MS R+++SWN+VI G                L+ CG                          
Subjt:  HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICG----------------LVDCG--------------------------

Query:  ------------------------YGAFGC-----ILYR----DLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFIGSGELGK
                                Y    C     IL+     DLV WN M+  Y  ++ G   +++F LM  +G R DDFT +++  +C F+ +   GK
Subjt:  ------------------------YGAFGC-----ILYR----DLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSSLLSSCNFIGSGELGK

Query:  QIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGFISGASEL
        Q+H   IK  +DLD+ V+S +++MY K   +  A+   D +P  + V+WTTMI G  +  + + A  +F +M      PDE T A++  +   ++   + 
Subjt:  QIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGFISGASEL

Query:  KQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACSHGGLVNM
        +Q+H+  LK        +   L+  Y+KCGS+  A   F  I   ++  W +++ GLA  G  ++ ++LF++M S GI+PD++ F+GVLSACSH GLV+ 
Subjt:  KQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACSHGGLVNM

Query:  GLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAK--WTMEFSSEPKEPVNYSLMSNMYASEGSW
           +   M   Y I P+ EH +CL D +GRAGL+ +A +L++S+  EA +  +   + ACR  G+    K   T     EP +   Y L+SNMYA+   W
Subjt:  GLHYFNLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAK--WTMEFSSEPKEPVNYSLMSNMYASEGSW

Query:  SDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK
         ++   R +MK    +K PG+SWIE+    H+FV  DRS+ Q+  +Y  V  ++   K+
Subjt:  SDVARMRKLMKDSCERKSPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CACAGCCTCGTGATCAAGTTGGGTTTGGTTAATCAACTCTCTGTCCAGAACAAAGTTTTGGGATTATATGTCAGATGCAAGAAATTCAAAGATGCACGGAAACTGTTCGA
TGAAATGTCTAGGCGAAATGTTGTGTCGTGGAACACGGTGATTTGTGGGCTTGTCGATTGTGGGTATGGAGCTTTTGGCTGCATACTGTATAGAGATCTGGTTTTGTGGA
ATGTGATGTTGTATTGTTATGTTTTCAATTATTTGGGGAGAGGGGCGATTGAAATCTTTTATTTGATGCAGTTGGAAGGCTTTAGAGGTGATGATTTTACATTCAGCAGC
CTGCTCAGTTCATGCAATTTTATAGGATCAGGGGAATTGGGTAAGCAGATTCATGGTCTTCTTATAAAACAGTCATTTGATTTAGATATTCTTGTGGCAAGCTCACTTGT
AAATATGTATGCTAAAAACAATAGTTTACATAATGCCCGCAAGGTTTTAGATGAAATGCCGACTAAAAATTCTGTGTCTTGGACCACTATGATTGTGGGGTATGGGCAGC
AAGAAGATGGGAAGGAGGCAGTGCGACTGTTTGGGAGAATGTTTCGGGAAGATTATTGTCCGGATGAGTTAACTTTTGCTAGCGTGCTGAGTTCATGTGGCTTTATATCT
GGGGCTAGTGAGCTGAAGCAAGTTCATTCCTGCTTGCTAAAATTTGGTTTCGAAGCATTTTTGTCGATTAATAATGGTTTGATAACTGCATACTCTAAGTGTGGTAGTGT
GGCTGCTGCGTTACGATGCTTTAGCTTAATTGCAGAGCCGGATTTGGTAACATGGACATCGATAATATGTGGTCTTGCGTTCTGTGGCCTTGAAAGGGATGCGGTTGAGT
TGTTTGAGAAGATGTTATCTTATGGCATTAGACCAGATAGAATTGCCTTTCTTGGAGTTCTCTCTGCCTGTAGTCATGGGGGATTAGTAAACATGGGGCTTCACTACTTC
AACTTAATGACGAATGGGTACCAAATTGTTCCCGATTCAGAGCATTTAACATGCCTCATTGACCTGATCGGTAGAGCGGGTCTTCTAGATGAGGCTTTTGATCTTTTGAA
ATCGGTGCCAAAGGAAGCTGGATCAGATGCTTTCGGGGCATTCATTCGGGCATGTAGAACTCATGGTAACCCAAGGTTAGCCAAATGGACCATGGAATTTTCATCAGAGC
CAAAAGAACCTGTGAATTACTCTCTAATGTCGAATATGTATGCTTCTGAAGGTAGCTGGTCAGACGTGGCGAGAATGCGCAAACTTATGAAGGATAGTTGTGAACGGAAA
TCCCCAGGCTATAGTTGGATAGAGATTGCTGGTTTTAACCATTTGTTCGTATCAAGTGATAGATCTCACCCGCAGTCTTTAGATCTTTATGTGATGGTGGGATTGTTACT
AAACACGACAAAGAAA
mRNA sequenceShow/hide mRNA sequence
CACAGCCTCGTGATCAAGTTGGGTTTGGTTAATCAACTCTCTGTCCAGAACAAAGTTTTGGGATTATATGTCAGATGCAAGAAATTCAAAGATGCACGGAAACTGTTCGA
TGAAATGTCTAGGCGAAATGTTGTGTCGTGGAACACGGTGATTTGTGGGCTTGTCGATTGTGGGTATGGAGCTTTTGGCTGCATACTGTATAGAGATCTGGTTTTGTGGA
ATGTGATGTTGTATTGTTATGTTTTCAATTATTTGGGGAGAGGGGCGATTGAAATCTTTTATTTGATGCAGTTGGAAGGCTTTAGAGGTGATGATTTTACATTCAGCAGC
CTGCTCAGTTCATGCAATTTTATAGGATCAGGGGAATTGGGTAAGCAGATTCATGGTCTTCTTATAAAACAGTCATTTGATTTAGATATTCTTGTGGCAAGCTCACTTGT
AAATATGTATGCTAAAAACAATAGTTTACATAATGCCCGCAAGGTTTTAGATGAAATGCCGACTAAAAATTCTGTGTCTTGGACCACTATGATTGTGGGGTATGGGCAGC
AAGAAGATGGGAAGGAGGCAGTGCGACTGTTTGGGAGAATGTTTCGGGAAGATTATTGTCCGGATGAGTTAACTTTTGCTAGCGTGCTGAGTTCATGTGGCTTTATATCT
GGGGCTAGTGAGCTGAAGCAAGTTCATTCCTGCTTGCTAAAATTTGGTTTCGAAGCATTTTTGTCGATTAATAATGGTTTGATAACTGCATACTCTAAGTGTGGTAGTGT
GGCTGCTGCGTTACGATGCTTTAGCTTAATTGCAGAGCCGGATTTGGTAACATGGACATCGATAATATGTGGTCTTGCGTTCTGTGGCCTTGAAAGGGATGCGGTTGAGT
TGTTTGAGAAGATGTTATCTTATGGCATTAGACCAGATAGAATTGCCTTTCTTGGAGTTCTCTCTGCCTGTAGTCATGGGGGATTAGTAAACATGGGGCTTCACTACTTC
AACTTAATGACGAATGGGTACCAAATTGTTCCCGATTCAGAGCATTTAACATGCCTCATTGACCTGATCGGTAGAGCGGGTCTTCTAGATGAGGCTTTTGATCTTTTGAA
ATCGGTGCCAAAGGAAGCTGGATCAGATGCTTTCGGGGCATTCATTCGGGCATGTAGAACTCATGGTAACCCAAGGTTAGCCAAATGGACCATGGAATTTTCATCAGAGC
CAAAAGAACCTGTGAATTACTCTCTAATGTCGAATATGTATGCTTCTGAAGGTAGCTGGTCAGACGTGGCGAGAATGCGCAAACTTATGAAGGATAGTTGTGAACGGAAA
TCCCCAGGCTATAGTTGGATAGAGATTGCTGGTTTTAACCATTTGTTCGTATCAAGTGATAGATCTCACCCGCAGTCTTTAGATCTTTATGTGATGGTGGGATTGTTACT
AAACACGACAAAGAAA
Protein sequenceShow/hide protein sequence
HSLVIKLGLVNQLSVQNKVLGLYVRCKKFKDARKLFDEMSRRNVVSWNTVICGLVDCGYGAFGCILYRDLVLWNVMLYCYVFNYLGRGAIEIFYLMQLEGFRGDDFTFSS
LLSSCNFIGSGELGKQIHGLLIKQSFDLDILVASSLVNMYAKNNSLHNARKVLDEMPTKNSVSWTTMIVGYGQQEDGKEAVRLFGRMFREDYCPDELTFASVLSSCGFIS
GASELKQVHSCLLKFGFEAFLSINNGLITAYSKCGSVAAALRCFSLIAEPDLVTWTSIICGLAFCGLERDAVELFEKMLSYGIRPDRIAFLGVLSACSHGGLVNMGLHYF
NLMTNGYQIVPDSEHLTCLIDLIGRAGLLDEAFDLLKSVPKEAGSDAFGAFIRACRTHGNPRLAKWTMEFSSEPKEPVNYSLMSNMYASEGSWSDVARMRKLMKDSCERK
SPGYSWIEIAGFNHLFVSSDRSHPQSLDLYVMVGLLLNTTKK