; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G030230 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G030230
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCicolChr02:25564667..25581152
RNA-Seq ExpressionCcUC02G030230
SyntenyCcUC02G030230
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011168.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.9e-10756.56Show/hide
Query:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW
        M CS  FS L+S+YVVTSAI KRIYQNISSK LHS HQYKQEKP +RFSRK RKGTK VKKEEV+                             + P   
Subjt:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP
                                                                                                            
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP

Query:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI
          YTRDTVRNI NILRNCSWGSAQG +E LPIRWDSYLINQVLKTHPPLEK WLFFNWASRLQ F+HDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI
Subjt:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI

Query:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK
        KIDAVTYTSLMHWRSNSGDV+GAI+VW+EMKANGCYPTVVSYTAYIKILLD G++++ATDAYKE+LQSGLSP+CCTYT+LMEYLIGE K
Subjt:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK

XP_011649371.1 pentatricopeptide repeat-containing protein At2g01390 [Cucumis sativus]3.7e-10957.84Show/hide
Query:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW
        MH    FSLLLS+YVV+SAI KRIYQNISSK LHS HQYK++KP +RFSR+SRKGTKV KKEEV PRLYTRDTVRNICNILRNCSW SAQ          
Subjt:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP
                                       KH                                                                   
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP

Query:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI
                                  LEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWAS LQ+FKHD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGI
Subjt:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI

Query:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK
        KIDAVTYTSLMHWRSNSGDV+GAIK+WKEMKANGC+PTVVSYTAYIKILLD GQI EAT  YK++LQSGLSP+CCTYTILMEYLIGEGK
Subjt:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK

XP_016902133.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01390-like [Cucumis melo]3.5e-10757.07Show/hide
Query:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW
        MH    FSLLLS+YVV SAI KRIYQNIS K LHS HQYK+EKP +RFSR SRKGTKVVKKEEV PR+YTRDTV NICNILRNCSW SAQ          
Subjt:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP
                                       KH                                                                   
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP

Query:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI
                                  LEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRL++FKHD YTYTTMLDIFGEAGRISSMNY+FQQMKEKGI
Subjt:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI

Query:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK
        KIDA TYTSLMHWRSNSGDV+GAIKVWKEMKANGC+PTVVSYTAYIKILLD GQ KEAT  YKE+L++GLSP+CCTYTILMEYLIGEGK
Subjt:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK

XP_022971714.1 pentatricopeptide repeat-containing protein At2g01390 [Cucurbita maxima]2.0e-10756.81Show/hide
Query:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW
        M CS SFS L+S+YVVTSAI KRIYQNISSK LHS HQYKQEKP +RFSRK RKGTK VKKEEV+                             + P   
Subjt:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP
                                                                                                            
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP

Query:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI
          YTRDTVRNI NILRNCSWGSAQG +E LPIRWDSYLINQVLKTHPPLEK WLFFNWASRLQ FKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI
Subjt:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI

Query:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK
        KIDAVTYTSLMHWRSNSGDV+GAI+VW+EMKANGCYPTVVSYTAYIKILLD G++++ATD YKE+LQSGLSP+CCTYT+LMEYLIGE K
Subjt:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK

XP_038901985.1 pentatricopeptide repeat-containing protein At2g01390 [Benincasa hispida]6.1e-11259.64Show/hide
Query:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW
        MHCS SFS LLS+YVVTSAI KRIYQNISSK LHSFHQYKQEKP  +F+RKSRKGTKVVKKEEVD R YTRDTVRNI NILR CSWGSAQ          
Subjt:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP
                                         +H                                                                 
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP

Query:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI
                                  LEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQ+FKHD YTYTTMLDIFGEAGRISSMNYVFQQMKEK I
Subjt:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI

Query:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK
        KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLD  QIKEATD YKE+LQSGL P+CCTYTILMEYLIGEGK
Subjt:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK

TrEMBL top hitse value%identityAlignment
A0A0A0LJM3 Uncharacterized protein1.8e-10957.84Show/hide
Query:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW
        MH    FSLLLS+YVV+SAI KRIYQNISSK LHS HQYK++KP +RFSR+SRKGTKV KKEEV PRLYTRDTVRNICNILRNCSW SAQ          
Subjt:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP
                                       KH                                                                   
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP

Query:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI
                                  LEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWAS LQ+FKHD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGI
Subjt:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI

Query:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK
        KIDAVTYTSLMHWRSNSGDV+GAIK+WKEMKANGC+PTVVSYTAYIKILLD GQI EAT  YK++LQSGLSP+CCTYTILMEYLIGEGK
Subjt:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK

A0A1S4E1N2 pentatricopeptide repeat-containing protein At2g01390-like1.7e-10757.07Show/hide
Query:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW
        MH    FSLLLS+YVV SAI KRIYQNIS K LHS HQYK+EKP +RFSR SRKGTKVVKKEEV PR+YTRDTV NICNILRNCSW SAQ          
Subjt:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP
                                       KH                                                                   
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP

Query:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI
                                  LEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRL++FKHD YTYTTMLDIFGEAGRISSMNY+FQQMKEKGI
Subjt:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI

Query:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK
        KIDA TYTSLMHWRSNSGDV+GAIKVWKEMKANGC+PTVVSYTAYIKILLD GQ KEAT  YKE+L++GLSP+CCTYTILMEYLIGEGK
Subjt:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK

A0A6J1DMJ2 pentatricopeptide repeat-containing protein At2g01390 isoform X12.1e-10255.01Show/hide
Query:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW
        MH S SFSLLLS+YVV SAI K+IY NIS KALHS  QYKQEKP   FSRK RKG KVV+KEEVDP+LYTRDTVRNI NILRN SW SAQ          
Subjt:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP
                                         +H                                                                 
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP

Query:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI
                                  LE LP+RWDSYLINQV+KTHPPLEK WLFFNWA RL+ FKHD YTYTTMLDIFGEAGRISSMNY+FQQMKEKGI
Subjt:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI

Query:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK
        KIDAVTYTSLMHWRS SGDV+GAIKVWKEMK NGCYPTVVSYTAYIKILLD  Q+KEATD YKE+LQSGLSP+CCTYT+LMEYLIG GK
Subjt:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK

A0A6J1EIW0 pentatricopeptide repeat-containing protein At2g013901.6e-10556.04Show/hide
Query:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW
        M CS  FS L+S+YVVTSAI KRIYQNISSK LHS HQYKQEKP +RFSRK RKGTK VKKEEV+                             + P   
Subjt:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP
                                                                                                            
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP

Query:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI
          YTRDTVRNI NILRNCSW SAQG +E LPIRWDSYLINQVLKTHPPLEK WLFFNWASRLQ F+HDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI
Subjt:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI

Query:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK
        KIDAVTYTSLMHWRSNSGDV+GAI+VW+EMKANGCYPTVVSYTAYIKILLD  ++++ATDAYKE+LQSGLSP+CCTYT+LMEYLIGE K
Subjt:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK

A0A6J1I9C5 pentatricopeptide repeat-containing protein At2g013909.8e-10856.81Show/hide
Query:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW
        M CS SFS L+S+YVVTSAI KRIYQNISSK LHS HQYKQEKP +RFSRK RKGTK VKKEEV+                             + P   
Subjt:  MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP
                                                                                                            
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDP

Query:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI
          YTRDTVRNI NILRNCSWGSAQG +E LPIRWDSYLINQVLKTHPPLEK WLFFNWASRLQ FKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI
Subjt:  RLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGI

Query:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK
        KIDAVTYTSLMHWRSNSGDV+GAI+VW+EMKANGCYPTVVSYTAYIKILLD G++++ATD YKE+LQSGLSP+CCTYT+LMEYLIGE K
Subjt:  KIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK

SwissProt top hitse value%identityAlignment
Q8GYP6 Pentatricopeptide repeat-containing protein At1g189001.1e-2334.27Show/hide
Query:  VRNICNILRNCSWG-SAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT
        V N+ ++LR   WG +A+  L+ L +R D+Y  NQVLK          FF W  R   FKHD +TYTTM+   G A +  ++N +  +M   G + + VT
Subjt:  VRNICNILRNCSWG-SAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT

Query:  YTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYL
        Y  L+H    +  +  A+ V+ +M+  GC P  V+Y   I I    G +  A D Y+ +   GLSP   TY++++  L
Subjt:  YTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYL

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028602.7e-2226.72Show/hide
Query:  FKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSG----DVEGAIKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGR
        F  D Y+YT+++  F  +GR      VF++M+E G K   +TY  +++     G     +   ++ +K + + P  YT +T+   C   R      A   
Subjt:  FKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSG----DVEGAIKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGR

Query:  LEMLPIRWDSY-------LINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGD
         E +     SY       L++   K+H P E   +       L  F     TY +++  +   G +     +  QM EKG K D  TYT+L+     +G 
Subjt:  LEMLPIRWDSY-------LINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGD

Query:  VEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILM
        VE A+ +++EM+  GC P + ++ A+IK+  + G+  E    + E+   GLSP   T+  L+
Subjt:  VEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILM

Q9SSF9 Pentatricopeptide repeat-containing protein At1g747504.1e-2625.88Show/hide
Query:  FSRKSRKGTKVVKKEEVDPRLYTRD--TVRNICNILRNCSWG-SAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDI
        F + SR+  KV  +    PR +      V N+ +ILR   WG +A+  L     R D+Y  NQVLK          FF W  R   FKHD +TYTTM+  
Subjt:  FSRKSRKGTKVVKKEEVDPRLYTRD--TVRNICNILRNCSWG-SAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDI

Query:  FGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLK
         G A +   +N +  +M   G K + VTY  L+H                                        S+G A    E + +       NQ+ +
Subjt:  FGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLK

Query:  THPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTA
                             + D  TY T++DI  +AG +     ++Q+M+E G+  D  TY+ +++    +G +  A +++ EM   GC P +V++  
Subjt:  THPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTA

Query:  YIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYL
         I +       + A   Y+++  +G  P   TY+I+ME L
Subjt:  YIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYL

Q9ZU29 Pentatricopeptide repeat-containing protein At2g013904.0e-7463Show/hide
Query:  KVVKKEEV-DPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMN
        K+VK + + DP +YTRD V NI NIL+  +W SAQ +L  L +RWDS++IN+VLK HPP++K WLFFNWA++++ FKHDH+TYTTMLDIFGEAGRI SM 
Subjt:  KVVKKEEV-DPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMN

Query:  YVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK
         VF  MKEKG+ ID VTYTSL+HW S+SGDV+GA+++W+EM+ NGC PTVVSYTAY+K+L   G+++EAT+ YKE+L+S +SP+C TYT+LMEYL+  GK
Subjt:  YVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK

Q9ZUU3 Pentatricopeptide repeat-containing protein At2g372305.0e-2424.52Show/hide
Query:  ICNILRNCSWGS-AQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTS
        IC ++ N +W +  Q  +  L   WD  L+  VL     LE    FF W  R  + +HD  T+  M+ + GE  +++    +   M EKG+  D   +  
Subjt:  ICNILRNCSWGS-AQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTS

Query:  LMHWRSNSGDVEGAIKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTM
        L+     +G V+ ++K+ +K +    L    T+++  ++ +       +GR  M+  R+ + ++++ ++                         +TY  M
Subjt:  LMHWRSNSGDVEGAIKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTM

Query:  LDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCC
        L  F  + R+ +    F+ MK +GI  D  T+ ++++       ++ A K++ EMK N   P+VVSYT  IK  L + ++ +    ++E+  SG+ P+  
Subjt:  LDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCC

Query:  TYTILMEYLIGEGK
        TY+ L+  L   GK
Subjt:  TYTILMEYLIGEGK

Arabidopsis top hitse value%identityAlignment
AT1G18900.1 Pentatricopeptide repeat (PPR) superfamily protein7.9e-2534.27Show/hide
Query:  VRNICNILRNCSWG-SAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT
        V N+ ++LR   WG +A+  L+ L +R D+Y  NQVLK          FF W  R   FKHD +TYTTM+   G A +  ++N +  +M   G + + VT
Subjt:  VRNICNILRNCSWG-SAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT

Query:  YTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYL
        Y  L+H    +  +  A+ V+ +M+  GC P  V+Y   I I    G +  A D Y+ +   GLSP   TY++++  L
Subjt:  YTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYL

AT1G18900.2 Pentatricopeptide repeat (PPR) superfamily protein7.9e-2534.27Show/hide
Query:  VRNICNILRNCSWG-SAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT
        V N+ ++LR   WG +A+  L+ L +R D+Y  NQVLK          FF W  R   FKHD +TYTTM+   G A +  ++N +  +M   G + + VT
Subjt:  VRNICNILRNCSWG-SAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT

Query:  YTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYL
        Y  L+H    +  +  A+ V+ +M+  GC P  V+Y   I I    G +  A D Y+ +   GLSP   TY++++  L
Subjt:  YTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYL

AT1G18900.3 Pentatricopeptide repeat (PPR) superfamily protein7.9e-2534.27Show/hide
Query:  VRNICNILRNCSWG-SAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT
        V N+ ++LR   WG +A+  L+ L +R D+Y  NQVLK          FF W  R   FKHD +TYTTM+   G A +  ++N +  +M   G + + VT
Subjt:  VRNICNILRNCSWG-SAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT

Query:  YTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYL
        Y  L+H    +  +  A+ V+ +M+  GC P  V+Y   I I    G +  A D Y+ +   GLSP   TY++++  L
Subjt:  YTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYL

AT1G74750.1 Pentatricopeptide repeat (PPR) superfamily protein2.9e-2725.88Show/hide
Query:  FSRKSRKGTKVVKKEEVDPRLYTRD--TVRNICNILRNCSWG-SAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDI
        F + SR+  KV  +    PR +      V N+ +ILR   WG +A+  L     R D+Y  NQVLK          FF W  R   FKHD +TYTTM+  
Subjt:  FSRKSRKGTKVVKKEEVDPRLYTRD--TVRNICNILRNCSWG-SAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDI

Query:  FGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLK
         G A +   +N +  +M   G K + VTY  L+H                                        S+G A    E + +       NQ+ +
Subjt:  FGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLK

Query:  THPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTA
                             + D  TY T++DI  +AG +     ++Q+M+E G+  D  TY+ +++    +G +  A +++ EM   GC P +V++  
Subjt:  THPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTA

Query:  YIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYL
         I +       + A   Y+++  +G  P   TY+I+ME L
Subjt:  YIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYL

AT2G01390.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.8e-7563Show/hide
Query:  KVVKKEEV-DPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMN
        K+VK + + DP +YTRD V NI NIL+  +W SAQ +L  L +RWDS++IN+VLK HPP++K WLFFNWA++++ FKHDH+TYTTMLDIFGEAGRI SM 
Subjt:  KVVKKEEV-DPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMN

Query:  YVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK
         VF  MKEKG+ ID VTYTSL+HW S+SGDV+GA+++W+EM+ NGC PTVVSYTAY+K+L   G+++EAT+ YKE+L+S +SP+C TYT+LMEYL+  GK
Subjt:  YVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTGTTCTATTAGTTTTTCTTTACTTCTGAGTCACTATGTGGTTACATCTGCCATCCATAAAAGGATTTATCAAAATATTTCCTCTAAAGCCTTGCATTCC
TTCCACCAATACAAACAAGAGAAACCCACCACAAGATTCAGTAGAAAGTCGAGGAAGGGAACTAAGGTAGTTAAGAAGGAAGAAGTAGATCCAAGGCTTTACACT
AGAGATACAGTGAGGAACATATGCAATATTCTGAGAAATTGCTCATGGGGCTCTGCTCAAGGACGCCTAGAGATGCTTCCTATAAGATGGGATTCTTATCTCATC
AACCAAGTTCTGAAAACACATCCACCATTAGAGAAGACATGGTTATTCTTCAATTGGGCCTCTAGGCTGCAAATCTTCAAGCATGACCATTATACGTACACCACG
ATGCTGGATATTTTTGGAGAAGCTGGGAGAATTTCATCCATGAATTATGTATTTCAACAGATGAAGGAGAAGGGGATAAAGATAGATGCAGTTACATATACTTCA
TTAATGCACTGGCGTTCAAATTCGGGAGATGTTGAGGGAGCAATAAAGGTAGTTAAGAAGGAAGAAGTAGATCCAAGGCTTTACACTAGAGATACAGTGAGGAAC
ATATGCAATATTCTGAGAAATTGCTCATGGGGCTCTGCTCAAGGACGCCTAGAGATGCTTCCTATAAGATGGGATTCTTATCTCATCAACCAAGTTCTGAAAACA
CATCCACCATTAGAGAAGACATGGTTATTCTTCAATTGGGCCTCTAGGCTGCAAATCTTCAAGCATGACCATTATACGTACACCACGATGCTGGATATTTTTGGA
GAAGCTGGGAGAATTTCATCCATGAATTATGTATTTCAACAGATGAAGGAGAAGGGGATAAAGATAGATGCAGTTACATATACTTCATTAATGCACTGGCGTTCA
AATTCGGGAGATGTTGAGGGAGCAATAAAGGTTTGGAAGGAAATGAAAGCCAATGGCTGCTATCCAACGGTAGTTTCTTATACTGCTTATATAAAGATTTTGTTG
GACATTGGCCAAATTAAGGAGGCCACTGATGCATACAAGGAGCTGCTTCAATCTGGGCTATCTCCAAGTTGTTGTACTTACACCATCTTAATGGAATACCTTATT
GGAGAGGGCAAG
mRNA sequenceShow/hide mRNA sequence
TTTTTTTTTAATTTGATTCAAGTCAACTTGGTTGGTCTGCATTATTTTTTAAGGGAAAAAAATTGGTAGTCTTTTAGGGTTTACAATCTGCCGCTACCGACACGC
GCCGCCGCTAGATCCGCCGCCGCCGACCAAAACCGCGGCCGACTTCATCTGTTCGTAGATCTGACCGAAGAAACTCACCTCTGCTATCAGATCTTCTGCTGTCGA
CCAAGATCCAAGCTACTCCGTCCGTTCGTCGCGCTATCCGTTGCGGTTCCTGTCGTCCGACTCCTCTGCTGTTGACCAAGATCCAACGCCGACACAGTCATCGCC
GTCCGTTCATCGCGCGTCAAGTTCTTCGTCGCTCTCGGTCTTCCTCTTTCGCCGGCAAAAAACGCAGTAACAGTTGCTCTTTGAATTGCGTCCAAGTAGTCCTCG
GAGTCCTCTCTTCCTCCCATTGATGCCATTCCAGCGCTTCTCCTTCTAGACACAGCACCGCCGCGTCGAGTTTATCCTTCTCCGACAGCCGATTGACCACGAAGT
AACGCTTCACTCGATGGAACCATCCGAGCGGATCCTCCCCTACCTCTCCTTTGAAGATGGGTATTTCTAAAAACCTTATTTTGTTGTCCATTCCATGCATTGTTC
TATTAGTTTTTCTTTACTTCTGAGTCACTATGTGGTTACATCTGCCATCCATAAAAGGATTTATCAAAATATTTCCTCTAAAGCCTTGCATTCCTTCCACCAATA
CAAACAAGAGAAACCCACCACAAGATTCAGTAGAAAGTCGAGGAAGGGAACTAAGGTAGTTAAGAAGGAAGAAGTAGATCCAAGGCTTTACACTAGAGATACAGT
GAGGAACATATGCAATATTCTGAGAAATTGCTCATGGGGCTCTGCTCAAGGACGCCTAGAGATGCTTCCTATAAGATGGGATTCTTATCTCATCAACCAAGTTCT
GAAAACACATCCACCATTAGAGAAGACATGGTTATTCTTCAATTGGGCCTCTAGGCTGCAAATCTTCAAGCATGACCATTATACGTACACCACGATGCTGGATAT
TTTTGGAGAAGCTGGGAGAATTTCATCCATGAATTATGTATTTCAACAGATGAAGGAGAAGGGGATAAAGATAGATGCAGTTACATATACTTCATTAATGCACTG
GCGTTCAAATTCGGGAGATGTTGAGGGAGCAATAAAGGTAGTTAAGAAGGAAGAAGTAGATCCAAGGCTTTACACTAGAGATACAGTGAGGAACATATGCAATAT
TCTGAGAAATTGCTCATGGGGCTCTGCTCAAGGACGCCTAGAGATGCTTCCTATAAGATGGGATTCTTATCTCATCAACCAAGTTCTGAAAACACATCCACCATT
AGAGAAGACATGGTTATTCTTCAATTGGGCCTCTAGGCTGCAAATCTTCAAGCATGACCATTATACGTACACCACGATGCTGGATATTTTTGGAGAAGCTGGGAG
AATTTCATCCATGAATTATGTATTTCAACAGATGAAGGAGAAGGGGATAAAGATAGATGCAGTTACATATACTTCATTAATGCACTGGCGTTCAAATTCGGGAGA
TGTTGAGGGAGCAATAAAGGTTTGGAAGGAAATGAAAGCCAATGGCTGCTATCCAACGGTAGTTTCTTATACTGCTTATATAAAGATTTTGTTGGACATTGGCCA
AATTAAGGAGGCCACTGATGCATACAAGGAGCTGCTTCAATCTGGGCTATCTCCAAGTTGTTGTACTTACACCATCTTAATGGAATACCTTATTGGAGAGGGCAA
G
Protein sequenceShow/hide protein sequence
MHCSISFSLLLSHYVVTSAIHKRIYQNISSKALHSFHQYKQEKPTTRFSRKSRKGTKVVKKEEVDPRLYTRDTVRNICNILRNCSWGSAQGRLEMLPIRWDSYLI
NQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVEGAIKVVKKEEVDPRLYTRDTVRN
ICNILRNCSWGSAQGRLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRS
NSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDIGQIKEATDAYKELLQSGLSPSCCTYTILMEYLIGEGK