; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016155 (gene) of Snake gourd v1 genome

Gene IDTan0016155
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG08:12989508..12990572
RNA-Seq ExpressionTan0016155
SyntenyTan0016155
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586124.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]3.8e-18591.5Show/hide
Query:  PVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCF
        PVLAQEIFLELKSE FPLNNSTLSSLMV YIDGGLL QA+AIWEEMLNSC+VPSVLVISKL NTYGKM RFDDII VLDQ+KLRY HLLPEAYSLAISCF
Subjt:  PVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCF

Query:  GKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNL
        GKHGQLELME+TLREMV SG PVDS TGNSFI YYS+FGSLMEMETAYGRLKRSRFLIEKEGILAMAF YIR+RKFYRLGEFLRDVGL RKNVGNLLWNL
Subjt:  GKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNL

Query:  LLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVSL
        LLLSYAANFKMKSLQREFL MVEAGFNPD+ TFNIRA+AFSRMDLLWDLHLSL+HMKH+KIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPP+SL
Subjt:  LLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVSL

Query:  TDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY
        TDPFVFEALGKGDFHM+SEAFMQFQ+QKKWTYRELISLYLKKQ+RRDQ+FWNY
Subjt:  TDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY

KAG7020945.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-18591.5Show/hide
Query:  PVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCF
        PVLAQEIFLELKSE FPLNNSTLSSLMV YIDGGLL QA+AIWEEMLNSC+VPSVLVISKL NTYGKM RFDDII VLDQ+KLRY HLLPEAYSLAISCF
Subjt:  PVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCF

Query:  GKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNL
        GKHGQLELME+TLREMV SG PVDS TGNSFI+YYS+FGSLMEMETAYGRLKRSRFLIEKEGILAMAF YIR+RKFYRLGEFLRDVGL RKNVGNLLWNL
Subjt:  GKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNL

Query:  LLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVSL
        LLLSYAANFKMKSLQREFL MVEAGFNPD+ TFNIRA+AFSRMDLLWDLHLSL+HMKH+KIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPP+SL
Subjt:  LLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVSL

Query:  TDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY
        TDPFVFEALGKGDFHM+SEAFMQFQ+QKKWTYRELISLYLKKQ+RRDQ+FWNY
Subjt:  TDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY

XP_022937749.1 pentatricopeptide repeat-containing protein At3g42630 [Cucurbita moschata]8.4e-18591.22Show/hide
Query:  PVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCF
        PVLAQEIFLELKSE FPLNNSTLSSLMV YIDGGLL QA+AIWEEMLNSC+VPSVLVISKL NTYGKM RFDDII VLDQ+KLRY HLLPEAYSLAISCF
Subjt:  PVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCF

Query:  GKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNL
        GKHGQLELME+TLREMV SG PVDS TGNSFI YYS+FGSLMEMETAYGRLKRSRFLIEKEGILAMAF YIR+RKFYRLGEFLRDVGL RKNVGN+LWNL
Subjt:  GKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNL

Query:  LLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVSL
        LLLSYAANFKMKSLQREFL MVEAGFNPD+ TFNIRA+AFSRMDLLWDLHLSL+HMKH+KIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPP+SL
Subjt:  LLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVSL

Query:  TDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY
        TDPFVFEALGKGDFHM+SEAFMQFQ+QKKWTYRELISLYLKKQ+RRDQ+FWNY
Subjt:  TDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY

XP_022966074.1 pentatricopeptide repeat-containing protein At3g42630 [Cucurbita maxima]1.5e-18190.08Show/hide
Query:  PVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCF
        PVLAQEIFLELKSE FPLNNSTLSSLMV YIDGGLL QA+AIWEEMLNSC+VPSVLVISKL NTYGKM RFDDII VLDQ+KLRY HLLPEAYSLAISCF
Subjt:  PVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCF

Query:  GKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNL
        GKHGQLELME+TLREMV SG PV S TGNSFI+YYS+FGSLMEMETAYGRLKRSRFLIEKEGILAMAF YIR+RKFYRLGEFLRDVGL RKNVGNLLWNL
Subjt:  GKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNL

Query:  LLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVSL
        LLLSYAANFKMKSLQREFL MVEAGFNPD+ TFNIRA+AFSRMDLLWDLHLSL+HMKH+KIEPDLVTYGCVVDAYV+RRLGRNLEFVLSKMNPDQPP+  
Subjt:  LLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVSL

Query:  TDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY
        TD FVFEALGKGDFHM+SEAFMQFQ+QKKWTYRELISLYLKKQ+RRDQ+FWNY
Subjt:  TDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY

XP_023536679.1 pentatricopeptide repeat-containing protein At3g42630 [Cucurbita pepo subsp. pepo]1.9e-18490.93Show/hide
Query:  PVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCF
        PVLAQEIFLELK E FPLNNSTLSSLMV YIDGGLL QA+AIWEEMLNSC+VPSVLVISKL NTYGKM RFDDII VLDQ+K+RY HLLPEAYSLAISCF
Subjt:  PVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCF

Query:  GKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNL
        GKHGQLELME+TLREMV SG PVDS TGNSFI+YYS+FGSLMEMETAYGRLKRSRFLIEKEGILAMAF YIR+RKFYRLGEFLRDVGL RKNVGNLLWNL
Subjt:  GKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNL

Query:  LLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVSL
        LLLSYAANFKMKSLQREFL MVEAGFNPD+ TFNIRA+AFSRMDLLWDLHLSL+HMKHMKIEPDLV+YGCVVDAYVDRRLGRNLEFVLSKMNPDQPP+SL
Subjt:  LLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVSL

Query:  TDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY
        TDPFVFEALGKGDFHM+SEAFMQFQ+QKKWTYRELISLYLKKQ+RRDQ+FWNY
Subjt:  TDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY

TrEMBL top hitse value%identityAlignment
A0A1S3BP30 pentatricopeptide repeat-containing protein At3g42630-like2.3e-18088.98Show/hide
Query:  MPVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISC
        MP LA+EIFLELKSEGFPLNNSTLS++MVHYID G   QAQA+WEEMLNSC+ PSV VISKLFN YGKMG FD I  VLDQ+KLRYSHLLPEAYSLAISC
Subjt:  MPVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISC

Query:  FGKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWN
        FGKH QLELMESTLREMV SGF V+SATGNSFIIYYSMFGSL+EMETAYGRLKRSRFLIEK+GI+AMAF YIRKRKFYRLGEFLRDVGLGRKNVGNLLWN
Subjt:  FGKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWN

Query:  LLLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVS
        LLLLSYAANFKMKSLQREFL+MVEAGFNPDL TFNIRALAFSRMDLLWDLHLSL+HMKHM IEPDLVTYGCVVDAYVDRRLGRNLEF+LSKMNP QPPVS
Subjt:  LLLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVS

Query:  LTDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY
        LTD FVFEALGKGDFHM+SEAFMQF++QKKWTYRELISLYLKKQ+RR+Q+FWNY
Subjt:  LTDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY

A0A5D3CAE3 Pentatricopeptide repeat-containing protein2.3e-18088.98Show/hide
Query:  MPVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISC
        MP LA+EIFLELKSEGFPLNNSTLS++MVHYID G   QAQA+WEEMLNSC+ PSV VISKLFN YGKMG FD I  VLDQ+KLRYSHLLPEAYSLAISC
Subjt:  MPVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISC

Query:  FGKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWN
        FGKH QLELMESTLREMV SGF V+SATGNSFIIYYSMFGSL+EMETAYGRLKRSRFLIEK+GI+AMAF YIRKRKFYRLGEFLRDVGLGRKNVGNLLWN
Subjt:  FGKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWN

Query:  LLLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVS
        LLLLSYAANFKMKSLQREFL+MVEAGFNPDL TFNIRALAFSRMDLLWDLHLSL+HMKHM IEPDLVTYGCVVDAYVDRRLGRNLEF+LSKMNP QPPVS
Subjt:  LLLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVS

Query:  LTDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY
        LTD FVFEALGKGDFHM+SEAFMQF++QKKWTYRELISLYLKKQ+RR+Q+FWNY
Subjt:  LTDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY

A0A6J1D9V6 pentatricopeptide repeat-containing protein At3g42630 isoform X21.5e-17988.42Show/hide
Query:  MPVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISC
        +PVLAQEIF ELKSEG PL NSTLS+LM  YIDGG L QAQAIWEEMLNS +VPSV +ISKLF+T+GKMGRFDDII VLDQ+KLRYSHLLPEA+SLAISC
Subjt:  MPVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISC

Query:  FGKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWN
        FGKHGQLELME+TLREMV SGFPV+SATGNSFII+YS+FGSLMEMETAYGRLKRSRFLIEKEGI+AMAF Y+RKRKFYRLGEFLRDVGLGR+NVGNLLWN
Subjt:  FGKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWN

Query:  LLLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVS
        LLLLSYAANFKMKSLQREFL MVEAGF+PDL TFNIRALAFSRMDLLWDLHLSL+HM+H+KIEPDLVTYGCVVDAYVDRRLGRNL+F LSKMNPDQ PVS
Subjt:  LLLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVS

Query:  LTDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY
        LT+ FVFEALGKGDFHM+SEAFMQFQRQKKWTYRELISLYLK+QYRRDQ+FWNY
Subjt:  LTDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY

A0A6J1FC40 pentatricopeptide repeat-containing protein At3g426304.1e-18591.22Show/hide
Query:  PVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCF
        PVLAQEIFLELKSE FPLNNSTLSSLMV YIDGGLL QA+AIWEEMLNSC+VPSVLVISKL NTYGKM RFDDII VLDQ+KLRY HLLPEAYSLAISCF
Subjt:  PVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCF

Query:  GKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNL
        GKHGQLELME+TLREMV SG PVDS TGNSFI YYS+FGSLMEMETAYGRLKRSRFLIEKEGILAMAF YIR+RKFYRLGEFLRDVGL RKNVGN+LWNL
Subjt:  GKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNL

Query:  LLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVSL
        LLLSYAANFKMKSLQREFL MVEAGFNPD+ TFNIRA+AFSRMDLLWDLHLSL+HMKH+KIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPP+SL
Subjt:  LLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVSL

Query:  TDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY
        TDPFVFEALGKGDFHM+SEAFMQFQ+QKKWTYRELISLYLKKQ+RRDQ+FWNY
Subjt:  TDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY

A0A6J1HSM5 pentatricopeptide repeat-containing protein At3g426307.2e-18290.08Show/hide
Query:  PVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCF
        PVLAQEIFLELKSE FPLNNSTLSSLMV YIDGGLL QA+AIWEEMLNSC+VPSVLVISKL NTYGKM RFDDII VLDQ+KLRY HLLPEAYSLAISCF
Subjt:  PVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCF

Query:  GKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNL
        GKHGQLELME+TLREMV SG PV S TGNSFI+YYS+FGSLMEMETAYGRLKRSRFLIEKEGILAMAF YIR+RKFYRLGEFLRDVGL RKNVGNLLWNL
Subjt:  GKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNL

Query:  LLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVSL
        LLLSYAANFKMKSLQREFL MVEAGFNPD+ TFNIRA+AFSRMDLLWDLHLSL+HMKH+KIEPDLVTYGCVVDAYV+RRLGRNLEFVLSKMNPDQPP+  
Subjt:  LLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVSL

Query:  TDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY
        TD FVFEALGKGDFHM+SEAFMQFQ+QKKWTYRELISLYLKKQ+RRDQ+FWNY
Subjt:  TDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY

SwissProt top hitse value%identityAlignment
B8Y6I0 Pentatricopeptide repeat-containing protein 10, chloroplastic2.0e-1120.87Show/hide
Query:  LKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCFGKHGQLELME
        + S+G   N  T +++M  Y + G + +A A++++M  + +VP+V   + +    GK  RF  ++ +L ++           ++  ++  GK G  + + 
Subjt:  LKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCFGKHGQLELME

Query:  STLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRF------------LIEKEGILAMAFTYIRKRK----------------FYRLGEF
          L  M   G  +   T N+ I  Y   GS       Y  +  + F            ++ ++G  + A + + K +                 Y  G  
Subjt:  STLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRF------------LIEKEGILAMAFTYIRKRK----------------FYRLGEF

Query:  LRDVGLGRKNV---GNLL--WNLLLLSYAANFKMKSL---QREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDA
        +  +      V   G +   W +L     ANFK + L   +  F E+   G+NPDL  FN     +++  +        D +K   + PDL+TY  ++D 
Subjt:  LRDVGLGRKNV---GNLL--WNLLLLSYAANFKMKSL---QREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDA

Query:  YVDRRLGRNLEFVLSKMNPDQ
        Y         E +L+++   Q
Subjt:  YVDRRLGRNLEFVLSKMNPDQ

O64624 Pentatricopeptide repeat-containing protein At2g18940, chloroplastic8.0e-1323.05Show/hide
Query:  LKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCFGKHGQLELME
        +  +G   N  T ++++  Y   G   +A  ++  M  +  VP+    + + +  GK  R +++I +L  +K          ++  ++  G  G  + + 
Subjt:  LKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCFGKHGQLELME

Query:  STLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGE----FLRDVG-------------------
           REM   GF  D  T N+ I  Y   GS ++    YG + R+ F        A+     RK   +R GE     ++  G                   
Subjt:  STLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGE----FLRDVG-------------------

Query:  --LGRKNVGNLL--------WNLLLLSYAANFKMKSL---QREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDA
          LG + + N +        W LL     ANFK ++L   +R F    + G+ PD+  FN     F+R ++       L+ ++   + PDLVTY  ++D 
Subjt:  --LGRKNVGNLL--------WNLLLLSYAANFKMKSL---QREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDA

Query:  YVDRRLGRNLEFVLSKMNPDQ
        YV R      E +L  +   Q
Subjt:  YVDRRLGRNLEFVLSKMNPDQ

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028608.8e-1220.69Show/hide
Query:  AQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEA--YSLAISCFG
        A  +F  L+ +GF L+  + +SL+  + + G  R+A  ++++M      P+++  + + N +GKMG   + IT L + K++   + P+A  Y+  I+C  
Subjt:  AQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEA--YSLAISCFG

Query:  KHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNLL
        +    +       EM  +GF  D  T N+ +  Y       E       +  + F        ++   Y R        E    +           +  L
Subjt:  KHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNLL

Query:  LLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKM
        L  +    K++S    F EM  AG  P++ TFN     +       ++    D +    + PD+VT+  ++  +    +   +  V  +M
Subjt:  LLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKM

Q9M2A1 Pentatricopeptide repeat-containing protein At3g426306.8e-12158.19Show/hide
Query:  MPVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISC
        +P +A EIFL+ KS     N  TL +LM+ + + G + +A+ IW+E++NSC+VP V V+SKL + Y + G FD++  +   +  R+S LLP   SLAISC
Subjt:  MPVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISC

Query:  FGKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWN
        FGK+GQLELME  + EM   G  +++ T N  + YYS FGSL +ME AYGR+K+   +IE+E I A+   Y+++RKFYRL EFL DVGLGR+N+GN+LWN
Subjt:  FGKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWN

Query:  LLLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVS
         +LLSYAA+FKMKSLQREF+ M++AGF+PDL TFNIRALAFSRM L WDLHL+L+HM+ + I PDLVT+GCVVDAY+D+RL RNLEFV ++MN D  P+ 
Subjt:  LLLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVS

Query:  LTDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY
        LTDP  FE LGKGDFH++SEA ++F  +K WTYR+LI +YLKK+ RRDQIFWNY
Subjt:  LTDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY

Q9SCP4 Pentatricopeptide repeat-containing protein At3g531701.8e-1225Show/hide
Query:  QAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPE--AYSLAISCFGKHGQLELMESTLREMVCSGFPVDSATGNSFIIYY
        QA  ++E ML+    P++ V + L + YGK    D   + L+ +K   S   P+   +++ ISC  K G+ +L++S + EM   G    + T N+ I  Y
Subjt:  QAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPE--AYSLAISCFGKHGQLELMESTLREMVCSGFPVDSATGNSFIIYY

Query:  SMFGSLMEMETAYG-RLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNLLLLSY--AANFKMKSLQREFLEMVEAGFNPDLAT
           G   EME+     ++    L +   + ++  +Y   R   ++  +     L         +N+L+LS+  A  +K      +F+E  +  F+    T
Subjt:  SMFGSLMEMETAYG-RLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNLLLLSY--AANFKMKSLQREFLEMVEAGFNPDLAT

Query:  FNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSK-MNPDQPPVSLTDPF---VFEALGK-GDFHMNSEAFMQFQRQ
        +NI    F +   +  +      MK+  ++P+ +TY  +V+AY    L   ++ VL + +N D   V L  PF   +  A G+ GD     E ++Q + +
Subjt:  FNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSK-MNPDQPPVSLTDPF---VFEALGK-GDFHMNSEAFMQFQRQ

Query:  K----KWTYRELISLY
        K    K T+  +I  Y
Subjt:  K----KWTYRELISLY

Arabidopsis top hitse value%identityAlignment
AT2G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.5e-1124.21Show/hide
Query:  GLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCFGKHGQLELMESTLREMVCSGFPVDSATGNSFII
        G +  A+ ++EEM     VP  +  + + + +GK+GR DD +   +++K          Y+  I+CF K G+L +     REM  +G   +  + ++ + 
Subjt:  GLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCFGKHGQLELMESTLREMVCSGFPVDSATGNSFII

Query:  YYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYI----------RKRKFYRLGEFLRDVGLGRKNVGNLLWNLLLLSYAANFKMKSLQREFLEMVE
         +   G + +    Y  ++R        G++   +TY                +RLG  +  VG+   NV  + +  L+       +MK  +  F +M  
Subjt:  YYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYI----------RKRKFYRLGEFLRDVGLGRKNVGNLLWNLLLLSYAANFKMKSLQREFLEMVE

Query:  AGFNPDLATFNIRALAF---SRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVV
        AG  P+LA++N     F     MD   +L   L+ +K   I+PDL+ YG  +
Subjt:  AGFNPDLATFNIRALAF---SRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVV

AT2G18940.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.7e-1423.05Show/hide
Query:  LKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCFGKHGQLELME
        +  +G   N  T ++++  Y   G   +A  ++  M  +  VP+    + + +  GK  R +++I +L  +K          ++  ++  G  G  + + 
Subjt:  LKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCFGKHGQLELME

Query:  STLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGE----FLRDVG-------------------
           REM   GF  D  T N+ I  Y   GS ++    YG + R+ F        A+     RK   +R GE     ++  G                   
Subjt:  STLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGE----FLRDVG-------------------

Query:  --LGRKNVGNLL--------WNLLLLSYAANFKMKSL---QREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDA
          LG + + N +        W LL     ANFK ++L   +R F    + G+ PD+  FN     F+R ++       L+ ++   + PDLVTY  ++D 
Subjt:  --LGRKNVGNLL--------WNLLLLSYAANFKMKSL---QREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDA

Query:  YVDRRLGRNLEFVLSKMNPDQ
        YV R      E +L  +   Q
Subjt:  YVDRRLGRNLEFVLSKMNPDQ

AT3G42630.1 Pentatricopeptide repeat (PPR) superfamily protein4.8e-12258.19Show/hide
Query:  MPVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISC
        +P +A EIFL+ KS     N  TL +LM+ + + G + +A+ IW+E++NSC+VP V V+SKL + Y + G FD++  +   +  R+S LLP   SLAISC
Subjt:  MPVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISC

Query:  FGKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWN
        FGK+GQLELME  + EM   G  +++ T N  + YYS FGSL +ME AYGR+K+   +IE+E I A+   Y+++RKFYRL EFL DVGLGR+N+GN+LWN
Subjt:  FGKHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWN

Query:  LLLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVS
         +LLSYAA+FKMKSLQREF+ M++AGF+PDL TFNIRALAFSRM L WDLHL+L+HM+ + I PDLVT+GCVVDAY+D+RL RNLEFV ++MN D  P+ 
Subjt:  LLLLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVS

Query:  LTDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY
        LTDP  FE LGKGDFH++SEA ++F  +K WTYR+LI +YLKK+ RRDQIFWNY
Subjt:  LTDPFVFEALGKGDFHMNSEAFMQFQRQKKWTYRELISLYLKKQYRRDQIFWNY

AT3G53170.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-1325Show/hide
Query:  QAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPE--AYSLAISCFGKHGQLELMESTLREMVCSGFPVDSATGNSFIIYY
        QA  ++E ML+    P++ V + L + YGK    D   + L+ +K   S   P+   +++ ISC  K G+ +L++S + EM   G    + T N+ I  Y
Subjt:  QAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPE--AYSLAISCFGKHGQLELMESTLREMVCSGFPVDSATGNSFIIYY

Query:  SMFGSLMEMETAYG-RLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNLLLLSY--AANFKMKSLQREFLEMVEAGFNPDLAT
           G   EME+     ++    L +   + ++  +Y   R   ++  +     L         +N+L+LS+  A  +K      +F+E  +  F+    T
Subjt:  SMFGSLMEMETAYG-RLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNLLLLSY--AANFKMKSLQREFLEMVEAGFNPDLAT

Query:  FNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSK-MNPDQPPVSLTDPF---VFEALGK-GDFHMNSEAFMQFQRQ
        +NI    F +   +  +      MK+  ++P+ +TY  +V+AY    L   ++ VL + +N D   V L  PF   +  A G+ GD     E ++Q + +
Subjt:  FNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSK-MNPDQPPVSLTDPF---VFEALGK-GDFHMNSEAFMQFQRQ

Query:  K----KWTYRELISLY
        K    K T+  +I  Y
Subjt:  K----KWTYRELISLY

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein6.3e-1320.69Show/hide
Query:  AQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEA--YSLAISCFG
        A  +F  L+ +GF L+  + +SL+  + + G  R+A  ++++M      P+++  + + N +GKMG   + IT L + K++   + P+A  Y+  I+C  
Subjt:  AQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEA--YSLAISCFG

Query:  KHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNLL
        +    +       EM  +GF  D  T N+ +  Y       E       +  + F        ++   Y R        E    +           +  L
Subjt:  KHGQLELMESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNLL

Query:  LLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKM
        L  +    K++S    F EM  AG  P++ TFN     +       ++    D +    + PD+VT+  ++  +    +   +  V  +M
Subjt:  LLSYAANFKMKSLQREFLEMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGTATTAGCTCAAGAGATTTTCTTGGAGCTGAAGTCTGAAGGTTTCCCATTAAACAACTCTACATTGTCTTCTCTTATGGTACACTACATAGATGGGGGTCTTCT
CCGCCAAGCACAAGCGATTTGGGAAGAAATGTTAAACAGTTGTTATGTTCCTTCTGTTCTAGTAATTTCAAAGTTATTTAACACTTATGGAAAGATGGGACGCTTTGATG
ATATAATTACAGTTCTGGATCAGATAAAGTTGAGGTATTCACATTTACTGCCTGAGGCTTACTCACTAGCCATATCATGTTTTGGGAAGCATGGACAATTGGAATTGATG
GAAAGTACTTTGAGGGAAATGGTTTGCAGTGGTTTTCCAGTTGATTCCGCTACTGGAAATTCCTTTATCATATACTATAGCATGTTTGGTTCTTTAATGGAGATGGAAAC
TGCCTACGGCCGCCTTAAAAGGTCTAGATTTCTAATTGAGAAGGAAGGAATCCTGGCAATGGCATTTACCTACATAAGGAAAAGAAAATTTTACAGATTAGGTGAATTCC
TCAGGGATGTTGGTCTTGGAAGGAAAAACGTGGGGAATCTTTTATGGAATCTTCTACTTCTATCTTATGCTGCTAATTTTAAAATGAAAAGTTTGCAGCGAGAATTTCTG
GAAATGGTCGAAGCTGGATTTAATCCGGATCTTGCCACATTTAATATTAGAGCTCTAGCATTTTCAAGAATGGATTTGTTATGGGATCTTCATCTTAGTCTTGATCATAT
GAAGCATATGAAGATTGAACCCGATCTCGTGACCTACGGTTGTGTTGTTGATGCATATGTAGATAGAAGACTTGGAAGAAATTTGGAGTTCGTTTTGAGCAAAATGAATC
CAGATCAACCTCCAGTATCATTAACAGATCCATTTGTTTTTGAGGCATTGGGTAAAGGAGATTTCCACATGAACTCTGAGGCGTTCATGCAGTTCCAGAGGCAGAAGAAA
TGGACTTACAGAGAGTTAATATCATTGTATCTGAAAAAGCAATACAGGAGAGATCAAATCTTCTGGAATTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTGTATTAGCTCAAGAGATTTTCTTGGAGCTGAAGTCTGAAGGTTTCCCATTAAACAACTCTACATTGTCTTCTCTTATGGTACACTACATAGATGGGGGTCTTCT
CCGCCAAGCACAAGCGATTTGGGAAGAAATGTTAAACAGTTGTTATGTTCCTTCTGTTCTAGTAATTTCAAAGTTATTTAACACTTATGGAAAGATGGGACGCTTTGATG
ATATAATTACAGTTCTGGATCAGATAAAGTTGAGGTATTCACATTTACTGCCTGAGGCTTACTCACTAGCCATATCATGTTTTGGGAAGCATGGACAATTGGAATTGATG
GAAAGTACTTTGAGGGAAATGGTTTGCAGTGGTTTTCCAGTTGATTCCGCTACTGGAAATTCCTTTATCATATACTATAGCATGTTTGGTTCTTTAATGGAGATGGAAAC
TGCCTACGGCCGCCTTAAAAGGTCTAGATTTCTAATTGAGAAGGAAGGAATCCTGGCAATGGCATTTACCTACATAAGGAAAAGAAAATTTTACAGATTAGGTGAATTCC
TCAGGGATGTTGGTCTTGGAAGGAAAAACGTGGGGAATCTTTTATGGAATCTTCTACTTCTATCTTATGCTGCTAATTTTAAAATGAAAAGTTTGCAGCGAGAATTTCTG
GAAATGGTCGAAGCTGGATTTAATCCGGATCTTGCCACATTTAATATTAGAGCTCTAGCATTTTCAAGAATGGATTTGTTATGGGATCTTCATCTTAGTCTTGATCATAT
GAAGCATATGAAGATTGAACCCGATCTCGTGACCTACGGTTGTGTTGTTGATGCATATGTAGATAGAAGACTTGGAAGAAATTTGGAGTTCGTTTTGAGCAAAATGAATC
CAGATCAACCTCCAGTATCATTAACAGATCCATTTGTTTTTGAGGCATTGGGTAAAGGAGATTTCCACATGAACTCTGAGGCGTTCATGCAGTTCCAGAGGCAGAAGAAA
TGGACTTACAGAGAGTTAATATCATTGTATCTGAAAAAGCAATACAGGAGAGATCAAATCTTCTGGAATTACTAA
Protein sequenceShow/hide protein sequence
MPVLAQEIFLELKSEGFPLNNSTLSSLMVHYIDGGLLRQAQAIWEEMLNSCYVPSVLVISKLFNTYGKMGRFDDIITVLDQIKLRYSHLLPEAYSLAISCFGKHGQLELM
ESTLREMVCSGFPVDSATGNSFIIYYSMFGSLMEMETAYGRLKRSRFLIEKEGILAMAFTYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNLLLLSYAANFKMKSLQREFL
EMVEAGFNPDLATFNIRALAFSRMDLLWDLHLSLDHMKHMKIEPDLVTYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPVSLTDPFVFEALGKGDFHMNSEAFMQFQRQKK
WTYRELISLYLKKQYRRDQIFWNY