; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G036900 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G036900
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionpentatricopeptide repeat-containing protein At2g30100, chloroplastic
Genome locationCicolChr02:32631046..32634421
RNA-Seq ExpressionCcUC02G036900
SyntenyCcUC02G036900
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570645.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]7.1e-27091.18Show/hide
Query:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        MICAQGF+PLTQFGFS SLSS LK+ER GFS PQL S SPV FCF+VSRI+CN+Q+STFSV RA KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS
        LERM R+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMC WIKKLVEG+HNVGDVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK
        MIEKVIS+YW+MGEKEKAISFVKEVLGRKL F+KD+WEGHKGGPSGYLAWKMMVD DYRGAVKMVL+LRESGLKPEVYCYLIAMTAVVKELNEFAKALRK
Subjt:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK

Query:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET
        LKSYARDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EG  S HGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET
Subjt:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET

Query:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ
        +AM RLLTRIEITSP  KKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAV+QGLR++IREPENV+TYLDLCKCLS+ANLIGPSLVYLHLQ
Subjt:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ

Query:  KHKLWVIKML
        K+KLWVIKML
Subjt:  KHKLWVIKML

KAG7010495.1 Pentatricopeptide repeat-containing protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]1.9e-27091.37Show/hide
Query:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        MICAQGF+PLTQFGFS SLSS LK+ER GFS PQL S SPV FCF+VSRI+CN+Q+STFSV RA KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS
        LERMTR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMC WIKKLVEG+HNVGDVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK
        MIEKVIS+YW+MGEKEKAISFVKEVLGRKL F+KD+WEGHKGGPSGYLAWKMMVD DYRGAVKMVL+LRESGLKPEVYCYLIAMTAVVKELNEFAKALRK
Subjt:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK

Query:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET
        LKSYARDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EG  S HGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET
Subjt:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET

Query:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ
        +AM RLLTRIEITSP  KKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAV+QGLR++IREPENV+TYLDLCKCLS+ANLIGPSLVYLHLQ
Subjt:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ

Query:  KHKLWVIKML
        K+KLWVIKML
Subjt:  KHKLWVIKML

XP_022944005.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita moschata]5.5e-27091.37Show/hide
Query:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        MICAQGFTPLTQFGFS SLSS LK+ER GFS PQL S SPV FCF+VSRI+CN+Q+STFSV RA KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS
        LERMTR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMC WIKKLVEG+HNV DVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK
        MIEKVIS+YW+MGEKEKAISFVKEVLGRKL F+KD+WEGHKGGPSGYLAWKMMVD DYRGAVKMVL+LRESGLKPEVYCYLIAMTAVVKELNEFAKALRK
Subjt:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK

Query:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET
        LKSYARDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EG  S HGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET
Subjt:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET

Query:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ
        +AM RLLTRIEITSP  KKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAV+QGLR++IREPENV+TYLDLCKCLS+ANLIGPSLVYLHLQ
Subjt:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ

Query:  KHKLWVIKML
        K+KLWVIKML
Subjt:  KHKLWVIKML

XP_023512972.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita pepo subsp. pepo]5.5e-27091.37Show/hide
Query:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        MICAQGFTPLTQFGFS SLSS LK+ER GFS PQL S SPV FCF+VSRI+CN+Q+STFSV RA KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS
        LERMTR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMC WIKKLVEG+HNVGDVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK
        MIEKVIS+YW+MGEKEKAISFVKEVLGRKL F+KD+WEGHKGGPSGYLAWKMMVD DYRGAVKMVL+LRESGLKPEVYCYLIAMTAVVKELNEFAKALRK
Subjt:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK

Query:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET
        LKSYARDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EG  S H VVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET
Subjt:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET

Query:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ
        +AM RLLTRIEITSP  KKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAV+QGLR++IREPENV+TYLDLCKCLS+ANLIGPSLVYLHLQ
Subjt:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ

Query:  KHKLWVIKML
        K+KLWVIKML
Subjt:  KHKLWVIKML

XP_038901728.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Benincasa hispida]7.9e-28595.49Show/hide
Query:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        M+CAQGFTPLTQFGFS SLSSALKT+RHGFSTPQLYSP PVKFCF+VSRISCNYQDSTFSV RA KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS
        LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMC WIKKLVEG+HNVGDVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK
        MIEKVIS+YWEMGEKEKAISFVKEVLGR L+F+KDDWEGHKGGPSGYLAWKMMVD DYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK
Subjt:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK

Query:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET
        LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEG+FSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET
Subjt:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET

Query:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ
        KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHF DAAETLVKM++LGFLPEYLDRVAV+QGLR++IREPENVDTYLDLCKCLS+ANLIGPSLVYLHLQ
Subjt:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ

Query:  KHKLWVIKML
        KHKLWV+KML
Subjt:  KHKLWVIKML

TrEMBL top hitse value%identityAlignment
A0A0A0KC35 Uncharacterized protein7.5e-26590.39Show/hide
Query:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        MICAQGFTPLTQFGFS SLSS L+++R GFSTP+LY  SP         ISCNYQDSTFSV RA+KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS
        LERMTREPSDVLEEMNDRLSARE QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMC WIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK
        MIEKVIS+YWEMGEKEKA+ FVKEVLGR L+F+KDDWEGHKGGPSGYLAWKMMVD DYRGAVKMVLHLRESGL+PEVY YLIAMTAVVKELNEFAKALRK
Subjt:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK

Query:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET
        LK YARDG VAELDKNNVELV KYQTELLADGV+LSNWVLEEG+ SI GVVHERLLAMYICAGQG+EAERQLWEMKLVGKEADADLYDIVLAICASQKET
Subjt:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET

Query:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ
        KAMKRLLTRIEITSPM KKKSLTWLLRGYIKGGHFRDAA TLVKM+NLGFLPEYLDRVAV+QGLR++IREPE+V TYLDLCKCLS+ANLIGPSLVYLHLQ
Subjt:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ

Query:  KHKLWVIKML
        KHKLW+IKML
Subjt:  KHKLWVIKML

A0A1S3CNE0 pentatricopeptide repeat-containing protein At2g30100, chloroplastic5.7e-26590.59Show/hide
Query:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        MICAQGFTPLTQFGFS SLSS L+T+R+GFSTP+LY  SP         ISCNYQDSTFSV RA+KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS
        LERMTREPSDVLEEMNDRLSARE QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMC WI KLVEGRHNVGDVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK
        MIEKVIS+YWEMGEKEKAI FVKEVLGR L+F+KDDWEGHKGGPSGYLAWKMMVD DYRGAVKMVLHLRESGL+PEVY YLIAMTAVVKELNEFAKALRK
Subjt:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK

Query:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET
        LKSYARDG VAELDKNNVELV KYQTELLADGVRLSNWVLEEG+ SIHGVVHERLLAMYICAGQG+EAERQLWEMKL+GKEADADLYDIVLAICASQKE 
Subjt:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET

Query:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ
        KAMKRLLTRIEITSPM KKKSLTWLLRGYIKGGHFRDAA T+VKM+NLGFLPEYLDRVAV+QGLR+ IREPE V TYLDLCKCLS+ANLIGPSLVYLHLQ
Subjt:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ

Query:  KHKLWVIKML
        KHKLW+IKML
Subjt:  KHKLWVIKML

A0A6J1D3T2 pentatricopeptide repeat-containing protein At2g30100, chloroplastic1.1e-26088.65Show/hide
Query:  MICAQGFTPLTQFGFSCSLSSALKTERHG-FSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICAQGFTP+TQFGFS SLSSALKT+R   FSTPQLY  SPV FCF++S I+CN+++STFSV +A KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPLTQFGFSCSLSSALKTERHG-FSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHF
        ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMC WIKKLVEG HNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALR
        SMIEKVIS+YWEMGEKE+AISFVKEVLGRK++F+KDD EGHKGGPSGYLAWKMMVD DYRGAVK+VLHLRESGL PEVY YLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALR

Query:  KLKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKE
        KLKSY RDGIVAELDK+NV LVE YQTELLADGVRLSNWVLEEG+ SIHGV HERLLAMYICAG+GLEAERQLWEMKLVGKEAD+DLYDIVLAICASQKE
Subjt:  KLKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKE

Query:  TKAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHL
        T+AM RLLTRIEI SP+ KKKSL+WLLRGYIKGGHF DAAETLVKMV+LGFLPEYLDRVAV+QGLR++IREP +V+TY  LCKCLS+ANLIGP LVYLHL
Subjt:  TKAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHL

Query:  QKHKLWVIKML
        QKHKLWVIKML
Subjt:  QKHKLWVIKML

A0A6J1FYE9 pentatricopeptide repeat-containing protein At2g30100, chloroplastic2.6e-27091.37Show/hide
Query:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        MICAQGFTPLTQFGFS SLSS LK+ER GFS PQL S SPV FCF+VSRI+CN+Q+STFSV RA KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS
        LERMTR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMC WIKKLVEG+HNV DVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK
        MIEKVIS+YW+MGEKEKAISFVKEVLGRKL F+KD+WEGHKGGPSGYLAWKMMVD DYRGAVKMVL+LRESGLKPEVYCYLIAMTAVVKELNEFAKALRK
Subjt:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK

Query:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET
        LKSYARDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EG  S HGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET
Subjt:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET

Query:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ
        +AM RLLTRIEITSP  KKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAV+QGLR++IREPENV+TYLDLCKCLS+ANLIGPSLVYLHLQ
Subjt:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ

Query:  KHKLWVIKML
        K+KLWVIKML
Subjt:  KHKLWVIKML

A0A6J1JH85 pentatricopeptide repeat-containing protein At2g30100, chloroplastic5.0e-26990.59Show/hide
Query:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        MICA GFTPLT+FGFS SLSS LK++R GFS PQL S SPV FCF+VSRI+CN+Q+STFSV RA KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS
        LERMTR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMC WIKKLVEG+HNVGDVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK
        MIEKVIS+YW+MGEKEKAISFVKEVLGRKL F+KD+WEGHKGGPSGYLAWKMMVD DYRGAVKMVL+LRESGLKPEVYC+LIAMTAVVKELNEFAKALRK
Subjt:  MIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRK

Query:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET
        LKSYARDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EG+ S HGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET
Subjt:  LKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET

Query:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ
        +AM RLL+RIEITSP  KKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAV+QGLR++IREPENV+TYLDLCKCLS+ANLIGPSLVYLHLQ
Subjt:  KAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQ

Query:  KHKLWVIKML
        K+KLWVIKML
Subjt:  KHKLWVIKML

SwissProt top hitse value%identityAlignment
Q0WNN7 Pentatricopeptide repeat-containing protein At2g30100, chloroplastic8.4e-17364.35Show/hide
Query:  SRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALE
        SRI CN + +      A KFR++ L +SVELDQFITS++E    +E+G+GFFEAIEELERMTREPSD+LEEMN RLS+RE QL+LVYF+QEGRDSWC LE
Subjt:  SRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALE

Query:  VFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFSMIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKD-----DWE
        VFEWL+KENRVD+E MELMVSIMCGW+KKL+E   N   V DLL++MDCVGLKP FSM++KVI++Y EMG+KE A+ FVKEVL R+  F          E
Subjt:  VFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFSMIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKD-----DWE

Query:  GHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNW
        G KGGP GYLAWK MVD DYR AV MV+ LR SGLKPE Y YLIAMTA+VKELN   K LR+LK +AR G VAE+D ++  L+EKYQ+E L+ G++L+ W
Subjt:  GHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNW

Query:  VLEEG--NFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFR
         +EEG  N SI GVVHERLLAMYICAG+G EAE+QLW+MKL G+E +ADL+DIV+AICASQKE  A+ RLLTR+E     RKKK+L+WLLRGY+KGGHF 
Subjt:  VLEEG--NFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFR

Query:  DAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQKHKLWVIKML
        +AAETLV M++ G  PEY+DRVAVMQG+ RKI+ P +V+ Y+ LCK L +A L+GP LVY+++ K+KLW++KM+
Subjt:  DAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQKHKLWVIKML

Q0WVV0 Pentatricopeptide repeat-containing protein At1g10910, chloroplastic1.6e-0621.86Show/hide
Query:  VKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYIC
        +K+   ++  GLKP+V  Y   +   +K  N + KA+          ++ EL  N +++                             V++  +LA+   
Subjt:  VKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYIC

Query:  AGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVM
         G+  EAE  + +MK+ G   +   Y  +L   + + + K    L+T ++    +  K  +T LL+ YIKGG F  + E L ++ + G+    +    +M
Subjt:  AGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVM

Query:  QGLRR--KIREPENV
         GL +  K+ E  ++
Subjt:  QGLRR--KIREPENV

Arabidopsis top hitse value%identityAlignment
AT1G10910.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-0721.86Show/hide
Query:  VKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYIC
        +K+   ++  GLKP+V  Y   +   +K  N + KA+          ++ EL  N +++                             V++  +LA+   
Subjt:  VKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYIC

Query:  AGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVM
         G+  EAE  + +MK+ G   +   Y  +L   + + + K    L+T ++    +  K  +T LL+ YIKGG F  + E L ++ + G+    +    +M
Subjt:  AGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVM

Query:  QGLRR--KIREPENV
         GL +  K+ E  ++
Subjt:  QGLRR--KIREPENV

AT2G30100.1 pentatricopeptide (PPR) repeat-containing protein6.0e-17464.35Show/hide
Query:  SRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALE
        SRI CN + +      A KFR++ L +SVELDQFITS++E    +E+G+GFFEAIEELERMTREPSD+LEEMN RLS+RE QL+LVYF+QEGRDSWC LE
Subjt:  SRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALE

Query:  VFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFSMIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKD-----DWE
        VFEWL+KENRVD+E MELMVSIMCGW+KKL+E   N   V DLL++MDCVGLKP FSM++KVI++Y EMG+KE A+ FVKEVL R+  F          E
Subjt:  VFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFSMIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKD-----DWE

Query:  GHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNW
        G KGGP GYLAWK MVD DYR AV MV+ LR SGLKPE Y YLIAMTA+VKELN   K LR+LK +AR G VAE+D ++  L+EKYQ+E L+ G++L+ W
Subjt:  GHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNW

Query:  VLEEG--NFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFR
         +EEG  N SI GVVHERLLAMYICAG+G EAE+QLW+MKL G+E +ADL+DIV+AICASQKE  A+ RLLTR+E     RKKK+L+WLLRGY+KGGHF 
Subjt:  VLEEG--NFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFR

Query:  DAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQKHKLWVIKML
        +AAETLV M++ G  PEY+DRVAVMQG+ RKI+ P +V+ Y+ LCK L +A L+GP LVY+++ K+KLW++KM+
Subjt:  DAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQKHKLWVIKML

AT5G42310.1 Pentatricopeptide repeat (PPR-like) superfamily protein5.9e-0427.78Show/hide
Query:  GRHNVGDVVDLLVDMDCVGLKPHFSMIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVD---DDYRG--AVKMVLHLRE
        GR N  D ++ L +M  VGLKP  +M   +I+ Y + G  E+A++  + +    L             PS  LA   +++   +D R   A  ++ +++E
Subjt:  GRHNVGDVVDLLVDMDCVGLKPHFSMIEKVISMYWEMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVD---DDYRG--AVKMVLHLRE

Query:  SGLKPEVYCYLIAMTAVVKELNEFAK
        +G+KP+V  Y   M A+++ +++F K
Subjt:  SGLKPEVYCYLIAMTAVVKELNEFAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTGTGCCCAGGGCTTTACCCCGTTAACCCAATTTGGGTTTTCATGTTCTTTATCTTCTGCACTGAAAACTGAGAGGCATGGGTTTTCTACTCCCCAATTG
TATAGTCCTTCGCCGGTAAAGTTTTGCTTTGTGGTTTCTCGTATTTCTTGCAACTACCAGGATTCTACTTTCTCTGTTCCGCGAGCTAGTAAGTTTCGGGACTTA
AGGTTGTTCAAATCGGTTGAGTTGGACCAGTTCATCACGAGTGATGACGAAGATGAAATGGGAGATGGGTTTTTTGAGGCAATTGAGGAATTGGAACGAATGACG
AGGGAACCATCGGATGTTCTTGAAGAAATGAACGATCGCCTTTCGGCGAGGGAATTTCAGCTCGTGTTGGTGTACTTCTCCCAAGAAGGGAGAGATTCATGGTGT
GCTCTTGAGGTTTTTGAGTGGCTCCAAAAGGAAAATCGGGTTGACAAGGAGACCATGGAGTTGATGGTGTCTATAATGTGTGGTTGGATCAAGAAGTTGGTCGAG
GGACGACATAACGTCGGTGATGTTGTTGACCTTCTTGTGGATATGGATTGTGTAGGTTTGAAGCCCCATTTTAGCATGATAGAAAAGGTTATCTCTATGTACTGG
GAAATGGGTGAGAAAGAAAAAGCAATTTCGTTCGTGAAAGAGGTCTTGGGACGCAAGCTTTCTTTTATTAAGGACGATTGGGAGGGACATAAAGGGGGCCCAAGC
GGTTATCTCGCATGGAAGATGATGGTTGATGATGACTATAGGGGCGCAGTGAAAATGGTGCTACATCTTAGAGAATCTGGATTAAAGCCAGAGGTTTACTGCTAT
CTTATTGCCATGACTGCTGTGGTTAAAGAGCTGAATGAATTTGCAAAAGCTCTACGGAAACTCAAAAGTTATGCAAGAGACGGGATAGTGGCTGAACTCGATAAA
AACAATGTTGAACTTGTTGAGAAGTATCAGACAGAGCTTCTAGCTGATGGAGTACGGTTATCCAACTGGGTGCTTGAAGAGGGAAACTTTTCAATTCATGGGGTG
GTTCATGAGAGACTCCTTGCTATGTACATTTGTGCTGGGCAAGGACTCGAGGCAGAGAGACAGCTTTGGGAAATGAAGCTTGTAGGCAAGGAGGCCGATGCTGAT
CTGTACGATATTGTGCTAGCCATTTGTGCTTCACAGAAGGAGACAAAAGCAATGAAACGGTTGCTTACCAGGATTGAGATTACGAGTCCCATGCGTAAGAAGAAG
AGTTTGACATGGCTACTAAGGGGTTACATAAAAGGAGGGCATTTCCGTGATGCTGCAGAAACGTTAGTAAAAATGGTAAATTTGGGTTTTCTCCCGGAGTACTTG
GACAGAGTAGCTGTGATGCAAGGACTAAGAAGAAAAATTCGGGAACCTGAAAATGTCGATACTTATCTCGATCTCTGCAAGTGTCTCTCCAATGCAAATCTAATT
GGACCTAGTCTTGTATATTTACATTTACAGAAACACAAGCTTTGGGTCATCAAAATGCTTTGA
mRNA sequenceShow/hide mRNA sequence
CGCTGCCATTGCAACCACTCACTCCGACTTCCACCGTCACCATCGCTCAATCCTCCGCCAGAAGATTAGGGTTTCTTTCTTTTGTTTCCGATCAACAATTCAATT
CAATTCACCGCTTCTTGAAGGGGTTTCTTGCTGCCTCTCTGTACTCTTCTCTCTTTCTCTTCCGTCTTTCATTTGTTTCATTTTCTGTTACATTGTTACTGTTAT
AACGAGTTCACATTGCGCTCAAAATTGATGCGGTGGAAACCCTAATCTTCATTTCTTGTTTTACTTCATTCGGTTTTGAACTTTTCTCAACTCGTTCGAGATATC
GTAACGCTTAAGTTCGCAGGTTATTCGAAATTTTAGTTTGAATTTAGTGATTTGGATTAGAATTACGAAATGATTTGTGCCCAGGGCTTTACCCCGTTAACCCAA
TTTGGGTTTTCATGTTCTTTATCTTCTGCACTGAAAACTGAGAGGCATGGGTTTTCTACTCCCCAATTGTATAGTCCTTCGCCGGTAAAGTTTTGCTTTGTGGTT
TCTCGTATTTCTTGCAACTACCAGGATTCTACTTTCTCTGTTCCGCGAGCTAGTAAGTTTCGGGACTTAAGGTTGTTCAAATCGGTTGAGTTGGACCAGTTCATC
ACGAGTGATGACGAAGATGAAATGGGAGATGGGTTTTTTGAGGCAATTGAGGAATTGGAACGAATGACGAGGGAACCATCGGATGTTCTTGAAGAAATGAACGAT
CGCCTTTCGGCGAGGGAATTTCAGCTCGTGTTGGTGTACTTCTCCCAAGAAGGGAGAGATTCATGGTGTGCTCTTGAGGTTTTTGAGTGGCTCCAAAAGGAAAAT
CGGGTTGACAAGGAGACCATGGAGTTGATGGTGTCTATAATGTGTGGTTGGATCAAGAAGTTGGTCGAGGGACGACATAACGTCGGTGATGTTGTTGACCTTCTT
GTGGATATGGATTGTGTAGGTTTGAAGCCCCATTTTAGCATGATAGAAAAGGTTATCTCTATGTACTGGGAAATGGGTGAGAAAGAAAAAGCAATTTCGTTCGTG
AAAGAGGTCTTGGGACGCAAGCTTTCTTTTATTAAGGACGATTGGGAGGGACATAAAGGGGGCCCAAGCGGTTATCTCGCATGGAAGATGATGGTTGATGATGAC
TATAGGGGCGCAGTGAAAATGGTGCTACATCTTAGAGAATCTGGATTAAAGCCAGAGGTTTACTGCTATCTTATTGCCATGACTGCTGTGGTTAAAGAGCTGAAT
GAATTTGCAAAAGCTCTACGGAAACTCAAAAGTTATGCAAGAGACGGGATAGTGGCTGAACTCGATAAAAACAATGTTGAACTTGTTGAGAAGTATCAGACAGAG
CTTCTAGCTGATGGAGTACGGTTATCCAACTGGGTGCTTGAAGAGGGAAACTTTTCAATTCATGGGGTGGTTCATGAGAGACTCCTTGCTATGTACATTTGTGCT
GGGCAAGGACTCGAGGCAGAGAGACAGCTTTGGGAAATGAAGCTTGTAGGCAAGGAGGCCGATGCTGATCTGTACGATATTGTGCTAGCCATTTGTGCTTCACAG
AAGGAGACAAAAGCAATGAAACGGTTGCTTACCAGGATTGAGATTACGAGTCCCATGCGTAAGAAGAAGAGTTTGACATGGCTACTAAGGGGTTACATAAAAGGA
GGGCATTTCCGTGATGCTGCAGAAACGTTAGTAAAAATGGTAAATTTGGGTTTTCTCCCGGAGTACTTGGACAGAGTAGCTGTGATGCAAGGACTAAGAAGAAAA
ATTCGGGAACCTGAAAATGTCGATACTTATCTCGATCTCTGCAAGTGTCTCTCCAATGCAAATCTAATTGGACCTAGTCTTGTATATTTACATTTACAGAAACAC
AAGCTTTGGGTCATCAAAATGCTTTGAAGAAGCTCCTCAATACCTACATCTCTGCACAGGCAGCTAATAAAATGGAGCAAAAACCATATTCATATTATACAGCAC
CAGCTCTTTTTTTGGTGCTTTTGTATAGTTTGAGGGATATGCTTTTTCAGGCAGGTGAAGGTGACTCTTGAAGCTCTTTAAGCTGACCCTGAAGTGGAATACTTG
TGTATATATATATATATAACTCTTTAGCTACTGTGCACAGAGAACCAATGTTTCAATGTATAGATTGTATAAGGAAATTACATATTCTGATATTTTTAGTGTACA
GGCTTTGGTTTCTTCAGTGTTTTCACGTGTGAAGAATCAATCTGGTTTTGCTCTGTGAATCAAATGGAAAGACAATCAAAGTTTGTGAACTGTTGGTCGTGAAAG
ATGTGGCCATATTTTGCTTCTTCTTGCCCAGAAAACAACACGTGGTGTGTCCATTGATTGAACATCCAGGTTTTACACTATTGCATTATTACATTATTCTGTTGC
TTTCACCTCAACAATTTATTGTCTGTATGATTCATACAAGATGTCTTTTACTCTTTTCTTATTATGGCCAGTAAAATGAAAAATCTGTTTGGAACAAGCGTATGT
TTTCATATTGGGGAGGTGTAATGTAATTGTGTCGGTTGGGTCCGCACATATGTCTTTGAATTTATCTATTCAACATTTTAAAAATCGAAATTGAAAGTTCCGG
Protein sequenceShow/hide protein sequence
MICAQGFTPLTQFGFSCSLSSALKTERHGFSTPQLYSPSPVKFCFVVSRISCNYQDSTFSVPRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMT
REPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCGWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFSMIEKVISMYW
EMGEKEKAISFVKEVLGRKLSFIKDDWEGHKGGPSGYLAWKMMVDDDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDK
NNVELVEKYQTELLADGVRLSNWVLEEGNFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKK
SLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVMQGLRRKIREPENVDTYLDLCKCLSNANLIGPSLVYLHLQKHKLWVIKML