; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001548 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001548
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionpentatricopeptide repeat-containing protein At2g30100, chloroplastic
Genome locationChr09:18036758..18049493
RNA-Seq ExpressionHG10001548
SyntenyHG10001548
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570645.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]2.7e-26791.9Show/hide
Query:  EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM
        +GF+PLTQFGFSFSLSS LK+E+ GFS PQL S SPV FCFMVSRI+CN+Q+STFSVSRA KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM
Subjt:  EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM

Query:  TREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEK
         R+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV DVVDLLVDMDCVGLKPHFSMIEK
Subjt:  TREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEK

Query:  VISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
        VISLYW+MGEKEKAISFVKEVLGR L FMKD+WEGHKGGPSGYLAWKMMVDGDYRGAVKMVL+LRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
Subjt:  VISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY

Query:  ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK
        ARDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EG  S HGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET+AM 
Subjt:  ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK

Query:  RLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKL
        RLLTRIEITSP  KKKSLTWLLRGYIKGGHFRDAAETLVKMV+LGFLPEYLDRVAVLQGLRK+IREPENV+TYLDL KCLSDANLIGPSLVYLHLQK+KL
Subjt:  RLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKL

Query:  WVIKML
        WVIKML
Subjt:  WVIKML

KAG7010495.1 Pentatricopeptide repeat-containing protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]7.0e-26892.09Show/hide
Query:  EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM
        +GF+PLTQFGFSFSLSS LK+E+ GFS PQL S SPV FCFMVSRI+CN+Q+STFSVSRA KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM
Subjt:  EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM

Query:  TREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEK
        TR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV DVVDLLVDMDCVGLKPHFSMIEK
Subjt:  TREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEK

Query:  VISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
        VISLYW+MGEKEKAISFVKEVLGR L FMKD+WEGHKGGPSGYLAWKMMVDGDYRGAVKMVL+LRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
Subjt:  VISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY

Query:  ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK
        ARDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EG  S HGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET+AM 
Subjt:  ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK

Query:  RLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKL
        RLLTRIEITSP  KKKSLTWLLRGYIKGGHFRDAAETLVKMV+LGFLPEYLDRVAVLQGLRK+IREPENV+TYLDL KCLSDANLIGPSLVYLHLQK+KL
Subjt:  RLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKL

Query:  WVIKML
        WVIKML
Subjt:  WVIKML

XP_022944005.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita moschata]3.2e-26892.29Show/hide
Query:  EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM
        +GFTPLTQFGFSFSLSS LK+E+ GFS PQL S SPV FCFMVSRI+CN+Q+STFSVSRA KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM
Subjt:  EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM

Query:  TREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEK
        TR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV DVVDLLVDMDCVGLKPHFSMIEK
Subjt:  TREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEK

Query:  VISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
        VISLYW+MGEKEKAISFVKEVLGR L FMKD+WEGHKGGPSGYLAWKMMVDGDYRGAVKMVL+LRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
Subjt:  VISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY

Query:  ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK
        ARDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EG  S HGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET+AM 
Subjt:  ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK

Query:  RLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKL
        RLLTRIEITSP  KKKSLTWLLRGYIKGGHFRDAAETLVKMV+LGFLPEYLDRVAVLQGLRK+IREPENV+TYLDL KCLSDANLIGPSLVYLHLQK+KL
Subjt:  RLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKL

Query:  WVIKML
        WVIKML
Subjt:  WVIKML

XP_023512972.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita pepo subsp. pepo]2.0e-26792.09Show/hide
Query:  EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM
        +GFTPLTQFGFSFSLSS LK+E+ GFS PQL S SPV FCFMVSRI+CN+Q+STFSVSRA KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM
Subjt:  EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM

Query:  TREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEK
        TR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV DVVDLLVDMDCVGLKPHFSMIEK
Subjt:  TREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEK

Query:  VISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
        VISLYW+MGEKEKAISFVKEVLGR L FMKD+WEGHKGGPSGYLAWKMMVDGDYRGAVKMVL+LRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
Subjt:  VISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY

Query:  ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK
        ARDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EG  S H VVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET+AM 
Subjt:  ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK

Query:  RLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKL
        RLLTRIEITSP  KKKSLTWLLRGYIKGGHFRDAAETLVKMV+LGFLPEYLDRVAVLQGLRK+IREPENV+TYLDL KCLSDANLIGPSLVYLHLQK+KL
Subjt:  RLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKL

Query:  WVIKML
        WVIKML
Subjt:  WVIKML

XP_038901728.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Benincasa hispida]7.5e-28697.63Show/hide
Query:  EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM
        +GFTPLTQFGFSFSLSSALKT++HGFSTPQL+SP PVKFCFMVSRISCNYQDSTFSVSRA KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM
Subjt:  EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM

Query:  TREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEK
        TREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV DVVDLLVDMDCVGLKPHFSMIEK
Subjt:  TREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEK

Query:  VISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
        VISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
Subjt:  VISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY

Query:  ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK
        ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK
Subjt:  ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK

Query:  RLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKL
        RLLTRIEITSPMRKKKSLTWLLRGYIKGGHF DAAETLVKM+DLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDL KCLSDANLIGPSLVYLHLQKHKL
Subjt:  RLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKL

Query:  WVIKML
        WV+KML
Subjt:  WVIKML

TrEMBL top hitse value%identityAlignment
A0A0A0KC35 Uncharacterized protein2.3e-26491.7Show/hide
Query:  EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM
        +GFTPLTQFGFSFSLSS L++++ GFSTP+L         +MVS ISCNYQDSTFSVSRA+KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM
Subjt:  EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM

Query:  TREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEK
        TREPSDVLEEMNDRLSARE QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNV DVVDLLVDMDCVGLKPHFSMIEK
Subjt:  TREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEK

Query:  VISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
        VISLYWEMGEKEKA+ FVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGL+PEVY YLIAMTAVVKELNEFAKALRKLK Y
Subjt:  VISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY

Query:  ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK
        ARDG VAELDKNNVELV KYQTELLADGV+LSNWVLEEGS SI GVVHERLLAMYICAGQG+EAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK
Subjt:  ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK

Query:  RLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKL
        RLLTRIEITSPM KKKSLTWLLRGYIKGGHFRDAA TLVKM++LGFLPEYLDRVAVLQGLRK+IREPE+V TYLDL KCLSDANLIGPSLVYLHLQKHKL
Subjt:  RLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKL

Query:  WVIKML
        W+IKML
Subjt:  WVIKML

A0A1S3CNE0 pentatricopeptide repeat-containing protein At2g30100, chloroplastic2.3e-26491.9Show/hide
Query:  EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM
        +GFTPLTQFGFSFSLSS L+T+++GFSTP+L         +MVS ISCNYQDSTFSVSRA+KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM
Subjt:  EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM

Query:  TREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEK
        TREPSDVLEEMNDRLSARE QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWI KLVEGRHNV DVVDLLVDMDCVGLKPHFSMIEK
Subjt:  TREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEK

Query:  VISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
        VISLYWEMGEKEKAI FVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGL+PEVY YLIAMTAVVKELNEFAKALRKLKSY
Subjt:  VISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY

Query:  ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK
        ARDG VAELDKNNVELV KYQTELLADGVRLSNWVLEEGS SIHGVVHERLLAMYICAGQG+EAERQLWEMKL+GKEADADLYDIVLAICASQKE KAMK
Subjt:  ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK

Query:  RLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKL
        RLLTRIEITSPM KKKSLTWLLRGYIKGGHFRDAA T+VKM++LGFLPEYLDRVAVLQGLRK IREPE V TYLDL KCLSDANLIGPSLVYLHLQKHKL
Subjt:  RLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKL

Query:  WVIKML
        W+IKML
Subjt:  WVIKML

A0A6J1D3T2 pentatricopeptide repeat-containing protein At2g30100, chloroplastic2.9e-25989.74Show/hide
Query:  EGFTPLTQFGFSFSLSSALKTEKHG-FSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELER
        +GFTP+TQFGFSFSLSSALKT++   FSTPQL+  SPV FCFM+S I+CN+++STFSV +A KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELER
Subjt:  EGFTPLTQFGFSFSLSSALKTEKHG-FSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELER

Query:  MTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIE
        MTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNV DVVDLLVDMDCVGLKPHFSMIE
Subjt:  MTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIE

Query:  KVISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKS
        KVISLYWEMGEKE+AISFVKEVLGR +AFMKDD EGHKGGPSGYLAWKMMVDGDYRGAVK+VLHLRESGL PEVY YLIAMTAVVKELNEFAKALRKLKS
Subjt:  KVISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKS

Query:  YARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAM
        Y RDGIVAELDK+NV LVE YQTELLADGVRLSNWVLEEGS SIHGV HERLLAMYICAG+GLEAERQLWEMKLVGKEAD+DLYDIVLAICASQKET+AM
Subjt:  YARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAM

Query:  KRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHK
         RLLTRIEI SP+ KKKSL+WLLRGYIKGGHF DAAETLVKMVDLGFLPEYLDRVAVLQGLRK+IREP +V+TY  L KCLSDANLIGP LVYLHLQKHK
Subjt:  KRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHK

Query:  LWVIKML
        LWVIKML
Subjt:  LWVIKML

A0A6J1FYE9 pentatricopeptide repeat-containing protein At2g30100, chloroplastic1.5e-26892.29Show/hide
Query:  EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM
        +GFTPLTQFGFSFSLSS LK+E+ GFS PQL S SPV FCFMVSRI+CN+Q+STFSVSRA KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM
Subjt:  EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM

Query:  TREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEK
        TR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV DVVDLLVDMDCVGLKPHFSMIEK
Subjt:  TREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEK

Query:  VISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
        VISLYW+MGEKEKAISFVKEVLGR L FMKD+WEGHKGGPSGYLAWKMMVDGDYRGAVKMVL+LRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
Subjt:  VISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY

Query:  ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK
        ARDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EG  S HGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET+AM 
Subjt:  ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMK

Query:  RLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKL
        RLLTRIEITSP  KKKSLTWLLRGYIKGGHFRDAAETLVKMV+LGFLPEYLDRVAVLQGLRK+IREPENV+TYLDL KCLSDANLIGPSLVYLHLQK+KL
Subjt:  RLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKL

Query:  WVIKML
        WVIKML
Subjt:  WVIKML

A0A6J1JH85 pentatricopeptide repeat-containing protein At2g30100, chloroplastic1.9e-26691.68Show/hide
Query:  GFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMT
        GFTPLT+FGFSFSLSS LK+++ GFS PQL S SPV FCF+VSRI+CN+Q+STFSVSRA KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMT
Subjt:  GFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMT

Query:  REPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEKV
        R+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV DVVDLLVDMDCVGLKPHFSMIEKV
Subjt:  REPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEKV

Query:  ISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYA
        ISLYW+MGEKEKAISFVKEVLGR L FMKD+WEGHKGGPSGYLAWKMMVDGDYRGAVKMVL+LRESGLKPEVYC+LIAMTAVVKELNEFAKALRKLKSYA
Subjt:  ISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYA

Query:  RDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKR
        RDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EGS S HGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKET+AM R
Subjt:  RDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKR

Query:  LLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKLW
        LL+RIEITSP  KKKSLTWLLRGYIKGGHFRDAAETLVKMV+LGFLPEYLDRVAVLQGLRK+IREPENV+TYLDL KCLSDANLIGPSLVYLHLQK+KLW
Subjt:  LLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKLW

Query:  VIKML
        VIKML
Subjt:  VIKML

SwissProt top hitse value%identityAlignment
O64815 Probable N-acetyltransferase HLS1-like1.1e-8541.67Show/hide
Query:  IRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAE----WNKEVVGVIQGSIKAVF---------FTAHKPPTGLVV--
        +R Y+ S+  D   V D+ERRCE+G + ++ LFTD L DPICR+R+SP Y MLVAE      KE+VG+I+G IK V           T +K    +V+  
Subjt:  IRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAE----WNKEVVGVIQGSIKAVF---------FTAHKPPTGLVV--

Query:  ----KMGYILGLRVAPRYRRRGIGSGLVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINSSEINIQKLKIEEAE
            K+ YILGLRV+P +RR+GIG  LV+ +EDWF  N  +Y   ATE DNHAS+NLF     Y +FRT  ILVNPV  H  NI S  + + KL+  +AE
Subjt:  ----KMGYILGLRVAPRYRRRGIGSGLVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINSSEINIQKLKIEEAE

Query:  AIYKKHMASTEFFPKDIKNILKNKLSLGTWVANFKQRRLSSTVAGGNEQITAS---------SWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKI
         +Y+   ++TEFFP+DI ++L NKLSLGT+VA  +     S    G+     S         SWA++S+WN  + F+L +  A     + +K+ +M+DK 
Subjt:  AIYKKHMASTEFFPKDIKNILKNKLSLGTWVANFKQRRLSSTVAGGNEQITAS---------SWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKI

Query:  LPCFKVILVPNYFKPFGFYFVYGLHHEGPFSERLVGALCKFVHNMALNNSKDHNCKAIVTEISGDEDDDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNN
        LP  K+  +P  F+PFG +F+YG+  EGP +E++V ALC   HN+A    K+  C  +  E++G+E    L+  IPHWK+LSC ED WCIK L       
Subjt:  LPCFKVILVPNYFKPFGFYFVYGLHHEGPFSERLVGALCKFVHNMALNNSKDHNCKAIVTEISGDEDDDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNN

Query:  NIIISNDNDNEHHILEWKNTPPIRTLFVDPRE
              ++ ++  + +W  +PP  ++FVDPRE
Subjt:  NIIISNDNDNEHHILEWKNTPPIRTLFVDPRE

Q0WNN7 Pentatricopeptide repeat-containing protein At2g30100, chloroplastic3.7e-17162.99Show/hide
Query:  PQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVL
        P+LH    VK     SRI CN + +      A KFR++ L +SVELDQFITS++E    +E+G+GFFEAIEELERMTREPSD+LEEMN RLS+RE QL+L
Subjt:  PQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVL

Query:  VYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGR
        VYF+QEGRDSWC LEVFEWL+KENRVD+E MELMVSIMC W+KKL+E   N   V DLL++MDCVGLKP FSM++KVI+LY EMG+KE A+ FVKEVL R
Subjt:  VYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGR

Query:  NLAFMKD-----DWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEK
           F          EG KGGP GYLAWK MVDGDYR AV MV+ LR SGLKPE Y YLIAMTA+VKELN   K LR+LK +AR G VAE+D ++  L+EK
Subjt:  NLAFMKD-----DWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEK

Query:  YQTELLADGVRLSNWVLEEG--SFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKS
        YQ+E L+ G++L+ W +EEG  + SI GVVHERLLAMYICAG+G EAE+QLW+MKL G+E +ADL+DIV+AICASQKE  A+ RLLTR+E     RKKK+
Subjt:  YQTELLADGVRLSNWVLEEG--SFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKS

Query:  LTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKLWVIKML
        L+WLLRGY+KGGHF +AAETLV M+D G  PEY+DRVAV+QG+ ++I+ P +V+ Y+ L K L DA L+GP LVY+++ K+KLW++KM+
Subjt:  LTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKLWVIKML

Q0WVV0 Pentatricopeptide repeat-containing protein At1g10910, chloroplastic2.2e-0621.4Show/hide
Query:  MMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVV
        ++ +G     +K+   ++  GLKP+V  Y   +   +K  N + KA+          ++ EL  N +++                             V+
Subjt:  MMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVV

Query:  HERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFL
        +  +LA+    G+  EAE  + +MK+ G   +   Y  +L   + + + K    L+T ++    +  K  +T LL+ YIKGG F  + E L ++   G+ 
Subjt:  HERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFL

Query:  PEYLDRVAVLQGLRK
           +    ++ GL K
Subjt:  PEYLDRVAVLQGLRK

Q42381 Probable N-acetyltransferase HLS16.5e-8341.65Show/hide
Query:  IIRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAEW---NKEVVGVIQGSIKAV-----FFTAHKPPTGLV----VKM
        ++R Y+ +R  D V V D+ERRCE+G S ++ LFTD L DPICRIR+SP Y MLVAE     KE+VG+I+G IK V         HK    +V     K+
Subjt:  IIRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAEW---NKEVVGVIQGSIKAV-----FFTAHKPPTGLV----VKM

Query:  GYILGLRVAPRYRRRGIGSGLVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINSSEINIQKLKIEEAEAIYKKH
         Y+LGLRV+P +RR+GIG  LV+ +E+WF  N  +Y  +ATE DN AS+NLF     Y +FRT  ILVNPV  H  N+ S  + + KL+  +AE +Y+  
Subjt:  GYILGLRVAPRYRRRGIGSGLVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINSSEINIQKLKIEEAEAIYKKH

Query:  MASTEFFPKDIKNILKNKLSLGTWVANFKQRRLSSTVAGGNE--------QITASSWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKILPCFKVI
         ++TEFFP+DI ++L NKLSLGT+VA     R S   +G           +    SWA++S+WN  + F L +  A     +  K+ +++DK LP  K+ 
Subjt:  MASTEFFPKDIKNILKNKLSLGTWVANFKQRRLSSTVAGGNE--------QITASSWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKILPCFKVI

Query:  LVPNYFKPFGFYFVYGLHHEGPFSERLVGALCKFVHNMALNNSKDHNCKAIVTEISGDEDDDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNNNIIISND
         +P+ F+PFG +F+YG+  EGP + ++V +LC   HN+A    K   C  +  E++G   +D L+  IPHWK+LSC ED WCIK L             D
Subjt:  LVPNYFKPFGFYFVYGLHHEGPFSERLVGALCKFVHNMALNNSKDHNCKAIVTEISGDEDDDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNNNIIISND

Query:  NDNEHHILEWKNTPPIRTLFVDPRE
        + ++  + +W  +PP  ++FVDPRE
Subjt:  NDNEHHILEWKNTPPIRTLFVDPRE

Arabidopsis top hitse value%identityAlignment
AT2G23060.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein7.6e-8741.67Show/hide
Query:  IRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAE----WNKEVVGVIQGSIKAVF---------FTAHKPPTGLVV--
        +R Y+ S+  D   V D+ERRCE+G + ++ LFTD L DPICR+R+SP Y MLVAE      KE+VG+I+G IK V           T +K    +V+  
Subjt:  IRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAE----WNKEVVGVIQGSIKAVF---------FTAHKPPTGLVV--

Query:  ----KMGYILGLRVAPRYRRRGIGSGLVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINSSEINIQKLKIEEAE
            K+ YILGLRV+P +RR+GIG  LV+ +EDWF  N  +Y   ATE DNHAS+NLF     Y +FRT  ILVNPV  H  NI S  + + KL+  +AE
Subjt:  ----KMGYILGLRVAPRYRRRGIGSGLVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINSSEINIQKLKIEEAE

Query:  AIYKKHMASTEFFPKDIKNILKNKLSLGTWVANFKQRRLSSTVAGGNEQITAS---------SWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKI
         +Y+   ++TEFFP+DI ++L NKLSLGT+VA  +     S    G+     S         SWA++S+WN  + F+L +  A     + +K+ +M+DK 
Subjt:  AIYKKHMASTEFFPKDIKNILKNKLSLGTWVANFKQRRLSSTVAGGNEQITAS---------SWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKI

Query:  LPCFKVILVPNYFKPFGFYFVYGLHHEGPFSERLVGALCKFVHNMALNNSKDHNCKAIVTEISGDEDDDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNN
        LP  K+  +P  F+PFG +F+YG+  EGP +E++V ALC   HN+A    K+  C  +  E++G+E    L+  IPHWK+LSC ED WCIK L       
Subjt:  LPCFKVILVPNYFKPFGFYFVYGLHHEGPFSERLVGALCKFVHNMALNNSKDHNCKAIVTEISGDEDDDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNN

Query:  NIIISNDNDNEHHILEWKNTPPIRTLFVDPRE
              ++ ++  + +W  +PP  ++FVDPRE
Subjt:  NIIISNDNDNEHHILEWKNTPPIRTLFVDPRE

AT2G30090.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein9.6e-9847.42Show/hide
Query:  IIRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAEWNKEVVGVIQGSIKAVFFTAHKPPTGLVVKMGYILGLRVAPRY
        +IR Y++ R  D++Q+  +E+ CEIG   +  LFTDTL DPICRIRNSP + MLVA    ++VG IQGS+K V F          V++GY+LGLRV P Y
Subjt:  IIRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAEWNKEVVGVIQGSIKAVFFTAHKPPTGLVVKMGYILGLRVAPRY

Query:  RRRGIGSGLVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINSSEINIQKLKIEEAEAIYKKHM-ASTEFFPKDI
        RRRGIGS LVR+LE+WF S++ DY  MATEKDN AS  LFI  L Y+ FR   ILVNPV         S+I I+KLK++EAE++Y++++ A+TEFFP DI
Subjt:  RRRGIGSGLVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINSSEINIQKLKIEEAEAIYKKHM-ASTEFFPKDI

Query:  KNILKNKLSLGTWVANFKQRRLSSTVAGGNEQITASSWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKILPCFKVILVPNYFKPFGFYFVYGLHH
          IL+NKLS+GTWVA +            N      SWA++S+W+S +VFKLR+ +AP  +L+ TK  K+    L    + ++P+ F PFGFYF+YG+H 
Subjt:  KNILKNKLSLGTWVANFKQRRLSSTVAGGNEQITASSWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKILPCFKVILVPNYFKPFGFYFVYGLHH

Query:  EGPFSERLVGALCKFVHNMALNNSKDHNCKAIVTEI-SGDEDDDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNNNIIISNDNDNEHHILEWKNTPPIRT
        EGP   +LV ALC+ VHNMA  N     CK +V E+  G   DD L+  IPHWK+LSC +D WCIK LK ++N  ++               + +    +
Subjt:  EGPFSERLVGALCKFVHNMALNNSKDHNCKAIVTEI-SGDEDDDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNNNIIISNDNDNEHHILEWKNTPPIRT

Query:  LFVDPRE
        LFVDPRE
Subjt:  LFVDPRE

AT2G30100.1 pentatricopeptide (PPR) repeat-containing protein2.6e-17262.99Show/hide
Query:  PQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVL
        P+LH    VK     SRI CN + +      A KFR++ L +SVELDQFITS++E    +E+G+GFFEAIEELERMTREPSD+LEEMN RLS+RE QL+L
Subjt:  PQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVL

Query:  VYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGR
        VYF+QEGRDSWC LEVFEWL+KENRVD+E MELMVSIMC W+KKL+E   N   V DLL++MDCVGLKP FSM++KVI+LY EMG+KE A+ FVKEVL R
Subjt:  VYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGR

Query:  NLAFMKD-----DWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEK
           F          EG KGGP GYLAWK MVDGDYR AV MV+ LR SGLKPE Y YLIAMTA+VKELN   K LR+LK +AR G VAE+D ++  L+EK
Subjt:  NLAFMKD-----DWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEK

Query:  YQTELLADGVRLSNWVLEEG--SFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKS
        YQ+E L+ G++L+ W +EEG  + SI GVVHERLLAMYICAG+G EAE+QLW+MKL G+E +ADL+DIV+AICASQKE  A+ RLLTR+E     RKKK+
Subjt:  YQTELLADGVRLSNWVLEEG--SFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKS

Query:  LTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKLWVIKML
        L+WLLRGY+KGGHF +AAETLV M+D G  PEY+DRVAV+QG+ ++I+ P +V+ Y+ L K L DA L+GP LVY+++ K+KLW++KM+
Subjt:  LTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKLWVIKML

AT4G37580.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein4.6e-8441.65Show/hide
Query:  IIRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAEW---NKEVVGVIQGSIKAV-----FFTAHKPPTGLV----VKM
        ++R Y+ +R  D V V D+ERRCE+G S ++ LFTD L DPICRIR+SP Y MLVAE     KE+VG+I+G IK V         HK    +V     K+
Subjt:  IIRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAEW---NKEVVGVIQGSIKAV-----FFTAHKPPTGLV----VKM

Query:  GYILGLRVAPRYRRRGIGSGLVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINSSEINIQKLKIEEAEAIYKKH
         Y+LGLRV+P +RR+GIG  LV+ +E+WF  N  +Y  +ATE DN AS+NLF     Y +FRT  ILVNPV  H  N+ S  + + KL+  +AE +Y+  
Subjt:  GYILGLRVAPRYRRRGIGSGLVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINSSEINIQKLKIEEAEAIYKKH

Query:  MASTEFFPKDIKNILKNKLSLGTWVANFKQRRLSSTVAGGNE--------QITASSWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKILPCFKVI
         ++TEFFP+DI ++L NKLSLGT+VA     R S   +G           +    SWA++S+WN  + F L +  A     +  K+ +++DK LP  K+ 
Subjt:  MASTEFFPKDIKNILKNKLSLGTWVANFKQRRLSSTVAGGNE--------QITASSWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKILPCFKVI

Query:  LVPNYFKPFGFYFVYGLHHEGPFSERLVGALCKFVHNMALNNSKDHNCKAIVTEISGDEDDDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNNNIIISND
         +P+ F+PFG +F+YG+  EGP + ++V +LC   HN+A    K   C  +  E++G   +D L+  IPHWK+LSC ED WCIK L             D
Subjt:  LVPNYFKPFGFYFVYGLHHEGPFSERLVGALCKFVHNMALNNSKDHNCKAIVTEISGDEDDDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNNNIIISND

Query:  NDNEHHILEWKNTPPIRTLFVDPRE
        + ++  + +W  +PP  ++FVDPRE
Subjt:  NDNEHHILEWKNTPPIRTLFVDPRE

AT5G67430.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein3.9e-7539.9Show/hide
Query:  FNGFIIRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAEWNKEVVGVIQGSIKAVFFTAHK-------PPTGLVVKMG
        FN  ++R Y+  R  D   V +LE  CE+G      L  D + DP+ RIR SP + MLVAE   E+VG+I+G+IK V    +         P     K+ 
Subjt:  FNGFIIRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAEWNKEVVGVIQGSIKAVFFTAHK-------PPTGLVVKMG

Query:  YILGLRVAPRYRRRGIGSGLVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINSSEINIQKLKIEEAEAIYKKHM
        ++ GLRV+P YRR GIG  LV+RLE+WF+ ND  Y  + TE DN AS+ LF     Y KFRT   LVNPV NH   + S  + I KL   +AE++Y+   
Subjt:  YILGLRVAPRYRRRGIGSGLVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINSSEINIQKLKIEEAEAIYKKHM

Query:  ASTEFFPKDIKNILKNKLSLGTWVANFKQRRLSSTVAGGNEQITASSWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKILPCFKVILVPNYFKPF
        ++TEFFP DI +IL NKLSLGT++A     R    V+G     T  SWA++S+WNS +V++L++  A     +  KS ++ D   P  K+   PN FK F
Subjt:  ASTEFFPKDIKNILKNKLSLGTWVANFKQRRLSSTVAGGNEQITASSWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKILPCFKVILVPNYFKPF

Query:  GFYFVYGLHHEGPFSERLVGALCKFVHNMALNNSKDHNCKAIVTEISGDEDDDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNNNIIISNDNDNEHHILE
          +F+YG+  EGP +  +V ALC   HN+A    +   C  +  E++  E    L++ IPHWK+LS  ED WC+K L+            D+D     ++
Subjt:  GFYFVYGLHHEGPFSERLVGALCKFVHNMALNNSKDHNCKAIVTEISGDEDDDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNNNIIISNDNDNEHHILE

Query:  WKNTPPIRTLFVDPRE
        W  +PP  ++FVDPRE
Subjt:  WKNTPPIRTLFVDPRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTTAACGGCTTTATTATTCGAAGCTATGAAGAGAGTCGATTATCAGATAAAGTTCAAGTTATGGATCTTGAACGACGATGTGAAATTGGTCAATCAAAACGTGT
GTTTCTCTTTACTGACACTTTAGATGACCCTATTTGTAGGATACGTAACAGTCCCATGTATAAAATGTTGGTTGCTGAGTGGAACAAGGAAGTGGTTGGTGTTATTCAAG
GCTCTATAAAAGCGGTTTTTTTTACTGCTCATAAACCACCGACCGGTTTGGTGGTTAAAATGGGCTACATTCTTGGCCTGAGAGTGGCGCCGCGGTATCGCCGTCGTGGG
ATTGGCTCCGGCCTCGTCCGCCGTTTGGAAGATTGGTTCGTTTCTAATGATGTTGATTACTGTTGCATGGCCACTGAGAAAGATAATCACGCCTCTCTTAATCTCTTCAT
CAATAATTTGAGGTACATAAAGTTTAGAACAGGAAGAATCCTAGTAAACCCAGTAAGAAATCATCCATACAATATCAATTCATCAGAAATCAACATTCAAAAGCTAAAAA
TAGAAGAAGCAGAAGCAATATACAAAAAACACATGGCTTCAACAGAGTTCTTCCCCAAAGACATAAAAAACATATTGAAAAACAAGTTGAGTTTAGGGACATGGGTGGCA
AATTTCAAACAACGACGGTTATCGTCGACAGTCGCCGGAGGAAACGAGCAGATTACGGCGAGTAGTTGGGCCATTGTAAGTCTATGGAACAGTGGGGAAGTTTTCAAGCT
AAGGCTAGGAAAAGCACCATTTCCATGGCTTATTTACACAAAGAGTTTAAAAATGATGGATAAAATTTTGCCTTGCTTTAAAGTGATTTTGGTGCCTAATTATTTCAAGC
CATTTGGGTTCTATTTTGTTTATGGATTGCACCATGAAGGCCCTTTTTCTGAGAGATTGGTTGGAGCTTTGTGCAAATTTGTGCACAATATGGCTTTGAACAATTCAAAG
GATCATAATTGTAAAGCTATTGTTACTGAGATTAGTGGTGATGAAGATGATGATGAGCTGAAAATGAAGATTCCCCATTGGAAATTGCTATCATGTTATGAAGATTTTTG
GTGCATAAAGTCCTTGAAAAGTAAGAGAAATAATAATAATATTATTATTAGTAATGATAATGATAATGAGCATCATATATTGGAATGGAAAAATACCCCACCTATTAGAA
CTCTCTTTGTAGACCCAAGAGAGGGCTTTACTCCGTTAACCCAATTTGGGTTTTCATTTTCTTTATCTTCTGCTCTGAAAACTGAGAAACATGGGTTTTCTACTCCCCAA
TTGCATAGTCCTTCGCCGGTAAAGTTTTGCTTTATGGTTTCTCGTATTTCTTGCAACTATCAGGATTCTACTTTCTCTGTCTCGCGAGCTAGTAAGTTTCGGGACTTAAG
GTTGTTCAAATCGGTTGAGTTGGATCAGTTCATCACGAGTGATGACGAAGATGAAATGGGAGATGGGTTTTTTGAGGCAATTGAGGAATTGGAAAGAATGACGAGGGAAC
CATCGGATGTTCTTGAAGAAATGAACGATCGCCTTTCGGCGAGGGAATTTCAGCTCGTGTTGGTGTATTTCTCTCAAGAAGGGAGAGATTCATGGTGTGCTCTTGAGGTT
TTTGAGTGGCTCCAAAAGGAAAATCGGGTTGACAAGGAGACCATGGAGTTGATGGTGTCTATAATGTGTAGTTGGATCAAGAAGTTGGTCGAGGGACGACATAACGTCAG
TGATGTTGTTGACCTTCTTGTGGATATGGATTGTGTAGGTTTGAAGCCTCATTTTAGCATGATAGAAAAGGTTATCTCTTTGTATTGGGAAATGGGTGAGAAGGAAAAGG
CAATTTCGTTTGTGAAAGAGGTCTTGGGACGCAATCTTGCTTTTATGAAGGACGATTGGGAGGGACATAAAGGGGGACCAAGCGGTTATCTCGCATGGAAGATGATGGTT
GATGGTGACTATAGGGGTGCAGTGAAAATGGTGCTGCATCTTAGAGAATCTGGATTAAAGCCAGAGGTTTACTGCTATCTTATCGCGATGACTGCTGTGGTTAAAGAGCT
GAATGAATTTGCAAAAGCTCTACGGAAACTCAAAAGTTATGCAAGAGATGGGATAGTGGCTGAACTCGATAAAAACAACGTTGAACTTGTTGAGAAGTATCAGACAGAGC
TTCTAGCTGATGGAGTACGGTTATCCAACTGGGTGCTTGAAGAGGGAAGCTTTTCGATTCATGGGGTGGTTCATGAGAGACTCCTTGCTATGTACATTTGTGCTGGGCAA
GGACTCGAGGCAGAGAGACAGCTTTGGGAAATGAAGCTTGTAGGCAAGGAGGCCGATGCTGATCTCTACGATATCGTGCTAGCGATTTGTGCTTCACAGAAGGAGACAAA
AGCAATGAAACGGTTGCTTACCAGGATTGAGATTACGAGTCCCATGCGTAAGAAGAAGAGTTTGACATGGCTACTAAGGGGTTACATAAAAGGAGGGCATTTCCGTGATG
CTGCAGAAACATTAGTAAAAATGGTCGATTTGGGTTTTCTCCCAGAGTACTTGGACAGAGTAGCTGTGCTGCAAGGACTAAGAAAACAGATTCGGGAACCTGAAAATGTC
GATACTTATCTCGATCTCCTCAAATGTCTCTCTGATGCCAATCTAATTGGACCTAGTCTTGTATATTTGCACTTACAGAAACACAAGCTTTGGGTCATTAAAATGCTTTG
A
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTTAACGGCTTTATTATTCGAAGCTATGAAGAGAGTCGATTATCAGATAAAGTTCAAGTTATGGATCTTGAACGACGATGTGAAATTGGTCAATCAAAACGTGT
GTTTCTCTTTACTGACACTTTAGATGACCCTATTTGTAGGATACGTAACAGTCCCATGTATAAAATGTTGGTTGCTGAGTGGAACAAGGAAGTGGTTGGTGTTATTCAAG
GCTCTATAAAAGCGGTTTTTTTTACTGCTCATAAACCACCGACCGGTTTGGTGGTTAAAATGGGCTACATTCTTGGCCTGAGAGTGGCGCCGCGGTATCGCCGTCGTGGG
ATTGGCTCCGGCCTCGTCCGCCGTTTGGAAGATTGGTTCGTTTCTAATGATGTTGATTACTGTTGCATGGCCACTGAGAAAGATAATCACGCCTCTCTTAATCTCTTCAT
CAATAATTTGAGGTACATAAAGTTTAGAACAGGAAGAATCCTAGTAAACCCAGTAAGAAATCATCCATACAATATCAATTCATCAGAAATCAACATTCAAAAGCTAAAAA
TAGAAGAAGCAGAAGCAATATACAAAAAACACATGGCTTCAACAGAGTTCTTCCCCAAAGACATAAAAAACATATTGAAAAACAAGTTGAGTTTAGGGACATGGGTGGCA
AATTTCAAACAACGACGGTTATCGTCGACAGTCGCCGGAGGAAACGAGCAGATTACGGCGAGTAGTTGGGCCATTGTAAGTCTATGGAACAGTGGGGAAGTTTTCAAGCT
AAGGCTAGGAAAAGCACCATTTCCATGGCTTATTTACACAAAGAGTTTAAAAATGATGGATAAAATTTTGCCTTGCTTTAAAGTGATTTTGGTGCCTAATTATTTCAAGC
CATTTGGGTTCTATTTTGTTTATGGATTGCACCATGAAGGCCCTTTTTCTGAGAGATTGGTTGGAGCTTTGTGCAAATTTGTGCACAATATGGCTTTGAACAATTCAAAG
GATCATAATTGTAAAGCTATTGTTACTGAGATTAGTGGTGATGAAGATGATGATGAGCTGAAAATGAAGATTCCCCATTGGAAATTGCTATCATGTTATGAAGATTTTTG
GTGCATAAAGTCCTTGAAAAGTAAGAGAAATAATAATAATATTATTATTAGTAATGATAATGATAATGAGCATCATATATTGGAATGGAAAAATACCCCACCTATTAGAA
CTCTCTTTGTAGACCCAAGAGAGGGCTTTACTCCGTTAACCCAATTTGGGTTTTCATTTTCTTTATCTTCTGCTCTGAAAACTGAGAAACATGGGTTTTCTACTCCCCAA
TTGCATAGTCCTTCGCCGGTAAAGTTTTGCTTTATGGTTTCTCGTATTTCTTGCAACTATCAGGATTCTACTTTCTCTGTCTCGCGAGCTAGTAAGTTTCGGGACTTAAG
GTTGTTCAAATCGGTTGAGTTGGATCAGTTCATCACGAGTGATGACGAAGATGAAATGGGAGATGGGTTTTTTGAGGCAATTGAGGAATTGGAAAGAATGACGAGGGAAC
CATCGGATGTTCTTGAAGAAATGAACGATCGCCTTTCGGCGAGGGAATTTCAGCTCGTGTTGGTGTATTTCTCTCAAGAAGGGAGAGATTCATGGTGTGCTCTTGAGGTT
TTTGAGTGGCTCCAAAAGGAAAATCGGGTTGACAAGGAGACCATGGAGTTGATGGTGTCTATAATGTGTAGTTGGATCAAGAAGTTGGTCGAGGGACGACATAACGTCAG
TGATGTTGTTGACCTTCTTGTGGATATGGATTGTGTAGGTTTGAAGCCTCATTTTAGCATGATAGAAAAGGTTATCTCTTTGTATTGGGAAATGGGTGAGAAGGAAAAGG
CAATTTCGTTTGTGAAAGAGGTCTTGGGACGCAATCTTGCTTTTATGAAGGACGATTGGGAGGGACATAAAGGGGGACCAAGCGGTTATCTCGCATGGAAGATGATGGTT
GATGGTGACTATAGGGGTGCAGTGAAAATGGTGCTGCATCTTAGAGAATCTGGATTAAAGCCAGAGGTTTACTGCTATCTTATCGCGATGACTGCTGTGGTTAAAGAGCT
GAATGAATTTGCAAAAGCTCTACGGAAACTCAAAAGTTATGCAAGAGATGGGATAGTGGCTGAACTCGATAAAAACAACGTTGAACTTGTTGAGAAGTATCAGACAGAGC
TTCTAGCTGATGGAGTACGGTTATCCAACTGGGTGCTTGAAGAGGGAAGCTTTTCGATTCATGGGGTGGTTCATGAGAGACTCCTTGCTATGTACATTTGTGCTGGGCAA
GGACTCGAGGCAGAGAGACAGCTTTGGGAAATGAAGCTTGTAGGCAAGGAGGCCGATGCTGATCTCTACGATATCGTGCTAGCGATTTGTGCTTCACAGAAGGAGACAAA
AGCAATGAAACGGTTGCTTACCAGGATTGAGATTACGAGTCCCATGCGTAAGAAGAAGAGTTTGACATGGCTACTAAGGGGTTACATAAAAGGAGGGCATTTCCGTGATG
CTGCAGAAACATTAGTAAAAATGGTCGATTTGGGTTTTCTCCCAGAGTACTTGGACAGAGTAGCTGTGCTGCAAGGACTAAGAAAACAGATTCGGGAACCTGAAAATGTC
GATACTTATCTCGATCTCCTCAAATGTCTCTCTGATGCCAATCTAATTGGACCTAGTCTTGTATATTTGCACTTACAGAAACACAAGCTTTGGGTCATTAAAATGCTTTG
A
Protein sequenceShow/hide protein sequence
MEFNGFIIRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAEWNKEVVGVIQGSIKAVFFTAHKPPTGLVVKMGYILGLRVAPRYRRRG
IGSGLVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINSSEINIQKLKIEEAEAIYKKHMASTEFFPKDIKNILKNKLSLGTWVA
NFKQRRLSSTVAGGNEQITASSWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKILPCFKVILVPNYFKPFGFYFVYGLHHEGPFSERLVGALCKFVHNMALNNSK
DHNCKAIVTEISGDEDDDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNNNIIISNDNDNEHHILEWKNTPPIRTLFVDPREGFTPLTQFGFSFSLSSALKTEKHGFSTPQ
LHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEV
FEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMV
DGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQ
GLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENV
DTYLDLLKCLSDANLIGPSLVYLHLQKHKLWVIKML