; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS023623 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS023623
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold155:727476..729611
RNA-Seq ExpressionMS023623
SyntenyMS023623
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003723 - RNA binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025166.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0087.78Show/hide
Query:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ
        MSSSS +YI RGLS+YKL TFIPKQWKN PVSNG EFMI  +F SLKSFA HGQLSKAFEAFSL+QLR+ YNDSFDLI+QS SILLVSCT  SSLP G+Q
Subjt:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ

Query:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL
        LHGRIILSGLE+DSILVPKLVTFYSSFKLLAEAHTLVENSN+FHPCPWNLLITSYVRN LHE+AIL YKQMLS+GVRPDNFTFPSILKACGETQNLGFGL
Subjt:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA
        EVHK IN+WS +WSLFVQNALISMYGRCGE+DTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFD MQSKC+EINIVTWNIIAGGCLR+G F  A
Subjt:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA

Query:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE
        LKLLSQMRNFG HLD VAMIIGLGACSHIGAIRLGKEIHGFTIRHCYH+ S VQNAL+TMYARCKDI  AYILFR+N DKSIITWNSMLSG +H+DRVE+
Subjt:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE

Query:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELLL GVEPNYVT ASILPLCARVADLQHGREFHCYITKR+D  DYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH
         KALRLF+EMK   IKPDHITMVAVLSACSHSGL+KQGE+LFAEMQSVHGLSP LEHYACMADLFGRVGLL++AK IITRMPYRPTSAMWATLIGACCIH
Subjt:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH

Query:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV
         NT+IGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRD GVAKAPGCSWV+VGS FVSFLVGDTSNPQALE+  LLD LNDVMKHG+LV
Subjt:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV

Query:  TKDSHDIDNDSF
          D +DI +D F
Subjt:  TKDSHDIDNDSF

XP_022141804.1 pentatricopeptide repeat-containing protein At1g71490-like [Momordica charantia]0.0e+0099.72Show/hide
Query:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ
        MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ
Subjt:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ

Query:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL
        LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRG+RPDNFTFPSILKACGETQNLGFGL
Subjt:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA
        EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA
Subjt:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA

Query:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE
        LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE
Subjt:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE

Query:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH
        SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH
Subjt:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH

Query:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV
        GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLN+VMKHGSLV
Subjt:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV

Query:  TKDSHDIDNDSF
        TKDSHDIDNDSF
Subjt:  TKDSHDIDNDSF

XP_022925519.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita moschata]0.0e+0087.78Show/hide
Query:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ
        MSSSS +YI RGLS+YKL TFIPKQWKN PVSNG EFMI  +F SLKSFA HGQLSKAFEAFSL+QLR  YNDSFDLI+QS SILLVSCT  SSLP G+Q
Subjt:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ

Query:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL
        LHGRII SGLE+DSILVPKLVTFYSSFKLLAEAHTLVENSN+FHPCPWNLLITSYVRN LHE+AIL YKQMLS+GVRPDNFTFPSILKACGETQNLGFGL
Subjt:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA
        EVHK IN+WS +WSLFVQNALISMYGRCGE+DTARNLFDNMLDRDAVSWNSMISCYAS GMWKEAFELFD MQSKC+EINIVTWNIIAGGCLR+G F  A
Subjt:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA

Query:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE
        LKLLSQMRNFG HLD VAMIIGLGACSHIGAIRLGKEIHGFTIRHCYH+ S VQNAL+TMYARCKDIM AYILFRLN DKSIITWNSMLSG +H+DRVE+
Subjt:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE

Query:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELLL GVEPNYVT ASILPLCARVADLQHGREFHCYITKR+D  DYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH
         KALRLF+EMK   IKPDHITMVAVLSACSHSGL+KQGE+LFAEMQSVHGLSP LEHYACMADLFGRVGLL++AK IITRMPYRPTSAMWATLIGACCIH
Subjt:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH

Query:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV
         NT+IGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRD GVAKAPGCSWV+VGS FVSFLVGDTSNPQALE+  LLD LNDVMKHG+LV
Subjt:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV

Query:  TKDSHDIDNDSF
          D +DI +D F
Subjt:  TKDSHDIDNDSF

XP_022973516.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita maxima]0.0e+0087.92Show/hide
Query:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ
        MSSSS +YI RGLS+YKL TFIPKQWKN PVSNG EFMI  +F SLKSFA HGQLSKAFEAFSL+QLR  YNDSFDLI+QS SILLVSCT  SSLP G+Q
Subjt:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ

Query:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL
        LHGRII SGLE+DSILVPKLVTFYSSFKLLAEAHTLVENSN+FHPCPWNLLITSYVRN LHE+AIL YKQMLS+GVRPDNFTFPSILKACGETQNLGFGL
Subjt:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA
        EVHK IN+WS +WSLFVQNALISMYGRCGE+DTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFD MQSKC+EINIVTWNIIAGGCLR+G F  A
Subjt:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA

Query:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE
        LKLLSQMRNFG HLD VAMIIGLGACSHIGAIRLGKEIHGFTIRHCYH+ S VQNAL+TMYARCKDIM AYILFRLN DKSIITWNSMLSG +H+DRVE+
Subjt:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE

Query:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRE LL GVEPNYVT ASILPLCARVADLQHGREFHCYITKR+D  DYLLLWNALVDMYARSGKV+EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH
         KALRLF+EMK   IKPDHITMVAVLSACSHSGL+KQGE+LFAEMQSVHGLSPHLEHYACMADLFGRVGLL++AK IITRMPYRPTSAMWATLIGACCIH
Subjt:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH

Query:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV
         NT+IGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRD GVAKAPGCSWV+VGS FVSFLVGDTSNPQALE+  LLD+LNDVMKHG+LV
Subjt:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV

Query:  TKDSHDIDNDSF
          D +DI ND F
Subjt:  TKDSHDIDNDSF

XP_023535485.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0087.08Show/hide
Query:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ
        MSSSS +YI RGLS+YKL TF+PKQWKN  VSNG EFMI  +F SLKSFA HGQLSKAFEAFSL+QLR  YNDSFDLI+QS SILLVSCT  SSLP G+Q
Subjt:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ

Query:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL
        LHG II SGLE+DSILVPKLVTFYSSFKLLAEAHTLVENSN+FHPCPWNLLITSYVRN LHE+AIL YKQMLS+GVRPDNFTFPSILKACGETQNLGFGL
Subjt:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA
        EVHK IN+WS +WSLFVQNALISMYGRCGE+DTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFD MQSKC+EINIVTWNIIAGGCLR+G F  A
Subjt:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA

Query:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE
        LKLLSQMRNFG HLD VAMIIGLGACSHIGAIRLGKEIHGFTIRHCYH+ S VQNAL+TMYARCKDI  AYILFRLN DKSIITWNSMLSG +H+DRVE+
Subjt:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE

Query:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELLL GVEPNYVT ASILPLCARVADLQHGREFHCYITKR+D  DYLLLWNALVDMYARSGKV+EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH
         KALRLF+EMK   IKPDHITMVAVLSACSHSGL+KQGE+LFAEMQSVHGLSP LEHYACMADLFGRVGLL++AK IITRMP RPTSAMWATLIGACCIH
Subjt:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH

Query:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV
         NT+IGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRD GVAKAPGCSWV+VGS FVSFLVGDTSNPQALE+  LLD+LNDVMKHG+LV
Subjt:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV

Query:  TKDSHDIDNDSF
          D +DI +D F
Subjt:  TKDSHDIDNDSF

TrEMBL top hitse value%identityAlignment
A0A1S3CB12 pentatricopeptide repeat-containing protein At1g714900.0e+0083.99Show/hide
Query:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ
        MS S S+ I +GLS+ KL+TFIPK WK  PVSN SEFMIG +FSSLK FA HGQLSK FEAFSLIQLRT YNDSFDLILQS SILLVSCT  SSLPPG+Q
Subjt:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ

Query:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL
        LHG II SGL +DS LV KLV FYSS + L EAHTLVE SN+F PC WN+L+TSYVRN L+EAAIL YKQMLS+GVRPDNFTFPSILKACGETQNL FGL
Subjt:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA
        EVHK INA ST WSLFV NALISMYGRCGEVDTAR LFD ML+RD VSWNSMISCY+S+GMW+EAFELF++MQSK +EIN+VTWNIIAGGCLRVGNF  A
Subjt:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA

Query:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE
        L LLSQMRNFG HLD VAMIIGLGACSHIGAIRLGKEIHGFTIRH +H LS VQNALVTMYARCKDI +AY+LFRLN DKSIITWNSMLSG THLDRVE+
Subjt:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE

Query:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELLL GVEPNYVT ASILPLCARVA+LQHGREFHCYITKR D  D+LLLWNALVDMYARSGKV EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH
         KALRLF+EMKRFQIKPDHITMVAVLSACSHSGLL QGELLFAEMQSVHGLSP LEHY+CMADLFGRVGLLNKAK IITRMPYRPTSA+WATLIGACCIH
Subjt:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH

Query:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV
        GNT+IGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLA+IRT MRDSGVAK PGCSWVDVGS FVSF VGDTS+PQALE+ LLLD+L DV+KH SL+
Subjt:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV

Query:  TKDSHDIDNDSF
        T D++D  ++ F
Subjt:  TKDSHDIDNDSF

A0A5D3BN10 Pentatricopeptide repeat-containing protein0.0e+0083.99Show/hide
Query:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ
        MS S S+ I +GLS+ KL+TFIPK WK  PVSN SEFMIG +FSSLK FA HGQLSK FEAFSLIQLRT YNDSFDLILQS SILLVSCT  SSLPPG+Q
Subjt:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ

Query:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL
        LHG II SGL +DS LV KLV FYSS + L EAHTLVE SN+F PC WN+L+TSYVRN L+EAAIL YKQMLS+GVRPDNFTFPSILKACGETQNL FGL
Subjt:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA
        EVHK INA ST WSLFV NALISMYGRCGEVDTAR LFD ML+RD VSWNSMISCY+S+GMW+EAFELF++MQSK +EIN+VTWNIIAGGCLRVGNF  A
Subjt:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA

Query:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE
        L LLSQMRNFG HLD VAMIIGLGACSHIGAIRLGKEIHGFTIRH +H LS VQNALVTMYARCKDI +AY+LFRLN DKSIITWNSMLSG THLDRVE+
Subjt:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE

Query:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELLL GVEPNYVT ASILPLCARVA+LQHGREFHCYITKR D  D+LLLWNALVDMYARSGKV EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH
         KALRLF+EMKRFQIKPDHITMVAVLSACSHSGLL QGELLFAEMQSVHGLSP LEHY+CMADLFGRVGLLNKAK IITRMPYRPTSA+WATLIGACCIH
Subjt:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH

Query:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV
        GNT+IGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLA+IRT MRDSGVAK PGCSWVDVGS FVSF VGDTS+PQALE+ LLLD+L DV+KH SL+
Subjt:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV

Query:  TKDSHDIDNDSF
        T D++D  ++ F
Subjt:  TKDSHDIDNDSF

A0A6J1CJU8 pentatricopeptide repeat-containing protein At1g71490-like0.0e+0099.72Show/hide
Query:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ
        MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ
Subjt:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ

Query:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL
        LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRG+RPDNFTFPSILKACGETQNLGFGL
Subjt:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA
        EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA
Subjt:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA

Query:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE
        LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE
Subjt:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE

Query:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH
        SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH
Subjt:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH

Query:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV
        GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLN+VMKHGSLV
Subjt:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV

Query:  TKDSHDIDNDSF
        TKDSHDIDNDSF
Subjt:  TKDSHDIDNDSF

A0A6J1EI84 pentatricopeptide repeat-containing protein At1g71490 isoform X10.0e+0087.78Show/hide
Query:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ
        MSSSS +YI RGLS+YKL TFIPKQWKN PVSNG EFMI  +F SLKSFA HGQLSKAFEAFSL+QLR  YNDSFDLI+QS SILLVSCT  SSLP G+Q
Subjt:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ

Query:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL
        LHGRII SGLE+DSILVPKLVTFYSSFKLLAEAHTLVENSN+FHPCPWNLLITSYVRN LHE+AIL YKQMLS+GVRPDNFTFPSILKACGETQNLGFGL
Subjt:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA
        EVHK IN+WS +WSLFVQNALISMYGRCGE+DTARNLFDNMLDRDAVSWNSMISCYAS GMWKEAFELFD MQSKC+EINIVTWNIIAGGCLR+G F  A
Subjt:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA

Query:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE
        LKLLSQMRNFG HLD VAMIIGLGACSHIGAIRLGKEIHGFTIRHCYH+ S VQNAL+TMYARCKDIM AYILFRLN DKSIITWNSMLSG +H+DRVE+
Subjt:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE

Query:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELLL GVEPNYVT ASILPLCARVADLQHGREFHCYITKR+D  DYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH
         KALRLF+EMK   IKPDHITMVAVLSACSHSGL+KQGE+LFAEMQSVHGLSP LEHYACMADLFGRVGLL++AK IITRMPYRPTSAMWATLIGACCIH
Subjt:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH

Query:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV
         NT+IGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRD GVAKAPGCSWV+VGS FVSFLVGDTSNPQALE+  LLD LNDVMKHG+LV
Subjt:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV

Query:  TKDSHDIDNDSF
          D +DI +D F
Subjt:  TKDSHDIDNDSF

A0A6J1I8V4 pentatricopeptide repeat-containing protein At1g71490 isoform X10.0e+0087.92Show/hide
Query:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ
        MSSSS +YI RGLS+YKL TFIPKQWKN PVSNG EFMI  +F SLKSFA HGQLSKAFEAFSL+QLR  YNDSFDLI+QS SILLVSCT  SSLP G+Q
Subjt:  MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQ

Query:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL
        LHGRII SGLE+DSILVPKLVTFYSSFKLLAEAHTLVENSN+FHPCPWNLLITSYVRN LHE+AIL YKQMLS+GVRPDNFTFPSILKACGETQNLGFGL
Subjt:  LHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA
        EVHK IN+WS +WSLFVQNALISMYGRCGE+DTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFD MQSKC+EINIVTWNIIAGGCLR+G F  A
Subjt:  EVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGA

Query:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE
        LKLLSQMRNFG HLD VAMIIGLGACSHIGAIRLGKEIHGFTIRHCYH+ S VQNAL+TMYARCKDIM AYILFRLN DKSIITWNSMLSG +H+DRVE+
Subjt:  LKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEE

Query:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRE LL GVEPNYVT ASILPLCARVADLQHGREFHCYITKR+D  DYLLLWNALVDMYARSGKV+EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH
         KALRLF+EMK   IKPDHITMVAVLSACSHSGL+KQGE+LFAEMQSVHGLSPHLEHYACMADLFGRVGLL++AK IITRMPYRPTSAMWATLIGACCIH
Subjt:  SKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIH

Query:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV
         NT+IGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRD GVAKAPGCSWV+VGS FVSFLVGDTSNPQALE+  LLD+LNDVMKHG+LV
Subjt:  GNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLV

Query:  TKDSHDIDNDSF
          D +DI ND F
Subjt:  TKDSHDIDNDSF

SwissProt top hitse value%identityAlignment
Q4V389 Pentatricopeptide repeat-containing protein At1g228303.4e-20453.08Show/hide
Query:  MSSSSSQYIFRGLSLYKLQTFIPKQWKN--KPVSNGS-----EFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNAS
        M SS S+ I RGL++ ++  FIP+ WK   +P+S  S     E +   +F+S +    HGQL +AF  FSL++ ++G   S + +L S + LL +C   +
Subjt:  MSSSSSQYIFRGLSLYKLQTFIPKQWKN--KPVSNGS-----EFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNAS

Query:  SLPPGRQLHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGET
           PG+QLH   I SGLE DS+LVPKLVTFYS+F LL EA T+ ENS I HP PWN+LI SY+RN   + ++ VYK+M+S+G+R D FT+PS++KAC   
Subjt:  SLPPGRQLHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGET

Query:  QNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLR
         +  +G  VH  I   S   +L+V NALISMY R G+VD AR LFD M +RDAVSWN++I+CY S+    EAF+L D M    +E +IVTWN IAGGCL 
Subjt:  QNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLR

Query:  VGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHC--YHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSG
         GN++GAL  +  MRN    +  VAMI GL ACSHIGA++ GK  H   IR C   H +  V+N+L+TMY+RC D+ +A+I+F+     S+ TWNS++SG
Subjt:  VGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHC--YHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSG

Query:  YTHLDRVEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI
        + + +R EE  FL +E+LLSG  PN++T+ASILPL ARV +LQHG+EFHCYI +R+   D L+LWN+LVDMYA+SG+++ AKRVFDS+ K+D+VTYTSLI
Subjt:  YTHLDRVEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI

Query:  AGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWA
         GYG  G+G  AL  F++M R  IKPDH+TMVAVLSACSHS L+++G  LF +M+ V G+   LEHY+CM DL+ R G L+KA+ I   +PY P+SAM A
Subjt:  AGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWA

Query:  TLIGACCIHGNTNIGEWAAEK-LLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKA
        TL+ AC IHGNTNIGEWAA+K LLE +PEH G+Y+L+A+MYA  GSWSKL  ++TL+ D GV KA
Subjt:  TLIGACCIHGNTNIGEWAAEK-LLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKA

Q9C9I6 Pentatricopeptide repeat-containing protein At1g714902.4e-23459.48Show/hide
Query:  VFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQLHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSN
        +F SL   A HG L  AF+ FSL++L++    S DL+L S + LL +C +  +   G Q+H   I SG+E  S+LVPKLVTFYS+F L  EA +++ENS+
Subjt:  VFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQLHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSN

Query:  IFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNM
        I HP PWN+LI SY +N L E  I  YK+M+S+G+RPD FT+PS+LKACGET ++ FG  VH  I   S + SL+V NALISMY R   +  AR LFD M
Subjt:  IFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNM

Query:  LDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGF
         +RDAVSWN++I+CYAS+GMW EAFELFD M    +E++++TWNII+GGCL+ GN+VGAL L+S+MRNF   LD VAMIIGL ACS IGAIRLGKEIHG 
Subjt:  LDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGF

Query:  TIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHC
         I   Y  +  V+N L+TMY++CKD+ +A I+FR   + S+ TWNS++SGY  L++ EEA  L RE+L++G +PN +T+ASILPLCAR+A+LQHG+EFHC
Subjt:  TIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHC

Query:  YITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELL
        YI +R+   DY +LWN+LVD+YA+SGK++ AK+V D +SK+DEVTYTSLI GYG QGEG  AL LF+EM R  IKPDH+T+VAVLSACSHS L+ +GE L
Subjt:  YITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELL

Query:  FAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIHGNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLA
        F +MQ  +G+ P L+H++CM DL+GR G L KAK II  MPY+P+ A WATL+ AC IHGNT IG+WAAEKLLEM+PE+ GYYVLIANMYAAAGSWSKLA
Subjt:  FAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIHGNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLA

Query:  KIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMK
        ++RT+MRD GV K PGC+W+D  SGF  F VGDTS+P+A     LLD LN +MK
Subjt:  KIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMK

Q9LFL5 Pentatricopeptide repeat-containing protein At5g168607.7e-12434.75Show/hide
Query:  SSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQLHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIF
        S ++S+  +G  +K    F L+   +   D++     +F  +  +C   SS+  G   H   +++G   +  +   LV  YS  + L++A  + +  +++
Subjt:  SSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQLHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIF

Query:  HPCPWNLLITSYVRNGLHEAAILVYKQMLSR-GVRPDNFTFPSILKACGETQNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNML
            WN +I SY + G  + A+ ++ +M +  G RPDN T  ++L  C        G ++H +        ++FV N L+ MY +CG +D A  +F NM 
Subjt:  HPCPWNLLITSYVRNGLHEAAILVYKQMLSR-GVRPDNFTFPSILKACGETQNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNML

Query:  DRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFT
         +D VSWN+M++ Y+  G +++A  LF+ MQ + I++++VTW+    G  + G    AL +  QM + G   + V +I  L  C+ +GA+  GKEIH + 
Subjt:  DRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFT

Query:  IRH-------CYHRLSIVQNALVTMYARCKDIMNAYILFRLNS--DKSIITWNSMLSGYTHLDRVEEALFLFRELLLSGVE--PNYVTIASILPLCARVA
        I++        +   ++V N L+ MYA+CK +  A  +F   S  ++ ++TW  M+ GY+      +AL L  E+     +  PN  TI+  L  CA +A
Subjt:  IRH-------CYHRLSIVQNALVTMYARCKDIMNAYILFRLNS--DKSIITWNSMLSGYTHLDRVEEALFLFRELLLSGVE--PNYVTIASILPLCARVA

Query:  DLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSH
         L+ G++ H Y  + +     L + N L+DMYA+ G + +A+ VFD++  K+EVT+TSL+ GYGM G G +AL +F EM+R   K D +T++ VL ACSH
Subjt:  DLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSH

Query:  SGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIHGNTNIGEWAAEKLLEMQPEHSGYYVLIANMY
        SG++ QG   F  M++V G+SP  EHYAC+ DL GR G LN A  +I  MP  P   +W   +  C IHG   +GE+AAEK+ E+   H G Y L++N+Y
Subjt:  SGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIHGNTNIGEWAAEKLLEMQPEHSGYYVLIANMY

Query:  AAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEA-NLLLDNLNDVMKHGSLVTKD--SHDIDND
        A AG W  + +IR+LMR  GV K PGCSWV+   G  +F VGD ++P A E   +LLD++  +   G +       HD+D++
Subjt:  AAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEA-NLLLDNLNDVMKHGSLVTKD--SHDIDND

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.1e-11434.38Show/hide
Query:  LLVSCTNASSLPPGRQLHGRIILSGLEQDSILVPKLVTF-------------YSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQM
        LL +C    SL   R +H ++I  GL   +  + KL+ F              S FK + E + L+          WN +   +  +    +A+ +Y  M
Subjt:  LLVSCTNASSLPPGRQLHGRIILSGLEQDSILVPKLVTF-------------YSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQM

Query:  LSRGVRPDNFTFPSILKACGETQNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDN
        +S G+ P+++TFP +LK+C +++    G ++H ++     +  L+V  +LISMY + G ++ A  +FD    RD VS+ ++I  YAS+G  + A +LFD 
Subjt:  LSRGVRPDNFTFPSILKACGETQNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDN

Query:  MQSKCIEINIVTWNIIAGGCLRVGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAY
        +  K    ++V+WN +  G    GN+  AL+L   M       D   M+  + AC+  G+I LG+++H +   H +     + NAL+ +Y++C ++  A 
Subjt:  MQSKCIEINIVTWNIIAGGCLRVGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAY

Query:  ILFRLNSDKSIITWNSMLSGYTHLDRVEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKR-EDLGDYLLLWNALVDMYARSGKVL
         LF     K +I+WN+++ GYTH++  +EAL LF+E+L SG  PN VT+ SILP CA +  +  GR  H YI KR + + +   L  +L+DMYA+ G + 
Subjt:  ILFRLNSDKSIITWNSMLSGYTHLDRVEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKR-EDLGDYLLLWNALVDMYARSGKVL

Query:  EAKRVFDSLSKKDEVTYTSLIAGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGL
         A +VF+S+  K   ++ ++I G+ M G    +  LF  M++  I+PD IT V +LSACSHSG+L  G  +F  M   + ++P LEHY CM DL G  GL
Subjt:  EAKRVFDSLSKKDEVTYTSLIAGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGL

Query:  LNKAKGIITRMPYRPTSAMWATLIGACCIHGNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSF
          +A+ +I  M   P   +W +L+ AC +HGN  +GE  AE L++++PE+ G YVL++N+YA+AG W+++AK R L+ D G+ K PGCS +++ S    F
Subjt:  LNKAKGIITRMPYRPTSAMWATLIGACCIHGNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSF

Query:  LVGDTSNPQALEANLLLDNLNDVMKHGSLVTKDS
        ++GD  +P+  E   +L+ +  +++    V   S
Subjt:  LVGDTSNPQALEANLLLDNLNDVMKHGSLVTKDS

Q9LNU6 Pentatricopeptide repeat-containing protein At1g202302.1e-10532.24Show/hide
Query:  ASSLPPGRQLHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACG
        +SSL    Q H RI+ SG + D  +  KL+  YS++    +A  ++++        ++ LI +  +  L   +I V+ +M S G+ PD+   P++ K C 
Subjt:  ASSLPPGRQLHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACG

Query:  ETQNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGC
        E      G ++H        +   FVQ ++  MY RCG +  AR +FD M D+D V+ ++++  YA KG  +E   +   M+S  IE NIV+WN I  G 
Subjt:  ETQNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGC

Query:  LRVGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARC-----------------KDIMNAYI--
         R G    A+ +  ++ + G   D V +   L +      + +G+ IHG+ I+    +   V +A++ MY +                    + NAYI  
Subjt:  LRVGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARC-----------------KDIMNAYI--

Query:  ------------LFRLNSDK----SIITWNSMLSGYTHLDRVEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLW
                    +F L  ++    ++++W S+++G     +  EAL LFRE+ ++GV+PN+VTI S+LP C  +A L HGR  H +   R  L D + + 
Subjt:  ------------LFRLNSDK----SIITWNSMLSGYTHLDRVEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLW

Query:  NALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLE
        +AL+DMYA+ G++  ++ VF+ +  K+ V + SL+ G+ M G+  + + +F+ + R ++KPD I+  ++LSAC   GL  +G   F  M   +G+ P LE
Subjt:  NALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLE

Query:  HYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIHGNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAP
        HY+CM +L GR G L +A  +I  MP+ P S +W  L+ +C +  N ++ E AAEKL  ++PE+ G YVL++N+YAA G W+++  IR  M   G+ K P
Subjt:  HYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIHGNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAP

Query:  GCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMK
        GCSW+ V +   + L GD S+PQ  +    +D ++  M+
Subjt:  GCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMK

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.0e-11634.38Show/hide
Query:  LLVSCTNASSLPPGRQLHGRIILSGLEQDSILVPKLVTF-------------YSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQM
        LL +C    SL   R +H ++I  GL   +  + KL+ F              S FK + E + L+          WN +   +  +    +A+ +Y  M
Subjt:  LLVSCTNASSLPPGRQLHGRIILSGLEQDSILVPKLVTF-------------YSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQM

Query:  LSRGVRPDNFTFPSILKACGETQNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDN
        +S G+ P+++TFP +LK+C +++    G ++H ++     +  L+V  +LISMY + G ++ A  +FD    RD VS+ ++I  YAS+G  + A +LFD 
Subjt:  LSRGVRPDNFTFPSILKACGETQNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDN

Query:  MQSKCIEINIVTWNIIAGGCLRVGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAY
        +  K    ++V+WN +  G    GN+  AL+L   M       D   M+  + AC+  G+I LG+++H +   H +     + NAL+ +Y++C ++  A 
Subjt:  MQSKCIEINIVTWNIIAGGCLRVGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAY

Query:  ILFRLNSDKSIITWNSMLSGYTHLDRVEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKR-EDLGDYLLLWNALVDMYARSGKVL
         LF     K +I+WN+++ GYTH++  +EAL LF+E+L SG  PN VT+ SILP CA +  +  GR  H YI KR + + +   L  +L+DMYA+ G + 
Subjt:  ILFRLNSDKSIITWNSMLSGYTHLDRVEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKR-EDLGDYLLLWNALVDMYARSGKVL

Query:  EAKRVFDSLSKKDEVTYTSLIAGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGL
         A +VF+S+  K   ++ ++I G+ M G    +  LF  M++  I+PD IT V +LSACSHSG+L  G  +F  M   + ++P LEHY CM DL G  GL
Subjt:  EAKRVFDSLSKKDEVTYTSLIAGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGL

Query:  LNKAKGIITRMPYRPTSAMWATLIGACCIHGNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSF
          +A+ +I  M   P   +W +L+ AC +HGN  +GE  AE L++++PE+ G YVL++N+YA+AG W+++AK R L+ D G+ K PGCS +++ S    F
Subjt:  LNKAKGIITRMPYRPTSAMWATLIGACCIHGNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSF

Query:  LVGDTSNPQALEANLLLDNLNDVMKHGSLVTKDS
        ++GD  +P+  E   +L+ +  +++    V   S
Subjt:  LVGDTSNPQALEANLLLDNLNDVMKHGSLVTKDS

AT1G22830.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-20553.08Show/hide
Query:  MSSSSSQYIFRGLSLYKLQTFIPKQWKN--KPVSNGS-----EFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNAS
        M SS S+ I RGL++ ++  FIP+ WK   +P+S  S     E +   +F+S +    HGQL +AF  FSL++ ++G   S + +L S + LL +C   +
Subjt:  MSSSSSQYIFRGLSLYKLQTFIPKQWKN--KPVSNGS-----EFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNAS

Query:  SLPPGRQLHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGET
           PG+QLH   I SGLE DS+LVPKLVTFYS+F LL EA T+ ENS I HP PWN+LI SY+RN   + ++ VYK+M+S+G+R D FT+PS++KAC   
Subjt:  SLPPGRQLHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGET

Query:  QNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLR
         +  +G  VH  I   S   +L+V NALISMY R G+VD AR LFD M +RDAVSWN++I+CY S+    EAF+L D M    +E +IVTWN IAGGCL 
Subjt:  QNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLR

Query:  VGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHC--YHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSG
         GN++GAL  +  MRN    +  VAMI GL ACSHIGA++ GK  H   IR C   H +  V+N+L+TMY+RC D+ +A+I+F+     S+ TWNS++SG
Subjt:  VGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHC--YHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSG

Query:  YTHLDRVEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI
        + + +R EE  FL +E+LLSG  PN++T+ASILPL ARV +LQHG+EFHCYI +R+   D L+LWN+LVDMYA+SG+++ AKRVFDS+ K+D+VTYTSLI
Subjt:  YTHLDRVEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI

Query:  AGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWA
         GYG  G+G  AL  F++M R  IKPDH+TMVAVLSACSHS L+++G  LF +M+ V G+   LEHY+CM DL+ R G L+KA+ I   +PY P+SAM A
Subjt:  AGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWA

Query:  TLIGACCIHGNTNIGEWAAEK-LLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKA
        TL+ AC IHGNTNIGEWAA+K LLE +PEH G+Y+L+A+MYA  GSWSKL  ++TL+ D GV KA
Subjt:  TLIGACCIHGNTNIGEWAAEK-LLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKA

AT1G22830.2 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-20553.08Show/hide
Query:  MSSSSSQYIFRGLSLYKLQTFIPKQWKN--KPVSNGS-----EFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNAS
        M SS S+ I RGL++ ++  FIP+ WK   +P+S  S     E +   +F+S +    HGQL +AF  FSL++ ++G   S + +L S + LL +C   +
Subjt:  MSSSSSQYIFRGLSLYKLQTFIPKQWKN--KPVSNGS-----EFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNAS

Query:  SLPPGRQLHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGET
           PG+QLH   I SGLE DS+LVPKLVTFYS+F LL EA T+ ENS I HP PWN+LI SY+RN   + ++ VYK+M+S+G+R D FT+PS++KAC   
Subjt:  SLPPGRQLHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGET

Query:  QNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLR
         +  +G  VH  I   S   +L+V NALISMY R G+VD AR LFD M +RDAVSWN++I+CY S+    EAF+L D M    +E +IVTWN IAGGCL 
Subjt:  QNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLR

Query:  VGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHC--YHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSG
         GN++GAL  +  MRN    +  VAMI GL ACSHIGA++ GK  H   IR C   H +  V+N+L+TMY+RC D+ +A+I+F+     S+ TWNS++SG
Subjt:  VGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFTIRHC--YHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSG

Query:  YTHLDRVEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI
        + + +R EE  FL +E+LLSG  PN++T+ASILPL ARV +LQHG+EFHCYI +R+   D L+LWN+LVDMYA+SG+++ AKRVFDS+ K+D+VTYTSLI
Subjt:  YTHLDRVEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI

Query:  AGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWA
         GYG  G+G  AL  F++M R  IKPDH+TMVAVLSACSHS L+++G  LF +M+ V G+   LEHY+CM DL+ R G L+KA+ I   +PY P+SAM A
Subjt:  AGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWA

Query:  TLIGACCIHGNTNIGEWAAEK-LLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKA
        TL+ AC IHGNTNIGEWAA+K LLE +PEH G+Y+L+A+MYA  GSWSKL  ++TL+ D GV KA
Subjt:  TLIGACCIHGNTNIGEWAAEK-LLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKA

AT1G71490.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.7e-23559.48Show/hide
Query:  VFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQLHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSN
        +F SL   A HG L  AF+ FSL++L++    S DL+L S + LL +C +  +   G Q+H   I SG+E  S+LVPKLVTFYS+F L  EA +++ENS+
Subjt:  VFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQLHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSN

Query:  IFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNM
        I HP PWN+LI SY +N L E  I  YK+M+S+G+RPD FT+PS+LKACGET ++ FG  VH  I   S + SL+V NALISMY R   +  AR LFD M
Subjt:  IFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNM

Query:  LDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGF
         +RDAVSWN++I+CYAS+GMW EAFELFD M    +E++++TWNII+GGCL+ GN+VGAL L+S+MRNF   LD VAMIIGL ACS IGAIRLGKEIHG 
Subjt:  LDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGF

Query:  TIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHC
         I   Y  +  V+N L+TMY++CKD+ +A I+FR   + S+ TWNS++SGY  L++ EEA  L RE+L++G +PN +T+ASILPLCAR+A+LQHG+EFHC
Subjt:  TIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHC

Query:  YITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELL
        YI +R+   DY +LWN+LVD+YA+SGK++ AK+V D +SK+DEVTYTSLI GYG QGEG  AL LF+EM R  IKPDH+T+VAVLSACSHS L+ +GE L
Subjt:  YITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELL

Query:  FAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIHGNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLA
        F +MQ  +G+ P L+H++CM DL+GR G L KAK II  MPY+P+ A WATL+ AC IHGNT IG+WAAEKLLEM+PE+ GYYVLIANMYAAAGSWSKLA
Subjt:  FAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIHGNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLA

Query:  KIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMK
        ++RT+MRD GV K PGC+W+D  SGF  F VGDTS+P+A     LLD LN +MK
Subjt:  KIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMK

AT5G16860.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.5e-12534.75Show/hide
Query:  SSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQLHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIF
        S ++S+  +G  +K    F L+   +   D++     +F  +  +C   SS+  G   H   +++G   +  +   LV  YS  + L++A  + +  +++
Subjt:  SSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQLHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIF

Query:  HPCPWNLLITSYVRNGLHEAAILVYKQMLSR-GVRPDNFTFPSILKACGETQNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNML
            WN +I SY + G  + A+ ++ +M +  G RPDN T  ++L  C        G ++H +        ++FV N L+ MY +CG +D A  +F NM 
Subjt:  HPCPWNLLITSYVRNGLHEAAILVYKQMLSR-GVRPDNFTFPSILKACGETQNLGFGLEVHKYINAWSTEWSLFVQNALISMYGRCGEVDTARNLFDNML

Query:  DRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFT
         +D VSWN+M++ Y+  G +++A  LF+ MQ + I++++VTW+    G  + G    AL +  QM + G   + V +I  L  C+ +GA+  GKEIH + 
Subjt:  DRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKEIHGFT

Query:  IRH-------CYHRLSIVQNALVTMYARCKDIMNAYILFRLNS--DKSIITWNSMLSGYTHLDRVEEALFLFRELLLSGVE--PNYVTIASILPLCARVA
        I++        +   ++V N L+ MYA+CK +  A  +F   S  ++ ++TW  M+ GY+      +AL L  E+     +  PN  TI+  L  CA +A
Subjt:  IRH-------CYHRLSIVQNALVTMYARCKDIMNAYILFRLNS--DKSIITWNSMLSGYTHLDRVEEALFLFRELLLSGVE--PNYVTIASILPLCARVA

Query:  DLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSH
         L+ G++ H Y  + +     L + N L+DMYA+ G + +A+ VFD++  K+EVT+TSL+ GYGM G G +AL +F EM+R   K D +T++ VL ACSH
Subjt:  DLQHGREFHCYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSH

Query:  SGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIHGNTNIGEWAAEKLLEMQPEHSGYYVLIANMY
        SG++ QG   F  M++V G+SP  EHYAC+ DL GR G LN A  +I  MP  P   +W   +  C IHG   +GE+AAEK+ E+   H G Y L++N+Y
Subjt:  SGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIHGNTNIGEWAAEKLLEMQPEHSGYYVLIANMY

Query:  AAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEA-NLLLDNLNDVMKHGSLVTKD--SHDIDND
        A AG W  + +IR+LMR  GV K PGCSWV+   G  +F VGD ++P A E   +LLD++  +   G +       HD+D++
Subjt:  AAAGSWSKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEA-NLLLDNLNDVMKHGSLVTKD--SHDIDND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCTTCCTCTTCTCAATATATCTTCAGAGGTCTCTCTCTATATAAGCTCCAAACATTCATACCTAAACAATGGAAAAACAAACCTGTGAGCAATGGTAGTGAATT
TATGATTGGTCCCGTTTTTTCTTCCCTTAAAAGCTTTGCCTGTCATGGTCAGTTGTCTAAAGCATTTGAAGCTTTCTCCCTCATTCAATTGCGCACTGGATATAATGATT
CATTTGACCTCATCTTGCAATCCTTCTCCATTCTTCTTGTATCATGCACCAATGCTAGCTCACTCCCACCTGGTAGGCAACTTCATGGTCGCATTATCTTGTCAGGTCTT
GAGCAAGATTCGATTTTGGTTCCTAAGCTCGTCACATTCTACTCGAGCTTTAAACTTCTGGCCGAGGCTCATACCCTTGTTGAGAATTCTAATATTTTTCACCCTTGTCC
TTGGAATTTACTCATCACATCATATGTCAGAAATGGACTTCATGAGGCAGCCATTTTAGTTTATAAACAGATGCTGAGTAGAGGAGTCAGACCAGATAATTTCACTTTTC
CCTCCATTTTGAAGGCTTGTGGTGAAACACAGAATTTGGGTTTTGGTTTAGAGGTCCACAAGTATATTAATGCTTGGTCAACTGAATGGAGTTTGTTTGTTCAGAATGCT
CTAATTTCTATGTATGGAAGATGTGGAGAGGTGGACACTGCACGTAACTTGTTTGACAATATGCTCGATCGAGATGCAGTTTCGTGGAATTCGATGATCTCTTGTTATGC
CTCCAAGGGCATGTGGAAGGAGGCATTTGAACTGTTCGACAATATGCAGAGTAAGTGTATCGAAATTAACATTGTGACTTGGAACATTATTGCGGGAGGTTGCTTGCGGG
TCGGTAATTTTGTTGGGGCACTTAAGTTATTGTCTCAAATGAGAAATTTTGGTGCTCATTTGGACTATGTAGCAATGATAATAGGTTTGGGTGCTTGTTCTCACATTGGT
GCCATTAGATTGGGAAAAGAAATCCATGGCTTTACCATCAGACATTGTTATCATAGGTTATCCATTGTTCAAAATGCATTAGTTACCATGTATGCTCGTTGTAAAGACAT
TATGAATGCATATATTTTGTTTCGATTAAACAGCGACAAAAGTATAATCACGTGGAACTCCATGCTTTCTGGTTACACGCACCTGGACCGGGTTGAGGAAGCCTTGTTTC
TATTTAGAGAATTGTTACTATCTGGGGTAGAACCAAATTATGTGACAATTGCTAGCATTCTTCCTCTTTGTGCTCGAGTTGCGGATTTACAACACGGGAGAGAATTTCAT
TGCTATATTACTAAACGTGAAGATTTAGGGGATTACTTGCTATTGTGGAATGCTTTAGTGGATATGTACGCAAGATCAGGCAAGGTTTTAGAGGCAAAAAGAGTGTTCGA
TTCGTTAAGCAAGAAGGACGAAGTGACGTATACTTCCTTGATTGCAGGCTATGGGATGCAAGGAGAGGGGAGCAAAGCACTAAGACTATTTCAAGAGATGAAAAGATTTC
AGATCAAACCAGATCATATAACCATGGTTGCTGTCCTATCAGCTTGTAGTCATTCAGGACTTTTGAAACAAGGTGAACTTTTATTTGCAGAGATGCAAAGTGTCCATGGT
TTAAGTCCCCATCTGGAACACTATGCTTGTATGGCAGACCTTTTTGGGAGGGTTGGTCTGTTGAATAAAGCAAAGGGGATTATCACAAGAATGCCTTATAGACCAACGTC
CGCTATGTGGGCCACTCTTATAGGAGCATGTTGCATCCACGGAAACACGAATATTGGGGAATGGGCTGCAGAGAAACTTCTCGAAATGCAACCTGAACATTCGGGTTACT
ATGTCTTGATTGCTAACATGTATGCTGCTGCAGGTTCTTGGAGTAAGCTGGCAAAAATAAGGACTCTTATGAGAGATTCTGGTGTGGCAAAAGCTCCTGGTTGTTCCTGG
GTTGATGTTGGCTCGGGATTCGTCTCGTTCTTGGTTGGGGACACTTCTAATCCACAAGCTCTTGAAGCTAATCTCTTGTTAGACAATTTGAACGATGTAATGAAACATGG
TAGTCTAGTGACAAAAGATTCTCACGATATCGACAATGACAGTTTT
mRNA sequenceShow/hide mRNA sequence
ATGTCATCTTCCTCTTCTCAATATATCTTCAGAGGTCTCTCTCTATATAAGCTCCAAACATTCATACCTAAACAATGGAAAAACAAACCTGTGAGCAATGGTAGTGAATT
TATGATTGGTCCCGTTTTTTCTTCCCTTAAAAGCTTTGCCTGTCATGGTCAGTTGTCTAAAGCATTTGAAGCTTTCTCCCTCATTCAATTGCGCACTGGATATAATGATT
CATTTGACCTCATCTTGCAATCCTTCTCCATTCTTCTTGTATCATGCACCAATGCTAGCTCACTCCCACCTGGTAGGCAACTTCATGGTCGCATTATCTTGTCAGGTCTT
GAGCAAGATTCGATTTTGGTTCCTAAGCTCGTCACATTCTACTCGAGCTTTAAACTTCTGGCCGAGGCTCATACCCTTGTTGAGAATTCTAATATTTTTCACCCTTGTCC
TTGGAATTTACTCATCACATCATATGTCAGAAATGGACTTCATGAGGCAGCCATTTTAGTTTATAAACAGATGCTGAGTAGAGGAGTCAGACCAGATAATTTCACTTTTC
CCTCCATTTTGAAGGCTTGTGGTGAAACACAGAATTTGGGTTTTGGTTTAGAGGTCCACAAGTATATTAATGCTTGGTCAACTGAATGGAGTTTGTTTGTTCAGAATGCT
CTAATTTCTATGTATGGAAGATGTGGAGAGGTGGACACTGCACGTAACTTGTTTGACAATATGCTCGATCGAGATGCAGTTTCGTGGAATTCGATGATCTCTTGTTATGC
CTCCAAGGGCATGTGGAAGGAGGCATTTGAACTGTTCGACAATATGCAGAGTAAGTGTATCGAAATTAACATTGTGACTTGGAACATTATTGCGGGAGGTTGCTTGCGGG
TCGGTAATTTTGTTGGGGCACTTAAGTTATTGTCTCAAATGAGAAATTTTGGTGCTCATTTGGACTATGTAGCAATGATAATAGGTTTGGGTGCTTGTTCTCACATTGGT
GCCATTAGATTGGGAAAAGAAATCCATGGCTTTACCATCAGACATTGTTATCATAGGTTATCCATTGTTCAAAATGCATTAGTTACCATGTATGCTCGTTGTAAAGACAT
TATGAATGCATATATTTTGTTTCGATTAAACAGCGACAAAAGTATAATCACGTGGAACTCCATGCTTTCTGGTTACACGCACCTGGACCGGGTTGAGGAAGCCTTGTTTC
TATTTAGAGAATTGTTACTATCTGGGGTAGAACCAAATTATGTGACAATTGCTAGCATTCTTCCTCTTTGTGCTCGAGTTGCGGATTTACAACACGGGAGAGAATTTCAT
TGCTATATTACTAAACGTGAAGATTTAGGGGATTACTTGCTATTGTGGAATGCTTTAGTGGATATGTACGCAAGATCAGGCAAGGTTTTAGAGGCAAAAAGAGTGTTCGA
TTCGTTAAGCAAGAAGGACGAAGTGACGTATACTTCCTTGATTGCAGGCTATGGGATGCAAGGAGAGGGGAGCAAAGCACTAAGACTATTTCAAGAGATGAAAAGATTTC
AGATCAAACCAGATCATATAACCATGGTTGCTGTCCTATCAGCTTGTAGTCATTCAGGACTTTTGAAACAAGGTGAACTTTTATTTGCAGAGATGCAAAGTGTCCATGGT
TTAAGTCCCCATCTGGAACACTATGCTTGTATGGCAGACCTTTTTGGGAGGGTTGGTCTGTTGAATAAAGCAAAGGGGATTATCACAAGAATGCCTTATAGACCAACGTC
CGCTATGTGGGCCACTCTTATAGGAGCATGTTGCATCCACGGAAACACGAATATTGGGGAATGGGCTGCAGAGAAACTTCTCGAAATGCAACCTGAACATTCGGGTTACT
ATGTCTTGATTGCTAACATGTATGCTGCTGCAGGTTCTTGGAGTAAGCTGGCAAAAATAAGGACTCTTATGAGAGATTCTGGTGTGGCAAAAGCTCCTGGTTGTTCCTGG
GTTGATGTTGGCTCGGGATTCGTCTCGTTCTTGGTTGGGGACACTTCTAATCCACAAGCTCTTGAAGCTAATCTCTTGTTAGACAATTTGAACGATGTAATGAAACATGG
TAGTCTAGTGACAAAAGATTCTCACGATATCGACAATGACAGTTTT
Protein sequenceShow/hide protein sequence
MSSSSSQYIFRGLSLYKLQTFIPKQWKNKPVSNGSEFMIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPPGRQLHGRIILSGL
EQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVRNGLHEAAILVYKQMLSRGVRPDNFTFPSILKACGETQNLGFGLEVHKYINAWSTEWSLFVQNA
LISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCIEINIVTWNIIAGGCLRVGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIG
AIRLGKEIHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDRVEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFH
CYITKREDLGDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGSKALRLFQEMKRFQIKPDHITMVAVLSACSHSGLLKQGELLFAEMQSVHG
LSPHLEHYACMADLFGRVGLLNKAKGIITRMPYRPTSAMWATLIGACCIHGNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKAPGCSW
VDVGSGFVSFLVGDTSNPQALEANLLLDNLNDVMKHGSLVTKDSHDIDNDSF