; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G018450 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G018450
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr01:19442701..19464036
RNA-Seq ExpressionLsi01G018450
SyntenyLsi01G018450
Gene Ontology termsGO:0070897 - transcription preinitiation complex assembly (biological process)
GO:0017025 - TBP-class protein binding (molecular function)
InterPro domainsIPR000812 - Transcription factor TFIIB
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR013137 - Zinc finger, TFIIB-type
IPR013150 - Transcription factor TFIIB, cyclin-like domain
IPR013763 - Cyclin-like
IPR023486 - Transcription factor TFIIB, conserved site
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR036915 - Cyclin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004150613.2 pentatricopeptide repeat-containing protein At3g14730 [Cucumis sativus]0.0e+0091.51Show/hide
Query:  MMSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSP
        MMSNVLKPS VF NVLD LAWCLTNRK IFSKI+SKKHY  SSSFL FSTSPPS LS  Q+L+NVT CV FLQSCA H+N+NKGKQLHSLMITYGFSPSP
Subjt:  MMSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSP

Query:  PSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDVF
        PSITSLINMYSKCGQM EAILVF+DPCHERNVFAYNAIISGFV+NGL+SKGFQFYK+MRLEGVMPDKYTFPCVVRTCC+VMEVKKIHGCL KMGLELDVF
Subjt:  PSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDVF

Query:  VGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGV
        VGSALVNTYLK GSMEDAQKVF ELSIRDVVLWNAMINGYA+IGCLDEALEVFRRMH++G+APSRFTITGILS+FA RGDLDNGKTVHGIVMKMGYDSGV
Subjt:  VGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGV

Query:  AVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKD
        +VSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMI+NGLGKD
Subjt:  AVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKD

Query:  DENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQ
        DENGA+D+LLV+NAVMDMYAKCGSMNNALKIFD +S KDVASWNIMIMGYGMHGY LEAL MFS+MC A FKP+EVTLVGVLSACNHAGFVS GRLFLAQ
Subjt:  DENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQ

Query:  MESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVR
        MES F VIPTIEHYTCVIDMLGRAGHLEDAYE+ QKMPIQANPVVWRALLGACRLHGNAELAEIAARQV+QLEPEHCGSYVLMSNVYGVIGRYEEVLEVR
Subjt:  MESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVR

Query:  KTMKEQNVKKTPGCSWIELKDGVH
        KTMKEQNVKKTPGCSWIELKDGVH
Subjt:  KTMKEQNVKKTPGCSWIELKDGVH

XP_008455782.1 PREDICTED: pentatricopeptide repeat-containing protein At3g14730-like [Cucumis melo]0.0e+0091.03Show/hide
Query:  MMSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSP
        MM NVLKPS VF NVLD LAWCLTNRK IFSKI+SKKHY  SSSFL FSTSP SKLS  Q+LDNVT C+ FLQSCA HKN+NKGKQ HSLMITYGFS SP
Subjt:  MMSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSP

Query:  PSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDVF
        PSITSLINMYSKCGQM EAILVF+DPCHERNVFAYNAIISGFVANGL+SKGFQFY++MRLEGVMPDKYTFPCVVRTCC+V EVKKIHGC  KMGLELDVF
Subjt:  PSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDVF

Query:  VGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGV
        VGSALVNTYLK GSMEDAQKVF EL +RDVVLWNAMINGYA+IGCLDEALEVFRRMH+EGIAP RFTITGILSIFA RGDLDNGKTVHGIV+KMGYDSGV
Subjt:  VGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGV

Query:  AVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKD
        AVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDH GTLRLFDKMLGSGILPDLVTITTVLPACSHLAALM GREIHGYMI+NG GKD
Subjt:  AVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKD

Query:  DENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQ
        DENGA+DDL V+NAVMDMYAKCGSMNNALKIFD +SNKDVASWNIMIMGYGMHGY LEALDMFSRMC A FKPDEVTLVGVLSACNHAGFVSQGRLF AQ
Subjt:  DENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQ

Query:  MESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVR
        MES F VIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAE+AARQV+QLEPEHCGSYVLMSNVYGVIGRYEEVLEVR
Subjt:  MESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVR

Query:  KTMKEQNVKKTPGCSWIELKDGVH
        KTMKEQNVKKTPGCSWIELKDG+H
Subjt:  KTMKEQNVKKTPGCSWIELKDGVH

XP_022966216.1 pentatricopeptide repeat-containing protein At3g14730-like isoform X1 [Cucurbita maxima]0.0e+0085.76Show/hide
Query:  VMMSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPS
        V M N+LKPS  F NV+DRLAWC+TN+K I +KI+SKKHY  SSSFL  STSPPSK S F+LL+NVTTC+ FLQSCA  KN+NKGKQLHS+MITYGFS S
Subjt:  VMMSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPS

Query:  PPSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDV
        P SITSLINMYSKCG+MEEA+LVFHDPC+E NVFAYNAIISGFVANGL+S GFQFYKQMRLEGVMPDKYTFPCVVR+CC+VMEVKKIHGCLFKMGLELD+
Subjt:  PPSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDV

Query:  FVGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSG
        FVGSALVNTYLK+GSMEDAQ+VFEEL IRDVVLWNAMINGYAQIGCLDEALE+F+RMHIEG++PSRFTITGILSIFAL+G LDNG+TVHGIVMKMGYDSG
Subjt:  FVGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSG

Query:  VAVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGK
        VAVSNALIDMYGKCKHIGDAL++FE +NEKDIFSWNSIISVHEQCGDHDG LRLFDKMLGSG LPDLVT+TT+LPACSHLAALMHGREIHGYMIVNGLG+
Subjt:  VAVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGK

Query:  DDENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLA
        D +NG +DDLLVNNAVMDMYAKCGSMNNA K+F+ ++NKDVASWNIMIMGYGMHGYG++ALDMFS MC A+ KPDEVT VGVLSACNHAGFV QGR+FLA
Subjt:  DDENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLA

Query:  QMESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEV
        QME  F VIPTIEHYTCVIDMLGRAGHLEDAY++AQ MPIQANPVVWRALLGACRLHGNAELAEIAA++VMQL+PEHCGSYVLMSNVYGV+GRYEEVLEV
Subjt:  QMESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEV

Query:  RKTMKEQNVKKTPGCSWIELKDGVH
        R TMKEQ+V+KTPGCSWIELKDGVH
Subjt:  RKTMKEQNVKKTPGCSWIELKDGVH

XP_022966217.1 pentatricopeptide repeat-containing protein At3g14730-like isoform X2 [Cucurbita maxima]0.0e+0085.87Show/hide
Query:  MSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSPP
        M N+LKPS  F NV+DRLAWC+TN+K I +KI+SKKHY  SSSFL  STSPPSK S F+LL+NVTTC+ FLQSCA  KN+NKGKQLHS+MITYGFS SP 
Subjt:  MSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSPP

Query:  SITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDVFV
        SITSLINMYSKCG+MEEA+LVFHDPC+E NVFAYNAIISGFVANGL+S GFQFYKQMRLEGVMPDKYTFPCVVR+CC+VMEVKKIHGCLFKMGLELD+FV
Subjt:  SITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDVFV

Query:  GSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGVA
        GSALVNTYLK+GSMEDAQ+VFEEL IRDVVLWNAMINGYAQIGCLDEALE+F+RMHIEG++PSRFTITGILSIFAL+G LDNG+TVHGIVMKMGYDSGVA
Subjt:  GSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGVA

Query:  VSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKDD
        VSNALIDMYGKCKHIGDAL++FE +NEKDIFSWNSIISVHEQCGDHDG LRLFDKMLGSG LPDLVT+TT+LPACSHLAALMHGREIHGYMIVNGLG+D 
Subjt:  VSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKDD

Query:  ENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQM
        +NG +DDLLVNNAVMDMYAKCGSMNNA K+F+ ++NKDVASWNIMIMGYGMHGYG++ALDMFS MC A+ KPDEVT VGVLSACNHAGFV QGR+FLAQM
Subjt:  ENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQM

Query:  ESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRK
        E  F VIPTIEHYTCVIDMLGRAGHLEDAY++AQ MPIQANPVVWRALLGACRLHGNAELAEIAA++VMQL+PEHCGSYVLMSNVYGV+GRYEEVLEVR 
Subjt:  ESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRK

Query:  TMKEQNVKKTPGCSWIELKDGVH
        TMKEQ+V+KTPGCSWIELKDGVH
Subjt:  TMKEQNVKKTPGCSWIELKDGVH

XP_038881250.1 pentatricopeptide repeat-containing protein At3g14730-like [Benincasa hispida]0.0e+0092.15Show/hide
Query:  MMSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSP
        M+SNVLKPS VF +VL+ LAW LTNRKIIF+KI+S KHY   SSFL FSTS PSKLSVFQLLDNVTTC+ FLQSCA HKN+NKGKQLHSLMITYGFSPSP
Subjt:  MMSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSP

Query:  PSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDVF
        PSITSLINMYSKCGQM EAILVFHDPCHERNVFAYNAIISGFVANGL+SKGFQFY+QMRLEGVMPDKYTFPCVVRTCC+VMEVKKIHGCLFKMGLELDVF
Subjt:  PSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDVF

Query:  VGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGV
        VGSALVNTYLKIGSME+AQKVFEE+SIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGIL IFA RGDLDNG+TVHGIVMKMGYDSGV
Subjt:  VGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGV

Query:  AVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKD
        AVSNALIDMYGKCKHI DALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGS ILPDLVTITTVLPACSHLAA MHGREIHGYMIVNGLGKD
Subjt:  AVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKD

Query:  DENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQ
        DENG +DDLLVNNAVMDMYAKCGSM NALK+FD +SNKDVASWNIMIMGYGMHGYG++ALDMFSRMC   FKPDEVTLVGVLSACNH GFVSQGRL LAQ
Subjt:  DENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQ

Query:  MESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVR
        MESKF VIPTIEHYTCVIDMLGRAGHLEDAY++ QKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGR+EEVLEVR
Subjt:  MESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVR

Query:  KTMKEQNVKKTPGCSWIELKDGVH
        KTMKEQNVKKTPGCSWIELKDGVH
Subjt:  KTMKEQNVKKTPGCSWIELKDGVH

TrEMBL top hitse value%identityAlignment
A0A1S4E0R7 pentatricopeptide repeat-containing protein At3g14730-like0.0e+0091.03Show/hide
Query:  MMSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSP
        MM NVLKPS VF NVLD LAWCLTNRK IFSKI+SKKHY  SSSFL FSTSP SKLS  Q+LDNVT C+ FLQSCA HKN+NKGKQ HSLMITYGFS SP
Subjt:  MMSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSP

Query:  PSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDVF
        PSITSLINMYSKCGQM EAILVF+DPCHERNVFAYNAIISGFVANGL+SKGFQFY++MRLEGVMPDKYTFPCVVRTCC+V EVKKIHGC  KMGLELDVF
Subjt:  PSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDVF

Query:  VGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGV
        VGSALVNTYLK GSMEDAQKVF EL +RDVVLWNAMINGYA+IGCLDEALEVFRRMH+EGIAP RFTITGILSIFA RGDLDNGKTVHGIV+KMGYDSGV
Subjt:  VGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGV

Query:  AVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKD
        AVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDH GTLRLFDKMLGSGILPDLVTITTVLPACSHLAALM GREIHGYMI+NG GKD
Subjt:  AVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKD

Query:  DENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQ
        DENGA+DDL V+NAVMDMYAKCGSMNNALKIFD +SNKDVASWNIMIMGYGMHGY LEALDMFSRMC A FKPDEVTLVGVLSACNHAGFVSQGRLF AQ
Subjt:  DENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQ

Query:  MESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVR
        MES F VIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAE+AARQV+QLEPEHCGSYVLMSNVYGVIGRYEEVLEVR
Subjt:  MESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVR

Query:  KTMKEQNVKKTPGCSWIELKDGVH
        KTMKEQNVKKTPGCSWIELKDG+H
Subjt:  KTMKEQNVKKTPGCSWIELKDGVH

A0A6J1DFN8 pentatricopeptide repeat-containing protein At3g14730-like0.0e+0087.66Show/hide
Query:  MMSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSP
        M+SN+LKP  VF +V+DRLAWCLT +KI FSKI+SKKHYL  SSFL FSTSPPSK SVFQLL+NVTT + FLQSCA HKN+N+GKQLHSLMITYGFS SP
Subjt:  MMSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSP

Query:  PSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDVF
         SITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNA+ISGFVANGL+S G QFYKQMRLEGVMPDKYTFPCVVR+CC+ MEVKKIHGCLFKMGLELD+F
Subjt:  PSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDVF

Query:  VGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGV
        VGSALVNTYLKIG ME+AQKVFEELSIRDVVLWNA+INGYAQIGCLDEALEVFRRM IEGI PSRFT+TGILSIFALRGDL+NG+TVH IV KMGY+ GV
Subjt:  VGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGV

Query:  AVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKD
        AV NALIDMYGKCKHI DAL+IF+MI+EKDIFSWNSIISVHEQ GDHDGTLRLFDKMLGSGILPDLVT+TTVLPACSHLAALMHGREIHGYMIVNG GKD
Subjt:  AVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKD

Query:  DENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQ
           G +DDLLVNNAVMDMYAKCGSM NAL +FDL+SNKDVASWNI+IMGYGMHGYG+EALD+FS MC AR KPDEVT VGVLSACNHAGFVSQGRLFLAQ
Subjt:  DENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQ

Query:  MESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVR
        MES+F VIPTIEHYTCVIDMLGRAGHL+DAYE+AQKMPIQANP+VWRALLGACRLHGNAELAE+AAR+VMQLEPEHCGSYVLMSNVYGV+GRY EVLEVR
Subjt:  MESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVR

Query:  KTMKEQNVKKTPGCSWIELKDGVH
        KTMKEQNVKKTPGCSWIELKDGVH
Subjt:  KTMKEQNVKKTPGCSWIELKDGVH

A0A6J1EGD0 pentatricopeptide repeat-containing protein At3g147300.0e+0085.71Show/hide
Query:  MSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSPP
        M N+LKPS  F NV+DRLAWC+TN+K I +KI+SKKHY  SSSFL  STS PSK S F+LLDNVTTC+ FLQSCA  KN+NKGKQLHS+MITYGFS SP 
Subjt:  MSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSPP

Query:  SITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDVFV
        SITSLINMYSKCG+MEEA+LVFHDPC+E NVFAYNAIISGFVANGL+S GFQFYKQMRLEGVMPDKYTFPCVVR+CC+VMEVKKIHGCLFKMGLELD+FV
Subjt:  SITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDVFV

Query:  GSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGVA
        GSALVNTYLK+GSMEDAQ+VFEEL IRDVVLWNAMINGYAQIGCLDEALE+FRRMHIEG++PSRFTITGILSIFAL+G LDNG+TVHGIVMKMGYDSGVA
Subjt:  GSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGVA

Query:  VSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKDD
        VSNALIDMYGKCKHIGDAL++FE +NEKDIFSWNSIISVHEQCGDHDG LRLFDKMLGSG LPDLVT+TT+LPACSHLAALMHGREIHGYMIVNGLG+D 
Subjt:  VSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKDD

Query:  ENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQM
        +NG +DDLLVNNAVMDMYAKCGSM NA K+F+ ++NKDVASWNIMIMGYGMHGYG++ALDM S MC A+ KPDEVT VGVLSACNHAGFV QGR FLAQM
Subjt:  ENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQM

Query:  ESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRK
        E  F VIPTIEHYTCVIDMLGRAGHLEDAY++AQ MPIQANPVVWRALLGACRLHGNAELAEIAA++VMQL+PEHCGSYVLMSNVYGV+GRYEEVLEVR 
Subjt:  ESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRK

Query:  TMKEQNVKKTPGCSWIELKDGVH
        TMKEQ+V+KTPGCSWIELKDGVH
Subjt:  TMKEQNVKKTPGCSWIELKDGVH

A0A6J1HME0 pentatricopeptide repeat-containing protein At3g14730-like isoform X20.0e+0085.87Show/hide
Query:  MSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSPP
        M N+LKPS  F NV+DRLAWC+TN+K I +KI+SKKHY  SSSFL  STSPPSK S F+LL+NVTTC+ FLQSCA  KN+NKGKQLHS+MITYGFS SP 
Subjt:  MSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSPP

Query:  SITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDVFV
        SITSLINMYSKCG+MEEA+LVFHDPC+E NVFAYNAIISGFVANGL+S GFQFYKQMRLEGVMPDKYTFPCVVR+CC+VMEVKKIHGCLFKMGLELD+FV
Subjt:  SITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDVFV

Query:  GSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGVA
        GSALVNTYLK+GSMEDAQ+VFEEL IRDVVLWNAMINGYAQIGCLDEALE+F+RMHIEG++PSRFTITGILSIFAL+G LDNG+TVHGIVMKMGYDSGVA
Subjt:  GSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGVA

Query:  VSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKDD
        VSNALIDMYGKCKHIGDAL++FE +NEKDIFSWNSIISVHEQCGDHDG LRLFDKMLGSG LPDLVT+TT+LPACSHLAALMHGREIHGYMIVNGLG+D 
Subjt:  VSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKDD

Query:  ENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQM
        +NG +DDLLVNNAVMDMYAKCGSMNNA K+F+ ++NKDVASWNIMIMGYGMHGYG++ALDMFS MC A+ KPDEVT VGVLSACNHAGFV QGR+FLAQM
Subjt:  ENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQM

Query:  ESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRK
        E  F VIPTIEHYTCVIDMLGRAGHLEDAY++AQ MPIQANPVVWRALLGACRLHGNAELAEIAA++VMQL+PEHCGSYVLMSNVYGV+GRYEEVLEVR 
Subjt:  ESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRK

Query:  TMKEQNVKKTPGCSWIELKDGVH
        TMKEQ+V+KTPGCSWIELKDGVH
Subjt:  TMKEQNVKKTPGCSWIELKDGVH

A0A6J1HNR1 pentatricopeptide repeat-containing protein At3g14730-like isoform X10.0e+0085.76Show/hide
Query:  VMMSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPS
        V M N+LKPS  F NV+DRLAWC+TN+K I +KI+SKKHY  SSSFL  STSPPSK S F+LL+NVTTC+ FLQSCA  KN+NKGKQLHS+MITYGFS S
Subjt:  VMMSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPS

Query:  PPSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDV
        P SITSLINMYSKCG+MEEA+LVFHDPC+E NVFAYNAIISGFVANGL+S GFQFYKQMRLEGVMPDKYTFPCVVR+CC+VMEVKKIHGCLFKMGLELD+
Subjt:  PPSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDV

Query:  FVGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSG
        FVGSALVNTYLK+GSMEDAQ+VFEEL IRDVVLWNAMINGYAQIGCLDEALE+F+RMHIEG++PSRFTITGILSIFAL+G LDNG+TVHGIVMKMGYDSG
Subjt:  FVGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSG

Query:  VAVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGK
        VAVSNALIDMYGKCKHIGDAL++FE +NEKDIFSWNSIISVHEQCGDHDG LRLFDKMLGSG LPDLVT+TT+LPACSHLAALMHGREIHGYMIVNGLG+
Subjt:  VAVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGK

Query:  DDENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLA
        D +NG +DDLLVNNAVMDMYAKCGSMNNA K+F+ ++NKDVASWNIMIMGYGMHGYG++ALDMFS MC A+ KPDEVT VGVLSACNHAGFV QGR+FLA
Subjt:  DDENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLA

Query:  QMESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEV
        QME  F VIPTIEHYTCVIDMLGRAGHLEDAY++AQ MPIQANPVVWRALLGACRLHGNAELAEIAA++VMQL+PEHCGSYVLMSNVYGV+GRYEEVLEV
Subjt:  QMESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEV

Query:  RKTMKEQNVKKTPGCSWIELKDGVH
        R TMKEQ+V+KTPGCSWIELKDGVH
Subjt:  RKTMKEQNVKKTPGCSWIELKDGVH

SwissProt top hitse value%identityAlignment
P48512 Transcription initiation factor IIB-17.6e-13862.01Show/hide
Query:  MSDAFCSDCKRQTEVVFDHSAGDTVCSECGLVLESHSIDETSEWRTFANESGDNDPVRVGGPTNPLLADGGLSTVIAKPNGTTGEFLSSSLGRWQNRGSN
        MSDA+C+DCK++TE+V DHSAGDT+CSECGLVLESHSIDETSEWRTFANES ++DP RVGGPTNPLLAD  L+TVIAKPNG++G+FLSSSLGRWQNR SN
Subjt:  MSDAFCSDCKRQTEVVFDHSAGDTVCSECGLVLESHSIDETSEWRTFANESGDNDPVRVGGPTNPLLADGGLSTVIAKPNGTTGEFLSSSLGRWQNRGSN

Query:  PDRGLILAFKTIATMSDSRKQIELGLWRGGNLGNLTSLRLGLVATIKVSCQYRMIVQYSQEKGLYEDKCIGKLRKTLVASGGVNIGDAVRKLSINRRKIT
         DRGLI AFKTIATMS+                     RLGLVATIK                                                     
Subjt:  PDRGLILAFKTIATMSDSRKQIELGLWRGGNLGNLTSLRLGLVATIKVSCQYRMIVQYSQEKGLYEDKCIGKLRKTLVASGGVNIGDAVRKLSINRRKIT

Query:  QSYSEIFPPNCKVPETNYSNEEISSGPDSRIVSFISLANDRANEIYKRVEDQKSSRGRNQDALLAACLYIACRQEDKPRTVKGIAYGSFDIWSIEICSVA
                                               DRANE+YKR+EDQKSSRGRNQDAL AACLYIACRQEDKPRT+K            EIC +A
Subjt:  QSYSEIFPPNCKVPETNYSNEEISSGPDSRIVSFISLANDRANEIYKRVEDQKSSRGRNQDALLAACLYIACRQEDKPRTVKGIAYGSFDIWSIEICSVA

Query:  NGATKKEIGRAKEYIVKQLGLETGQSVEMGTIHAGDFMRRFCSNLGMNNQAVKAAQEAVQKSEEFDIRRSPISIAAAVIYIITQLSDDKKPLKDISVATG
        NGATKKEIGRAK+YIVK LGLE GQSV++GTIHAGDFMRRFCSNL M+N AVKAAQEAVQKSEEFDIRRSPISIAA VIYIITQLSDDKK LKDIS ATG
Subjt:  NGATKKEIGRAKEYIVKQLGLETGQSVEMGTIHAGDFMRRFCSNLGMNNQAVKAAQEAVQKSEEFDIRRSPISIAAAVIYIITQLSDDKKPLKDISVATG

Query:  VAEGTIRNSYKDLYPHVSKILPSWYAKEEDLKNLCSP
        VAEGTIRNSYKDLYPH+SKI PSWYAKEEDLKNL SP
Subjt:  VAEGTIRNSYKDLYPHVSKILPSWYAKEEDLKNLCSP

P48513 Transcription initiation factor IIB3.0e-15068.04Show/hide
Query:  MSDAFCSDCKRQTEVVFDHSAGDTVCSECGLVLESHSIDETSEWRTFANESGDNDPVRVGGPTNPLLADGGLSTVIAKPN-GTTGEFLSSSLGRWQNRGS
        MSDAFCSDCKRQTEVVFDHSAGDTVCSECGLVLESHSIDETSEWRTFANESGDNDP RVGGP+NPLL DGGLSTVIAKPN G  GEFLSSSLGRWQNRGS
Subjt:  MSDAFCSDCKRQTEVVFDHSAGDTVCSECGLVLESHSIDETSEWRTFANESGDNDPVRVGGPTNPLLADGGLSTVIAKPN-GTTGEFLSSSLGRWQNRGS

Query:  NPDRGLILAFKTIATMSDSRKQIELGLWRGGNLGNLTSLRLGLVATIKVSCQYRMIVQYSQEKGLYEDKCIGKLRKTLVASGGVNIGDAVRKLSINRRKI
        NPDR LI AFKTIATMSD                     RLGLVATIK                                                    
Subjt:  NPDRGLILAFKTIATMSDSRKQIELGLWRGGNLGNLTSLRLGLVATIKVSCQYRMIVQYSQEKGLYEDKCIGKLRKTLVASGGVNIGDAVRKLSINRRKI

Query:  TQSYSEIFPPNCKVPETNYSNEEISSGPDSRIVSFISLANDRANEIYKRVEDQKSSRGRNQDALLAACLYIACRQEDKPRTVKGIAYGSFDIWSIEICSV
                                                DRANEIYKRVEDQKSSRGRNQDALLAACLYIACRQEDKPRTVK            EICSV
Subjt:  TQSYSEIFPPNCKVPETNYSNEEISSGPDSRIVSFISLANDRANEIYKRVEDQKSSRGRNQDALLAACLYIACRQEDKPRTVKGIAYGSFDIWSIEICSV

Query:  ANGATKKEIGRAKEYIVKQLGLETGQSVEMGTIHAGDFMRRFCSNLGMNNQAVKAAQEAVQKSEEFDIRRSPISIAAAVIYIITQLSDDKKPLKDISVAT
        ANGATKKEIGRAKEYIVKQLGLE G +VEMGTIHAGDFMRRFCSNL MNNQAVKAAQEAVQKSEEFDIRRSPISIAAAVIYIITQLSDDKKPLKDIS+AT
Subjt:  ANGATKKEIGRAKEYIVKQLGLETGQSVEMGTIHAGDFMRRFCSNLGMNNQAVKAAQEAVQKSEEFDIRRSPISIAAAVIYIITQLSDDKKPLKDISVAT

Query:  GVAEGTIRNSYKDLYPHVSKILPSWYAKEEDLKNLCSP
        GVAEGTIRNSYKDLYPHVSKI+P+WYAKEEDLKNLCSP
Subjt:  GVAEGTIRNSYKDLYPHVSKILPSWYAKEEDLKNLCSP

Q8W0W3 Transcription initiation factor IIB3.7e-14062.24Show/hide
Query:  MSDAFCSDCKRQTEVVFDHSAGDTVCSECGLVLESHSIDETSEWRTFANESGDNDPVRVGGPTNPLLADGGLSTVIAKPNGTTGEFLSSSLGRWQNRGSN
        MSD+FC DCK+ TEV FDHSAGDTVC+ECGLVLE+HS+DETSEWRTFANES DNDPVRVGGPTNPLL DGGLSTVIAKPNG  GEFLSSSLGRWQNRGSN
Subjt:  MSDAFCSDCKRQTEVVFDHSAGDTVCSECGLVLESHSIDETSEWRTFANESGDNDPVRVGGPTNPLLADGGLSTVIAKPNGTTGEFLSSSLGRWQNRGSN

Query:  PDRGLILAFKTIATMSDSRKQIELGLWRGGNLGNLTSLRLGLVATIKVSCQYRMIVQYSQEKGLYEDKCIGKLRKTLVASGGVNIGDAVRKLSINRRKIT
        PDR LILAF+TIA M+D                     RLGLVATIK                                                     
Subjt:  PDRGLILAFKTIATMSDSRKQIELGLWRGGNLGNLTSLRLGLVATIKVSCQYRMIVQYSQEKGLYEDKCIGKLRKTLVASGGVNIGDAVRKLSINRRKIT

Query:  QSYSEIFPPNCKVPETNYSNEEISSGPDSRIVSFISLANDRANEIYKRVEDQKSSRGRNQDALLAACLYIACRQEDKPRTVKGIAYGSFDIWSIEICSVA
                                               DRANEIYK+VED KS RGRNQDA+LAACLYIACRQED+PRTVK            EICSVA
Subjt:  QSYSEIFPPNCKVPETNYSNEEISSGPDSRIVSFISLANDRANEIYKRVEDQKSSRGRNQDALLAACLYIACRQEDKPRTVKGIAYGSFDIWSIEICSVA

Query:  NGATKKEIGRAKEYIVKQLGLETGQSVEMGTIHAGDFMRRFCSNLGMNNQAVKAAQEAVQKSEEFDIRRSPISIAAAVIYIITQLSDDKKPLKDISVATG
        NGATKKEIGRAKE+IVKQL +E GQS+EMGTIHAGDF+RRFCS LGMNNQAVKAAQEAVQ+SEE DIRRSPISIAAAVIY+ITQLSDDKKPLKDIS+ATG
Subjt:  NGATKKEIGRAKEYIVKQLGLETGQSVEMGTIHAGDFMRRFCSNLGMNNQAVKAAQEAVQKSEEFDIRRSPISIAAAVIYIITQLSDDKKPLKDISVATG

Query:  VAEGTIRNSYKDLYPHVSKILPSWYAKEEDLKNLCSP
        VAEGTIRNSYKDLYP+ S+++P+ YAKEEDLKNLC+P
Subjt:  VAEGTIRNSYKDLYPHVSKILPSWYAKEEDLKNLCSP

Q9LUC2 Pentatricopeptide repeat-containing protein At3g147301.6e-18855.63Show/hide
Query:  NVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGF-SPSPPSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEG
        NV TC+  LQ CA  K+   G+Q+H  M+  GF   SP + TSL+NMY+KCG M  A+LVF     ER+VF YNA+ISGFV NG      + Y++MR  G
Subjt:  NVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGF-SPSPPSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEG

Query:  VMPDKYTFPCVVR--TCCDVMEVKKIHGCLFKMGLELDVFVGSALVNTYLKIGSMEDAQKVFEELSIR-DVVLWNAMINGYAQIGCLDEALEVFRRMHIE
        ++PDKYTFP +++     ++ +VKK+HG  FK+G + D +VGS LV +Y K  S+EDAQKVF+EL  R D VLWNA++NGY+QI   ++AL VF +M  E
Subjt:  VMPDKYTFPCVVR--TCCDVMEVKKIHGCLFKMGLELDVFVGSALVNTYLKIGSMEDAQKVFEELSIR-DVVLWNAMINGYAQIGCLDEALEVFRRMHIE

Query:  GIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGVAVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLG
        G+  SR TIT +LS F + GD+DNG+++HG+ +K G  S + VSNALIDMYGK K + +A  IFE ++E+D+F+WNS++ VH+ CGDHDGTL LF++ML 
Subjt:  GIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGVAVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLG

Query:  SGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKDDENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEA
        SGI PD+VT+TTVLP C  LA+L  GREIHGYMIV+GL     N    +  ++N++MDMY KCG + +A  +FD +  KD ASWNIMI GYG+   G  A
Subjt:  SGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKDDENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEA

Query:  LDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQMESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNA
        LDMFS MC A  KPDE+T VG+L AC+H+GF+++GR FLAQME+ ++++PT +HY CVIDMLGRA  LE+AYE+A   PI  NPVVWR++L +CRLHGN 
Subjt:  LDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQMESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNA

Query:  ELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRKTMKEQNVKKTPGCSWIELKDGVHKIY
        +LA +A +++ +LEPEHCG YVLMSNVY   G+YEEVL+VR  M++QNVKKTPGCSWI LK+GVH  +
Subjt:  ELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRKTMKEQNVKKTPGCSWIELKDGVHKIY

Q9SS44 Transcription initiation factor IIB-22.7e-15167.28Show/hide
Query:  MSDAFCSDCKRQTEVVFDHSAGDTVCSECGLVLESHSIDETSEWRTFANESGDNDPVRVGGPTNPLLADGGLSTVIAKPNGTTGEFLSSSLGRWQNRGSN
        MSDAFCSDCKR TEVVFDHSAGDTVCSECGLVLESHSIDETSEWRTFANESGDNDPVRVGGPTNPLLADGGL+TVI+KPNG++G+FLSSSLGRWQNRGSN
Subjt:  MSDAFCSDCKRQTEVVFDHSAGDTVCSECGLVLESHSIDETSEWRTFANESGDNDPVRVGGPTNPLLADGGLSTVIAKPNGTTGEFLSSSLGRWQNRGSN

Query:  PDRGLILAFKTIATMSDSRKQIELGLWRGGNLGNLTSLRLGLVATIKVSCQYRMIVQYSQEKGLYEDKCIGKLRKTLVASGGVNIGDAVRKLSINRRKIT
        PDRGLI+AFKTIATM+D                     RLGLVATIK                                                     
Subjt:  PDRGLILAFKTIATMSDSRKQIELGLWRGGNLGNLTSLRLGLVATIKVSCQYRMIVQYSQEKGLYEDKCIGKLRKTLVASGGVNIGDAVRKLSINRRKIT

Query:  QSYSEIFPPNCKVPETNYSNEEISSGPDSRIVSFISLANDRANEIYKRVEDQKSSRGRNQDALLAACLYIACRQEDKPRTVKGIAYGSFDIWSIEICSVA
                                               DRANEIYKRVEDQKSSRGRNQDALLAACLYIACRQEDKPRTVK            EICSVA
Subjt:  QSYSEIFPPNCKVPETNYSNEEISSGPDSRIVSFISLANDRANEIYKRVEDQKSSRGRNQDALLAACLYIACRQEDKPRTVKGIAYGSFDIWSIEICSVA

Query:  NGATKKEIGRAKEYIVKQLGLETGQSVEMGTIHAGDFMRRFCSNLGMNNQAVKAAQEAVQKSEEFDIRRSPISIAAAVIYIITQLSDDKKPLKDISVATG
        NGATKKEIGRAKEYIVKQLGLETGQ VEMGTIHAGDFMRRFCSNLGM NQ VKAAQE+VQKSEEFDIRRSPISIAAAVIYIITQLSD+KKPL+DISVATG
Subjt:  NGATKKEIGRAKEYIVKQLGLETGQSVEMGTIHAGDFMRRFCSNLGMNNQAVKAAQEAVQKSEEFDIRRSPISIAAAVIYIITQLSDDKKPLKDISVATG

Query:  VAEGTIRNSYKDLYPHVSKILPSWYAKEEDLKNLCSP
        VAEGTIRNSYKDLYPH+SKI+P+WYAKEEDLKNL SP
Subjt:  VAEGTIRNSYKDLYPHVSKILPSWYAKEEDLKNLCSP

Arabidopsis top hitse value%identityAlignment
AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.5e-11436.33Show/hide
Query:  NVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGV
        +V T    L++C    ++ +GK++H  ++ YG+      + +LI MY KCG ++ A L+F D    R++ ++NA+ISG+  NG+  +G + +  MR   V
Subjt:  NVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGV

Query:  MPDKYTFPCVVRTC---CDVMEVKKIHGCLFKMGLELDVFVGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEG
         PD  T   V+  C    D    + IH  +   G  +D+ V ++L   YL  GS  +A+K+F  +  +D+V W  MI+GY      D+A++ +R M  + 
Subjt:  MPDKYTFPCVVRTC---CDVMEVKKIHGCLFKMGLELDVFVGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEG

Query:  IAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGVAVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIIS---VHEQCGDHDGTLRLFDKM
        + P   T+  +LS  A  GDLD G  +H + +K    S V V+N LI+MY KCK I  AL IF  I  K++ SW SII+   ++ +C +      +F + 
Subjt:  IAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGVAVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIIS---VHEQCGDHDGTLRLFDKM

Query:  LGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKDDENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGL
        +   + P+ +T+T  L AC+ + ALM G+EIH +++  G+G DD         + NA++DMY +CG MN A   F+    KDV SWNI++ GY   G G 
Subjt:  LGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKDDENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGL

Query:  EALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQMESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHG
          +++F RM  +R +PDE+T + +L  C+ +  V QG ++ ++ME  + V P ++HY CV+D+LGRAG L++A++  QKMP+  +P VW ALL ACR+H 
Subjt:  EALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQMESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHG

Query:  NAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRKTMKEQNVKKTPGCSWIELKDGVH
          +L E++A+ + +L+ +  G Y+L+ N+Y   G++ EV +VR+ MKE  +    GCSW+E+K  VH
Subjt:  NAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRKTMKEQNVKKTPGCSWIELKDGVH

AT2G41630.1 transcription factor IIB5.4e-13962.01Show/hide
Query:  MSDAFCSDCKRQTEVVFDHSAGDTVCSECGLVLESHSIDETSEWRTFANESGDNDPVRVGGPTNPLLADGGLSTVIAKPNGTTGEFLSSSLGRWQNRGSN
        MSDA+C+DCK++TE+V DHSAGDT+CSECGLVLESHSIDETSEWRTFANES ++DP RVGGPTNPLLAD  L+TVIAKPNG++G+FLSSSLGRWQNR SN
Subjt:  MSDAFCSDCKRQTEVVFDHSAGDTVCSECGLVLESHSIDETSEWRTFANESGDNDPVRVGGPTNPLLADGGLSTVIAKPNGTTGEFLSSSLGRWQNRGSN

Query:  PDRGLILAFKTIATMSDSRKQIELGLWRGGNLGNLTSLRLGLVATIKVSCQYRMIVQYSQEKGLYEDKCIGKLRKTLVASGGVNIGDAVRKLSINRRKIT
         DRGLI AFKTIATMS+                     RLGLVATIK                                                     
Subjt:  PDRGLILAFKTIATMSDSRKQIELGLWRGGNLGNLTSLRLGLVATIKVSCQYRMIVQYSQEKGLYEDKCIGKLRKTLVASGGVNIGDAVRKLSINRRKIT

Query:  QSYSEIFPPNCKVPETNYSNEEISSGPDSRIVSFISLANDRANEIYKRVEDQKSSRGRNQDALLAACLYIACRQEDKPRTVKGIAYGSFDIWSIEICSVA
                                               DRANE+YKR+EDQKSSRGRNQDAL AACLYIACRQEDKPRT+K            EIC +A
Subjt:  QSYSEIFPPNCKVPETNYSNEEISSGPDSRIVSFISLANDRANEIYKRVEDQKSSRGRNQDALLAACLYIACRQEDKPRTVKGIAYGSFDIWSIEICSVA

Query:  NGATKKEIGRAKEYIVKQLGLETGQSVEMGTIHAGDFMRRFCSNLGMNNQAVKAAQEAVQKSEEFDIRRSPISIAAAVIYIITQLSDDKKPLKDISVATG
        NGATKKEIGRAK+YIVK LGLE GQSV++GTIHAGDFMRRFCSNL M+N AVKAAQEAVQKSEEFDIRRSPISIAA VIYIITQLSDDKK LKDIS ATG
Subjt:  NGATKKEIGRAKEYIVKQLGLETGQSVEMGTIHAGDFMRRFCSNLGMNNQAVKAAQEAVQKSEEFDIRRSPISIAAAVIYIITQLSDDKKPLKDISVATG

Query:  VAEGTIRNSYKDLYPHVSKILPSWYAKEEDLKNLCSP
        VAEGTIRNSYKDLYPH+SKI PSWYAKEEDLKNL SP
Subjt:  VAEGTIRNSYKDLYPHVSKILPSWYAKEEDLKNLCSP

AT3G10330.1 Cyclin-like family protein1.9e-15267.28Show/hide
Query:  MSDAFCSDCKRQTEVVFDHSAGDTVCSECGLVLESHSIDETSEWRTFANESGDNDPVRVGGPTNPLLADGGLSTVIAKPNGTTGEFLSSSLGRWQNRGSN
        MSDAFCSDCKR TEVVFDHSAGDTVCSECGLVLESHSIDETSEWRTFANESGDNDPVRVGGPTNPLLADGGL+TVI+KPNG++G+FLSSSLGRWQNRGSN
Subjt:  MSDAFCSDCKRQTEVVFDHSAGDTVCSECGLVLESHSIDETSEWRTFANESGDNDPVRVGGPTNPLLADGGLSTVIAKPNGTTGEFLSSSLGRWQNRGSN

Query:  PDRGLILAFKTIATMSDSRKQIELGLWRGGNLGNLTSLRLGLVATIKVSCQYRMIVQYSQEKGLYEDKCIGKLRKTLVASGGVNIGDAVRKLSINRRKIT
        PDRGLI+AFKTIATM+D                     RLGLVATIK                                                     
Subjt:  PDRGLILAFKTIATMSDSRKQIELGLWRGGNLGNLTSLRLGLVATIKVSCQYRMIVQYSQEKGLYEDKCIGKLRKTLVASGGVNIGDAVRKLSINRRKIT

Query:  QSYSEIFPPNCKVPETNYSNEEISSGPDSRIVSFISLANDRANEIYKRVEDQKSSRGRNQDALLAACLYIACRQEDKPRTVKGIAYGSFDIWSIEICSVA
                                               DRANEIYKRVEDQKSSRGRNQDALLAACLYIACRQEDKPRTVK            EICSVA
Subjt:  QSYSEIFPPNCKVPETNYSNEEISSGPDSRIVSFISLANDRANEIYKRVEDQKSSRGRNQDALLAACLYIACRQEDKPRTVKGIAYGSFDIWSIEICSVA

Query:  NGATKKEIGRAKEYIVKQLGLETGQSVEMGTIHAGDFMRRFCSNLGMNNQAVKAAQEAVQKSEEFDIRRSPISIAAAVIYIITQLSDDKKPLKDISVATG
        NGATKKEIGRAKEYIVKQLGLETGQ VEMGTIHAGDFMRRFCSNLGM NQ VKAAQE+VQKSEEFDIRRSPISIAAAVIYIITQLSD+KKPL+DISVATG
Subjt:  NGATKKEIGRAKEYIVKQLGLETGQSVEMGTIHAGDFMRRFCSNLGMNNQAVKAAQEAVQKSEEFDIRRSPISIAAAVIYIITQLSDDKKPLKDISVATG

Query:  VAEGTIRNSYKDLYPHVSKILPSWYAKEEDLKNLCSP
        VAEGTIRNSYKDLYPH+SKI+P+WYAKEEDLKNL SP
Subjt:  VAEGTIRNSYKDLYPHVSKILPSWYAKEEDLKNLCSP

AT3G14730.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-18955.63Show/hide
Query:  NVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGF-SPSPPSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEG
        NV TC+  LQ CA  K+   G+Q+H  M+  GF   SP + TSL+NMY+KCG M  A+LVF     ER+VF YNA+ISGFV NG      + Y++MR  G
Subjt:  NVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGF-SPSPPSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEG

Query:  VMPDKYTFPCVVR--TCCDVMEVKKIHGCLFKMGLELDVFVGSALVNTYLKIGSMEDAQKVFEELSIR-DVVLWNAMINGYAQIGCLDEALEVFRRMHIE
        ++PDKYTFP +++     ++ +VKK+HG  FK+G + D +VGS LV +Y K  S+EDAQKVF+EL  R D VLWNA++NGY+QI   ++AL VF +M  E
Subjt:  VMPDKYTFPCVVR--TCCDVMEVKKIHGCLFKMGLELDVFVGSALVNTYLKIGSMEDAQKVFEELSIR-DVVLWNAMINGYAQIGCLDEALEVFRRMHIE

Query:  GIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGVAVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLG
        G+  SR TIT +LS F + GD+DNG+++HG+ +K G  S + VSNALIDMYGK K + +A  IFE ++E+D+F+WNS++ VH+ CGDHDGTL LF++ML 
Subjt:  GIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGVAVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLG

Query:  SGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKDDENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEA
        SGI PD+VT+TTVLP C  LA+L  GREIHGYMIV+GL     N    +  ++N++MDMY KCG + +A  +FD +  KD ASWNIMI GYG+   G  A
Subjt:  SGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKDDENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEA

Query:  LDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQMESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNA
        LDMFS MC A  KPDE+T VG+L AC+H+GF+++GR FLAQME+ ++++PT +HY CVIDMLGRA  LE+AYE+A   PI  NPVVWR++L +CRLHGN 
Subjt:  LDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQMESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNA

Query:  ELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRKTMKEQNVKKTPGCSWIELKDGVHKIY
        +LA +A +++ +LEPEHCG YVLMSNVY   G+YEEVL+VR  M++QNVKKTPGCSWI LK+GVH  +
Subjt:  ELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRKTMKEQNVKKTPGCSWIELKDGVHKIY

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-11338.91Show/hide
Query:  KNVNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCC
        ++V+ G+QLH  ++  GF        SL+  Y K  +++ A  VF D   ER+V ++N+II+G+V+NGL+ KG   + QM + G+  D  T   V   C 
Subjt:  KNVNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCC

Query:  DVMEV---KKIHGCLFKMGLELDVFVGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIF
        D   +   + +H    K     +    + L++ Y K G ++ A+ VF E+S R VV + +MI GYA+ G   EA+++F  M  EGI+P  +T+T +L+  
Subjt:  DVMEV---KKIHGCLFKMGLELDVFVGSALVNTYLKIGSMEDAQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIF

Query:  ALRGDLDNGKTVHGIVMKMGYDSGVAVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKML-GSGILPDLVTITTVLP
        A    LD GK VH  + +      + VSNAL+DMY KC  + +A ++F  +  KDI SWN+II  + +    +  L LF+ +L      PD  T+  VLP
Subjt:  ALRGDLDNGKTVHGIVMKMGYDSGVAVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKML-GSGILPDLVTITTVLP

Query:  ACSHLAALMHGREIHGYMIVNGLGKDDENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPD
        AC+ L+A   GREIHGY++         NG   D  V N+++DMYAKCG++  A  +FD +++KD+ SW +MI GYGMHG+G EA+ +F++M  A  + D
Subjt:  ACSHLAALMHGREIHGYMIVNGLGKDDENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSNKDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPD

Query:  EVTLVGVLSACNHAGFVSQGRLFLAQMESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEP
        E++ V +L AC+H+G V +G  F   M  +  + PT+EHY C++DML R G L  AY   + MPI  +  +W ALL  CR+H + +LAE  A +V +LEP
Subjt:  EVTLVGVLSACNHAGFVSQGRLFLAQMESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEP

Query:  EHCGSYVLMSNVYGVIGRYEEVLEVRKTMKEQNVKKTPGCSWIELKDGVH
        E+ G YVLM+N+Y    ++E+V  +RK + ++ ++K PGCSWIE+K  V+
Subjt:  EHCGSYVLMSNVYGVIGRYEEVLEVRKTMKEQNVKKTPGCSWIELKDGVH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGTGATGATGTCGAATGTGTTAAAACCAAGCATAGTTTTTTACAACGTTCTCGATAGATTAGCTTGGTGCTTAACAAACAGAAAGATTATCTTCTCCAAAATTAT
CTCCAAGAAGCACTACCTTTCTTCTTCTTCCTTTCTTTGTTTCTCTACTTCACCACCTTCCAAGCTTTCAGTTTTTCAACTGCTGGATAATGTAACCACATGCGTTGTTT
TTCTACAATCATGTGCTCACCACAAGAATGTCAACAAAGGAAAACAGCTTCACTCCCTAATGATCACCTATGGTTTTTCTCCTTCACCTCCATCCATCACTAGCTTAATC
AACATGTACTCCAAATGTGGTCAAATGGAGGAGGCCATTTTGGTTTTTCATGATCCATGTCATGAGCGTAATGTGTTTGCATATAATGCTATAATTTCTGGGTTTGTCGC
TAATGGTCTTTCTTCAAAAGGGTTTCAATTTTATAAGCAAATGAGGTTAGAGGGTGTAATGCCTGATAAGTACACTTTTCCATGTGTAGTTAGAACTTGTTGTGATGTTA
TGGAGGTGAAGAAGATTCATGGATGTTTGTTTAAAATGGGGTTGGAGTTGGATGTGTTTGTTGGTAGTGCTTTGGTTAATACTTACTTAAAGATTGGCTCAATGGAGGAT
GCACAAAAAGTGTTTGAAGAACTATCAATAAGAGATGTCGTACTTTGGAATGCAATGATCAATGGGTATGCCCAGATTGGTTGCCTTGACGAGGCATTAGAGGTTTTCAG
AAGAATGCATATAGAAGGGATTGCACCTAGTAGGTTTACAATTACTGGCATTTTATCTATTTTTGCTTTAAGGGGAGACTTAGACAATGGGAAAACAGTTCACGGGATTG
TAATGAAAATGGGTTATGATTCAGGAGTTGCAGTTTCAAATGCGTTAATTGATATGTATGGGAAATGCAAACATATTGGAGATGCTTTAATAATTTTTGAGATGATAAAT
GAGAAGGATATTTTCTCATGGAATTCAATTATATCAGTTCATGAACAATGTGGTGATCATGATGGTACCTTGAGGCTTTTTGATAAGATGTTAGGTTCAGGGATTCTACC
TGATTTGGTAACCATCACAACCGTACTTCCAGCTTGCTCTCATTTGGCTGCCCTCATGCATGGTAGAGAAATTCATGGATATATGATTGTTAATGGATTGGGAAAGGATG
ATGAAAATGGAGCTTTAGATGATTTACTTGTAAATAATGCTGTTATGGATATGTATGCAAAATGTGGAAGTATGAACAATGCCCTCAAGATTTTTGATCTACTGAGCAAT
AAGGACGTGGCATCGTGGAATATCATGATTATGGGTTATGGTATGCATGGATATGGTTTGGAGGCATTGGATATGTTTTCGCGAATGTGTGCGGCCAGATTTAAGCCGGA
TGAAGTTACGCTTGTTGGAGTTCTATCAGCATGCAATCATGCGGGCTTTGTGTCTCAAGGGCGTCTGTTTTTAGCTCAAATGGAATCTAAATTTAGTGTTATTCCAACTA
TTGAGCATTATACATGTGTAATTGATATGCTCGGTCGAGCTGGGCATCTGGAGGACGCGTATGAGGTTGCTCAGAAAATGCCTATTCAAGCCAATCCTGTTGTTTGGAGG
GCTTTATTGGGAGCATGCCGACTTCATGGGAATGCAGAGTTGGCCGAAATTGCAGCACGACAAGTAATGCAACTTGAACCAGAGCATTGTGGGAGTTATGTATTGATGTC
CAATGTTTATGGAGTTATAGGTCGATATGAAGAGGTGTTGGAGGTTAGAAAAACAATGAAGGAACAAAATGTCAAGAAGACACCTGGTTGTAGTTGGATTGAACTCAAGG
ATGGGGTGCACAAGATCTATTGCGAGCTGACTTCCTTCCATTCTCTTCACTTCCCTGAGTACAATGGCTCCATTGGTAGTCGTCTCTCTCCTGTTTCGTTTCTCAGTGTT
GATCAACATGACAACACTTCCATTACTCCACAAATGACAATGAAAAACAAATTCGTTCCACTTTTATGGCTCTTCAATCTTCTGACTCATCAATCCAGAATTAAAGTGCT
TCAAGATCATGTGCATTTTCCCAATGGTCCGGCTGTGGACTGTGTCGGGTTGAGTGGTGGTTTGGCTTTATTATGGACCACCGATGTCACCATTGATCTCTTCACACTCT
CTAAATCACATATTCACATGAAAATTACTAACAATAACCATATCCTCCGTCTTAATGGGTTCTATGGCGAGCCAAAACACTCTGATAAACACTTCTCATGGACTTTATTG
AGAAGGTTACGTGGAATGTACTCACTCCCCTGGATTGTGGTAGGGGATTTTAATGAAACCCTTCAGGCTTATGAGAAAGTTAGAGGGCGTGAACAATACAGTCCTGAGTT
GACAGGCCACTACAAGCATAAAATCGAGGAGACCAAAGCTCGTATTCAATACCTTCTTAGTGGTGAATACGGGTTAGAGCCGACGAGAAGGAACAGAAGGAGGAGGAGGA
GGTTCTCAACAATGTCCGATGCGTTTTGCTCTGACTGCAAGCGTCAGACGGAGGTTGTTTTCGACCATTCCGCTGGAGACACCGTGTGTTCCGAGTGTGGTCTTGTGCTT
GAATCCCACTCCATCGATGAGACCTCCGAGTGGAGGACTTTTGCCAATGAGTCTGGGGATAACGACCCGGTTCGTGTTGGTGGACCGACCAATCCGCTTTTGGCTGATGG
TGGCCTCTCTACCGTGATTGCGAAGCCTAATGGTACGACTGGGGAGTTCTTGTCCTCGTCTTTGGGTCGGTGGCAGAATCGTGGGTCGAATCCAGATCGAGGGCTCATTC
TTGCTTTCAAGACCATTGCTACTATGTCTGATAGCCGGAAACAGATCGAATTGGGACTGTGGAGGGGTGGGAATTTGGGAAATCTTACTAGTTTAAGGTTGGGCCTTGTT
GCAACCATTAAGGTTAGTTGTCAGTATAGAATGATCGTGCAGTATTCTCAAGAAAAAGGTCTTTACGAGGACAAATGCATAGGTAAGTTAAGGAAGACGCTTGTGGCATC
TGGAGGGGTTAATATTGGTGATGCAGTGAGGAAGTTGAGCATTAACAGAAGGAAAATTACTCAATCTTATAGTGAAATTTTTCCTCCGAACTGTAAGGTGCCTGAGACTA
ATTATTCAAACGAGGAAATTTCCTCTGGTCCAGACTCCAGAATAGTGTCATTCATATCTCTAGCTAATGATCGGGCCAATGAGATATATAAAAGAGTAGAAGATCAAAAA
TCTAGTAGAGGAAGAAATCAAGATGCTTTATTGGCTGCTTGCTTATACATTGCTTGTCGACAAGAAGATAAACCCCGCACGGTCAAGGGTATTGCTTATGGATCATTTGA
TATCTGGAGCATTGAAATTTGCTCTGTTGCGAATGGGGCAACAAAGAAGGAGATTGGCCGAGCAAAAGAATACATTGTGAAACAGTTGGGGTTGGAGACAGGTCAGTCTG
TGGAGATGGGAACAATACACGCTGGAGACTTTATGAGGCGTTTTTGTTCTAATCTTGGGATGAATAATCAAGCTGTTAAAGCTGCCCAAGAAGCTGTACAGAAATCTGAA
GAGTTTGATATTAGGAGAAGCCCAATTTCCATTGCAGCAGCAGTTATTTACATTATTACTCAGCTTTCAGATGATAAGAAGCCTCTGAAAGATATATCGGTAGCAACCGG
TGTTGCAGAAGGAACAATCAGAAATTCATATAAAGATCTCTACCCACACGTGTCGAAGATATTACCGAGTTGGTATGCTAAAGAGGAGGATCTTAAGAACCTTTGCAGTC
CTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGTGATGATGTCGAATGTGTTAAAACCAAGCATAGTTTTTTACAACGTTCTCGATAGATTAGCTTGGTGCTTAACAAACAGAAAGATTATCTTCTCCAAAATTAT
CTCCAAGAAGCACTACCTTTCTTCTTCTTCCTTTCTTTGTTTCTCTACTTCACCACCTTCCAAGCTTTCAGTTTTTCAACTGCTGGATAATGTAACCACATGCGTTGTTT
TTCTACAATCATGTGCTCACCACAAGAATGTCAACAAAGGAAAACAGCTTCACTCCCTAATGATCACCTATGGTTTTTCTCCTTCACCTCCATCCATCACTAGCTTAATC
AACATGTACTCCAAATGTGGTCAAATGGAGGAGGCCATTTTGGTTTTTCATGATCCATGTCATGAGCGTAATGTGTTTGCATATAATGCTATAATTTCTGGGTTTGTCGC
TAATGGTCTTTCTTCAAAAGGGTTTCAATTTTATAAGCAAATGAGGTTAGAGGGTGTAATGCCTGATAAGTACACTTTTCCATGTGTAGTTAGAACTTGTTGTGATGTTA
TGGAGGTGAAGAAGATTCATGGATGTTTGTTTAAAATGGGGTTGGAGTTGGATGTGTTTGTTGGTAGTGCTTTGGTTAATACTTACTTAAAGATTGGCTCAATGGAGGAT
GCACAAAAAGTGTTTGAAGAACTATCAATAAGAGATGTCGTACTTTGGAATGCAATGATCAATGGGTATGCCCAGATTGGTTGCCTTGACGAGGCATTAGAGGTTTTCAG
AAGAATGCATATAGAAGGGATTGCACCTAGTAGGTTTACAATTACTGGCATTTTATCTATTTTTGCTTTAAGGGGAGACTTAGACAATGGGAAAACAGTTCACGGGATTG
TAATGAAAATGGGTTATGATTCAGGAGTTGCAGTTTCAAATGCGTTAATTGATATGTATGGGAAATGCAAACATATTGGAGATGCTTTAATAATTTTTGAGATGATAAAT
GAGAAGGATATTTTCTCATGGAATTCAATTATATCAGTTCATGAACAATGTGGTGATCATGATGGTACCTTGAGGCTTTTTGATAAGATGTTAGGTTCAGGGATTCTACC
TGATTTGGTAACCATCACAACCGTACTTCCAGCTTGCTCTCATTTGGCTGCCCTCATGCATGGTAGAGAAATTCATGGATATATGATTGTTAATGGATTGGGAAAGGATG
ATGAAAATGGAGCTTTAGATGATTTACTTGTAAATAATGCTGTTATGGATATGTATGCAAAATGTGGAAGTATGAACAATGCCCTCAAGATTTTTGATCTACTGAGCAAT
AAGGACGTGGCATCGTGGAATATCATGATTATGGGTTATGGTATGCATGGATATGGTTTGGAGGCATTGGATATGTTTTCGCGAATGTGTGCGGCCAGATTTAAGCCGGA
TGAAGTTACGCTTGTTGGAGTTCTATCAGCATGCAATCATGCGGGCTTTGTGTCTCAAGGGCGTCTGTTTTTAGCTCAAATGGAATCTAAATTTAGTGTTATTCCAACTA
TTGAGCATTATACATGTGTAATTGATATGCTCGGTCGAGCTGGGCATCTGGAGGACGCGTATGAGGTTGCTCAGAAAATGCCTATTCAAGCCAATCCTGTTGTTTGGAGG
GCTTTATTGGGAGCATGCCGACTTCATGGGAATGCAGAGTTGGCCGAAATTGCAGCACGACAAGTAATGCAACTTGAACCAGAGCATTGTGGGAGTTATGTATTGATGTC
CAATGTTTATGGAGTTATAGGTCGATATGAAGAGGTGTTGGAGGTTAGAAAAACAATGAAGGAACAAAATGTCAAGAAGACACCTGGTTGTAGTTGGATTGAACTCAAGG
ATGGGGTGCACAAGATCTATTGCGAGCTGACTTCCTTCCATTCTCTTCACTTCCCTGAGTACAATGGCTCCATTGGTAGTCGTCTCTCTCCTGTTTCGTTTCTCAGTGTT
GATCAACATGACAACACTTCCATTACTCCACAAATGACAATGAAAAACAAATTCGTTCCACTTTTATGGCTCTTCAATCTTCTGACTCATCAATCCAGAATTAAAGTGCT
TCAAGATCATGTGCATTTTCCCAATGGTCCGGCTGTGGACTGTGTCGGGTTGAGTGGTGGTTTGGCTTTATTATGGACCACCGATGTCACCATTGATCTCTTCACACTCT
CTAAATCACATATTCACATGAAAATTACTAACAATAACCATATCCTCCGTCTTAATGGGTTCTATGGCGAGCCAAAACACTCTGATAAACACTTCTCATGGACTTTATTG
AGAAGGTTACGTGGAATGTACTCACTCCCCTGGATTGTGGTAGGGGATTTTAATGAAACCCTTCAGGCTTATGAGAAAGTTAGAGGGCGTGAACAATACAGTCCTGAGTT
GACAGGCCACTACAAGCATAAAATCGAGGAGACCAAAGCTCGTATTCAATACCTTCTTAGTGGTGAATACGGGTTAGAGCCGACGAGAAGGAACAGAAGGAGGAGGAGGA
GGTTCTCAACAATGTCCGATGCGTTTTGCTCTGACTGCAAGCGTCAGACGGAGGTTGTTTTCGACCATTCCGCTGGAGACACCGTGTGTTCCGAGTGTGGTCTTGTGCTT
GAATCCCACTCCATCGATGAGACCTCCGAGTGGAGGACTTTTGCCAATGAGTCTGGGGATAACGACCCGGTTCGTGTTGGTGGACCGACCAATCCGCTTTTGGCTGATGG
TGGCCTCTCTACCGTGATTGCGAAGCCTAATGGTACGACTGGGGAGTTCTTGTCCTCGTCTTTGGGTCGGTGGCAGAATCGTGGGTCGAATCCAGATCGAGGGCTCATTC
TTGCTTTCAAGACCATTGCTACTATGTCTGATAGCCGGAAACAGATCGAATTGGGACTGTGGAGGGGTGGGAATTTGGGAAATCTTACTAGTTTAAGGTTGGGCCTTGTT
GCAACCATTAAGGTTAGTTGTCAGTATAGAATGATCGTGCAGTATTCTCAAGAAAAAGGTCTTTACGAGGACAAATGCATAGGTAAGTTAAGGAAGACGCTTGTGGCATC
TGGAGGGGTTAATATTGGTGATGCAGTGAGGAAGTTGAGCATTAACAGAAGGAAAATTACTCAATCTTATAGTGAAATTTTTCCTCCGAACTGTAAGGTGCCTGAGACTA
ATTATTCAAACGAGGAAATTTCCTCTGGTCCAGACTCCAGAATAGTGTCATTCATATCTCTAGCTAATGATCGGGCCAATGAGATATATAAAAGAGTAGAAGATCAAAAA
TCTAGTAGAGGAAGAAATCAAGATGCTTTATTGGCTGCTTGCTTATACATTGCTTGTCGACAAGAAGATAAACCCCGCACGGTCAAGGGTATTGCTTATGGATCATTTGA
TATCTGGAGCATTGAAATTTGCTCTGTTGCGAATGGGGCAACAAAGAAGGAGATTGGCCGAGCAAAAGAATACATTGTGAAACAGTTGGGGTTGGAGACAGGTCAGTCTG
TGGAGATGGGAACAATACACGCTGGAGACTTTATGAGGCGTTTTTGTTCTAATCTTGGGATGAATAATCAAGCTGTTAAAGCTGCCCAAGAAGCTGTACAGAAATCTGAA
GAGTTTGATATTAGGAGAAGCCCAATTTCCATTGCAGCAGCAGTTATTTACATTATTACTCAGCTTTCAGATGATAAGAAGCCTCTGAAAGATATATCGGTAGCAACCGG
TGTTGCAGAAGGAACAATCAGAAATTCATATAAAGATCTCTACCCACACGTGTCGAAGATATTACCGAGTTGGTATGCTAAAGAGGAGGATCTTAAGAACCTTTGCAGTC
CTTGAAATGTAGAGCAAAGCAACAATGACGACGACCTCATATGACGGGAATTAAATCGAAATTGAAATTACTCTTTTTCTTTAGCTTTCTTGTATTTACTCTTTGAGATT
GCAAAGAGTGTGCTAGAAAGAAAATCTCCACCTTCTGCTTCTTCTTTTCCTAAATAGATTCATTGCCAATTATCAAATTCTAAATGACCCCCAAAACTTGCCCCATAGTT
TTCACCTCAATTCACAATGAAAGTTATCCAAAATTTTGTGATGCTCTCATGACTTATATCAATATCAATCAGTTAGCATTTGCTTCAAGTTAATAGCCATTTTCAGGCGT
TCTTAGGGCTTCCAAAGGGTTTCAAGAGTAGGCATTTGGAACTTTGCTTTCCATTTGTTCATCTTGTTCTCCAATTTTTGTAGGTGTCCTGTTTTTGCTTTTTGTGAAAT
ATTTTTAGCAATCTATTTTGTTAAATCACCATTCCATCCAAAAATTAGGTCACTTAACATTAATCTTTTCAGGGGTTTAGAATTTAGAATGAGCTATTAATATTTATTAG
AAAAATTATAATATTTAGAGTTTGAGTATAAGACCTTTTGCCTTTATATCACATTAAATTGCCACTCAATTTAGAAGACGGG
Protein sequenceShow/hide protein sequence
MVVMMSNVLKPSIVFYNVLDRLAWCLTNRKIIFSKIISKKHYLSSSSFLCFSTSPPSKLSVFQLLDNVTTCVVFLQSCAHHKNVNKGKQLHSLMITYGFSPSPPSITSLI
NMYSKCGQMEEAILVFHDPCHERNVFAYNAIISGFVANGLSSKGFQFYKQMRLEGVMPDKYTFPCVVRTCCDVMEVKKIHGCLFKMGLELDVFVGSALVNTYLKIGSMED
AQKVFEELSIRDVVLWNAMINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILSIFALRGDLDNGKTVHGIVMKMGYDSGVAVSNALIDMYGKCKHIGDALIIFEMIN
EKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIVNGLGKDDENGALDDLLVNNAVMDMYAKCGSMNNALKIFDLLSN
KDVASWNIMIMGYGMHGYGLEALDMFSRMCAARFKPDEVTLVGVLSACNHAGFVSQGRLFLAQMESKFSVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQANPVVWR
ALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRKTMKEQNVKKTPGCSWIELKDGVHKIYCELTSFHSLHFPEYNGSIGSRLSPVSFLSV
DQHDNTSITPQMTMKNKFVPLLWLFNLLTHQSRIKVLQDHVHFPNGPAVDCVGLSGGLALLWTTDVTIDLFTLSKSHIHMKITNNNHILRLNGFYGEPKHSDKHFSWTLL
RRLRGMYSLPWIVVGDFNETLQAYEKVRGREQYSPELTGHYKHKIEETKARIQYLLSGEYGLEPTRRNRRRRRRFSTMSDAFCSDCKRQTEVVFDHSAGDTVCSECGLVL
ESHSIDETSEWRTFANESGDNDPVRVGGPTNPLLADGGLSTVIAKPNGTTGEFLSSSLGRWQNRGSNPDRGLILAFKTIATMSDSRKQIELGLWRGGNLGNLTSLRLGLV
ATIKVSCQYRMIVQYSQEKGLYEDKCIGKLRKTLVASGGVNIGDAVRKLSINRRKITQSYSEIFPPNCKVPETNYSNEEISSGPDSRIVSFISLANDRANEIYKRVEDQK
SSRGRNQDALLAACLYIACRQEDKPRTVKGIAYGSFDIWSIEICSVANGATKKEIGRAKEYIVKQLGLETGQSVEMGTIHAGDFMRRFCSNLGMNNQAVKAAQEAVQKSE
EFDIRRSPISIAAAVIYIITQLSDDKKPLKDISVATGVAEGTIRNSYKDLYPHVSKILPSWYAKEEDLKNLCSP