; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G023040 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G023040
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionprotein TIC 62, chloroplastic
Genome locationCicolChr02:5707380..5730242
RNA-Seq ExpressionCcUC02G023040
SyntenyCcUC02G023040
Gene Ontology termsGO:0098807 - chloroplast thylakoid membrane protein complex (cellular component)
InterPro domainsIPR006869 - Domain of unknown function DUF547
IPR016040 - NAD(P)-binding domain
IPR025714 - Methyltransferase domain
IPR025757 - Ternary complex factor MIP1, leucine-zipper
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase
IPR036291 - NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK06707.1 uncharacterized protein E5676_scaffold13G00080 [Cucumis melo var. makuwa]2.8e-26887.25Show/hide
Query:  MEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQHVSSFST
        M KGN++QI DGDVQISLKQEILQL+EQLQSQFATRHALEKAINFQPLS +SAT+++IP+AEMELIKQIA+LELEVVYLEKYLLSLYRRTF+Q VSSFST
Subjt:  MEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQHVSSFST

Query:  MDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQSQIDAP
        MDDRLESY EP+  +EGEHS I+SDHIVS ETL  +QSKGRN VEEPEKLSH HRS SSLSQRS GSSRNY LSKYMAKAVDSYHS PLSMLEQS+ID P
Subjt:  MDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQSQIDAP

Query:  NSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPYYTMLKV
        +STSL EH  AC+S R DESPNWLSEEMIKSISAIY ELAEPPLMN NNPSPISPLSSMYELSSQD GSMRNYEKS NSHFENPFH EEF APY TMLKV
Subjt:  NSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPYYTMLKV

Query:  QWISRER-KDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILGCRLPR
        QWISRER KDSDINHMLQGFRSLIFRLKEV LKVMKHDEKLAFWINVHNTLVMHAYLQYGIPK+ LKRISLILKAAYNIGGHIISVD IQSSILGCRLPR
Subjt:  QWISRER-KDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILGCRLPR

Query:  PGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLCLEDLE
         GQWLHLFLSSKTKFKVND QKSFPINHPEPRLYFALCCG+ SDPAVR+YTAKRVNE+LEVAK++YI+SNLR HKGQRILLPKIVESFAKDSGLCLEDLE
Subjt:  PGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLCLEDLE

Query:  DIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL
        + VECLR  RRINDIQQRQRKK WKSIGWIPHNFTFSFLL  EL+CQSL
Subjt:  DIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL

XP_008454883.1 PREDICTED: uncharacterized protein LOC103495193 [Cucumis melo]2.5e-26987.3Show/hide
Query:  MEMEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQHVSSF
        MEM KGN++QI DGDVQISLKQEILQL+EQLQSQFATRHALEKAINFQPLS +SAT+++IP+AEMELIKQIA+LELEVVYLEKYLLSLYRRTF+Q VSSF
Subjt:  MEMEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQHVSSF

Query:  STMDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQSQID
        STMDDRLESY EP+  +EGEHS I+SDHIVS ETL  +QSKGRN VEEPEKLSH HRS SSLSQRS GSSRNY LSKYMAKAVDSYHS PLSMLEQS+ID
Subjt:  STMDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQSQID

Query:  APNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPYYTML
         P+STSL EH  AC+S R DESPNWLSEEMIKSISAIY ELAEPPLMN NNPSPISPLSSMYELSSQD GSMRNYEKS NSHFENPFH EEF APY TML
Subjt:  APNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPYYTML

Query:  KVQWISRER-KDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILGCRL
        KVQWISRER KDSDINHMLQGFRSLIFRLKEV LKVMKHDEKLAFWINVHNTLVMHAYLQYGIPK+ LKRISLILKAAYNIGGHIISVD IQSSILGCRL
Subjt:  KVQWISRER-KDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILGCRL

Query:  PRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLCLED
        PR GQWLHLFLSSKTKFKVND QKSFPINHPEPRLYFALCCG+ SDPAVR+YTAKRVNE+LEVAK++YI+SNLR HKGQRILLPKIVESFAKDSGLCLED
Subjt:  PRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLCLED

Query:  LEDIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL
        LE+ VECLR  RRINDIQQRQRKK WKSIGWIPHNFTFSFLL  EL+CQSL
Subjt:  LEDIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL

XP_011658927.1 uncharacterized protein LOC101203131 isoform X2 [Cucumis sativus]6.2e-26887.11Show/hide
Query:  MEMEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQHVSSF
        MEM KGN++QI DGD QISLKQEILQL+EQLQSQFATRHALEKAINFQPLS +SAT+++IP+AEMELIKQIA+LELEVVYLEKYLLSLYRRTF+Q VSSF
Subjt:  MEMEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQHVSSF

Query:  STMDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQSQID
        STMDDRLESY EP+  +EGEHS I+SDHI S ETL  +QSKGRN VEEPE LSH HRS SSLSQRS GSSRNY LSK MAKAVDSYHS PLSMLEQS+ID
Subjt:  STMDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQSQID

Query:  APNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPYYTML
         P+STSL EH  AC+S R DESPNWLSEEMIKSISAIY ELAEPPLMN NNPSPISPLSSMYELSSQD GSMRNYEKS NSHFENPFHTEEF APY TML
Subjt:  APNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPYYTML

Query:  KVQWISRERK-DSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILGCRL
        KVQWISRERK DSDINHMLQGFRSLIFRLKEV LK MKHDEKLAFWINVHNTLVMHAYLQYGI K+ LKRISLILKAAYNIGGHIISVD IQSSILGCRL
Subjt:  KVQWISRERK-DSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILGCRL

Query:  PRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLCLED
        PR GQWLHLFLSSKTKFKVND QKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKE+YI+SNLR HKGQ+ILLPKIVESFAKDSGLCLED
Subjt:  PRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLCLED

Query:  LEDIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL
        LE+ VECLR  RRINDIQQRQRKK WKSIGWIPHNFTFSFLL  EL+CQSL
Subjt:  LEDIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL

XP_038888458.1 uncharacterized protein LOC120078295 isoform X1 [Benincasa hispida]4.7e-27690.07Show/hide
Query:  MEMEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLS----PHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQH
        MEMEKGN+KQI DG+VQ+SLKQEILQL+EQLQSQFATRHALEKAINFQPLS     HSAT+NSIPKAEM LIKQIA+LELEVVYLEKYLLSLYRRTF+QH
Subjt:  MEMEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLS----PHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQH

Query:  VSSFSTMDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQ
        VSSFSTMDDRLESYTEPH A+EGEHSFINS HIVS ETLS +QSKGRNEVEEPEKLS  HRSYSSLSQRS  SSRNYPLS   AKAVDS+HSLPLSMLEQ
Subjt:  VSSFSTMDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQ

Query:  SQIDAPNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPY
        S+IDAPNSTSL EH  ACISNR DESPNWLSEEMIKSISAIYLELAEPPLMN NNPSPISPLSS+YELSSQD GSMRNYEKSFNSHFENPF+TEEFS   
Subjt:  SQIDAPNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPY

Query:  YTMLKVQWISRERKDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILG
        +TML+VQWISRERKDSDIN MLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAY+IGGHIISVDMIQSSILG
Subjt:  YTMLKVQWISRERKDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILG

Query:  CRLPRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLC
        CRLPR GQWLHLFLSSKTKFKV DAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYI+SNLRAHKGQRILLPKIVESFAKDSGLC
Subjt:  CRLPRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLC

Query:  LEDLEDIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL
        LEDLE+IVE LR  RRI DIQQRQRKK WKSIGWIPHNF+FSFLL KELACQSL
Subjt:  LEDLEDIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL

XP_038888459.1 uncharacterized protein LOC120078295 isoform X2 [Benincasa hispida]1.4e-27289.53Show/hide
Query:  MEMEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLS----PHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQH
        MEMEKGN+KQI DG+VQ+SLKQEILQL+EQLQSQFATRHALEKAINFQPLS     HSAT+NSIPKAEM LIKQIA+LELEVVYLEKYLLSLYRRTF+QH
Subjt:  MEMEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLS----PHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQH

Query:  VSSFSTMDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQ
        VSSFSTMDDRLESYTEPH A+EGEHSFINS HIVS ETLS +QSKGRNEVEEPEKLS  HRSYSSLSQRS  SSRNYPLS   AKAVDS+HSLPLSMLEQ
Subjt:  VSSFSTMDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQ

Query:  SQIDAPNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPY
        S+IDAPNSTSL EH  ACISNR DESPNWLSEEMIKSISAIYLELAEPPLMN NNPSPISPLSS+YELSSQD GSMRNYEKSFNSHFENPF+TEEFS   
Subjt:  SQIDAPNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPY

Query:  YTMLKVQWISRERKDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILG
        +TML+VQWISRERKDSDIN MLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTL   AYLQYGIPKNSLKRISLILKAAY+IGGHIISVDMIQSSILG
Subjt:  YTMLKVQWISRERKDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILG

Query:  CRLPRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLC
        CRLPR GQWLHLFLSSKTKFKV DAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYI+SNLRAHKGQRILLPKIVESFAKDSGLC
Subjt:  CRLPRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLC

Query:  LEDLEDIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL
        LEDLE+IVE LR  RRI DIQQRQRKK WKSIGWIPHNF+FSFLL KELACQSL
Subjt:  LEDLEDIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL

TrEMBL top hitse value%identityAlignment
A0A0A0K861 Uncharacterized protein3.0e-26887.11Show/hide
Query:  MEMEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQHVSSF
        MEM KGN++QI DGD QISLKQEILQL+EQLQSQFATRHALEKAINFQPLS +SAT+++IP+AEMELIKQIA+LELEVVYLEKYLLSLYRRTF+Q VSSF
Subjt:  MEMEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQHVSSF

Query:  STMDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQSQID
        STMDDRLESY EP+  +EGEHS I+SDHI S ETL  +QSKGRN VEEPE LSH HRS SSLSQRS GSSRNY LSK MAKAVDSYHS PLSMLEQS+ID
Subjt:  STMDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQSQID

Query:  APNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPYYTML
         P+STSL EH  AC+S R DESPNWLSEEMIKSISAIY ELAEPPLMN NNPSPISPLSSMYELSSQD GSMRNYEKS NSHFENPFHTEEF APY TML
Subjt:  APNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPYYTML

Query:  KVQWISRERK-DSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILGCRL
        KVQWISRERK DSDINHMLQGFRSLIFRLKEV LK MKHDEKLAFWINVHNTLVMHAYLQYGI K+ LKRISLILKAAYNIGGHIISVD IQSSILGCRL
Subjt:  KVQWISRERK-DSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILGCRL

Query:  PRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLCLED
        PR GQWLHLFLSSKTKFKVND QKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKE+YI+SNLR HKGQ+ILLPKIVESFAKDSGLCLED
Subjt:  PRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLCLED

Query:  LEDIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL
        LE+ VECLR  RRINDIQQRQRKK WKSIGWIPHNFTFSFLL  EL+CQSL
Subjt:  LEDIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL

A0A1S3BZ51 uncharacterized protein LOC1034951931.2e-26987.3Show/hide
Query:  MEMEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQHVSSF
        MEM KGN++QI DGDVQISLKQEILQL+EQLQSQFATRHALEKAINFQPLS +SAT+++IP+AEMELIKQIA+LELEVVYLEKYLLSLYRRTF+Q VSSF
Subjt:  MEMEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQHVSSF

Query:  STMDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQSQID
        STMDDRLESY EP+  +EGEHS I+SDHIVS ETL  +QSKGRN VEEPEKLSH HRS SSLSQRS GSSRNY LSKYMAKAVDSYHS PLSMLEQS+ID
Subjt:  STMDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQSQID

Query:  APNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPYYTML
         P+STSL EH  AC+S R DESPNWLSEEMIKSISAIY ELAEPPLMN NNPSPISPLSSMYELSSQD GSMRNYEKS NSHFENPFH EEF APY TML
Subjt:  APNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPYYTML

Query:  KVQWISRER-KDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILGCRL
        KVQWISRER KDSDINHMLQGFRSLIFRLKEV LKVMKHDEKLAFWINVHNTLVMHAYLQYGIPK+ LKRISLILKAAYNIGGHIISVD IQSSILGCRL
Subjt:  KVQWISRER-KDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILGCRL

Query:  PRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLCLED
        PR GQWLHLFLSSKTKFKVND QKSFPINHPEPRLYFALCCG+ SDPAVR+YTAKRVNE+LEVAK++YI+SNLR HKGQRILLPKIVESFAKDSGLCLED
Subjt:  PRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLCLED

Query:  LEDIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL
        LE+ VECLR  RRINDIQQRQRKK WKSIGWIPHNFTFSFLL  EL+CQSL
Subjt:  LEDIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL

A0A5A7SMG0 Uncharacterized protein4.8e-26686.91Show/hide
Query:  MEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQHVSSFST
        M KGN++QI DGDVQISLKQEILQL+EQLQSQFATRHALEKAINFQPLS +SAT+++IP+AEMELIKQIA+LELEVVYLEKYLLSLYRRTF+Q VSSFST
Subjt:  MEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQHVSSFST

Query:  MDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQSQIDAP
        MDDRLESY EP+  +EGEHS I+SDHIVS ETL  +QSKGRN VEEPEKLSH HRS SSLSQRS GSSRNY LSKYMAKAVDSYHS PLSMLEQS+ID P
Subjt:  MDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQSQIDAP

Query:  NSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPYYTMLKV
        +STSL EH  AC+S R DESPNWLSEEMIKSISAIY ELAEPPLMN NNPSPISPLSSMYELSSQD GSMRNYEKS NSHFENPFH EEF APY TMLKV
Subjt:  NSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPYYTMLKV

Query:  QWISRER-KDSDINHMLQGF-RSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILGCRLP
        QWISRER KDSDINHMLQGF RSLIFRLKEV LKVMKHDEKLAFWINVHNTLVMHAYLQYGIPK+ LKRISLILKAAYNIGGHIISVD IQSSILGCRLP
Subjt:  QWISRER-KDSDINHMLQGF-RSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILGCRLP

Query:  RPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLCLEDL
        R GQWLHLFLSSKTKFKVND QKSFPINHPEPRLYFALCCG+ SD AVR+YTAKRVNE+LEVAK++YI+SNLR HKGQRILLPKIVESFAKDSGLCLEDL
Subjt:  RPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLCLEDL

Query:  EDIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL
        E+ VECLR  RRINDIQQRQRKK WKSIGWIPHNFTFSFLL  EL+CQSL
Subjt:  EDIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL

A0A5D3C4C9 Uncharacterized protein1.3e-26887.25Show/hide
Query:  MEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQHVSSFST
        M KGN++QI DGDVQISLKQEILQL+EQLQSQFATRHALEKAINFQPLS +SAT+++IP+AEMELIKQIA+LELEVVYLEKYLLSLYRRTF+Q VSSFST
Subjt:  MEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQHVSSFST

Query:  MDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQSQIDAP
        MDDRLESY EP+  +EGEHS I+SDHIVS ETL  +QSKGRN VEEPEKLSH HRS SSLSQRS GSSRNY LSKYMAKAVDSYHS PLSMLEQS+ID P
Subjt:  MDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQSQIDAP

Query:  NSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPYYTMLKV
        +STSL EH  AC+S R DESPNWLSEEMIKSISAIY ELAEPPLMN NNPSPISPLSSMYELSSQD GSMRNYEKS NSHFENPFH EEF APY TMLKV
Subjt:  NSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPYYTMLKV

Query:  QWISRER-KDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILGCRLPR
        QWISRER KDSDINHMLQGFRSLIFRLKEV LKVMKHDEKLAFWINVHNTLVMHAYLQYGIPK+ LKRISLILKAAYNIGGHIISVD IQSSILGCRLPR
Subjt:  QWISRER-KDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILGCRLPR

Query:  PGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLCLEDLE
         GQWLHLFLSSKTKFKVND QKSFPINHPEPRLYFALCCG+ SDPAVR+YTAKRVNE+LEVAK++YI+SNLR HKGQRILLPKIVESFAKDSGLCLEDLE
Subjt:  PGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLCLEDLE

Query:  DIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL
        + VECLR  RRINDIQQRQRKK WKSIGWIPHNFTFSFLL  EL+CQSL
Subjt:  DIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQSL

A0A6J1C5D8 uncharacterized protein LOC111007555 isoform X13.2e-25483.07Show/hide
Query:  ASYRQEMEME------KGNEKQIPDGDV-QISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLS
        ASYRQ MEME      +  +KQ+PD  V Q SLKQEI QLQEQLQSQF  RHALEKAINFQP S  SAT++SIPKA MELIKQIA+LELEVVYLEKYLLS
Subjt:  ASYRQEMEME------KGNEKQIPDGDV-QISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLS

Query:  LYRRTFDQHVSSFSTMDDRLESYTEPHFAMEGE--HSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDS
        LYRRTF Q VSS STMDDRLESY+ P F +EGE  HSFI+SDHIVS +T  G+QSKGRNEVEEPEKLSH HRSYSSL +RSPGSS NYPLSK +AKAVDS
Subjt:  LYRRTFDQHVSSFSTMDDRLESYTEPHFAMEGE--HSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDS

Query:  YHSLPLSMLEQSQIDAPNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLM-NRNNPSPISPLSSMYELSSQD-LGSMRNYEKSFNSHF
        YHSLPLSMLEQSQ DA NS SL EHF A +  R  +SPNW+SEEMIKSIS IY ELA+PPLM N NNPSPISPLSSM ELSSQD LGSMRNYEKSFNS+F
Subjt:  YHSLPLSMLEQSQIDAPNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPPLM-NRNNPSPISPLSSMYELSSQD-LGSMRNYEKSFNSHF

Query:  ENPFHTEEFSAPYYTMLKVQWISRER-KDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGG
         NPFH EEFS PY TMLKVQWISRER KDSDINHMLQGFRSLI+RLKEVDLK MKH+EKLAFWINVHNTLVMHAYLQYGIPKNSLKR SLILKAAYN+GG
Subjt:  ENPFHTEEFSAPYYTMLKVQWISRER-KDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGG

Query:  HIISVDMIQSSILGCRLPRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILL
        HIISVDMIQSSILGC LPR GQWLHLFLSSKTKFKVNDA+KSF INHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYI+SNLR HKGQRILL
Subjt:  HIISVDMIQSSILGCRLPRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILL

Query:  PKIVESFAKDSGLCLEDLEDIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQS
        PK+VESFAKDSGLCLEDLEDIVE LRP  RINDIQQ+Q+KK WKSI  IPHNFTF++LLSKELACQS
Subjt:  PKIVESFAKDSGLCLEDLEDIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELACQS

SwissProt top hitse value%identityAlignment
P0DPD7 EEF1A lysine methyltransferase 41.6e-3234.66Show/hide
Query:  NPKSAQSESVAPATALAYLDPKYWDERF---SKEEHYEWFKDYSHFRHLILPLLKPDSSVLELGSGNSKLSEELYNDGITDITCIDLSAVAVEKMQRRLH
        +P + ++    P     Y + +YWD+R+   +    Y+WF D+S FR L+ P L+P+  +L LG GNS LS EL+  G  ++T +D S+V V  MQ R H
Subjt:  NPKSAQSESVAPATALAYLDPKYWDERF---SKEEHYEWFKDYSHFRHLILPLLKPDSSVLELGSGNSKLSEELYNDGITDITCIDLSAVAVEKMQRRLH

Query:  LKGMKEIKVLEADMLDMPFGDECFDVVVEKGTMDVLFVDGGNPWNPQPSTRAKVMAMLEGVHRVLKKDGIFVSITFGQPHFRRPLFNAPEFTWSFECSTF
           + +++    D+  + F    FDVV+EKGT+D L     +PW         V  +L  V RVL   G F+S+T   PHFR   +    + WS   +T+
Subjt:  LKGMKEIKVLEADMLDMPFGDECFDVVVEKGTMDVLFVDGGNPWNPQPSTRAKVMAMLEGVHRVLKKDGIFVSITFGQPHFRRPLFNAPEFTWSFECSTF

Query:  GDGFHYFLYTLRK-GRRSFGDKGEGERSDMP-----SICLLQDELEGEDYM
        G GFH+ LY + K G+ S      G +   P     S C LQD  + ED++
Subjt:  GDGFHYFLYTLRK-GRRSFGDKGEGERSDMP-----SICLLQDELEGEDYM

Q10A77 Protein TIC 62, chloroplastic5.6e-7843.12Show/hide
Query:  FSSEAAPAILKKEDSKDEDLVFVAGATGKVGSRTVRELLKLGFRVRAGVRSSQKAETLVESVKKIKLDE----AVEKLETVVCDLEKPNQ--IGPALGNA
        +++ AA A  + + +K+ DLVF+AGATGKVGSR VRE +KLGFRVRAGVRS+Q+A +LV+SV+++K+D+      E+LE V CDLEK  Q  I  A+GNA
Subjt:  FSSEAAPAILKKEDSKDEDLVFVAGATGKVGSRTVRELLKLGFRVRAGVRSSQKAETLVESVKKIKLDE----AVEKLETVVCDLEKPNQ--IGPALGNA

Query:  SIVICCIGASEKEIFDITGPYRIDYLATKNLVEAALCS-----AYVPGNQQDRIS------------CCHSQ--------------IVRPGGMERPTDAF
        +IV+C IGASEK+I D+TGPYRIDY+AT NLV+AA  +       V     +RI              C  +              IVRPGGMERPTDAF
Subjt:  SIVICCIGASEKEIFDITGPYRIDYLATKNLVEAALCS-----AYVPGNQQDRIS------------CCHSQ--------------IVRPGGMERPTDAF

Query:  KETHNTTLSPGDTLFGGLVSNLQVAELMACIAKNPGLSYYKVLEVIAETTAPLTPLEDLLKKIPSKVANAFPEKEYGAVRTVDPSPKQSSIAKGKESAEV
        KETHN  ++  DT  GGLVSNLQVAEL+ACIA N   +Y KV+E IAETTAPL P ED L  IPSK                 P P+   + +G+     
Subjt:  KETHNTTLSPGDTLFGGLVSNLQVAELMACIAKNPGLSYYKVLEVIAETTAPLTPLEDLLKKIPSKVANAFPEKEYGAVRTVDPSPKQSSIAKGKESAEV

Query:  NVTEQPAPQSGSSEQLSITKEKESAVANVTKQSSSPYIAYGDLKPPTSPTPAAPVGKIDLNVVEGISSSAQTISAEAPTEI-AEANPAPAPAPAPEKAVT
             P P   S   L                  SPY A+ DLKPP+SP+P  P      +      +S  T +A + + + + A   P     P++   
Subjt:  NVTEQPAPQSGSSEQLSITKEKESAVANVTKQSSSPYIAYGDLKPPTSPTPAAPVGKIDLNVVEGISSSAQTISAEAPTEI-AEANPAPAPAPAPEKAVT

Query:  LKPLSPYVAYEDLKPPASPSPSAPSLSFSSTSPSNCPPEPATSTVNSTLPIPEAEDSKSEAHLPKPKNQQPLSPFTIYEDLKPPTSPSPSAPSL
         +PLSPY  YE+LKPP+SPSP+ PS + SS S S  P  P  +  +S      A  +         + Q PLSP+T YE+LKPP+SP+PS P L
Subjt:  LKPLSPYVAYEDLKPPASPSPSAPSLSFSSTSPSNCPPEPATSTVNSTLPIPEAEDSKSEAHLPKPKNQQPLSPFTIYEDLKPPTSPSPSAPSL

Q8H0U5 Protein TIC 62, chloroplastic1.3e-9845.3Show/hide
Query:  LRSPALTTVPSSLSRTGFSDKPLLNAQPLKLSNKKTYPLAGRLKF--LHIRAQASSSTSNFSSEAAPAILKKEDSKDEDLVFVAGATGKVGSRTVRELLK
        LR   LTT+PS  SR GF  +     + ++ S  K + ++G+ +   L +RA      S+  +EA+P  L   +SK+EDLVFVAGATGKVGSRTVRELLK
Subjt:  LRSPALTTVPSSLSRTGFSDKPLLNAQPLKLSNKKTYPLAGRLKF--LHIRAQASSSTSNFSSEAAPAILKKEDSKDEDLVFVAGATGKVGSRTVRELLK

Query:  LGFRVRAGVRSSQKAETLVESVKKIKLD------EAVEKLETVVCDLEKPNQIGPALGNASIVICCIGASEKEIFDITGPYRIDYLATKNLVEAALCSAY
        LGFRVRAGVRS+Q+A +LV+SVK++KL       + VEKLE V CDLEK + I PALGNAS++ICCIGASEKEI DITGPYRIDYLATKNLV+AA  SA 
Subjt:  LGFRVRAGVRSSQKAETLVESVKKIKLD------EAVEKLETVVCDLEKPNQIGPALGNASIVICCIGASEKEIFDITGPYRIDYLATKNLVEAALCSAY

Query:  VPG------------------------------NQQDRI--SCCHSQIVRPGGMERPTDAFKETHNTTLSPGDTLFGGLVSNLQVAELMACIAKNPGLSY
        V                                  ++ +  S  +  IVRPGGMERPTDA+KETHN TL+  DTLFGG VSNLQVAEL+AC+AKNP LS+
Subjt:  VPG------------------------------NQQDRI--SCCHSQIVRPGGMERPTDAFKETHNTTLSPGDTLFGGLVSNLQVAELMACIAKNPGLSY

Query:  YKVLEVIAETTAPLTPLEDLLKKIPSKVANAFPEKEYGAVRTVDPSPKQSSIAKGKESAEVNVTEQPAPQSGSSEQLSITKEKESAVANVTKQSSSPYIA
         K++EV+AETTAPLTP+E LL+KIPSK     P K   A + V P P +             VT++P   +   E  +  KEK     NV  +  SPY +
Subjt:  YKVLEVIAETTAPLTPLEDLLKKIPSKVANAFPEKEYGAVRTVDPSPKQSSIAKGKESAEVNVTEQPAPQSGSSEQLSITKEKESAVANVTKQSSSPYIA

Query:  YGDLKPPTSPTPAAPVGKIDLNVVEGISSSAQTISAEAPTEIAEANPAPAP-----APAPE-KAVTLK---PLSPYVAYEDLKPPASPSPSA--------
        Y DLKPPTSP P +              S A++   +A     EAN  P P      P  E K V  K   PLSPY  YE+LKPP+SPSP+A        
Subjt:  YGDLKPPTSPTPAAPVGKIDLNVVEGISSSAQTISAEAPTEIAEANPAPAP-----APAPE-KAVTLK---PLSPYVAYEDLKPPASPSPSA--------

Query:  --PSLSFSSTSPSNCPPEPATST----------VNSTLPIPEAEDSKSEAHLPKPKNQQPLSPFTIYEDLKPPTSPSPSAPSLSFSSTSPSNGPPEPATS
          P  + S T  S+   +  T T          V +++P      S +E   P     +PLSP+ IY DLKPPTSP+P+           S GP E A+ 
Subjt:  --PSLSFSSTSPSNCPPEPATST----------VNSTLPIPEAEDSKSEAHLPKPKNQQPLSPFTIYEDLKPPTSPSPSAPSLSFSSTSPSNGPPEPATS

Query:  TVNSTLP
          NS LP
Subjt:  TVNSTLP

Q8SKU2 Protein TIC 62, chloroplastic1.5e-10248.37Show/hide
Query:  TTVPSSLSRTG-FSDKPLLNAQPLKLSNKKTYPLAGRLKFLHIRAQASSSTS-------NFSSEAAPAILKKEDSKDEDLVFVAGATGKVGSRTVRELLK
        T +PS+L+R    +DKP  +    K S+   YPL   L    IR+ +SSS+S       +  S  A  I +K DSKD++LVFVAGATGKVGSRTVREL+K
Subjt:  TTVPSSLSRTG-FSDKPLLNAQPLKLSNKKTYPLAGRLKFLHIRAQASSSTS-------NFSSEAAPAILKKEDSKDEDLVFVAGATGKVGSRTVRELLK

Query:  LGFRVRAGVRSSQKAETLVESVKKIKLD------EAVEKLETVVCDLEKPNQIGPALGNASIVICCIGASEKEIFDITGPYRIDYLATKNLVEAA-----
        LGF+VRAGVR++QKA  LV+SVK++KLD      EAVEKLE V CDLEK +QIG ALGNAS VIC IGASEKEIFDITGP RIDY ATKNLV+AA     
Subjt:  LGFRVRAGVRSSQKAETLVESVKKIKLD------EAVEKLETVVCDLEKPNQIGPALGNASIVICCIGASEKEIFDITGPYRIDYLATKNLVEAA-----

Query:  ---------------LCSAYV-----------PGNQQDRISCCHSQIVRPGGMERPTDAFKETHNTTLSPGDTLFGGLVSNLQVAELMACIAKNPGLSYY
                       L +A +              +    S     IVRPGGMERPTDA+KETHN TLS  DTLFGG VSNLQVAELMA +AKNP LSY 
Subjt:  ---------------LCSAYV-----------PGNQQDRISCCHSQIVRPGGMERPTDAFKETHNTTLSPGDTLFGGLVSNLQVAELMACIAKNPGLSYY

Query:  KVLEVIAETTAPLTPLEDLLKKIPSKVANAFPEKEYGAVRTVDPSPKQSSIAKGKESAEVNVTEQPAPQSGSSEQLSITKEKESAVANVTKQSSSPYIAY
        K++EVIAETTAPLTP E LL +IPS+             R   PSPK+   A   ++A V+ T   A        ++  KE  S     T+Q  SPY AY
Subjt:  KVLEVIAETTAPLTPLEDLLKKIPSKVANAFPEKEYGAVRTVDPSPKQSSIAKGKESAEVNVTEQPAPQSGSSEQLSITKEKESAVANVTKQSSSPYIAY

Query:  GDLKPPTSPTPAAPVGKIDLNVVEGISSSAQTISAEAPTEIAEANPAPAPAPAPEKAVTLKPLSPYVAYEDLKPPASPSPSAPSLSFSSTSPSNCPPEPA
         DLKPP+SP+P  P  K  +N+ + + +    IS++ P+ I E +     +     +   + LSPY AY DLKPP+SPSPS P+ S S            
Subjt:  GDLKPPTSPTPAAPVGKIDLNVVEGISSSAQTISAEAPTEIAEANPAPAPAPAPEKAVTLKPLSPYVAYEDLKPPASPSPSAPSLSFSSTSPSNCPPEPA

Query:  TSTVNSTLPIP---EAEDSKSEAHLPKPKNQQPLSPFTIYEDLKPPTSPSPS
        T  V+S  P     E      E HL +PK+ +PLSP+ +YEDLKPP SPSPS
Subjt:  TSTVNSTLPIP---EAEDSKSEAHLPKPKNQQPLSPFTIYEDLKPPTSPSPS

Q9XII1 Plastid division protein PDV21.7e-5844.05Show/hide
Query:  EEQSTAIILARATELRLKIRNSVNTTTTSSAVNSREIRDDRFSVDENNGVVGSRRSEADASGGEAEEDEEAVRLLNICDALESLENQLSSLQDLQQRQRY
        +E+   +ILARATELRLKI + ++ ++T+ + N     D        + ++G++  + D+   E  ++ EA RLL I DALE+LE+QL+SLQ+L+QRQ+Y
Subjt:  EEQSTAIILARATELRLKIRNSVNTTTTSSAVNSREIRDDRFSVDENNGVVGSRRSEADASGGEAEEDEEAVRLLNICDALESLENQLSSLQDLQQRQRY

Query:  EKEVALSEIEHSRKILLDKLKKYKGGDLEVIHEASAFVGETVQHNQDFMLPPYPSH----LG----NGYLHPFPSGHKSVSNGLIDATSNKATNKLNKSE
        EK++ALSEI++SRK+LL+KLK+YKG D EV+ E + F GE V +  D +LPPYP H    LG    NGYL   PS  KS +NG        + +  N++E
Subjt:  EKEVALSEIEHSRKILLDKLKKYKGGDLEVIHEASAFVGETVQHNQDFMLPPYPSH----LG----NGYLHPFPSGHKSVSNGLIDATSNKATNKLNKSE

Query:  RKLSKSDSQNSKNGLGFFISVAAKSVVTIVGIASILHLTGFRPKFVRKVAALKVFDGFRQSAGGNNGSHNACPPGKFLMMEDGEARCVVKERIEVPFSSV
         K     S  S +G+  F+   AK V+ I+G+ S+L  +G+ P+  ++ A+L +F      A     + N CPPGK L++EDGEARC+VKER+E+PF SV
Subjt:  RKLSKSDSQNSKNGLGFFISVAAKSVVTIVGIASILHLTGFRPKFVRKVAALKVFDGFRQSAGGNNGSHNACPPGKFLMMEDGEARCVVKERIEVPFSSV

Query:  VAKPDVNYGSG
        VAK DV YG G
Subjt:  VAKPDVNYGSG

Arabidopsis top hitse value%identityAlignment
AT3G18890.1 NAD(P)-binding Rossmann-fold superfamily protein9.0e-10045.3Show/hide
Query:  LRSPALTTVPSSLSRTGFSDKPLLNAQPLKLSNKKTYPLAGRLKF--LHIRAQASSSTSNFSSEAAPAILKKEDSKDEDLVFVAGATGKVGSRTVRELLK
        LR   LTT+PS  SR GF  +     + ++ S  K + ++G+ +   L +RA      S+  +EA+P  L   +SK+EDLVFVAGATGKVGSRTVRELLK
Subjt:  LRSPALTTVPSSLSRTGFSDKPLLNAQPLKLSNKKTYPLAGRLKF--LHIRAQASSSTSNFSSEAAPAILKKEDSKDEDLVFVAGATGKVGSRTVRELLK

Query:  LGFRVRAGVRSSQKAETLVESVKKIKLD------EAVEKLETVVCDLEKPNQIGPALGNASIVICCIGASEKEIFDITGPYRIDYLATKNLVEAALCSAY
        LGFRVRAGVRS+Q+A +LV+SVK++KL       + VEKLE V CDLEK + I PALGNAS++ICCIGASEKEI DITGPYRIDYLATKNLV+AA  SA 
Subjt:  LGFRVRAGVRSSQKAETLVESVKKIKLD------EAVEKLETVVCDLEKPNQIGPALGNASIVICCIGASEKEIFDITGPYRIDYLATKNLVEAALCSAY

Query:  VPG------------------------------NQQDRI--SCCHSQIVRPGGMERPTDAFKETHNTTLSPGDTLFGGLVSNLQVAELMACIAKNPGLSY
        V                                  ++ +  S  +  IVRPGGMERPTDA+KETHN TL+  DTLFGG VSNLQVAEL+AC+AKNP LS+
Subjt:  VPG------------------------------NQQDRI--SCCHSQIVRPGGMERPTDAFKETHNTTLSPGDTLFGGLVSNLQVAELMACIAKNPGLSY

Query:  YKVLEVIAETTAPLTPLEDLLKKIPSKVANAFPEKEYGAVRTVDPSPKQSSIAKGKESAEVNVTEQPAPQSGSSEQLSITKEKESAVANVTKQSSSPYIA
         K++EV+AETTAPLTP+E LL+KIPSK     P K   A + V P P +             VT++P   +   E  +  KEK     NV  +  SPY +
Subjt:  YKVLEVIAETTAPLTPLEDLLKKIPSKVANAFPEKEYGAVRTVDPSPKQSSIAKGKESAEVNVTEQPAPQSGSSEQLSITKEKESAVANVTKQSSSPYIA

Query:  YGDLKPPTSPTPAAPVGKIDLNVVEGISSSAQTISAEAPTEIAEANPAPAP-----APAPE-KAVTLK---PLSPYVAYEDLKPPASPSPSA--------
        Y DLKPPTSP P +              S A++   +A     EAN  P P      P  E K V  K   PLSPY  YE+LKPP+SPSP+A        
Subjt:  YGDLKPPTSPTPAAPVGKIDLNVVEGISSSAQTISAEAPTEIAEANPAPAP-----APAPE-KAVTLK---PLSPYVAYEDLKPPASPSPSA--------

Query:  --PSLSFSSTSPSNCPPEPATST----------VNSTLPIPEAEDSKSEAHLPKPKNQQPLSPFTIYEDLKPPTSPSPSAPSLSFSSTSPSNGPPEPATS
          P  + S T  S+   +  T T          V +++P      S +E   P     +PLSP+ IY DLKPPTSP+P+           S GP E A+ 
Subjt:  --PSLSFSSTSPSNCPPEPATST----------VNSTLPIPEAEDSKSEAHLPKPKNQQPLSPFTIYEDLKPPTSPSPSAPSLSFSSTSPSNGPPEPATS

Query:  TVNSTLP
          NS LP
Subjt:  TVNSTLP

AT5G66600.1 Protein of unknown function, DUF5474.9e-11444.33Show/hide
Query:  KASYRQEMEMEKGNEKQIPD--GDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSAT---DNSIPKAEMELIKQIAILELEVVYLEKYLLSL
        +AS R +++M + NE +      + + SLKQEI  L+ +LQ QF  R ALEKA+ ++  S +  T   D ++PK   +LIK +A+LE+EV++LE+YLLSL
Subjt:  KASYRQEMEMEKGNEKQIPD--GDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSAT---DNSIPKAEMELIKQIAILELEVVYLEKYLLSL

Query:  YRRTFDQHVSSFSTMDDRLESYTEPHFAMEGEHSFINSDHIVSR------ETLSGDQSKGRN----EVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKY
        YR+ F+Q +SS S   +  +  + P         F   D   S+        L  +Q++ +      V+  +    F RS+   SQRS   SR       
Subjt:  YRRTFDQHVSSFSTMDDRLESYTEPHFAMEGEHSFINSDHIVSR------ETLSGDQSKGRN----EVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKY

Query:  MAKAVDSYHSLPLSMLEQSQIDAPNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPP-LMNRNNPSPISPLSS-------MYELSSQDLG
          KA  S HS PL +      +  N  SL EH    IS+   E+PN LSE M+K +S IY +LAEPP +++R   SP S LSS        Y+ SS   G
Subjt:  MAKAVDSYHSLPLSMLEQSQIDAPNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPP-LMNRNNPSPISPLSS-------MYELSSQDLG

Query:  SMRNYEKSFNSHFENPFHTE---EFSAPYYTMLKVQWISRE-RKDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNS
        +      SF+   +N FH E   +FS PY ++++V  I R+ +K S++  +LQ F+SLI RL+EVD + +KH+EKLAFWINVHN LVMHA+L YGIP+N+
Subjt:  SMRNYEKSFNSHFENPFHTE---EFSAPYYTMLKVQWISRE-RKDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNS

Query:  LKRISLILKAAYNIGGHIISVDMIQSSILGCRLPRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKED
        +KR+ L+LKAAYNIGGH IS + IQSSILGC++  PGQWL L  +S+ KFK  D + ++ I+HPEP L+FAL  GSHSDPAVR+YT KR+ +ELE +KE+
Subjt:  LKRISLILKAAYNIGGHIISVDMIQSSILGCRLPRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKED

Query:  YIISNLRAHKGQRILLPKIVESFAKDSGLCLEDLEDIVECLRP--YRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELA
        YI  NL   K QRILLPK+VE+FAKDSGLC   L ++V    P   R+     Q    K  K+I WIPH+FTF +L+ +E A
Subjt:  YIISNLRAHKGQRILLPKIVESFAKDSGLCLEDLEDIVECLRP--YRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELA

AT5G66600.2 Protein of unknown function, DUF5474.9e-11444.33Show/hide
Query:  KASYRQEMEMEKGNEKQIPD--GDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSAT---DNSIPKAEMELIKQIAILELEVVYLEKYLLSL
        +AS R +++M + NE +      + + SLKQEI  L+ +LQ QF  R ALEKA+ ++  S +  T   D ++PK   +LIK +A+LE+EV++LE+YLLSL
Subjt:  KASYRQEMEMEKGNEKQIPD--GDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSAT---DNSIPKAEMELIKQIAILELEVVYLEKYLLSL

Query:  YRRTFDQHVSSFSTMDDRLESYTEPHFAMEGEHSFINSDHIVSR------ETLSGDQSKGRN----EVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKY
        YR+ F+Q +SS S   +  +  + P         F   D   S+        L  +Q++ +      V+  +    F RS+   SQRS   SR       
Subjt:  YRRTFDQHVSSFSTMDDRLESYTEPHFAMEGEHSFINSDHIVSR------ETLSGDQSKGRN----EVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKY

Query:  MAKAVDSYHSLPLSMLEQSQIDAPNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPP-LMNRNNPSPISPLSS-------MYELSSQDLG
          KA  S HS PL +      +  N  SL EH    IS+   E+PN LSE M+K +S IY +LAEPP +++R   SP S LSS        Y+ SS   G
Subjt:  MAKAVDSYHSLPLSMLEQSQIDAPNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPP-LMNRNNPSPISPLSS-------MYELSSQDLG

Query:  SMRNYEKSFNSHFENPFHTE---EFSAPYYTMLKVQWISRE-RKDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNS
        +      SF+   +N FH E   +FS PY ++++V  I R+ +K S++  +LQ F+SLI RL+EVD + +KH+EKLAFWINVHN LVMHA+L YGIP+N+
Subjt:  SMRNYEKSFNSHFENPFHTE---EFSAPYYTMLKVQWISRE-RKDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNS

Query:  LKRISLILKAAYNIGGHIISVDMIQSSILGCRLPRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKED
        +KR+ L+LKAAYNIGGH IS + IQSSILGC++  PGQWL L  +S+ KFK  D + ++ I+HPEP L+FAL  GSHSDPAVR+YT KR+ +ELE +KE+
Subjt:  LKRISLILKAAYNIGGHIISVDMIQSSILGCRLPRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKED

Query:  YIISNLRAHKGQRILLPKIVESFAKDSGLCLEDLEDIVECLRP--YRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELA
        YI  NL   K QRILLPK+VE+FAKDSGLC   L ++V    P   R+     Q    K  K+I WIPH+FTF +L+ +E A
Subjt:  YIISNLRAHKGQRILLPKIVESFAKDSGLCLEDLEDIVECLRP--YRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELA

AT5G66600.3 Protein of unknown function, DUF5474.9e-11444.33Show/hide
Query:  KASYRQEMEMEKGNEKQIPD--GDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSAT---DNSIPKAEMELIKQIAILELEVVYLEKYLLSL
        +AS R +++M + NE +      + + SLKQEI  L+ +LQ QF  R ALEKA+ ++  S +  T   D ++PK   +LIK +A+LE+EV++LE+YLLSL
Subjt:  KASYRQEMEMEKGNEKQIPD--GDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSAT---DNSIPKAEMELIKQIAILELEVVYLEKYLLSL

Query:  YRRTFDQHVSSFSTMDDRLESYTEPHFAMEGEHSFINSDHIVSR------ETLSGDQSKGRN----EVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKY
        YR+ F+Q +SS S   +  +  + P         F   D   S+        L  +Q++ +      V+  +    F RS+   SQRS   SR       
Subjt:  YRRTFDQHVSSFSTMDDRLESYTEPHFAMEGEHSFINSDHIVSR------ETLSGDQSKGRN----EVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKY

Query:  MAKAVDSYHSLPLSMLEQSQIDAPNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPP-LMNRNNPSPISPLSS-------MYELSSQDLG
          KA  S HS PL +      +  N  SL EH    IS+   E+PN LSE M+K +S IY +LAEPP +++R   SP S LSS        Y+ SS   G
Subjt:  MAKAVDSYHSLPLSMLEQSQIDAPNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPP-LMNRNNPSPISPLSS-------MYELSSQDLG

Query:  SMRNYEKSFNSHFENPFHTE---EFSAPYYTMLKVQWISRE-RKDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNS
        +      SF+   +N FH E   +FS PY ++++V  I R+ +K S++  +LQ F+SLI RL+EVD + +KH+EKLAFWINVHN LVMHA+L YGIP+N+
Subjt:  SMRNYEKSFNSHFENPFHTE---EFSAPYYTMLKVQWISRE-RKDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNS

Query:  LKRISLILKAAYNIGGHIISVDMIQSSILGCRLPRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKED
        +KR+ L+LKAAYNIGGH IS + IQSSILGC++  PGQWL L  +S+ KFK  D + ++ I+HPEP L+FAL  GSHSDPAVR+YT KR+ +ELE +KE+
Subjt:  LKRISLILKAAYNIGGHIISVDMIQSSILGCRLPRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKED

Query:  YIISNLRAHKGQRILLPKIVESFAKDSGLCLEDLEDIVECLRP--YRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELA
        YI  NL   K QRILLPK+VE+FAKDSGLC   L ++V    P   R+     Q    K  K+I WIPH+FTF +L+ +E A
Subjt:  YIISNLRAHKGQRILLPKIVESFAKDSGLCLEDLEDIVECLRP--YRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELA

AT5G66600.4 Protein of unknown function, DUF5474.9e-11444.33Show/hide
Query:  KASYRQEMEMEKGNEKQIPD--GDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSAT---DNSIPKAEMELIKQIAILELEVVYLEKYLLSL
        +AS R +++M + NE +      + + SLKQEI  L+ +LQ QF  R ALEKA+ ++  S +  T   D ++PK   +LIK +A+LE+EV++LE+YLLSL
Subjt:  KASYRQEMEMEKGNEKQIPD--GDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSAT---DNSIPKAEMELIKQIAILELEVVYLEKYLLSL

Query:  YRRTFDQHVSSFSTMDDRLESYTEPHFAMEGEHSFINSDHIVSR------ETLSGDQSKGRN----EVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKY
        YR+ F+Q +SS S   +  +  + P         F   D   S+        L  +Q++ +      V+  +    F RS+   SQRS   SR       
Subjt:  YRRTFDQHVSSFSTMDDRLESYTEPHFAMEGEHSFINSDHIVSR------ETLSGDQSKGRN----EVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKY

Query:  MAKAVDSYHSLPLSMLEQSQIDAPNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPP-LMNRNNPSPISPLSS-------MYELSSQDLG
          KA  S HS PL +      +  N  SL EH    IS+   E+PN LSE M+K +S IY +LAEPP +++R   SP S LSS        Y+ SS   G
Subjt:  MAKAVDSYHSLPLSMLEQSQIDAPNSTSLREHFSACISNRTDESPNWLSEEMIKSISAIYLELAEPP-LMNRNNPSPISPLSS-------MYELSSQDLG

Query:  SMRNYEKSFNSHFENPFHTE---EFSAPYYTMLKVQWISRE-RKDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNS
        +      SF+   +N FH E   +FS PY ++++V  I R+ +K S++  +LQ F+SLI RL+EVD + +KH+EKLAFWINVHN LVMHA+L YGIP+N+
Subjt:  SMRNYEKSFNSHFENPFHTE---EFSAPYYTMLKVQWISRE-RKDSDINHMLQGFRSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNS

Query:  LKRISLILKAAYNIGGHIISVDMIQSSILGCRLPRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKED
        +KR+ L+LKAAYNIGGH IS + IQSSILGC++  PGQWL L  +S+ KFK  D + ++ I+HPEP L+FAL  GSHSDPAVR+YT KR+ +ELE +KE+
Subjt:  LKRISLILKAAYNIGGHIISVDMIQSSILGCRLPRPGQWLHLFLSSKTKFKVNDAQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKED

Query:  YIISNLRAHKGQRILLPKIVESFAKDSGLCLEDLEDIVECLRP--YRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELA
        YI  NL   K QRILLPK+VE+FAKDSGLC   L ++V    P   R+     Q    K  K+I WIPH+FTF +L+ +E A
Subjt:  YIISNLRAHKGQRILLPKIVESFAKDSGLCLEDLEDIVECLRP--YRRINDIQQRQRKKSWKSIGWIPHNFTFSFLLSKELA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAGGCTTCTTACCGACAAGAAATGGAAATGGAAAAGGGAAATGAGAAGCAAATTCCTGATGGGGATGTTCAGATTTCCTTGAAGCAGGAGATTCTACAGCTTCA
AGAACAATTACAAAGCCAGTTCGCCACTCGCCATGCCTTGGAGAAGGCAATCAATTTTCAGCCTCTCTCGCCTCACTCCGCTACCGATAATTCAATCCCAAAGGCTGAGA
TGGAACTGATTAAACAAATAGCGATCTTGGAGTTAGAAGTCGTTTATTTGGAAAAATATCTTCTATCGCTATACCGCCGAACTTTCGACCAACACGTATCCTCTTTTTCT
ACCATGGATGATAGGCTCGAATCATATACTGAACCTCATTTTGCGATGGAAGGAGAACATTCTTTCATTAATTCTGACCATATCGTGTCGCGGGAAACTTTATCTGGTGA
TCAATCAAAAGGAAGAAATGAAGTTGAGGAACCAGAGAAGCTGTCACACTTTCATCGCAGCTATTCGTCTCTGTCGCAGAGATCACCCGGTTCATCTAGAAACTACCCTC
TGTCAAAGTATATGGCTAAAGCAGTAGATTCATACCATTCTCTTCCATTATCAATGCTGGAGCAATCTCAAATTGATGCTCCGAATTCTACAAGCCTCAGGGAGCATTTC
AGTGCCTGTATATCCAATCGAACAGACGAGTCACCTAACTGGCTTTCTGAGGAGATGATCAAGTCCATCTCTGCAATATACCTTGAACTTGCAGAACCTCCTTTGATGAA
TCGCAACAATCCTTCTCCAATCTCACCTTTGTCATCCATGTATGAACTTTCTTCACAAGATTTGGGTAGTATGAGGAACTATGAGAAGTCGTTCAACTCGCATTTTGAGA
ACCCTTTTCACACTGAAGAATTTAGTGCACCATACTACACAATGTTGAAGGTGCAATGGATTTCTAGAGAAAGAAAGGACTCAGATATCAACCACATGCTACAAGGTTTC
AGGTCACTTATTTTTCGGCTCAAAGAAGTCGATCTTAAAGTGATGAAACACGACGAAAAGCTCGCATTTTGGATTAATGTACACAATACACTTGTAATGCATGCTTATTT
GCAATATGGGATCCCCAAAAATAGTTTGAAGAGAATATCTTTGATACTCAAGGCTGCATACAATATTGGGGGCCACATAATAAGTGTAGATATGATACAAAGCTCAATTC
TGGGGTGTCGTTTGCCTCGCCCGGGACAGTGGCTGCACCTGTTCCTCTCGTCAAAAACGAAATTCAAGGTCAATGACGCACAGAAATCCTTTCCAATCAACCACCCCGAA
CCTCGGTTATACTTTGCTCTATGCTGTGGGAGTCATTCTGATCCAGCGGTCCGTATCTATACGGCCAAGAGGGTGAATGAGGAGCTGGAGGTTGCTAAAGAAGACTACAT
CATTTCAAATTTGAGGGCACACAAAGGGCAGAGAATTCTACTCCCAAAGATTGTGGAGTCGTTTGCCAAGGATTCTGGCTTATGCCTGGAAGATTTGGAGGACATTGTGG
AGTGTCTAAGGCCCTACAGGCGGATAAACGACATTCAGCAGCGGCAACGAAAGAAGTCTTGGAAGAGTATTGGGTGGATACCTCACAACTTCACCTTCAGCTTTCTGCTG
TCCAAAGAATTGGCATGCCAATCCCTCAGGGGTTCCATTTGGGGAAAGATGGAAGAACAAAGCACCGCTATAATTTTAGCTAGAGCCACGGAGCTGAGGCTAAAGATTAG
AAACTCTGTTAACACCACCACCACCAGTTCCGCCGTAAATTCCCGGGAGATTCGGGATGATCGGTTTTCTGTGGATGAAAATAATGGCGTTGTTGGTTCCCGCCGGAGTG
AGGCGGATGCGAGTGGTGGGGAGGCGGAGGAAGATGAGGAAGCGGTGAGGCTTTTGAACATCTGCGATGCACTCGAGTCTCTTGAGAATCAGCTTTCTTCGTTGCAGGAT
TTACAACAACGGCAAAGGTATGAGAAAGAAGTAGCCCTTTCCGAGATCGAACATAGCCGCAAGATTTTACTGGATAAACTGAAGAAGTACAAAGGAGGGGATTTGGAAGT
GATACATGAGGCTTCAGCTTTTGTTGGTGAGACAGTGCAGCACAACCAGGATTTCATGCTTCCTCCGTATCCAAGCCATCTTGGTAATGGCTACTTACATCCCTTCCCTT
CTGGACACAAATCTGTGAGTAATGGGCTAATCGACGCCACGTCAAATAAAGCTACAAACAAACTTAACAAATCAGAAAGGAAACTATCAAAATCGGATTCTCAGAACTCG
AAGAATGGATTGGGATTTTTCATTAGTGTAGCTGCAAAATCAGTGGTTACTATTGTTGGCATAGCATCCATACTCCACTTGACTGGTTTTAGACCAAAGTTTGTAAGGAA
AGTTGCTGCTTTGAAGGTTTTCGACGGTTTTCGACAGTCTGCAGGTGGAAATAATGGATCACACAACGCATGTCCTCCGGGTAAATTCCTCATGATGGAAGACGGGGAAG
CTCGATGTGTAGTGAAAGAGAGAATTGAAGTTCCATTTTCTTCAGTTGTGGCTAAACCAGATGTAAACTATGGAAGCGGTTCTTCACAGGCCAGAAATGGAACCTTCTTT
GGGACCCATCATATCCACATGGTGTGGATTGTGGACGATGCCATTGCTGCCAACGCCAAGCTGAGGAACTATTCTCTGCGGTCACCGGCTTTAACCACAGTGCCTTCCTC
TTTATCTAGAACTGGGTTCTCAGACAAGCCATTGTTGAATGCTCAACCCCTGAAATTATCGAATAAGAAGACATACCCACTTGCAGGAAGGCTCAAATTTCTTCACATCA
GAGCTCAAGCTTCTTCCAGCACAAGCAATTTTAGCTCAGAGGCAGCTCCAGCAATTTTAAAGAAGGAAGATTCGAAGGACGAGGATCTTGTATTTGTTGCTGGTGCTACT
GGAAAAGTTGGTTCAAGAACAGTGAGGGAGCTTTTGAAACTTGGATTCCGTGTAAGAGCTGGTGTAAGGAGTTCTCAGAAAGCTGAAACTCTTGTTGAGAGTGTTAAAAA
GATTAAGCTTGATGAAGCTGTAGAAAAGCTTGAAACTGTGGTATGTGACTTGGAGAAACCAAACCAGATTGGACCTGCATTAGGGAATGCATCAATAGTTATTTGCTGCA
TTGGTGCAAGTGAGAAGGAGATTTTTGATATTACTGGACCTTATCGAATTGACTATCTGGCTACCAAAAACTTGGTTGAAGCAGCACTTTGTTCTGCTTACGTCCCTGGG
AACCAACAAGATCGGATTTCCTGCTGCCATTCTCAAATAGTGAGACCTGGAGGCATGGAGAGGCCTACAGATGCTTTCAAGGAAACTCATAATACTACTCTTTCACCAGG
AGATACTTTATTTGGTGGTCTAGTGTCAAACCTTCAGGTTGCAGAGCTAATGGCATGCATAGCAAAAAATCCTGGTCTTTCATACTATAAAGTGCTAGAAGTAATTGCTG
AGACAACTGCACCATTGACTCCCCTGGAGGATCTTCTTAAAAAAATACCATCTAAAGTTGCAAATGCTTTCCCAGAGAAGGAATATGGTGCTGTACGAACTGTTGATCCA
TCACCCAAACAGTCGAGCATTGCCAAAGGAAAGGAATCAGCTGAAGTAAATGTGACAGAACAGCCAGCCCCTCAAAGTGGTTCATCAGAACAGTTGAGCATCACAAAGGA
GAAGGAATCAGCTGTAGCAAATGTTACAAAACAGTCATCATCACCCTACATTGCGTATGGGGATTTAAAACCTCCAACGTCTCCCACTCCAGCTGCACCTGTTGGGAAAA
TAGATTTAAATGTTGTTGAAGGAATCTCATCTAGTGCTCAAACTATTTCAGCAGAGGCACCAACTGAAATTGCCGAGGCAAATCCCGCACCTGCACCTGCACCTGCCCCT
GAAAAAGCAGTGACATTGAAACCTCTCTCCCCTTATGTAGCATATGAAGATTTAAAGCCTCCTGCATCACCATCTCCTAGTGCACCTTCCCTATCATTTTCTTCAACATC
TCCTTCAAATTGTCCACCCGAGCCTGCTACCTCCACCGTAAATAGCACATTGCCGATTCCAGAGGCAGAAGACTCAAAGAGTGAAGCACATCTTCCCAAGCCGAAGAACC
AACAACCGTTATCACCTTTTACCATATATGAAGATTTAAAGCCTCCTACATCACCATCTCCTAGTGCACCTTCCCTATCATTTTCTTCAACATCTCCCTCAAACGGTCCA
CCTGAGCCCGCTACCTCCACTGTAAATAGCACATTGCCGATTCCAGAGGCAGAAGACTCAAAGAGTGAAGCACATCTTCCCAAACCGAAGAAGCAACAACCGCTATCACC
TTTTACCATCAAAATTGGTCATCTCTCCCAATGGACTGGCAGCCCATTAAACTTCAGTAGAAAACCCTTTGAACTGGTCTGCCATTTAACAACGAAAGGCCCAATTGAAG
ATCAGTGGTTTAGGGTTTGTCTATACCCATTTCCACTTAATTGCAATTTGCAATCGAAGACGGGAGTGGATATCCGACTTCAATTTTCAATTGAGAAAGTTCATATGGGT
ACGGACCAAAATCCCAAATCCGCTCAGAGTGAAAGCGTTGCTCCTGCAACTGCTTTGGCATACCTTGATCCTAAATACTGGGACGAGCGCTTTTCTAAGGAAGAGCATTA
TGAGTGGTTTAAGGATTATTCTCATTTTCGTCATCTCATACTTCCTCTTCTCAAACCCGATTCATCAGTATTGGAATTGGGTAGTGGAAATTCGAAACTTTCTGAGGAAT
TGTACAACGATGGAATCACCGATATAACATGCATTGATTTGTCTGCTGTGGCCGTTGAGAAGATGCAAAGACGTTTACATTTGAAGGGTATGAAAGAAATAAAGGTATTA
GAAGCTGACATGCTAGACATGCCTTTTGGTGATGAGTGTTTTGATGTCGTTGTCGAGAAAGGAACCATGGATGTTCTGTTTGTGGACGGTGGCAACCCATGGAATCCTCA
ACCATCCACGCGAGCGAAGGTGATGGCAATGCTGGAAGGTGTCCATAGGGTTTTGAAGAAAGATGGCATTTTTGTCTCAATTACATTTGGCCAGCCGCATTTCAGGCGTC
CCTTATTTAATGCTCCAGAGTTTACATGGTCATTTGAGTGCAGTACTTTTGGCGATGGATTTCACTACTTCCTCTATACCTTGCGCAAGGGAAGGCGATCGTTTGGCGAT
AAAGGTGAAGGTGAGAGGTCTGATATGCCATCGATCTGTTTACTTCAAGACGAGCTAGAGGGTGAAGATTACATGTTCAGAACCAATGTTGATGAGCTGAATTGCTAG
mRNA sequenceShow/hide mRNA sequence
GTTTCAAGAGAAATGAGAACAGAAATTCCCATCTTCTAATTCTTCCTCTTCTTCCTCTTCTTTTTTTATGTTCATATGAAAAAGGCTTCTTACCGACAAGAAATGGAAAT
GGAAAAGGGAAATGAGAAGCAAATTCCTGATGGGGATGTTCAGATTTCCTTGAAGCAGGAGATTCTACAGCTTCAAGAACAATTACAAAGCCAGTTCGCCACTCGCCATG
CCTTGGAGAAGGCAATCAATTTTCAGCCTCTCTCGCCTCACTCCGCTACCGATAATTCAATCCCAAAGGCTGAGATGGAACTGATTAAACAAATAGCGATCTTGGAGTTA
GAAGTCGTTTATTTGGAAAAATATCTTCTATCGCTATACCGCCGAACTTTCGACCAACACGTATCCTCTTTTTCTACCATGGATGATAGGCTCGAATCATATACTGAACC
TCATTTTGCGATGGAAGGAGAACATTCTTTCATTAATTCTGACCATATCGTGTCGCGGGAAACTTTATCTGGTGATCAATCAAAAGGAAGAAATGAAGTTGAGGAACCAG
AGAAGCTGTCACACTTTCATCGCAGCTATTCGTCTCTGTCGCAGAGATCACCCGGTTCATCTAGAAACTACCCTCTGTCAAAGTATATGGCTAAAGCAGTAGATTCATAC
CATTCTCTTCCATTATCAATGCTGGAGCAATCTCAAATTGATGCTCCGAATTCTACAAGCCTCAGGGAGCATTTCAGTGCCTGTATATCCAATCGAACAGACGAGTCACC
TAACTGGCTTTCTGAGGAGATGATCAAGTCCATCTCTGCAATATACCTTGAACTTGCAGAACCTCCTTTGATGAATCGCAACAATCCTTCTCCAATCTCACCTTTGTCAT
CCATGTATGAACTTTCTTCACAAGATTTGGGTAGTATGAGGAACTATGAGAAGTCGTTCAACTCGCATTTTGAGAACCCTTTTCACACTGAAGAATTTAGTGCACCATAC
TACACAATGTTGAAGGTGCAATGGATTTCTAGAGAAAGAAAGGACTCAGATATCAACCACATGCTACAAGGTTTCAGGTCACTTATTTTTCGGCTCAAAGAAGTCGATCT
TAAAGTGATGAAACACGACGAAAAGCTCGCATTTTGGATTAATGTACACAATACACTTGTAATGCATGCTTATTTGCAATATGGGATCCCCAAAAATAGTTTGAAGAGAA
TATCTTTGATACTCAAGGCTGCATACAATATTGGGGGCCACATAATAAGTGTAGATATGATACAAAGCTCAATTCTGGGGTGTCGTTTGCCTCGCCCGGGACAGTGGCTG
CACCTGTTCCTCTCGTCAAAAACGAAATTCAAGGTCAATGACGCACAGAAATCCTTTCCAATCAACCACCCCGAACCTCGGTTATACTTTGCTCTATGCTGTGGGAGTCA
TTCTGATCCAGCGGTCCGTATCTATACGGCCAAGAGGGTGAATGAGGAGCTGGAGGTTGCTAAAGAAGACTACATCATTTCAAATTTGAGGGCACACAAAGGGCAGAGAA
TTCTACTCCCAAAGATTGTGGAGTCGTTTGCCAAGGATTCTGGCTTATGCCTGGAAGATTTGGAGGACATTGTGGAGTGTCTAAGGCCCTACAGGCGGATAAACGACATT
CAGCAGCGGCAACGAAAGAAGTCTTGGAAGAGTATTGGGTGGATACCTCACAACTTCACCTTCAGCTTTCTGCTGTCCAAAGAATTGGCATGCCAATCCCTCAGGGGTTC
CATTTGGGGAAAGATGGAAGAACAAAGCACCGCTATAATTTTAGCTAGAGCCACGGAGCTGAGGCTAAAGATTAGAAACTCTGTTAACACCACCACCACCAGTTCCGCCG
TAAATTCCCGGGAGATTCGGGATGATCGGTTTTCTGTGGATGAAAATAATGGCGTTGTTGGTTCCCGCCGGAGTGAGGCGGATGCGAGTGGTGGGGAGGCGGAGGAAGAT
GAGGAAGCGGTGAGGCTTTTGAACATCTGCGATGCACTCGAGTCTCTTGAGAATCAGCTTTCTTCGTTGCAGGATTTACAACAACGGCAAAGGTATGAGAAAGAAGTAGC
CCTTTCCGAGATCGAACATAGCCGCAAGATTTTACTGGATAAACTGAAGAAGTACAAAGGAGGGGATTTGGAAGTGATACATGAGGCTTCAGCTTTTGTTGGTGAGACAG
TGCAGCACAACCAGGATTTCATGCTTCCTCCGTATCCAAGCCATCTTGGTAATGGCTACTTACATCCCTTCCCTTCTGGACACAAATCTGTGAGTAATGGGCTAATCGAC
GCCACGTCAAATAAAGCTACAAACAAACTTAACAAATCAGAAAGGAAACTATCAAAATCGGATTCTCAGAACTCGAAGAATGGATTGGGATTTTTCATTAGTGTAGCTGC
AAAATCAGTGGTTACTATTGTTGGCATAGCATCCATACTCCACTTGACTGGTTTTAGACCAAAGTTTGTAAGGAAAGTTGCTGCTTTGAAGGTTTTCGACGGTTTTCGAC
AGTCTGCAGGTGGAAATAATGGATCACACAACGCATGTCCTCCGGGTAAATTCCTCATGATGGAAGACGGGGAAGCTCGATGTGTAGTGAAAGAGAGAATTGAAGTTCCA
TTTTCTTCAGTTGTGGCTAAACCAGATGTAAACTATGGAAGCGGTTCTTCACAGGCCAGAAATGGAACCTTCTTTGGGACCCATCATATCCACATGGTGTGGATTGTGGA
CGATGCCATTGCTGCCAACGCCAAGCTGAGGAACTATTCTCTGCGGTCACCGGCTTTAACCACAGTGCCTTCCTCTTTATCTAGAACTGGGTTCTCAGACAAGCCATTGT
TGAATGCTCAACCCCTGAAATTATCGAATAAGAAGACATACCCACTTGCAGGAAGGCTCAAATTTCTTCACATCAGAGCTCAAGCTTCTTCCAGCACAAGCAATTTTAGC
TCAGAGGCAGCTCCAGCAATTTTAAAGAAGGAAGATTCGAAGGACGAGGATCTTGTATTTGTTGCTGGTGCTACTGGAAAAGTTGGTTCAAGAACAGTGAGGGAGCTTTT
GAAACTTGGATTCCGTGTAAGAGCTGGTGTAAGGAGTTCTCAGAAAGCTGAAACTCTTGTTGAGAGTGTTAAAAAGATTAAGCTTGATGAAGCTGTAGAAAAGCTTGAAA
CTGTGGTATGTGACTTGGAGAAACCAAACCAGATTGGACCTGCATTAGGGAATGCATCAATAGTTATTTGCTGCATTGGTGCAAGTGAGAAGGAGATTTTTGATATTACT
GGACCTTATCGAATTGACTATCTGGCTACCAAAAACTTGGTTGAAGCAGCACTTTGTTCTGCTTACGTCCCTGGGAACCAACAAGATCGGATTTCCTGCTGCCATTCTCA
AATAGTGAGACCTGGAGGCATGGAGAGGCCTACAGATGCTTTCAAGGAAACTCATAATACTACTCTTTCACCAGGAGATACTTTATTTGGTGGTCTAGTGTCAAACCTTC
AGGTTGCAGAGCTAATGGCATGCATAGCAAAAAATCCTGGTCTTTCATACTATAAAGTGCTAGAAGTAATTGCTGAGACAACTGCACCATTGACTCCCCTGGAGGATCTT
CTTAAAAAAATACCATCTAAAGTTGCAAATGCTTTCCCAGAGAAGGAATATGGTGCTGTACGAACTGTTGATCCATCACCCAAACAGTCGAGCATTGCCAAAGGAAAGGA
ATCAGCTGAAGTAAATGTGACAGAACAGCCAGCCCCTCAAAGTGGTTCATCAGAACAGTTGAGCATCACAAAGGAGAAGGAATCAGCTGTAGCAAATGTTACAAAACAGT
CATCATCACCCTACATTGCGTATGGGGATTTAAAACCTCCAACGTCTCCCACTCCAGCTGCACCTGTTGGGAAAATAGATTTAAATGTTGTTGAAGGAATCTCATCTAGT
GCTCAAACTATTTCAGCAGAGGCACCAACTGAAATTGCCGAGGCAAATCCCGCACCTGCACCTGCACCTGCCCCTGAAAAAGCAGTGACATTGAAACCTCTCTCCCCTTA
TGTAGCATATGAAGATTTAAAGCCTCCTGCATCACCATCTCCTAGTGCACCTTCCCTATCATTTTCTTCAACATCTCCTTCAAATTGTCCACCCGAGCCTGCTACCTCCA
CCGTAAATAGCACATTGCCGATTCCAGAGGCAGAAGACTCAAAGAGTGAAGCACATCTTCCCAAGCCGAAGAACCAACAACCGTTATCACCTTTTACCATATATGAAGAT
TTAAAGCCTCCTACATCACCATCTCCTAGTGCACCTTCCCTATCATTTTCTTCAACATCTCCCTCAAACGGTCCACCTGAGCCCGCTACCTCCACTGTAAATAGCACATT
GCCGATTCCAGAGGCAGAAGACTCAAAGAGTGAAGCACATCTTCCCAAACCGAAGAAGCAACAACCGCTATCACCTTTTACCATCAAAATTGGTCATCTCTCCCAATGGA
CTGGCAGCCCATTAAACTTCAGTAGAAAACCCTTTGAACTGGTCTGCCATTTAACAACGAAAGGCCCAATTGAAGATCAGTGGTTTAGGGTTTGTCTATACCCATTTCCA
CTTAATTGCAATTTGCAATCGAAGACGGGAGTGGATATCCGACTTCAATTTTCAATTGAGAAAGTTCATATGGGTACGGACCAAAATCCCAAATCCGCTCAGAGTGAAAG
CGTTGCTCCTGCAACTGCTTTGGCATACCTTGATCCTAAATACTGGGACGAGCGCTTTTCTAAGGAAGAGCATTATGAGTGGTTTAAGGATTATTCTCATTTTCGTCATC
TCATACTTCCTCTTCTCAAACCCGATTCATCAGTATTGGAATTGGGTAGTGGAAATTCGAAACTTTCTGAGGAATTGTACAACGATGGAATCACCGATATAACATGCATT
GATTTGTCTGCTGTGGCCGTTGAGAAGATGCAAAGACGTTTACATTTGAAGGGTATGAAAGAAATAAAGGTATTAGAAGCTGACATGCTAGACATGCCTTTTGGTGATGA
GTGTTTTGATGTCGTTGTCGAGAAAGGAACCATGGATGTTCTGTTTGTGGACGGTGGCAACCCATGGAATCCTCAACCATCCACGCGAGCGAAGGTGATGGCAATGCTGG
AAGGTGTCCATAGGGTTTTGAAGAAAGATGGCATTTTTGTCTCAATTACATTTGGCCAGCCGCATTTCAGGCGTCCCTTATTTAATGCTCCAGAGTTTACATGGTCATTT
GAGTGCAGTACTTTTGGCGATGGATTTCACTACTTCCTCTATACCTTGCGCAAGGGAAGGCGATCGTTTGGCGATAAAGGTGAAGGTGAGAGGTCTGATATGCCATCGAT
CTGTTTACTTCAAGACGAGCTAGAGGGTGAAGATTACATGTTCAGAACCAATGTTGATGAGCTGAATTGCTAGAAAGTAGTATTATTGTATCCATT
Protein sequenceShow/hide protein sequence
MKKASYRQEMEMEKGNEKQIPDGDVQISLKQEILQLQEQLQSQFATRHALEKAINFQPLSPHSATDNSIPKAEMELIKQIAILELEVVYLEKYLLSLYRRTFDQHVSSFS
TMDDRLESYTEPHFAMEGEHSFINSDHIVSRETLSGDQSKGRNEVEEPEKLSHFHRSYSSLSQRSPGSSRNYPLSKYMAKAVDSYHSLPLSMLEQSQIDAPNSTSLREHF
SACISNRTDESPNWLSEEMIKSISAIYLELAEPPLMNRNNPSPISPLSSMYELSSQDLGSMRNYEKSFNSHFENPFHTEEFSAPYYTMLKVQWISRERKDSDINHMLQGF
RSLIFRLKEVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKRISLILKAAYNIGGHIISVDMIQSSILGCRLPRPGQWLHLFLSSKTKFKVNDAQKSFPINHPE
PRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEDYIISNLRAHKGQRILLPKIVESFAKDSGLCLEDLEDIVECLRPYRRINDIQQRQRKKSWKSIGWIPHNFTFSFLL
SKELACQSLRGSIWGKMEEQSTAIILARATELRLKIRNSVNTTTTSSAVNSREIRDDRFSVDENNGVVGSRRSEADASGGEAEEDEEAVRLLNICDALESLENQLSSLQD
LQQRQRYEKEVALSEIEHSRKILLDKLKKYKGGDLEVIHEASAFVGETVQHNQDFMLPPYPSHLGNGYLHPFPSGHKSVSNGLIDATSNKATNKLNKSERKLSKSDSQNS
KNGLGFFISVAAKSVVTIVGIASILHLTGFRPKFVRKVAALKVFDGFRQSAGGNNGSHNACPPGKFLMMEDGEARCVVKERIEVPFSSVVAKPDVNYGSGSSQARNGTFF
GTHHIHMVWIVDDAIAANAKLRNYSLRSPALTTVPSSLSRTGFSDKPLLNAQPLKLSNKKTYPLAGRLKFLHIRAQASSSTSNFSSEAAPAILKKEDSKDEDLVFVAGAT
GKVGSRTVRELLKLGFRVRAGVRSSQKAETLVESVKKIKLDEAVEKLETVVCDLEKPNQIGPALGNASIVICCIGASEKEIFDITGPYRIDYLATKNLVEAALCSAYVPG
NQQDRISCCHSQIVRPGGMERPTDAFKETHNTTLSPGDTLFGGLVSNLQVAELMACIAKNPGLSYYKVLEVIAETTAPLTPLEDLLKKIPSKVANAFPEKEYGAVRTVDP
SPKQSSIAKGKESAEVNVTEQPAPQSGSSEQLSITKEKESAVANVTKQSSSPYIAYGDLKPPTSPTPAAPVGKIDLNVVEGISSSAQTISAEAPTEIAEANPAPAPAPAP
EKAVTLKPLSPYVAYEDLKPPASPSPSAPSLSFSSTSPSNCPPEPATSTVNSTLPIPEAEDSKSEAHLPKPKNQQPLSPFTIYEDLKPPTSPSPSAPSLSFSSTSPSNGP
PEPATSTVNSTLPIPEAEDSKSEAHLPKPKKQQPLSPFTIKIGHLSQWTGSPLNFSRKPFELVCHLTTKGPIEDQWFRVCLYPFPLNCNLQSKTGVDIRLQFSIEKVHMG
TDQNPKSAQSESVAPATALAYLDPKYWDERFSKEEHYEWFKDYSHFRHLILPLLKPDSSVLELGSGNSKLSEELYNDGITDITCIDLSAVAVEKMQRRLHLKGMKEIKVL
EADMLDMPFGDECFDVVVEKGTMDVLFVDGGNPWNPQPSTRAKVMAMLEGVHRVLKKDGIFVSITFGQPHFRRPLFNAPEFTWSFECSTFGDGFHYFLYTLRKGRRSFGD
KGEGERSDMPSICLLQDELEGEDYMFRTNVDELNC