; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G06870 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G06870
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProtein of unknown function, DUF547
Genome locationChr5:6124113..6128676
RNA-Seq ExpressionCSPI05G06870
SyntenyCSPI05G06870
Gene Ontology termsNA
InterPro domainsIPR006869 - Domain of unknown function DUF547
IPR025757 - Ternary complex factor MIP1, leucine-zipper


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039722.1 DUF547 domain-containing protein/Lzipper-MIP1 domain-containing protein [Cucumis melo var. makuwa]2.9e-21192.54Show/hide
Query:  SSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH-
        SSSDKNSNGFRG SLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH 
Subjt:  SSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH-

Query:  ---------------------FAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGP
                             F QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGP
Subjt:  ---------------------FAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGP

Query:  LQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAQKKAGSFPKVKQ
        LQPFSVNDLGKRTSNAGPNSLFG KSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAQKKAGSFP+VKQ
Subjt:  LQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAQKKAGSFPKVKQ

Query:  PQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGG
        PQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGG
Subjt:  PQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGG

Query:  HIISANAIEQSIFFFKSPRIG-WVWTILS
        HIISANAIEQSIFFFKSPRIG W+ TI+S
Subjt:  HIISANAIEQSIFFFKSPRIG-WVWTILS

XP_008437070.1 PREDICTED: uncharacterized protein LOC103482606 isoform X2 [Cucumis melo]1.0e-23597.75Show/hide
Query:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN
        MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRG SLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN
Subjt:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHFAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH AQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHFAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFG KSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        NKAQKKAGSFP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS
        RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG W+ TI+S
Subjt:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS

XP_011654811.1 uncharacterized protein LOC101204173 isoform X1 [Cucumis sativus]3.9e-24099.1Show/hide
Query:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN
        MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN
Subjt:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHFAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHFAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHFAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        NKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS
        RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG W+ TI+S
Subjt:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS

XP_016903702.1 PREDICTED: uncharacterized protein LOC103482606 isoform X1 [Cucumis melo]6.7e-23293.13Show/hide
Query:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN
        MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRG SLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN
Subjt:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH----------------------FAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH                      F QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH----------------------FAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS

Query:  QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPS
        QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFG KSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPS
Subjt:  QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPS

Query:  KLSEEMVRCMAFIYCSLHRVASNKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFW
        KLSEEMVRCMAFIYCSLHRVASNKAQKKAGSFP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFW
Subjt:  KLSEEMVRCMAFIYCSLHRVASNKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFW

Query:  INVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS
        INVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG W+ TI+S
Subjt:  INVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS

XP_038874743.1 uncharacterized protein LOC120067282 isoform X1 [Benincasa hispida]6.5e-22793.92Show/hide
Query:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN
        MSG DMHMRGEESAS KRELRD+LASQRVHS HRRSRSSSD+NSN FRG  LHS+SKNDRSDAQASPLSTSGIRARSPLHE S + NDNSS+KQRASLEN
Subjt:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHFAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH AQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHFAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFG KSD+STGKTSGTAKVREAFSQ+KR SLR+LKDHLFECPSKLSEEMVRCMAFIYCSLHR AS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        NKAQKKAGSFPKVKQPQCGPVEEQFGG KAMLEIHCIST+NSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS
        RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG W+ TI+S
Subjt:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS

TrEMBL top hitse value%identityAlignment
A0A1S3AT95 uncharacterized protein LOC103482606 isoform X24.8e-23697.75Show/hide
Query:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN
        MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRG SLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN
Subjt:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHFAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH AQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHFAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFG KSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        NKAQKKAGSFP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS
        RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG W+ TI+S
Subjt:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS

A0A1S4E657 uncharacterized protein LOC103482606 isoform X13.3e-23293.13Show/hide
Query:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN
        MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRG SLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN
Subjt:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH----------------------FAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH                      F QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH----------------------FAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS

Query:  QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPS
        QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFG KSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPS
Subjt:  QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPS

Query:  KLSEEMVRCMAFIYCSLHRVASNKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFW
        KLSEEMVRCMAFIYCSLHRVASNKAQKKAGSFP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFW
Subjt:  KLSEEMVRCMAFIYCSLHRVASNKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFW

Query:  INVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS
        INVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG W+ TI+S
Subjt:  INVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS

A0A5A7T8B7 DUF547 domain-containing protein/Lzipper-MIP1 domain-containing protein1.4e-21192.54Show/hide
Query:  SSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH-
        SSSDKNSNGFRG SLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH 
Subjt:  SSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH-

Query:  ---------------------FAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGP
                             F QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGP
Subjt:  ---------------------FAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGP

Query:  LQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAQKKAGSFPKVKQ
        LQPFSVNDLGKRTSNAGPNSLFG KSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAQKKAGSFP+VKQ
Subjt:  LQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAQKKAGSFPKVKQ

Query:  PQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGG
        PQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGG
Subjt:  PQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGG

Query:  HIISANAIEQSIFFFKSPRIG-WVWTILS
        HIISANAIEQSIFFFKSPRIG W+ TI+S
Subjt:  HIISANAIEQSIFFFKSPRIG-WVWTILS

A0A6J1DMR5 uncharacterized protein LOC111021990 isoform X11.7e-20988.31Show/hide
Query:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN
        MSG D  MRGEE+ +GKRELRDYLASQRVH+RHRRSRSSSD+NSN FRG  LHSN KND+SDAQASPLSTSGIRA+SPLHE ST FNDNSS+K RASLEN
Subjt:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHFAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH AQTKDLI+EIELLEEEVANREQHVLSLYRSIFE CVSKPSSQQNSVTASPAHGKHESRKHPS+
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHFAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKT-SGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVA
        ISSAFCSS+KFPLGPLQPFSVNDLGKRTSNAGPNSLFG KS+I+TGKT SGT+KVRE  SQ+KRTSLR+LKDHLFECPSKLSEEMVRCMA IYCSLHRVA
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKT-SGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVA

Query:  SNKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGS
        SNKAQKK GS P VKQPQCGP+EEQ   GKAMLEIH ISTNNSQFSRAS+AIN YRVLVEQLEKVNVSKM IDAQTAFWINVYNALLMHAYLAYGIP  S
Subjt:  SNKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGS

Query:  LRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS
        LRRLALFHKAAYNIGGHIISANAIEQSIF FK+PRIG W+ TI+S
Subjt:  LRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS

A0A6J1I221 uncharacterized protein LOC1114691311.7e-20987.84Show/hide
Query:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN
        MSG DMH+RGEE ASG RELRDYLAS  VH+RHRRSRSSSD+NSN  RG  LHSNSKN RSD QASPLSTSGIRARSPLHE +T+FNDNSS+K RASLEN
Subjt:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHFAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH AQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSK SSQQ+SVT SPAHGKHES+KHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHFAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVN+LGKR SNAGPNSL G K DIST K SG AKVRE  SQ+K+TSLR+LKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        N A+KKA SFPKVK+P+ GPVEEQ G  KAMLEIHCISTNN+QFSRASYAINNYRVLVEQLEKVNVSKM IDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS
        RRLALFHKAAYNIGG IISANAIEQSIF FKSPRIG W+ TI+S
Subjt:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G47380.1 Protein of unknown function, DUF5472.0e-9348.25Show/hide
Query:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN
        M G D++  G +          +      H R + + S  D +++G  GA   S S N+ +  QAS + T+  +   PLH       +N S+  RASLE 
Subjt:  MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHFA-QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHG-KHESRKH-
        D+E L LRLQQE+SMR +LERAMGRASS+LSPGHRHFA Q  +LI+EIELLE EV NRE HVLSLYRSIFE  VS+  S+Q+S  +SPAH  K   RK  
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHFA-QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHG-KHESRKH-

Query:  PSIISSAFCSSRKFPLGPLQPF-SVNDLGKRTSNAGPNSLFGSKSDI-STGKTSGTAK---VREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIY
        P++IS+AFCSS  FPL P     ++ D  ++TS    +S F  ++ I ST   S  AK   ++++ + +K  S R+LKDHL++CP+KLSE+MV+CM+ +Y
Subjt:  PSIISSAFCSSRKFPLGPLQPF-SVNDLGKRTSNAGPNSLFGSKSDI-STGKTSGTAK---VREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIY

Query:  ----CSLHRVASNKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMH
            CS       K      S   V  P+    E++    ++M+E+  IS++  +FS+ +YAINNYR+LVEQLE+V +++M  +A+ AFWIN+YNALLMH
Subjt:  ----CSLHRVASNKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMH

Query:  AYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS
        AYLAYG+P  SLRRLALFHK+AYNIGGHII+AN IE SIF F++PR G W+ TI+S
Subjt:  AYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG-WVWTILS

AT5G66600.1 Protein of unknown function, DUF5473.4e-3230Show/hide
Query:  RVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRSMLERAMG--R
        R+   H+RS+S+S        G    SNS ++ S      +     R+    H Q   ++ N+ T    SL+ +I  L+ RLQ +  +R  LE+A+G   
Subjt:  RVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRSMLERAMG--R

Query:  ASSTLSPGHRHFAQTK---DLISEIELLEEEVANREQHVLSLYRSIFENCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSIISSAF
        ASS +       A  K   DLI ++ +LE EV + EQ++LSLYR  FE  +S  S + +N    SP                   K +    P ++    
Subjt:  ASSTLSPGHRHFAQTK---DLISEIELLEEEVANREQHVLSLYRSIFENCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSIISSAF

Query:  CSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKS---DISTGKTSGTAKVREAFSQMKRTSL-------RSLKDHLFECPSKLSEEMVRCMAFIYCS-
          S+K  +  +    ++   +R+ +    S FGS+    + S GK S +   +  + Q     +         + DH+ E P+KLSE MV+CM+ IYC  
Subjt:  CSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKS---DISTGKTSGTAKVREAFSQMKRTSL-------RSLKDHLFECPSKLSEEMVRCMAFIYCS-

Query:  ------LHRVAS--NKAQKKAGSFPKVKQPQCGP-------------------VEEQFGGG-KAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNV
              LHR  S  N +   +   P  +     P                    E+ F G   +++E+ CI  +  + S     + N++ L+ +LE+V+ 
Subjt:  ------LHRVAS--NKAQKKAGSFPKVKQPQCGP-------------------VEEQFGGG-KAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNV

Query:  SKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG
         K+  + + AFWINV+NAL+MHA+LAYGIP  +++R+ L  KAAYNIGGH ISA AI+ SI   K    G
Subjt:  SKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG

AT5G66600.2 Protein of unknown function, DUF5478.4e-3130.02Show/hide
Query:  RSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRSMLERAMG--RASSTLSP
        + R   DK SN     S H  S+  + D           R+    H Q   ++ N+ T    SL+ +I  L+ RLQ +  +R  LE+A+G   ASS +  
Subjt:  RSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRSMLERAMG--RASSTLSP

Query:  GHRHFAQTK---DLISEIELLEEEVANREQHVLSLYRSIFENCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSIISSAFCSSRKFP
             A  K   DLI ++ +LE EV + EQ++LSLYR  FE  +S  S + +N    SP                   K +    P ++      S+K  
Subjt:  GHRHFAQTK---DLISEIELLEEEVANREQHVLSLYRSIFENCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSIISSAFCSSRKFP

Query:  LGPLQPFSVNDLGKRTSNAGPNSLFGSKS---DISTGKTSGTAKVREAFSQMKRTSL-------RSLKDHLFECPSKLSEEMVRCMAFIYCS-------L
        +  +    ++   +R+ +    S FGS+    + S GK S +   +  + Q     +         + DH+ E P+KLSE MV+CM+ IYC        L
Subjt:  LGPLQPFSVNDLGKRTSNAGPNSLFGSKS---DISTGKTSGTAKVREAFSQMKRTSL-------RSLKDHLFECPSKLSEEMVRCMAFIYCS-------L

Query:  HRVAS--NKAQKKAGSFPKVKQPQCGP-------------------VEEQFGGG-KAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDA
        HR  S  N +   +   P  +     P                    E+ F G   +++E+ CI  +  + S     + N++ L+ +LE+V+  K+  + 
Subjt:  HRVAS--NKAQKKAGSFPKVKQPQCGP-------------------VEEQFGGG-KAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDA

Query:  QTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG
        + AFWINV+NAL+MHA+LAYGIP  +++R+ L  KAAYNIGGH ISA AI+ SI   K    G
Subjt:  QTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG

AT5G66600.3 Protein of unknown function, DUF5473.4e-3230Show/hide
Query:  RVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRSMLERAMG--R
        R+   H+RS+S+S        G    SNS ++ S      +     R+    H Q   ++ N+ T    SL+ +I  L+ RLQ +  +R  LE+A+G   
Subjt:  RVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRSMLERAMG--R

Query:  ASSTLSPGHRHFAQTK---DLISEIELLEEEVANREQHVLSLYRSIFENCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSIISSAF
        ASS +       A  K   DLI ++ +LE EV + EQ++LSLYR  FE  +S  S + +N    SP                   K +    P ++    
Subjt:  ASSTLSPGHRHFAQTK---DLISEIELLEEEVANREQHVLSLYRSIFENCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSIISSAF

Query:  CSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKS---DISTGKTSGTAKVREAFSQMKRTSL-------RSLKDHLFECPSKLSEEMVRCMAFIYCS-
          S+K  +  +    ++   +R+ +    S FGS+    + S GK S +   +  + Q     +         + DH+ E P+KLSE MV+CM+ IYC  
Subjt:  CSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKS---DISTGKTSGTAKVREAFSQMKRTSL-------RSLKDHLFECPSKLSEEMVRCMAFIYCS-

Query:  ------LHRVAS--NKAQKKAGSFPKVKQPQCGP-------------------VEEQFGGG-KAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNV
              LHR  S  N +   +   P  +     P                    E+ F G   +++E+ CI  +  + S     + N++ L+ +LE+V+ 
Subjt:  ------LHRVAS--NKAQKKAGSFPKVKQPQCGP-------------------VEEQFGGG-KAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNV

Query:  SKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG
         K+  + + AFWINV+NAL+MHA+LAYGIP  +++R+ L  KAAYNIGGH ISA AI+ SI   K    G
Subjt:  SKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG

AT5G66600.4 Protein of unknown function, DUF5478.4e-3129.58Show/hide
Query:  RVHSRHRRSRSSS---------DKNSNGFRGASLHSNSKNDRSDAQASP-LSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRS
        R+   H+RS+ SS           N   F         K   S  +AS  +     R+    H Q   ++ N+ T    SL+ +I  L+ RLQ +  +R 
Subjt:  RVHSRHRRSRSSS---------DKNSNGFRGASLHSNSKNDRSDAQASP-LSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRS

Query:  MLERAMG--RASSTLSPGHRHFAQTK---DLISEIELLEEEVANREQHVLSLYRSIFENCVSKPS-SQQNSVTASP----------------AHGKHESR
         LE+A+G   ASS +       A  K   DLI ++ +LE EV + EQ++LSLYR  FE  +S  S + +N    SP                   K +  
Subjt:  MLERAMG--RASSTLSPGHRHFAQTK---DLISEIELLEEEVANREQHVLSLYRSIFENCVSKPS-SQQNSVTASP----------------AHGKHESR

Query:  KHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKS---DISTGKTSGTAKVREAFSQMKRTSL-------RSLKDHLFECPSKLSEEMV
          P ++      S+K  +  +    ++   +R+ +    S FGS+    + S GK S +   +  + Q     +         + DH+ E P+KLSE MV
Subjt:  KHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSKS---DISTGKTSGTAKVREAFSQMKRTSL-------RSLKDHLFECPSKLSEEMV

Query:  RCMAFIYCS-------LHRVAS--NKAQKKAGSFPKVKQPQCGP-------------------VEEQFGGG-KAMLEIHCISTNNSQFSRASYAINNYRV
        +CM+ IYC        LHR  S  N +   +   P  +     P                    E+ F G   +++E+ CI  +  + S     + N++ 
Subjt:  RCMAFIYCS-------LHRVAS--NKAQKKAGSFPKVKQPQCGP-------------------VEEQFGGG-KAMLEIHCISTNNSQFSRASYAINNYRV

Query:  LVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG
        L+ +LE+V+  K+  + + AFWINV+NAL+MHA+LAYGIP  +++R+ L  KAAYNIGGH ISA AI+ SI   K    G
Subjt:  LVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGATTGGATATGCATATGCGTGGTGAAGAATCAGCTTCTGGGAAGAGAGAGCTTAGAGATTATTTAGCTTCCCAACGGGTTCATTCCCGCCACAGGCGATCTAG
AAGCTCTTCAGACAAGAACTCCAATGGCTTTAGAGGTGCAAGTCTTCATTCTAATAGCAAGAATGACCGGAGTGACGCACAAGCATCTCCACTTTCTACGAGTGGTATCA
GAGCACGAAGTCCTCTACATGAGCAGTCTACAGATTTTAATGATAATTCATCAACTAAACAGCGAGCATCTTTAGAAAATGATATCGAGTTGCTACAGCTGCGCTTGCAA
CAAGAGAGATCTATGCGAAGTATGCTTGAAAGAGCAATGGGTCGTGCATCTAGTACTTTATCTCCCGGCCATAGGCACTTCGCCCAGACGAAGGATTTGATCTCAGAGAT
TGAATTGCTTGAAGAAGAGGTTGCAAACCGTGAACAGCATGTTCTCTCTCTCTACAGGAGTATTTTTGAAAATTGTGTTAGTAAGCCATCTTCTCAGCAAAATTCAGTTA
CAGCCTCTCCTGCTCATGGAAAGCACGAATCAAGAAAACACCCCAGTATCATTTCGAGCGCATTTTGTTCATCGAGGAAGTTTCCTTTGGGACCTTTGCAACCTTTCTCT
GTAAATGACTTGGGTAAAAGAACCTCAAATGCTGGTCCTAATTCCTTGTTCGGAAGTAAAAGCGACATAAGTACTGGAAAAACTTCAGGCACTGCAAAGGTTCGTGAAGC
TTTTTCCCAGATGAAGAGAACTTCTCTGCGATCTCTAAAAGATCATCTTTTTGAATGTCCAAGTAAATTATCAGAGGAGATGGTGAGATGCATGGCTTTCATATACTGCT
CTCTTCATAGAGTGGCATCAAACAAGGCTCAAAAAAAGGCAGGTTCTTTCCCTAAAGTTAAACAGCCTCAATGTGGACCGGTGGAGGAACAATTTGGGGGTGGGAAAGCA
ATGCTGGAAATACATTGCATATCAACCAATAACAGCCAGTTCTCTCGTGCTTCGTACGCAATCAACAATTACAGAGTATTAGTTGAGCAGCTGGAAAAAGTGAATGTTAG
TAAGATGGGGATTGATGCTCAAACTGCATTCTGGATTAATGTGTATAATGCTCTTCTAATGCATGCCTACTTAGCATATGGAATTCCTCATGGTTCTCTAAGAAGGTTGG
CTTTGTTCCACAAGGCTGCTTATAACATCGGTGGCCATATCATCAGTGCAAATGCGATAGAGCAATCAATTTTTTTCTTCAAATCTCCCCGAATAGGATGGGTATGGACT
ATTCTCTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGATTGGATATGCATATGCGTGGTGAAGAATCAGCTTCTGGGAAGAGAGAGCTTAGAGATTATTTAGCTTCCCAACGGGTTCATTCCCGCCACAGGCGATCTAG
AAGCTCTTCAGACAAGAACTCCAATGGCTTTAGAGGTGCAAGTCTTCATTCTAATAGCAAGAATGACCGGAGTGACGCACAAGCATCTCCACTTTCTACGAGTGGTATCA
GAGCACGAAGTCCTCTACATGAGCAGTCTACAGATTTTAATGATAATTCATCAACTAAACAGCGAGCATCTTTAGAAAATGATATCGAGTTGCTACAGCTGCGCTTGCAA
CAAGAGAGATCTATGCGAAGTATGCTTGAAAGAGCAATGGGTCGTGCATCTAGTACTTTATCTCCCGGCCATAGGCACTTCGCCCAGACGAAGGATTTGATCTCAGAGAT
TGAATTGCTTGAAGAAGAGGTTGCAAACCGTGAACAGCATGTTCTCTCTCTCTACAGGAGTATTTTTGAAAATTGTGTTAGTAAGCCATCTTCTCAGCAAAATTCAGTTA
CAGCCTCTCCTGCTCATGGAAAGCACGAATCAAGAAAACACCCCAGTATCATTTCGAGCGCATTTTGTTCATCGAGGAAGTTTCCTTTGGGACCTTTGCAACCTTTCTCT
GTAAATGACTTGGGTAAAAGAACCTCAAATGCTGGTCCTAATTCCTTGTTCGGAAGTAAAAGCGACATAAGTACTGGAAAAACTTCAGGCACTGCAAAGGTTCGTGAAGC
TTTTTCCCAGATGAAGAGAACTTCTCTGCGATCTCTAAAAGATCATCTTTTTGAATGTCCAAGTAAATTATCAGAGGAGATGGTGAGATGCATGGCTTTCATATACTGCT
CTCTTCATAGAGTGGCATCAAACAAGGCTCAAAAAAAGGCAGGTTCTTTCCCTAAAGTTAAACAGCCTCAATGTGGACCGGTGGAGGAACAATTTGGGGGTGGGAAAGCA
ATGCTGGAAATACATTGCATATCAACCAATAACAGCCAGTTCTCTCGTGCTTCGTACGCAATCAACAATTACAGAGTATTAGTTGAGCAGCTGGAAAAAGTGAATGTTAG
TAAGATGGGGATTGATGCTCAAACTGCATTCTGGATTAATGTGTATAATGCTCTTCTAATGCATGCCTACTTAGCATATGGAATTCCTCATGGTTCTCTAAGAAGGTTGG
CTTTGTTCCACAAGGCTGCTTATAACATCGGTGGCCATATCATCAGTGCAAATGCGATAGAGCAATCAATTTTTTTCTTCAAATCTCCCCGAATAGGATGGGTATGGACT
ATTCTCTCTTAAGCTGTTGGTGCTGGTGCGGTGACTTGTCCTGGAGTTCCTATCAAATACCCTGTTAAAAGAGATGTTAAAGTGAAGTTTTGGTTTTGACTATGTTTTTG
TATTAACTACAGTGGCTTGAAACTATCATTTCAACTGCGCTGAGGAAGAAGTCTGGGGAAGAAAGGCAACTTATCTCTTCAAAATTGGGCCTTCCTAGTCCTCAACCTCT
TGTTTGCTTTGGCCTTTGCACTGGTGCCTCTTCAGATCCTGTGCTGAAAGTGTACACGGCATCAAATGTTAAAGAGGAACTCGAAGTGGCCAAAAGGGATTTTCTCCAAG
CAAATATAGTTGTGAAGAAGTCAAAGAAAGTATTCCTACCAAAGGTGCTAGAAAGATTTGCACGCGAAGCATCCATCAGCTCAGACGAACTCCCAAAATGGGTTTCTGAA
AATGTCGATGGGAAGCTCCAGGAGTCGATACAGAAATGTATGGAACACCGAACCGGCAAAAAGACATCTCAGATCATCGAATGGTTACCTTACAGCTCAAGGTTTCGGTA
TGTATTTTCTACCAATCTAACTGAAAGGCCATGGTGGTCGTAAGATTCATAATGTTATTAACTCATCGATGTGAAGTTTCGAAAATAGTAGGACCAACATTTTTCTGCTA
ATCTGAGGGGAAATGGATTTAGGTAGTTGAAACTTGAAATAACTCATATTTGATTCTGTATATATATTGGGTCTGATTGTTGCAACTTACCTTATGGCTGCATTTTGAAC
TCTACCTCTATTTATGCAAAGCCGATGTGTTGTTTCGAAAAGA
Protein sequenceShow/hide protein sequence
MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDRSDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQ
QERSMRSMLERAMGRASSTLSPGHRHFAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFS
VNDLGKRTSNAGPNSLFGSKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAQKKAGSFPKVKQPQCGPVEEQFGGGKA
MLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWVWT
ILS