; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G004270 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G004270
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function, DUF547
Genome locationchr09:4570714..4575873
RNA-Seq ExpressionLsi09G004270
SyntenyLsi09G004270
Gene Ontology termsNA
InterPro domainsIPR006869 - Domain of unknown function DUF547
IPR025757 - Ternary complex factor MIP1, leucine-zipper


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039722.1 DUF547 domain-containing protein/Lzipper-MIP1 domain-containing protein [Cucumis melo var. makuwa]1.5e-28586.68Show/hide
Query:  SSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLENDIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHL
        SSSD+NSN FRGG LHSNSKNDRSD QASPLSTSGIRA+SPLHE+S DFNDNS TKQRASLENDIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHL
Subjt:  SSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLENDIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHL

Query:  AQVTF----TFSCMHRSI------TKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGP
        AQ  F      SC    +      TKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGP
Subjt:  AQVTF----TFSCMHRSI------TKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGP

Query:  LQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCS
        LQPFSVNDLGKRTSNAGP+SLFGGKSDIS                  G+T         +  +FSQ KRTSLR+LKDHLFECPSKLSEEMVRCMAFIYCS
Subjt:  LQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCS

Query:  LHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYG
        LHRVASNKA+KKAG FP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYG
Subjt:  LHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYG

Query:  IPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVL
        IPHGSLRRLALFHK         AAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVL
Subjt:  IPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVL

Query:  KVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFRYVFSTN
        KVYTASNVKEELE AKRDFLQANIVVKKSKKVFLPKVLER AREASISSDELPKWVSENVDGKL ESIQKCMEHRTGKK SQIIEWLPYSSRFRYVFSTN
Subjt:  KVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFRYVFSTN

Query:  LTEKPWWS
        LTE+PWWS
Subjt:  LTEKPWWS

XP_008437070.1 PREDICTED: uncharacterized protein LOC103482606 isoform X2 [Cucumis melo]5.0e-30587.87Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSG DMHMRGEE ASGKRELRDYLASQRVHSRHRRSRSSSD+NSN FRGG LHSNSKNDRSD QASPLSTSGIRA+SPLHE+S DFNDNS TKQRASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQ            TKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA

Query:  HGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLR
        HGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGP+SLFGGKSDIS                  G+T         +  +FSQ KRTSLR
Subjt:  HGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLR

Query:  TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVS
        +LKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKA+KKAG FP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVS
Subjt:  TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVS

Query:  KMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI
        KMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHK         AAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI
Subjt:  KMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI

Query:  SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCME
        SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELE AKRDFLQANIVVKKSKKVFLPKVLER AREASISSDELPKWVS+NVDGKL ESIQKCME
Subjt:  SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCME

Query:  HRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWWS
        HRTGKK SQIIEWLPYSSRFRYVFSTNLTE+PWWS
Subjt:  HRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWWS

XP_011654811.1 uncharacterized protein LOC101204173 isoform X1 [Cucumis sativus]3.5e-30688.03Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSG DMHMRGEE ASGKRELRDYLASQRVHSRHRRSRSSSD+NSN FRG  LHSNSKNDRSD QASPLSTSGIRA+SPLHE+S DFNDNS TKQRASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH AQ            TKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA

Query:  HGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLR
        HGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGP+SLFG KSDIS                  G+T         +  +FSQ KRTSLR
Subjt:  HGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLR

Query:  TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVS
        +LKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKA+KKAG FP+VKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVS
Subjt:  TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVS

Query:  KMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI
        KMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHK         AAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI
Subjt:  KMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI

Query:  SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCME
        SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLER AREASISSDELPKWVSENVDGKL ESIQKCME
Subjt:  SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCME

Query:  HRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWWS
        HRTGKK SQIIEWLPYSSRFRYVFSTNLTE+PWWS
Subjt:  HRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWWS

XP_016903702.1 PREDICTED: uncharacterized protein LOC103482606 isoform X1 [Cucumis melo]5.0e-30586.98Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSG DMHMRGEE ASGKRELRDYLASQRVHSRHRRSRSSSD+NSN FRGG LHSNSKNDRSD QASPLSTSGIRA+SPLHE+S DFNDNS TKQRASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTF----TFSCMHRSI------TKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQ  F      SC    +      TKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTF----TFSCMHRSI------TKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS

Query:  QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISS
        QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGP+SLFGGKSDIS                  G+T         +  +
Subjt:  QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISS

Query:  FSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVL
        FSQ KRTSLR+LKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKA+KKAG FP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVL
Subjt:  FSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVL

Query:  VEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALR
        VEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHK         AAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALR
Subjt:  VEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALR

Query:  KKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGK
        KKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELE AKRDFLQANIVVKKSKKVFLPKVLER AREASISSDELPKWVS+NVDGK
Subjt:  KKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGK

Query:  LHESIQKCMEHRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWWS
        L ESIQKCMEHRTGKK SQIIEWLPYSSRFRYVFSTNLTE+PWWS
Subjt:  LHESIQKCMEHRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWWS

XP_038874743.1 uncharacterized protein LOC120067282 isoform X1 [Benincasa hispida]5.5e-30487.56Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSGFDMHMRGEE AS KRELRD+LASQRVHS HRRSRSSSDRNSNVFRGGVLHS+SKNDRSD QASPLSTSGIRA+SPLHE S + NDNS +KQRASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQ            TKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA

Query:  HGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLR
        HGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGP+SLFGGKSD+S                  G+T         +  +FSQ KR SLR
Subjt:  HGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLR

Query:  TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVS
        TLKDHLFECPSKLSEEMVRCMAFIYCSLHR ASNKA+KKAG FP+VKQPQCGPVEEQFGG KAMLEIHCIST+NSQFSRASYAINNYRVLVEQLEKVNVS
Subjt:  TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVS

Query:  KMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI
        KMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHK         AAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI
Subjt:  KMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI

Query:  SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCME
        SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLER AREASI SDEL KWVSENVDGKLHESIQKCME
Subjt:  SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCME

Query:  HRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWWS
        HRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWWS
Subjt:  HRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWWS

TrEMBL top hitse value%identityAlignment
A0A1S3AT95 uncharacterized protein LOC103482606 isoform X22.4e-30587.87Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSG DMHMRGEE ASGKRELRDYLASQRVHSRHRRSRSSSD+NSN FRGG LHSNSKNDRSD QASPLSTSGIRA+SPLHE+S DFNDNS TKQRASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQ            TKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA

Query:  HGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLR
        HGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGP+SLFGGKSDIS                  G+T         +  +FSQ KRTSLR
Subjt:  HGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLR

Query:  TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVS
        +LKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKA+KKAG FP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVS
Subjt:  TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVS

Query:  KMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI
        KMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHK         AAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI
Subjt:  KMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI

Query:  SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCME
        SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELE AKRDFLQANIVVKKSKKVFLPKVLER AREASISSDELPKWVS+NVDGKL ESIQKCME
Subjt:  SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCME

Query:  HRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWWS
        HRTGKK SQIIEWLPYSSRFRYVFSTNLTE+PWWS
Subjt:  HRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWWS

A0A1S4E657 uncharacterized protein LOC103482606 isoform X12.4e-30586.98Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSG DMHMRGEE ASGKRELRDYLASQRVHSRHRRSRSSSD+NSN FRGG LHSNSKNDRSD QASPLSTSGIRA+SPLHE+S DFNDNS TKQRASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTF----TFSCMHRSI------TKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQ  F      SC    +      TKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTF----TFSCMHRSI------TKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS

Query:  QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISS
        QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGP+SLFGGKSDIS                  G+T         +  +
Subjt:  QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISS

Query:  FSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVL
        FSQ KRTSLR+LKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKA+KKAG FP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVL
Subjt:  FSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVL

Query:  VEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALR
        VEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHK         AAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALR
Subjt:  VEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALR

Query:  KKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGK
        KKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELE AKRDFLQANIVVKKSKKVFLPKVLER AREASISSDELPKWVS+NVDGK
Subjt:  KKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGK

Query:  LHESIQKCMEHRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWWS
        L ESIQKCMEHRTGKK SQIIEWLPYSSRFRYVFSTNLTE+PWWS
Subjt:  LHESIQKCMEHRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWWS

A0A5A7T8B7 DUF547 domain-containing protein/Lzipper-MIP1 domain-containing protein7.3e-28686.68Show/hide
Query:  SSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLENDIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHL
        SSSD+NSN FRGG LHSNSKNDRSD QASPLSTSGIRA+SPLHE+S DFNDNS TKQRASLENDIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHL
Subjt:  SSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLENDIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHL

Query:  AQVTF----TFSCMHRSI------TKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGP
        AQ  F      SC    +      TKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGP
Subjt:  AQVTF----TFSCMHRSI------TKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGP

Query:  LQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCS
        LQPFSVNDLGKRTSNAGP+SLFGGKSDIS                  G+T         +  +FSQ KRTSLR+LKDHLFECPSKLSEEMVRCMAFIYCS
Subjt:  LQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCS

Query:  LHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYG
        LHRVASNKA+KKAG FP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYG
Subjt:  LHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYG

Query:  IPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVL
        IPHGSLRRLALFHK         AAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVL
Subjt:  IPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVL

Query:  KVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFRYVFSTN
        KVYTASNVKEELE AKRDFLQANIVVKKSKKVFLPKVLER AREASISSDELPKWVSENVDGKL ESIQKCMEHRTGKK SQIIEWLPYSSRFRYVFSTN
Subjt:  KVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFRYVFSTN

Query:  LTEKPWWS
        LTE+PWWS
Subjt:  LTEKPWWS

A0A6J1DMR5 uncharacterized protein LOC111021990 isoform X12.1e-28583.44Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSGFD  MRGEE  +GKRELRDYLASQRVH+RHRRSRSSSDRNSNVFRGGVLHSN KND+SD QASPLSTSGIRAQSPLHE S  FNDNS +K RASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQ            TKDLI+EIELLEEEVANREQHVLSLYRSIFE CVSKPSSQQNSVTASPA
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA

Query:  HGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLR
        HGKHESRKHPS+ISSAFCSS+KFPLGPLQPFSVNDLGKRTSNAGP+SLFGGKS+I+                    T K       +  + SQ KRTSLR
Subjt:  HGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLR

Query:  TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVS
        TLKDHLFECPSKLSEEMVRCMA IYCSLHRVASNKA+KK G  P+VKQPQCGP+EEQ   GKAMLEIH ISTNNSQFSRAS+AIN YRVLVEQLEKVNVS
Subjt:  TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVS

Query:  KMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI
        KM IDAQTAFWINVYNALLMHAYLAYGIP  SLRRLALFHK         AAYNIGGHIISANAIEQSIF FK+PRIGWWLETIISTALRKKSGEERQLI
Subjt:  KMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI

Query:  SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCME
        SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLER AREASIS DEL K VS+NVD +LH+SIQKCM+
Subjt:  SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCME

Query:  HRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWW
        HRTGKKASQIIEWLPYSSRFRYVFSTNLTEK WW
Subjt:  HRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWW

A0A6J1I221 uncharacterized protein LOC1114691313.6e-28583.31Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSGFDMH+RGEE ASG RELRDYLAS  VH+RHRRSRSSSDRNSNV RGGVLHSNSKN RSD QASPLSTSGIRA+SPLHE + +FNDNS +K RASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQ            TKDLISEIELLEEEVANREQHVLSLYRSIFENCVSK SSQQ+SVT SPA
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA

Query:  HGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLR
        HGKHES+KHPSIISSAFCSSRKFPLGPLQPFSVN+LGKR SNAGP+SL GGK DIS         KF              P+   +    SQ K+TSLR
Subjt:  HGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLR

Query:  TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVS
        TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASN AKKKA  FP+VK+P+ GPVEEQ G  KAMLEIHCISTNN+QFSRASYAINNYRVLVEQLEKVNVS
Subjt:  TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVS

Query:  KMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI
        KM IDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHK         AAYNIGG IISANAIEQSIF FKSPRIGWWLETIISTALRKKSGEERQLI
Subjt:  KMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI

Query:  SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCME
        SSKLGLPS QPLVCFGLCTGASSDPVLKVYTASNVKEELE+AKR+FLQANIVVKKSKKVFLPKVLER AREASISSDELPKW+SENVDGKLHESIQKC++
Subjt:  SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCME

Query:  HRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWWS
         +TGKKAS IIEWLPYSSRFRYVFSTNLTEKPW S
Subjt:  HRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWWS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G47380.1 Protein of unknown function, DUF5471.9e-14548.67Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        M GFD++  G +    +R   D       H   R   +SS+R+ +    G   S S N+ +  QAS + T+  +   PLH       +N  +  RASLE 
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA
        D+E L LRLQQE+SMR +LERAMGRASS+LSPGHRH A               +LI+EIELLE EV NRE HVLSLYRSIFE  VS+  S+Q+S  +SPA
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPA

Query:  HG-KHESRKH-PSIISSAFCSSRKFPLGPLQPF-SVNDLGKRTSNAGPSSLFGGKSDI--SCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAK
        H  K   RK  P++IS+AFCSS  FPL P     ++ D  ++TS    SS F  ++ I  +  C +  KS F  + +                      K
Subjt:  HG-KHESRKH-PSIISSAFCSSRKFPLGPLQPF-SVNDLGKRTSNAGPSSLFGGKSDI--SCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAK

Query:  RTSLRTLKDHLFECPSKLSEEMVRCMAFIY----CSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLV
          S RTLKDHL++CP+KLSE+MV+CM+ +Y    CS       K          V  P+    E++    ++M+E+  IS++  +FS+ +YAINNYR+LV
Subjt:  RTSLRTLKDHLFECPSKLSEEMVRCMAFIY----CSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLV

Query:  EQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRK
        EQLE+V +++M  +A+ AFWIN+YNALLMHAYLAYG+P  SLRRLALFHK         +AYNIGGHII+AN IE SIF F++PR G WLETIISTALRK
Subjt:  EQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRK

Query:  KSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKL
        K  E++  + S   L  P+PLVCF LC GA SDPVLK YTASNVKEEL+ +KR+FL AN+VVK  KKV LPK++ER  +EAS+S D+L +W+ +N D KL
Subjt:  KSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKL

Query:  HESIQKCMEHR-TGKKASQIIEWLPYSSRFRYVFSTNLTEK
         ESIQKC++ +   KKASQ++EWLPYSS+FRYVFS +L EK
Subjt:  HESIQKCMEHR-TGKKASQIIEWLPYSSRFRYVFSTNLTEK

AT5G66600.1 Protein of unknown function, DUF5472.9e-5327.91Show/hide
Query:  RVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLENDIELLQLRLQQERSMRSMLERAMG-RA
        R+   H+RS+S+S        G    SNS ++ S      +  S     +  H  +             SL+ +I  L+ RLQ +  +R  LE+A+G R 
Subjt:  RVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLENDIELLQLRLQQERSMRSMLERAMG-RA

Query:  SSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPS-SQQNSVTASP----------------AHGKHESRKH
        +S+          +T T          DLI ++ +LE EV + EQ++LSLYR  FE  +S  S + +N    SP                   K +    
Subjt:  SSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPS-SQQNSVTASP----------------AHGKHESRKH

Query:  PSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLRTLKDHLFEC
        P ++      S+K  +  +    ++   +R+ +    S FG +   +    +W K+          R+    P Y     +           + DH+ E 
Subjt:  PSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLRTLKDHLFEC

Query:  PSKLSEEMVRCMAFIYCS-------LHRVAS--NKAKKKAGFFPEVKQPQCGP-------------------VEEQFGGG-KAMLEIHCISTNNSQFSRA
        P+KLSE MV+CM+ IYC        LHR  S  N +   + F P  +     P                    E+ F G   +++E+ CI  +  + S  
Subjt:  PSKLSEEMVRCMAFIYCS-------LHRVAS--NKAKKKAGFFPEVKQPQCGP-------------------VEEQFGGG-KAMLEIHCISTNNSQFSRA

Query:  SYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWW
           + N++ L+ +LE+V+  K+  + + AFWINV+NAL+MHA+LAYGIP  +++R+ L          ++AAYNIGGH ISA AI+ SI   K    G W
Subjt:  SYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWW

Query:  LETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELP
        L  + ++  + K+G+ER   +    +  P+PL+ F L +G+ SDP ++VYT   +++ELE +K ++++ N+ ++K +++ LPK++E  A+++ +     P
Subjt:  LETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELP

Query:  KWVSENVDGKLHESIQKCME--HRTGKKASQIIEWLPYSSRFRYV
          ++E V+  + ES +KC++    +  K  + I+W+P+S  FRY+
Subjt:  KWVSENVDGKLHESIQKCME--HRTGKKASQIIEWLPYSSRFRYV

AT5G66600.2 Protein of unknown function, DUF5471.7e-5328.99Show/hide
Query:  SLENDIELLQLRLQQERSMRSMLERAMG-RASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPS-SQQNS
        SL+ +I  L+ RLQ +  +R  LE+A+G R +S+          +T T          DLI ++ +LE EV + EQ++LSLYR  FE  +S  S + +N 
Subjt:  SLENDIELLQLRLQQERSMRSMLERAMG-RASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPS-SQQNS

Query:  VTASP----------------AHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTV
           SP                   K +    P ++      S+K  +  +    ++   +R+ +    S FG +   +    +W K+          R+ 
Subjt:  VTASP----------------AHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTV

Query:  KLDPSYYSIISSFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCS-------LHRVAS--NKAKKKAGFFPEVKQPQCGP----------------
           P Y     +           + DH+ E P+KLSE MV+CM+ IYC        LHR  S  N +   + F P  +     P                
Subjt:  KLDPSYYSIISSFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCS-------LHRVAS--NKAKKKAGFFPEVKQPQCGP----------------

Query:  ---VEEQFGGG-KAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCM
            E+ F G   +++E+ CI  +  + S     + N++ L+ +LE+V+  K+  + + AFWINV+NAL+MHA+LAYGIP  +++R+ L          +
Subjt:  ---VEEQFGGG-KAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCM

Query:  QAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQA
        +AAYNIGGH ISA AI+ SI   K    G WL  + ++  + K+G+ER   +    +  P+PL+ F L +G+ SDP ++VYT   +++ELE +K ++++ 
Subjt:  QAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQA

Query:  NIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCME--HRTGKKASQIIEWLPYSSRFRYV
        N+ ++K +++ LPK++E  A+++ +     P  ++E V+  + ES +KC++    +  K  + I+W+P+S  FRY+
Subjt:  NIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCME--HRTGKKASQIIEWLPYSSRFRYV

AT5G66600.3 Protein of unknown function, DUF5472.9e-5327.91Show/hide
Query:  RVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLENDIELLQLRLQQERSMRSMLERAMG-RA
        R+   H+RS+S+S        G    SNS ++ S      +  S     +  H  +             SL+ +I  L+ RLQ +  +R  LE+A+G R 
Subjt:  RVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLENDIELLQLRLQQERSMRSMLERAMG-RA

Query:  SSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPS-SQQNSVTASP----------------AHGKHESRKH
        +S+          +T T          DLI ++ +LE EV + EQ++LSLYR  FE  +S  S + +N    SP                   K +    
Subjt:  SSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPS-SQQNSVTASP----------------AHGKHESRKH

Query:  PSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLRTLKDHLFEC
        P ++      S+K  +  +    ++   +R+ +    S FG +   +    +W K+          R+    P Y     +           + DH+ E 
Subjt:  PSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLRTLKDHLFEC

Query:  PSKLSEEMVRCMAFIYCS-------LHRVAS--NKAKKKAGFFPEVKQPQCGP-------------------VEEQFGGG-KAMLEIHCISTNNSQFSRA
        P+KLSE MV+CM+ IYC        LHR  S  N +   + F P  +     P                    E+ F G   +++E+ CI  +  + S  
Subjt:  PSKLSEEMVRCMAFIYCS-------LHRVAS--NKAKKKAGFFPEVKQPQCGP-------------------VEEQFGGG-KAMLEIHCISTNNSQFSRA

Query:  SYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWW
           + N++ L+ +LE+V+  K+  + + AFWINV+NAL+MHA+LAYGIP  +++R+ L          ++AAYNIGGH ISA AI+ SI   K    G W
Subjt:  SYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWW

Query:  LETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELP
        L  + ++  + K+G+ER   +    +  P+PL+ F L +G+ SDP ++VYT   +++ELE +K ++++ N+ ++K +++ LPK++E  A+++ +     P
Subjt:  LETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELP

Query:  KWVSENVDGKLHESIQKCME--HRTGKKASQIIEWLPYSSRFRYV
          ++E V+  + ES +KC++    +  K  + I+W+P+S  FRY+
Subjt:  KWVSENVDGKLHESIQKCME--HRTGKKASQIIEWLPYSSRFRYV

AT5G66600.4 Protein of unknown function, DUF5471.9e-5228.35Show/hide
Query:  RVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERS----ADFNDNSPTKQ-------RASLENDIELLQLRLQQERSMR
        R+   H+RS+ SS     VF G        ND S P+       G +  +  HE S     D   ++ +K          SL+ +I  L+ RLQ +  +R
Subjt:  RVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERS----ADFNDNSPTKQ-------RASLENDIELLQLRLQQERSMR

Query:  SMLERAMG-RASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPS-SQQNSVTASP---------------
          LE+A+G R +S+          +T T          DLI ++ +LE EV + EQ++LSLYR  FE  +S  S + +N    SP               
Subjt:  SMLERAMG-RASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPS-SQQNSVTASP---------------

Query:  -AHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTS
            K +    P ++      S+K  +  +    ++   +R+ +    S FG +   +    +W K+          R+    P Y     +        
Subjt:  -AHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTS

Query:  LRTLKDHLFECPSKLSEEMVRCMAFIYCS-------LHRVAS--NKAKKKAGFFPEVKQPQCGP-------------------VEEQFGGG-KAMLEIHC
           + DH+ E P+KLSE MV+CM+ IYC        LHR  S  N +   + F P  +     P                    E+ F G   +++E+ C
Subjt:  LRTLKDHLFECPSKLSEEMVRCMAFIYCS-------LHRVAS--NKAKKKAGFFPEVKQPQCGP-------------------VEEQFGGG-KAMLEIHC

Query:  ISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSI
        I  +  + S     + N++ L+ +LE+V+  K+  + + AFWINV+NAL+MHA+LAYGIP  +++R+ L          ++AAYNIGGH ISA AI+ SI
Subjt:  ISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKVLHFSVCMQAAYNIGGHIISANAIEQSI

Query:  FFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLA
           K    G WL  + ++  + K+G+ER   +    +  P+PL+ F L +G+ SDP ++VYT   +++ELE +K ++++ N+ ++K +++ LPK++E  A
Subjt:  FFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLA

Query:  REASISSDELPKWVSENVDGKLHESIQKCME--HRTGKKASQIIEWLPYSSRFRYV
        +++ +     P  ++E V+  + ES +KC++    +  K  + I+W+P+S  FRY+
Subjt:  REASISSDELPKWVSENVDGKLHESIQKCME--HRTGKKASQIIEWLPYSSRFRYV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGATTCGATATGCATATGCGTGGTGAAGAAATAGCTTCTGGGAAGAGAGAACTTAGAGATTATTTAGCGTCTCAACGTGTTCATTCCCGCCATAGGCGATCCAG
AAGTTCTTCAGACAGGAACTCCAATGTCTTTAGAGGTGGGGTTCTGCATTCTAATAGCAAGAATGATCGGAGTGACCCACAAGCATCCCCACTTTCTACGAGCGGTATCA
GAGCACAAAGTCCTCTACATGAAAGGTCTGCAGATTTCAATGATAATTCCCCAACTAAACAGCGAGCGTCTTTGGAAAATGATATTGAGTTGCTACAGCTGCGCTTGCAA
CAAGAGAGATCTATGCGAAGTATGCTTGAAAGGGCAATGGGTCGTGCATCAAGTACTTTATCTCCCGGGCATAGGCACTTAGCCCAGGTAACTTTCACTTTTTCATGTAT
GCATAGAAGTATCACGAAGGATTTGATCTCAGAAATTGAATTGCTTGAAGAAGAGGTTGCAAACCGTGAACAGCATGTGCTCTCTCTCTATAGGAGTATTTTTGAAAATT
GTGTTAGTAAGCCATCTTCTCAGCAAAATTCAGTCACAGCCTCTCCTGCTCATGGAAAGCATGAATCAAGAAAACACCCCAGTATCATTTCAAGTGCGTTTTGTTCGTCG
AGGAAGTTTCCCTTGGGACCTTTGCAACCTTTCTCTGTAAATGACTTGGGAAAAAGAACCTCAAATGCTGGTCCTAGTTCCTTGTTTGGAGGTAAAAGCGACATAAGTTG
TGTTTGTCCTGCTTGGCTAAAAAGTAAGTTTTGCATTGAGCTAATAATATTCGGTAGAACTGTTAAGTTGGATCCTTCCTATTACAGCATCATTAGTTCCTTTTCGCAGG
CGAAGAGAACTTCTCTGCGAACTCTAAAAGATCATCTTTTTGAGTGTCCAAGTAAATTATCAGAGGAGATGGTGAGGTGCATGGCTTTTATATACTGCTCTCTTCATAGA
GTGGCATCAAACAAGGCTAAAAAAAAGGCAGGTTTCTTCCCTGAAGTTAAACAGCCCCAGTGTGGACCAGTGGAGGAACAATTTGGGGGTGGGAAAGCAATGCTGGAAAT
ACATTGCATATCAACCAATAACAGCCAGTTCTCTCGTGCTTCATACGCAATCAACAATTATAGAGTATTAGTTGAGCAGCTGGAGAAAGTGAATGTCAGTAAGATGGGGA
TCGATGCCCAAACTGCATTCTGGATTAATGTGTATAATGCTCTTCTAATGCATGCCTATTTGGCATATGGAATCCCTCATGGCTCTCTGAGAAGGTTGGCTTTGTTCCAT
AAGGTATTGCATTTCTCTGTCTGCATGCAGGCTGCTTACAACATCGGTGGCCATATCATCAGTGCAAATGCGATAGAGCAATCAATTTTTTTCTTCAAATCTCCCCGAAT
AGGATGGTGGCTTGAAACCATCATTTCAACTGCGCTGAGGAAGAAGTCTGGGGAAGAAAGGCAACTTATCTCTTCAAAATTGGGCCTTCCTAGTCCTCAACCTCTTGTTT
GCTTTGGCCTCTGCACTGGTGCCTCTTCAGATCCTGTGCTGAAAGTGTACACTGCATCAAATGTTAAAGAGGAACTGGAAGTGGCCAAAAGGGATTTTCTCCAAGCGAAT
ATAGTCGTGAAGAAGTCAAAGAAAGTATTCCTACCAAAGGTGCTCGAAAGACTTGCACGCGAAGCATCCATCAGCTCAGATGAACTCCCGAAATGGGTTTCTGAAAATGT
CGATGGGAAACTCCACGAGTCGATACAGAAATGTATGGAACATCGGACTGGCAAGAAGGCATCTCAAATCATTGAGTGGTTACCTTACAGCTCAAGGTTCCGGTATGTAT
TTTCTACCAATCTAACTGAAAAGCCATGGTGGTCGTAG
mRNA sequenceShow/hide mRNA sequence
AAGAGTTTACTCATGGCGCAGTTTATTATCTTTCTTTATGATTTCTGCTATGTGGGGAAGTAGAATTATCGAGTGTTTAATAGAGGAAGAATGTGTATTCTGAATAACAA
TTATGAGTGGATTCGATATGCATATGCGTGGTGAAGAAATAGCTTCTGGGAAGAGAGAACTTAGAGATTATTTAGCGTCTCAACGTGTTCATTCCCGCCATAGGCGATCC
AGAAGTTCTTCAGACAGGAACTCCAATGTCTTTAGAGGTGGGGTTCTGCATTCTAATAGCAAGAATGATCGGAGTGACCCACAAGCATCCCCACTTTCTACGAGCGGTAT
CAGAGCACAAAGTCCTCTACATGAAAGGTCTGCAGATTTCAATGATAATTCCCCAACTAAACAGCGAGCGTCTTTGGAAAATGATATTGAGTTGCTACAGCTGCGCTTGC
AACAAGAGAGATCTATGCGAAGTATGCTTGAAAGGGCAATGGGTCGTGCATCAAGTACTTTATCTCCCGGGCATAGGCACTTAGCCCAGGTAACTTTCACTTTTTCATGT
ATGCATAGAAGTATCACGAAGGATTTGATCTCAGAAATTGAATTGCTTGAAGAAGAGGTTGCAAACCGTGAACAGCATGTGCTCTCTCTCTATAGGAGTATTTTTGAAAA
TTGTGTTAGTAAGCCATCTTCTCAGCAAAATTCAGTCACAGCCTCTCCTGCTCATGGAAAGCATGAATCAAGAAAACACCCCAGTATCATTTCAAGTGCGTTTTGTTCGT
CGAGGAAGTTTCCCTTGGGACCTTTGCAACCTTTCTCTGTAAATGACTTGGGAAAAAGAACCTCAAATGCTGGTCCTAGTTCCTTGTTTGGAGGTAAAAGCGACATAAGT
TGTGTTTGTCCTGCTTGGCTAAAAAGTAAGTTTTGCATTGAGCTAATAATATTCGGTAGAACTGTTAAGTTGGATCCTTCCTATTACAGCATCATTAGTTCCTTTTCGCA
GGCGAAGAGAACTTCTCTGCGAACTCTAAAAGATCATCTTTTTGAGTGTCCAAGTAAATTATCAGAGGAGATGGTGAGGTGCATGGCTTTTATATACTGCTCTCTTCATA
GAGTGGCATCAAACAAGGCTAAAAAAAAGGCAGGTTTCTTCCCTGAAGTTAAACAGCCCCAGTGTGGACCAGTGGAGGAACAATTTGGGGGTGGGAAAGCAATGCTGGAA
ATACATTGCATATCAACCAATAACAGCCAGTTCTCTCGTGCTTCATACGCAATCAACAATTATAGAGTATTAGTTGAGCAGCTGGAGAAAGTGAATGTCAGTAAGATGGG
GATCGATGCCCAAACTGCATTCTGGATTAATGTGTATAATGCTCTTCTAATGCATGCCTATTTGGCATATGGAATCCCTCATGGCTCTCTGAGAAGGTTGGCTTTGTTCC
ATAAGGTATTGCATTTCTCTGTCTGCATGCAGGCTGCTTACAACATCGGTGGCCATATCATCAGTGCAAATGCGATAGAGCAATCAATTTTTTTCTTCAAATCTCCCCGA
ATAGGATGGTGGCTTGAAACCATCATTTCAACTGCGCTGAGGAAGAAGTCTGGGGAAGAAAGGCAACTTATCTCTTCAAAATTGGGCCTTCCTAGTCCTCAACCTCTTGT
TTGCTTTGGCCTCTGCACTGGTGCCTCTTCAGATCCTGTGCTGAAAGTGTACACTGCATCAAATGTTAAAGAGGAACTGGAAGTGGCCAAAAGGGATTTTCTCCAAGCGA
ATATAGTCGTGAAGAAGTCAAAGAAAGTATTCCTACCAAAGGTGCTCGAAAGACTTGCACGCGAAGCATCCATCAGCTCAGATGAACTCCCGAAATGGGTTTCTGAAAAT
GTCGATGGGAAACTCCACGAGTCGATACAGAAATGTATGGAACATCGGACTGGCAAGAAGGCATCTCAAATCATTGAGTGGTTACCTTACAGCTCAAGGTTCCGGTATGT
ATTTTCTACCAATCTAACTGAAAAGCCATGGTGGTCGTAGGATTCATAATGTCGTTGACTTATCGTTGAGAGGCTTCAAAAAAAGTAGGACCAAAAATTTTCTGCTAATC
TGAGGGGAAGCGGATTTAGGTAGTTGAAAGAACTCACCATTTGATTGTGTATATATATTGGGTCTGATTGTTTCTGCAAGAAAAAGACTGCATTTTGAACTCTCCCTCTG
TTTATGCAAAAGCCAATTTGTTGCTTCCTGTTAACACTGAAAAAGGTTCAAATTCATATCTATATTTCAAGATCTTTGAAACATATCTTTAGGAGAATGAAGCAATGTGA
TGGTTCGATTTCAATTAAACAATATAAACCCTCC
Protein sequenceShow/hide protein sequence
MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLENDIELLQLRLQ
QERSMRSMLERAMGRASSTLSPGHRHLAQVTFTFSCMHRSITKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSS
RKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISCVCPAWLKSKFCIELIIFGRTVKLDPSYYSIISSFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHR
VASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFH
KVLHFSVCMQAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQAN
IVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFRYVFSTNLTEKPWWS