; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G002110 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G002110
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionSET domain-containing protein
Genome locationchr08:4642386..4647340
RNA-Seq ExpressionLsi08G002110
SyntenyLsi08G002110
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001214 - SET domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463080.1 PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo]2.9e-28177.02Show/hide
Query:  MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRL--LHLL
        MEMRA+EDIEMAEDITPPLFPLT+ALHDSFL THCSSCFS LPNPPISHS LL YCSLKCSLSHSDPLTAAFFS HP P  SSDTSDLRASLRL  LHLL
Subjt:  MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRL--LHLL

Query:  LSHPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHS
        LSHPS   S PP RIFGLLTNRHKLM PQ+ S+VFLKLRE  +AIAA RRKN ADI  G ALEEAVLCLVLTNAVDVQDSIG+TIGIAVYAPTF WINHS
Subjt:  LSHPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHS

Query:  CSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------
        CSPNACYRFETPSD  TTR RIAPSCTD V++EG+C QMG VRSN+ DF+RE     G GPRVVVRSIK I+KGEAVTIAYCDLLQPK            
Subjt:  CSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------

Query:  ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHP
                             EISAVKVE LDS  ISNFDHD AVRRID+YVDNAITEYLSI SPESC EKLQNLLT GF DEQ ED E KQPV+LRLHP
Subjt:  ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHP

Query:  LHFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWA-TTNSS
         HFL LNAYTAL SAYKVRSCDLLALSS+MD D+EN+  A  MS+TSAAY+LFLAGATHHLFL +PSLIASAANCWVVAGESLLILA HSSLWA TTN+S
Subjt:  LHFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWA-TTNSS

Query:  KWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSK
         WG P+GKRMCSNCSWVD+FN SRI GR I+ADF EFSIGISNCIA++S+K WSFLTHGCPYLKAFTDP DFSWPK     +N  D+  H ID SCACSK
Subjt:  KWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSK

Query:  TKDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD
        TKD+CF+ EPQ SNQERESI GLGIHCL+YGGYLASICYG+HSHLASQIQNIL+DL+
Subjt:  TKDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD

XP_011656459.1 protein SET DOMAIN GROUP 41 [Cucumis sativus]2.8e-27675.65Show/hide
Query:  MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRLLHLL
        MEMEM A+EDIEMAEDI+PPLFPLT+ALHDSFL THCSSCFS LPNPPISHS  L YCSLKCSLSHSDPLT AFFS HPFP  SSDTSDLRASLRLLHLL
Subjt:  MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRLLHLL

Query:  LSHPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHS
        LSHPS   S PP+RI+GLLTNRHKLM PQ+DS+VFLKLREG +AIAA RRKN ADI  G ALEEAVLCLVLTNAVDVQDSIG+TIGIAVYA TF WINHS
Subjt:  LSHPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHS

Query:  CSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------
        CSPNACYRFETPSDS TTR RIAPSCTD +++EGSC QMG VRSN+ DFIRE     G GPRVVVRSIK I+KGEAVTIAYCDLLQPK            
Subjt:  CSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------

Query:  ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHP
                             EIS+VKVE LDST ISNFDHD AVRRID+YVDNAITEYLS SSPESC EKLQNLLT GF DEQ ED E KQ V+LRLHP
Subjt:  ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHP

Query:  LHFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWA-TTNSS
        LHFL LNAYTAL SAYKVRSCDL+ALSS+MD D+ N+  A  M +TSAAY+LFLAGATH LFL +PSL+ASAANCWVVAGESLLILA HSSLWA TTN+S
Subjt:  LHFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWA-TTNSS

Query:  KWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSK
         W  P+GKRMC NCSWVD+FNASRI G+ ++ADF EFSIGISNCIA++SQK WS LTHGCPYLKAFT P DFSWPK     +N QD+    ID SCACSK
Subjt:  KWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSK

Query:  TKDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD
        T+DVC + +PQ SNQERESI GLGIHCL+YGGYLASICYGHHSHLASQIQNIL+DL+
Subjt:  TKDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD

XP_022932824.1 protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata]2.2e-24971.1Show/hide
Query:  MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLS
        MEMEMRAMEDIEMAEDITPPL PLTAALHD+F LTHCSSCFSPLPN  ISHSNLLRYCS  C  S SD LTAA FS   FP SDTSDLRASLRLLHLLLS
Subjt:  MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLS

Query:  HPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCS
          SA+ SAPPERIFGLLTNR KLM+ +DDS+VF+K+R+G DA+AASRR NSADIR+ NALEEA+LCLVLTNAV+VQDS+G+TIGIAVY PTFCWINHSCS
Subjt:  HPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCS

Query:  PNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK--------------
        PNACYRFETPSDS  TRLRI+P CTD+ T EGSCNQM TVR N S FI +     GYGPRV+VRSIK +RKGEAVTIAYCDLLQPK              
Subjt:  PNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK--------------

Query:  -------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLH
                           EISA  VE LDSTSISNFD+D A+RRIDDYV+NAI EYLSI SPESC EKLQNLLTLGF DEQAED + KQ +NLRLHP+H
Subjt:  -------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLH

Query:  FLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWATTNSSKWG
        FL LN YTALASAYKVRS           NDDENQ  A+ MS+TSAAYSLFLAGATHHLFL++PSLIASAANCWVVAGESLLIL  HSSLW  +N+SK  
Subjt:  FLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWATTNSSKWG

Query:  LPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKTKD
         P+G+  C NCSWVDKFN +RI GRSIEADF EFSIGISNCIA++S K WSFL H C YLKAFTDP DFSWPK ITT  NY          SC CSK +D
Subjt:  LPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKTKD

Query:  VCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD
        V        S Q+R+SI  LGIHCLFYGGYLASICYGH SHLASQI+ ILHD++
Subjt:  VCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD

XP_023520942.1 protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo]4.6e-25572.78Show/hide
Query:  MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLS
        MEMEMRAMEDIEMAEDITPPL PLTAALHD+FLLTHCSSCFSPLPN  ISHSNLLRYCS  C  SHSD LTAA FS   FP SDTSDLRASLRLLHLLLS
Subjt:  MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLS

Query:  HPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCS
         PSA+ SAPPERIFGLLTNR KLM+  DDS+VF+K+REG DA+AASRR NSADIR+ NALEEA+LCLVLTNAV+VQDS+GRTIGIAVY PTFCWINHSCS
Subjt:  HPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCS

Query:  PNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK--------------
        PNACYRFETPSDS  TRLRI+P CTD+ T EGSC+QM TVR N S FI +     GYGPRV+VRSIK IR GEAVTIAYCDLLQPK              
Subjt:  PNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK--------------

Query:  -------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLH
                           EISAV VE LDSTSISNFD+D A+ RIDDYV+NAI EYLSI S ESC EKLQNLLTLGF DEQAED + KQ +NLRLHP+H
Subjt:  -------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLH

Query:  FLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWATTNSSKWG
        FL LNAYTALASAYKVRS           N DENQ  A+ MS+TSAAYSLFLAGATHHLFLS+PSLIASAANCWVVAGESLLIL  HSSLW  +N+SK  
Subjt:  FLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWATTNSSKWG

Query:  LPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKTKD
         P+G+  C NCSWVDKFN SRI GRSIEADF EFSIGISNCIAN+SQK WSFL H C YLKAFTDP DFSWPK ITT SNY+       D SC CSK +D
Subjt:  LPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKTKD

Query:  VCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD
        V        S+Q+R+SI  LGIHCLFYGGYLASICYGHHSHLASQIQ ILHD++
Subjt:  VCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD

XP_038886411.1 protein SET DOMAIN GROUP 41 [Benincasa hispida]3.8e-28978.51Show/hide
Query:  MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFPS--SDTSDLRASLRLLHLL
        MEMEM AMEDIEMAEDITPPL PLT+ALHDSFL THCSSCFS LPNPPISHSNLLRYCS KCSLSHSDPLTAAFFS HPFPS  S TSDLRASLRLLHLL
Subjt:  MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFPS--SDTSDLRASLRLLHLL

Query:  LSHPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHS
        LSHP A  S PPERIFGLLTNRHKLM PQ D+++F KLREGVDAIAA     SADI HG+ L EA LCLV TNAVDV DS GRTIGIAVY PTFCWINHS
Subjt:  LSHPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHS

Query:  CSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------
        CSPNACYRFET S STTTR RIAPSCTDL+T +GSC+QMGTVRSNLSDFI E     G GPRV+VRSIK IR+GEAVTIAYCDLLQPK            
Subjt:  CSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------

Query:  ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHP
                             E+SA KVE  DSTSISNFDHD+AVRRIDDYV++AITEYLSI SPESC EKL+NLLTLGF DEQAED E+KQPVNLRLHP
Subjt:  ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHP

Query:  LHFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWATTNSSK
        LHFLSLN YTALASAYKVRSCDLLALSS+MD D+E+Q  AS M + SAAYSLFLAGATHHLFLS+PSLI SA+ CWV+AGESLL LA HS LWATTN+SK
Subjt:  LHFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWATTNSSK

Query:  WGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKT
        WG PVGKRMCS CSWVDKFNASRI G+ IEADF EFSIGISNCIANMS+KSWSFLTHGCPYLKAFTDP +FSWPK I  YS+ +D++AHSID  CACS +
Subjt:  WGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKT

Query:  KDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD
        KDVCFQ EPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNIL+DL+
Subjt:  KDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD

TrEMBL top hitse value%identityAlignment
A0A0A0KAK3 SET domain-containing protein4.5e-28076.1Show/hide
Query:  MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRLLHLL
        MEMEM A+EDIEMAEDI+PPLFPLT+ALHDSFL THCSSCFS LPNPPISHS  L YCSLKCSLSHSDPLT AFFS HPFP  SSDTSDLRASLRLLHLL
Subjt:  MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRLLHLL

Query:  LSHPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHS
        LSHPS   S PP+RI+GLLTNRHKLM PQ+DS+VFLKLREG +AIAA RRKN ADI  G ALEEAVLCLVLTNAVDVQDSIG+TIGIAVYA TF WINHS
Subjt:  LSHPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHS

Query:  CSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------
        CSPNACYRFETPSDS TTR RIAPSCTD +++EGSC QMG VRSN+ DFIREGA L+G GPRVVVRSIK I+KGEAVTIAYCDLLQPK            
Subjt:  CSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------

Query:  ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHP
                             EIS+VKVE LDST ISNFDHD AVRRID+YVDNAITEYLS SSPESC EKLQNLLT GF DEQ ED E KQ V+LRLHP
Subjt:  ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHP

Query:  LHFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWA-TTNSS
        LHFL LNAYTAL SAYKVRSCDL+ALSS+MD D+ N+  A  M +TSAAY+LFLAGATH LFL +PSL+ASAANCWVVAGESLLILA HSSLWA TTN+S
Subjt:  LHFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWA-TTNSS

Query:  KWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSK
         W  P+GKRMC NCSWVD+FNASRI G+ ++ADF EFSIGISNCIA++SQK WS LTHGCPYLKAFT P DFSWPK     +N QD+    ID SCACSK
Subjt:  KWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSK

Query:  TKDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD
        T+DVC + +PQ SNQERESI GLGIHCL+YGGYLASICYGHHSHLASQIQNIL+DL+
Subjt:  TKDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD

A0A1S3CIT0 protein SET DOMAIN GROUP 41 isoform X11.4e-28177.02Show/hide
Query:  MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRL--LHLL
        MEMRA+EDIEMAEDITPPLFPLT+ALHDSFL THCSSCFS LPNPPISHS LL YCSLKCSLSHSDPLTAAFFS HP P  SSDTSDLRASLRL  LHLL
Subjt:  MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRL--LHLL

Query:  LSHPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHS
        LSHPS   S PP RIFGLLTNRHKLM PQ+ S+VFLKLRE  +AIAA RRKN ADI  G ALEEAVLCLVLTNAVDVQDSIG+TIGIAVYAPTF WINHS
Subjt:  LSHPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHS

Query:  CSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------
        CSPNACYRFETPSD  TTR RIAPSCTD V++EG+C QMG VRSN+ DF+RE     G GPRVVVRSIK I+KGEAVTIAYCDLLQPK            
Subjt:  CSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------

Query:  ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHP
                             EISAVKVE LDS  ISNFDHD AVRRID+YVDNAITEYLSI SPESC EKLQNLLT GF DEQ ED E KQPV+LRLHP
Subjt:  ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHP

Query:  LHFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWA-TTNSS
         HFL LNAYTAL SAYKVRSCDLLALSS+MD D+EN+  A  MS+TSAAY+LFLAGATHHLFL +PSLIASAANCWVVAGESLLILA HSSLWA TTN+S
Subjt:  LHFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWA-TTNSS

Query:  KWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSK
         WG P+GKRMCSNCSWVD+FN SRI GR I+ADF EFSIGISNCIA++S+K WSFLTHGCPYLKAFTDP DFSWPK     +N  D+  H ID SCACSK
Subjt:  KWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSK

Query:  TKDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD
        TKD+CF+ EPQ SNQERESI GLGIHCL+YGGYLASICYG+HSHLASQIQNIL+DL+
Subjt:  TKDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD

A0A1S3CJZ3 protein SET DOMAIN GROUP 41 isoform X23.6e-22176.97Show/hide
Query:  MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRL--LHLL
        MEMRA+EDIEMAEDITPPLFPLT+ALHDSFL THCSSCFS LPNPPISHS LL YCSLKCSLSHSDPLTAAFFS HP P  SSDTSDLRASLRL  LHLL
Subjt:  MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRL--LHLL

Query:  LSHPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHS
        LSHPS   S PP RIFGLLTNRHKLM PQ+ S+VFLKLRE  +AIAA RRKN ADI  G ALEEAVLCLVLTNAVDVQDSIG+TIGIAVYAPTF WINHS
Subjt:  LSHPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHS

Query:  CSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------
        CSPNACYRFETPSD  TTR RIAPSCTD V++EG+C QMG VRSN+ DF+RE     G GPRVVVRSIK I+KGEAVTIAYCDLLQPK            
Subjt:  CSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------

Query:  ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHP
                             EISAVKVE LDS  ISNFDHD AVRRID+YVDNAITEYLSI SPESC EKLQNLLT GF DEQ ED E KQPV+LRLHP
Subjt:  ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHP

Query:  LHFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWA-TTNSS
         HFL LNAYTAL SAYKVRSCDLLALSS+MD D+EN+  A  MS+TSAAY+LFLAGATHHLFL +PSLIASAANCWVVAGESLLILA HSSLWA TTN+S
Subjt:  LHFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWA-TTNSS

Query:  KWGLPVGKRMCSNCSWVDKFNASRILGRSIEADF
         WG P+GKRMCSNCSWVD+FN SRI GR I+ADF
Subjt:  KWGLPVGKRMCSNCSWVDKFNASRILGRSIEADF

A0A6J1EY39 protein SET DOMAIN GROUP 41 isoform X11.1e-24971.1Show/hide
Query:  MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLS
        MEMEMRAMEDIEMAEDITPPL PLTAALHD+F LTHCSSCFSPLPN  ISHSNLLRYCS  C  S SD LTAA FS   FP SDTSDLRASLRLLHLLLS
Subjt:  MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLS

Query:  HPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCS
          SA+ SAPPERIFGLLTNR KLM+ +DDS+VF+K+R+G DA+AASRR NSADIR+ NALEEA+LCLVLTNAV+VQDS+G+TIGIAVY PTFCWINHSCS
Subjt:  HPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCS

Query:  PNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK--------------
        PNACYRFETPSDS  TRLRI+P CTD+ T EGSCNQM TVR N S FI +     GYGPRV+VRSIK +RKGEAVTIAYCDLLQPK              
Subjt:  PNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK--------------

Query:  -------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLH
                           EISA  VE LDSTSISNFD+D A+RRIDDYV+NAI EYLSI SPESC EKLQNLLTLGF DEQAED + KQ +NLRLHP+H
Subjt:  -------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLH

Query:  FLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWATTNSSKWG
        FL LN YTALASAYKVRS           NDDENQ  A+ MS+TSAAYSLFLAGATHHLFL++PSLIASAANCWVVAGESLLIL  HSSLW  +N+SK  
Subjt:  FLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWATTNSSKWG

Query:  LPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKTKD
         P+G+  C NCSWVDKFN +RI GRSIEADF EFSIGISNCIA++S K WSFL H C YLKAFTDP DFSWPK ITT  NY          SC CSK +D
Subjt:  LPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKTKD

Query:  VCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD
        V        S Q+R+SI  LGIHCLFYGGYLASICYGH SHLASQI+ ILHD++
Subjt:  VCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD

A0A6J1I954 protein SET DOMAIN GROUP 41 isoform X11.5e-24670.53Show/hide
Query:  MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLS
        MEME+RAMEDIEMAEDITPPL PLTAALHDSFLLTHCSSCFSPLPN PISHSNLLRYCS  C  S+SD LTAA FS   F  SDTSDLRASLRLLHLLLS
Subjt:  MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLS

Query:  HPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCS
          SA+ S PPERIFGLLTNR KLM+  DDS+VF K+R+G DAIA SRR NSADIR+ NALEEA++CLVLTNAV+VQDS+G+TIGIAVY PTFCWINHSCS
Subjt:  HPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCS

Query:  PNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK--------------
        PNACYRFETPSDS  TRLRI+P CTD+ T EGSC+QM TVR N S FI +     GYGPRV+VRSIK IRKGEAVTIAYCDLLQPK              
Subjt:  PNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK--------------

Query:  -------------------EISAVKV-EFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPL
                           EI AV V E LDSTSISNFD+D A+ RIDDYV+NAI EYLSI SPESC EKLQNLLTLGF DEQA+D + KQ +NLRLHP+
Subjt:  -------------------EISAVKV-EFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPL

Query:  HFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWATTNSSKW
        HFL LN YTALASAYKVRS           ND+ENQ   S MS+TSAAYSLFLAGATHHLFL++PSLIASAANCWVVAGESLL L  HSSLW  +N+SK 
Subjt:  HFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWATTNSSKW

Query:  GLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKTK
          P+G+  C NCSWVDKFN SRI GRSIE DF EFSIGISNCIAN+S K WSFLTH CPYLKAFTDP DFSWPK ITT SNY+       D  C  SK +
Subjt:  GLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKTK

Query:  DVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD
        DV        S+Q+R+SI  LGIHCLFYGGYLASICYGH SHL+SQIQ IL D++
Subjt:  DVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD

SwissProt top hitse value%identityAlignment
Q3ECY6 Protein SET DOMAIN GROUP 411.9e-8635.29Show/hide
Query:  MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHP
        ME+RA EDIE+  D+ PPL PL ++L+DSFL +HCSSCFS LP  P        YCS  CS      LT +F ++  FP   T  L + +R    LL+  
Subjt:  MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHP

Query:  SAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCSPN
        +   S+ P R+  LLTN H LM    D  + + +    + IA   R N    R    LEEA +C VLTNAV+V DS G  +GIA+Y  +F WINHSCSPN
Subjt:  SAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCSPN

Query:  ACYRFETPSDSTTTRLRIAPSCTDL-VTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKEI-------------
        +CYRF          +    S  D+ VTN  + + +          +  G   +G GP+++VRSIK I+ GE +T++Y DLLQP  +             
Subjt:  ACYRFETPSDSTTTRLRIAPSCTDL-VTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKEI-------------

Query:  -----SAVKVEFLDS------------TSISNFD----HDQAVRRIDDYVDNAITEYLSIS-SPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPL
             +A    ++DS            T++ +FD     D+AV +++DY+  AI ++LS +  P++C E ++++L  G      + +E+ QP  LRLH  
Subjt:  -----SAVKVEFLDS------------TSISNFD----HDQAVRRIDDYVDNAITEYLSIS-SPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPL

Query:  HFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRG-ASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWATTNSSK
        H+++LNAY  LA+AY++RS D             ++ G    MSR SAAYSLFLAG +HHLF ++ S   SAA  W  AGE L  LA    +  +  S  
Subjt:  HFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRG-ASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWATTNSSK

Query:  WGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKT
                 C+ C  ++  N+ R        D  E S  I +C+ ++SQ +WSFLT GCPYL+ F  P DFS      T +N +                
Subjt:  WGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKT

Query:  KDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQ
                 + S  +  ++L L  HCL Y   L  +CYG  SHL S+ +
Subjt:  KDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQ

Arabidopsis top hitse value%identityAlignment
AT1G43245.1 SET domain-containing protein1.4e-8735.29Show/hide
Query:  MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHP
        ME+RA EDIE+  D+ PPL PL ++L+DSFL +HCSSCFS LP  P        YCS  CS      LT +F ++  FP   T  L + +R    LL+  
Subjt:  MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHP

Query:  SAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCSPN
        +   S+ P R+  LLTN H LM    D  + + +    + IA   R N    R    LEEA +C VLTNAV+V DS G  +GIA+Y  +F WINHSCSPN
Subjt:  SAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCSPN

Query:  ACYRFETPSDSTTTRLRIAPSCTDL-VTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKEI-------------
        +CYRF          +    S  D+ VTN  + + +          +  G   +G GP+++VRSIK I+ GE +T++Y DLLQP  +             
Subjt:  ACYRFETPSDSTTTRLRIAPSCTDL-VTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKEI-------------

Query:  -----SAVKVEFLDS------------TSISNFD----HDQAVRRIDDYVDNAITEYLSIS-SPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPL
             +A    ++DS            T++ +FD     D+AV +++DY+  AI ++LS +  P++C E ++++L  G      + +E+ QP  LRLH  
Subjt:  -----SAVKVEFLDS------------TSISNFD----HDQAVRRIDDYVDNAITEYLSIS-SPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPL

Query:  HFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRG-ASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWATTNSSK
        H+++LNAY  LA+AY++RS D             ++ G    MSR SAAYSLFLAG +HHLF ++ S   SAA  W  AGE L  LA    +  +  S  
Subjt:  HFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRG-ASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWATTNSSK

Query:  WGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKT
                 C+ C  ++  N+ R        D  E S  I +C+ ++SQ +WSFLT GCPYL+ F  P DFS      T +N +                
Subjt:  WGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKT

Query:  KDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQ
                 + S  +  ++L L  HCL Y   L  +CYG  SHL S+ +
Subjt:  KDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCATTGTTTCCCCTCACCGCCGCTCTCCATGATTCCTTCCTCCTCACTCATTG
TTCCTCCTGCTTCTCCCCTCTCCCAAATCCCCCAATTTCTCACTCCAATCTCCTCCGCTACTGCTCCCTCAAATGCTCCCTTTCCCATTCTGATCCCCTCACCGCCGCCT
TCTTCTCCGCCCATCCCTTCCCCTCCTCCGACACCTCCGACCTCCGCGCCTCCCTCCGCCTCCTCCACCTCCTCCTCTCCCATCCCTCCGCTTTCCACTCCGCTCCTCCC
GAACGCATCTTTGGCCTTCTCACCAATCGACACAAATTGATGATCCCCCAAGACGATTCCCAAGTCTTCCTCAAGCTTCGGGAAGGGGTAGACGCAATAGCCGCTTCCAG
AAGGAAGAACTCTGCCGATATTCGCCATGGAAACGCCTTGGAAGAGGCTGTTCTCTGCCTTGTCTTGACCAACGCCGTCGATGTTCAGGATTCGATCGGTCGCACCATTG
GAATCGCTGTGTACGCTCCTACCTTCTGCTGGATTAATCACAGTTGTTCTCCCAACGCTTGTTACAGATTTGAAACTCCTTCGGATTCCACCACTACGAGGTTACGCATC
GCCCCTTCCTGTACTGATCTTGTGACGAATGAAGGAAGTTGTAATCAAATGGGTACTGTTCGTAGCAACCTTTCGGATTTCATAAGAGAAGGTGCGTTTCTTCACGGTTA
TGGTCCAAGAGTTGTGGTTAGGAGTATAAAGGGTATAAGGAAAGGTGAAGCAGTCACAATCGCATACTGTGACTTGTTGCAACCTAAGGAAATCTCTGCTGTCAAGGTGG
AATTTCTTGATTCAACTTCCATTAGCAACTTTGATCATGACCAAGCAGTGAGAAGAATAGATGATTATGTCGACAATGCTATCACCGAGTACCTGTCTATCAGTTCTCCT
GAATCGTGTTATGAGAAGCTTCAAAACTTGCTTACTTTAGGTTTCTGTGATGAGCAAGCGGAAGACGAGGAAGAAAAACAGCCAGTTAACCTGAGGCTGCATCCTTTGCA
CTTCCTGTCGCTGAATGCATACACTGCTCTCGCATCAGCTTACAAAGTCCGTTCATGTGATTTATTGGCTTTGAGTTCCAAAATGGACAATGACGATGAAAATCAACGTG
GAGCATCTATCATGAGCAGAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTTCTGACCCATCTTTGATTGCGTCTGCTGCAAATTGT
TGGGTTGTTGCTGGAGAGTCTTTGCTTATTCTTGCTAGTCACAGCTCATTATGGGCTACTACTAACTCTTCAAAATGGGGTTTACCTGTTGGAAAAAGAATGTGCTCTAA
CTGCTCATGGGTCGATAAGTTCAATGCGAGTAGAATCCTTGGTCGATCTATCGAAGCTGATTTTCACGAGTTTTCAATTGGTATTTCAAATTGTATTGCTAATATGTCAC
AAAAATCTTGGAGCTTTCTGACTCATGGATGCCCATATTTGAAGGCTTTCACTGACCCCTCTGATTTCAGCTGGCCAAAGGCTATCACAACATATTCGAATTACCAAGAT
CTACAGGCTCATAGCATCGATCCTTCGTGTGCTTGTAGTAAAACTAAGGATGTTTGTTTTCAGAGTGAACCTCAGCATTCTAACCAAGAGAGAGAATCTATCCTTGGGCT
TGGCATCCATTGCTTATTCTATGGGGGCTATTTAGCAAGTATTTGTTATGGTCACCATTCACATTTGGCATCTCAGATTCAAAATATTTTACATGACTTGGATTAA
mRNA sequenceShow/hide mRNA sequence
TGAAATTTTTAGTTGGAGGAGGACAGGAGAGACAGAAATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCATTGTTTCCCCTCA
CCGCCGCTCTCCATGATTCCTTCCTCCTCACTCATTGTTCCTCCTGCTTCTCCCCTCTCCCAAATCCCCCAATTTCTCACTCCAATCTCCTCCGCTACTGCTCCCTCAAA
TGCTCCCTTTCCCATTCTGATCCCCTCACCGCCGCCTTCTTCTCCGCCCATCCCTTCCCCTCCTCCGACACCTCCGACCTCCGCGCCTCCCTCCGCCTCCTCCACCTCCT
CCTCTCCCATCCCTCCGCTTTCCACTCCGCTCCTCCCGAACGCATCTTTGGCCTTCTCACCAATCGACACAAATTGATGATCCCCCAAGACGATTCCCAAGTCTTCCTCA
AGCTTCGGGAAGGGGTAGACGCAATAGCCGCTTCCAGAAGGAAGAACTCTGCCGATATTCGCCATGGAAACGCCTTGGAAGAGGCTGTTCTCTGCCTTGTCTTGACCAAC
GCCGTCGATGTTCAGGATTCGATCGGTCGCACCATTGGAATCGCTGTGTACGCTCCTACCTTCTGCTGGATTAATCACAGTTGTTCTCCCAACGCTTGTTACAGATTTGA
AACTCCTTCGGATTCCACCACTACGAGGTTACGCATCGCCCCTTCCTGTACTGATCTTGTGACGAATGAAGGAAGTTGTAATCAAATGGGTACTGTTCGTAGCAACCTTT
CGGATTTCATAAGAGAAGGTGCGTTTCTTCACGGTTATGGTCCAAGAGTTGTGGTTAGGAGTATAAAGGGTATAAGGAAAGGTGAAGCAGTCACAATCGCATACTGTGAC
TTGTTGCAACCTAAGGAAATCTCTGCTGTCAAGGTGGAATTTCTTGATTCAACTTCCATTAGCAACTTTGATCATGACCAAGCAGTGAGAAGAATAGATGATTATGTCGA
CAATGCTATCACCGAGTACCTGTCTATCAGTTCTCCTGAATCGTGTTATGAGAAGCTTCAAAACTTGCTTACTTTAGGTTTCTGTGATGAGCAAGCGGAAGACGAGGAAG
AAAAACAGCCAGTTAACCTGAGGCTGCATCCTTTGCACTTCCTGTCGCTGAATGCATACACTGCTCTCGCATCAGCTTACAAAGTCCGTTCATGTGATTTATTGGCTTTG
AGTTCCAAAATGGACAATGACGATGAAAATCAACGTGGAGCATCTATCATGAGCAGAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCT
TTCTGACCCATCTTTGATTGCGTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTTATTCTTGCTAGTCACAGCTCATTATGGGCTACTACTAACTCTTCAA
AATGGGGTTTACCTGTTGGAAAAAGAATGTGCTCTAACTGCTCATGGGTCGATAAGTTCAATGCGAGTAGAATCCTTGGTCGATCTATCGAAGCTGATTTTCACGAGTTT
TCAATTGGTATTTCAAATTGTATTGCTAATATGTCACAAAAATCTTGGAGCTTTCTGACTCATGGATGCCCATATTTGAAGGCTTTCACTGACCCCTCTGATTTCAGCTG
GCCAAAGGCTATCACAACATATTCGAATTACCAAGATCTACAGGCTCATAGCATCGATCCTTCGTGTGCTTGTAGTAAAACTAAGGATGTTTGTTTTCAGAGTGAACCTC
AGCATTCTAACCAAGAGAGAGAATCTATCCTTGGGCTTGGCATCCATTGCTTATTCTATGGGGGCTATTTAGCAAGTATTTGTTATGGTCACCATTCACATTTGGCATCT
CAGATTCAAAATATTTTACATGACTTGGATTAATACAGTTATCAAGTAGAAATGTAAATTATTCTGAGATTGAAACTTTTTTCTCCCACCCTACCCATGGTATAAAATTA
GGCCTCGTTTCAT
Protein sequenceShow/hide protein sequence
MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPP
ERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRI
APSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSP
ESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLHFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANC
WVVAGESLLILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQD
LQAHSIDPSCACSKTKDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD