; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC06G113160 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC06G113160
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionTranscription factor EGL1-like
Genome locationCicolChr06:4267815..4284844
RNA-Seq ExpressionCcUC06G113160
SyntenyCcUC06G113160
Gene Ontology termsGO:0048629 - trichome patterning (biological process)
GO:0005634 - nucleus (cellular component)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR025610 - Transcription factor MYC/MYB N-terminal
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575174.1 Transcription factor GLABRA 3, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0071.25Show/hide
Query:  MINDHTHPSGLVLSNTETIRSFLTAASIDSRLSEELRPIASDLASQNNIPYKPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND  HPSGLV+++TE IRSFLT+ASIDSR+SEELR +AS+LASQ+N+PYKPLR IWFATES TRPDLLRLL GSEFVFTSPKPREKSEELKARLKKLA
Subjt:  MINDHTHPSGLVLSNTETIRSFLTAASIDSRLSEELRPIASDLASQNNIPYKPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELFLT--------------SHEVVAGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLIETLLFIIRSSNYDNRSSPSKLK
        DVAERKAYQEL                   ++  GLHV LIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGML+ETLLFIIR+S            
Subjt:  DVAERKAYQELFLT--------------SHEVVAGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLIETLLFIIRSSNYDNRSSPSKLK

Query:  KSQVSTIYMTSSYSIKTRFLVRFGCFHPELQHRILISSFLPDTHKSCGGRKEKKKKFFFSKGSVKKVNPKLEFVSPTIRFLIKKLLLLCVARLKPKRLLG
                                                                                     I FL +K LLL VARLKPK LLG
Subjt:  KSQVSTIYMTSSYSIKTRFLVRFGCFHPELQHRILISSFLPDTHKSCGGRKEKKKKFFFSKGSVKKVNPKLEFVSPTIRFLIKKLLLLCVARLKPKRLLG

Query:  FAILWHWNQILSPLWLMACEPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQL
        FAILW  NQILS LWLMACEPGFLRKQLAVAVKSIQWSYAIFWSPS RQHG            VLEWCDGYYNGDIKTRKTVQAEDVH++ MGLHRSEQL
Subjt:  FAILWHWNQILSPLWLMACEPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQL

Query:  RELYKSLLEGESEQRAKKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELG
        RELYKSLL+GESEQRAKKPPA+LSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALA+DRTIWLCNAQYAESSVFSRSLLAK     TVVCFPYLGGVIELG
Subjt:  RELYKSLLEGESEQRAKKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELG

Query:  VTEQVSEDPNLLQHVKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLT--EVHGKAVNGIQRKINEFGIDSLNDFSNGCEQY
        VTEQVSEDP+LLQHVKDFL+KFSKPICSKK  S+AYKDDN KEPM AKSDNEIVEVLAMEN++ LT  +   KAVNGIQRK +EFGIDSL+ FSNGCE++
Subjt:  VTEQVSEDPNLLQHVKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLT--EVHGKAVNGIQRKINEFGIDSLNDFSNGCEQY

Query:  HHQVESPLRLEGVEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQ-EKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTI
        H  V+ PLRLEGVEGGAS F+SLQFLDDDFSYGFQDSMNPSDCISEALAN  EKVS    S G NDL LKELQNSN TKS SLD R+DED+HYKRTIFTI
Subjt:  HHQVESPLRLEGVEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQ-EKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTI

Query:  LGSSTQLVGSPLLHNFSSRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPS
        LGSSTQL GSPLLH+FSSRSSF+PWKKGMAE NT P+QQ+MLKKILFT                                                    
Subjt:  LGSSTQLVGSPLLHNFSSRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPS

Query:  LNEINKVSILNDTIKYLKMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTM
           INKVSILNDTIKYLKMLEARVQELETC+DSLYYEERFRRKYLDMVEQTSDNYDY+KIEG+ KPSTNKRKACEMDETDLKLKNNIPKDG KLDVKVTM
Subjt:  LNEINKVSILNDTIKYLKMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTM

Query:  KEQEVLVDMHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS
         EQEVLVDMHCPYREYILVDVMD LNDLQLDAHSVQSSD NGVFSLTLKSKFRG+VAAS GM+K+ALLKVANKS
Subjt:  KEQEVLVDMHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS

XP_008458230.1 PREDICTED: LOW QUALITY PROTEIN: transcription factor EGL1-like [Cucumis melo]0.0e+0087.67Show/hide
Query:  MACEPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRA
        MACEPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHG            VLEWCDGYYNGDIKTRKTVQAEDVH++NMGLHRSEQLRELY+SLLEGESEQR 
Subjt:  MACEPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRA

Query:  KKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVK
        KKPPA+LSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALA+DRTIWLCNAQYAESSVFSRSLLAK     TVVCFPYLGGVIELGVTEQV+EDP LLQHVK
Subjt:  KKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVK

Query:  DFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEV--HGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG
        DFL+KFSKPICSKKP SAAYKDDN KEPMTAKSDNEIVE LAMENLYC T V   GK+VNGIQR  NEFGIDSL+DFSNGCEQY HQ+E  LRLEGVEGG
Subjt:  DFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEV--HGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG

Query:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNFS
        ASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQ+KVSSSPR   AN+LPLKELQN N T+SGSLD  SDEDMHYKRTIFTILGSSTQLVGSPLLHNFS
Subjt:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNFS

Query:  SRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYL
        +RS+F PWKK MAET+TPPMQQRMLKKILF VPLLSAGSL GLKD +RSILKQGNN+ CTKN   DKLRENEKFMALKSMLPSLNEINKVSILNDTIKYL
Subjt:  SRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYL

Query:  KMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYREYI
        KMLEARVQELETC+DSLYYEERFRRKYLDMVEQTSDNYDYEKIEGS KPSTNKRKACEMDETDLKLK++ PK GHKLDVKV+M+E EVL+DMHCPYREYI
Subjt:  KMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYREYI

Query:  LVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS
        LVDV+DALNDLQLDA+SVQSSDHNG FSLTLKSKFRGI AASVGMIK+ALLKVANKS
Subjt:  LVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS

XP_011656339.1 transcription factor EGL1 isoform X1 [Cucumis sativus]0.0e+0087.37Show/hide
Query:  MACEPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRA
        MACEPGFLRKQLAVAVKSIQWSYAIFWSPS+RQHG            VLEWCDGYYNGDIKTRKTVQAEDVH++NMGLHRSEQLRELY+SLLEGESEQR 
Subjt:  MACEPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRA

Query:  KKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVK
        KKPPA+LSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALA+DRTIWLCNAQYAES+VFSRSLLAK     TVVCFPYLGGVIELGVTEQVSEDP+LLQHVK
Subjt:  KKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVK

Query:  DFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEV--HGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG
        DFL+KFSKPICSKKP SAAYKDDN KEPMTAKSDNEIVEVLAMENLYC T V   GK+VNGIQRK NEFGIDSL+DFSNGCEQY H +E  LRLEG EGG
Subjt:  DFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEV--HGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG

Query:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNFS
        ASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPR   AN+LPLKE QN NHT+SGSLD  SDEDMHYKRTIFTILGSSTQLVGSPLLHNFS
Subjt:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNFS

Query:  SRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYL
        +RS+F+PWKK +AET+TPPMQQRMLKKILF VPLLSAGSL GLKD ++SILKQGNND CTKN   DKL+ENEKFMALKSMLPSLNEINKVSILNDTIKYL
Subjt:  SRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYL

Query:  KMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYREYI
        KMLEARVQELETC+DSLYYEERFRRKYLDMVEQTSDNYDYEKIEGS KPSTNKRKACEMDETDLKLKN+ PK G KLDVKV+M+E EVLVDMHCPYREYI
Subjt:  KMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYREYI

Query:  LVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS
        LVDVMDALNDLQLDA+SVQSSDHNG+FSLTLKSKFRG+ AASVGMIK+ALLKV NKS
Subjt:  LVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS

XP_038875172.1 transcription factor EGL1-like isoform X1 [Benincasa hispida]0.0e+0089.31Show/hide
Query:  EPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRAKKP
        E GFLRKQLAVAVKSIQWSYAIFWSPSTRQHG            VLEWCDGYYNGDIKTRKTVQAEDVH++NMGLHRSEQLRELYKSLLEGESEQ+AKKP
Subjt:  EPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRAKKP

Query:  PAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVKDFL
        PA+LSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALA+D+TIWLCNAQYAESSVFSRSLLAK     TVVCFP LGGVIELGV EQVSEDP+LLQHVKDFL
Subjt:  PAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVKDFL

Query:  VKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEV--HGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGGASR
        +KFSK ICSKKP SAAYKDDN KEPMT KSDN  VEVLA+ENLYCLT V  +GKA+N IQRK N+FGIDSL+DFSNGCEQY HQ+E PLRLEGVEGGASR
Subjt:  VKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEV--HGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGGASR

Query:  FQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSSRS
        F SLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRS GANDLPL+ELQNSNHTKSGSLD RSDEDMHYKRTIFTILGSSTQLVGS LLHNFSSRS
Subjt:  FQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSSRS

Query:  SFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAG-SLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYLKM
        SFVPWKKGM ET+TPPMQQRMLKKILFTVPLLS+G SLNGLK G++SI +QGNNDFCTKNV+HDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYLKM
Subjt:  SFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAG-SLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYLKM

Query:  LEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYREYILV
        LEARVQELETC+DSLYYEERFRRKYLDMVEQTSDNYDYEKIEGS KPSTNKRKACE+DETDLKLKN IPKDGHKLDVKV+MK+QEVLVDMHCPYREYILV
Subjt:  LEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYREYILV

Query:  DVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS
        DVMDALNDLQLDA+SVQSSDHNGVFSLTLKSKFRGIVAASVGMIK+ALLKVANKS
Subjt:  DVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS

XP_038875173.1 transcription factor EGL1-like isoform X2 [Benincasa hispida]0.0e+0089.36Show/hide
Query:  MACEPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRA
        MACE GFLRKQLAVAVKSIQWSYAIFWSPSTRQHG            VLEWCDGYYNGDIKTRKTVQAEDVH++NMGLHRSEQLRELYKSLLEGESEQ+A
Subjt:  MACEPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRA

Query:  KKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVK
        KKPPA+LSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALA+D+TIWLCNAQYAESSVFSRSLLAK     TVVCFP LGGVIELGV EQVSEDP+LLQHVK
Subjt:  KKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVK

Query:  DFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEV--HGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG
        DFL+KFSK ICSKKP SAAYKDDN KEPMT KSDN  VEVLA+ENLYCLT V  +GKA+N IQRK N+FGIDSL+DFSNGCEQY HQ+E PLRLEGVEGG
Subjt:  DFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEV--HGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG

Query:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNFS
        ASRF SLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRS GANDLPL+ELQNSNHTKSGSLD RSDEDMHYKRTIFTILGSSTQLVGS LLHNFS
Subjt:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNFS

Query:  SRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAG-SLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKY
        SRSSFVPWKKGM ET+TPPMQQRMLKKILFTVPLLS+G SLNGLK G++SI +QGNNDFCTKNV+HDKLRENEKFMALKSMLPSLNEINKVSILNDTIKY
Subjt:  SRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAG-SLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKY

Query:  LKMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYREY
        LKMLEARVQELETC+DSLYYEERFRRKYLDMVEQTSDNYDYEKIEGS KPSTNKRKACE+DETDLKLKN IPKDGHKLDVKV+MK+QEVLVDMHCPYREY
Subjt:  LKMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYREY

Query:  ILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS
        ILVDVMDALNDLQLDA+SVQSSDHNGVFSLTLKSKFRGIVAASVGMIK+ALLKVANKS
Subjt:  ILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS

TrEMBL top hitse value%identityAlignment
A0A1S3C7Z7 LOW QUALITY PROTEIN: transcription factor EGL1-like0.0e+0087.67Show/hide
Query:  MACEPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRA
        MACEPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHG            VLEWCDGYYNGDIKTRKTVQAEDVH++NMGLHRSEQLRELY+SLLEGESEQR 
Subjt:  MACEPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRA

Query:  KKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVK
        KKPPA+LSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALA+DRTIWLCNAQYAESSVFSRSLLAK     TVVCFPYLGGVIELGVTEQV+EDP LLQHVK
Subjt:  KKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVK

Query:  DFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEV--HGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG
        DFL+KFSKPICSKKP SAAYKDDN KEPMTAKSDNEIVE LAMENLYC T V   GK+VNGIQR  NEFGIDSL+DFSNGCEQY HQ+E  LRLEGVEGG
Subjt:  DFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEV--HGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG

Query:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNFS
        ASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQ+KVSSSPR   AN+LPLKELQN N T+SGSLD  SDEDMHYKRTIFTILGSSTQLVGSPLLHNFS
Subjt:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNFS

Query:  SRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYL
        +RS+F PWKK MAET+TPPMQQRMLKKILF VPLLSAGSL GLKD +RSILKQGNN+ CTKN   DKLRENEKFMALKSMLPSLNEINKVSILNDTIKYL
Subjt:  SRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYL

Query:  KMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYREYI
        KMLEARVQELETC+DSLYYEERFRRKYLDMVEQTSDNYDYEKIEGS KPSTNKRKACEMDETDLKLK++ PK GHKLDVKV+M+E EVL+DMHCPYREYI
Subjt:  KMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYREYI

Query:  LVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS
        LVDV+DALNDLQLDA+SVQSSDHNG FSLTLKSKFRGI AASVGMIK+ALLKVANKS
Subjt:  LVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS

A0A5D3BTG5 Transcription factor EGL1-like0.0e+0087.77Show/hide
Query:  EPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRAKKP
        EPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHG            VLEWCDGYYNGDIKTRKTVQAEDVH++NMGLHRSEQLRELY+SLLEGESEQR KKP
Subjt:  EPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRAKKP

Query:  PAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVKDFL
        PA+LSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALA+DRTIWLCNAQYAESSVFSRSLLAK     TVVCFPYLGGVIELGVTEQV+EDP LLQHVKDFL
Subjt:  PAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVKDFL

Query:  VKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEV--HGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGGASR
        +KFSKPICSKKP SAAYKDDN KEPMTAKSDNEIVE LAMENLYC T V   GK+VNGIQRK NEFGIDSL+DFSNGCEQY HQ+E  LRLEGVEGGASR
Subjt:  VKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEV--HGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGGASR

Query:  FQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSSRS
        FQSLQFLDDDFSYGFQDSMNPSDCISEALANQ+KVSSSPR   AN+LPLKELQN N T+SGSLD  SDEDMHYKRTIFTILGSSTQLVGSPLLHNFS+RS
Subjt:  FQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSSRS

Query:  SFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYLKML
        +F PWKK MAET+TPPMQQRMLKKILF VPLLSAGSL GLKD +RSILKQGNN+ CTKN   DKLRENEKFMALKSMLPSLNEINKVSILNDTIKYLKML
Subjt:  SFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYLKML

Query:  EARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYREYILVD
        EARVQELETC+DSLYYEERFRRKYLDMVEQTSDNYDYEKIEGS KPSTNKRKACEMDETDLKLK++ PK GHKLDVKV+M+E EVL+DMHCPYREYILVD
Subjt:  EARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYREYILVD

Query:  VMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS
        V+DALNDLQLDA+SVQSSDHNG FSLTLKSKFRGI AASVGMIK+ALLKVANKS
Subjt:  VMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS

A0A6J1H4N1 transcription factor EGL1-like isoform X20.0e+0085.74Show/hide
Query:  MACEPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRA
        MACEPG LRKQLAVAVKSIQWSYAIFWSPS RQHG            VLEWCDGYYNGDIKTRKTVQAEDVH++ MGLHRSEQLRELYKSLL+GESEQRA
Subjt:  MACEPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRA

Query:  KKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVK
        KKPPA+LSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALA+DRTIWLCNAQYAESSVFSRSLLAK     TVVCFPYLGGVIELGVTEQVSEDP+LLQHVK
Subjt:  KKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVK

Query:  DFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLT--EVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG
        DFL+KFSKPICSKK  S+AYKDDN KEPM AKSDNEIVEVLAMEN++ LT  +   KAVNGIQRK +EFGIDSL+ FSNGCE++H  V+ PLRLEGVEGG
Subjt:  DFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLT--EVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG

Query:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQ-EKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNF
        AS F+SLQFLDDDFSYGFQDSMNPSDCISEALAN  EKVS    S G NDL LKELQNSN TKS SLD R+DED+HYK+TIFTILGSSTQL GSPLLH+F
Subjt:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQ-EKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNF

Query:  SSRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAG-SLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIK
        SSRSSF+PWKKGMAE NT P+QQ+MLKKILFTVPLLSAG SLN LKD +RSILKQGN+DFCTK+VVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIK
Subjt:  SSRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAG-SLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIK

Query:  YLKMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYRE
        YLKMLEARVQELETC+DSLYYEERFRRKYLDMVEQTSDNYDY+KIEG+ KPSTNKRKACEMDETDLKLKNNIPKDG KLDVKVTM EQEVLVDMHCPYRE
Subjt:  YLKMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYRE

Query:  YILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS
        YILVDVMD LNDLQLDAHSVQSSD NGVFSLTLKSKFRG+V AS GM+K+ALLKVANKS
Subjt:  YILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS

A0A6J1H626 transcription factor EGL1-like isoform X10.0e+0085.67Show/hide
Query:  EPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRAKKP
        EPG LRKQLAVAVKSIQWSYAIFWSPS RQHG            VLEWCDGYYNGDIKTRKTVQAEDVH++ MGLHRSEQLRELYKSLL+GESEQRAKKP
Subjt:  EPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRAKKP

Query:  PAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVKDFL
        PA+LSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALA+DRTIWLCNAQYAESSVFSRSLLAK     TVVCFPYLGGVIELGVTEQVSEDP+LLQHVKDFL
Subjt:  PAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVKDFL

Query:  VKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLT--EVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGGASR
        +KFSKPICSKK  S+AYKDDN KEPM AKSDNEIVEVLAMEN++ LT  +   KAVNGIQRK +EFGIDSL+ FSNGCE++H  V+ PLRLEGVEGGAS 
Subjt:  VKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLT--EVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGGASR

Query:  FQSLQFLDDDFSYGFQDSMNPSDCISEALANQ-EKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSSR
        F+SLQFLDDDFSYGFQDSMNPSDCISEALAN  EKVS    S G NDL LKELQNSN TKS SLD R+DED+HYK+TIFTILGSSTQL GSPLLH+FSSR
Subjt:  FQSLQFLDDDFSYGFQDSMNPSDCISEALANQ-EKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSSR

Query:  SSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAG-SLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYLK
        SSF+PWKKGMAE NT P+QQ+MLKKILFTVPLLSAG SLN LKD +RSILKQGN+DFCTK+VVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYLK
Subjt:  SSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAG-SLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYLK

Query:  MLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYREYIL
        MLEARVQELETC+DSLYYEERFRRKYLDMVEQTSDNYDY+KIEG+ KPSTNKRKACEMDETDLKLKNNIPKDG KLDVKVTM EQEVLVDMHCPYREYIL
Subjt:  MLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYREYIL

Query:  VDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS
        VDVMD LNDLQLDAHSVQSSD NGVFSLTLKSKFRG+V AS GM+K+ALLKVANKS
Subjt:  VDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS

I6N8K6 GL30.0e+0086.85Show/hide
Query:  EPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRAKKP
        EPGFLRKQLAVAVKSIQWSYA+FWSPS+RQHG            VLEWCDGYYNGDIKTRKTVQAEDVH++NMGLHRSEQLRELY+SLLEGESEQR KKP
Subjt:  EPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRAKKP

Query:  PAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVKDFL
        PA+LSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALA+DRTIWLCNAQYAES+VFSRSLLAK     TVVCFPYLGGVIELGVTEQVSEDP+LLQHVKDFL
Subjt:  PAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVKDFL

Query:  VKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEV--HGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGGASR
        +KFS+PICSKKP SAAYKDDN KEPMTAKSDNEIVEVLAMENLYC T V   GK+VNGIQRK NEFGIDSL+DFSNGCEQY H +E  LRLEG EGGASR
Subjt:  VKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEV--HGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGGASR

Query:  FQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSSRS
        FQSLQFLDDDFSYGFQDSMNPSDCISEALA+QEKVSSSPR   AN+LPLKE QN NHT+SGSLD  SDEDMHYKRTIFTILGSSTQLVGSPLLHNFS+RS
Subjt:  FQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSSRS

Query:  SFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYLKML
        +F+PWKK +AET+TPPMQQRMLKKILF VPLLSAGSL GLKD ++SILKQGNND CTKN   DKL+ENEKFMALKSMLPSLNEINKVSILNDTIKYLKML
Subjt:  SFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYLKML

Query:  EARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYREYILVD
        EARVQELETC+DSLYYEERFRRKYLDMVEQTSDNYDYEKIEGS KPSTNKRKACEMDETDLKLKN+ PK G KLDVKV+M+E EVLVDMHCPYREYILVD
Subjt:  EARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYREYILVD

Query:  VMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS
        VMDALNDLQLDA+SVQSSDHNG+FSLTLKSKFRG+ AASVGMIK+ALLKV NKS
Subjt:  VMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS

SwissProt top hitse value%identityAlignment
E3SXU4 Basic helix-loop-helix protein A3.2e-7031.16Show/hide
Query:  LRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRAKKPPAAL
        L+  L  AV+S+QW+Y++FW    +Q              +L W DGYYNG IKTRKTVQ  +V      L RS+QLRELY+SL  GE+    ++P A+L
Subjt:  LRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRAKKPPAAL

Query:  SPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVKDFLVKFS
        SPEDL+++EW+YL+C+SF F  G GLPG+A A  + +WL  A   +S  FSR++LAK     TVVC P L GV+E+G T+++ ED N ++HV+ F +   
Subjt:  SPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVKDFLVKFS

Query:  KPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAV-NGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG-ASRFQSL
              KP  + +   N        S + I  +     +Y + +    A+ N      +E   D  ++  +G E   +Q  +      +E    S    +
Subjt:  KPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAV-NGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG-ASRFQSL

Query:  QFLDDDFSYGFQDSMN--PSDCISEALANQEKVSSSPRSS-----GANDLPLK---ELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLH
        +  DD       D  N   SD    A++NQ   S    S      G  + PL    ++Q S+      L+  + ED HY +T+ TIL    Q + SP ++
Subjt:  QFLDDDFSYGFQDSMN--PSDCISEALANQEKVSSSPRSS-----GANDLPLK---ELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLH

Query:  --NFSSRSSFVPWKKGMAETNTPP---MQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNND--------------FCTKNVVHDKLRE---NEKFM
          N+S++SSF  W         PP     Q ++K ILFTVP L   + +      R      +ND                  +V+ ++ R    NE+F+
Subjt:  --NFSSRSSFVPWKKGMAETNTPP---MQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNND--------------FCTKNVVHDKLRE---NEKFM

Query:  ALKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGH
         L+S++P + +++K SIL DTI+YLK L  ++Q+LET    +  E    +  + ++   ++      +EG+      + KA E+                
Subjt:  ALKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGH

Query:  KLDVKVTMKEQEVLVDMHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFR
           V+V++ E + L+++ C  RE +L+DVM  L +L+++   VQSS +NGVF   L++K +
Subjt:  KLDVKVTMKEQEVLVDMHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFR

P13526 Anthocyanin regulatory Lc protein3.3e-6730.86Show/hide
Query:  EPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRA--K
        E   +R QLA A +SI WSYA+FWS S  Q G            VL W DG+YNG++KTRK   + ++  + + + RS+QLRELY++LL GE ++RA   
Subjt:  EPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRA--K

Query:  KPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVKD
        +P  +LSPEDL D EWYY+V M++ F  GQGLPGR+ A+D  +WLCNA  A S  F R+LLAK     +++C P +GGV+ELG T+ V E P+L+     
Subjt:  KPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQHVKD

Query:  FLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEV-HGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGGAS
                       +AA+ +  C     +   NE  E  A +  +   E+ H   ++ I+      G               H  E  LRL   E   S
Subjt:  FLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEV-HGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGGAS

Query:  RFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKE---LQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNF
           SL+ +  +    +            +L ++  + + P       LPL++   +  SN     S    +   +           S   + G       
Subjt:  RFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKE---LQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNF

Query:  SSRSSFVPWKKGMAETN----------TPPMQ--QRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVV--HDKLRE--NEKFMALKSMLPS
        S  +SF+ W +   +++           P ++  QR+LKK++       AG       G  +   Q  +   TKN V    K RE  NE F+ LKS+LPS
Subjt:  SSRSSFVPWKKGMAETN----------TPPMQ--QRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVV--HDKLRE--NEKFMALKSMLPS

Query:  LNEINKVSILNDTIKYLKMLEARVQELETCVDSLYYEERFRRKYLDMVEQ-TSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVT
        ++ +NK SIL +TI YLK L+ RVQELE+  +          + +    +  +++   E   GS      KRK+ E+   D++    +  D    +V VT
Subjt:  LNEINKVSILNDTIKYLKMLEARVQELETCVDSLYYEERFRRKYLDMVEQ-TSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVT

Query:  MKEQEVLVDMHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANK
        + +++VL+++ C + E ++  V DA+  L LD  SVQ+S  +G   L ++++F G  A    MI  AL K   K
Subjt:  MKEQEVLVDMHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANK

Q8W2F1 Transcription factor MYC11.1e-8334.93Show/hide
Query:  LRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGES-----------
        LRKQLA+AV+S+QWSYAIFWS S  Q G            VLEW +G YNGD+K RK  ++ + H    GL +S++LR+LY S+LEG+S           
Subjt:  LRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGES-----------

Query:  ---EQRAKKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDP
           +         LSP+DLSD EWYYLV MS+ F+  Q LPGRA A   TIWLCNAQYAE+ +FSRSLLA+     TVVCFPYLGGVIELGVTE +SED 
Subjt:  ---EQRAKKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDP

Query:  NLLQHVKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLE
        NLL+++K  L++ S           A++D++ ++ M  K   E                                               HQ        
Subjt:  NLLQHVKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLE

Query:  GVEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVG---
                                                              LPL                 SDED+HYKRTI T+L  S    G   
Subjt:  GVEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVG---

Query:  ------SPLLHNFSSRSSFVPWKKGMAETN---TPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPS
               P +      SSF+ WK+   + +        Q +L+KIL  VPL+       +   Q S L Q +          D+ +ENEKF  L++M+P+
Subjt:  ------SPLLHNFSSRSSFVPWKKGMAETN---TPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPS

Query:  LNEINKVSILNDTIKYLKMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTM
        +NE++K SILN+TIKYL+ LEARV+ELE+C+ S+ + ER R+      E  +D+   E+  G++  ST      ++D+   + +  +     K  ++V +
Subjt:  LNEINKVSILNDTIKYLKMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTM

Query:  KEQEVLVDMHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKV
        KE EV++++ C YR+YI+ D+M+ L++L +DA SV+S   N   +L LK+KFRG   ASVGMIK  L +V
Subjt:  KEQEVLVDMHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKV

Q9CAD0 Transcription factor EGL12.7e-10939.06Show/hide
Query:  PGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSL------LEGESEQ
        P  L+KQLAV+V++IQWSY IFWS S  Q G            VLEW DGYYNGDIKTRKT+QA +V I+ +GL RSEQLRELY+SL        G S+ 
Subjt:  PGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSL------LEGESEQ

Query:  RAKKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQH
          +   AALSPEDL+D EWYYLVCMSF FN G+G+PG AL+N   IWLCNA+ A+S VF+RSLLAK     TVVCFP+LGGV+E+G TE + ED N++Q 
Subjt:  RAKKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQH

Query:  VKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG
        VK   ++         P++      + +E     SD++   V   E                      F   S + F    E +   +         +GG
Subjt:  VKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG

Query:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEA-LANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNF
        AS+ QS QF+ ++ S     S+N SDC+S+  +    +++  PR S    L   + + SNH          D+D+HY+  I TI  ++ QL+  P   NF
Subjt:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEA-LANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNF

Query:  SSRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKY
          RSSF  WK+  +        Q+M+KKILF VPL     +N  ++      ++  N   ++    +KL  NE+FM L+S++PS+++I+KVSIL+DTI+Y
Subjt:  SSRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKY

Query:  LKMLEARVQELETCVDSLYYEERF----RRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKD----GHKLDVKVTMKEQEVLVD
        L+ L+ RVQELE+C +S   E R     R+K  D  E+ S N    K +GS                D+ +  + P D    G   +++++    EV+++
Subjt:  LKMLEARVQELETCVDSLYYEERF----RRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKD----GHKLDVKVTMKEQEVLVD

Query:  MHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVA
        + C +RE IL+++MD ++DL LD+HSVQSS  +G+  LT+  K +G   A+ GMI+ AL +VA
Subjt:  MHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVA

Q9FN69 Transcription factor GLABRA 34.4e-11240.36Show/hide
Query:  PGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRA----
        P  L+K LAV+V++IQWSY IFWS S  Q G            VLEW DGYYNGDIKTRKT+QA ++  + +GL RSEQL ELY+SL   ES        
Subjt:  PGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRA----

Query:  -----KKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLA-----KTVVCFPYLGGVIELGVTEQVSEDPNL
             +   AALSPEDL+D EWYYLVCMSF FN G+G+PGR  AN   IWLCNA  A+S VFSRSLLA     KTVVCFP+LGGV+E+G TE ++ED N+
Subjt:  -----KKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLA-----KTVVCFPYLGGVIELGVTEQVSEDPNL

Query:  LQHVKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQV---ESPLRL
        +Q VK   ++   P  +  P  + Y  DN  +P     D     + + E                      F   S +  +NG +Q H QV         
Subjt:  LQHVKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQV---ESPLRL

Query:  EGVEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALAN--QEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVG
        E + GGAS+ QS Q +DD+ S     S+N SDC+S+        +V+   R S    L   + Q  N  K+ S D R+D D+HY+  I TI  ++ QL+ 
Subjt:  EGVEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALAN--QEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVG

Query:  SPLLHNFSSRSSFVPWKKGMAE-----TNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRE--NEKFMALKSMLPSLN
         P   N   +SSF  WKK  +      T T P  Q MLKKI+F VP +       + D   +  + GN+      V+  K RE  NE+FM L+ ++PS+N
Subjt:  SPLLHNFSSRSSFVPWKKGMAE-----TNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRE--NEKFMALKSMLPSLN

Query:  EINKVSILNDTIKYLKMLEARVQELETCVDSLYYEER-----FRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVK
        +I+KVSIL+DTI+YL+ LE RVQELE+C +S   E R      R+K  D  E+TS N    +     K S N     E  +T           G   +++
Subjt:  EINKVSILNDTIKYLKMLEARVQELETCVDSLYYEER-----FRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVK

Query:  VTMKEQEVLVDMHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVA
        +     EV++++ C +RE +L+++MD ++DL LD+HSVQSS  +G+  LT+  K +G   A+ GMIK AL +VA
Subjt:  VTMKEQEVLVDMHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVA

Arabidopsis top hitse value%identityAlignment
AT1G63650.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.9e-11039.06Show/hide
Query:  PGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSL------LEGESEQ
        P  L+KQLAV+V++IQWSY IFWS S  Q G            VLEW DGYYNGDIKTRKT+QA +V I+ +GL RSEQLRELY+SL        G S+ 
Subjt:  PGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSL------LEGESEQ

Query:  RAKKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQH
          +   AALSPEDL+D EWYYLVCMSF FN G+G+PG AL+N   IWLCNA+ A+S VF+RSLLAK     TVVCFP+LGGV+E+G TE + ED N++Q 
Subjt:  RAKKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQH

Query:  VKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG
        VK   ++         P++      + +E     SD++   V   E                      F   S + F    E +   +         +GG
Subjt:  VKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG

Query:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEA-LANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNF
        AS+ QS QF+ ++ S     S+N SDC+S+  +    +++  PR S    L   + + SNH          D+D+HY+  I TI  ++ QL+  P   NF
Subjt:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEA-LANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNF

Query:  SSRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKY
          RSSF  WK+  +        Q+M+KKILF VPL     +N  ++      ++  N   ++    +KL  NE+FM L+S++PS+++I+KVSIL+DTI+Y
Subjt:  SSRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKY

Query:  LKMLEARVQELETCVDSLYYEERF----RRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKD----GHKLDVKVTMKEQEVLVD
        L+ L+ RVQELE+C +S   E R     R+K  D  E+ S N    K +GS                D+ +  + P D    G   +++++    EV+++
Subjt:  LKMLEARVQELETCVDSLYYEERF----RRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKD----GHKLDVKVTMKEQEVLVD

Query:  MHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVA
        + C +RE IL+++MD ++DL LD+HSVQSS  +G+  LT+  K +G   A+ GMI+ AL +VA
Subjt:  MHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVA

AT1G63650.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.9e-11039.06Show/hide
Query:  PGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSL------LEGESEQ
        P  L+KQLAV+V++IQWSY IFWS S  Q G            VLEW DGYYNGDIKTRKT+QA +V I+ +GL RSEQLRELY+SL        G S+ 
Subjt:  PGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSL------LEGESEQ

Query:  RAKKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQH
          +   AALSPEDL+D EWYYLVCMSF FN G+G+PG AL+N   IWLCNA+ A+S VF+RSLLAK     TVVCFP+LGGV+E+G TE + ED N++Q 
Subjt:  RAKKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQH

Query:  VKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG
        VK   ++         P++      + +E     SD++   V   E                      F   S + F    E +   +         +GG
Subjt:  VKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG

Query:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEA-LANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNF
        AS+ QS QF+ ++ S     S+N SDC+S+  +    +++  PR S    L   + + SNH          D+D+HY+  I TI  ++ QL+  P   NF
Subjt:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEA-LANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNF

Query:  SSRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKY
          RSSF  WK+  +        Q+M+KKILF VPL     +N  ++      ++  N   ++    +KL  NE+FM L+S++PS+++I+KVSIL+DTI+Y
Subjt:  SSRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKY

Query:  LKMLEARVQELETCVDSLYYEERF----RRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKD----GHKLDVKVTMKEQEVLVD
        L+ L+ RVQELE+C +S   E R     R+K  D  E+ S N    K +GS                D+ +  + P D    G   +++++    EV+++
Subjt:  LKMLEARVQELETCVDSLYYEERF----RRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKD----GHKLDVKVTMKEQEVLVD

Query:  MHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVA
        + C +RE IL+++MD ++DL LD+HSVQSS  +G+  LT+  K +G   A+ GMI+ AL +VA
Subjt:  MHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVA

AT1G63650.3 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.9e-11039.06Show/hide
Query:  PGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSL------LEGESEQ
        P  L+KQLAV+V++IQWSY IFWS S  Q G            VLEW DGYYNGDIKTRKT+QA +V I+ +GL RSEQLRELY+SL        G S+ 
Subjt:  PGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSL------LEGESEQ

Query:  RAKKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQH
          +   AALSPEDL+D EWYYLVCMSF FN G+G+PG AL+N   IWLCNA+ A+S VF+RSLLAK     TVVCFP+LGGV+E+G TE + ED N++Q 
Subjt:  RAKKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDPNLLQH

Query:  VKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG
        VK   ++         P++      + +E     SD++   V   E                      F   S + F    E +   +         +GG
Subjt:  VKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLEGVEGG

Query:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEA-LANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNF
        AS+ QS QF+ ++ S     S+N SDC+S+  +    +++  PR S    L   + + SNH          D+D+HY+  I TI  ++ QL+  P   NF
Subjt:  ASRFQSLQFLDDDFSYGFQDSMNPSDCISEA-LANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVGSPLLHNF

Query:  SSRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKY
          RSSF  WK+  +        Q+M+KKILF VPL     +N  ++      ++  N   ++    +KL  NE+FM L+S++PS+++I+KVSIL+DTI+Y
Subjt:  SSRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKY

Query:  LKMLEARVQELETCVDSLYYEERF----RRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKD----GHKLDVKVTMKEQEVLVD
        L+ L+ RVQELE+C +S   E R     R+K  D  E+ S N    K +GS                D+ +  + P D    G   +++++    EV+++
Subjt:  LKMLEARVQELETCVDSLYYEERF----RRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKD----GHKLDVKVTMKEQEVLVD

Query:  MHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVA
        + C +RE IL+++MD ++DL LD+HSVQSS  +G+  LT+  K +G   A+ GMI+ AL +VA
Subjt:  MHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVA

AT4G00480.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein8.1e-8534.93Show/hide
Query:  LRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGES-----------
        LRKQLA+AV+S+QWSYAIFWS S  Q G            VLEW +G YNGD+K RK  ++ + H    GL +S++LR+LY S+LEG+S           
Subjt:  LRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGES-----------

Query:  ---EQRAKKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDP
           +         LSP+DLSD EWYYLV MS+ F+  Q LPGRA A   TIWLCNAQYAE+ +FSRSLLA+     TVVCFPYLGGVIELGVTE +SED 
Subjt:  ---EQRAKKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLAK-----TVVCFPYLGGVIELGVTEQVSEDP

Query:  NLLQHVKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLE
        NLL+++K  L++ S           A++D++ ++ M  K   E                                               HQ        
Subjt:  NLLQHVKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQVESPLRLE

Query:  GVEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVG---
                                                              LPL                 SDED+HYKRTI T+L  S    G   
Subjt:  GVEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVG---

Query:  ------SPLLHNFSSRSSFVPWKKGMAETN---TPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPS
               P +      SSF+ WK+   + +        Q +L+KIL  VPL+       +   Q S L Q +          D+ +ENEKF  L++M+P+
Subjt:  ------SPLLHNFSSRSSFVPWKKGMAETN---TPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPS

Query:  LNEINKVSILNDTIKYLKMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTM
        +NE++K SILN+TIKYL+ LEARV+ELE+C+ S+ + ER R+      E  +D+   E+  G++  ST      ++D+   + +  +     K  ++V +
Subjt:  LNEINKVSILNDTIKYLKMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTM

Query:  KEQEVLVDMHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKV
        KE EV++++ C YR+YI+ D+M+ L++L +DA SV+S   N   +L LK+KFRG   ASVGMIK  L +V
Subjt:  KEQEVLVDMHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKV

AT5G41315.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.2e-11340.36Show/hide
Query:  PGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRA----
        P  L+K LAV+V++IQWSY IFWS S  Q G            VLEW DGYYNGDIKTRKT+QA ++  + +GL RSEQL ELY+SL   ES        
Subjt:  PGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCDGYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRA----

Query:  -----KKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLA-----KTVVCFPYLGGVIELGVTEQVSEDPNL
             +   AALSPEDL+D EWYYLVCMSF FN G+G+PGR  AN   IWLCNA  A+S VFSRSLLA     KTVVCFP+LGGV+E+G TE ++ED N+
Subjt:  -----KKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLA-----KTVVCFPYLGGVIELGVTEQVSEDPNL

Query:  LQHVKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQV---ESPLRL
        +Q VK   ++   P  +  P  + Y  DN  +P     D     + + E                      F   S +  +NG +Q H QV         
Subjt:  LQHVKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAVNGIQRKINEFGIDSLNDFSNGCEQYHHQV---ESPLRL

Query:  EGVEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALAN--QEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVG
        E + GGAS+ QS Q +DD+ S     S+N SDC+S+        +V+   R S    L   + Q  N  K+ S D R+D D+HY+  I TI  ++ QL+ 
Subjt:  EGVEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALAN--QEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQLVG

Query:  SPLLHNFSSRSSFVPWKKGMAE-----TNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRE--NEKFMALKSMLPSLN
         P   N   +SSF  WKK  +      T T P  Q MLKKI+F VP +       + D   +  + GN+      V+  K RE  NE+FM L+ ++PS+N
Subjt:  SPLLHNFSSRSSFVPWKKGMAE-----TNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRE--NEKFMALKSMLPSLN

Query:  EINKVSILNDTIKYLKMLEARVQELETCVDSLYYEER-----FRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVK
        +I+KVSIL+DTI+YL+ LE RVQELE+C +S   E R      R+K  D  E+TS N    +     K S N     E  +T           G   +++
Subjt:  EINKVSILNDTIKYLKMLEARVQELETCVDSLYYEER-----FRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVK

Query:  VTMKEQEVLVDMHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVA
        +     EV++++ C +RE +L+++MD ++DL LD+HSVQSS  +G+  LT+  K +G   A+ GMIK AL +VA
Subjt:  VTMKEQEVLVDMHCPYREYILVDVMDALNDLQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGAAACCTTTCCGGCGAAAGGTAAGGTACTCCGGAATCGATTTGCCGTCGATCAGCCAGGCTCCGGCCGGAGAACAATCAAGGAACATGAGTTCAAGAATTCACCG
GCGACTCACTGGTCGGCGACAGAGGAGAGAGAGAGAGAGATTGATGCTCAAGCGTGCGAGCATTTGCTTTGAGCTCTCGTCTTCCACCGGTCTCGTCGGCGCCAAGGATA
TCGCTGTTCTCGAACCGGAAAAACCATATTCTACGATGATCAACGACCACACACATCCTTCTGGTCTTGTTCTATCTAACACTGAAACAATTCGCTCATTTCTCACCGCG
GCGTCCATAGACTCACGACTTTCTGAGGAACTCCGGCCGATTGCTTCAGATCTCGCTTCACAAAACAACATTCCGTATAAGCCTCTGAGAGCTATCTGGTTTGCTACGGA
ATCGTGCACACGGCCGGATTTGCTCCGTCTTTTGGCTGGATCGGAGTTCGTCTTTACAAGCCCTAAACCCAGGGAGAAGAGTGAGGAGTTAAAGGCTAGACTGAAGAAGC
TTGCAGATGTAGCAGAGAGGAAGGCCTATCAGGAACTGTTCTTAACAAGTCATGAGGTTGTTGCAGGTTTACATGTCGCGTTGATAATGTTTACTGGCTATCTTGTTGGA
TATGCATTATTCCGAGCATTGTTTAGGCATGATCCAATCATGAGTGCTGCTGGAGGTATCCTTGGATTAGTTTTCGGCATGCTCATAGAAACACTTCTTTTCATTATTCG
ATCGTCCAATTATGATAATCGGTCTTCCCCTTCTAAGCTAAAGAAGAGTCAAGTGTCTACTATATATATGACTTCGAGCTACAGTATCAAGACACGATTTTTGGTTCGGT
TCGGTTGCTTTCATCCTGAACTTCAACACCGGATTTTGATATCCTCCTTCCTTCCAGATACGCACAAGTCATGCGGAGGAAGAAAAGAAAAAAAAAAAAAATTCTTCTTC
TCTAAAGGGTCAGTCAAAAAGGTAAACCCCAAATTAGAATTTGTCAGCCCCACGATCAGATTTCTCATAAAAAAGCTGCTGTTGCTCTGCGTCGCACGGCTAAAGCCTAA
AAGGCTTTTGGGTTTTGCCATTCTGTGGCATTGGAACCAAATCTTAAGCCCCTTATGGCTAATGGCTTGCGAACCTGGGTTTCTCAGAAAGCAGCTTGCTGTTGCTGTTA
AGAGCATCCAGTGGAGCTATGCGATTTTCTGGTCACCATCGACTAGGCAACATGGATCAATTCACCAGATGGATGTTGAATATCCTAATTGGGTGCTGGAATGGTGTGAT
GGCTACTACAATGGAGACATCAAGACAAGGAAAACAGTTCAAGCTGAGGATGTCCATATCAATAATATGGGCTTACACAGAAGTGAGCAATTGAGAGAGCTCTATAAGTC
TCTCTTAGAAGGCGAAAGTGAACAACGAGCGAAAAAGCCCCCTGCCGCTTTGTCTCCTGAAGATCTATCTGATGCAGAATGGTATTACTTGGTTTGCATGTCCTTTTTCT
TCAATCAAGGCCAAGGTTTGCCTGGAAGAGCGTTAGCTAATGATCGAACTATCTGGTTATGCAATGCTCAATATGCGGAGAGTAGTGTATTCTCCCGCTCCTTGCTTGCA
AAGACTGTGGTTTGCTTTCCTTACCTTGGTGGTGTTATTGAACTAGGTGTAACTGAGCAGGTCTCGGAGGATCCTAATCTTCTTCAACATGTCAAAGATTTTTTAGTGAA
GTTCTCAAAGCCAATATGCTCTAAGAAACCTTTTTCCGCTGCTTATAAAGATGATAATTGTAAAGAACCAATGACTGCCAAATCGGACAATGAGATTGTTGAAGTTTTGG
CAATGGAGAACCTCTACTGCTTGACAGAAGTTCACGGGAAGGCAGTAAATGGGATTCAAAGGAAAATAAATGAGTTTGGCATTGATTCTCTCAATGATTTTTCGAATGGT
TGTGAACAATATCACCACCAAGTGGAAAGTCCTTTAAGACTTGAAGGTGTCGAGGGAGGGGCTTCTCGTTTCCAGAGTTTGCAGTTTCTGGATGATGACTTCAGTTACGG
TTTTCAAGATTCCATGAATCCTAGTGACTGTATTTCTGAAGCTTTGGCCAATCAGGAGAAAGTCTCATCTTCTCCAAGATCGAGTGGTGCCAACGATTTACCTTTGAAAG
AACTTCAAAACTCAAATCACACTAAATCAGGTTCCTTAGATTCCAGATCTGATGAAGACATGCACTACAAGAGAACTATCTTCACCATTTTAGGAAGTTCAACTCAACTA
GTTGGAAGTCCCCTTCTCCATAACTTCTCAAGCAGATCCAGTTTCGTGCCATGGAAGAAAGGAATGGCTGAGACAAACACGCCCCCCATGCAACAAAGAATGTTAAAGAA
GATTTTGTTTACAGTTCCATTATTATCTGCTGGTTCTCTAAATGGCCTCAAGGATGGGCAACGGTCGATCTTGAAACAGGGTAACAACGATTTCTGCACTAAAAATGTCG
TGCATGACAAATTGAGAGAAAATGAAAAATTTATGGCCCTTAAGTCGATGTTACCTTCGCTTAATGAGATCAACAAAGTATCGATACTCAACGATACAATCAAATATCTG
AAGATGCTTGAAGCAAGAGTACAGGAGTTAGAAACTTGCGTGGATTCATTATATTATGAAGAAAGATTCAGAAGGAAATATCTTGACATGGTGGAGCAGACTTCAGACAA
CTATGACTATGAAAAGATTGAAGGCAGCTTCAAACCTTCAACGAACAAGAGAAAAGCCTGTGAAATGGATGAAACCGACCTGAAGCTGAAGAATAACATTCCCAAGGATG
GCCATAAGCTAGATGTGAAAGTCACCATGAAAGAGCAAGAAGTTCTTGTTGACATGCACTGTCCTTATCGAGAATATATATTGGTCGACGTCATGGATGCTTTGAATGAT
TTGCAACTGGACGCCCACTCGGTTCAATCTTCTGATCATAATGGTGTTTTCTCCTTGACCCTCAAGTCTAAGTTTCGAGGGATAGTGGCTGCATCAGTTGGGATGATCAA
AGTAGCACTTCTGAAAGTTGCCAACAAGAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACGAAACCTTTCCGGCGAAAGGTAAGGTACTCCGGAATCGATTTGCCGTCGATCAGCCAGGCTCCGGCCGGAGAACAATCAAGGAACATGAGTTCAAGAATTCACCG
GCGACTCACTGGTCGGCGACAGAGGAGAGAGAGAGAGAGATTGATGCTCAAGCGTGCGAGCATTTGCTTTGAGCTCTCGTCTTCCACCGGTCTCGTCGGCGCCAAGGATA
TCGCTGTTCTCGAACCGGAAAAACCATATTCTACGATGATCAACGACCACACACATCCTTCTGGTCTTGTTCTATCTAACACTGAAACAATTCGCTCATTTCTCACCGCG
GCGTCCATAGACTCACGACTTTCTGAGGAACTCCGGCCGATTGCTTCAGATCTCGCTTCACAAAACAACATTCCGTATAAGCCTCTGAGAGCTATCTGGTTTGCTACGGA
ATCGTGCACACGGCCGGATTTGCTCCGTCTTTTGGCTGGATCGGAGTTCGTCTTTACAAGCCCTAAACCCAGGGAGAAGAGTGAGGAGTTAAAGGCTAGACTGAAGAAGC
TTGCAGATGTAGCAGAGAGGAAGGCCTATCAGGAACTGTTCTTAACAAGTCATGAGGTTGTTGCAGGTTTACATGTCGCGTTGATAATGTTTACTGGCTATCTTGTTGGA
TATGCATTATTCCGAGCATTGTTTAGGCATGATCCAATCATGAGTGCTGCTGGAGGTATCCTTGGATTAGTTTTCGGCATGCTCATAGAAACACTTCTTTTCATTATTCG
ATCGTCCAATTATGATAATCGGTCTTCCCCTTCTAAGCTAAAGAAGAGTCAAGTGTCTACTATATATATGACTTCGAGCTACAGTATCAAGACACGATTTTTGGTTCGGT
TCGGTTGCTTTCATCCTGAACTTCAACACCGGATTTTGATATCCTCCTTCCTTCCAGATACGCACAAGTCATGCGGAGGAAGAAAAGAAAAAAAAAAAAAATTCTTCTTC
TCTAAAGGGTCAGTCAAAAAGGTAAACCCCAAATTAGAATTTGTCAGCCCCACGATCAGATTTCTCATAAAAAAGCTGCTGTTGCTCTGCGTCGCACGGCTAAAGCCTAA
AAGGCTTTTGGGTTTTGCCATTCTGTGGCATTGGAACCAAATCTTAAGCCCCTTATGGCTAATGGCTTGCGAACCTGGGTTTCTCAGAAAGCAGCTTGCTGTTGCTGTTA
AGAGCATCCAGTGGAGCTATGCGATTTTCTGGTCACCATCGACTAGGCAACATGGATCAATTCACCAGATGGATGTTGAATATCCTAATTGGGTGCTGGAATGGTGTGAT
GGCTACTACAATGGAGACATCAAGACAAGGAAAACAGTTCAAGCTGAGGATGTCCATATCAATAATATGGGCTTACACAGAAGTGAGCAATTGAGAGAGCTCTATAAGTC
TCTCTTAGAAGGCGAAAGTGAACAACGAGCGAAAAAGCCCCCTGCCGCTTTGTCTCCTGAAGATCTATCTGATGCAGAATGGTATTACTTGGTTTGCATGTCCTTTTTCT
TCAATCAAGGCCAAGGTTTGCCTGGAAGAGCGTTAGCTAATGATCGAACTATCTGGTTATGCAATGCTCAATATGCGGAGAGTAGTGTATTCTCCCGCTCCTTGCTTGCA
AAGACTGTGGTTTGCTTTCCTTACCTTGGTGGTGTTATTGAACTAGGTGTAACTGAGCAGGTCTCGGAGGATCCTAATCTTCTTCAACATGTCAAAGATTTTTTAGTGAA
GTTCTCAAAGCCAATATGCTCTAAGAAACCTTTTTCCGCTGCTTATAAAGATGATAATTGTAAAGAACCAATGACTGCCAAATCGGACAATGAGATTGTTGAAGTTTTGG
CAATGGAGAACCTCTACTGCTTGACAGAAGTTCACGGGAAGGCAGTAAATGGGATTCAAAGGAAAATAAATGAGTTTGGCATTGATTCTCTCAATGATTTTTCGAATGGT
TGTGAACAATATCACCACCAAGTGGAAAGTCCTTTAAGACTTGAAGGTGTCGAGGGAGGGGCTTCTCGTTTCCAGAGTTTGCAGTTTCTGGATGATGACTTCAGTTACGG
TTTTCAAGATTCCATGAATCCTAGTGACTGTATTTCTGAAGCTTTGGCCAATCAGGAGAAAGTCTCATCTTCTCCAAGATCGAGTGGTGCCAACGATTTACCTTTGAAAG
AACTTCAAAACTCAAATCACACTAAATCAGGTTCCTTAGATTCCAGATCTGATGAAGACATGCACTACAAGAGAACTATCTTCACCATTTTAGGAAGTTCAACTCAACTA
GTTGGAAGTCCCCTTCTCCATAACTTCTCAAGCAGATCCAGTTTCGTGCCATGGAAGAAAGGAATGGCTGAGACAAACACGCCCCCCATGCAACAAAGAATGTTAAAGAA
GATTTTGTTTACAGTTCCATTATTATCTGCTGGTTCTCTAAATGGCCTCAAGGATGGGCAACGGTCGATCTTGAAACAGGGTAACAACGATTTCTGCACTAAAAATGTCG
TGCATGACAAATTGAGAGAAAATGAAAAATTTATGGCCCTTAAGTCGATGTTACCTTCGCTTAATGAGATCAACAAAGTATCGATACTCAACGATACAATCAAATATCTG
AAGATGCTTGAAGCAAGAGTACAGGAGTTAGAAACTTGCGTGGATTCATTATATTATGAAGAAAGATTCAGAAGGAAATATCTTGACATGGTGGAGCAGACTTCAGACAA
CTATGACTATGAAAAGATTGAAGGCAGCTTCAAACCTTCAACGAACAAGAGAAAAGCCTGTGAAATGGATGAAACCGACCTGAAGCTGAAGAATAACATTCCCAAGGATG
GCCATAAGCTAGATGTGAAAGTCACCATGAAAGAGCAAGAAGTTCTTGTTGACATGCACTGTCCTTATCGAGAATATATATTGGTCGACGTCATGGATGCTTTGAATGAT
TTGCAACTGGACGCCCACTCGGTTCAATCTTCTGATCATAATGGTGTTTTCTCCTTGACCCTCAAGTCTAAGTTTCGAGGGATAGTGGCTGCATCAGTTGGGATGATCAA
AGTAGCACTTCTGAAAGTTGCCAACAAGAGCTGAGGTGAGAAGAAAGCTTGGGATCAATGGTGAAGCATTTTCTGGAAGTCAGTAGAGTAGAAATGAGCAGTGAAATGCC
AATTAAGAGAAGAAAAGAAGCTGACAATTCTAACATCCTCTCCCCTTGGGGCTTACAAATTAGGCAAGCTCAGCATCAACTGATAAAGATTGACTTGTTAACTTTTGAGT
GATAAATTTGATCATATACTGTTACTATTTGATACCCACAATCA
Protein sequenceShow/hide protein sequence
MTKPFRRKVRYSGIDLPSISQAPAGEQSRNMSSRIHRRLTGRRQRRERERLMLKRASICFELSSSTGLVGAKDIAVLEPEKPYSTMINDHTHPSGLVLSNTETIRSFLTA
ASIDSRLSEELRPIASDLASQNNIPYKPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLADVAERKAYQELFLTSHEVVAGLHVALIMFTGYLVG
YALFRALFRHDPIMSAAGGILGLVFGMLIETLLFIIRSSNYDNRSSPSKLKKSQVSTIYMTSSYSIKTRFLVRFGCFHPELQHRILISSFLPDTHKSCGGRKEKKKKFFF
SKGSVKKVNPKLEFVSPTIRFLIKKLLLLCVARLKPKRLLGFAILWHWNQILSPLWLMACEPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGSIHQMDVEYPNWVLEWCD
GYYNGDIKTRKTVQAEDVHINNMGLHRSEQLRELYKSLLEGESEQRAKKPPAALSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALANDRTIWLCNAQYAESSVFSRSLLA
KTVVCFPYLGGVIELGVTEQVSEDPNLLQHVKDFLVKFSKPICSKKPFSAAYKDDNCKEPMTAKSDNEIVEVLAMENLYCLTEVHGKAVNGIQRKINEFGIDSLNDFSNG
CEQYHHQVESPLRLEGVEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRSSGANDLPLKELQNSNHTKSGSLDSRSDEDMHYKRTIFTILGSSTQL
VGSPLLHNFSSRSSFVPWKKGMAETNTPPMQQRMLKKILFTVPLLSAGSLNGLKDGQRSILKQGNNDFCTKNVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYL
KMLEARVQELETCVDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSFKPSTNKRKACEMDETDLKLKNNIPKDGHKLDVKVTMKEQEVLVDMHCPYREYILVDVMDALND
LQLDAHSVQSSDHNGVFSLTLKSKFRGIVAASVGMIKVALLKVANKS