; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016390 (gene) of Snake gourd v1 genome

Gene IDTan0016390
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGCFC domain-containing protein
Genome locationLG11:28822613..28841758
RNA-Seq ExpressionTan0016390
SyntenyTan0016390
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR012890 - Intron Large complex component GCFC2-like
IPR022783 - GCF, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446554.1 PREDICTED: PAX3- and PAX7-binding protein 1 [Cucumis melo]0.0e+0076.33Show/hide
Query:  TRPSQT---GVRNTSFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL-----------
        T+P +    G +  SFASDEENDAPL  SSSK S+S+KPSSARLAKPSSTHKIT LKDRIAHSSS SASVPSNVQPQAG Y+KEAL +L           
Subjt:  TRPSQT---GVRNTSFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL-----------

Query:  -----KPSAEPVIVLKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGV
             KPSAEPVIVLKGLLKP EQ+PE+ RE KE SSEDE  GS+ KSA S RRSKEDTLARMASMGI  GKDSS  SIPDQATINAIRAKRERMRQAGV
Subjt:  -----KPSAEPVIVLKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGV

Query:  AALDYISLDAGSNHTAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISAS
        AA DYISLDAGSN TAPGEL                    ++KGVFEEVDEQ ID VRTNIIEH+DEDEEEK  EEEQFRKGLGKR+DDGSTRV S S  
Subjt:  AALDYISLDAGSNHTAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISAS

Query:  IIQSVPQQNLIYPATAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIARRHRA-----
        ++QSV QQNLIYP T GY+SVPS ST TSIGGSV VS          QAEIAKKA+Q++MGRLK  S+ R++S+    +  + A  + I    +A     
Subjt:  IIQSVPQQNLIYPATAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIARRHRA-----

Query:  -------SVADVVE-----YRHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTN
                + D V       +HKAPFIEELEEQMQKLHEER STVVERR+ADNDDEMVEIETAVKA T ILNKKGSS+EM+ AATSAAQAAI S++EQ N
Subjt:  -------SVADVVE-----YRHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTN

Query:  LPSKLDEFGRDLNLQKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEE
        LP+KLDEFGRDLNLQKRMDMKRRAEARKRRR+QYDSKRL S EVDGHQ VEGES+TDESDS+SAAYQSNRDLLLQTA QIFSDAAEEFSQLSVVKQRFEE
Subjt:  LPSKLDEFGRDLNLQKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEE

Query:  WKRGYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTV
        WKR YSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHE+ADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLS   
Subjt:  WKRGYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTV

Query:  VPLEPNAAFATSLITNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYG
             NAAFATSLITNYVP SSEALT+LLVVIRTRLS A+EDLTVPTWN+LV KAVPN AR+AAYRFGMS+RL+RNICL K IIALPILEKL +EELLYG
Subjt:  VPLEPNAAFATSLITNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYG

Query:  KVLPHVRSITANVHDAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHL
        KVLPHVRSITAN+HDAVTRTERIIASL+GVWTG  I G R  +LQPLVDYVLLLGRTLEKK  SGIAESETSGLA+RLKKMLVELNEYDNARDIAKTFHL
Subjt:  KVLPHVRSITANVHDAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHL

Query:  KEAL
        KEAL
Subjt:  KEAL

XP_022945398.1 transcriptional repressor ILP1 [Cucurbita moschata]0.0e+0078.09Show/hide
Query:  SFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL----------------KPSAEPVIV
        SFASDEENDAPL T SSKP+NS+KPSSARLAKPSSTHKIT LKDRIAHSSSTSASVPSNVQPQAGTY+KEAL +L                KPSAEPVIV
Subjt:  SFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL----------------KPSAEPVIV

Query:  LKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNH
        LKGLLKPVEQI ++ +E KE SSEDE  GS+ KSAGSFRRSKED LARMASMGI  GKDS+  SIPDQATINAIRAKRERMRQAGVAA DYISLDAGSN 
Subjt:  LKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNH

Query:  TAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISASIIQSVPQQNLIYPA
        TAPGEL                    ++KGVFEE DEQAID VRTNIIEH+DEDEEEK  E EQFRKGLGKR+DDGSTRV S S  +I SVPQQNLIYP 
Subjt:  TAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISASIIQSVPQQNLIYPA

Query:  TAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIA----------------RRHRASVA
        TAGYNSVPSIST TSIGGSVGVS          QAEIAKKA++DNMGRLK  S+ R++++    +  + A  ++I                 ++ R  V+
Subjt:  TAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIA----------------RRHRASVA

Query:  DVVEY-RHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNL
         + ++ +HKAPFIEELEEQMQKLHEER STVVERR+ADNDDEMVEIE AVKA   ILNKKGSSNEMIAAATSAAQAAI SAKEQ NLP+K+DEFGRDLNL
Subjt:  DVVEY-RHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNL

Query:  QKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYM
        QKRMDMKRRAEARKRRRA+YDSKRL STEVDGHQ VEGES+TDESDSE+AAYQSN DLLLQTA QIFSDAAEEFSQLSVVKQRFE+WKR YSATYRDAYM
Subjt:  QKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYM

Query:  SLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLI
        SLS  AIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHE+AHCWDMLS        NAAFATSLI
Subjt:  SLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLI

Query:  TNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVH
        TNYVPTSSEALT+LLVVIRTRLSSAVEDLTVPTW+ALVMKAVPN AR+AAYRFG+S+RLMRNICL K IIALPILEKL +EELLYGKVLPHVRSITAN+H
Subjt:  TNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVH

Query:  DAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHLKEAL
        DAVTRTERIIASLSGVWTGP +TG R  +LQPLVDYV+LLGRTLEKK  SGIAESETSGLA+RLKKMLVELNEYDNARDIAKTFHL+EAL
Subjt:  DAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHLKEAL

XP_022968423.1 transcriptional repressor ILP1 [Cucurbita maxima]0.0e+0077.42Show/hide
Query:  SFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL----------------KPSAEPVIV
        SFASDEENDAPL T SSKP+NS+KPSSARLAKPSSTHKIT LKDRIAHSSSTSASVPSNVQPQAG Y++EAL +L                KPSAEPVIV
Subjt:  SFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL----------------KPSAEPVIV

Query:  LKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNH
        LKGLLKPVEQI ++ +E KE SSEDE  GS+ KSAGSFRRSKED LARMASMGI  GKDS+  SIPDQATINAIRAKRERMRQAGVAA DYISLDAGSN 
Subjt:  LKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNH

Query:  TAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISASIIQSVPQQNLIYPA
        TAPGEL                    ++KGVFEE DEQAID VRTNIIEH+DEDEEEK  E EQFRKGLGKR+DDGSTRV S S  +I SV QQNLIYP 
Subjt:  TAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISASIIQSVPQQNLIYPA

Query:  TAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIA----------------RRHRASVA
        TAGYNSVPSIST TSIGGSVGVS          QAEIAKKA++DNMGRLK  S+ R++++    +  + A  ++I                 ++ R  V+
Subjt:  TAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIA----------------RRHRASVA

Query:  DVVEY-RHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNL
         + ++ +HKAPFIEELEEQMQKLHEER STVVERR+ADNDDEMVEI+ AVKA   ILNKKGSSNEMIAAATSAAQAAI SAKEQ NLP+K+DEFGRDLNL
Subjt:  DVVEY-RHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNL

Query:  QKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYM
        QKRMDMKRRAEARKRRRA+YDSKRL STEVDGHQ VEGES+TDESDSE+AAYQSN DLLLQTA QIFSDAAEEFSQLSVVKQRFE+WKR YSATYRDAYM
Subjt:  QKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYM

Query:  SLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLI
        SLS  AIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLS        NAAFATSLI
Subjt:  SLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLI

Query:  TNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVH
        TNYVPTSSEAL +LLVVIRTRLSSAVEDLTVPTW+ALVMKAVPN AR+AAYRFG+S+RLMRNICL K IIALPILEKL +EELLYGKVLPHVRSITAN+H
Subjt:  TNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVH

Query:  DAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHLKEAL
        DAVTRTERIIASL GVWTGP +TG R  +LQPLVDYV+LLGRTLEKK  SG+AESETSGLA+RLKKMLVELNEYDNARDIAKTFHL+EAL
Subjt:  DAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHLKEAL

XP_023542827.1 transcriptional repressor ILP1 [Cucurbita pepo subsp. pepo]0.0e+0078.09Show/hide
Query:  SFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL----------------KPSAEPVIV
        SFASDEENDAPL T SSKP+NS+KPSSARLAKPSSTHKIT LKDRIAHSSSTSASVPSNVQPQAGTY+KEAL +L                KPSAEPVIV
Subjt:  SFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL----------------KPSAEPVIV

Query:  LKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNH
        LKGLLKPVEQI ++ +E KE SSEDE  GS+ KSAGSFRRSKED LARMASMGI  GKDS+  SIPDQATINAIRAKRERMRQAGVAA DYISLDAGSN 
Subjt:  LKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNH

Query:  TAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISASIIQSVPQQNLIYPA
        TAPGEL                    ++KGVFEE DEQAID VRTNIIEH+DEDEEEK  E EQFRKGLGKR+DDGSTRV S S  +I SVPQQNLIYP 
Subjt:  TAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISASIIQSVPQQNLIYPA

Query:  TAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIA----------------RRHRASVA
        TAGYNSVPSIST TSIGGSVGVS          QAEIAKKA++DNMGRLK  S+ R++++    +  + A  ++I                 ++ R  V+
Subjt:  TAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIA----------------RRHRASVA

Query:  DVVEY-RHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNL
         + ++ +HKAPFIEELEEQMQKLHEER STVVERR+ADNDDEMVEIE AVKA   ILNKKGSSNEMIAAATSAAQAAI SAKEQ NLP+K+DEFGRDLNL
Subjt:  DVVEY-RHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNL

Query:  QKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYM
        QKRMDMKRRAEARKRRRA+YDSKRL STEVDGHQ VEGES+TDESDSE+AAYQSN DLLLQTA QIFSDAAEEFSQLSVVKQRFE+WKR YSATYRDAYM
Subjt:  QKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYM

Query:  SLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLI
        SLS  AIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLS        NAAFATSLI
Subjt:  SLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLI

Query:  TNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVH
        TNYVPTSSEALT+LLVVIRTRLSSAVEDLTVPTW+ALVMKAVPN AR+AAYRFG+S+RLMRNICL K IIALPILEKL +EELLYGKVLPHVRSITAN+H
Subjt:  TNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVH

Query:  DAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHLKEAL
        DAVTRTERIIASLSGVWTGP +TG R  +LQPLVDYV+LLGRTLEKK  SG+AESETSGLA+RLKKMLVELNEYDNARDIAKTFHL+EAL
Subjt:  DAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHLKEAL

XP_038892418.1 transcriptional repressor ILP1 [Benincasa hispida]0.0e+0078.31Show/hide
Query:  SFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL----------------KPSAEPVIV
        SFASDEENDAPL  SSSK +NS+KPSSARLAKPSSTHKIT LKDRIAHSSS  ASVPSNVQPQAGTY+KEAL +L                KPSAEPVIV
Subjt:  SFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL----------------KPSAEPVIV

Query:  LKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNH
        LKGLLKP EQIP++ REAKE +SEDE  GS+ KSAGS RRSKEDTLARMASMGI  GKDSS  SIPDQATINAIRAKRERMRQAGVAA DYISLDAGSN 
Subjt:  LKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNH

Query:  TAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISASIIQSVPQQNLIYPA
        TAPGEL                    ++KGVFEEVDEQ  D VRTNIIEH+DEDEEEK  EEEQFRKGLGKR+DDGSTRV S S S++QSV QQNLIYP 
Subjt:  TAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISASIIQSVPQQNLIYPA

Query:  TAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIARRHRA------------SVADVVE
        T GYNSVPS+ST TSIGGSVGVS          QAEIAKKA+QDNMGRLK  S+ R++S+    +  +    ++I    +A             + D V 
Subjt:  TAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIARRHRA------------SVADVVE

Query:  -----YRHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNL
              +HKAPFIEELEEQMQKLHEER STVVERR+ADNDDEMVEIE AVKA T ILNKKGSS +MIAAATSAAQAAI S++EQ NLP+KLDEFGRDLNL
Subjt:  -----YRHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNL

Query:  QKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYM
        QKRMDMKRRAEARKRRR+QYDSKRL STEVDGHQ VEGES+TDESDS+SAAYQSNRDLLLQTA QIFSDAAEEFSQLSVVKQRFEEWK+ YSATYRDAYM
Subjt:  QKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYM

Query:  SLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLI
        SLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDG DFAPNDADANLVPELVEKVALPILHHEIAHCWDMLS        NAAFATSLI
Subjt:  SLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLI

Query:  TNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVH
        TNYVPTSSEALT+LLVVIRTRLSSAVEDL VPTWNALVMKAVPN AR+AAYRFGMS+RL+RNICL K IIALPILEKL +EELLYGKVLPHVRSI AN+H
Subjt:  TNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVH

Query:  DAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHLKEAL
        DAVTRTERIIASLSGVWTGP+ITG R  +LQPLVDYVLLLGRTLEKK  SGIAESETSGLA+RLKKMLVELNEYDNARDIAKTFHLKEAL
Subjt:  DAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHLKEAL

TrEMBL top hitse value%identityAlignment
A0A0A0KWD3 GCFC domain-containing protein0.0e+0075.44Show/hide
Query:  TRPSQT---GVRNTSFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL-----------
        T+P +    G++  SFASDEENDAPL  SSSK S+S+KPSSARLAKPSSTHKIT LKDRIAHSSS SASVPSNVQPQAG Y+KEAL +L           
Subjt:  TRPSQT---GVRNTSFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL-----------

Query:  -----KPSAEPVIVLKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGV
             KPSAEPVIVLKGLLKP EQ+P++ REAKE SSED+  GS+ KSA S RRSKEDTLARMASMGI  GKDSS  SIPDQATINAIRAKRERMRQAGV
Subjt:  -----KPSAEPVIVLKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGV

Query:  AALDYISLDAGSNHTAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISAS
        AA DYISLDAGSN TAPGEL                    ++KGVFEEVDEQ ID  RTNIIEH+DEDEEEK  EEEQFRKGLGKR+DDGSTRV S S  
Subjt:  AALDYISLDAGSNHTAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISAS

Query:  IIQSVPQQNLIYPATAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSS----SASEHIEARIY------------A
        ++ SV  QNLIYP T GY+SVPS+ST TSIGGSV +S          QAEIAK A+Q++MGRLK  S+ R++       E++ A +              
Subjt:  IIQSVPQQNLIYPATAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSS----SASEHIEARIY------------A

Query:  DAVSIARRHRASVADVVEY-RHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTN
        D     ++ R  V+ + ++ +HKAPFIEELEEQMQKLHEER STVVERR+ADNDDEMVEIETAVKA   ILNKKGSSNEM+ AATSAAQAAI  ++EQ N
Subjt:  DAVSIARRHRASVADVVEY-RHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTN

Query:  LPSKLDEFGRDLNLQKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEE
        LP+KLDEFGRDLNLQKRMDMKRRAEARKRRR+QYDSKRL S EVDGHQ VEGES+TDESDS+SAAYQSNRDLLLQTA QIFSDAAEEFSQLSVVKQRFE 
Subjt:  LPSKLDEFGRDLNLQKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEE

Query:  WKRGYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTV
        WKR YSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHE+ADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLS   
Subjt:  WKRGYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTV

Query:  VPLEPNAAFATSLITNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYG
             NAAFATSLITNYVP SSEALT+LLVVIRTRLS A+EDLTVPTWN+LV KAVPN AR+AAYRFGMS+RLMRNICL K IIALPILEKL +EELLYG
Subjt:  VPLEPNAAFATSLITNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYG

Query:  KVLPHVRSITANVHDAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHL
        KVLPHVRSITAN+HDAVTRTERIIASL+GVWTG  I G R  +LQPLVDYVLLLGRTLEKK  SGIAESETSGLA+RLKKMLVELNEYDNARDIAKTFHL
Subjt:  KVLPHVRSITANVHDAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHL

Query:  KEAL
        KEAL
Subjt:  KEAL

A0A1S3BG51 PAX3- and PAX7-binding protein 10.0e+0076.33Show/hide
Query:  TRPSQT---GVRNTSFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL-----------
        T+P +    G +  SFASDEENDAPL  SSSK S+S+KPSSARLAKPSSTHKIT LKDRIAHSSS SASVPSNVQPQAG Y+KEAL +L           
Subjt:  TRPSQT---GVRNTSFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL-----------

Query:  -----KPSAEPVIVLKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGV
             KPSAEPVIVLKGLLKP EQ+PE+ RE KE SSEDE  GS+ KSA S RRSKEDTLARMASMGI  GKDSS  SIPDQATINAIRAKRERMRQAGV
Subjt:  -----KPSAEPVIVLKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGV

Query:  AALDYISLDAGSNHTAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISAS
        AA DYISLDAGSN TAPGEL                    ++KGVFEEVDEQ ID VRTNIIEH+DEDEEEK  EEEQFRKGLGKR+DDGSTRV S S  
Subjt:  AALDYISLDAGSNHTAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISAS

Query:  IIQSVPQQNLIYPATAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIARRHRA-----
        ++QSV QQNLIYP T GY+SVPS ST TSIGGSV VS          QAEIAKKA+Q++MGRLK  S+ R++S+    +  + A  + I    +A     
Subjt:  IIQSVPQQNLIYPATAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIARRHRA-----

Query:  -------SVADVVE-----YRHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTN
                + D V       +HKAPFIEELEEQMQKLHEER STVVERR+ADNDDEMVEIETAVKA T ILNKKGSS+EM+ AATSAAQAAI S++EQ N
Subjt:  -------SVADVVE-----YRHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTN

Query:  LPSKLDEFGRDLNLQKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEE
        LP+KLDEFGRDLNLQKRMDMKRRAEARKRRR+QYDSKRL S EVDGHQ VEGES+TDESDS+SAAYQSNRDLLLQTA QIFSDAAEEFSQLSVVKQRFEE
Subjt:  LPSKLDEFGRDLNLQKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEE

Query:  WKRGYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTV
        WKR YSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHE+ADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLS   
Subjt:  WKRGYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTV

Query:  VPLEPNAAFATSLITNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYG
             NAAFATSLITNYVP SSEALT+LLVVIRTRLS A+EDLTVPTWN+LV KAVPN AR+AAYRFGMS+RL+RNICL K IIALPILEKL +EELLYG
Subjt:  VPLEPNAAFATSLITNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYG

Query:  KVLPHVRSITANVHDAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHL
        KVLPHVRSITAN+HDAVTRTERIIASL+GVWTG  I G R  +LQPLVDYVLLLGRTLEKK  SGIAESETSGLA+RLKKMLVELNEYDNARDIAKTFHL
Subjt:  KVLPHVRSITANVHDAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHL

Query:  KEAL
        KEAL
Subjt:  KEAL

A0A5D3CCM3 PAX3-and PAX7-binding protein 10.0e+0076.33Show/hide
Query:  TRPSQT---GVRNTSFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL-----------
        T+P +    G +  SFASDEENDAPL  SSSK S+S+KPSSARLAKPSSTHKIT LKDRIAHSSS SASVPSNVQPQAG Y+KEAL +L           
Subjt:  TRPSQT---GVRNTSFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL-----------

Query:  -----KPSAEPVIVLKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGV
             KPSAEPVIVLKGLLKP EQ+PE+ RE KE SSEDE  GS+ KSA S RRSKEDTLARMASMGI  GKDSS  SIPDQATINAIRAKRERMRQAGV
Subjt:  -----KPSAEPVIVLKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGV

Query:  AALDYISLDAGSNHTAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISAS
        AA DYISLDAGSN TAPGEL                    ++KGVFEEVDEQ ID VRTNIIEH+DEDEEEK  EEEQFRKGLGKR+DDGSTRV S S  
Subjt:  AALDYISLDAGSNHTAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISAS

Query:  IIQSVPQQNLIYPATAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIARRHRA-----
        ++QSV QQNLIYP T GY+SVPS ST TSIGGSV VS          QAEIAKKA+Q++MGRLK  S+ R++S+    +  + A  + I    +A     
Subjt:  IIQSVPQQNLIYPATAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIARRHRA-----

Query:  -------SVADVVE-----YRHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTN
                + D V       +HKAPFIEELEEQMQKLHEER STVVERR+ADNDDEMVEIETAVKA T ILNKKGSS+EM+ AATSAAQAAI S++EQ N
Subjt:  -------SVADVVE-----YRHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTN

Query:  LPSKLDEFGRDLNLQKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEE
        LP+KLDEFGRDLNLQKRMDMKRRAEARKRRR+QYDSKRL S EVDGHQ VEGES+TDESDS+SAAYQSNRDLLLQTA QIFSDAAEEFSQLSVVKQRFEE
Subjt:  LPSKLDEFGRDLNLQKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEE

Query:  WKRGYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTV
        WKR YSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHE+ADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLS   
Subjt:  WKRGYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTV

Query:  VPLEPNAAFATSLITNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYG
             NAAFATSLITNYVP SSEALT+LLVVIRTRLS A+EDLTVPTWN+LV KAVPN AR+AAYRFGMS+RL+RNICL K IIALPILEKL +EELLYG
Subjt:  VPLEPNAAFATSLITNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYG

Query:  KVLPHVRSITANVHDAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHL
        KVLPHVRSITAN+HDAVTRTERIIASL+GVWTG  I G R  +LQPLVDYVLLLGRTLEKK  SGIAESETSGLA+RLKKMLVELNEYDNARDIAKTFHL
Subjt:  KVLPHVRSITANVHDAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHL

Query:  KEAL
        KEAL
Subjt:  KEAL

A0A6J1G0Q4 transcriptional repressor ILP10.0e+0078.09Show/hide
Query:  SFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL----------------KPSAEPVIV
        SFASDEENDAPL T SSKP+NS+KPSSARLAKPSSTHKIT LKDRIAHSSSTSASVPSNVQPQAGTY+KEAL +L                KPSAEPVIV
Subjt:  SFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL----------------KPSAEPVIV

Query:  LKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNH
        LKGLLKPVEQI ++ +E KE SSEDE  GS+ KSAGSFRRSKED LARMASMGI  GKDS+  SIPDQATINAIRAKRERMRQAGVAA DYISLDAGSN 
Subjt:  LKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNH

Query:  TAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISASIIQSVPQQNLIYPA
        TAPGEL                    ++KGVFEE DEQAID VRTNIIEH+DEDEEEK  E EQFRKGLGKR+DDGSTRV S S  +I SVPQQNLIYP 
Subjt:  TAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISASIIQSVPQQNLIYPA

Query:  TAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIA----------------RRHRASVA
        TAGYNSVPSIST TSIGGSVGVS          QAEIAKKA++DNMGRLK  S+ R++++    +  + A  ++I                 ++ R  V+
Subjt:  TAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIA----------------RRHRASVA

Query:  DVVEY-RHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNL
         + ++ +HKAPFIEELEEQMQKLHEER STVVERR+ADNDDEMVEIE AVKA   ILNKKGSSNEMIAAATSAAQAAI SAKEQ NLP+K+DEFGRDLNL
Subjt:  DVVEY-RHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNL

Query:  QKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYM
        QKRMDMKRRAEARKRRRA+YDSKRL STEVDGHQ VEGES+TDESDSE+AAYQSN DLLLQTA QIFSDAAEEFSQLSVVKQRFE+WKR YSATYRDAYM
Subjt:  QKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYM

Query:  SLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLI
        SLS  AIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHE+AHCWDMLS        NAAFATSLI
Subjt:  SLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLI

Query:  TNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVH
        TNYVPTSSEALT+LLVVIRTRLSSAVEDLTVPTW+ALVMKAVPN AR+AAYRFG+S+RLMRNICL K IIALPILEKL +EELLYGKVLPHVRSITAN+H
Subjt:  TNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVH

Query:  DAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHLKEAL
        DAVTRTERIIASLSGVWTGP +TG R  +LQPLVDYV+LLGRTLEKK  SGIAESETSGLA+RLKKMLVELNEYDNARDIAKTFHL+EAL
Subjt:  DAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHLKEAL

A0A6J1HXZ2 transcriptional repressor ILP10.0e+0077.42Show/hide
Query:  SFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL----------------KPSAEPVIV
        SFASDEENDAPL T SSKP+NS+KPSSARLAKPSSTHKIT LKDRIAHSSSTSASVPSNVQPQAG Y++EAL +L                KPSAEPVIV
Subjt:  SFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDL----------------KPSAEPVIV

Query:  LKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNH
        LKGLLKPVEQI ++ +E KE SSEDE  GS+ KSAGSFRRSKED LARMASMGI  GKDS+  SIPDQATINAIRAKRERMRQAGVAA DYISLDAGSN 
Subjt:  LKGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNH

Query:  TAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISASIIQSVPQQNLIYPA
        TAPGEL                    ++KGVFEE DEQAID VRTNIIEH+DEDEEEK  E EQFRKGLGKR+DDGSTRV S S  +I SV QQNLIYP 
Subjt:  TAPGEL--------------------TRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISASIIQSVPQQNLIYPA

Query:  TAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIA----------------RRHRASVA
        TAGYNSVPSIST TSIGGSVGVS          QAEIAKKA++DNMGRLK  S+ R++++    +  + A  ++I                 ++ R  V+
Subjt:  TAGYNSVPSISTTTSIGGSVGVS----------QAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIA----------------RRHRASVA

Query:  DVVEY-RHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNL
         + ++ +HKAPFIEELEEQMQKLHEER STVVERR+ADNDDEMVEI+ AVKA   ILNKKGSSNEMIAAATSAAQAAI SAKEQ NLP+K+DEFGRDLNL
Subjt:  DVVEY-RHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNL

Query:  QKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYM
        QKRMDMKRRAEARKRRRA+YDSKRL STEVDGHQ VEGES+TDESDSE+AAYQSN DLLLQTA QIFSDAAEEFSQLSVVKQRFE+WKR YSATYRDAYM
Subjt:  QKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYM

Query:  SLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLI
        SLS  AIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLS        NAAFATSLI
Subjt:  SLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLI

Query:  TNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVH
        TNYVPTSSEAL +LLVVIRTRLSSAVEDLTVPTW+ALVMKAVPN AR+AAYRFG+S+RLMRNICL K IIALPILEKL +EELLYGKVLPHVRSITAN+H
Subjt:  TNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVH

Query:  DAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHLKEAL
        DAVTRTERIIASL GVWTGP +TG R  +LQPLVDYV+LLGRTLEKK  SG+AESETSGLA+RLKKMLVELNEYDNARDIAKTFHL+EAL
Subjt:  DAVTRTERIIASLSGVWTGPNITGSR--ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHLKEAL

SwissProt top hitse value%identityAlignment
P16383 Intron Large complex component GCFC21.4e-1022.44Show/hide
Query:  EAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNHTA--------------
        E  E  + D +   + K   S     +  L+  +S  +   + SS   IPD A I A R KRE  R    A  DYISLD    HT+              
Subjt:  EAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNHTA--------------

Query:  ----------PGELTRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGL----GKRID----DGSTRVGSISASIIQSVPQQNLIYPATAGY
                  P  L  + + + + E++I        E + EDE++   E++Q RK +     + ID    +GS++V     SI  S P  NL        
Subjt:  ----------PGELTRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGL----GKRID----DGSTRVGSISASIIQSVPQQNLIYPATAGY

Query:  NSVPSISTTTSIGGSVGVSQAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIARRHRASVADVVE-YRHKAPFIEELEEQMQKLHEERVST
             ++T  ++      S     +K +QD +   K       SS+++ +  + Y       +  +  V ++++    K   I+E+E  M  L  ++  T
Subjt:  NSVPSISTTTSIGGSVGVSQAEIAKKALQDNMGRLKHHSFERSSSASEHIEARIYADAVSIARRHRASVADVVE-YRHKAPFIEELEEQMQKLHEERVST

Query:  VVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNLQKRMDMKRRAEARKRRRAQYDSKRLTSTEV
         ++RR     DE+    T ++                           +S K++T   S    F  D   +K   +    E+R+ +R Q    R+ S   
Subjt:  VVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNLQKRMDMKRRAEARKRRRAQYDSKRLTSTEV

Query:  DGHQNVEGESNTDESDS-ESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLH-ENADF
        + HQ  EG S+ DE  S E   +Q ++  +LQ   ++F +  ++F  +  +  +F++W+  +  +Y +A++SL IP + +P +R++L+ W+PL  E+   
Subjt:  DGHQNVEGESNTDESDS-ESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLH-ENADF

Query:  FDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLITNYVPTSSEALT----------DLLVVI
         +M W   +  +           + +D  ++  ++ K  +P L   +   WD LS +           TSLIT+      E  T          DLL  I
Subjt:  FDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLITNYVPTSSEALT----------DLLVVI

Query:  RTRLSSAVE-DLTVPTW--NALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVHDAVTRTERIIASLSG
         +R+  AVE D+ +P +  +A+  K  P+ ++    +F   ++L RNI L   ++    L++L + +LL   ++  + + T    D V +  ++ A L  
Subjt:  RTRLSSAVE-DLTVPTW--NALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVHDAVTRTERIIASLSG

Query:  VW--TGPNITGSRELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHL
         W       T   +L+  + ++L        + A  ++ SE     + +  +LV++   + A       HL
Subjt:  VW--TGPNITGSRELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHL

P58501 PAX3- and PAX7-binding protein 11.2e-2723.78Show/hide
Query:  PVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRP-SIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNHTAPGE
        P + +  ++    E+  E E E    K+ G+F  +       ++S+ +       RP  IPD A I+A R KR+  R+ G    D+   D   +    G 
Subjt:  PVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRP-SIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNHTAPGE

Query:  LTRKGVFEEVDEQAIDEVRTNI-----------------IEHND--------EDEEEKFLEEEQFRKGLGKRIDDGS--TRVGSISASIIQSVP---QQN
        L R+   +  D++  DE R  +                 IE +D        +DEE    E+EQ RKG+       S  + V     +  Q++P      
Subjt:  LTRKGVFEEVDEQAIDEVRTNI-----------------IEHND--------EDEEEKFLEEEQFRKGLGKRIDDGS--TRVGSISASIIQSVP---QQN

Query:  LIYPATA-GYNSVPSISTTTSI-----GGSVGVSQAEIAKKALQDNMGRLKHHSFERSSSASEHIEARI---------------YADAVSIARRHRASVA
        + Y  TA G +   S  T  ++        +     ++ K+ L+D +  +K           +H+++R+                 +     +  R  V 
Subjt:  LIYPATA-GYNSVPSISTTTSI-----GGSVGVSQAEIAKKALQDNMGRLKHHSFERSSSASEHIEARI---------------YADAVSIARRHRASVA

Query:  DVVE-YRHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNL
        D++E +  K P I ELE  + +L+++R S +V+RR  D  DE  E  +              SN+ + A                     LD FGRD  L
Subjt:  DVVE-YRHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNL

Query:  QKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDS-ESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAY
         +    +R AE   RR  +  ++  T    D   ++EG S+ DE  S +   +   +D +L+ + ++F D  E F  +  +K +FE W+  Y  +Y+DAY
Subjt:  QKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDS-ESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAY

Query:  MSLSIPAIFSPYVRLELLKWDPLHENA-DFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATS
        + L +P +F+P +RL+LL W PL     DF  M W   L  YG  +   +   ++AD  L+P +VEKV LP L       WD  S T             
Subjt:  MSLSIPAIFSPYVRLELLKWDPLHENA-DFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATS

Query:  LITNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAY-----RFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVR
        LI  Y    +    +  V ++  L      L    +  L  K V        Y     +F  S++L+ N      I +   L++L ++ LL   +L   +
Subjt:  LITNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAY-----RFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVR

Query:  SITANVHDAVTRTERIIASLSGVWTGPNITGSR---ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLK---KMLVELNEYDNARDIAKTFHLKE
        + +    D++ + + +I      W   N+ G R   +L+    Y++ L  T+  + + G ++ E     + +K   K+L  +   D+A  +A   ++KE
Subjt:  SITANVHDAVTRTERIIASLSGVWTGPNITGSR---ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLK---KMLVELNEYDNARDIAKTFHLKE

Q8BKT3 Intron Large complex component GCFC28.8e-1023.13Show/hide
Query:  SEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNHTAPGELTRKGVFEEVDEQAIDE
        S DE EG+   +     RS        +S  ++    S    IPD A I A R KRE  R  G    DYISLD   NH+      ++   E+ +    D 
Subjt:  SEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNHTAPGELTRKGVFEEVDEQAIDE

Query:  VRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISASIIQSVPQQNLIYPATAGYNSVPSISTTTSIGGSVGVS------QAEIAKKALQDNM
         +  +     +   ++  EE   R       ++ S          I    Q        AG N+  S S+ +        S        EI KK L + +
Subjt:  VRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISASIIQSVPQQNLIYPATAGYNSVPSISTTTSIGGSVGVS------QAEIAKKALQDNM

Query:  GRLK--HHSFER-----------SSSASEHIE-ARIYADAVSIARRHRASVADVVE-YRHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIET
          L+  H S +R           S +A +++E A  +A      R  ++ V ++++    K   I ELE  M  L  +R   +++RR     DE+     
Subjt:  GRLK--HHSFER-----------SSSASEHIE-ARIYADAVSIARRHRASVADVVE-YRHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIET

Query:  AVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNLQKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDE-SDS
          K  +  L +    +E     TSA  +  +  K+Q  L                       EAR+ +R Q    R  S   D HQ  EG S+ DE S +
Subjt:  AVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNLQKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDE-SDS

Query:  ESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHENADFFD-MNWHSLLFNYGMPEDG
        E   +   +  +LQ   ++F D  ++F  +  +  +F++W+  +  +Y +A++   +P + SP +R++LL W+PL  ++   D M W + +  + M    
Subjt:  ESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHENADFFD-MNWHSLLFNYGMPEDG

Query:  SDFAPND-ADANLVPELVEKVALPILHHEIAHCWDMLSVT---VVPLEPNAAFATSLITNYVPTSSEALTDLLVVIRTRLSSAVE-DLTVPTW--NALVM
         D    D +D  ++  ++ K  +P L   +   WD LS +    + +    AF      N V  + +   DLL  I  R+  ++E D+ +P +  ++   
Subjt:  SDFAPND-ADANLVPELVEKVALPILHHEIAHCWDMLSVT---VVPLEPNAAFATSLITNYVPTSSEALTDLLVVIRTRLSSAVE-DLTVPTW--NALVM

Query:  KAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVHDAVTRTERIIASLSGVWTGPNITGSRELQPLVDYVLLLG
        K  P+ ++    +F  +++L RNI L   ++    L+ L + +LL   ++  + +      D V +  +I A L   W   N      +  L +++  L 
Subjt:  KAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVHDAVTRTERIIASLSGVWTGPNITGSRELQPLVDYVLLLG

Query:  RTLEKKQAS
        ++ +K  +S
Subjt:  RTLEKKQAS

Q9FNN3 Transcriptional repressor ILP12.0e-20048.99Show/hide
Query:  SFASDEEND---APLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDLK------------PSAEPVIVL
        SFA DEE +   AP  T   K       SS+RL    S+H+ ++ K+R   S        SNV PQAG+YSKEAL +L+             +AEP +VL
Subjt:  SFASDEEND---APLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDLK------------PSAEPVIVL

Query:  KGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDI--GKDSSRPSIPDQATINAIRAKRERMRQAGVA-ALDYISLDAG-
        KGL+KP +     D E                     ++S +D + +++ +  D    ++    +  DQA I  IRAK+ERMRQ+  A A DYISLD G 
Subjt:  KGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDI--GKDSSRPSIPDQATINAIRAKRERMRQAGVA-ALDYISLDAG-

Query:  SNHTA-------------------PGELTRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTR------VGSISASIIQSVP
         NH+A                   P +  +KGVF+  DE    +  T    + DEDEE+K  EEEQF+KG+GKR+D+GS R      +G    S  Q++P
Subjt:  SNHTA-------------------PGELTRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTR------VGSISASIIQSVP

Query:  QQNLIYPATAGYNSVPSISTTTSIGGSVGV------SQAEIAKKALQDNMGRLKHHSFERSSS---ASEHIEARIYA------------DAVSIARRHRA
        QQ     A      +P++S   +IG +  V       QAE+AKKAL+DN+ +LK    +  SS     E++ A + +            D     ++ R 
Subjt:  QQNLIYPATAGYNSVPSISTTTSIGGSVGV------SQAEIAKKALQDNMGRLKHHSFERSSS---ASEHIEARIYA------------DAVSIARRHRA

Query:  SVADVVEY-RHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRD
         ++ + ++ ++K   IEE+E+QM++L+E+   +++ERRIADN+DEM+E+  AVKA   +LNK GSS+ +IAAAT AA AA  S ++Q N P KLDEFGRD
Subjt:  SVADVVEY-RHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRD

Query:  LNLQKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQ-NVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYR
         NLQKR ++++RA AR++RRA++++KR ++ EVDG    +EGES+TDESD+E++AY+  RD LLQ A ++FSDA+EE+SQLS VK RFE WKR YS+TYR
Subjt:  LNLQKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQ-NVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYR

Query:  DAYMSLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFA
        DAYMSL++P+IFSPYVRLELLKWDPLH++ DFFDM WH LLF+YG PEDG DFAP+D DANLVPELVEKVA+PILHH+I  CWD+LS        NA  A
Subjt:  DAYMSLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFA

Query:  TSLITNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSIT
        TSL+TNYV  SSEAL +L   IR RL  A+  ++VPTW+ LV+KAVPN  +VAAYRFG S+RLMRNIC+ K I+ALP+LE L + +LL+GKVLPHVRSI 
Subjt:  TSLITNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSIT

Query:  ANVHDAVTRTERIIASLSGVWTGPNI--TGSRELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHLKEAL
        +N+HDAVTRTERI+ASLSGVWTGP++  T SR LQPLVD  L L R LEK+  SG+ ++ET+GLA+RLK++LVEL+E+D+AR+I +TF+LKEA+
Subjt:  ANVHDAVTRTERIIASLSGVWTGPNI--TGSRELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHLKEAL

Q9Y5B6 PAX3- and PAX7-binding protein 14.5e-3024.43Show/hide
Query:  ELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRP-SIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNHTAPGELTRKGVFEEVDEQ
        E+  E E E    K+ G+F  +       ++S+ +       RP  IPD A I+A R KR+  R+ G    D+   D   N    G L R+   +  D++
Subjt:  ELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRP-SIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNHTAPGELTRKGVFEEVDEQ

Query:  AIDEVRTNI-----------------IEHND--------EDEEEKFLEEEQFRKGLGKRIDDGS--TRVGSISASIIQSVP---QQNLIYPATA-GYNSV
          DE R  +                 IE +D        +DEE    E+EQ RKG+       S    V     +  Q++P      + Y  TA G +  
Subjt:  AIDEVRTNI-----------------IEHND--------EDEEEKFLEEEQFRKGLGKRIDDGS--TRVGSISASIIQSVP---QQNLIYPATA-GYNSV

Query:  PSISTTTSI-----GGSVGVSQAEIAKKALQDNMGRLKHHSFERSSSASEHIEARI---------------YADAVSIARRHRASVADVVE-YRHKAPFI
         S  T  ++        +     ++ KK L+D +  +K           +H+++R+                 +     +  R  V D++E +  K P I
Subjt:  PSISTTTSI-----GGSVGVSQAEIAKKALQDNMGRLKHHSFERSSSASEHIEARI---------------YADAVSIARRHRASVADVVE-YRHKAPFI

Query:  EELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNLQKRMDMKRRAEAR
         ELE  + +L+++R S +V+RR  D  DE  E  +              SN+ + A                     LD FGRD  L +    +R AE  
Subjt:  EELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRDLNLQKRMDMKRRAEAR

Query:  KRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDS-ESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYMSLSIPAIFSPYV
         RR  +  ++  T    D   ++EG S+ DE  S +   +   +D + + +G++F D  E F  +  +K +FE W+  Y  +Y+DAY+ L +P +F+P +
Subjt:  KRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDS-ESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYMSLSIPAIFSPYV

Query:  RLELLKWDPLHENA-DFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLITNYVPTSSEAL
        RL+LL W PL     DF +M W   L  YG  E   +   +D D  L+P +VEKV LP L     + WD  S T             LI  Y    +   
Subjt:  RLELLKWDPLHENA-DFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLITNYVPTSSEAL

Query:  TDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAY-----RFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVHDAVTRT
         +  V ++  L      L    +  L  K V        Y     +F  S++L+ N      I +   L++L ++ LL   +L   ++ +    D++ + 
Subjt:  TDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAY-----RFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVHDAVTRT

Query:  ERIIASLSGVWTGPNITGSR---ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLK---KMLVELNEYDNARDIAKTFHLKE
        + +I      W   N+ G R   +L+    Y++ L  T+  + + G ++ E     + +K   K+L  +   D+A  +A   ++KE
Subjt:  ERIIASLSGVWTGPNITGSR---ELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLK---KMLVELNEYDNARDIAKTFHLKE

Arabidopsis top hitse value%identityAlignment
AT5G08550.1 GC-rich sequence DNA-binding factor-like protein1.5e-20148.99Show/hide
Query:  SFASDEEND---APLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDLK------------PSAEPVIVL
        SFA DEE +   AP  T   K       SS+RL    S+H+ ++ K+R   S        SNV PQAG+YSKEAL +L+             +AEP +VL
Subjt:  SFASDEEND---APLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDLK------------PSAEPVIVL

Query:  KGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDI--GKDSSRPSIPDQATINAIRAKRERMRQAGVA-ALDYISLDAG-
        KGL+KP +     D E                     ++S +D + +++ +  D    ++    +  DQA I  IRAK+ERMRQ+  A A DYISLD G 
Subjt:  KGLLKPVEQIPENDREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDI--GKDSSRPSIPDQATINAIRAKRERMRQAGVA-ALDYISLDAG-

Query:  SNHTA-------------------PGELTRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTR------VGSISASIIQSVP
         NH+A                   P +  +KGVF+  DE    +  T    + DEDEE+K  EEEQF+KG+GKR+D+GS R      +G    S  Q++P
Subjt:  SNHTA-------------------PGELTRKGVFEEVDEQAIDEVRTNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTR------VGSISASIIQSVP

Query:  QQNLIYPATAGYNSVPSISTTTSIGGSVGV------SQAEIAKKALQDNMGRLKHHSFERSSS---ASEHIEARIYA------------DAVSIARRHRA
        QQ     A      +P++S   +IG +  V       QAE+AKKAL+DN+ +LK    +  SS     E++ A + +            D     ++ R 
Subjt:  QQNLIYPATAGYNSVPSISTTTSIGGSVGV------SQAEIAKKALQDNMGRLKHHSFERSSS---ASEHIEARIYA------------DAVSIARRHRA

Query:  SVADVVEY-RHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRD
         ++ + ++ ++K   IEE+E+QM++L+E+   +++ERRIADN+DEM+E+  AVKA   +LNK GSS+ +IAAAT AA AA  S ++Q N P KLDEFGRD
Subjt:  SVADVVEY-RHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKLDEFGRD

Query:  LNLQKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQ-NVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYR
         NLQKR ++++RA AR++RRA++++KR ++ EVDG    +EGES+TDESD+E++AY+  RD LLQ A ++FSDA+EE+SQLS VK RFE WKR YS+TYR
Subjt:  LNLQKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQ-NVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYR

Query:  DAYMSLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFA
        DAYMSL++P+IFSPYVRLELLKWDPLH++ DFFDM WH LLF+YG PEDG DFAP+D DANLVPELVEKVA+PILHH+I  CWD+LS        NA  A
Subjt:  DAYMSLSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFA

Query:  TSLITNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSIT
        TSL+TNYV  SSEAL +L   IR RL  A+  ++VPTW+ LV+KAVPN  +VAAYRFG S+RLMRNIC+ K I+ALP+LE L + +LL+GKVLPHVRSI 
Subjt:  TSLITNYVPTSSEALTDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSIT

Query:  ANVHDAVTRTERIIASLSGVWTGPNI--TGSRELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHLKEAL
        +N+HDAVTRTERI+ASLSGVWTGP++  T SR LQPLVD  L L R LEK+  SG+ ++ET+GLA+RLK++LVEL+E+D+AR+I +TF+LKEA+
Subjt:  ANVHDAVTRTERIIASLSGVWTGPNI--TGSRELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHLKEAL

AT5G09210.1 GC-rich sequence DNA-binding factor-like protein1.3e-8554.74Show/hide
Query:  EVDGHQ-NVEGESNT-DESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHENA
        +VDG+   VEG+S+T DESD E++AY+  RD LLQ A +IFSDA+  +S+LS VK  F+   R  S  +R AY SL++P+++SPY+RLELL+WDPLH++ 
Subjt:  EVDGHQ-NVEGESNT-DESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHENA

Query:  DFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLITNYVPTSSEALTDLLVVIRTRLSSAV
        DF DMNWH LLF+  +    +    N    N V ELV+ VA+PILHH I  CWD+LS        N   ATSL+  YV  SSEAL +L + I  RL  A+
Subjt:  DFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLITNYVPTSSEALTDLLVVIRTRLSSAV

Query:  EDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSIT--ANVHDAVTRTERIIASLSGVWTGPNI--
          ++VPTW+  V K VPN  +VAAYRFG S+RLMRNIC+ K ++ LP+LEKL + +LL+GKVLPHVRSI   +N+HDAVT+TERI+ASLSGVWTGP++  
Subjt:  EDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSIT--ANVHDAVTRTERIIASLSGVWTGPNI--

Query:  TGSRELQPLVDYVLLLGRTLEKKQASG
        T S  LQPLVD  L LGR LEKK   G
Subjt:  TGSRELQPLVDYVLLLGRTLEKKQASG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAGACCCTCGCAGACTGGAGTTCGTAACACGAGCTTCGCCAGTGACGAAGAGAACGATGCCCCACTTTGCACTTCCTCTTCCAAACCATCAAATTCCGAGAAGCC
CTCTTCTGCTCGACTTGCCAAGCCTTCCTCCACTCACAAGATCACTACCTTGAAGGATCGCATCGCTCACTCTTCTTCAACTTCGGCTTCTGTTCCCTCCAATGTGCAGC
CTCAAGCTGGAACTTATTCTAAGGAGGCCCTCAGCGATCTTAAGCCTTCCGCCGAGCCTGTTATTGTTCTAAAGGGTCTTCTCAAACCCGTCGAACAAATTCCAGAAAAT
GATAGAGAAGCTAAAGAGTTGAGCTCCGAGGACGAAGCAGAAGGTAGTGATGGGAAGAGCGCTGGTTCGTTTCGGAGGAGTAAGGAAGATACCTTAGCTCGAATGGCTTC
AATGGGGATTGACATAGGAAAGGATTCATCTAGGCCGTCAATTCCCGATCAAGCGACCATTAACGCAATTCGCGCGAAAAGGGAACGTATGCGACAGGCTGGGGTTGCAG
CTCTGGATTATATATCGTTAGATGCAGGAAGCAACCACACCGCGCCCGGGGAGTTGACTCGGAAGGGAGTGTTTGAGGAAGTAGATGAGCAAGCAATAGATGAGGTAAGA
ACGAATATTATTGAGCACAACGACGAGGACGAGGAGGAAAAATTTTTGGAAGAGGAGCAGTTTAGGAAGGGACTCGGGAAGAGAATAGACGATGGTTCTACTAGGGTGGG
GAGCATTAGTGCCTCTATCATTCAGAGTGTTCCGCAACAAAACTTAATTTACCCCGCTACAGCTGGGTATAATTCGGTGCCTAGCATATCTACAACTACAAGTATAGGAG
GCTCTGTTGGTGTTTCACAGGCTGAGATTGCTAAAAAAGCTCTACAAGATAATATGGGAAGGCTTAAGCATCATTCATTCGAGCGATCGTCAAGCGCGTCGGAACATATA
GAAGCCAGGATCTATGCCGACGCGGTGTCTATAGCGCGTCGGCATAGAGCATCTGTTGCCGACGTGGTGGAGTACCGCCATAAAGCTCCATTCATAGAGGAGCTCGAGGA
GCAGATGCAAAAACTTCATGAAGAACGCGTTTCTACAGTTGTGGAAAGAAGAATAGCTGATAACGACGATGAAATGGTGGAGATAGAAACAGCTGTAAAAGCAGGAACGT
TAATTTTGAATAAGAAAGGGAGCAGCAATGAAATGATTGCTGCAGCCACAAGTGCAGCCCAAGCAGCAATCATCTCTGCAAAAGAACAGACAAATCTACCATCGAAGTTA
GATGAATTTGGTAGGGATTTAAATTTACAGAAACGTATGGATATGAAACGGAGAGCTGAGGCTCGAAAGCGCCGGAGAGCTCAATATGATTCGAAGAGACTAACATCAAC
AGAGGTTGATGGCCATCAAAATGTAGAAGGAGAGTCTAACACTGATGAGAGTGATAGTGAGAGTGCAGCATACCAGTCAAACCGTGATTTATTGCTTCAGACTGCTGGTC
AGATTTTTAGTGATGCAGCTGAGGAGTTTTCCCAACTTTCTGTGGTGAAACAGAGGTTTGAAGAATGGAAGAGAGGTTATTCAGCAACGTACCGTGATGCATATATGTCA
TTAAGCATTCCTGCTATCTTCTCTCCTTATGTGAGATTGGAACTCTTGAAGTGGGACCCCCTACATGAAAATGCAGATTTTTTTGATATGAACTGGCATTCTTTGTTGTT
CAATTATGGTATGCCGGAGGATGGAAGTGATTTTGCTCCAAATGATGCTGATGCTAACCTTGTCCCAGAACTAGTTGAGAAGGTTGCACTTCCGATATTGCACCATGAAA
TTGCTCATTGTTGGGACATGCTTAGTGTCACGGTCGTACCTCTCGAACCAAATGCAGCTTTTGCTACAAGCTTGATTACTAACTATGTTCCAACATCAAGTGAAGCTCTT
ACAGACTTGTTGGTTGTCATTCGCACCCGTTTGTCGAGTGCTGTTGAAGATCTTACGGTTCCTACTTGGAATGCACTTGTGATGAAAGCTGTTCCAAATGTCGCTCGAGT
TGCAGCATATCGGTTTGGCATGTCCATCCGTTTGATGAGAAACATATGTTTGTGTAAGGTAATTATTGCATTGCCCATTTTAGAAAAGCTTGTTGTTGAAGAGCTCTTAT
ATGGAAAAGTTCTACCTCATGTTAGAAGTATCACAGCAAACGTCCATGATGCAGTCACAAGAACAGAGAGAATCATTGCTTCTCTTTCAGGAGTGTGGACAGGCCCCAAC
ATCACCGGCAGTCGCGAGTTGCAACCATTGGTAGACTATGTTCTACTGCTTGGAAGAACATTGGAGAAAAAACAGGCTTCAGGCATAGCTGAGAGTGAGACCAGTGGACT
AGCTCAACGATTAAAGAAGATGCTAGTTGAGCTGAATGAATATGACAATGCAAGAGACATTGCTAAGACCTTCCATCTCAAGGAGGCATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGACCAGACCCTCGCAGACTGGAGTTCGTAACACGAGCTTCGCCAGTGACGAAGAGAACGATGCCCCACTTTGCACTTCCTCTTCCAAACCATCAAATTCCGAGAAGCC
CTCTTCTGCTCGACTTGCCAAGCCTTCCTCCACTCACAAGATCACTACCTTGAAGGATCGCATCGCTCACTCTTCTTCAACTTCGGCTTCTGTTCCCTCCAATGTGCAGC
CTCAAGCTGGAACTTATTCTAAGGAGGCCCTCAGCGATCTTAAGCCTTCCGCCGAGCCTGTTATTGTTCTAAAGGGTCTTCTCAAACCCGTCGAACAAATTCCAGAAAAT
GATAGAGAAGCTAAAGAGTTGAGCTCCGAGGACGAAGCAGAAGGTAGTGATGGGAAGAGCGCTGGTTCGTTTCGGAGGAGTAAGGAAGATACCTTAGCTCGAATGGCTTC
AATGGGGATTGACATAGGAAAGGATTCATCTAGGCCGTCAATTCCCGATCAAGCGACCATTAACGCAATTCGCGCGAAAAGGGAACGTATGCGACAGGCTGGGGTTGCAG
CTCTGGATTATATATCGTTAGATGCAGGAAGCAACCACACCGCGCCCGGGGAGTTGACTCGGAAGGGAGTGTTTGAGGAAGTAGATGAGCAAGCAATAGATGAGGTAAGA
ACGAATATTATTGAGCACAACGACGAGGACGAGGAGGAAAAATTTTTGGAAGAGGAGCAGTTTAGGAAGGGACTCGGGAAGAGAATAGACGATGGTTCTACTAGGGTGGG
GAGCATTAGTGCCTCTATCATTCAGAGTGTTCCGCAACAAAACTTAATTTACCCCGCTACAGCTGGGTATAATTCGGTGCCTAGCATATCTACAACTACAAGTATAGGAG
GCTCTGTTGGTGTTTCACAGGCTGAGATTGCTAAAAAAGCTCTACAAGATAATATGGGAAGGCTTAAGCATCATTCATTCGAGCGATCGTCAAGCGCGTCGGAACATATA
GAAGCCAGGATCTATGCCGACGCGGTGTCTATAGCGCGTCGGCATAGAGCATCTGTTGCCGACGTGGTGGAGTACCGCCATAAAGCTCCATTCATAGAGGAGCTCGAGGA
GCAGATGCAAAAACTTCATGAAGAACGCGTTTCTACAGTTGTGGAAAGAAGAATAGCTGATAACGACGATGAAATGGTGGAGATAGAAACAGCTGTAAAAGCAGGAACGT
TAATTTTGAATAAGAAAGGGAGCAGCAATGAAATGATTGCTGCAGCCACAAGTGCAGCCCAAGCAGCAATCATCTCTGCAAAAGAACAGACAAATCTACCATCGAAGTTA
GATGAATTTGGTAGGGATTTAAATTTACAGAAACGTATGGATATGAAACGGAGAGCTGAGGCTCGAAAGCGCCGGAGAGCTCAATATGATTCGAAGAGACTAACATCAAC
AGAGGTTGATGGCCATCAAAATGTAGAAGGAGAGTCTAACACTGATGAGAGTGATAGTGAGAGTGCAGCATACCAGTCAAACCGTGATTTATTGCTTCAGACTGCTGGTC
AGATTTTTAGTGATGCAGCTGAGGAGTTTTCCCAACTTTCTGTGGTGAAACAGAGGTTTGAAGAATGGAAGAGAGGTTATTCAGCAACGTACCGTGATGCATATATGTCA
TTAAGCATTCCTGCTATCTTCTCTCCTTATGTGAGATTGGAACTCTTGAAGTGGGACCCCCTACATGAAAATGCAGATTTTTTTGATATGAACTGGCATTCTTTGTTGTT
CAATTATGGTATGCCGGAGGATGGAAGTGATTTTGCTCCAAATGATGCTGATGCTAACCTTGTCCCAGAACTAGTTGAGAAGGTTGCACTTCCGATATTGCACCATGAAA
TTGCTCATTGTTGGGACATGCTTAGTGTCACGGTCGTACCTCTCGAACCAAATGCAGCTTTTGCTACAAGCTTGATTACTAACTATGTTCCAACATCAAGTGAAGCTCTT
ACAGACTTGTTGGTTGTCATTCGCACCCGTTTGTCGAGTGCTGTTGAAGATCTTACGGTTCCTACTTGGAATGCACTTGTGATGAAAGCTGTTCCAAATGTCGCTCGAGT
TGCAGCATATCGGTTTGGCATGTCCATCCGTTTGATGAGAAACATATGTTTGTGTAAGGTAATTATTGCATTGCCCATTTTAGAAAAGCTTGTTGTTGAAGAGCTCTTAT
ATGGAAAAGTTCTACCTCATGTTAGAAGTATCACAGCAAACGTCCATGATGCAGTCACAAGAACAGAGAGAATCATTGCTTCTCTTTCAGGAGTGTGGACAGGCCCCAAC
ATCACCGGCAGTCGCGAGTTGCAACCATTGGTAGACTATGTTCTACTGCTTGGAAGAACATTGGAGAAAAAACAGGCTTCAGGCATAGCTGAGAGTGAGACCAGTGGACT
AGCTCAACGATTAAAGAAGATGCTAGTTGAGCTGAATGAATATGACAATGCAAGAGACATTGCTAAGACCTTCCATCTCAAGGAGGCATTATGA
Protein sequenceShow/hide protein sequence
MTRPSQTGVRNTSFASDEENDAPLCTSSSKPSNSEKPSSARLAKPSSTHKITTLKDRIAHSSSTSASVPSNVQPQAGTYSKEALSDLKPSAEPVIVLKGLLKPVEQIPEN
DREAKELSSEDEAEGSDGKSAGSFRRSKEDTLARMASMGIDIGKDSSRPSIPDQATINAIRAKRERMRQAGVAALDYISLDAGSNHTAPGELTRKGVFEEVDEQAIDEVR
TNIIEHNDEDEEEKFLEEEQFRKGLGKRIDDGSTRVGSISASIIQSVPQQNLIYPATAGYNSVPSISTTTSIGGSVGVSQAEIAKKALQDNMGRLKHHSFERSSSASEHI
EARIYADAVSIARRHRASVADVVEYRHKAPFIEELEEQMQKLHEERVSTVVERRIADNDDEMVEIETAVKAGTLILNKKGSSNEMIAAATSAAQAAIISAKEQTNLPSKL
DEFGRDLNLQKRMDMKRRAEARKRRRAQYDSKRLTSTEVDGHQNVEGESNTDESDSESAAYQSNRDLLLQTAGQIFSDAAEEFSQLSVVKQRFEEWKRGYSATYRDAYMS
LSIPAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSVTVVPLEPNAAFATSLITNYVPTSSEAL
TDLLVVIRTRLSSAVEDLTVPTWNALVMKAVPNVARVAAYRFGMSIRLMRNICLCKVIIALPILEKLVVEELLYGKVLPHVRSITANVHDAVTRTERIIASLSGVWTGPN
ITGSRELQPLVDYVLLLGRTLEKKQASGIAESETSGLAQRLKKMLVELNEYDNARDIAKTFHLKEAL