; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033318 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033318
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransducin/WD40 repeat-like superfamily protein
Genome locationscaffold5:4138829..4148878
RNA-Seq ExpressionSpg033318
SyntenySpg033318
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0000127 - transcription factor TFIIIC complex (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR017956 - AT hook, DNA-binding motif
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043896.1 DNA binding protein, putative isoform 1 [Cucumis melo var. makuwa]0.0e+0072.83Show/hide
Query:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPTNVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEESNDNKGDNYQL
        FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEN+E        SDLSSQPK+PRGRPPGRK+K  S LPS PKRPRGRPKK+ +ES D KGDN QL
Subjt:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPTNVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEESNDNKGDNYQL

Query:  VQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNGINLQANVNVIS
        VQ  S+E PV SS+ LEIDGVPKN+E  VLLEN+VER+RSTLQEVS CNSEDEVPA+KRRVRRK  ++N VDD+G  SLTE QEDGS   N +A+ NV S
Subjt:  VQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNGINLQANVNVIS

Query:  ECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIFSRFNGV
        E SGED LLC DISEN VLD SSI+FSIPESVALPRVVLCLAHNGKVAWDLKWKP N+CT   KHRMGYLAVLLGNGSLEVWEVPFPH VK I+S+FNG 
Subjt:  ECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIFSRFNGV

Query:  GTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIRGVAWAPSESDL
        GTDPRFVKLKPIFRCS LR+AN+QSIPLTVEWS  PPYDYLLAGCHDGT        VALWKFSA   NS+CEDTRPLLRFSADTVPIR VAWAPSES+L
Subjt:  GTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIRGVAWAPSESDL

Query:  ESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCCSSFAIWSTQVS
        ESANV+LTAGHGGLKFWDLRDPFRPLWDLHPAPR+IY LDWLP+PRC+ LSFDDGTLRLLSL KAA DVP TG+PFT  KQKGLH Y CSS+AIWS QVS
Subjt:  ESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCCSSFAIWSTQVS

Query:  RQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-IQSNEGNHKTA
        RQT       GMVAYCGADG VVRFQLTTKA +KENSR+RTPH+VCEYLTEE+S ITF SP  +VP PLKK+SN+S+ PLSMRAILS+ +QSNEGNHKTA
Subjt:  RQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-IQSNEGNHKTA

Query:  TVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKP-NDAQTDAGVESGSEETLMSMKKKKQTQSKCKKKGEDNQDLEC
        T    ENE  ++C D D GVESGSE+T +S KKKN+TQ KCKKKG +N +LEC+ +P +DA  DA VE+ ++  L                         
Subjt:  TVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKP-NDAQTDAGVESGSEETLMSMKKKKQTQSKCKKKGEDNQDLEC

Query:  SDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK
             +A+ DAD+VP SG+ FE  PPKSV++HR+RWNMN+GSEKWLCYGGA+GILRCQE+VLSALD KLMKKK
Subjt:  SDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK

XP_008442823.1 PREDICTED: uncharacterized protein LOC103486595 [Cucumis melo]0.0e+0072.83Show/hide
Query:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPTNVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEESNDNKGDNYQL
        FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEN+E        SDLSSQPK+PRGRPPGRK+K  S LPS PKRPRGRPKK+ +ES D KGDN QL
Subjt:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPTNVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEESNDNKGDNYQL

Query:  VQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNGINLQANVNVIS
        VQ  S+E PV SS+ LEIDGVPKN+E  VLLEN+VER+RSTLQEVS CNSEDEVPA+KRRVRRK  ++N VDD+G  SLTE QEDGS   N +A+ NV S
Subjt:  VQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNGINLQANVNVIS

Query:  ECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIFSRFNGV
        E SGED LLC DISEN VLD SSI+FSIPESVALPRVVLCLAHNGKVAWDLKWKP N+CT   KHRMGYLAVLLGNGSLEVWEVPFPH VK I+S+FNG 
Subjt:  ECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIFSRFNGV

Query:  GTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIRGVAWAPSESDL
        GTDPRFVKLKPIFRCS LR+AN+QSIPLTVEWS  PPYDYLLAGCHDGT        VALWKFSA   NS+CEDTRPLLRFSADTVPIR VAWAPSES+L
Subjt:  GTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIRGVAWAPSESDL

Query:  ESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCCSSFAIWSTQVS
        ESANV+LTAGHGGLKFWDLRDPFRPLWDLHPAPR+IY LDWLP+PR I+LSFDDGTLRLLSL KAA DVP TG+PFT  KQKGLH Y CSS+AIWS QVS
Subjt:  ESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCCSSFAIWSTQVS

Query:  RQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-IQSNEGNHKTA
        RQT       GMVAYCGADG VVRFQLTTKA +KENSR+RTPH+VCEYLTEE+S ITF SP  +VP PLKK+SN+S+ PLSMRAILS+ +QSNEGNHKTA
Subjt:  RQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-IQSNEGNHKTA

Query:  TVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKP-NDAQTDAGVESGSEETLMSMKKKKQTQSKCKKKGEDNQDLEC
        T    ENE  ++C D D GVESGSE+T +S KKKN+TQ KCKKKG +N +LEC+ +P +DA  DA VE+ ++  L                         
Subjt:  TVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKP-NDAQTDAGVESGSEETLMSMKKKKQTQSKCKKKGEDNQDLEC

Query:  SDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK
             +A+ DAD+VP SG+ FE  PPKSV++HR+RWNMN+GSEKWLCYGGA+GILRCQE+VLSALD KLMKKK
Subjt:  SDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK

XP_022144681.1 uncharacterized protein LOC111014310 isoform X1 [Momordica charantia]0.0e+0073.18Show/hide
Query:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPT-----------NVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEE
        FIAVSAHPPGSSYHKMG PL GRGMVQIWCLVHGTENHE EP            +  +SDLSSQPKRPRGRPPG K+KG SDLPSQPKRPRGRPKKK E 
Subjt:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPT-----------NVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEE

Query:  SNDNKGDNYQLVQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNG
        SNDN GDN Q+VQ+LS+EYP  SSN LEIDG PKNSEEL+LL NSVERQ+STLQ VS CNS+DE PAQKRRVRRK GTKN++DDMGT   T N+EDGS+ 
Subjt:  SNDNKGDNYQLVQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNG

Query:  INLQANVNVISECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHV
        I+ Q N NVISE SGEDTLLCN+IS+NA       +FSIPESVALPRVVLCLAHNGKVAWDLKWKP+N+CT   KHRMGYLAVLLGNGSLEVWE+PFPHV
Subjt:  INLQANVNVISECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHV

Query:  VKAIFSRFNGVGTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIR
        VKAI+S+FN  GTDPRFVKLKPIFR +ML+SAN QSIPLTVEWS+TPPYDYL AGC+DGT        VALWKFSA   NSTCEDTRPLLRFSADTVPIR
Subjt:  VKAIFSRFNGVGTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIR

Query:  GVAWAPSESDLESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCC
         VAWAP+ESD ESANVVLTA HGGLKFWDLRDPFRPLWD+HPAPRMIY LDWLPDPRC+ILSFDDGTLRLLSL KAAYDVPVTGKPFTGTKQ+GLH Y  
Subjt:  GVAWAPSESDLESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCC

Query:  SSFAIWSTQVSRQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-
        SSFAIWS QVSRQT       GMVAYC ADG V+RFQLTT+AVEK++SRNRTPHF+CEYLTEE+S IT HSPA+ VPFPLKK SN+SD PLS RAILS+ 
Subjt:  SSFAIWSTQVSRQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-

Query:  IQSNEGNHKTATVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKPNDAQTDAGVESGSEETLMSMKKKKQTQSKCKK
        I+SNEGNHKTAT   SENE LA+  D+D  V+SGSE+TLMSMKKKN                                              QTQSKCKK
Subjt:  IQSNEGNHKTATVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKPNDAQTDAGVESGSEETLMSMKKKKQTQSKCKK

Query:  KGEDNQDLECSDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK
        K  D+Q LECSDEPNDAQT  D +PGSG+ FE FPPKSV+LHR+RWNMN GSE+WLCYGG AGI+RCQEIVLS  DKKLM+KK
Subjt:  KGEDNQDLECSDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK

XP_022144682.1 uncharacterized protein LOC111014310 isoform X2 [Momordica charantia]0.0e+0073.18Show/hide
Query:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPT-----------NVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEE
        FIAVSAHPPGSSYHKMG PL GRGMVQIWCLVHGTENHE EP            +  +SDLSSQPKRPRGRPPG K+KG SDLPSQPKRPRGRPKKK E 
Subjt:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPT-----------NVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEE

Query:  SNDNKGDNYQLVQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNG
        SNDN GDN Q+VQ+LS+EYP  SSN LEIDG PKNSEEL+LL NSVERQ+STLQ VS CNS+DE PAQKRRVRRK GTKN++DDMGT   T N+EDGS+ 
Subjt:  SNDNKGDNYQLVQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNG

Query:  INLQANVNVISECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHV
        I+ Q N NVISE SGEDTLLCN+IS+NA       +FSIPESVALPRVVLCLAHNGKVAWDLKWKP+N+CT   KHRMGYLAVLLGNGSLEVWE+PFPHV
Subjt:  INLQANVNVISECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHV

Query:  VKAIFSRFNGVGTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIR
        VKAI+S+FN  GTDPRFVKLKPIFR +ML+SAN QSIPLTVEWS+TPPYDYL AGC+DGT        VALWKFSA   NSTCEDTRPLLRFSADTVPIR
Subjt:  VKAIFSRFNGVGTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIR

Query:  GVAWAPSESDLESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCC
         VAWAP+ESD ESANVVLTA HGGLKFWDLRDPFRPLWD+HPAPRMIY LDWLPDPRC+ILSFDDGTLRLLSL KAAYDVPVTGKPFTGTKQ+GLH Y  
Subjt:  GVAWAPSESDLESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCC

Query:  SSFAIWSTQVSRQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-
        SSFAIWS QVSRQT       GMVAYC ADG V+RFQLTT+AVEK++SRNRTPHF+CEYLTEE+S IT HSPA+ VPFPLKK SN+SD PLS RAILS+ 
Subjt:  SSFAIWSTQVSRQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-

Query:  IQSNEGNHKTATVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKPNDAQTDAGVESGSEETLMSMKKKKQTQSKCKK
        I+SNEGNHKTAT   SENE LA+  D+D  V+SGSE+TLMSMKKKN                                              QTQSKCKK
Subjt:  IQSNEGNHKTATVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKPNDAQTDAGVESGSEETLMSMKKKKQTQSKCKK

Query:  KGEDNQDLECSDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK
        K  D+Q LECSDEPNDAQT  D +PGSG+ FE FPPKSV+LHR+RWNMN GSE+WLCYGG AGI+RCQEIVLS  DKKLM+KK
Subjt:  KGEDNQDLECSDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK

XP_038903194.1 uncharacterized protein LOC120089853 [Benincasa hispida]0.0e+0075.23Show/hide
Query:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPTNVAN--SDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEESNDNKGDNY
        FIAVSAHPPGSSYHKMGIPLTGRGMVQIWC VHGTE++  EPTNV    +DLSSQPKRPRGRP GRK+ G S LP QPKRPRGRPKKK EESND KGD+ 
Subjt:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPTNVAN--SDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEESNDNKGDNY

Query:  QLVQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNGINLQANVNV
         LVQA SIE PV SSN LE+DGVPKNSE +VLLENSVER+RSTLQEVS CNSEDEVPAQKRRVRRKT  KN+V D+G  SLTEN+EDGSN I+L+AN NV
Subjt:  QLVQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNGINLQANVNV

Query:  ISECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIFSRFN
        + E SGED LLC +IS NAVLDTSSI+FSIPESVALPRVVLCLAHNGKVAWDLKWKPTN+ T   K RMGYLAVLLGNGSLEVWEVPFPH VKAI+S+FN
Subjt:  ISECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIFSRFN

Query:  GVGTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIRGVAWAPSES
        G GTDPRFVKLKPIFRCSMLR+AN+QSIPLTVEWS TPPYDYLLAGCHDGT        VALWKFSA   NS+CEDTRPLLRFSADTVPIR VAWAPSES
Subjt:  GVGTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIRGVAWAPSES

Query:  DLESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCCSSFAIWSTQ
          ESANV+LTAGHGGLKFWDLRDPFRPLWDLHPAPR+IY LDWLP+PRC+ LSFDDGTLRLLSL KAAYDVPVTG+PFT  KQKGLH Y CSS+AIWS Q
Subjt:  DLESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCCSSFAIWSTQ

Query:  VSRQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-IQSNEGNHK
        VSRQT       GMVAYCGADG VVRFQLTTKA +KENSR+RTPH+VCEYLTEE+STIT HSP  ++PF LKK+SN+S+ PLSMRAILS+ +QSNEGNHK
Subjt:  VSRQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-IQSNEGNHK

Query:  TATVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKPN-DAQTDAGVESGSEETLMSMKKKKQTQSKCKKKGEDNQDL
        TAT P  ENE+ ALC D D GVESG E+TLMS+KKKN+TQSKC KKG +NQ L+CSD+PN DAQ DA V                               
Subjt:  TATVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKPN-DAQTDAGVESGSEETLMSMKKKKQTQSKCKKKGEDNQDL

Query:  ECSDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK
               D QTDA +VPGS + FE  PPKSV++HR+RWNMN+GSE+WLCYGGAAGILRCQEIVLS LD KLMKKK
Subjt:  ECSDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK

TrEMBL top hitse value%identityAlignment
A0A0A0LGM2 Uncharacterized protein0.0e+0072.48Show/hide
Query:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPTNVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEESND-NKGDNYQ
        FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTE++E        SDLSSQPKRPRGRPPGRKEKG S LPSQPKRPRGRPKK+ +ESND  KGDN Q
Subjt:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPTNVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEESND-NKGDNYQ

Query:  LVQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNGINLQANVNVI
        LVQ  S+E PV SSN LEIDGVPKN+E  VLLEN+VER+ STLQEVS C+SEDEVPA+KRRVRRK   +N VDD+G  SL E QEDGS   N +AN NV 
Subjt:  LVQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNGINLQANVNVI

Query:  SECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIFSRFNG
        SE SGED LLC DISEN VLD SSI+FSIPESVALPRVVLCLAHNGKVAWDLKWKP N+CT   KHRMGYLAVLLGNGSLEVWEVPFPH VKAI+S+FNG
Subjt:  SECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIFSRFNG

Query:  VGTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIRGVAWAPSESD
         GTDPRF+KLKPIFRCS LR+ N+QSIPLTVEWS TPPYDYLLAGCHDGT        VALWKFSA   NS+CEDTRPLLRFSADTVPIR VAWAPSESD
Subjt:  VGTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIRGVAWAPSESD

Query:  LESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCCSSFAIWSTQV
        LESANV+LTAGHGGLKFWDLRDPFRPLWDLHPAPR+IY LDWLP+PRC+ LSFDDGTLRLLSL KAA DVP TG+PFT  KQKGLH Y CSS+AIWS QV
Subjt:  LESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCCSSFAIWSTQV

Query:  SRQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-IQSNEGNHKT
        SRQT       GMVAYCGADG VVRFQLTTKA +KENSR+RTPH+VCEYLTEE+S ITF SP  +VP PLKK+SN+S+ PLSMRAILS+ +QSNE   KT
Subjt:  SRQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-IQSNEGNHKT

Query:  ATVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKP-NDAQTDAGVESGSEETLMSMKKKKQTQSKCKKKGEDNQDLE
        AT    ENE   +C D D  VESGSE+TL   KKKN+TQ KC K+G +  +LECSD+P +DA  DA V++ ++  L                        
Subjt:  ATVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKP-NDAQTDAGVESGSEETLMSMKKKKQTQSKCKKKGEDNQDLE

Query:  CSDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK
              +AQ DAD +P SG+ FE  PPKSV++HR+RWNMN+GSE+WLCYGGAAGILRC+EIVLSALD KLMKKK
Subjt:  CSDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK

A0A1S3B6M4 uncharacterized protein LOC1034865950.0e+0072.83Show/hide
Query:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPTNVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEESNDNKGDNYQL
        FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEN+E        SDLSSQPK+PRGRPPGRK+K  S LPS PKRPRGRPKK+ +ES D KGDN QL
Subjt:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPTNVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEESNDNKGDNYQL

Query:  VQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNGINLQANVNVIS
        VQ  S+E PV SS+ LEIDGVPKN+E  VLLEN+VER+RSTLQEVS CNSEDEVPA+KRRVRRK  ++N VDD+G  SLTE QEDGS   N +A+ NV S
Subjt:  VQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNGINLQANVNVIS

Query:  ECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIFSRFNGV
        E SGED LLC DISEN VLD SSI+FSIPESVALPRVVLCLAHNGKVAWDLKWKP N+CT   KHRMGYLAVLLGNGSLEVWEVPFPH VK I+S+FNG 
Subjt:  ECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIFSRFNGV

Query:  GTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIRGVAWAPSESDL
        GTDPRFVKLKPIFRCS LR+AN+QSIPLTVEWS  PPYDYLLAGCHDGT        VALWKFSA   NS+CEDTRPLLRFSADTVPIR VAWAPSES+L
Subjt:  GTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIRGVAWAPSESDL

Query:  ESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCCSSFAIWSTQVS
        ESANV+LTAGHGGLKFWDLRDPFRPLWDLHPAPR+IY LDWLP+PR I+LSFDDGTLRLLSL KAA DVP TG+PFT  KQKGLH Y CSS+AIWS QVS
Subjt:  ESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCCSSFAIWSTQVS

Query:  RQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-IQSNEGNHKTA
        RQT       GMVAYCGADG VVRFQLTTKA +KENSR+RTPH+VCEYLTEE+S ITF SP  +VP PLKK+SN+S+ PLSMRAILS+ +QSNEGNHKTA
Subjt:  RQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-IQSNEGNHKTA

Query:  TVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKP-NDAQTDAGVESGSEETLMSMKKKKQTQSKCKKKGEDNQDLEC
        T    ENE  ++C D D GVESGSE+T +S KKKN+TQ KCKKKG +N +LEC+ +P +DA  DA VE+ ++  L                         
Subjt:  TVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKP-NDAQTDAGVESGSEETLMSMKKKKQTQSKCKKKGEDNQDLEC

Query:  SDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK
             +A+ DAD+VP SG+ FE  PPKSV++HR+RWNMN+GSEKWLCYGGA+GILRCQE+VLSALD KLMKKK
Subjt:  SDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK

A0A5D3DPQ1 DNA binding protein, putative isoform 10.0e+0072.83Show/hide
Query:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPTNVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEESNDNKGDNYQL
        FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEN+E        SDLSSQPK+PRGRPPGRK+K  S LPS PKRPRGRPKK+ +ES D KGDN QL
Subjt:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPTNVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEESNDNKGDNYQL

Query:  VQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNGINLQANVNVIS
        VQ  S+E PV SS+ LEIDGVPKN+E  VLLEN+VER+RSTLQEVS CNSEDEVPA+KRRVRRK  ++N VDD+G  SLTE QEDGS   N +A+ NV S
Subjt:  VQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNGINLQANVNVIS

Query:  ECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIFSRFNGV
        E SGED LLC DISEN VLD SSI+FSIPESVALPRVVLCLAHNGKVAWDLKWKP N+CT   KHRMGYLAVLLGNGSLEVWEVPFPH VK I+S+FNG 
Subjt:  ECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIFSRFNGV

Query:  GTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIRGVAWAPSESDL
        GTDPRFVKLKPIFRCS LR+AN+QSIPLTVEWS  PPYDYLLAGCHDGT        VALWKFSA   NS+CEDTRPLLRFSADTVPIR VAWAPSES+L
Subjt:  GTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIRGVAWAPSESDL

Query:  ESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCCSSFAIWSTQVS
        ESANV+LTAGHGGLKFWDLRDPFRPLWDLHPAPR+IY LDWLP+PRC+ LSFDDGTLRLLSL KAA DVP TG+PFT  KQKGLH Y CSS+AIWS QVS
Subjt:  ESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCCSSFAIWSTQVS

Query:  RQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-IQSNEGNHKTA
        RQT       GMVAYCGADG VVRFQLTTKA +KENSR+RTPH+VCEYLTEE+S ITF SP  +VP PLKK+SN+S+ PLSMRAILS+ +QSNEGNHKTA
Subjt:  RQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-IQSNEGNHKTA

Query:  TVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKP-NDAQTDAGVESGSEETLMSMKKKKQTQSKCKKKGEDNQDLEC
        T    ENE  ++C D D GVESGSE+T +S KKKN+TQ KCKKKG +N +LEC+ +P +DA  DA VE+ ++  L                         
Subjt:  TVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKP-NDAQTDAGVESGSEETLMSMKKKKQTQSKCKKKGEDNQDLEC

Query:  SDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK
             +A+ DAD+VP SG+ FE  PPKSV++HR+RWNMN+GSEKWLCYGGA+GILRCQE+VLSALD KLMKKK
Subjt:  SDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK

A0A6J1CU50 uncharacterized protein LOC111014310 isoform X10.0e+0073.18Show/hide
Query:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPT-----------NVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEE
        FIAVSAHPPGSSYHKMG PL GRGMVQIWCLVHGTENHE EP            +  +SDLSSQPKRPRGRPPG K+KG SDLPSQPKRPRGRPKKK E 
Subjt:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPT-----------NVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEE

Query:  SNDNKGDNYQLVQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNG
        SNDN GDN Q+VQ+LS+EYP  SSN LEIDG PKNSEEL+LL NSVERQ+STLQ VS CNS+DE PAQKRRVRRK GTKN++DDMGT   T N+EDGS+ 
Subjt:  SNDNKGDNYQLVQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNG

Query:  INLQANVNVISECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHV
        I+ Q N NVISE SGEDTLLCN+IS+NA       +FSIPESVALPRVVLCLAHNGKVAWDLKWKP+N+CT   KHRMGYLAVLLGNGSLEVWE+PFPHV
Subjt:  INLQANVNVISECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHV

Query:  VKAIFSRFNGVGTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIR
        VKAI+S+FN  GTDPRFVKLKPIFR +ML+SAN QSIPLTVEWS+TPPYDYL AGC+DGT        VALWKFSA   NSTCEDTRPLLRFSADTVPIR
Subjt:  VKAIFSRFNGVGTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIR

Query:  GVAWAPSESDLESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCC
         VAWAP+ESD ESANVVLTA HGGLKFWDLRDPFRPLWD+HPAPRMIY LDWLPDPRC+ILSFDDGTLRLLSL KAAYDVPVTGKPFTGTKQ+GLH Y  
Subjt:  GVAWAPSESDLESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCC

Query:  SSFAIWSTQVSRQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-
        SSFAIWS QVSRQT       GMVAYC ADG V+RFQLTT+AVEK++SRNRTPHF+CEYLTEE+S IT HSPA+ VPFPLKK SN+SD PLS RAILS+ 
Subjt:  SSFAIWSTQVSRQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-

Query:  IQSNEGNHKTATVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKPNDAQTDAGVESGSEETLMSMKKKKQTQSKCKK
        I+SNEGNHKTAT   SENE LA+  D+D  V+SGSE+TLMSMKKKN                                              QTQSKCKK
Subjt:  IQSNEGNHKTATVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKPNDAQTDAGVESGSEETLMSMKKKKQTQSKCKK

Query:  KGEDNQDLECSDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK
        K  D+Q LECSDEPNDAQT  D +PGSG+ FE FPPKSV+LHR+RWNMN GSE+WLCYGG AGI+RCQEIVLS  DKKLM+KK
Subjt:  KGEDNQDLECSDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK

A0A6J1CUD2 uncharacterized protein LOC111014310 isoform X20.0e+0073.18Show/hide
Query:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPT-----------NVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEE
        FIAVSAHPPGSSYHKMG PL GRGMVQIWCLVHGTENHE EP            +  +SDLSSQPKRPRGRPPG K+KG SDLPSQPKRPRGRPKKK E 
Subjt:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPT-----------NVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKKHEE

Query:  SNDNKGDNYQLVQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNG
        SNDN GDN Q+VQ+LS+EYP  SSN LEIDG PKNSEEL+LL NSVERQ+STLQ VS CNS+DE PAQKRRVRRK GTKN++DDMGT   T N+EDGS+ 
Subjt:  SNDNKGDNYQLVQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNG

Query:  INLQANVNVISECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHV
        I+ Q N NVISE SGEDTLLCN+IS+NA       +FSIPESVALPRVVLCLAHNGKVAWDLKWKP+N+CT   KHRMGYLAVLLGNGSLEVWE+PFPHV
Subjt:  INLQANVNVISECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHV

Query:  VKAIFSRFNGVGTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIR
        VKAI+S+FN  GTDPRFVKLKPIFR +ML+SAN QSIPLTVEWS+TPPYDYL AGC+DGT        VALWKFSA   NSTCEDTRPLLRFSADTVPIR
Subjt:  VKAIFSRFNGVGTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIR

Query:  GVAWAPSESDLESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCC
         VAWAP+ESD ESANVVLTA HGGLKFWDLRDPFRPLWD+HPAPRMIY LDWLPDPRC+ILSFDDGTLRLLSL KAAYDVPVTGKPFTGTKQ+GLH Y  
Subjt:  GVAWAPSESDLESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCC

Query:  SSFAIWSTQVSRQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-
        SSFAIWS QVSRQT       GMVAYC ADG V+RFQLTT+AVEK++SRNRTPHF+CEYLTEE+S IT HSPA+ VPFPLKK SN+SD PLS RAILS+ 
Subjt:  SSFAIWSTQVSRQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSE-

Query:  IQSNEGNHKTATVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKPNDAQTDAGVESGSEETLMSMKKKKQTQSKCKK
        I+SNEGNHKTAT   SENE LA+  D+D  V+SGSE+TLMSMKKKN                                              QTQSKCKK
Subjt:  IQSNEGNHKTATVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKPNDAQTDAGVESGSEETLMSMKKKKQTQSKCKK

Query:  KGEDNQDLECSDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK
        K  D+Q LECSDEPNDAQT  D +PGSG+ FE FPPKSV+LHR+RWNMN GSE+WLCYGG AGI+RCQEIVLS  DKKLM+KK
Subjt:  KGEDNQDLECSDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEIVLSALDKKLMKKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19485.1 Transducin/WD40 repeat-like superfamily protein2.6e-16344.52Show/hide
Query:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPTNVANSDL--------------SSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKK
        F+AV+ HPP S  HK+GIPL GRG++QIWC+++ T   ++   +     L              +++PK+PRGRP  RK    +   ++PK+PRGRP+KK
Subjt:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPTNVANSDL--------------SSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKK

Query:  H-EESNDNKGDNYQLVQALSIEYPVDSSNFLEIDGVP-KNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQE
           E      D+   V+ALS+ YP +S     +   P +   E  + E  V  + S  Q +S  N+  ++P +++R + K+  ++    +  +S      
Subjt:  H-EESNDNKGDNYQLVQALSIEYPVDSSNFLEIDGVP-KNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQE

Query:  DGSNGINLQANVNVISECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEV
                +A  NV S+ S       + ISE              + VALPRVVLCLAHNGKV WD+KW+P+ +  +  KH MGYLAVLLGNGSLEVW+V
Subjt:  DGSNGINLQANVNVISECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEV

Query:  PFPHVVKAIFSRFNGVGTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSAD
        P P    A++       TDPRFVKL P+F+CS L+  +++SIPLTVEWS     D+LLAGCHDGT        VALWKFS      + EDTRPLL FSAD
Subjt:  PFPHVVKAIFSRFNGVGTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSAD

Query:  TVPIRGVAWAPSESDLESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGL
        T PIR VAWAP ESD ESAN+V TAGH GLKFWDLRDPFRPLWDLHP PR IY LDWL DP C++LSFDDGTLR+LSL K AYDVP TG+P+  TKQ+GL
Subjt:  TVPIRGVAWAPSESDLESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGL

Query:  HCYCCSSFAIWSTQVSRQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKK-VSNRSDPPLSMR
          Y CS+F IWS QVSR       L+G+ AYC ADG++  F+LTTKAVEK+ +RNRTPH++C  LT + ST   HSP  D+P  LKK V    +    +R
Subjt:  HCYCCSSFAIWSTQVSRQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKK-VSNRSDPPLSMR

Query:  AILSEIQSNEGNHKTATVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKPNDAQTDAGVESGSEETLMSMKKKKQTQ
        ++L+E  S   ++      VS+ + LA    +D G+ES SE T       N+      KKG++N   E              E  +   L+ +K+    +
Subjt:  AILSEIQSNEGNHKTATVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKPNDAQTDAGVESGSEETLMSMKKKKQTQ

Query:  SKCKKKGEDNQDLECSDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEI
           +K   +N                     +G   E FPPK V++HR+RWNMN GSE+WLCYGGAAGI+RCQEI
Subjt:  SKCKKKGEDNQDLECSDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEI

AT1G19485.2 Transducin/WD40 repeat-like superfamily protein2.6e-16344.52Show/hide
Query:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPTNVANSDL--------------SSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKK
        F+AV+ HPP S  HK+GIPL GRG++QIWC+++ T   ++   +     L              +++PK+PRGRP  RK    +   ++PK+PRGRP+KK
Subjt:  FIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPTNVANSDL--------------SSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPKKK

Query:  H-EESNDNKGDNYQLVQALSIEYPVDSSNFLEIDGVP-KNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQE
           E      D+   V+ALS+ YP +S     +   P +   E  + E  V  + S  Q +S  N+  ++P +++R + K+  ++    +  +S      
Subjt:  H-EESNDNKGDNYQLVQALSIEYPVDSSNFLEIDGVP-KNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQE

Query:  DGSNGINLQANVNVISECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEV
                +A  NV S+ S       + ISE              + VALPRVVLCLAHNGKV WD+KW+P+ +  +  KH MGYLAVLLGNGSLEVW+V
Subjt:  DGSNGINLQANVNVISECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEV

Query:  PFPHVVKAIFSRFNGVGTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSAD
        P P    A++       TDPRFVKL P+F+CS L+  +++SIPLTVEWS     D+LLAGCHDGT        VALWKFS      + EDTRPLL FSAD
Subjt:  PFPHVVKAIFSRFNGVGTDPRFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSAD

Query:  TVPIRGVAWAPSESDLESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGL
        T PIR VAWAP ESD ESAN+V TAGH GLKFWDLRDPFRPLWDLHP PR IY LDWL DP C++LSFDDGTLR+LSL K AYDVP TG+P+  TKQ+GL
Subjt:  TVPIRGVAWAPSESDLESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGL

Query:  HCYCCSSFAIWSTQVSRQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKK-VSNRSDPPLSMR
          Y CS+F IWS QVSR       L+G+ AYC ADG++  F+LTTKAVEK+ +RNRTPH++C  LT + ST   HSP  D+P  LKK V    +    +R
Subjt:  HCYCCSSFAIWSTQVSRQTVASTALSGMVAYCGADGTVVRFQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKK-VSNRSDPPLSMR

Query:  AILSEIQSNEGNHKTATVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKPNDAQTDAGVESGSEETLMSMKKKKQTQ
        ++L+E  S   ++      VS+ + LA    +D G+ES SE T       N+      KKG++N   E              E  +   L+ +K+    +
Subjt:  AILSEIQSNEGNHKTATVPVSENETLALCFDDDTGVESGSEETLMSMKKKNQTQSKCKKKGEDNQDLECSDKPNDAQTDAGVESGSEETLMSMKKKKQTQ

Query:  SKCKKKGEDNQDLECSDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEI
           +K   +N                     +G   E FPPK V++HR+RWNMN GSE+WLCYGGAAGI+RCQEI
Subjt:  SKCKKKGEDNQDLECSDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWLCYGGAAGILRCQEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCTTACGGTGCCACTTTCACTTTTGATTGCTTCTGCTGTTGGTTATAGTAGTTTTTATCAGTGGGCTAGTGTTTTTATTGCTGTTTCTGCTCATCCGCCTGGCTC
TTCTTATCACAAGATGGGTATCCCACTCACTGGAAGAGGTATGGTGCAGATATGGTGCTTAGTGCATGGCACTGAAAACCATGAAACAGAACCGACCAATGTAGCAAATT
CAGACTTATCATCTCAACCAAAGAGGCCTAGAGGAAGGCCACCAGGGCGTAAGGAAAAAGGGGGATCAGACTTGCCATCTCAACCAAAGAGGCCTAGAGGAAGACCTAAA
AAGAAACACGAAGAATCCAATGATAACAAGGGTGACAATTACCAACTTGTTCAGGCCCTTTCTATTGAATACCCAGTTGATTCATCCAACTTCCTTGAGATTGATGGTGT
CCCCAAAAATTCTGAAGAACTTGTATTACTGGAAAACAGTGTTGAAAGACAGAGGAGTACCTTACAAGAAGTTTCTATATGCAATTCTGAAGATGAAGTTCCTGCACAGA
AGAGAAGAGTGAGAAGAAAAACTGGGACTAAGAATTATGTTGATGACATGGGAACATTTTCACTTACAGAGAATCAAGAAGATGGATCCAATGGTATCAATCTTCAGGCA
AATGTGAATGTTATAAGTGAATGTTCTGGGGAAGATACTCTATTATGTAATGATATTTCAGAGAATGCTGTTTTAGACACTAGCTCAATTAAATTTTCTATTCCTGAGAG
CGTTGCTTTGCCTAGAGTAGTATTGTGCTTGGCTCACAATGGAAAGGTAGCATGGGATTTAAAATGGAAGCCAACTAATTCGTGTACTGCCAAATACAAGCACCGAATGG
GCTACCTTGCTGTCTTGCTGGGCAACGGATCTCTAGAAGTCTGGGAGGTTCCTTTTCCCCATGTAGTGAAGGCCATCTTTTCTAGATTCAATGGGGTGGGCACAGATCCT
CGCTTTGTGAAGTTGAAGCCTATTTTCAGATGCTCGATGTTGAGAAGTGCAAATTCACAAAGCATCCCCCTGACGGTGGAATGGTCGGCAACACCTCCTTATGATTATCT
ACTCGCTGGATGCCATGATGGAACACATGAGAAAATTGAAGTTCCTGAGGTCGCCTTGTGGAAGTTTTCTGCAAATAGTATCAATAGTACCTGCGAAGATACAAGGCCTT
TGCTTCGTTTTAGTGCAGATACAGTTCCAATAAGAGGGGTCGCATGGGCACCAAGTGAAAGTGATCTCGAAAGTGCAAATGTGGTACTTACTGCTGGTCATGGAGGTTTA
AAATTTTGGGACCTAAGAGATCCTTTCCGTCCATTGTGGGACCTTCATCCAGCACCGAGGATGATATATGGTCTGGATTGGCTTCCTGATCCTAGATGCATTATTTTATC
CTTTGATGACGGAACGTTGAGACTTCTCAGTTTGCCAAAGGCTGCATATGATGTTCCAGTAACTGGTAAACCCTTCACAGGGACAAAACAAAAAGGTTTACACTGTTACT
GTTGTTCATCATTTGCTATCTGGAGTACTCAAGTTTCAAGGCAGACAGTGGCCTCTACAGCTCTCAGTGGTATGGTTGCATACTGTGGTGCTGATGGAACAGTTGTCCGT
TTCCAGCTTACTACCAAAGCAGTGGAGAAAGAAAATTCTCGAAACAGAACCCCACATTTCGTATGTGAATACTTAACTGAGGAGCAATCAACTATTACATTCCACTCTCC
AGCAGCAGATGTTCCATTCCCTTTGAAGAAGGTGTCCAACAGATCTGACCCGCCATTATCCATGCGAGCTATTTTATCTGAAATTCAGTCAAATGAAGGAAATCACAAAA
CCGCCACAGTTCCAGTATCAGAAAATGAAACATTAGCCCTTTGCTTTGACGATGATACCGGTGTTGAATCTGGATCTGAGGAAACACTGATGTCCATGAAGAAGAAAAAC
CAAACTCAATCAAAGTGCAAGAAGAAGGGAGAGGATAACCAAGACTTGGAATGTAGCGACAAGCCTAACGATGCACAGACAGATGCTGGTGTTGAATCTGGATCTGAGGA
AACACTGATGTCCATGAAGAAGAAAAAACAAACTCAATCAAAGTGCAAGAAGAAGGGAGAGGATAACCAAGACTTGGAATGTAGCGACGAGCCTAACGATGCACAGACGG
ATGCTGACATAGTGCCTGGTTCGGGGAATGGTTTTGAAAAATTCCCTCCCAAATCAGTTTCACTGCATCGAATGAGATGGAACATGAACCTCGGGAGTGAAAAATGGTTG
TGCTATGGTGGAGCAGCTGGAATTTTACGCTGTCAGGAGATTGTGCTGTCTGCTCTCGATAAGAAGTTGATGAAGAAGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGCTTACGGTGCCACTTTCACTTTTGATTGCTTCTGCTGTTGGTTATAGTAGTTTTTATCAGTGGGCTAGTGTTTTTATTGCTGTTTCTGCTCATCCGCCTGGCTC
TTCTTATCACAAGATGGGTATCCCACTCACTGGAAGAGGTATGGTGCAGATATGGTGCTTAGTGCATGGCACTGAAAACCATGAAACAGAACCGACCAATGTAGCAAATT
CAGACTTATCATCTCAACCAAAGAGGCCTAGAGGAAGGCCACCAGGGCGTAAGGAAAAAGGGGGATCAGACTTGCCATCTCAACCAAAGAGGCCTAGAGGAAGACCTAAA
AAGAAACACGAAGAATCCAATGATAACAAGGGTGACAATTACCAACTTGTTCAGGCCCTTTCTATTGAATACCCAGTTGATTCATCCAACTTCCTTGAGATTGATGGTGT
CCCCAAAAATTCTGAAGAACTTGTATTACTGGAAAACAGTGTTGAAAGACAGAGGAGTACCTTACAAGAAGTTTCTATATGCAATTCTGAAGATGAAGTTCCTGCACAGA
AGAGAAGAGTGAGAAGAAAAACTGGGACTAAGAATTATGTTGATGACATGGGAACATTTTCACTTACAGAGAATCAAGAAGATGGATCCAATGGTATCAATCTTCAGGCA
AATGTGAATGTTATAAGTGAATGTTCTGGGGAAGATACTCTATTATGTAATGATATTTCAGAGAATGCTGTTTTAGACACTAGCTCAATTAAATTTTCTATTCCTGAGAG
CGTTGCTTTGCCTAGAGTAGTATTGTGCTTGGCTCACAATGGAAAGGTAGCATGGGATTTAAAATGGAAGCCAACTAATTCGTGTACTGCCAAATACAAGCACCGAATGG
GCTACCTTGCTGTCTTGCTGGGCAACGGATCTCTAGAAGTCTGGGAGGTTCCTTTTCCCCATGTAGTGAAGGCCATCTTTTCTAGATTCAATGGGGTGGGCACAGATCCT
CGCTTTGTGAAGTTGAAGCCTATTTTCAGATGCTCGATGTTGAGAAGTGCAAATTCACAAAGCATCCCCCTGACGGTGGAATGGTCGGCAACACCTCCTTATGATTATCT
ACTCGCTGGATGCCATGATGGAACACATGAGAAAATTGAAGTTCCTGAGGTCGCCTTGTGGAAGTTTTCTGCAAATAGTATCAATAGTACCTGCGAAGATACAAGGCCTT
TGCTTCGTTTTAGTGCAGATACAGTTCCAATAAGAGGGGTCGCATGGGCACCAAGTGAAAGTGATCTCGAAAGTGCAAATGTGGTACTTACTGCTGGTCATGGAGGTTTA
AAATTTTGGGACCTAAGAGATCCTTTCCGTCCATTGTGGGACCTTCATCCAGCACCGAGGATGATATATGGTCTGGATTGGCTTCCTGATCCTAGATGCATTATTTTATC
CTTTGATGACGGAACGTTGAGACTTCTCAGTTTGCCAAAGGCTGCATATGATGTTCCAGTAACTGGTAAACCCTTCACAGGGACAAAACAAAAAGGTTTACACTGTTACT
GTTGTTCATCATTTGCTATCTGGAGTACTCAAGTTTCAAGGCAGACAGTGGCCTCTACAGCTCTCAGTGGTATGGTTGCATACTGTGGTGCTGATGGAACAGTTGTCCGT
TTCCAGCTTACTACCAAAGCAGTGGAGAAAGAAAATTCTCGAAACAGAACCCCACATTTCGTATGTGAATACTTAACTGAGGAGCAATCAACTATTACATTCCACTCTCC
AGCAGCAGATGTTCCATTCCCTTTGAAGAAGGTGTCCAACAGATCTGACCCGCCATTATCCATGCGAGCTATTTTATCTGAAATTCAGTCAAATGAAGGAAATCACAAAA
CCGCCACAGTTCCAGTATCAGAAAATGAAACATTAGCCCTTTGCTTTGACGATGATACCGGTGTTGAATCTGGATCTGAGGAAACACTGATGTCCATGAAGAAGAAAAAC
CAAACTCAATCAAAGTGCAAGAAGAAGGGAGAGGATAACCAAGACTTGGAATGTAGCGACAAGCCTAACGATGCACAGACAGATGCTGGTGTTGAATCTGGATCTGAGGA
AACACTGATGTCCATGAAGAAGAAAAAACAAACTCAATCAAAGTGCAAGAAGAAGGGAGAGGATAACCAAGACTTGGAATGTAGCGACGAGCCTAACGATGCACAGACGG
ATGCTGACATAGTGCCTGGTTCGGGGAATGGTTTTGAAAAATTCCCTCCCAAATCAGTTTCACTGCATCGAATGAGATGGAACATGAACCTCGGGAGTGAAAAATGGTTG
TGCTATGGTGGAGCAGCTGGAATTTTACGCTGTCAGGAGATTGTGCTGTCTGCTCTCGATAAGAAGTTGATGAAGAAGAAATGA
Protein sequenceShow/hide protein sequence
MLLTVPLSLLIASAVGYSSFYQWASVFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENHETEPTNVANSDLSSQPKRPRGRPPGRKEKGGSDLPSQPKRPRGRPK
KKHEESNDNKGDNYQLVQALSIEYPVDSSNFLEIDGVPKNSEELVLLENSVERQRSTLQEVSICNSEDEVPAQKRRVRRKTGTKNYVDDMGTFSLTENQEDGSNGINLQA
NVNVISECSGEDTLLCNDISENAVLDTSSIKFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNSCTAKYKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIFSRFNGVGTDP
RFVKLKPIFRCSMLRSANSQSIPLTVEWSATPPYDYLLAGCHDGTHEKIEVPEVALWKFSANSINSTCEDTRPLLRFSADTVPIRGVAWAPSESDLESANVVLTAGHGGL
KFWDLRDPFRPLWDLHPAPRMIYGLDWLPDPRCIILSFDDGTLRLLSLPKAAYDVPVTGKPFTGTKQKGLHCYCCSSFAIWSTQVSRQTVASTALSGMVAYCGADGTVVR
FQLTTKAVEKENSRNRTPHFVCEYLTEEQSTITFHSPAADVPFPLKKVSNRSDPPLSMRAILSEIQSNEGNHKTATVPVSENETLALCFDDDTGVESGSEETLMSMKKKN
QTQSKCKKKGEDNQDLECSDKPNDAQTDAGVESGSEETLMSMKKKKQTQSKCKKKGEDNQDLECSDEPNDAQTDADIVPGSGNGFEKFPPKSVSLHRMRWNMNLGSEKWL
CYGGAAGILRCQEIVLSALDKKLMKKK