; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G003630 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G003630
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionBED-type domain-containing protein
Genome locationchr03:4271516..4274669
RNA-Seq ExpressionLsi03G003630
SyntenyLsi03G003630
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR003656 - Zinc finger, BED-type
IPR007021 - Domain of unknown function DUF659
IPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138492.2 uncharacterized protein LOC101220029 [Cucumis sativus]0.0e+0079.26Show/hide
Query:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
        MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKC+YCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
Subjt:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN

Query:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGK-SSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEA
         MTGEVD ISNHMDMDSSIHLIEVAEPLET S LLL HEEGTSNKVGRKKGSKGK SSC+ERE+IVIPNGGGILDSNR  NQVHMAVGRFLYDIGASLEA
Subjt:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGK-SSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEA

Query:  VNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLL
        VNSAYFQPMIESI  AGTGIIPPSY DIRGWILKNS+EEVRSDFDRCKATWG+TGCSVMVDQW TEAGRTML FLVYCPKGTVFLESVDASGIMDSPDLL
Subjt:  VNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLL

Query:  YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEP
        YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNT+IEQARSITRFVYNNSMVLNMVR+ T+GNDIVEP
Subjt:  YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEP

Query:  CVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM-------------------------------------------------------
        C+TRSATNFATLNRMVDLKRCLQ MVTSQEWMDSPYSKRPGGLEM                                                       
Subjt:  CVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM-------------------------------------------------------

Query:  -----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
                                                                   LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
Subjt:  -----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE

Query:  WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAE
        WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQN  LFDKLHDTRNHIEHQRLSDLVFVR NLQL++MA    EHYP DPLSFD LGIVDDWVWKKDLSAE
Subjt:  WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAE

Query:  DCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS
        DCGNLEWTVLDNPP    M LPQ+D YDDLVAGFDDLEVFKRQRESEDDNIS
Subjt:  DCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS

XP_008441385.1 PREDICTED: uncharacterized protein LOC103485517 [Cucumis melo]0.0e+0078.99Show/hide
Query:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
        MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKC+YCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
Subjt:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN

Query:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGK-SSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEA
        AMT EVDAISNHMDMDSSIHLIEVAEPL+T S LLL HEEGTSNKVGRKKGSKGK SSC++RE+IVIPNGGGILDSNR  NQVHMA+GRFLYDIGASLEA
Subjt:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGK-SSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEA

Query:  VNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLL
        VNSAYFQPMIESI  AGTGIIPPSY DIRGWILKNSVEEVR DFDRCKATWGMTGCSVMVDQW TEAGRTML FLVYCPKGTVFLESVDASGIMDSPDLL
Subjt:  VNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLL

Query:  YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEP
        YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLIL DIGNIE VNT+IEQARSITRFVYNNSMVLNMVR+ T+GNDIVEP
Subjt:  YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEP

Query:  CVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM-------------------------------------------------------
        C+TRSATNFATLNRMVDLKRCLQ MVTSQEWMDSPYSKRPGGLEM                                                       
Subjt:  CVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM-------------------------------------------------------

Query:  -----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
                                                                   LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
Subjt:  -----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE

Query:  WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAE
        WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQ+ FDKLHDTRNHIEHQRLSDLVFVR NLQL++MA    EHYP DPLSFDGLGIVDDWVWKKDLSAE
Subjt:  WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAE

Query:  DCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS
        DCGNLEWTVL+NPP    M LPQND YDDLVAGFDDLEVFKRQRESEDDNIS
Subjt:  DCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS

XP_022955231.1 uncharacterized protein LOC111457243 isoform X2 [Cucurbita moschata]0.0e+0076.46Show/hide
Query:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
        MSSGLQ VPITPQKHDPAWKHCQMFKNGDRVQLKC+YCHKLFKGGGIHRIKEHLAG KGNASTCHSVPPEVQ IMQESLDGV MKK+KRQKLDEEMTNVN
Subjt:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN

Query:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGKSSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEAV
        AMTGEVDAISNHMDMDSS+ LIEVAEPLET S LL+NHEEGTSNK+GRK+GSKGKSSC+ER +IVIPNGGGILDSN+V+NQVHMAVGRFLYDIGASLEAV
Subjt:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGKSSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEAV

Query:  NSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLLY
        NSAYFQPMIESIVSA TGIIPPSY DIRGWILKNSVEEVRSDFDRCKATWG TGCSVMVDQWRTEAGRTMLIFLVYCP+GTVFLESVDASGIMDSPDLLY
Subjt:  NSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLLY

Query:  ELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEPC
        ELLKKVVEQVGVKHV+QVITR EEN+AIAGRKLSDTYPTLYWTPCAASCVDLILGD GNIE VNT+IEQARSITRFVYNNSMVLNMVRRYTYGNDI+EPC
Subjt:  ELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEPC

Query:  VTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM--------------------------------------------------------
         TRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM                                                        
Subjt:  VTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM--------------------------------------------------------

Query:  -----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
                                                                   LVSDT +QDKIIKEI SYKNA GD  RK AIRAR TLLPAE
Subjt:  -----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE

Query:  WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAE
        WWST     CP L+RLA RILSQTCSSVG KQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQL++MAAKA E Y +DPLSFDGLGIVDDWVW++DL AE
Subjt:  WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAE

Query:  DCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS
        D GNLEWTVLDNPPSGST LLP ++D DDLVAGFDDLEVFKRQRESEDDNIS
Subjt:  DCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS

XP_023550765.1 uncharacterized protein LOC111808808 [Cucurbita pepo subsp. pepo]0.0e+0075.37Show/hide
Query:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
        M+ GLQPVPITPQKHDPAWKHCQMFKNGDRVQLKC+YCHKLFKGGGIHRIKEHLAG+KGNASTCHSVPPEVQN+MQESLDGVMMKK+KRQKLDEEMTNVN
Subjt:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN

Query:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGKSSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEAV
        A+TGEVDAI+NHMDMDSS HLIEVA+P+ET SGLL+NHEEG+SNKVGRKKGSKGKSSC++R++IV+PNGGGILDSN+++NQVHMA+GRFL+DIGASLEAV
Subjt:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGKSSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEAV

Query:  NSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLLY
        NSAYFQPMIESIVSAGTGIIPPSY DIRGWILKNSVEEVRSDFDRCKATWG TGCSVMVDQWRTEAGRTMLIFLVYCP+GTVFLESVDASGIMDSPDLLY
Subjt:  NSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLLY

Query:  ELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEPC
        EL KKVVEQVGVKHV+QVITRFEEN+AIAGRKLSDTYPTLYWTPCAAS VDLILGDIGNIEGVNT+IEQARSITRFVYNN+MVL+MVRRYTYGNDI+EPC
Subjt:  ELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEPC

Query:  VTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM--------------------------------------------------------
         TRSATNFATLNRMV+LKRCLQTMVTSQEWMDSPYSKRPGGLEM                                                        
Subjt:  VTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM--------------------------------------------------------

Query:  ----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAEW
                                                                  LVSDT +QD IIKE+  YKNA  D  RK AIR R TLLPAEW
Subjt:  ----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAEW

Query:  WSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAED
        WST G G C NL  LATRILSQTCSSVGF+QNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQL++MAAKA E +P+DPLSFDGLGIVDDWVW++DL AED
Subjt:  WSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAED

Query:  CGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS
        CGNLEWT+LDNPPS STMLLP NDDYDDL AGFDDLEVFKRQRESEDD IS
Subjt:  CGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS

XP_038886683.1 uncharacterized protein LOC120076824 [Benincasa hispida]0.0e+0081.62Show/hide
Query:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
        MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKC+YCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
Subjt:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN

Query:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGKSSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEAV
        AMTGEVDAISNHMDMDSSIHLIEVAEPLET SGLLL HEEGTSNKVGRKKGSKGKSSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFL+DIGASLEAV
Subjt:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGKSSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEAV

Query:  NSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLLY
        NSAYFQPMIESIVSAGTGIIPPSY DIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQW TEAGRTML FLVYCPKGTVFLESVDASGIMDSPDLLY
Subjt:  NSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLLY

Query:  ELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEPC
        ELLKKVVEQVGVKHVVQVITRFEENFA AGRKLS+TYPTLYWTPC+ASCVDLILGDIGNIEGVNT+IEQARSITRFVYNNS+VLNM+RRYTYGNDIVEPC
Subjt:  ELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEPC

Query:  VTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM--------------------------------------------------------
        VTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM                                                        
Subjt:  VTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM--------------------------------------------------------

Query:  ----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAEW
                                                                  LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAEW
Subjt:  ----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAEW

Query:  WSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAED
        WSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQL++MA  A EHYP+DPLSFDGLGIVDDWVW+KDLSAED
Subjt:  WSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAED

Query:  CGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS
        CGNLEWTVLD+PPSGSTMLLP NDDYDDLVAGFDDLEVF++QRESEDDNIS
Subjt:  CGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS

TrEMBL top hitse value%identityAlignment
A0A0A0KD75 BED-type domain-containing protein0.0e+0079.26Show/hide
Query:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
        MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKC+YCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
Subjt:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN

Query:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGK-SSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEA
         MTGEVD ISNHMDMDSSIHLIEVAEPLET S LLL HEEGTSNKVGRKKGSKGK SSC+ERE+IVIPNGGGILDSNR  NQVHMAVGRFLYDIGASLEA
Subjt:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGK-SSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEA

Query:  VNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLL
        VNSAYFQPMIESI  AGTGIIPPSY DIRGWILKNS+EEVRSDFDRCKATWG+TGCSVMVDQW TEAGRTML FLVYCPKGTVFLESVDASGIMDSPDLL
Subjt:  VNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLL

Query:  YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEP
        YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNT+IEQARSITRFVYNNSMVLNMVR+ T+GNDIVEP
Subjt:  YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEP

Query:  CVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM-------------------------------------------------------
        C+TRSATNFATLNRMVDLKRCLQ MVTSQEWMDSPYSKRPGGLEM                                                       
Subjt:  CVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM-------------------------------------------------------

Query:  -----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
                                                                   LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
Subjt:  -----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE

Query:  WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAE
        WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQN  LFDKLHDTRNHIEHQRLSDLVFVR NLQL++MA    EHYP DPLSFD LGIVDDWVWKKDLSAE
Subjt:  WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAE

Query:  DCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS
        DCGNLEWTVLDNPP    M LPQ+D YDDLVAGFDDLEVFKRQRESEDDNIS
Subjt:  DCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS

A0A1S4DTW8 uncharacterized protein LOC1034855170.0e+0078.99Show/hide
Query:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
        MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKC+YCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
Subjt:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN

Query:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGK-SSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEA
        AMT EVDAISNHMDMDSSIHLIEVAEPL+T S LLL HEEGTSNKVGRKKGSKGK SSC++RE+IVIPNGGGILDSNR  NQVHMA+GRFLYDIGASLEA
Subjt:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGK-SSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEA

Query:  VNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLL
        VNSAYFQPMIESI  AGTGIIPPSY DIRGWILKNSVEEVR DFDRCKATWGMTGCSVMVDQW TEAGRTML FLVYCPKGTVFLESVDASGIMDSPDLL
Subjt:  VNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLL

Query:  YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEP
        YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLIL DIGNIE VNT+IEQARSITRFVYNNSMVLNMVR+ T+GNDIVEP
Subjt:  YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEP

Query:  CVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM-------------------------------------------------------
        C+TRSATNFATLNRMVDLKRCLQ MVTSQEWMDSPYSKRPGGLEM                                                       
Subjt:  CVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM-------------------------------------------------------

Query:  -----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
                                                                   LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
Subjt:  -----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE

Query:  WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAE
        WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQ+ FDKLHDTRNHIEHQRLSDLVFVR NLQL++MA    EHYP DPLSFDGLGIVDDWVWKKDLSAE
Subjt:  WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAE

Query:  DCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS
        DCGNLEWTVL+NPP    M LPQND YDDLVAGFDDLEVFKRQRESEDDNIS
Subjt:  DCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS

A0A5D3DL06 Putative HAT and BED zinc finger domain-containing protein0.0e+0078.99Show/hide
Query:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
        MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKC+YCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
Subjt:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN

Query:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGK-SSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEA
        AMT EVDAISNHMDMDSSIHLIEVAEPL+T S LLL HEEGTSNKVGRKKGSKGK SSC++RE+IVIPNGGGILDSNR  NQVHMA+GRFLYDIGASLEA
Subjt:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGK-SSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEA

Query:  VNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLL
        VNSAYFQPMIESI  AGTGIIPPSY DIRGWILKNSVEEVR DFDRCKATWGMTGCSVMVDQW TEAGRTML FLVYCPKGTVFLESVDASGIMDSPDLL
Subjt:  VNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLL

Query:  YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEP
        YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLIL DIGNIE VNT+IEQARSITRFVYNNSMVLNMVR+ T+GNDIVEP
Subjt:  YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEP

Query:  CVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM-------------------------------------------------------
        C+TRSATNFATLNRMVDLKRCLQ MVTSQEWMDSPYSKRPGGLEM                                                       
Subjt:  CVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM-------------------------------------------------------

Query:  -----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
                                                                   LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
Subjt:  -----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE

Query:  WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAE
        WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQ+ FDKLHDTRNHIEHQRLSDLVFVR NLQL++MA    EHYP DPLSFDGLGIVDDWVWKKDLSAE
Subjt:  WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAE

Query:  DCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS
        DCGNLEWTVL+NPP    M LPQND YDDLVAGFDDLEVFKRQRESEDDNIS
Subjt:  DCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS

A0A6J1GT79 uncharacterized protein LOC111457243 isoform X20.0e+0076.46Show/hide
Query:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
        MSSGLQ VPITPQKHDPAWKHCQMFKNGDRVQLKC+YCHKLFKGGGIHRIKEHLAG KGNASTCHSVPPEVQ IMQESLDGV MKK+KRQKLDEEMTNVN
Subjt:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN

Query:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGKSSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEAV
        AMTGEVDAISNHMDMDSS+ LIEVAEPLET S LL+NHEEGTSNK+GRK+GSKGKSSC+ER +IVIPNGGGILDSN+V+NQVHMAVGRFLYDIGASLEAV
Subjt:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGKSSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEAV

Query:  NSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLLY
        NSAYFQPMIESIVSA TGIIPPSY DIRGWILKNSVEEVRSDFDRCKATWG TGCSVMVDQWRTEAGRTMLIFLVYCP+GTVFLESVDASGIMDSPDLLY
Subjt:  NSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLLY

Query:  ELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEPC
        ELLKKVVEQVGVKHV+QVITR EEN+AIAGRKLSDTYPTLYWTPCAASCVDLILGD GNIE VNT+IEQARSITRFVYNNSMVLNMVRRYTYGNDI+EPC
Subjt:  ELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEPC

Query:  VTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM--------------------------------------------------------
         TRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM                                                        
Subjt:  VTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM--------------------------------------------------------

Query:  -----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
                                                                   LVSDT +QDKIIKEI SYKNA GD  RK AIRAR TLLPAE
Subjt:  -----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE

Query:  WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAE
        WWST     CP L+RLA RILSQTCSSVG KQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQL++MAAKA E Y +DPLSFDGLGIVDDWVW++DL AE
Subjt:  WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAE

Query:  DCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS
        D GNLEWTVLDNPPSGST LLP ++D DDLVAGFDDLEVFKRQRESEDDNIS
Subjt:  DCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS

E5GC76 DNA binding protein0.0e+0078.99Show/hide
Query:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
        MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKC+YCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
Subjt:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN

Query:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGK-SSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEA
        AMT EVDAISNHMDMDSSIHLIEVAEPL+T S LLL HEEGTSNKVGRKKGSKGK SSC++RE+IVIPNGGGILDSNR  NQVHMA+GRFLYDIGASLEA
Subjt:  AMTGEVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGK-SSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEA

Query:  VNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLL
        VNSAYFQPMIESI  AGTGIIPPSY DIRGWILKNSVEEVR DFDRCKATWGMTGCSVMVDQW TEAGRTML FLVYCPKGTVFLESVDASGIMDSPDLL
Subjt:  VNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLL

Query:  YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEP
        YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLIL DIGNIE VNT+IEQARSITRFVYNNSMVLNMVR+ T+GNDIVEP
Subjt:  YELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEP

Query:  CVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM-------------------------------------------------------
        C+TRSATNFATLNRMVDLKRCLQ MVTSQEWMDSPYSKRPGGLEM                                                       
Subjt:  CVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGLEM-------------------------------------------------------

Query:  -----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
                                                                   LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE
Subjt:  -----------------------------------------------------------LVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAE

Query:  WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAE
        WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQ+ FDKLHDTRNHIEHQRLSDLVFVR NLQL++MA    EHYP DPLSFDGLGIVDDWVWKKDLSAE
Subjt:  WWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAE

Query:  DCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS
        DCGNLEWTVL+NPP    M LPQND YDDLVAGFDDLEVFKRQRESEDDNIS
Subjt:  DCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G17450.1 hAT dimerisation domain-containing protein1.7e-6426.25Show/hide
Query:  DPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNAMTGEVDAISNHMDM
        DP W+H  + ++  + ++KC YC+K+   GGI+R K+HLA   G  + C + P EV   ++E++      KR+ +  D+EM  +   T   D      D 
Subjt:  DPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNAMTGEVDAISNHMDM

Query:  DSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGKSSCMERELIVIPNGGGILD-----SNRV--HNQVHMAVGRFLYDIGASLEAVNSAYFQPM
        +         + L   +G     +  + +    +  S+ K+          P+           SNRV     V  ++ +FL+ +G   EA NS YFQ M
Subjt:  DSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGKSSCMERELIVIPNGGGILD-----SNRV--HNQVHMAVGRFLYDIGASLEAVNSAYFQPM

Query:  IESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLLYELLKKVVE
        IE I   G G + PS     G +L+  +  ++S     +++W +TGCS+M D W    G+ M+ FLV CP+G  F  S+DA+ I++    L++ L K+V+
Subjt:  IESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLLYELLKKVVE

Query:  QVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVR-RYTYGNDIVEPCVTRSATN
         +G ++VVQVIT+    F  AG+ L +    LYWTPCA  C +L+L D   +E V+  +E+A+ ITRF+YN + +LN+++  +T G D++ P V R A+ 
Subjt:  QVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVR-RYTYGNDIVEPCVTRSATN

Query:  FATLNRMVDLKRCLQTMVTSQEWM----------------------------------------------------------------------------
        F TL  ++D K  L+ +  S  W+                                                                            
Subjt:  FATLNRMVDLKRCLQTMVTSQEWM----------------------------------------------------------------------------

Query:  ----------------------------DSPYSKRPGGLEMLVSDTNVQDKIIK-------------EITSYKNASGDFARKTAIRARGTLLPAEWWSTC
                                    +  Y  RP  +        V + I++             +I  Y  A  DF    AI  R  L P+ WW   
Subjt:  ----------------------------DSPYSKRPGGLEMLVSDTNVQDKIIK-------------EITSYKNASGDFARKTAIRARGTLLPAEWWSTC

Query:  GEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLH-DTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVD----DWVWKKDLSAE
        G   C  L R+A RILS TCSSVG +    ++D+++   ++    +   DL +V +NL+LR    K   HY  +P       ++D    DW+   +   E
Subjt:  GEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLH-DTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGIVD----DWVWKKDLSAE

Query:  D
        +
Subjt:  D

AT3G22220.1 hAT transposon superfamily3.3e-15339.58Show/hide
Query:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
        M S L+PV +TPQK D AWKHC+++K GDRVQ++C+YC K+FKGGGI R+KEHLAG+KG  + C  VP EV+  +Q+ +DG + ++RKR+K   E   + 
Subjt:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN

Query:  AMTG-EVDA-ISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEG-TSNKVGRKKGSKGKSSCMERELIVIPNGGGILDS----------NRVH-------N
             EV+  ++   D+++              S +++    G T  +  R +    K++  ER  +        +D+          N VH        
Subjt:  AMTG-EVDA-ISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEG-TSNKVGRKKGSKGKSSCMERELIVIPNGGGILDS----------NRVH-------N

Query:  QVHMAVGRFLYDIGASLEAVNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKG
         VHMA+GRFL+DIGA  +A NS   QP I++IVS G G+  P++ D+RGWILK+ VEEV+ + D CK  W  TGCSV+V +  +  G  +L FLVYCP+ 
Subjt:  QVHMAVGRFLYDIGASLEAVNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKG

Query:  TVFLESVDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNN
         VFL+SVDAS I+DS D LYELLK+VVE++G  +VVQVIT+ E+++A AG+KL D YP+LYW PCAA C+D +L + G ++ +  +IEQAR++TR +YN+
Subjt:  TVFLESVDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNN

Query:  SMVLNMVRRYTYGNDIVEPCVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGL--------------------------------------
        S VLN++R++T+GNDIV+P  T SATNF T+ R+ DLK  LQ MVTS EW D  YSK  GGL                                      
Subjt:  SMVLNMVRRYTYGNDIVEPCVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGL--------------------------------------

Query:  -------------------------------------------------------------------------EMLVSDTNVQDKIIKEITSYKNASGDF
                                                                                 E LV D N+QD +IK+I SYKNA G F
Subjt:  -------------------------------------------------------------------------EMLVSDTNVQDKIIKEITSYKNASGDF

Query:  ARKTAIRARGTLLPAEWWSTCGEGGCPNLTRLATRILSQTC-SSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFD
         R  AIRAR T+LPAEWWST GE  C NL+R A RILSQTC SS+G  +N     ++++++N IE QRL+DLVFV++N++LRR+ ++++    +DPLS  
Subjt:  ARKTAIRARGTLLPAEWWSTCGEGGCPNLTRLATRILSQTC-SSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFD

Query:  GLGIVDDWVWKKDLSAEDCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRES
         + +++DWV +  +  E  G+ +W  L+       + +   D+ +DL +GFDD E+FK ++E+
Subjt:  GLGIVDDWVWKKDLSAEDCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRES

AT3G22220.2 hAT transposon superfamily3.3e-15339.58Show/hide
Query:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
        M S L+PV +TPQK D AWKHC+++K GDRVQ++C+YC K+FKGGGI R+KEHLAG+KG  + C  VP EV+  +Q+ +DG + ++RKR+K   E   + 
Subjt:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN

Query:  AMTG-EVDA-ISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEG-TSNKVGRKKGSKGKSSCMERELIVIPNGGGILDS----------NRVH-------N
             EV+  ++   D+++              S +++    G T  +  R +    K++  ER  +        +D+          N VH        
Subjt:  AMTG-EVDA-ISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEG-TSNKVGRKKGSKGKSSCMERELIVIPNGGGILDS----------NRVH-------N

Query:  QVHMAVGRFLYDIGASLEAVNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKG
         VHMA+GRFL+DIGA  +A NS   QP I++IVS G G+  P++ D+RGWILK+ VEEV+ + D CK  W  TGCSV+V +  +  G  +L FLVYCP+ 
Subjt:  QVHMAVGRFLYDIGASLEAVNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKG

Query:  TVFLESVDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNN
         VFL+SVDAS I+DS D LYELLK+VVE++G  +VVQVIT+ E+++A AG+KL D YP+LYW PCAA C+D +L + G ++ +  +IEQAR++TR +YN+
Subjt:  TVFLESVDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNN

Query:  SMVLNMVRRYTYGNDIVEPCVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGL--------------------------------------
        S VLN++R++T+GNDIV+P  T SATNF T+ R+ DLK  LQ MVTS EW D  YSK  GGL                                      
Subjt:  SMVLNMVRRYTYGNDIVEPCVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGL--------------------------------------

Query:  -------------------------------------------------------------------------EMLVSDTNVQDKIIKEITSYKNASGDF
                                                                                 E LV D N+QD +IK+I SYKNA G F
Subjt:  -------------------------------------------------------------------------EMLVSDTNVQDKIIKEITSYKNASGDF

Query:  ARKTAIRARGTLLPAEWWSTCGEGGCPNLTRLATRILSQTC-SSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFD
         R  AIRAR T+LPAEWWST GE  C NL+R A RILSQTC SS+G  +N     ++++++N IE QRL+DLVFV++N++LRR+ ++++    +DPLS  
Subjt:  ARKTAIRARGTLLPAEWWSTCGEGGCPNLTRLATRILSQTC-SSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFD

Query:  GLGIVDDWVWKKDLSAEDCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRES
         + +++DWV +  +  E  G+ +W  L+       + +   D+ +DL +GFDD E+FK ++E+
Subjt:  GLGIVDDWVWKKDLSAEDCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRES

AT4G15020.1 hAT transposon superfamily1.0e-14638.71Show/hide
Query:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
        M + L+PV +TPQK D AWKHC+++K GDR+Q++C+YC K+FKGGGI R+KEHLAG+KG  + C  VP +V+  +Q+ +DG + ++RKR K   E  +V 
Subjt:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN

Query:  AMTG-EVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGKSSCMERELI------VIPNGGGILDSNRVH-------NQVHMAV
        ++   E D +    D++        ++ +     LL    +  + +  +K   +  S+    +LI      +IP     +  N VH       N +HMA+
Subjt:  AMTG-EVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGKSSCMERELI------VIPNGGGILDSNRVH-------NQVHMAV

Query:  GRFLYDIGASLEAVNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLES
        GRFL+ IGA  +AVNS  FQPMI++I S G G+  P++ D+RGWILKN VEE+  + D CKA W  TGCS++V++  ++ G  +L FLVYCP+  VFL+S
Subjt:  GRFLYDIGASLEAVNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLES

Query:  VDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNM
        VDAS ++ S D L+ELL ++VE+VG  +VVQVIT+ ++ +  AG++L   YP+LYW PCAA C+D +L + G +  ++  IEQA++ITRFVYN+S VLN+
Subjt:  VDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNM

Query:  VRRYTYGNDIVEPCVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGL--------------------------------------------
        + ++T GNDI+ P  + SATNFATL R+ +LK  LQ MVTS EW +  YS+ P GL                                            
Subjt:  VRRYTYGNDIVEPCVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGL--------------------------------------------

Query:  ---------------------------------------------------------------------EMLVSDTNVQDKIIKEITSYKNASGDFARKT
                                                                             E LV D  +QDKIIKE+TSYK A G F R  
Subjt:  ---------------------------------------------------------------------EMLVSDTNVQDKIIKEITSYKNASGDFARKT

Query:  AIRARGTLLPAEWWSTCGEGGCPNLTRLATRILSQTC-SSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGI
        AIRAR T+LPAEWWST GE  C NL+R A RILSQTC SSV  ++NQI  + ++ ++N IE +RLSDLVFV++N++LR++   + +   +DPLS + + +
Subjt:  AIRARGTLLPAEWWSTCGEGGCPNLTRLATRILSQTC-SSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGI

Query:  VDDWVWKKDLSAEDCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDD
        + +WV       E  G+ +W  L++       + P  DD +DL +GFDD+E+FK ++E  D+
Subjt:  VDDWVWKKDLSAEDCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDD

AT4G15020.2 hAT transposon superfamily1.0e-14638.71Show/hide
Query:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN
        M + L+PV +TPQK D AWKHC+++K GDR+Q++C+YC K+FKGGGI R+KEHLAG+KG  + C  VP +V+  +Q+ +DG + ++RKR K   E  +V 
Subjt:  MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN

Query:  AMTG-EVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGKSSCMERELI------VIPNGGGILDSNRVH-------NQVHMAV
        ++   E D +    D++        ++ +     LL    +  + +  +K   +  S+    +LI      +IP     +  N VH       N +HMA+
Subjt:  AMTG-EVDAISNHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGKSSCMERELI------VIPNGGGILDSNRVH-------NQVHMAV

Query:  GRFLYDIGASLEAVNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLES
        GRFL+ IGA  +AVNS  FQPMI++I S G G+  P++ D+RGWILKN VEE+  + D CKA W  TGCS++V++  ++ G  +L FLVYCP+  VFL+S
Subjt:  GRFLYDIGASLEAVNSAYFQPMIESIVSAGTGIIPPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLES

Query:  VDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNM
        VDAS ++ S D L+ELL ++VE+VG  +VVQVIT+ ++ +  AG++L   YP+LYW PCAA C+D +L + G +  ++  IEQA++ITRFVYN+S VLN+
Subjt:  VDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNM

Query:  VRRYTYGNDIVEPCVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGL--------------------------------------------
        + ++T GNDI+ P  + SATNFATL R+ +LK  LQ MVTS EW +  YS+ P GL                                            
Subjt:  VRRYTYGNDIVEPCVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPGGL--------------------------------------------

Query:  ---------------------------------------------------------------------EMLVSDTNVQDKIIKEITSYKNASGDFARKT
                                                                             E LV D  +QDKIIKE+TSYK A G F R  
Subjt:  ---------------------------------------------------------------------EMLVSDTNVQDKIIKEITSYKNASGDFARKT

Query:  AIRARGTLLPAEWWSTCGEGGCPNLTRLATRILSQTC-SSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGI
        AIRAR T+LPAEWWST GE  C NL+R A RILSQTC SSV  ++NQI  + ++ ++N IE +RLSDLVFV++N++LR++   + +   +DPLS + + +
Subjt:  AIRARGTLLPAEWWSTCGEGGCPNLTRLATRILSQTC-SSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLRRMAAKATEHYPIDPLSFDGLGI

Query:  VDDWVWKKDLSAEDCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDD
        + +WV       E  G+ +W  L++       + P  DD +DL +GFDD+E+FK ++E  D+
Subjt:  VDDWVWKKDLSAEDCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCTGGGTTGCAACCAGTTCCTATAACGCCCCAAAAACACGACCCGGCGTGGAAGCACTGTCAGATGTTTAAGAATGGGGATAGAGTGCAGCTCAAATGTATGTA
CTGTCACAAACTTTTTAAGGGTGGTGGGATTCATAGAATAAAAGAACATCTTGCTGGTCAAAAGGGTAATGCTTCTACTTGCCACAGTGTTCCTCCTGAGGTTCAGAATA
TAATGCAAGAGAGTTTAGATGGGGTAATGATGAAGAAGAGGAAAAGACAGAAGCTTGATGAAGAGATGACTAATGTGAATGCCATGACTGGTGAGGTAGATGCAATTTCC
AATCATATGGATATGGATTCTAGTATTCATTTGATTGAAGTTGCTGAACCACTTGAAACGTGTTCAGGTTTGTTACTAAATCACGAGGAAGGAACAAGTAATAAAGTGGG
AAGGAAAAAGGGTAGTAAAGGTAAGAGTTCTTGCATGGAACGAGAGTTGATTGTTATTCCAAATGGTGGTGGTATATTAGATTCTAATAGGGTCCATAATCAAGTGCATA
TGGCAGTTGGGCGATTTTTGTATGACATTGGGGCATCTCTAGAAGCAGTAAACTCAGCCTATTTCCAGCCAATGATAGAATCAATTGTTTCAGCAGGTACTGGGATTATA
CCACCCTCATACCCTGATATTCGGGGTTGGATATTGAAGAATTCAGTGGAAGAAGTGAGGAGTGATTTTGACAGATGCAAAGCTACATGGGGAATGACTGGTTGTTCTGT
CATGGTTGATCAGTGGCGTACAGAAGCAGGTCGAACCATGCTCATTTTTTTGGTGTATTGCCCTAAGGGAACAGTGTTTTTGGAATCTGTGGATGCATCTGGAATTATGG
ATTCCCCAGATTTGCTTTACGAATTACTCAAAAAGGTGGTTGAACAAGTAGGGGTGAAACATGTAGTGCAGGTGATTACTAGGTTTGAAGAAAATTTTGCTATTGCTGGT
AGAAAGCTTTCGGATACATATCCAACCCTCTATTGGACCCCATGTGCTGCTAGTTGTGTGGATTTGATTCTTGGGGATATTGGAAACATAGAGGGTGTAAATACTCTTAT
TGAGCAGGCTCGATCAATTACAAGATTTGTCTATAACAATAGTATGGTTTTAAACATGGTCAGGAGGTATACCTATGGGAATGATATCGTAGAACCTTGTGTCACAAGAT
CTGCCACAAATTTCGCCACATTGAACCGAATGGTTGATCTGAAACGATGTCTGCAGACCATGGTTACTTCTCAAGAGTGGATGGACAGCCCATATTCAAAGAGGCCAGGG
GGGCTGGAAATGCTGGTTTCTGATACAAATGTTCAAGATAAAATAATAAAAGAAATAACCTCGTACAAGAATGCTAGTGGAGATTTTGCAAGAAAGACGGCTATTAGAGC
AAGAGGGACACTGCTTCCAGCTGAGTGGTGGTCAACATGTGGAGAAGGAGGATGCCCAAATTTAACTCGCTTGGCCACTCGAATTCTAAGTCAAACCTGCTCCTCAGTGG
GGTTTAAGCAAAATCAAATCCTTTTTGACAAGTTACATGACACGAGGAATCACATTGAACATCAACGTCTTAGTGACCTTGTATTTGTGCGCTTCAACTTGCAACTTAGA
CGAATGGCCGCTAAAGCCACTGAACATTATCCAATTGACCCCCTTTCCTTTGATGGCCTCGGTATTGTTGACGACTGGGTTTGGAAAAAGGATTTAAGTGCAGAGGATTG
TGGGAATCTGGAATGGACAGTACTTGATAATCCTCCCTCCGGTTCCACTATGCTTTTGCCCCAGAATGATGACTATGATGACTTGGTTGCAGGGTTTGATGACTTGGAGG
TTTTTAAAAGGCAGAGGGAGAGTGAAGATGACAATATTTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCTGGGTTGCAACCAGTTCCTATAACGCCCCAAAAACACGACCCGGCGTGGAAGCACTGTCAGATGTTTAAGAATGGGGATAGAGTGCAGCTCAAATGTATGTA
CTGTCACAAACTTTTTAAGGGTGGTGGGATTCATAGAATAAAAGAACATCTTGCTGGTCAAAAGGGTAATGCTTCTACTTGCCACAGTGTTCCTCCTGAGGTTCAGAATA
TAATGCAAGAGAGTTTAGATGGGGTAATGATGAAGAAGAGGAAAAGACAGAAGCTTGATGAAGAGATGACTAATGTGAATGCCATGACTGGTGAGGTAGATGCAATTTCC
AATCATATGGATATGGATTCTAGTATTCATTTGATTGAAGTTGCTGAACCACTTGAAACGTGTTCAGGTTTGTTACTAAATCACGAGGAAGGAACAAGTAATAAAGTGGG
AAGGAAAAAGGGTAGTAAAGGTAAGAGTTCTTGCATGGAACGAGAGTTGATTGTTATTCCAAATGGTGGTGGTATATTAGATTCTAATAGGGTCCATAATCAAGTGCATA
TGGCAGTTGGGCGATTTTTGTATGACATTGGGGCATCTCTAGAAGCAGTAAACTCAGCCTATTTCCAGCCAATGATAGAATCAATTGTTTCAGCAGGTACTGGGATTATA
CCACCCTCATACCCTGATATTCGGGGTTGGATATTGAAGAATTCAGTGGAAGAAGTGAGGAGTGATTTTGACAGATGCAAAGCTACATGGGGAATGACTGGTTGTTCTGT
CATGGTTGATCAGTGGCGTACAGAAGCAGGTCGAACCATGCTCATTTTTTTGGTGTATTGCCCTAAGGGAACAGTGTTTTTGGAATCTGTGGATGCATCTGGAATTATGG
ATTCCCCAGATTTGCTTTACGAATTACTCAAAAAGGTGGTTGAACAAGTAGGGGTGAAACATGTAGTGCAGGTGATTACTAGGTTTGAAGAAAATTTTGCTATTGCTGGT
AGAAAGCTTTCGGATACATATCCAACCCTCTATTGGACCCCATGTGCTGCTAGTTGTGTGGATTTGATTCTTGGGGATATTGGAAACATAGAGGGTGTAAATACTCTTAT
TGAGCAGGCTCGATCAATTACAAGATTTGTCTATAACAATAGTATGGTTTTAAACATGGTCAGGAGGTATACCTATGGGAATGATATCGTAGAACCTTGTGTCACAAGAT
CTGCCACAAATTTCGCCACATTGAACCGAATGGTTGATCTGAAACGATGTCTGCAGACCATGGTTACTTCTCAAGAGTGGATGGACAGCCCATATTCAAAGAGGCCAGGG
GGGCTGGAAATGCTGGTTTCTGATACAAATGTTCAAGATAAAATAATAAAAGAAATAACCTCGTACAAGAATGCTAGTGGAGATTTTGCAAGAAAGACGGCTATTAGAGC
AAGAGGGACACTGCTTCCAGCTGAGTGGTGGTCAACATGTGGAGAAGGAGGATGCCCAAATTTAACTCGCTTGGCCACTCGAATTCTAAGTCAAACCTGCTCCTCAGTGG
GGTTTAAGCAAAATCAAATCCTTTTTGACAAGTTACATGACACGAGGAATCACATTGAACATCAACGTCTTAGTGACCTTGTATTTGTGCGCTTCAACTTGCAACTTAGA
CGAATGGCCGCTAAAGCCACTGAACATTATCCAATTGACCCCCTTTCCTTTGATGGCCTCGGTATTGTTGACGACTGGGTTTGGAAAAAGGATTTAAGTGCAGAGGATTG
TGGGAATCTGGAATGGACAGTACTTGATAATCCTCCCTCCGGTTCCACTATGCTTTTGCCCCAGAATGATGACTATGATGACTTGGTTGCAGGGTTTGATGACTTGGAGG
TTTTTAAAAGGCAGAGGGAGAGTGAAGATGACAATATTTCATAACAAGAAGCTAGCAATGCCTGCAAGCAAGTCACACCATTTCTATTGGTAGGTGTCTATTTATCTTGC
ATGGAAGGAATAACTAGATGATATGATTTAGTGAAGCGCCTCAAGAGCAGATTGAGTGTTGTATGCCTCTGTAGTAGAATAGGTCACCAGGTCACTGTATCTTATAACTA
AGCAAAGTAGTTTTGTAGATTACAAGGCTTTTAGGTCACATATGATTTAGATTATTCAAATTAAAGCTGTATATGCAAACGATGGAGGGAGACACCCCTCAATTGATTTT
TAATGATGATATCTTAGTTGAGCTTTTTATGCTGAAAA
Protein sequenceShow/hide protein sequence
MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCMYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNAMTGEVDAIS
NHMDMDSSIHLIEVAEPLETCSGLLLNHEEGTSNKVGRKKGSKGKSSCMERELIVIPNGGGILDSNRVHNQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIVSAGTGII
PPSYPDIRGWILKNSVEEVRSDFDRCKATWGMTGCSVMVDQWRTEAGRTMLIFLVYCPKGTVFLESVDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAG
RKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTLIEQARSITRFVYNNSMVLNMVRRYTYGNDIVEPCVTRSATNFATLNRMVDLKRCLQTMVTSQEWMDSPYSKRPG
GLEMLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAEWWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQILFDKLHDTRNHIEHQRLSDLVFVRFNLQLR
RMAAKATEHYPIDPLSFDGLGIVDDWVWKKDLSAEDCGNLEWTVLDNPPSGSTMLLPQNDDYDDLVAGFDDLEVFKRQRESEDDNIS