; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0013084 (gene) of Chayote v1 genome

Gene IDSed0013084
OrganismSechium edule (Chayote v1)
DescriptionAT-rich interactive domain-containing protein 4-like
Genome locationLG06:194837..200169
RNA-Seq ExpressionSed0013084
SyntenySed0013084
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001606 - ARID DNA-binding domain
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR036431 - ARID DNA-binding domain superfamily
IPR042293 - AT-rich interactive domain-containing protein 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606676.1 AT-rich interactive domain-containing protein 4, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0086.7Show/hide
Query:  GAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLEDLCELF
        GAARQTCSLLAVTCG + ++KCEEDVAE NLKYPFPELVSSGRLEV+VLT+PS +EF RIVESCQPSF+YLQGEQLENDE+GSLVWN V LSLEDL  LF
Subjt:  GAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLEDLCELF

Query:  NTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMSDLEPQLVGE
        +TALPTTVYLE+PNG  IAE LHSKGIPYV+YWNNTFSCYAAA FRNALLSV+QSSSTHTWDAFQLAHAAF+LHC G NY+LPG +DD  SDLEPQL+G 
Subjt:  NTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMSDLEPQLVGE

Query:  PLKINIEPPKIDADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTMRCDIV
        P KIN+EPP++DA E+ED SLE LP IS+H+N VTMR LICG+PCTPDA L++SLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTMRCDIV
Subjt:  PLKINIEPPKIDADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTMRCDIV

Query:  TCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRSLVALGIG
        TCSSAHISVLVSGSA TCFDDQLLEKHIKHEIIENSQLVH M+DCEGNKHHMH+PRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMS+RSLVALGIG
Subjt:  TCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRSLVALGIG

Query:  GAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKASLPSARKR
        G QG PVASFEKEDA+RLLFFCSRDENDKHSDQLL S LP WFK PTPSRKRVEPSQG R+T+SHDSLA ANIPS+RR+GRE+PAPMNG KA L  ARKR
Subjt:  GAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKASLPSARKR

Query:  LKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDCSEEEFLKD
        LKV +MRPIP VHRNK+  FSG TE DGN GGQ KASLP VT  KHVT G STSA QRKSFS+SSQ KQ IIPLNPLPL+KHGCGRNP+QDCSEEEFLKD
Subjt:  LKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDCSEEEFLKD

Query:  VMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGE
        VMEFLLLRGHSRLIPQGGVEEFP+A+LNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGE
Subjt:  VMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGE

Query:  CCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG
        CCLLC SSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH +ANG
Subjt:  CCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG

XP_022949406.1 AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita moschata]0.0e+0086.79Show/hide
Query:  MLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLED
        MLHS+GAARQTCSLLAVTCG + ++KCEEDVAE NLKYPFPELVSSGRLEV+VLT+PSK+EF RIVESCQPSF+YLQGEQLENDE+GSLVWN V LSLED
Subjt:  MLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLED

Query:  LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMSDLEP
        LC LF+TALPTTVYLE+PNG +IAE LHSKGIPYV+YWNNTFSCYAAA FRNALLSV++SSSTHTWDAFQLAHAAF+LHC G NY+LPG +DD  SDLEP
Subjt:  LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMSDLEP

Query:  QLVGEPLKINIEPPKIDADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM
        QL+GEP KINIEPP++DA E+ED SLE +P IS+H+N VTMR LICG+PCTPDA L++SLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM
Subjt:  QLVGEPLKINIEPPKIDADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM

Query:  RCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRSLV
        RCDIVTCSSAHISVLVSGSA TCFDDQLLEKHIKHEIIENSQLVH M+DCEGNKHHMH+PRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMS+RSLV
Subjt:  RCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRSLV

Query:  ALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKASLP
        ALGIGG QG PVASFEKEDA+RLLFFCSRDENDKHSDQLL S LP WFK PTPSRKRVEPSQG R+T+SHDSLA ANIPS+RR+GRE+PAPMNG KA L 
Subjt:  ALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKASLP

Query:  SARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDCSEE
         ARKRLKV +M+PIP VHRNK+  FSG TE DGN GGQ KASLP VT  KHVT G STSA QRKSFS+SSQ KQ IIPLNPLPL+KHGCGRNP+QDCSEE
Subjt:  SARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDCSEE

Query:  EFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHD
        EFLKDVMEFLLLRGHSRLIPQGGVEEFP+A+LNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHD
Subjt:  EFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHD

Query:  DVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG
        DVDGECCLLC SSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH +ANG
Subjt:  DVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG

XP_022998193.1 AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita maxima]0.0e+0086.27Show/hide
Query:  MLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLED
        MLHS+GAARQTCSLLAVTCG + ++KCEEDVAE NLKYPFPELVSSGRLEV+VLT+PSK+EFSRIVESCQPSF+YLQGEQLENDE+GSLVWN V LSLED
Subjt:  MLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLED

Query:  LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMSDLEP
        LC LFNTALPTTVYLE+PNG  IAE LHSKGIPYV+YWNNTFSCYAAA FRNALLSV+QSSSTHTWDAFQLAHAAF+L C G NY+LPG +D+  SDLEP
Subjt:  LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMSDLEP

Query:  QLVGEPLKINIEPPKIDADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM
        QL+GEP KI +EPP++DA  +ED SLE LP IS+H+N VTMR LICG+PCTPDA L++SLEDGLNALLNIEIRGSKLQGKFSAPPPPLQA SFSRGVVTM
Subjt:  QLVGEPLKINIEPPKIDADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM

Query:  RCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRSLV
        RCDIVTCSSAHISVLVSGSA TCFDDQLLEKHIKHEIIENSQLVH M+DCEGNKHHMH+PRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMS+RSLV
Subjt:  RCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRSLV

Query:  ALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKASLP
        ALGIGG QG PVASFEKEDA+RLLFFCSRDENDKHSDQLL S LP WFK PTPSRKRVEPSQG R+ + HDSLA ANIPS+RR+GRE+PAPMNG KA L 
Subjt:  ALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKASLP

Query:  SARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDCSEE
         ARKRLKV +MRPIP VHRNK+  FSG TE DGN G Q KASLP+VT  KHVT G STSA QRKSFS+SSQ KQ IIPLNPLPL+KHGCGRNP+QDCSEE
Subjt:  SARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDCSEE

Query:  EFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHD
        EFLKDVMEFLLLRGHSRLIPQGGVEEFP+A+LNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHD
Subjt:  EFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHD

Query:  DVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG
        DVDGECCLLC SSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH +ANG
Subjt:  DVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG

XP_023524673.1 AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0087.05Show/hide
Query:  MLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLED
        MLHS+GAARQTCSLLAVTCG + ++KCEEDVAE NLKYPFPEL SSGRLEV+VLT+PSK+EF RIVESCQPSF+YLQGEQLENDE+GSLVWN V LSLED
Subjt:  MLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLED

Query:  LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMSDLEP
        LC LF+TALPTTVYLE+PNG  IAE LHSKGIPYV+YWNNTFSCYAAA FRNALLSV+QSSSTHTWDAFQLAHAAF+LHC G NY+LPG +DD  SDLEP
Subjt:  LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMSDLEP

Query:  QLVGEPLKINIEPPKIDADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM
        QL+GEP KIN+EPP++DA E+ED SLE LP IS+H+N VTMR LICG+PCTPDA L++SLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM
Subjt:  QLVGEPLKINIEPPKIDADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM

Query:  RCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRSLV
        RCDIVTCSSAHISVLVSGSA TCFDDQLLEKHIKHEIIENSQLVH M+DCEGNKHHMH+PRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMS+RSLV
Subjt:  RCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRSLV

Query:  ALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKASLP
        ALGIGG QG PVASFEKEDA+RLLFFCSRDENDKHSDQLL S LP WFK PTPSRKRVEPSQG R+T+SHDSLA ANIPS+RR+GRE+PAPMNG KA L 
Subjt:  ALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKASLP

Query:  SARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDCSEE
         ARKRLKV +MRPIP VHRNK+  FSG TE DGN GGQ KASLP VT  KHVT G STSA QRKSFS+SSQ KQ IIPLNPLPL+KHGCGRNP+QDCSEE
Subjt:  SARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDCSEE

Query:  EFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHD
        EFLKDVMEFLLLRGHSRLIPQGGVEEFP+A+LNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHD
Subjt:  EFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHD

Query:  DVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG
        DVDGECCLLCRSSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH +ANG
Subjt:  DVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG

XP_038883881.1 AT-rich interactive domain-containing protein 4-like [Benincasa hispida]0.0e+0087.23Show/hide
Query:  MLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLED
        MLHSV AARQTCSLLAVTCGSV +IKCEE+V ED L+YPFPELVSSGRLEVRVL +PSKDEFSRIVES  PSF+YLQGEQL NDEIGSLVWN V LSLED
Subjt:  MLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLED

Query:  LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDD-IMSDLE
        LC LFNT LPT VYLEIPNG RIAE LHSKGIPY++YWN+TFSCYAAA FRNALLSV+QSSSTHTWDAFQLA AAFKL+C GSNY LPGI+DD IMSDLE
Subjt:  LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDD-IMSDLE

Query:  PQLVGEPLKINIEPPKIDADEEE--DGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGV
        PQL+GEPLKIN+EPP++DA E E  DGSLE LPAISIH+N VT+RFLICGVPCTPDA L++SLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGV
Subjt:  PQLVGEPLKINIEPPKIDADEEE--DGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGV

Query:  VTMRCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYR
        VTMRCDIVTCSSAHIS+LVSGSA TCFDDQLLEKHIKHEIIEN+QLVHAM+DCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYR
Subjt:  VTMRCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYR

Query:  SLVALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKA
        SLVALGIGG QGLPVASFEKEDA+RLLFFCS DENDKHS+QLL S LP WFK PTPSRKRVEPSQG RST+SHDSLA ANIPS+RR+ RE+PAPMNG KA
Subjt:  SLVALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKA

Query:  SLPSARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDC
         L   RKRLKV SMRP+P VHRNKI+ FSGL E D N G  SKASLP+VT  KHVT G STSA  RKSFS+SSQ KQ II LNPLPL+KHGCGRNPIQDC
Subjt:  SLPSARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDC

Query:  SEEEFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYEL
        SEEEFLKDVMEFLLLRGHSRLIPQGG+EEFP+AILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYEL
Subjt:  SEEEFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYEL

Query:  AHDDVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG
        AHDDVDGECCLLC SSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH++ANG
Subjt:  AHDDVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG

TrEMBL top hitse value%identityAlignment
A0A0A0LEG9 ARID domain-containing protein0.0e+0084.52Show/hide
Query:  MLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLED
        MLHSV AARQTCSLLAVTCG+V ++KCEE+V ED LKYPFPELVS GRLEVRVL +PSKDEFSRIVESC PSF+YLQGEQL NDEIGSLVWN V LSLED
Subjt:  MLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLED

Query:  LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDD-IMSDLE
        LC LFN ALPT VYLEIP+G RIAE LHSKGIPY++YWN+TFSCYAAA FR+ALLSV+QSSSTHTWDAFQLA AAF+L+  GSNY LPGI+DD +MSDLE
Subjt:  LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDD-IMSDLE

Query:  PQLVGEPLKINIEPPKIDA--DEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGV
        PQL+GEPLKI++EPP++D    E+EDGSLE LPAI+IH+N VTMRFLICGVPCTPD  L++SLEDGL+ALL IE+RGSKLQGKFSAPPPPLQAGSFSRGV
Subjt:  PQLVGEPLKINIEPPKIDA--DEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGV

Query:  VTMRCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYR
        VTMRCDIVTCSSAHIS+LVSGSA TCFDDQLLEKHIKHEIIE++QLVHA++DCEGNKHHMH+PRKSAS+ACGATVFEVSMKVPAWASQVLRQLAPD+SYR
Subjt:  VTMRCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYR

Query:  SLVALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKA
        SLVALGIGG QGLPVASFEKEDA+RLLFFCS D NDKHS+QLL S LP WFK PTPSRKRVEPSQG R+++SHDSL+ A+IP++RR+GREDP PMNG KA
Subjt:  SLVALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKA

Query:  SLPSARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDC
        SL  ARK+LKV SMRP+P +HRNK++ F+GLTE DGN GG SKASL IVT PKHVT G STSA  RKSFS+SSQ KQ II LNPLPL+KHGCGRNPIQDC
Subjt:  SLPSARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDC

Query:  SEEEFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYEL
        SEEEFLKDVMEFLLLRGH+RLIPQGG+EEFP+AILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYEL
Subjt:  SEEEFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYEL

Query:  AHDDVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG
        AHDDVDGECCLLC SSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH++ANG
Subjt:  AHDDVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG

A0A6J1DIE1 AT-rich interactive domain-containing protein 4-like0.0e+0086.08Show/hide
Query:  MMLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLE
        MMLHSVG ARQTCSLLAVTCGSV ++KCEEDVAED LKYPFPELVSSGRLEVRVLT+PSKDEF+RIVESCQPSF+YLQGEQLENDEIGSLVWN V LSLE
Subjt:  MMLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLE

Query:  DLCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMSDLE
        DLC LF+TALP TVYLEIPNG R AE LHSKGIPYV+YWNNT SCYAAA FRN LLSV+QSSSTHTWDAFQLAHAAF+LHCA SNY+LPG  D I  +LE
Subjt:  DLCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMSDLE

Query:  PQLVGEPLKINIEPPKI---DADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRG
        PQL+GEPLKI++EPP+I   DA E+ED SL  LPAISIH+N VTMRFLICGVPCTPDA L++SLEDGLNALLNIEIRGSKLQGKFSA PPPLQAGSFSRG
Subjt:  PQLVGEPLKINIEPPKI---DADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRG

Query:  VVTMRCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSY
        VVTMRCD+VTCSSAHI++LVSGSA TCFDDQLLEKHIKHEIIENSQLVHA+ DCEGN+H MHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSY
Subjt:  VVTMRCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSY

Query:  RSLVALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLK
        RSLVALGIGG QGLPVASFEKEDA+R LFFCSRD NDKHSDQL  S LP WFK P PSRKRVEPSQG  STVSHDSLA ANIPS+RR+G E+ APMNG K
Subjt:  RSLVALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLK

Query:  ASLPSARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQD
        A+L  ARKRLKV +MRPIP VHRNK++ FSGLTEADGN G   KASLP+VT  KHVT G STSA QRKSFS+SSQ KQ II LNPLPL+KHGCGRNPIQ 
Subjt:  ASLPSARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQD

Query:  CSEEEFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYE
        CSEEEFLKDVMEFLLLRGHSRLIPQGG+ EFP+AILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYE
Subjt:  CSEEEFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYE

Query:  LAHDDVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG
        LAHDDVDGECCLLC SSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKP+++ANG
Subjt:  LAHDDVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG

A0A6J1GBY3 AT-rich interactive domain-containing protein 4-like isoform X10.0e+0086.79Show/hide
Query:  MLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLED
        MLHS+GAARQTCSLLAVTCG + ++KCEEDVAE NLKYPFPELVSSGRLEV+VLT+PSK+EF RIVESCQPSF+YLQGEQLENDE+GSLVWN V LSLED
Subjt:  MLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLED

Query:  LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMSDLEP
        LC LF+TALPTTVYLE+PNG +IAE LHSKGIPYV+YWNNTFSCYAAA FRNALLSV++SSSTHTWDAFQLAHAAF+LHC G NY+LPG +DD  SDLEP
Subjt:  LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMSDLEP

Query:  QLVGEPLKINIEPPKIDADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM
        QL+GEP KINIEPP++DA E+ED SLE +P IS+H+N VTMR LICG+PCTPDA L++SLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM
Subjt:  QLVGEPLKINIEPPKIDADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM

Query:  RCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRSLV
        RCDIVTCSSAHISVLVSGSA TCFDDQLLEKHIKHEIIENSQLVH M+DCEGNKHHMH+PRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMS+RSLV
Subjt:  RCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRSLV

Query:  ALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKASLP
        ALGIGG QG PVASFEKEDA+RLLFFCSRDENDKHSDQLL S LP WFK PTPSRKRVEPSQG R+T+SHDSLA ANIPS+RR+GRE+PAPMNG KA L 
Subjt:  ALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKASLP

Query:  SARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDCSEE
         ARKRLKV +M+PIP VHRNK+  FSG TE DGN GGQ KASLP VT  KHVT G STSA QRKSFS+SSQ KQ IIPLNPLPL+KHGCGRNP+QDCSEE
Subjt:  SARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDCSEE

Query:  EFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHD
        EFLKDVMEFLLLRGHSRLIPQGGVEEFP+A+LNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHD
Subjt:  EFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHD

Query:  DVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG
        DVDGECCLLC SSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH +ANG
Subjt:  DVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG

A0A6J1GVQ9 AT-rich interactive domain-containing protein 4-like0.0e+0085.94Show/hide
Query:  MLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLED
        MLHSV AARQTCSLLAVTCGSV + KCEEDV ED LKYPFP LVSSGRLEVR LT+PS DEFSRIVESC PSF+YLQGEQL NDEIGSLVWN V L LED
Subjt:  MLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLED

Query:  LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMSDLEP
        LC LFNTALPT VYLEIPNG RIAE LHSKGIPYV+YWN+TFSCYAAA FRNAL SVLQSSSTHTWDAFQLA AAF+LHC GS+++LPGI D I S LEP
Subjt:  LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMSDLEP

Query:  QLVGEPLKINIEPPKIDA--DEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVV
        Q+ GEPLKIN+EPPK+D    E+EDGSLE L AISIH+N VT+RFLICGVPCTPDA L++SLEDGLNALLNIEIRG KLQGKFSAPPPPLQAGSF+RGVV
Subjt:  QLVGEPLKINIEPPKIDA--DEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVV

Query:  TMRCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRS
        TMRCDIVTCSSAHIS+LVSGS  TCFDDQLLEKHIKHEIIEN+QLVHAMYDCE NKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRS
Subjt:  TMRCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRS

Query:  LVALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKAS
        LVALGIGG QGLPVASFEKEDA+RLLFFCS+D NDKHSDQLL S LP WFK P PSRKRVEPSQG RST+SHD LA ANIP +RR+GRE+PAPMNG K  
Subjt:  LVALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKAS

Query:  LPSARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDCS
        L + RKRLKV SMRPIP VHRNK++ FSGLTEADGN GGQ KA  P+VT  KHVT G STSA QRKSFS+SSQ KQ II LNPLPL+KHGCGRNPIQDCS
Subjt:  LPSARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDCS

Query:  EEEFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELA
        EEEFLKDVMEFLLLRGHSRLIPQGG+EEFP+AILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTM+NRMTGVGNTLKRHYETYLLEYELA
Subjt:  EEEFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELA

Query:  HDDVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY-KKKPHKIANG
        HDDVDGECCLLC SSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY KKKPH++ANG
Subjt:  HDDVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY-KKKPHKIANG

A0A6J1KBW8 AT-rich interactive domain-containing protein 4-like isoform X10.0e+0086.27Show/hide
Query:  MLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLED
        MLHS+GAARQTCSLLAVTCG + ++KCEEDVAE NLKYPFPELVSSGRLEV+VLT+PSK+EFSRIVESCQPSF+YLQGEQLENDE+GSLVWN V LSLED
Subjt:  MLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLED

Query:  LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMSDLEP
        LC LFNTALPTTVYLE+PNG  IAE LHSKGIPYV+YWNNTFSCYAAA FRNALLSV+QSSSTHTWDAFQLAHAAF+L C G NY+LPG +D+  SDLEP
Subjt:  LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMSDLEP

Query:  QLVGEPLKINIEPPKIDADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM
        QL+GEP KI +EPP++DA  +ED SLE LP IS+H+N VTMR LICG+PCTPDA L++SLEDGLNALLNIEIRGSKLQGKFSAPPPPLQA SFSRGVVTM
Subjt:  QLVGEPLKINIEPPKIDADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM

Query:  RCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRSLV
        RCDIVTCSSAHISVLVSGSA TCFDDQLLEKHIKHEIIENSQLVH M+DCEGNKHHMH+PRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMS+RSLV
Subjt:  RCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRSLV

Query:  ALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKASLP
        ALGIGG QG PVASFEKEDA+RLLFFCSRDENDKHSDQLL S LP WFK PTPSRKRVEPSQG R+ + HDSLA ANIPS+RR+GRE+PAPMNG KA L 
Subjt:  ALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKASLP

Query:  SARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDCSEE
         ARKRLKV +MRPIP VHRNK+  FSG TE DGN G Q KASLP+VT  KHVT G STSA QRKSFS+SSQ KQ IIPLNPLPL+KHGCGRNP+QDCSEE
Subjt:  SARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDCSEE

Query:  EFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHD
        EFLKDVMEFLLLRGHSRLIPQGGVEEFP+A+LNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHD
Subjt:  EFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHD

Query:  DVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG
        DVDGECCLLC SSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH +ANG
Subjt:  DVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG

SwissProt top hitse value%identityAlignment
Q6NQ79 AT-rich interactive domain-containing protein 42.9e-25758.12Show/hide
Query:  MLHSVGAARQTCSLLAVTCGS-VHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQL-ENDEIGSLVWNDVYLSL
        M H  G +R  C+++AV  G+ +     + D      KYPFP+L SSGRL+ +VL +P+ +EF   V S    F+YLQGE   ++DE+G LV      S 
Subjt:  MLHSVGAARQTCSLLAVTCGS-VHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQL-ENDEIGSLVWNDVYLSL

Query:  ED-LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMS-
         D L  LF + LPTTVYLE+PNG  +A+ L+SKG+ YV+YW N FS YAA  FR++L SV+QSS + TWD F +A A+F+L+C   N  LP  S+  M+ 
Subjt:  ED-LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMS-

Query:  DLEPQLVGEPLKINIEPPKIDADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRG
        ++ P L+GEP KI++  P+ D + EE+ SLE+LP+I I++  VT+RFL+CG PCT D  L+ SL DGLNALL IE+RGSKL  + SAP PPLQAG+F+RG
Subjt:  DLEPQLVGEPLKINIEPPKIDADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRG

Query:  VVTMRCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSY
        VVTMRCD+ TCSSAHIS+LVSG+AQTCF DQLLE HIKHE++E  QLVH++ + E  K    EPR+SAS+ACGA+V EVSM+VP WA QVLRQLAPD+SY
Subjt:  VVTMRCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSY

Query:  RSLVALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLK
        RSLV LG+   QGL VASFEK+DA+RLLFFC +  ND  +   L S++P W   P P+RKR EP                         RE     NG  
Subjt:  RSLVALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLK

Query:  ASLPSARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQD
           P++RK + V ++RPIP   R+K+  FSG +E     G  +K SLP+   PKH  +G  T    RK+FS S Q KQ II LNPLPL+KH CGR  IQ 
Subjt:  ASLPSARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQD

Query:  CSEEEFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYE
        CSEEEFL+DVM+FLL+RGH+RL+P GG+ EFP+A+LN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM N+T+TNRMTGVGNTLKRHYETYLLEYE
Subjt:  CSEEEFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYE

Query:  LAHDDVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG
         AHDDVDGECCL+CRSS AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP+CS++ Y+KK  K +NG
Subjt:  LAHDDVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG

Arabidopsis top hitse value%identityAlignment
AT3G43240.1 ARID/BRIGHT DNA-binding domain-containing protein2.1e-25858.12Show/hide
Query:  MLHSVGAARQTCSLLAVTCGS-VHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQL-ENDEIGSLVWNDVYLSL
        M H  G +R  C+++AV  G+ +     + D      KYPFP+L SSGRL+ +VL +P+ +EF   V S    F+YLQGE   ++DE+G LV      S 
Subjt:  MLHSVGAARQTCSLLAVTCGS-VHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQL-ENDEIGSLVWNDVYLSL

Query:  ED-LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMS-
         D L  LF + LPTTVYLE+PNG  +A+ L+SKG+ YV+YW N FS YAA  FR++L SV+QSS + TWD F +A A+F+L+C   N  LP  S+  M+ 
Subjt:  ED-LCELFNTALPTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMS-

Query:  DLEPQLVGEPLKINIEPPKIDADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRG
        ++ P L+GEP KI++  P+ D + EE+ SLE+LP+I I++  VT+RFL+CG PCT D  L+ SL DGLNALL IE+RGSKL  + SAP PPLQAG+F+RG
Subjt:  DLEPQLVGEPLKINIEPPKIDADEEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRG

Query:  VVTMRCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSY
        VVTMRCD+ TCSSAHIS+LVSG+AQTCF DQLLE HIKHE++E  QLVH++ + E  K    EPR+SAS+ACGA+V EVSM+VP WA QVLRQLAPD+SY
Subjt:  VVTMRCDIVTCSSAHISVLVSGSAQTCFDDQLLEKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSY

Query:  RSLVALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLK
        RSLV LG+   QGL VASFEK+DA+RLLFFC +  ND  +   L S++P W   P P+RKR EP                         RE     NG  
Subjt:  RSLVALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQLLASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLK

Query:  ASLPSARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQD
           P++RK + V ++RPIP   R+K+  FSG +E     G  +K SLP+   PKH  +G  T    RK+FS S Q KQ II LNPLPL+KH CGR  IQ 
Subjt:  ASLPSARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAPKHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQD

Query:  CSEEEFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYE
        CSEEEFL+DVM+FLL+RGH+RL+P GG+ EFP+A+LN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM N+T+TNRMTGVGNTLKRHYETYLLEYE
Subjt:  CSEEEFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYE

Query:  LAHDDVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG
         AHDDVDGECCL+CRSS AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP+CS++ Y+KK  K +NG
Subjt:  LAHDDVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKIANG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGCTTCATTCCGTAGGAGCTGCTAGACAAACTTGTAGCTTACTTGCTGTAACCTGCGGAAGTGTGCATCGAATAAAATGCGAGGAGGATGTTGCCGAGGATAACTT
GAAGTACCCCTTTCCAGAATTAGTTTCTTCTGGACGATTGGAGGTTCGGGTTCTAACAAGTCCAAGCAAGGATGAGTTTAGTAGAATTGTGGAATCATGTCAACCAAGCT
TCATATACTTGCAAGGGGAACAACTCGAAAATGATGAAATTGGGTCATTGGTTTGGAATGATGTATATTTGTCTCTTGAAGATTTATGTGAACTATTCAACACTGCATTA
CCAACCACTGTTTATCTAGAAATCCCAAATGGTAGCAGAATAGCAGAGGTTCTTCATTCTAAGGGAATTCCTTATGTTGTGTACTGGAACAACACATTTTCATGTTATGC
TGCAGCTCAATTTCGTAATGCATTGCTTTCAGTGCTGCAGAGTTCATCTACTCATACATGGGATGCTTTTCAGCTTGCACATGCTGCTTTTAAGCTTCATTGTGCGGGCA
GCAATTATTCCCTTCCCGGGATTTCTGATGACATTATGAGTGATTTAGAGCCTCAGCTTGTTGGGGAACCTCTAAAGATTAACATAGAACCCCCCAAGATAGATGCAGAT
GAAGAGGAAGATGGTTCTTTAGAAAACCTCCCTGCCATAAGTATACACAATAATGCTGTGACTATGAGATTTCTCATCTGTGGAGTGCCTTGCACACCGGATGCACACTT
AATGAAATCGTTGGAGGATGGCCTTAACGCCCTTTTGAACATTGAAATACGTGGGAGTAAACTTCAGGGAAAGTTCAGTGCACCTCCGCCACCTCTTCAAGCAGGATCCT
TTTCTCGTGGTGTTGTGACAATGCGATGTGATATAGTGACCTGTAGTTCAGCCCATATCTCAGTTTTGGTGTCTGGTAGTGCTCAAACTTGTTTTGATGATCAGCTGTTG
GAGAAACATATCAAACATGAGATTATTGAAAATAGCCAATTAGTTCACGCCATGTATGATTGTGAGGGCAACAAACATCACATGCACGAGCCTCGAAAGTCTGCTTCAGT
TGCTTGTGGGGCAACAGTATTTGAGGTTTCCATGAAGGTTCCCGCTTGGGCATCACAGGTCTTAAGGCAACTAGCACCTGATATGTCATATCGGAGTTTAGTTGCACTTG
GCATTGGGGGAGCTCAGGGTTTGCCTGTTGCTTCTTTTGAGAAAGAGGATGCCAAGCGGTTGCTCTTCTTCTGTTCAAGAGATGAGAATGATAAACATTCAGATCAGTTA
CTTGCAAGTGAATTGCCCGGCTGGTTTAAAGTGCCTACTCCTAGTAGAAAGCGGGTGGAACCAAGCCAAGGAACAAGGAGCACTGTTTCACATGACAGTCTTGCATGTGC
AAACATCCCTTCCCTTAGAAGATTAGGTAGAGAGGATCCTGCACCAATGAATGGCTTAAAGGCATCTTTACCCTCAGCTAGGAAAAGATTAAAAGTACCCTCCATGAGGC
CTATTCCAAGTGTGCATAGGAATAAAATTTCACATTTCTCTGGATTGACCGAAGCAGATGGGAACTATGGAGGCCAATCCAAGGCTAGTTTGCCCATAGTTACCGCGCCA
AAGCATGTCACTACAGGATCATCAACTTCTGCAGCACAGAGGAAATCTTTTTCAAACTCATCGCAGTTTAAGCAGCTGATTATTCCCTTAAATCCACTACCTTTAAGGAA
ACATGGTTGTGGAAGAAACCCGATTCAAGATTGCTCTGAGGAGGAGTTCTTGAAGGATGTTATGGAGTTTTTACTACTTCGAGGACATTCACGACTTATCCCTCAAGGCG
GTGTTGAGGAGTTCCCAGAAGCCATACTCAACGGGAAGCGTCTTGACCTCTATAACTTATATAAGGAGGTGGTCACCCGAGGAGGCTTTCATGTCGGCAATGGTATCAAT
TGGAAGGGGCAGATCTTCTCTAAGATGCACAACTACACAATGACCAATAGAATGACTGGTGTTGGAAATACACTGAAACGACATTACGAGACTTACCTTCTAGAATATGA
ATTGGCTCACGATGATGTAGATGGAGAATGCTGCCTTTTGTGTCGCAGTAGTGCAGCAGGGGATTGGGTTAACTGTGGTGTTTGTGGTGAATGGGCCCATTTTGGGTGCG
ATCGGAGGCAGGGTCTCGGTGCATTTAAGGATTATGCCAAAACTGATGGGTTAGAGTATGTTTGTCCACATTGTAGTATTACTACTTACAAGAAAAAGCCACACAAAATA
GCAAACGGGCCGGATACATTCATTCAGTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGCTTCATTCCGTAGGAGCTGCTAGACAAACTTGTAGCTTACTTGCTGTAACCTGCGGAAGTGTGCATCGAATAAAATGCGAGGAGGATGTTGCCGAGGATAACTT
GAAGTACCCCTTTCCAGAATTAGTTTCTTCTGGACGATTGGAGGTTCGGGTTCTAACAAGTCCAAGCAAGGATGAGTTTAGTAGAATTGTGGAATCATGTCAACCAAGCT
TCATATACTTGCAAGGGGAACAACTCGAAAATGATGAAATTGGGTCATTGGTTTGGAATGATGTATATTTGTCTCTTGAAGATTTATGTGAACTATTCAACACTGCATTA
CCAACCACTGTTTATCTAGAAATCCCAAATGGTAGCAGAATAGCAGAGGTTCTTCATTCTAAGGGAATTCCTTATGTTGTGTACTGGAACAACACATTTTCATGTTATGC
TGCAGCTCAATTTCGTAATGCATTGCTTTCAGTGCTGCAGAGTTCATCTACTCATACATGGGATGCTTTTCAGCTTGCACATGCTGCTTTTAAGCTTCATTGTGCGGGCA
GCAATTATTCCCTTCCCGGGATTTCTGATGACATTATGAGTGATTTAGAGCCTCAGCTTGTTGGGGAACCTCTAAAGATTAACATAGAACCCCCCAAGATAGATGCAGAT
GAAGAGGAAGATGGTTCTTTAGAAAACCTCCCTGCCATAAGTATACACAATAATGCTGTGACTATGAGATTTCTCATCTGTGGAGTGCCTTGCACACCGGATGCACACTT
AATGAAATCGTTGGAGGATGGCCTTAACGCCCTTTTGAACATTGAAATACGTGGGAGTAAACTTCAGGGAAAGTTCAGTGCACCTCCGCCACCTCTTCAAGCAGGATCCT
TTTCTCGTGGTGTTGTGACAATGCGATGTGATATAGTGACCTGTAGTTCAGCCCATATCTCAGTTTTGGTGTCTGGTAGTGCTCAAACTTGTTTTGATGATCAGCTGTTG
GAGAAACATATCAAACATGAGATTATTGAAAATAGCCAATTAGTTCACGCCATGTATGATTGTGAGGGCAACAAACATCACATGCACGAGCCTCGAAAGTCTGCTTCAGT
TGCTTGTGGGGCAACAGTATTTGAGGTTTCCATGAAGGTTCCCGCTTGGGCATCACAGGTCTTAAGGCAACTAGCACCTGATATGTCATATCGGAGTTTAGTTGCACTTG
GCATTGGGGGAGCTCAGGGTTTGCCTGTTGCTTCTTTTGAGAAAGAGGATGCCAAGCGGTTGCTCTTCTTCTGTTCAAGAGATGAGAATGATAAACATTCAGATCAGTTA
CTTGCAAGTGAATTGCCCGGCTGGTTTAAAGTGCCTACTCCTAGTAGAAAGCGGGTGGAACCAAGCCAAGGAACAAGGAGCACTGTTTCACATGACAGTCTTGCATGTGC
AAACATCCCTTCCCTTAGAAGATTAGGTAGAGAGGATCCTGCACCAATGAATGGCTTAAAGGCATCTTTACCCTCAGCTAGGAAAAGATTAAAAGTACCCTCCATGAGGC
CTATTCCAAGTGTGCATAGGAATAAAATTTCACATTTCTCTGGATTGACCGAAGCAGATGGGAACTATGGAGGCCAATCCAAGGCTAGTTTGCCCATAGTTACCGCGCCA
AAGCATGTCACTACAGGATCATCAACTTCTGCAGCACAGAGGAAATCTTTTTCAAACTCATCGCAGTTTAAGCAGCTGATTATTCCCTTAAATCCACTACCTTTAAGGAA
ACATGGTTGTGGAAGAAACCCGATTCAAGATTGCTCTGAGGAGGAGTTCTTGAAGGATGTTATGGAGTTTTTACTACTTCGAGGACATTCACGACTTATCCCTCAAGGCG
GTGTTGAGGAGTTCCCAGAAGCCATACTCAACGGGAAGCGTCTTGACCTCTATAACTTATATAAGGAGGTGGTCACCCGAGGAGGCTTTCATGTCGGCAATGGTATCAAT
TGGAAGGGGCAGATCTTCTCTAAGATGCACAACTACACAATGACCAATAGAATGACTGGTGTTGGAAATACACTGAAACGACATTACGAGACTTACCTTCTAGAATATGA
ATTGGCTCACGATGATGTAGATGGAGAATGCTGCCTTTTGTGTCGCAGTAGTGCAGCAGGGGATTGGGTTAACTGTGGTGTTTGTGGTGAATGGGCCCATTTTGGGTGCG
ATCGGAGGCAGGGTCTCGGTGCATTTAAGGATTATGCCAAAACTGATGGGTTAGAGTATGTTTGTCCACATTGTAGTATTACTACTTACAAGAAAAAGCCACACAAAATA
GCAAACGGGCCGGATACATTCATTCAGTTTTAG
Protein sequenceShow/hide protein sequence
MMLHSVGAARQTCSLLAVTCGSVHRIKCEEDVAEDNLKYPFPELVSSGRLEVRVLTSPSKDEFSRIVESCQPSFIYLQGEQLENDEIGSLVWNDVYLSLEDLCELFNTAL
PTTVYLEIPNGSRIAEVLHSKGIPYVVYWNNTFSCYAAAQFRNALLSVLQSSSTHTWDAFQLAHAAFKLHCAGSNYSLPGISDDIMSDLEPQLVGEPLKINIEPPKIDAD
EEEDGSLENLPAISIHNNAVTMRFLICGVPCTPDAHLMKSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTMRCDIVTCSSAHISVLVSGSAQTCFDDQLL
EKHIKHEIIENSQLVHAMYDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRSLVALGIGGAQGLPVASFEKEDAKRLLFFCSRDENDKHSDQL
LASELPGWFKVPTPSRKRVEPSQGTRSTVSHDSLACANIPSLRRLGREDPAPMNGLKASLPSARKRLKVPSMRPIPSVHRNKISHFSGLTEADGNYGGQSKASLPIVTAP
KHVTTGSSTSAAQRKSFSNSSQFKQLIIPLNPLPLRKHGCGRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGVEEFPEAILNGKRLDLYNLYKEVVTRGGFHVGNGIN
WKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHKI
ANGPDTFIQF