; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0001277 (gene) of Chayote v1 genome

Gene IDSed0001277
OrganismSechium edule (Chayote v1)
DescriptionAT-rich interactive domain-containing protein 4-like
Genome locationLG06:46362607..46371299
RNA-Seq ExpressionSed0001277
SyntenySed0001277
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7036396.1 AT-rich interactive domain-containing protein 4, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0086.16Show/hide
Query:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED
        MLHSIGAARQTCSLLAVTCG +PKVKCEEDVAE  LKYPFP LVSSGRLEV+VLTNPS  EF R+VESCQPSF+YLQGEQLENDE+GSLVWNGVDLS ED
Subjt:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED

Query:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADDIRSDMEP
           LF++ALPTTVYLE+PNG  IA+ LHSKGIPYVIYW NTFSCYA AHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVG +Y LP NADD RSD+EP
Subjt:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADDIRSDMEP

Query:  QLIGEPLKISIEPPEIDVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVE--------IRGSKLQGKFSAPPPP
        QLIGEP KI++EPPE+D GEDED SLEALP IS+HDNNVT R LICG+PCTP     DACLLRSLEDGLNALLN+E        IRGSKLQGKFSAPPPP
Subjt:  QLIGEPLKISIEPPEIDVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVE--------IRGSKLQGKFSAPPPP

Query:  LQAGSFSRGVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVL
        LQAGSFSRGVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVH MHDCE NKH MH+PRKSASVACGATV EVSMKVPAWASQVL
Subjt:  LQAGSFSRGVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVL

Query:  RQLAPEMSYRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGE
        RQLAP+MS+RSLVALGIGGVQG PVASF KEDAERLLFFCS+DEND+HSDQ L+SVLP+WFKPPTPSRKRVEPSQ +R TLSHD+LAYA IPS+RRV  E
Subjt:  RQLAPEMSYRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGE

Query:  EPAPMNGLKAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKH
        EPAPMNG KAPLLPARKR K ATMRPIP  HRNKM  FSG  E DGN+GGQ KASLP +TPSKH T+GSTSATQRKSFSSSSQSKQ + IPL PLPLKKH
Subjt:  EPAPMNGLKAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKH

Query:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
        GCGRNP+ DCSEEEFLKDVMEFLLLRGHSR IPQGGVEEFPDA+LNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
Subjt:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH

Query:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI
        YETYLLEYELAHDDVDGECCLLCH SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY CPHCSIT YKKK H VANGS QGI  PRI
Subjt:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI

XP_022949406.1 AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita moschata]0.0e+0087.29Show/hide
Query:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED
        MLHSIGAARQTCSLLAVTCG +PKVKCEEDVAE  LKYPFP LVSSGRLEV+VLTNPSK EF R+VESCQPSF+YLQGEQLENDE+GSLVWNGVDLS ED
Subjt:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED

Query:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADDIRSDMEP
         C LF++ALPTTVYLE+PNG +IA+ LHSKGIPYVIYW NTFSCYA AHFRNALLSVV+SSSTHTWDAFQLAHAAFRLHCVG +Y LP NADD RSD+EP
Subjt:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADDIRSDMEP

Query:  QLIGEPLKISIEPPEIDVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAGSFSR
        QLIGEP KI+IEPPE+D GEDED SLEA+P IS+HDNNVT R LICG+PCTP     DACLLRSLEDGLNALLN+EIRGSKLQGKFSAPPPPLQAGSFSR
Subjt:  QLIGEPLKISIEPPEIDVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAGSFSR

Query:  GVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLAPEMS
        GVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVH MHDCE NKH MH+PRKSASVACGATV EVSMKVPAWASQVLRQLAP+MS
Subjt:  GVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLAPEMS

Query:  YRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAPMNGL
        +RSLVALGIGGVQG PVASF KEDAERLLFFCS+DEND+HSDQ L+SVLP+WFKPPTPSRKRVEPSQ +R TLSHD+LAYA IPS+RRV  EEPAPMNG 
Subjt:  YRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAPMNGL

Query:  KAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGRNPIH
        KAPLLPARKR K ATM+PIP  HRNKM  FSG  E DGN+GGQ KASLP +TPSKH T+GSTSATQRKSFSSSSQSKQ + IPL PLPLKKHGCGRNP+ 
Subjt:  KAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGRNPIH

Query:  DCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEY
        DCSEEEFLKDVMEFLLLRGHSR IPQGGVEEFPDA+LNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEY
Subjt:  DCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEY

Query:  ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI
        ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSIT YKKK H VANGS QGI  PRI
Subjt:  ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI

XP_022998193.1 AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita maxima]0.0e+0087.04Show/hide
Query:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED
        MLHSIGAARQTCSLLAVTCG +PKVKCEEDVAE  LKYPFP LVSSGRLEV+VLTNPSK EFSR+VESCQPSF+YLQGEQLENDE+GSLVWNGVDLS ED
Subjt:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED

Query:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADDIRSDMEP
         C LFN+ALPTTVYLE+PNG  IA+ LHSKGIPYVIYW NTFSCYA AHFRNALLSVVQSSSTHTWDAFQLAHAAFRL C+G +Y LP NAD+ RSD+EP
Subjt:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADDIRSDMEP

Query:  QLIGEPLKISIEPPEIDVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAGSFSR
        QLIGEP KI +EPPE+D G DED SLEALP IS+HDNNVT R LICG+PCTP     DACLLRSLEDGLNALLN+EIRGSKLQGKFSAPPPPLQA SFSR
Subjt:  QLIGEPLKISIEPPEIDVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAGSFSR

Query:  GVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLAPEMS
        GVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVH MHDCE NKH MH+PRKSASVACGATV EVSMKVPAWASQVLRQLAP+MS
Subjt:  GVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLAPEMS

Query:  YRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAPMNGL
        +RSLVALGIGGVQG PVASF KEDAERLLFFCS+DEND+HSDQ L+SVLPNWFKPPTPSRKRVEPSQ IR  L HD+LAYA IPS+RRV  EEPAPMNG 
Subjt:  YRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAPMNGL

Query:  KAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGRNPIH
        KAPLLPARKR K ATMRPIP  HRNKM  FSG  E DGNNG Q KASLPV+TPSKH T+GSTSATQRKSFSSSSQSKQ + IPL PLPLKKHGCGRNP+ 
Subjt:  KAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGRNPIH

Query:  DCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEY
        DCSEEEFLKDVMEFLLLRGHSR IPQGGVEEFPDA+LNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEY
Subjt:  DCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEY

Query:  ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI
        ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSIT YKKK H +ANGS QGI  PR+
Subjt:  ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI

XP_023524673.1 AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0087.17Show/hide
Query:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED
        MLHSIGAARQTCSLLAVTCG +PKVKCEEDVAE  LKYPFP L SSGRLEV+VLTNPSK EF R+VESCQPSF+YLQGEQLENDE+GSLVWNGVDLS ED
Subjt:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED

Query:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADDIRSDMEP
         C LF++ALPTTVYLE+PNG  IA+ LHSKGIPYVIYW NTFSCYA AHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVG +Y LP NADD RSD+EP
Subjt:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADDIRSDMEP

Query:  QLIGEPLKISIEPPEIDVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAGSFSR
        QLIGEP KI++EPPE+D GEDED SLEALP IS+HDNNVT R LICG+PCTP     DACLLRSLEDGLNALLN+EIRGSKLQGKFSAPPPPLQAGSFSR
Subjt:  QLIGEPLKISIEPPEIDVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAGSFSR

Query:  GVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLAPEMS
        GVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVH MHDCE NKH MH+PRKSASVACGATV EVSMKVPAWASQVLRQLAP+MS
Subjt:  GVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLAPEMS

Query:  YRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAPMNGL
        +RSLVALGIGGVQG PVASF KEDAERLLFFCS+DEND+HSDQ L+SVLP+WFKPPTPSRKRVEPSQ +R TLSHD+LAYA IPS+RRV  EEPAPMNG 
Subjt:  YRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAPMNGL

Query:  KAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGRNPIH
        KAPLLPARKR K ATMRPIP  HRNKM  FSG  E DGN+GGQ KASLP +TPSKH T+GSTSATQRKSFSSSSQSKQ + IPL PLPLKKHGCGRNP+ 
Subjt:  KAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGRNPIH

Query:  DCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEY
        DCSEEEFLKDVMEFLLLRGHSR IPQGGVEEFPDA+LNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEY
Subjt:  DCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEY

Query:  ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI
        ELAHDDVDGECCLLC SSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSIT YKKK H VANGS QGI  PR+
Subjt:  ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI

XP_038883881.1 AT-rich interactive domain-containing protein 4-like [Benincasa hispida]0.0e+0086.96Show/hide
Query:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED
        MLHS+ AARQTCSLLAVTCGSVPK+KCEE+V EDKL+YPFP LVSSGRLEVRVL NPSK EFSR+VES  PSF+YLQGEQL NDEIGSLVWNGVDLS ED
Subjt:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED

Query:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADD-IRSDME
         C LFN+ LPT VYLEIPNG RIA+ALHSKGIPY++YW +TFSCYA AHFRNALLSVVQSSSTHTWDAFQLA AAF+L+CVGS+Y LP  ADD I SD+E
Subjt:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADD-IRSDME

Query:  PQLIGEPLKISIEPPEIDV--GEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAGS
        PQLIGEPLKI++EPPE+D   GED DGSLE LPAISIHDNNVT RFLICGVPCTP     DACLLRSLEDGLNALLN+EIRGSKLQGKFSAPPPPLQAGS
Subjt:  PQLIGEPLKISIEPPEIDV--GEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAGS

Query:  FSRGVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLAP
        FSRGVVTMRCDIVTCSSAHIS+LVSGSAHTCFDDQLLEKHIKHEIIEN+QLVHAMHDCE NKH MHEPRKSASVACGATV EVSMKVPAWASQVLRQLAP
Subjt:  FSRGVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLAP

Query:  EMSYRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAPM
        +MSYRSLVALGIGGVQGLPVASF KEDAERLLFFCS DEND+HS+Q L+SVLP+WFKPPTPSRKRVEPSQ IR TLSHD+LAYA IPSIRRV  EEPAPM
Subjt:  EMSYRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAPM

Query:  NGLKAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGRN
        NG KAPLLP RKR K A+MRP+PR HRNK+T FSGL E D NNG   KASLPV+TPSKH T+GSTSAT RKSFSSSSQSKQ+  I L PLPLKKHGCGRN
Subjt:  NGLKAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGRN

Query:  PIHDCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYL
        PI DCSEEEFLKDVMEFLLLRGHSR IPQGG+EEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYL
Subjt:  PIHDCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYL

Query:  LEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI
        LEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSIT YKKK HRVANGS QGI  PRI
Subjt:  LEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI

TrEMBL top hitse value%identityAlignment
A0A0A0LEG9 ARID domain-containing protein0.0e+0084.81Show/hide
Query:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED
        MLHS+ AARQTCSLLAVTCG+VPKVKCEE+V EDKLKYPFP LVS GRLEVRVL NPSK EFSR+VESC PSF+YLQGEQL NDEIGSLVWNGVDLS ED
Subjt:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED

Query:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADD-IRSDME
         C LFN+ALPT VYLEIP+G RIA+ALHSKGIPY+IYW +TFSCYA AHFR+ALLSVVQSSSTHTWDAFQLA AAFRL+ VGS+Y LP  ADD + SD+E
Subjt:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADD-IRSDME

Query:  PQLIGEPLKISIEPPEIDV--GEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAGS
        PQLIGEPLKI +EPPE+DV  GEDEDGSLEALPAI+IHDNNVT RFLICGVPCTP     D CLLRSLEDGL+ALL +E+RGSKLQGKFSAPPPPLQAGS
Subjt:  PQLIGEPLKISIEPPEIDV--GEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAGS

Query:  FSRGVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLAP
        FSRGVVTMRCDIVTCSSAHIS+LVSGSAHTCFDDQLLEKHIKHEIIE++QLVHA+HDCE NKH MH+PRKSAS+ACGATV EVSMKVPAWASQVLRQLAP
Subjt:  FSRGVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLAP

Query:  EMSYRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAPM
        ++SYRSLVALGIGGVQGLPVASF KEDAERLLFFCS D ND+HS+Q L+SVLP+WFKPPTPSRKRVEPSQ IR +LSHD+L+YA IP+IRRV  E+P PM
Subjt:  EMSYRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAPM

Query:  NGLKAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGRN
        NG KA L PARK+ K A+MRP+PR HRNKMT F+GL E DGNNGG  KASL ++TP KH T+GSTSAT RKSFSSSSQSKQ+  I L PLPLKKHGCGRN
Subjt:  NGLKAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGRN

Query:  PIHDCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYL
        PI DCSEEEFLKDVMEFLLLRGH+R IPQGG+EEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYL
Subjt:  PIHDCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYL

Query:  LEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI
        LEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSIT YKKK HRVANGS QGI  PRI
Subjt:  LEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI

A0A6J1DIE1 AT-rich interactive domain-containing protein 4-like0.0e+0086.6Show/hide
Query:  MMLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFE
        MMLHS+G ARQTCSLLAVTCGSVPKVKCEEDVAED+LKYPFP LVSSGRLEVRVLTNPSK EF+R+VESCQPSF+YLQGEQLENDEIGSLVWNGVDLS E
Subjt:  MMLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFE

Query:  DSCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADDIRSDME
        D C LF++ALP TVYLEIPNG R A+ALHSKGIPYV+YW NT SCYA AHFRN LLSVVQSSSTHTWDAFQLAHAAFRLHC  S+Y LP + D I  ++E
Subjt:  DSCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADDIRSDME

Query:  PQLIGEPLKISIEPPEI---DVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAG
        PQLIGEPLKIS+EPPEI   D GEDED SL  LPAISIHDNNVT RFLICGVPCTP     DACLLRSLEDGLNALLN+EIRGSKLQGKFSA PPPLQAG
Subjt:  PQLIGEPLKISIEPPEI---DVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAG

Query:  SFSRGVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLA
        SFSRGVVTMRCD+VTCSSAHI++LVSGSAHTCFDDQLLEKHIKHEIIENSQLVHA+ DCE N+H MHEPRKSASVACGATV EVSMKVPAWASQVLRQLA
Subjt:  SFSRGVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLA

Query:  PEMSYRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAP
        P+MSYRSLVALGIGGVQGLPVASF KEDAER LFFCS+D ND+HSDQ  LSVLP+WFKPP PSRKRVEPSQ I  T+SHD+LAYA IPSIRRV GEE AP
Subjt:  PEMSYRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAP

Query:  MNGLKAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGR
        MNG KA LLPARKR K ATMRPIPR HRNKMT FSGL EADGNNG   KASLPV+TPSKH T+GSTSATQRKSFSSSSQSKQ+  I L PLPLKKHGCGR
Subjt:  MNGLKAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGR

Query:  NPIHDCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETY
        NPI  CSEEEFLKDVMEFLLLRGHSR IPQGG+ EFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETY
Subjt:  NPIHDCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETY

Query:  LLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI
        LLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSIT YKKK +RVANGS QGI  PRI
Subjt:  LLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI

A0A6J1GBY3 AT-rich interactive domain-containing protein 4-like isoform X10.0e+0087.29Show/hide
Query:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED
        MLHSIGAARQTCSLLAVTCG +PKVKCEEDVAE  LKYPFP LVSSGRLEV+VLTNPSK EF R+VESCQPSF+YLQGEQLENDE+GSLVWNGVDLS ED
Subjt:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED

Query:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADDIRSDMEP
         C LF++ALPTTVYLE+PNG +IA+ LHSKGIPYVIYW NTFSCYA AHFRNALLSVV+SSSTHTWDAFQLAHAAFRLHCVG +Y LP NADD RSD+EP
Subjt:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADDIRSDMEP

Query:  QLIGEPLKISIEPPEIDVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAGSFSR
        QLIGEP KI+IEPPE+D GEDED SLEA+P IS+HDNNVT R LICG+PCTP     DACLLRSLEDGLNALLN+EIRGSKLQGKFSAPPPPLQAGSFSR
Subjt:  QLIGEPLKISIEPPEIDVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAGSFSR

Query:  GVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLAPEMS
        GVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVH MHDCE NKH MH+PRKSASVACGATV EVSMKVPAWASQVLRQLAP+MS
Subjt:  GVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLAPEMS

Query:  YRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAPMNGL
        +RSLVALGIGGVQG PVASF KEDAERLLFFCS+DEND+HSDQ L+SVLP+WFKPPTPSRKRVEPSQ +R TLSHD+LAYA IPS+RRV  EEPAPMNG 
Subjt:  YRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAPMNGL

Query:  KAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGRNPIH
        KAPLLPARKR K ATM+PIP  HRNKM  FSG  E DGN+GGQ KASLP +TPSKH T+GSTSATQRKSFSSSSQSKQ + IPL PLPLKKHGCGRNP+ 
Subjt:  KAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGRNPIH

Query:  DCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEY
        DCSEEEFLKDVMEFLLLRGHSR IPQGGVEEFPDA+LNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEY
Subjt:  DCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEY

Query:  ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI
        ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSIT YKKK H VANGS QGI  PRI
Subjt:  ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI

A0A6J1GVQ9 AT-rich interactive domain-containing protein 4-like0.0e+0085.91Show/hide
Query:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED
        MLHS+ AARQTCSLLAVTCGSV K KCEEDV EDKLKYPFP LVSSGRLEVR LTNPS  EFSR+VESC PSF+YLQGEQL NDEIGSLVWNGVDL  ED
Subjt:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED

Query:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADDIRSDMEP
         C LFN+ALPT VYLEIPNG RIA+ALHSKGIPYV+YW +TFSCYA AHFRNAL SV+QSSSTHTWDAFQLA AAFRLHC+GS + LP   D I S +EP
Subjt:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADDIRSDMEP

Query:  QLIGEPLKISIEPPEIDV--GEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAGSF
        Q+ GEPLKI++EPP++DV  GEDEDGSLE L AISIHDNNVT RFLICGVPCTP     DACLLRSLEDGLNALLN+EIRG KLQGKFSAPPPPLQAGSF
Subjt:  QLIGEPLKISIEPPEIDV--GEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAGSF

Query:  SRGVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLAPE
        +RGVVTMRCDIVTCSSAHIS+LVSGS HTCFDDQLLEKHIKHEIIEN+QLVHAM+DCEDNKH MHEPRKSASVACGATV EVSMKVPAWASQVLRQLAP+
Subjt:  SRGVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLAPE

Query:  MSYRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAPMN
        MSYRSLVALGIGGVQGLPVASF KEDAERLLFFCSKD ND+HSDQ L+SVLP+WFKPP PSRKRVEPSQ IR TLSHD LAYA IP IRRV  EEPAPMN
Subjt:  MSYRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAPMN

Query:  GLKAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGRNP
        G K PLL  RKR K A+MRPIPR HRNKMT FSGL EADGNNGGQ KA  PV+TPSKH T+GSTSATQRKSFSSSSQSKQ+  I L PLPLKKHGCGRNP
Subjt:  GLKAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGRNP

Query:  IHDCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLL
        I DCSEEEFLKDVMEFLLLRGHSR IPQGG+EEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTM+NRMTGVGNTLKRHYETYLL
Subjt:  IHDCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLL

Query:  EYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNY-KKKTHRVANGSSQGILYP
        EYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSIT Y KKK HRVANGS QG+  P
Subjt:  EYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNY-KKKTHRVANGSSQGILYP

A0A6J1KBW8 AT-rich interactive domain-containing protein 4-like isoform X10.0e+0087.04Show/hide
Query:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED
        MLHSIGAARQTCSLLAVTCG +PKVKCEEDVAE  LKYPFP LVSSGRLEV+VLTNPSK EFSR+VESCQPSF+YLQGEQLENDE+GSLVWNGVDLS ED
Subjt:  MLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFED

Query:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADDIRSDMEP
         C LFN+ALPTTVYLE+PNG  IA+ LHSKGIPYVIYW NTFSCYA AHFRNALLSVVQSSSTHTWDAFQLAHAAFRL C+G +Y LP NAD+ RSD+EP
Subjt:  SCELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADDIRSDMEP

Query:  QLIGEPLKISIEPPEIDVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAGSFSR
        QLIGEP KI +EPPE+D G DED SLEALP IS+HDNNVT R LICG+PCTP     DACLLRSLEDGLNALLN+EIRGSKLQGKFSAPPPPLQA SFSR
Subjt:  QLIGEPLKISIEPPEIDVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAGSFSR

Query:  GVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLAPEMS
        GVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVH MHDCE NKH MH+PRKSASVACGATV EVSMKVPAWASQVLRQLAP+MS
Subjt:  GVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLAPEMS

Query:  YRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAPMNGL
        +RSLVALGIGGVQG PVASF KEDAERLLFFCS+DEND+HSDQ L+SVLPNWFKPPTPSRKRVEPSQ IR  L HD+LAYA IPS+RRV  EEPAPMNG 
Subjt:  YRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAPMNGL

Query:  KAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGRNPIH
        KAPLLPARKR K ATMRPIP  HRNKM  FSG  E DGNNG Q KASLPV+TPSKH T+GSTSATQRKSFSSSSQSKQ + IPL PLPLKKHGCGRNP+ 
Subjt:  KAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGRNPIH

Query:  DCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEY
        DCSEEEFLKDVMEFLLLRGHSR IPQGGVEEFPDA+LNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEY
Subjt:  DCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEY

Query:  ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI
        ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSIT YKKK H +ANGS QGI  PR+
Subjt:  ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYPRI

SwissProt top hitse value%identityAlignment
Q6NQ79 AT-rich interactive domain-containing protein 42.8e-26058.05Show/hide
Query:  MLHSIGAARQTCSLLAVTCGS-VPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQL-ENDEIGSLVWNGVDLSF
        M H  G +R  C+++AV  G+ +     + D    + KYPFP L SSGRL+ +VL NP+  EF   V S    F+YLQGE   ++DE+G LV    D S 
Subjt:  MLHSIGAARQTCSLLAVTCGS-VPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQL-ENDEIGSLVWNGVDLSF

Query:  EDS-CELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENAD-DIRS
         D+   LF S LPTTVYLE+PNG+ +A+AL+SKG+ YVIYWKN FS YA  HFR++L SV+QSS + TWD F +A A+FRL+C   +  LP N++  +  
Subjt:  EDS-CELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENAD-DIRS

Query:  DMEPQLIGEPLKISIEPPEIDVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAG
        +M P L+GEP KI +  PE D  E+E+ SLE+LP+I I+D +VT RFL+CG PCT      D  LL SL DGLNALL +E+RGSKL  + SAP PPLQAG
Subjt:  DMEPQLIGEPLKISIEPPEIDVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAG

Query:  SFSRGVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLA
        +F+RGVVTMRCD+ TCSSAHIS+LVSG+A TCF DQLLE HIKHE++E  QLVH++ + E+ K    EPR+SAS+ACGA+V EVSM+VP WA QVLRQLA
Subjt:  SFSRGVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLA

Query:  PEMSYRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAP
        P++SYRSLV LG+  +QGL VASF K+DAERLLFFC +  ND  +   LLS +PNW  PP P+RKR EP +                        E    
Subjt:  PEMSYRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAP

Query:  MNGLKAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGR
         NG      P  ++   A +RPIP   R+KM  FSG +E    +G   K SLP+  P KH   G T  T RK+FS S Q KQ+  I L PLPLKKH CGR
Subjt:  MNGLKAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGR

Query:  NPIHDCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETY
          I  CSEEEFL+DVM+FLL+RGH+R +P GG+ EFPDA+LN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM N+T+TNRMTGVGNTLKRHYETY
Subjt:  NPIHDCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETY

Query:  LLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYP
        LLEYE AHDDVDGECCL+C SS AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP+CS++NY+KK+ + +NG   G+L P
Subjt:  LLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYP

Arabidopsis top hitse value%identityAlignment
AT3G43240.1 ARID/BRIGHT DNA-binding domain-containing protein2.0e-26158.05Show/hide
Query:  MLHSIGAARQTCSLLAVTCGS-VPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQL-ENDEIGSLVWNGVDLSF
        M H  G +R  C+++AV  G+ +     + D    + KYPFP L SSGRL+ +VL NP+  EF   V S    F+YLQGE   ++DE+G LV    D S 
Subjt:  MLHSIGAARQTCSLLAVTCGS-VPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQL-ENDEIGSLVWNGVDLSF

Query:  EDS-CELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENAD-DIRS
         D+   LF S LPTTVYLE+PNG+ +A+AL+SKG+ YVIYWKN FS YA  HFR++L SV+QSS + TWD F +A A+FRL+C   +  LP N++  +  
Subjt:  EDS-CELFNSALPTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENAD-DIRS

Query:  DMEPQLIGEPLKISIEPPEIDVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAG
        +M P L+GEP KI +  PE D  E+E+ SLE+LP+I I+D +VT RFL+CG PCT      D  LL SL DGLNALL +E+RGSKL  + SAP PPLQAG
Subjt:  DMEPQLIGEPLKISIEPPEIDVGEDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAG

Query:  SFSRGVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLA
        +F+RGVVTMRCD+ TCSSAHIS+LVSG+A TCF DQLLE HIKHE++E  QLVH++ + E+ K    EPR+SAS+ACGA+V EVSM+VP WA QVLRQLA
Subjt:  SFSRGVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLA

Query:  PEMSYRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAP
        P++SYRSLV LG+  +QGL VASF K+DAERLLFFC +  ND  +   LLS +PNW  PP P+RKR EP +                        E    
Subjt:  PEMSYRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQHSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAP

Query:  MNGLKAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGR
         NG      P  ++   A +RPIP   R+KM  FSG +E    +G   K SLP+  P KH   G T  T RK+FS S Q KQ+  I L PLPLKKH CGR
Subjt:  MNGLKAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLPVITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGR

Query:  NPIHDCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETY
          I  CSEEEFL+DVM+FLL+RGH+R +P GG+ EFPDA+LN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM N+T+TNRMTGVGNTLKRHYETY
Subjt:  NPIHDCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETY

Query:  LLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYP
        LLEYE AHDDVDGECCL+C SS AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP+CS++NY+KK+ + +NG   G+L P
Subjt:  LLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKKKTHRVANGSSQGILYP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGCTTCATTCCATAGGAGCTGCTAGGCAGACTTGCAGCTTACTTGCTGTCACCTGCGGAAGCGTGCCTAAAGTAAAGTGCGAAGAGGATGTTGCTGAGGATAAGTT
GAAATACCCCTTTCCGGTATTAGTTTCTTCGGGACGATTGGAGGTCCGAGTTCTGACGAATCCAAGTAAGGGTGAGTTTAGTAGAGTTGTAGAATCATGCCAACCGAGCT
TCATCTACTTACAAGGGGAACAACTCGAAAATGATGAAATTGGGTCTTTGGTTTGGAATGGTGTTGATTTGTCTTTTGAAGATTCATGTGAACTATTCAATTCTGCATTA
CCAACCACTGTTTATTTAGAAATCCCGAATGGGGACAGAATAGCGAAGGCTCTTCATTCTAAGGGAATTCCTTATGTCATTTATTGGAAGAACACATTTTCTTGTTATGC
TGGAGCTCATTTTCGAAATGCATTGCTTTCAGTGGTGCAGAGTTCATCTACTCATACATGGGATGCTTTTCAGCTTGCACATGCTGCTTTTAGGCTTCATTGTGTGGGGA
GCGACTATACCCTTCCGGAGAATGCTGATGATATTAGGAGTGATATGGAGCCTCAGCTTATAGGGGAACCTCTAAAGATTAGCATAGAACCCCCCGAGATAGATGTAGGT
GAAGACGAAGATGGTTCTTTAGAAGCCCTCCCTGCCATAAGTATACATGATAATAATGTGACTGCCAGATTTCTCATCTGTGGAGTTCCTTGCACACCAGTAAGCAGTTT
TCAGGATGCTTGCTTATTGAGATCATTGGAGGATGGCCTTAATGCCCTTTTGAACGTTGAAATCCGTGGGAGTAAACTTCAGGGGAAGTTCAGTGCTCCCCCACCACCTC
TTCAAGCAGGATCCTTTTCTCGTGGTGTTGTGACGATGCGATGTGATATTGTGACCTGTAGTTCAGCCCACATCTCAGTATTGGTGTCTGGTAGTGCTCATACTTGTTTT
GACGATCAGCTGTTGGAGAAACATATCAAACATGAGATTATTGAAAACAGCCAATTAGTCCATGCCATGCATGATTGTGAGGATAACAAACATCGCATGCACGAGCCTCG
AAAATCTGCTTCAGTTGCTTGTGGGGCAACTGTATTGGAGGTTTCCATGAAGGTTCCCGCTTGGGCATCACAGGTCTTGAGGCAGCTAGCACCTGAGATGTCATATCGGA
GTTTAGTGGCACTTGGCATCGGGGGAGTTCAGGGTTTGCCCGTTGCTTCTTTTGTGAAAGAGGATGCTGAGCGATTGCTCTTCTTTTGTTCAAAGGATGAGAATGATCAA
CATTCAGATCAGTTTCTTTTAAGTGTATTGCCCAACTGGTTTAAACCACCTACTCCTAGTAGAAAGAGAGTTGAACCAAGCCAAGTAATAAGGAAAACTCTTTCACATGA
CACTCTTGCATATGCAAAGATTCCTTCCATTAGAAGAGTACCTGGAGAGGAGCCTGCACCAATGAATGGGTTGAAGGCACCCTTACTCCCAGCAAGGAAAAGATCAAAAG
GAGCCACCATGAGGCCGATTCCACGCGCGCATAGGAATAAAATGACATCTTTTTCTGGATTGAATGAAGCAGATGGGAACAATGGAGGCCAACACAAGGCTAGTTTGCCT
GTCATTACCCCATCAAAGCATGCAACTCTAGGATCAACTTCTGCAACACAAAGAAAATCGTTTTCCAGCTCATCTCAGTCTAAGCAGCTGCTGACTATTCCCTTAATTCC
ACTACCTTTAAAGAAACATGGTTGTGGAAGAAACCCAATTCATGATTGCTCCGAGGAGGAGTTCTTGAAGGACGTTATGGAGTTTTTACTACTAAGAGGACATTCGCGAC
ATATTCCTCAAGGGGGAGTTGAGGAGTTCCCGGATGCCATACTCAACGGGAAGCGTCTTGACCTCTATAACTTGTACAAGGAGGTGGTCACTCGAGGAGGCTTTCATGTT
GGCAATGGTATCAATTGGAAGGGGCAGATCTTCTCTAAGATGCACAACTACACAATGACCAATAGAATGACTGGTGTTGGAAATACACTGAAAAGACATTACGAGACTTA
CCTTTTAGAATATGAATTGGCTCATGATGATGTAGATGGAGAATGCTGCCTTTTGTGTCACAGTAGTGCAGCAGGGGATTGGGTGAACTGTGGTATTTGTGGTGAATGGG
CTCATTTCGGGTGTGACCGGAGGCAAGGGCTCGGTGCATTCAAGGATTATGCCAAAACAGATGGGTTAGAATATGTTTGTCCACATTGTAGCATTACAAATTACAAGAAG
AAAACACACAGAGTGGCAAACGGGTCTTCACAAGGAATATTGTATCCGCGAATATCTTGA
mRNA sequenceShow/hide mRNA sequence
GCCCCATCACATCAATAAATTTCTCTTTACGTTCATCTTAAAAAAGAATAGAAAAGAAAATAAAACTGAACATTGAAATAATTTATTTTCTTCCAATTTTCGGAAAGTAC
GTAGAGATTCTGTATAAACTCTATAAACCGAACAACTTGTACCTCTCAATTGAAAATTCAGCCGAGCGATCTTCGGCAGTGCAAGTTTACTGCACTGCCCGATTCGGCAT
GCGTACACAGATACACTGCCCCATGAGGAGGAAAGAAGCAAAAGCTCTTGACGAAACCAGAAGAAGAATCTGCTGTCAATCTAATCGACCATGAAAAAAGCGAAGAGAAA
CCCAAATTACGAACAAGACAAGGCCTGAAAATACATACAAGAAGAATCTCGACAAAAGAAAATAAAATCCAGTGTTTGTGGGCACCCAATTGCCGAAGATTTCTGGATTT
GGATCTTAGAAGACAAACGAAACATGGGCATTACAAGTTTGAATCAAAACCAGAGGGAAGTGAAATGGAAATCGACGAGTTTTCTTGATTGGGTTCGAAAGGGGTTTCAT
ATACTTTTGTTGGGTTTAATGTGTTTTCAAATGTTTCGAGATGTGGGAATGATTTTATCTGACACCCACTTATCTTACCAAAAGGAACTTATTATTTGGATTTAATTTTG
GCGACTGTGTGACCCTTTTGTTTGATTTTCGTGTTTATTGCGTGGAATGTGGATAGGGAGAAACAAAAGAAAGCGCTCGATGTTCAAATTTCCTTGATGGGTTTTTCGAT
TTTTGTCATCTGAGGCGGTGAATTCATCCCGGTTCTTCTGGGTGTGGCCGATTCGTTTGATTTTGAGGTTTTCTCGGATGGGTTTTTGCTTTGGAGCTTCGTTTATGATG
GGTTTTGCCTTGAAGCGCTTTTGCTCAACTAATCGCTGAATAAATTTGTGCTGAACTGGGTGAGTTTTAAGGTTTAGTTGTTGGTGCGGTAAAATTAGAGGTTTTGGATG
GCTGTGATTTTGATCTTTAAGTAGTTTCTGCGGGGCTTTAGAATGATGGCATTGTTTCTACATCAGAATAAGAATTCAGTTTCTTTACTTCAATGACATGGCTTGTGGCT
GTTTGTTTCTGAGATTTGTAGCTGATGCCAGGGCATTTGGCTCTGTTTTTGGTGAATGTTTGTGTTTTAACGGCTCCACAAGTTAAGTTGAAAGGTCTCGTTGCATGGGG
AAGTTCGCTTAATGCAATTTACTCCCGGTTCTCTTGTAAATGATGCTTCATTCCATAGGAGCTGCTAGGCAGACTTGCAGCTTACTTGCTGTCACCTGCGGAAGCGTGCC
TAAAGTAAAGTGCGAAGAGGATGTTGCTGAGGATAAGTTGAAATACCCCTTTCCGGTATTAGTTTCTTCGGGACGATTGGAGGTCCGAGTTCTGACGAATCCAAGTAAGG
GTGAGTTTAGTAGAGTTGTAGAATCATGCCAACCGAGCTTCATCTACTTACAAGGGGAACAACTCGAAAATGATGAAATTGGGTCTTTGGTTTGGAATGGTGTTGATTTG
TCTTTTGAAGATTCATGTGAACTATTCAATTCTGCATTACCAACCACTGTTTATTTAGAAATCCCGAATGGGGACAGAATAGCGAAGGCTCTTCATTCTAAGGGAATTCC
TTATGTCATTTATTGGAAGAACACATTTTCTTGTTATGCTGGAGCTCATTTTCGAAATGCATTGCTTTCAGTGGTGCAGAGTTCATCTACTCATACATGGGATGCTTTTC
AGCTTGCACATGCTGCTTTTAGGCTTCATTGTGTGGGGAGCGACTATACCCTTCCGGAGAATGCTGATGATATTAGGAGTGATATGGAGCCTCAGCTTATAGGGGAACCT
CTAAAGATTAGCATAGAACCCCCCGAGATAGATGTAGGTGAAGACGAAGATGGTTCTTTAGAAGCCCTCCCTGCCATAAGTATACATGATAATAATGTGACTGCCAGATT
TCTCATCTGTGGAGTTCCTTGCACACCAGTAAGCAGTTTTCAGGATGCTTGCTTATTGAGATCATTGGAGGATGGCCTTAATGCCCTTTTGAACGTTGAAATCCGTGGGA
GTAAACTTCAGGGGAAGTTCAGTGCTCCCCCACCACCTCTTCAAGCAGGATCCTTTTCTCGTGGTGTTGTGACGATGCGATGTGATATTGTGACCTGTAGTTCAGCCCAC
ATCTCAGTATTGGTGTCTGGTAGTGCTCATACTTGTTTTGACGATCAGCTGTTGGAGAAACATATCAAACATGAGATTATTGAAAACAGCCAATTAGTCCATGCCATGCA
TGATTGTGAGGATAACAAACATCGCATGCACGAGCCTCGAAAATCTGCTTCAGTTGCTTGTGGGGCAACTGTATTGGAGGTTTCCATGAAGGTTCCCGCTTGGGCATCAC
AGGTCTTGAGGCAGCTAGCACCTGAGATGTCATATCGGAGTTTAGTGGCACTTGGCATCGGGGGAGTTCAGGGTTTGCCCGTTGCTTCTTTTGTGAAAGAGGATGCTGAG
CGATTGCTCTTCTTTTGTTCAAAGGATGAGAATGATCAACATTCAGATCAGTTTCTTTTAAGTGTATTGCCCAACTGGTTTAAACCACCTACTCCTAGTAGAAAGAGAGT
TGAACCAAGCCAAGTAATAAGGAAAACTCTTTCACATGACACTCTTGCATATGCAAAGATTCCTTCCATTAGAAGAGTACCTGGAGAGGAGCCTGCACCAATGAATGGGT
TGAAGGCACCCTTACTCCCAGCAAGGAAAAGATCAAAAGGAGCCACCATGAGGCCGATTCCACGCGCGCATAGGAATAAAATGACATCTTTTTCTGGATTGAATGAAGCA
GATGGGAACAATGGAGGCCAACACAAGGCTAGTTTGCCTGTCATTACCCCATCAAAGCATGCAACTCTAGGATCAACTTCTGCAACACAAAGAAAATCGTTTTCCAGCTC
ATCTCAGTCTAAGCAGCTGCTGACTATTCCCTTAATTCCACTACCTTTAAAGAAACATGGTTGTGGAAGAAACCCAATTCATGATTGCTCCGAGGAGGAGTTCTTGAAGG
ACGTTATGGAGTTTTTACTACTAAGAGGACATTCGCGACATATTCCTCAAGGGGGAGTTGAGGAGTTCCCGGATGCCATACTCAACGGGAAGCGTCTTGACCTCTATAAC
TTGTACAAGGAGGTGGTCACTCGAGGAGGCTTTCATGTTGGCAATGGTATCAATTGGAAGGGGCAGATCTTCTCTAAGATGCACAACTACACAATGACCAATAGAATGAC
TGGTGTTGGAAATACACTGAAAAGACATTACGAGACTTACCTTTTAGAATATGAATTGGCTCATGATGATGTAGATGGAGAATGCTGCCTTTTGTGTCACAGTAGTGCAG
CAGGGGATTGGGTGAACTGTGGTATTTGTGGTGAATGGGCTCATTTCGGGTGTGACCGGAGGCAAGGGCTCGGTGCATTCAAGGATTATGCCAAAACAGATGGGTTAGAA
TATGTTTGTCCACATTGTAGCATTACAAATTACAAGAAGAAAACACACAGAGTGGCAAACGGGTCTTCACAAGGAATATTGTATCCGCGAATATCTTGATTTAGTTCTGG
TCTCCGTCTCAAAGTTTCTGACTGCGTTGTTACTTTTTCTGCCTATTCTTATCCTACACGGGAAGTTACTCATTCTATCAGCTCGTAGTTGCATTTTATTTGAGGTGGCG
TGCTGTTACTGAAGAGTATATTTTCGAGACTCAGGATGAAAACGTGAGATGAAGAGAAGAGCACACAAAGAGAGCATGACTCGTTCATAGGTGTGGATCAGAATTTTGCT
ACACTTGTGGATGTAAGTGGAGTCATGCCCATGATCAAGCGCCACAAAATCTGTGGAGTTCAATGGGCAAGGGGAGAAATCTGCAAATACAAAGAGGAAGAATCAATTCA
GGGGAATAAGTGGGCTGCTGAGATTCATGACCCAAGGAAAGGGGCCCCTGTATGGCTTGGTACTTTCAACACTGCAGAGGAAGCTGCAAGAGCGTACGATGCTGAGACGC
GGAGAATTCGAGGCAAGAAGGCCAAGGTGAATTTCTCTGAGGAACCACCTTTGCATAATACAATACCCAGAAGCCAAAAAACTCGCAGAAAATACATCTAAAGACGAACT
TGAAAGCCAACCAACATTCTCAGTTTTCTAATAATCCACATCAGAACTACTATAGAACCTGTAGTTTTCTGGAAGTGAAACCTCATACTAACCAACTTAGATACATGGTC
TCATTGCCTGCTAGTATGGGGAGTGCTCCATCTGAGGATATGCACCTGGGTTAGAATATGTTATATGTTACCTGTATTGATGTGTACCATGAAACTTGGCTCTTGTCTTT
GCTATTGAG
Protein sequenceShow/hide protein sequence
MMLHSIGAARQTCSLLAVTCGSVPKVKCEEDVAEDKLKYPFPVLVSSGRLEVRVLTNPSKGEFSRVVESCQPSFIYLQGEQLENDEIGSLVWNGVDLSFEDSCELFNSAL
PTTVYLEIPNGDRIAKALHSKGIPYVIYWKNTFSCYAGAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSDYTLPENADDIRSDMEPQLIGEPLKISIEPPEIDVG
EDEDGSLEALPAISIHDNNVTARFLICGVPCTPVSSFQDACLLRSLEDGLNALLNVEIRGSKLQGKFSAPPPPLQAGSFSRGVVTMRCDIVTCSSAHISVLVSGSAHTCF
DDQLLEKHIKHEIIENSQLVHAMHDCEDNKHRMHEPRKSASVACGATVLEVSMKVPAWASQVLRQLAPEMSYRSLVALGIGGVQGLPVASFVKEDAERLLFFCSKDENDQ
HSDQFLLSVLPNWFKPPTPSRKRVEPSQVIRKTLSHDTLAYAKIPSIRRVPGEEPAPMNGLKAPLLPARKRSKGATMRPIPRAHRNKMTSFSGLNEADGNNGGQHKASLP
VITPSKHATLGSTSATQRKSFSSSSQSKQLLTIPLIPLPLKKHGCGRNPIHDCSEEEFLKDVMEFLLLRGHSRHIPQGGVEEFPDAILNGKRLDLYNLYKEVVTRGGFHV
GNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITNYKK
KTHRVANGSSQGILYPRIS