; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G14030 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G14030
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionAT-rich interactive domain-containing protein 4-like
Genome locationClcChr07:28718067..28725228
RNA-Seq ExpressionClc07G14030
SyntenyClc07G14030
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001606 - ARID DNA-binding domain
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR036431 - ARID DNA-binding domain superfamily
IPR042293 - AT-rich interactive domain-containing protein 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053551.1 AT-rich interactive domain-containing protein 4-like [Cucumis melo var. makuwa]0.0e+0089.32Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSVVAARQTCSLLAVTCG+VPKV+CEEE+ EDKL+YPFPELVSSGRLEVRVLANPSKDEFSRIVES LPSF+YLQGEQLG+DEIGSLVWN VDLSLED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE
        LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYL+YWNSTFS YAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYC+GSNY L  IADD++M DLE
Subjt:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE

Query:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP
        PQL+GEPLKI+VEPP+V+ GEGEDEDGSLE LPAISIHDNNVT+RFLICG+PCTPDACLLRSLEDGLNALL IE      S R L++   +      APP
Subjt:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP

Query:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ
        PPLQAGSFSRGVVTMRCD+VTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEN+QLVHA+HDCEGNK H+HEPRKSAS+ACGA VFEVSMKVPAWASQ
Subjt:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ

Query:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG
        VLRQLAPD+SYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGD NDK+SEQLLVSVLPSWFKPPTPSR+RVEPSQGIR++LSHDS +YA IP+IRR+G
Subjt:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG

Query:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
         E+P PMNGFKA + P RK+LKVASMRPVPR++RNK+ PF+GLTEVDGNNGGLSK SL VVT  KHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
Subjt:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH

Query:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
        GCGRNPI DCSEEEFLKDVMEFLLLRGH+RLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
Subjt:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH

Query:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIMNPRIP
        YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGI NPRIP
Subjt:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIMNPRIP

KAG7018239.1 AT-rich interactive domain-containing protein 4, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0088.97Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSV+AARQTCSLLAVTCGSV K KCEE+V EDKL+YPFP LVSSGRLEVR L NPS DEFSRIVESCLPSF+YLQGEQLGNDEIGSLVWNGVDL LED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE
        LCGLFNTALPT+VYLEIPNGGRIAEALHSKGIPY++YWNSTFSCYAAAHFRNAL SV+QSSSTHTWDAFQLARAAFRL+CMGS++ LPGI  DSI S LE
Subjt:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE

Query:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP
        PQ++GEPLKINVEPP+VD GEGEDEDGSLETL AISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIE    +   +            FSAPP
Subjt:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP

Query:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ
        PPLQAGSF+RGVVTMRCDMVTCSSAHISILVSGS HTCFDDQLLEKHIKHEIIEN+QLVHAM+DCE NK HMHEPRKSASVACGA VFEVSMKVPAWASQ
Subjt:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ

Query:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG
        VLRQLAPDMSYRSLVALG+GGVQGLPVASFEKEDAERLLFFCS D NDKHS+QLLVSVLPSWFKPP PSRKRVEPSQGIRSTLSHD  AYA IPSIRR+G
Subjt:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG

Query:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
         EEPAPMNGFK P+LP+RKRLKVASMRP+PR++RNKM PFSGLTE DGNNGG  K   PVVTPSKHVTVGSTSAT RKSFSSSSQSKQIISLNPLPLKKH
Subjt:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH

Query:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
        GCGRNPI DCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTM+NRMTGVGNTLKRH
Subjt:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH

Query:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY-KKKPHRVANGSPQGIMN-PRIP
        YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY KKKPHRVANGSPQG+ N PRIP
Subjt:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY-KKKPHRVANGSPQGIMN-PRIP

XP_008460343.1 PREDICTED: AT-rich interactive domain-containing protein 4-like [Cucumis melo]0.0e+0089.2Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSVVAARQTCSLLAVTCG+VPKV+CEEE+ EDKL+YPFPELVSSGRLEVRVLANPSKDEFSRIVES LPSF+YLQGEQLG+DEIGSLVWN VDLSLED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE
        LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYL+YWNSTFS YAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYC+GSNY L  IADD++M DLE
Subjt:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE

Query:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP
        PQL+GEPLKI+VEPP+V+ GEGEDEDGSLE LPAISIHDNNVT+RFLICG+PCTPDACLLRSLEDGLNALL IE    +   +            FSAPP
Subjt:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP

Query:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ
        PPLQAGSFSRGVVTMRCD+VTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEN+QLVHA+HDCEGNK H+HEPRKSAS+ACGA VFEVSMKVPAWASQ
Subjt:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ

Query:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG
        VLRQLAPD+SYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGD NDK+SEQLLVSVLPSWFKPPTPSR+RVEPSQGIR++LSHDS +YA IP+IRR+G
Subjt:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG

Query:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
         E+P PMNGFKA + P RK+LKVASMRPVPR++RNK+ PF+GLTEVDGNNGGLSK SL VVT  KHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
Subjt:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH

Query:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
        GCGRNPI DCSEEEFLKDVMEFLLLRGH+RLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
Subjt:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH

Query:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIMNPRIP
        YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGI NPRIP
Subjt:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIMNPRIP

XP_011651722.1 AT-rich interactive domain-containing protein 4 isoform X1 [Cucumis sativus]0.0e+0090.2Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSVVAARQTCSLLAVTCG+VPKVKCEEEV EDKL+YPFPELVS GRLEVRVLANPSKDEFSRIVESCLPSF+YLQGEQLGNDEIGSLVWNGVDLSLED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE
        LCGLFN ALPT VYLEIP+GGRIAEALHSKGIPYLIYWNSTFSCYAAAHFR+ALLSVVQSSSTHTWDAFQLARAAFRLY +GSNYGLPGIADDS+MSDLE
Subjt:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE

Query:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP
        PQL+GEPLKI+VEPP++D GEGEDEDGSLE LPAI+IHDNNVT+RFLICGVPCTPD CLLRSLEDGL+ALL IE    +   +            FSAPP
Subjt:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP

Query:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ
        PPLQAGSFSRGVVTMRCD+VTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIE+NQLVHA+HDCEGNK HMH+PRKSAS+ACGATVFEVSMKVPAWASQ
Subjt:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ

Query:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG
        VLRQLAPD+SYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGD NDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIR++LSHDS +YA IP+IRR+G
Subjt:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG

Query:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
         E+P PMNGFKA + P RK+LKVASMRPVPR++RNKM PF+GLTEVDGNNGGLSK SL +VTP KHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
Subjt:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH

Query:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
        GCGRNPI DCSEEEFLKDVMEFLLLRGH+RLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
Subjt:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH

Query:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIMNPRIP
        YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGI NPRIP
Subjt:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIMNPRIP

XP_038883881.1 AT-rich interactive domain-containing protein 4-like [Benincasa hispida]0.0e+0093.72Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSVVAARQTCSLLAVTCGSVPK+KCEEEV EDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVES LPSF+YLQGEQLGNDEIGSLVWNGVDLSLED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE
        LCGLFNT LPTIVYLEIPNGGRIAEALHSKGIPYL+YWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAF+LYC+GSNYGLPGIADDSIMSDLE
Subjt:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE

Query:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP
        PQL+GEPLKINVEPP+VDAGEGED DGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIE    +   +            FSAPP
Subjt:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP

Query:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ
        PPLQAGSFSRGVVTMRCD+VTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNK HMHEPRKSASVACGATVFEVSMKVPAWASQ
Subjt:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ

Query:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG
        VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDS AYA IPSIRR+ 
Subjt:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG

Query:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
         EEPAPMNGFKAP+LPTRKRLKVASMRPVPR++RNK+ PFSGL EVD NNG LSK SLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
Subjt:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH

Query:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
        GCGRNPI DCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
Subjt:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH

Query:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIMNPRIP
        YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGI NPRIP
Subjt:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIMNPRIP

TrEMBL top hitse value%identityAlignment
A0A0A0LEG9 ARID domain-containing protein0.0e+0090.2Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSVVAARQTCSLLAVTCG+VPKVKCEEEV EDKL+YPFPELVS GRLEVRVLANPSKDEFSRIVESCLPSF+YLQGEQLGNDEIGSLVWNGVDLSLED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE
        LCGLFN ALPT VYLEIP+GGRIAEALHSKGIPYLIYWNSTFSCYAAAHFR+ALLSVVQSSSTHTWDAFQLARAAFRLY +GSNYGLPGIADDS+MSDLE
Subjt:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE

Query:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP
        PQL+GEPLKI+VEPP++D GEGEDEDGSLE LPAI+IHDNNVT+RFLICGVPCTPD CLLRSLEDGL+ALL IE    +   +            FSAPP
Subjt:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP

Query:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ
        PPLQAGSFSRGVVTMRCD+VTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIE+NQLVHA+HDCEGNK HMH+PRKSAS+ACGATVFEVSMKVPAWASQ
Subjt:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ

Query:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG
        VLRQLAPD+SYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGD NDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIR++LSHDS +YA IP+IRR+G
Subjt:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG

Query:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
         E+P PMNGFKA + P RK+LKVASMRPVPR++RNKM PF+GLTEVDGNNGGLSK SL +VTP KHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
Subjt:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH

Query:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
        GCGRNPI DCSEEEFLKDVMEFLLLRGH+RLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
Subjt:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH

Query:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIMNPRIP
        YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGI NPRIP
Subjt:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIMNPRIP

A0A1S3CCC9 AT-rich interactive domain-containing protein 4-like0.0e+0089.2Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSVVAARQTCSLLAVTCG+VPKV+CEEE+ EDKL+YPFPELVSSGRLEVRVLANPSKDEFSRIVES LPSF+YLQGEQLG+DEIGSLVWN VDLSLED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE
        LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYL+YWNSTFS YAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYC+GSNY L  IADD++M DLE
Subjt:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE

Query:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP
        PQL+GEPLKI+VEPP+V+ GEGEDEDGSLE LPAISIHDNNVT+RFLICG+PCTPDACLLRSLEDGLNALL IE    +   +            FSAPP
Subjt:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP

Query:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ
        PPLQAGSFSRGVVTMRCD+VTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEN+QLVHA+HDCEGNK H+HEPRKSAS+ACGA VFEVSMKVPAWASQ
Subjt:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ

Query:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG
        VLRQLAPD+SYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGD NDK+SEQLLVSVLPSWFKPPTPSR+RVEPSQGIR++LSHDS +YA IP+IRR+G
Subjt:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG

Query:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
         E+P PMNGFKA + P RK+LKVASMRPVPR++RNK+ PF+GLTEVDGNNGGLSK SL VVT  KHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
Subjt:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH

Query:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
        GCGRNPI DCSEEEFLKDVMEFLLLRGH+RLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
Subjt:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH

Query:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIMNPRIP
        YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGI NPRIP
Subjt:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIMNPRIP

A0A5A7UJ97 AT-rich interactive domain-containing protein 4-like0.0e+0089.32Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSVVAARQTCSLLAVTCG+VPKV+CEEE+ EDKL+YPFPELVSSGRLEVRVLANPSKDEFSRIVES LPSF+YLQGEQLG+DEIGSLVWN VDLSLED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE
        LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYL+YWNSTFS YAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYC+GSNY L  IADD++M DLE
Subjt:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE

Query:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP
        PQL+GEPLKI+VEPP+V+ GEGEDEDGSLE LPAISIHDNNVT+RFLICG+PCTPDACLLRSLEDGLNALL IE      S R L++   +      APP
Subjt:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP

Query:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ
        PPLQAGSFSRGVVTMRCD+VTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEN+QLVHA+HDCEGNK H+HEPRKSAS+ACGA VFEVSMKVPAWASQ
Subjt:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ

Query:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG
        VLRQLAPD+SYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGD NDK+SEQLLVSVLPSWFKPPTPSR+RVEPSQGIR++LSHDS +YA IP+IRR+G
Subjt:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG

Query:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
         E+P PMNGFKA + P RK+LKVASMRPVPR++RNK+ PF+GLTEVDGNNGGLSK SL VVT  KHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
Subjt:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH

Query:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
        GCGRNPI DCSEEEFLKDVMEFLLLRGH+RLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
Subjt:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH

Query:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIMNPRIP
        YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGI NPRIP
Subjt:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIMNPRIP

A0A5D3D6I3 AT-rich interactive domain-containing protein 4-like0.0e+0089.2Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSVVAARQTCSLLAVTCG+VPKV+CEEE+ EDKL+YPFPELVSSGRLEVRVLANPSKDEFSRIVES LPSF+YLQGEQLG+DEIGSLVWN VDLSLED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE
        LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYL+YWNSTFS YAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYC+GSNY L  IADD++M DLE
Subjt:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE

Query:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP
        PQL+GEPLKI+VEPP+V+ GEGEDEDGSLE LPAISIHDNNVT+RFLICG+PCTPDACLLRSLEDGLNALL IE    +   +            FSAPP
Subjt:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP

Query:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ
        PPLQAGSFSRGVVTMRCD+VTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEN+QLVHA+HDCEGNK H+HEPRKSAS+ACGA VFEVSMKVPAWASQ
Subjt:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ

Query:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG
        VLRQLAPD+SYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGD NDK+SEQLLVSVLPSWFKPPTPSR+RVEPSQGIR++LSHDS +YA IP+IRR+G
Subjt:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG

Query:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
         E+P PMNGFKA + P RK+LKVASMRPVPR++RNK+ PF+GLTEVDGNNGGLSK SL VVT  KHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
Subjt:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH

Query:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
        GCGRNPI DCSEEEFLKDVMEFLLLRGH+RLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
Subjt:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH

Query:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIMNPRIP
        YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGI NPRIP
Subjt:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIMNPRIP

A0A6J1GVQ9 AT-rich interactive domain-containing protein 4-like0.0e+0089.1Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSV+AARQTCSLLAVTCGSV K KCEE+V EDKL+YPFP LVSSGRLEVR L NPS DEFSRIVESCLPSF+YLQGEQLGNDEIGSLVWNGVDL LED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE
        LCGLFNTALPT+VYLEIPNGGRIAEALHSKGIPY++YWNSTFSCYAAAHFRNAL SV+QSSSTHTWDAFQLARAAFRL+CMGS++ LPGI  DSI S LE
Subjt:  LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLE

Query:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP
        PQ+ GEPLKINVEPP+VD GEGEDEDGSLETL AISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIE    +   +            FSAPP
Subjt:  PQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPP

Query:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ
        PPLQAGSF+RGVVTMRCD+VTCSSAHISILVSGS HTCFDDQLLEKHIKHEIIENNQLVHAM+DCE NK HMHEPRKSASVACGATVFEVSMKVPAWASQ
Subjt:  PPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQ

Query:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG
        VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCS D NDKHS+QLLVSVLPSWFKPP PSRKRVEPSQGIRSTLSHD  AYA IP IRR+G
Subjt:  VLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIG

Query:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
         EEPAPMNGFK P+L TRKRLKVASMRP+PR++RNKM PFSGLTE DGNNGG  K   PVVTPSKHVTVGSTSAT RKSFSSSSQSKQIISLNPLPLKKH
Subjt:  GEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH

Query:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
        GCGRNPI DCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTM+NRMTGVGNTLKRH
Subjt:  GCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH

Query:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY-KKKPHRVANGSPQGIMN-PRIP
        YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY KKKPHRVANGSPQG+ N PRIP
Subjt:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY-KKKPHRVANGSPQGIMN-PRIP

SwissProt top hitse value%identityAlignment
Q6NQ79 AT-rich interactive domain-containing protein 44.4e-26157.87Show/hide
Query:  MLHSVVAARQTCSLLAVTCGS-VPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLG-NDEIGSLVWNGVDLSL
        M H    +R  C+++AV  G+ +     + +    + +YPFP+L SSGRL+ +VL NP+ +EF   V S    F+YLQGE  G +DE+G LV    D S 
Subjt:  MLHSVVAARQTCSLLAVTCGS-VPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLG-NDEIGSLVWNGVDLSL

Query:  ED-LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMS
         D L  LF + LPT VYLE+PNG  +A+AL+SKG+ Y+IYW + FS YAA HFR++L SV+QSS + TWD F +A A+FRLYC   N  LP  ++  +  
Subjt:  ED-LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMS

Query:  DLEPQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFS
        ++ P L+GEP KI+V  P+ D  E   E+ SLE+LP+I I+D +VTVRFL+CG PCT D  LL SL DGLNALL IE    +   R             S
Subjt:  DLEPQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFS

Query:  APPPPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAW
        AP PPLQAG+F+RGVVTMRCD+ TCSSAHIS+LVSG+A TCF DQLLE HIKHE++E  QLVH++ + E  K    EPR+SAS+ACGA+V EVSM+VP W
Subjt:  APPPPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAW

Query:  ASQVLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIR
        A QVLRQLAPD+SYRSLV LG+  +QGL VASFEK+DAERLLFFC    ND  +   L+S +P+W  PP P+RKR EP +                    
Subjt:  ASQVLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIR

Query:  RIGGEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPL
            E     NG      PT +++ VA++RP+P   R+KM+PFSG +E+   +G  +KGSLP+  P KH   G T  THRK+FS S Q KQIISLNPLPL
Subjt:  RIGGEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPL

Query:  KKHGCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTL
        KKH CGR  I  CSEEEFL+DVM+FLL+RGH+RL+P GGL EFPDA+LN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM N+T+TNRMTGVGNTL
Subjt:  KKHGCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTL

Query:  KRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANG
        KRHYETYLLEYE AHDDVDGECCL+C SS AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP+CS++ Y+KK  + +NG
Subjt:  KRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANG

Arabidopsis top hitse value%identityAlignment
AT3G43240.1 ARID/BRIGHT DNA-binding domain-containing protein3.1e-26257.87Show/hide
Query:  MLHSVVAARQTCSLLAVTCGS-VPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLG-NDEIGSLVWNGVDLSL
        M H    +R  C+++AV  G+ +     + +    + +YPFP+L SSGRL+ +VL NP+ +EF   V S    F+YLQGE  G +DE+G LV    D S 
Subjt:  MLHSVVAARQTCSLLAVTCGS-VPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLG-NDEIGSLVWNGVDLSL

Query:  ED-LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMS
         D L  LF + LPT VYLE+PNG  +A+AL+SKG+ Y+IYW + FS YAA HFR++L SV+QSS + TWD F +A A+FRLYC   N  LP  ++  +  
Subjt:  ED-LCGLFNTALPTIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMS

Query:  DLEPQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFS
        ++ P L+GEP KI+V  P+ D  E   E+ SLE+LP+I I+D +VTVRFL+CG PCT D  LL SL DGLNALL IE    +   R             S
Subjt:  DLEPQLVGEPLKINVEPPQVDAGEGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFS

Query:  APPPPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAW
        AP PPLQAG+F+RGVVTMRCD+ TCSSAHIS+LVSG+A TCF DQLLE HIKHE++E  QLVH++ + E  K    EPR+SAS+ACGA+V EVSM+VP W
Subjt:  APPPPLQAGSFSRGVVTMRCDMVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAW

Query:  ASQVLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIR
        A QVLRQLAPD+SYRSLV LG+  +QGL VASFEK+DAERLLFFC    ND  +   L+S +P+W  PP P+RKR EP +                    
Subjt:  ASQVLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIR

Query:  RIGGEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPL
            E     NG      PT +++ VA++RP+P   R+KM+PFSG +E+   +G  +KGSLP+  P KH   G T  THRK+FS S Q KQIISLNPLPL
Subjt:  RIGGEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNNGGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPL

Query:  KKHGCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTL
        KKH CGR  I  CSEEEFL+DVM+FLL+RGH+RL+P GGL EFPDA+LN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM N+T+TNRMTGVGNTL
Subjt:  KKHGCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTL

Query:  KRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANG
        KRHYETYLLEYE AHDDVDGECCL+C SS AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP+CS++ Y+KK  + +NG
Subjt:  KRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCATTCCGTAGTAGCTGCTAGACAAACTTGTAGCCTACTTGCTGTCACCTGCGGAAGTGTACCTAAAGTAAAATGCGAGGAGGAAGTTGTTGAGGATAAGTTGAG
ATATCCATTTCCGGAATTAGTTTCTTCCGGACGATTGGAGGTTCGAGTTTTGGCAAATCCAAGCAAGGATGAATTTAGTAGAATTGTGGAATCATGTCTACCGAGCTTCA
TCTACTTGCAAGGGGAACAACTCGGAAATGATGAAATTGGGTCTTTGGTTTGGAATGGTGTTGATTTGTCTCTTGAAGATTTATGTGGACTATTCAATACTGCACTACCA
ACCATTGTGTATTTAGAAATTCCTAATGGAGGCAGAATAGCGGAGGCTCTTCATTCTAAGGGAATTCCTTATCTCATATATTGGAACAGCACATTTTCATGTTATGCTGC
AGCTCATTTTCGTAATGCATTGCTTTCAGTGGTGCAGAGTTCATCTACTCATACATGGGATGCTTTTCAGCTTGCACGTGCTGCTTTTAGGCTTTATTGTATGGGGAGCA
ATTATGGTCTGCCTGGGATTGCTGATGACAGTATTATGAGTGATCTAGAGCCACAGCTTGTTGGGGAACCCCTAAAGATTAACGTAGAACCCCCCCAGGTAGATGCAGGT
GAAGGTGAAGATGAAGATGGTTCTTTAGAAACCCTCCCAGCCATAAGTATTCATGACAATAATGTGACCGTGAGATTTCTTATCTGTGGAGTGCCTTGCACGCCGGATGC
CTGCTTATTGAGATCATTGGAGGATGGCCTTAATGCGCTTTTGAACATTGAAGAATTAAATCTTCAGTTCAGTGCGCGTCAATTGTATATGCTATTAATGACTTTAAAAA
TATGGTTCAGTGCTCCTCCACCACCTCTTCAAGCAGGATCCTTTTCTCGTGGAGTTGTGACAATGCGGTGTGATATGGTGACTTGTAGTTCAGCCCACATCTCAATATTG
GTGTCTGGTAGCGCTCATACTTGTTTTGATGATCAGCTCTTGGAGAAACATATAAAGCATGAGATTATTGAAAATAACCAGTTAGTTCATGCCATGCACGACTGTGAGGG
CAACAAACCTCACATGCACGAGCCTCGAAAATCTGCTTCAGTTGCCTGTGGGGCAACAGTATTTGAGGTTTCTATGAAGGTTCCCGCTTGGGCATCACAGGTCTTGAGGC
AACTTGCACCTGATATGTCATATCGGAGTTTAGTTGCACTCGGTATTGGGGGAGTTCAAGGTTTACCTGTTGCTTCTTTTGAGAAAGAAGATGCCGAGCGATTGCTCTTC
TTTTGTTCAGGGGATGAGAATGATAAACATTCAGAGCAGTTGCTTGTAAGTGTATTACCCAGCTGGTTTAAGCCACCTACTCCTAGTAGAAAGAGAGTAGAACCAAGCCA
AGGAATAAGAAGCACTCTTTCACATGACAGTCCTGCATATGCAAAGATTCCTTCCATTAGAAGAATAGGTGGAGAGGAGCCTGCACCAATGAATGGGTTCAAGGCACCCA
TACTCCCAACGAGGAAAAGATTAAAAGTAGCCTCCATGAGGCCTGTTCCACGTATGAATAGGAATAAAATGATGCCTTTCTCTGGATTGACTGAAGTAGATGGGAATAAC
GGAGGCCTATCCAAGGGTAGTTTACCTGTTGTTACCCCGTCAAAGCATGTAACTGTAGGATCAACTTCTGCAACACACAGAAAATCTTTTTCAAGCTCATCTCAGTCTAA
GCAGATTATTTCCTTGAACCCACTGCCTCTAAAGAAGCATGGTTGTGGAAGAAACCCAATTCATGATTGCTCAGAGGAAGAGTTCTTGAAAGATGTTATGGAGTTTTTAC
TGCTTAGAGGACATTCACGACTTATTCCTCAAGGTGGACTTGAGGAGTTTCCAGATGCCATACTCAACGGGAAGCGTCTTGACCTCTATAACTTGTATAAGGAGGTGGTC
ACCCGAGGAGGCTTTCATGTCGGCAATGGTATCAACTGGAAGGGGCAGATCTTCTCTAAGATGCACAATTACACAATGACCAATAGAATGACTGGTGTTGGAAATACACT
GAAAAGACATTATGAGACTTACCTTCTAGAATATGAATTGGCTCACGATGATGTAGATGGAGAATGCTGTCTTCTGTGCCACAGTAGTGCAGCAGGGGATTGGGTGAACT
GTGGTATTTGTGGTGAATGGGCCCATTTTGGGTGCGATCGAAGGCAGGGTCTAGGAGCATTTAAGGATTATGCCAAAACAGATGGGCTAGAGTATGTTTGTCCACATTGT
AGCATTACAACTTACAAGAAGAAACCACACAGAGTAGCAAACGGGTCTCCACAAGGAATAATGAATCCACGGATACCTTGA
mRNA sequenceShow/hide mRNA sequence
GAAGAAGTTAAAAGTTCCACCCACATGATGCTCAAGAAAAACAATATAAGATGAAGAGTGTATACAGAAAATGTTGAATGTGGCGCCATCCTCATTTCTTTATCCTACAA
ATCCACCAATTTCTTGTTAAGTCCGGTCAATTTTTCATTTCTTTGATGTTGATTATTGATGGACCCTCCTCATATTCAACATTCAAATCATTTGTTTTCATCCAAATCAC
CTAAGATACGTAGAGATTCTGTATATATACACTATAAACCAAACAATTAACTGAAAATTCAGCCGAGCGATCTTCGCCAGTGCAAGTTTACTCCACTTCCTGGTTCTGCA
TTCGTGCAGATGCGGCCGTATGAGGAGAAAAGAACAAGCTTCAGCTATGCTTCTTGACGAAACTTGAAGATTCTGTTCATCAATCCAATCCAAGATAATAAAAGCCAACA
GAAAAAGAAACCCAAATTACGAACAAGACAAGGGTTGAGATCTTTTACGAACAATTTTCGTTAAATCCGTGCTTTGTGGGCACCAAATTAACGCAAATTTCTGGATTTGG
GATGTGTTGGTATCTGATTTTGTTTACAAATTTCAAAGAAGACAAAGAATAAACAGCCGAGGGGAAGTAAATTTGGAAGGCCATGAGTTGTTTTCTTGATTGGATTGGAA
TCGCATTTTGTGGGAAGGACTGTGATATAAACCTTTCCCTTTTCTTGTTGGGTTTAACGTGTTTTGAATCTTTTCATATGTTTGACTTCCGGTTTCGAGATGTGGGAATG
TTTTTATCTGACACCCATTTATTTATCAGAAAGGAACTAATTATTTGATTTAATTTTTGGCGACTGTGTGAGCTTTTGATTTTCTTGTTTATTGCGTGGAATGTGGATAG
GGAGAAAGGTAAGAATATTTCTTATTTCTATTTTTTTTTCAGCGATTTTTTATTGATGTGAATTTTAAGGCCCTAGGGTTCACTTTTCGGGCTCCAGTCTTCTATCGCTG
CCCAAATTTCTTCCCCTTTCGTATGATTTTTAGGGGTTACGGACTCTGTCTTGTCTTGTCGTCGATTTTTCGTCATCTCATCGTGCTGGGTGTTCAAATTTCCTTGATTT
TTCCCATTCCCTTGGTTGGTTCTCCGGTTTCTGTTGTCTGAGGCGCTGAATTTCCTCTCGCTCCTTCTGCTGCCGACTGTACATGCTGGTCATAGTTGGAGATTCTTAAC
GGATAGGGTTTGAGTTCTAAATTAGTCCTGCATTCACTTTTGCAGAATTTTTTTGCTGAAGTGGGTGACTTTTGGGTTTAATTGTTGGTCCGGTAAGAATTAAAGGTTCC
GGACGGCTGTGATTTTGATCTTAAAGCCCGTCCCCTTGAGGGGCTTTAGGATGATGGCATTGTTTCAACCAGATTCTTTAGTTCTATGTCATGGCATATGGGTAGTGCTC
TCTGAGATTTTTAACTGATTCCAGGGCATTGGGGCTCTGTTTTAGCGAATGTTTGTGTTCTAATGGTTCCACAAGCAACTTGAAAGATCTCATTGCTTGAGGAAGTCCGC
TTAATGCAATTTTCTCCCGGTTTTTCACGTAAACGATGCTTCATTCCGTAGTAGCTGCTAGACAAACTTGTAGCCTACTTGCTGTCACCTGCGGAAGTGTACCTAAAGTA
AAATGCGAGGAGGAAGTTGTTGAGGATAAGTTGAGATATCCATTTCCGGAATTAGTTTCTTCCGGACGATTGGAGGTTCGAGTTTTGGCAAATCCAAGCAAGGATGAATT
TAGTAGAATTGTGGAATCATGTCTACCGAGCTTCATCTACTTGCAAGGGGAACAACTCGGAAATGATGAAATTGGGTCTTTGGTTTGGAATGGTGTTGATTTGTCTCTTG
AAGATTTATGTGGACTATTCAATACTGCACTACCAACCATTGTGTATTTAGAAATTCCTAATGGAGGCAGAATAGCGGAGGCTCTTCATTCTAAGGGAATTCCTTATCTC
ATATATTGGAACAGCACATTTTCATGTTATGCTGCAGCTCATTTTCGTAATGCATTGCTTTCAGTGGTGCAGAGTTCATCTACTCATACATGGGATGCTTTTCAGCTTGC
ACGTGCTGCTTTTAGGCTTTATTGTATGGGGAGCAATTATGGTCTGCCTGGGATTGCTGATGACAGTATTATGAGTGATCTAGAGCCACAGCTTGTTGGGGAACCCCTAA
AGATTAACGTAGAACCCCCCCAGGTAGATGCAGGTGAAGGTGAAGATGAAGATGGTTCTTTAGAAACCCTCCCAGCCATAAGTATTCATGACAATAATGTGACCGTGAGA
TTTCTTATCTGTGGAGTGCCTTGCACGCCGGATGCCTGCTTATTGAGATCATTGGAGGATGGCCTTAATGCGCTTTTGAACATTGAAGAATTAAATCTTCAGTTCAGTGC
GCGTCAATTGTATATGCTATTAATGACTTTAAAAATATGGTTCAGTGCTCCTCCACCACCTCTTCAAGCAGGATCCTTTTCTCGTGGAGTTGTGACAATGCGGTGTGATA
TGGTGACTTGTAGTTCAGCCCACATCTCAATATTGGTGTCTGGTAGCGCTCATACTTGTTTTGATGATCAGCTCTTGGAGAAACATATAAAGCATGAGATTATTGAAAAT
AACCAGTTAGTTCATGCCATGCACGACTGTGAGGGCAACAAACCTCACATGCACGAGCCTCGAAAATCTGCTTCAGTTGCCTGTGGGGCAACAGTATTTGAGGTTTCTAT
GAAGGTTCCCGCTTGGGCATCACAGGTCTTGAGGCAACTTGCACCTGATATGTCATATCGGAGTTTAGTTGCACTCGGTATTGGGGGAGTTCAAGGTTTACCTGTTGCTT
CTTTTGAGAAAGAAGATGCCGAGCGATTGCTCTTCTTTTGTTCAGGGGATGAGAATGATAAACATTCAGAGCAGTTGCTTGTAAGTGTATTACCCAGCTGGTTTAAGCCA
CCTACTCCTAGTAGAAAGAGAGTAGAACCAAGCCAAGGAATAAGAAGCACTCTTTCACATGACAGTCCTGCATATGCAAAGATTCCTTCCATTAGAAGAATAGGTGGAGA
GGAGCCTGCACCAATGAATGGGTTCAAGGCACCCATACTCCCAACGAGGAAAAGATTAAAAGTAGCCTCCATGAGGCCTGTTCCACGTATGAATAGGAATAAAATGATGC
CTTTCTCTGGATTGACTGAAGTAGATGGGAATAACGGAGGCCTATCCAAGGGTAGTTTACCTGTTGTTACCCCGTCAAAGCATGTAACTGTAGGATCAACTTCTGCAACA
CACAGAAAATCTTTTTCAAGCTCATCTCAGTCTAAGCAGATTATTTCCTTGAACCCACTGCCTCTAAAGAAGCATGGTTGTGGAAGAAACCCAATTCATGATTGCTCAGA
GGAAGAGTTCTTGAAAGATGTTATGGAGTTTTTACTGCTTAGAGGACATTCACGACTTATTCCTCAAGGTGGACTTGAGGAGTTTCCAGATGCCATACTCAACGGGAAGC
GTCTTGACCTCTATAACTTGTATAAGGAGGTGGTCACCCGAGGAGGCTTTCATGTCGGCAATGGTATCAACTGGAAGGGGCAGATCTTCTCTAAGATGCACAATTACACA
ATGACCAATAGAATGACTGGTGTTGGAAATACACTGAAAAGACATTATGAGACTTACCTTCTAGAATATGAATTGGCTCACGATGATGTAGATGGAGAATGCTGTCTTCT
GTGCCACAGTAGTGCAGCAGGGGATTGGGTGAACTGTGGTATTTGTGGTGAATGGGCCCATTTTGGGTGCGATCGAAGGCAGGGTCTAGGAGCATTTAAGGATTATGCCA
AAACAGATGGGCTAGAGTATGTTTGTCCACATTGTAGCATTACAACTTACAAGAAGAAACCACACAGAGTAGCAAACGGGTCTCCACAAGGAATAATGAATCCACGGATA
CCTTGATTCAGTTTTGGCCTCCCCATCTCGAGGTTTTGGCTGCATTGTTTTTCTTTCTCTTTTTTTGCCTATTCTTATCCTACACGAGAAATTATTCGTTCCCATCGGCT
CGTAGATGAGTTTTGTTTGAGGTGGTGTGCTGCTAAACCATTAGGTGAAGGATTAAAGGATGAAAACTTTGAAGTGGAGAAGAGAAGTAAAGAGACTAGAGCACACAAAG
AGGACTTCATAATAGGTTCTTTTTTTGACCTCTCTTTAGGTCTAGGTAATAGTTTAGTAGTAAAGTTAGCAGCAGCATGGCAGTTAACTTTAGTAGGGTGCTGTAAAATA
CTTCTATTCTTGTAGCTAAGCCATGACTGATGTATTTTAATGAAGATAA
Protein sequenceShow/hide protein sequence
MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVVEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFIYLQGEQLGNDEIGSLVWNGVDLSLEDLCGLFNTALP
TIVYLEIPNGGRIAEALHSKGIPYLIYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCMGSNYGLPGIADDSIMSDLEPQLVGEPLKINVEPPQVDAG
EGEDEDGSLETLPAISIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEELNLQFSARQLYMLLMTLKIWFSAPPPPLQAGSFSRGVVTMRCDMVTCSSAHISIL
VSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKPHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRSLVALGIGGVQGLPVASFEKEDAERLLF
FCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSPAYAKIPSIRRIGGEEPAPMNGFKAPILPTRKRLKVASMRPVPRMNRNKMMPFSGLTEVDGNN
GGLSKGSLPVVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIHDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVV
TRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHC
SITTYKKKPHRVANGSPQGIMNPRIP