; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G006470 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G006470
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionAT-rich interactive domain-containing protein 4-like
Genome locationchr07:6887838..6895049
RNA-Seq ExpressionLsi07G006470
SyntenyLsi07G006470
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001606 - ARID DNA-binding domain
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR036431 - ARID DNA-binding domain superfamily
IPR042293 - AT-rich interactive domain-containing protein 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053551.1 AT-rich interactive domain-containing protein 4-like [Cucumis melo var. makuwa]0.0e+0090.44Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSVVAARQTCSLLAVTCG+VPKV+CEEE+DEDKL+YPFPELVSSGRLEVRVLANPSKDEFSRIVES LPSFVYLQGEQLG+DEIGSLVWN VDLSLED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI
        LCGLFNTALPTIVYLEIPNGGR+AEALHSK          GIPYLMYWNSTFS YAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCV SNY L  I
Subjt:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI

Query:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIE--IRGSKLQGKFSAPP
        ADD++M DL PQLIGEPLKI+VEPPEV+ GEGEDEDGSLE LPAI+IHDNNVT+RFLICG+PCTPDACLLRSLEDGLNALL IE  +R   L  K  APP
Subjt:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIE--IRGSKLQGKFSAPP

Query:  PPLQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQ
        PPLQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEN+QLVHA+HDCEGNKHH+HEPRKSAS+ACGA VFEVSMKVPAWASQ
Subjt:  PPLQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQ

Query:  VLRQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVG
        VLRQLAPD SYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGD NDK+SEQLLVSVLPSWFKPPTPSR+RVE SQG R++LSHDSL+YANIP+IRRVG
Subjt:  VLRQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVG

Query:  REEPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
        RE+P PMNGFKA L PARK+LKVASMRPVPR++RNK++PF+GLTEVDGNNGGLSKA LS+VT  KHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
Subjt:  REEPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH

Query:  GCGRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
        GCGRNPIQDCSEEEFLKDVMEFLLLRGH+RLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
Subjt:  GCGRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH

Query:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNGSPQGITNPRI
        YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRV NGSPQGITNPRI
Subjt:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNGSPQGITNPRI

XP_008460343.1 PREDICTED: AT-rich interactive domain-containing protein 4-like [Cucumis melo]0.0e+0091.68Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSVVAARQTCSLLAVTCG+VPKV+CEEE+DEDKL+YPFPELVSSGRLEVRVLANPSKDEFSRIVES LPSFVYLQGEQLG+DEIGSLVWN VDLSLED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI
        LCGLFNTALPTIVYLEIPNGGR+AEALHSK          GIPYLMYWNSTFS YAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCV SNY L  I
Subjt:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI

Query:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPP
        ADD++M DL PQLIGEPLKI+VEPPEV+ GEGEDEDGSLE LPAI+IHDNNVT+RFLICG+PCTPDACLLRSLEDGLNALL IEIRGSKLQGKFSAPPPP
Subjt:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPP

Query:  LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL
        LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEN+QLVHA+HDCEGNKHH+HEPRKSAS+ACGA VFEVSMKVPAWASQVL
Subjt:  LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL

Query:  RQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVGRE
        RQLAPD SYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGD NDK+SEQLLVSVLPSWFKPPTPSR+RVE SQG R++LSHDSL+YANIP+IRRVGRE
Subjt:  RQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVGRE

Query:  EPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC
        +P PMNGFKA L PARK+LKVASMRPVPR++RNK++PF+GLTEVDGNNGGLSKA LS+VT  KHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC
Subjt:  EPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC

Query:  GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE
        GRNPIQDCSEEEFLKDVMEFLLLRGH+RLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE
Subjt:  GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE

Query:  TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNGSPQGITNPRI
        TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRV NGSPQGITNPRI
Subjt:  TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNGSPQGITNPRI

XP_011651722.1 AT-rich interactive domain-containing protein 4 isoform X1 [Cucumis sativus]0.0e+0092.81Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSVVAARQTCSLLAVTCG+VPKVKCEEEVDEDKL+YPFPELVS GRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI
        LCGLFN ALPT VYLEIP+GGR+AEALHSK          GIPYL+YWNSTFSCYAAAHFR+ALLSVVQSSSTHTWDAFQLARAAFRLY V SNYGLPGI
Subjt:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI

Query:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPP
        ADDS+MSDL PQLIGEPLKI+VEPPE+D GEGEDEDGSLE LPAINIHDNNVT+RFLICGVPCTPD CLLRSLEDGL+ALL IE+RGSKLQGKFSAPPPP
Subjt:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPP

Query:  LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL
        LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIE+NQLVHA+HDCEGNKHHMH+PRKSAS+ACGATVFEVSMKVPAWASQVL
Subjt:  LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL

Query:  RQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVGRE
        RQLAPD SYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGD NDKHSEQLLVSVLPSWFKPPTPSRKRVE SQG R++LSHDSL+YA+IP+IRRVGRE
Subjt:  RQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVGRE

Query:  EPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC
        +P PMNGFKA L PARK+LKVASMRPVPR++RNKMTPF+GLTEVDGNNGGLSKA LSIVTP KHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC
Subjt:  EPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC

Query:  GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE
        GRNPIQDCSEEEFLKDVMEFLLLRGH+RLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE
Subjt:  GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE

Query:  TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNGSPQGITNPRI
        TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRV NGSPQGITNPRI
Subjt:  TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNGSPQGITNPRI

XP_022956131.1 AT-rich interactive domain-containing protein 4-like [Cucurbita moschata]0.0e+0090.78Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSV+AARQTCSLLAVTCGSV K KCEE+VDEDKL+YPFP LVSSGRLEVR L NPS DEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDL LED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI
        LCGLFNTALPT+VYLEIPNGGR+AEALHSK          GIPY+MYWNSTFSCYAAAHFRNAL SV+QSSSTHTWDAFQLARAAFRL+C+ S++ LPGI
Subjt:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI

Query:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPP
          DSI S L PQ+ GEPLKINVEPP+VD GEGEDEDGSLETL AI+IHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRG KLQGKFSAPPPP
Subjt:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPP

Query:  LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL
        LQAGSF+RGVVTMRCDIVTCSSAHISILVSGS HTCFDDQLLEKHIKHEIIENNQLVHAM+DCE NKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL
Subjt:  LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL

Query:  RQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVGRE
        RQLAPD SYRSLVALGIGGVQGLPVASFEKEDAERLLFFCS D NDKHS+QLLVSVLPSWFKPP PSRKRVE SQG RSTLSHD LAYANIP IRRVGRE
Subjt:  RQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVGRE

Query:  EPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC
        EPAPMNGFK PLL  RKRLKVASMRP+PRV+RNKMTPFSGLTE DGNNGG  KA   +VTPSKHVTVGSTSAT RKSFSSSSQSKQIISLNPLPLKKHGC
Subjt:  EPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC

Query:  GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE
        GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTM+NRMTGVGNTLKRHYE
Subjt:  GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE

Query:  TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY-KKKPHRVVNGSPQGITNP
        TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY KKKPHRV NGSPQG+TNP
Subjt:  TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY-KKKPHRVVNGSPQGITNP

XP_038883881.1 AT-rich interactive domain-containing protein 4-like [Benincasa hispida]0.0e+0095.71Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSVVAARQTCSLLAVTCGSVPK+KCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVES LPSFVYLQGEQLGNDEIGSLVWNGVDLSLED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI
        LCGLFNT LPTIVYLEIPNGGR+AEALHSK          GIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAF+LYCV SNYGLPGI
Subjt:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI

Query:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPP
        ADDSIMSDL PQLIGEPLKINVEPPEVDAGEGED DGSLETLPAI+IHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPP
Subjt:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPP

Query:  LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL
        LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL
Subjt:  LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL

Query:  RQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVGRE
        RQLAPD SYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGD+NDKHSEQLLVSVLPSWFKPPTPSRKRVE SQG RSTLSHDSLAYANIPSIRRV RE
Subjt:  RQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVGRE

Query:  EPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC
        EPAPMNGFKAPLLP RKRLKVASMRPVPRV+RNK+TPFSGL EVD NNG LSKA L +VTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC
Subjt:  EPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC

Query:  GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE
        GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE
Subjt:  GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE

Query:  TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNGSPQGITNPRI
        TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRV NGSPQGITNPRI
Subjt:  TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNGSPQGITNPRI

TrEMBL top hitse value%identityAlignment
A0A0A0LEG9 ARID domain-containing protein0.0e+0092.81Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSVVAARQTCSLLAVTCG+VPKVKCEEEVDEDKL+YPFPELVS GRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI
        LCGLFN ALPT VYLEIP+GGR+AEALHSK          GIPYL+YWNSTFSCYAAAHFR+ALLSVVQSSSTHTWDAFQLARAAFRLY V SNYGLPGI
Subjt:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI

Query:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPP
        ADDS+MSDL PQLIGEPLKI+VEPPE+D GEGEDEDGSLE LPAINIHDNNVT+RFLICGVPCTPD CLLRSLEDGL+ALL IE+RGSKLQGKFSAPPPP
Subjt:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPP

Query:  LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL
        LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIE+NQLVHA+HDCEGNKHHMH+PRKSAS+ACGATVFEVSMKVPAWASQVL
Subjt:  LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL

Query:  RQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVGRE
        RQLAPD SYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGD NDKHSEQLLVSVLPSWFKPPTPSRKRVE SQG R++LSHDSL+YA+IP+IRRVGRE
Subjt:  RQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVGRE

Query:  EPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC
        +P PMNGFKA L PARK+LKVASMRPVPR++RNKMTPF+GLTEVDGNNGGLSKA LSIVTP KHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC
Subjt:  EPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC

Query:  GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE
        GRNPIQDCSEEEFLKDVMEFLLLRGH+RLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE
Subjt:  GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE

Query:  TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNGSPQGITNPRI
        TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRV NGSPQGITNPRI
Subjt:  TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNGSPQGITNPRI

A0A1S3CCC9 AT-rich interactive domain-containing protein 4-like0.0e+0091.68Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSVVAARQTCSLLAVTCG+VPKV+CEEE+DEDKL+YPFPELVSSGRLEVRVLANPSKDEFSRIVES LPSFVYLQGEQLG+DEIGSLVWN VDLSLED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI
        LCGLFNTALPTIVYLEIPNGGR+AEALHSK          GIPYLMYWNSTFS YAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCV SNY L  I
Subjt:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI

Query:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPP
        ADD++M DL PQLIGEPLKI+VEPPEV+ GEGEDEDGSLE LPAI+IHDNNVT+RFLICG+PCTPDACLLRSLEDGLNALL IEIRGSKLQGKFSAPPPP
Subjt:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPP

Query:  LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL
        LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEN+QLVHA+HDCEGNKHH+HEPRKSAS+ACGA VFEVSMKVPAWASQVL
Subjt:  LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL

Query:  RQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVGRE
        RQLAPD SYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGD NDK+SEQLLVSVLPSWFKPPTPSR+RVE SQG R++LSHDSL+YANIP+IRRVGRE
Subjt:  RQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVGRE

Query:  EPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC
        +P PMNGFKA L PARK+LKVASMRPVPR++RNK++PF+GLTEVDGNNGGLSKA LS+VT  KHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC
Subjt:  EPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC

Query:  GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE
        GRNPIQDCSEEEFLKDVMEFLLLRGH+RLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE
Subjt:  GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE

Query:  TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNGSPQGITNPRI
        TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRV NGSPQGITNPRI
Subjt:  TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNGSPQGITNPRI

A0A5A7UJ97 AT-rich interactive domain-containing protein 4-like0.0e+0090.44Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSVVAARQTCSLLAVTCG+VPKV+CEEE+DEDKL+YPFPELVSSGRLEVRVLANPSKDEFSRIVES LPSFVYLQGEQLG+DEIGSLVWN VDLSLED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI
        LCGLFNTALPTIVYLEIPNGGR+AEALHSK          GIPYLMYWNSTFS YAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCV SNY L  I
Subjt:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI

Query:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIE--IRGSKLQGKFSAPP
        ADD++M DL PQLIGEPLKI+VEPPEV+ GEGEDEDGSLE LPAI+IHDNNVT+RFLICG+PCTPDACLLRSLEDGLNALL IE  +R   L  K  APP
Subjt:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIE--IRGSKLQGKFSAPP

Query:  PPLQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQ
        PPLQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEN+QLVHA+HDCEGNKHH+HEPRKSAS+ACGA VFEVSMKVPAWASQ
Subjt:  PPLQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQ

Query:  VLRQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVG
        VLRQLAPD SYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGD NDK+SEQLLVSVLPSWFKPPTPSR+RVE SQG R++LSHDSL+YANIP+IRRVG
Subjt:  VLRQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVG

Query:  REEPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
        RE+P PMNGFKA L PARK+LKVASMRPVPR++RNK++PF+GLTEVDGNNGGLSKA LS+VT  KHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH
Subjt:  REEPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKH

Query:  GCGRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
        GCGRNPIQDCSEEEFLKDVMEFLLLRGH+RLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH
Subjt:  GCGRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRH

Query:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNGSPQGITNPRI
        YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRV NGSPQGITNPRI
Subjt:  YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNGSPQGITNPRI

A0A5D3D6I3 AT-rich interactive domain-containing protein 4-like0.0e+0091.68Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSVVAARQTCSLLAVTCG+VPKV+CEEE+DEDKL+YPFPELVSSGRLEVRVLANPSKDEFSRIVES LPSFVYLQGEQLG+DEIGSLVWN VDLSLED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI
        LCGLFNTALPTIVYLEIPNGGR+AEALHSK          GIPYLMYWNSTFS YAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCV SNY L  I
Subjt:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI

Query:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPP
        ADD++M DL PQLIGEPLKI+VEPPEV+ GEGEDEDGSLE LPAI+IHDNNVT+RFLICG+PCTPDACLLRSLEDGLNALL IEIRGSKLQGKFSAPPPP
Subjt:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPP

Query:  LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL
        LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEN+QLVHA+HDCEGNKHH+HEPRKSAS+ACGA VFEVSMKVPAWASQVL
Subjt:  LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL

Query:  RQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVGRE
        RQLAPD SYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGD NDK+SEQLLVSVLPSWFKPPTPSR+RVE SQG R++LSHDSL+YANIP+IRRVGRE
Subjt:  RQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVGRE

Query:  EPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC
        +P PMNGFKA L PARK+LKVASMRPVPR++RNK++PF+GLTEVDGNNGGLSKA LS+VT  KHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC
Subjt:  EPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC

Query:  GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE
        GRNPIQDCSEEEFLKDVMEFLLLRGH+RLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE
Subjt:  GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE

Query:  TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNGSPQGITNPRI
        TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRV NGSPQGITNPRI
Subjt:  TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNGSPQGITNPRI

A0A6J1GVQ9 AT-rich interactive domain-containing protein 4-like0.0e+0090.78Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED
        MLHSV+AARQTCSLLAVTCGSV K KCEE+VDEDKL+YPFP LVSSGRLEVR L NPS DEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDL LED
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLED

Query:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI
        LCGLFNTALPT+VYLEIPNGGR+AEALHSK          GIPY+MYWNSTFSCYAAAHFRNAL SV+QSSSTHTWDAFQLARAAFRL+C+ S++ LPGI
Subjt:  LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGI

Query:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPP
          DSI S L PQ+ GEPLKINVEPP+VD GEGEDEDGSLETL AI+IHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRG KLQGKFSAPPPP
Subjt:  ADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPP

Query:  LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL
        LQAGSF+RGVVTMRCDIVTCSSAHISILVSGS HTCFDDQLLEKHIKHEIIENNQLVHAM+DCE NKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL
Subjt:  LQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVL

Query:  RQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVGRE
        RQLAPD SYRSLVALGIGGVQGLPVASFEKEDAERLLFFCS D NDKHS+QLLVSVLPSWFKPP PSRKRVE SQG RSTLSHD LAYANIP IRRVGRE
Subjt:  RQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVGRE

Query:  EPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC
        EPAPMNGFK PLL  RKRLKVASMRP+PRV+RNKMTPFSGLTE DGNNGG  KA   +VTPSKHVTVGSTSAT RKSFSSSSQSKQIISLNPLPLKKHGC
Subjt:  EPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGC

Query:  GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE
        GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTM+NRMTGVGNTLKRHYE
Subjt:  GRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYE

Query:  TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY-KKKPHRVVNGSPQGITNP
        TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY KKKPHRV NGSPQG+TNP
Subjt:  TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY-KKKPHRVVNGSPQGITNP

SwissProt top hitse value%identityAlignment
Q6NQ79 AT-rich interactive domain-containing protein 44.7e-26358.45Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVD--EDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLG-NDEIGSLVWNGVDLS
        M H    +R  C+++AV  G+        ++D    + +YPFP+L SSGRL+ +VL NP+ +EF   V S    FVYLQGE  G +DE+G LV    D S
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVD--EDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLG-NDEIGSLVWNGVDLS

Query:  LED-LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYG
          D L  LF + LPT VYLE+PNG  +A+AL+SK          G+ Y++YW + FS YAA HFR++L SV+QSS + TWD F +A A+FRLYC   N  
Subjt:  LED-LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYG

Query:  LPGIADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSA
        LP  ++  +  ++GP L+GEP KI+V  PE D  E   E+ SLE+LP+I I+D +VTVRFL+CG PCT D  LL SL DGLNALL IE+RGSKL  + SA
Subjt:  LPGIADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSA

Query:  PPPPLQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWA
        P PPLQAG+F+RGVVTMRCD+ TCSSAHIS+LVSG+A TCF DQLLE HIKHE++E  QLVH++ + E  K    EPR+SAS+ACGA+V EVSM+VP WA
Subjt:  PPPPLQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWA

Query:  SQVLRQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRR
         QVLRQLAPD SYRSLV LG+  +QGL VASFEK+DAERLLFFC    ND  +   L+S +P+W  PP P+RKR E                        
Subjt:  SQVLRQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRR

Query:  VGREEPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLK
          RE     NG      P  +++ VA++RP+P   R+KM PFSG +E+   +G  +K   S+  P KH   G T  THRK+FS S Q KQIISLNPLPLK
Subjt:  VGREEPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLK

Query:  KHGCGRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLK
        KH CGR  IQ CSEEEFL+DVM+FLL+RGH+RL+P GGL EFPDA+LN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM N+T+TNRMTGVGNTLK
Subjt:  KHGCGRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLK

Query:  RHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNG
        RHYETYLLEYE AHDDVDGECCL+C SS AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP+CS++ Y+KK  +  NG
Subjt:  RHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNG

Arabidopsis top hitse value%identityAlignment
AT3G43240.1 ARID/BRIGHT DNA-binding domain-containing protein3.3e-26458.45Show/hide
Query:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVD--EDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLG-NDEIGSLVWNGVDLS
        M H    +R  C+++AV  G+        ++D    + +YPFP+L SSGRL+ +VL NP+ +EF   V S    FVYLQGE  G +DE+G LV    D S
Subjt:  MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVD--EDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLG-NDEIGSLVWNGVDLS

Query:  LED-LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYG
          D L  LF + LPT VYLE+PNG  +A+AL+SK          G+ Y++YW + FS YAA HFR++L SV+QSS + TWD F +A A+FRLYC   N  
Subjt:  LED-LCGLFNTALPTIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYG

Query:  LPGIADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSA
        LP  ++  +  ++GP L+GEP KI+V  PE D  E   E+ SLE+LP+I I+D +VTVRFL+CG PCT D  LL SL DGLNALL IE+RGSKL  + SA
Subjt:  LPGIADDSIMSDLGPQLIGEPLKINVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSA

Query:  PPPPLQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWA
        P PPLQAG+F+RGVVTMRCD+ TCSSAHIS+LVSG+A TCF DQLLE HIKHE++E  QLVH++ + E  K    EPR+SAS+ACGA+V EVSM+VP WA
Subjt:  PPPPLQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWA

Query:  SQVLRQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRR
         QVLRQLAPD SYRSLV LG+  +QGL VASFEK+DAERLLFFC    ND  +   L+S +P+W  PP P+RKR E                        
Subjt:  SQVLRQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFCSGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRR

Query:  VGREEPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLK
          RE     NG      P  +++ VA++RP+P   R+KM PFSG +E+   +G  +K   S+  P KH   G T  THRK+FS S Q KQIISLNPLPLK
Subjt:  VGREEPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGGLSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLK

Query:  KHGCGRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLK
        KH CGR  IQ CSEEEFL+DVM+FLL+RGH+RL+P GGL EFPDA+LN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM N+T+TNRMTGVGNTLK
Subjt:  KHGCGRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLK

Query:  RHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNG
        RHYETYLLEYE AHDDVDGECCL+C SS AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP+CS++ Y+KK  +  NG
Subjt:  RHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVVNG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCATTCCGTAGTAGCTGCTAGACAAACTTGTAGCCTACTTGCTGTCACCTGTGGAAGTGTACCTAAAGTAAAATGCGAGGAGGAAGTTGATGAGGATAAGTTGAG
ATATCCGTTTCCAGAATTAGTTTCTTCTGGACGATTGGAGGTTCGAGTTTTGGCAAATCCAAGCAAGGATGAATTTAGTAGAATTGTGGAATCATGTCTACCGAGCTTCG
TCTACTTGCAAGGGGAACAACTTGGAAATGATGAAATTGGGTCTTTGGTTTGGAATGGTGTTGATTTGTCTCTTGAAGATTTATGTGGACTATTCAATACTGCACTACCA
ACCATTGTGTATTTAGAAATCCCAAATGGAGGCAGAATGGCAGAGGCTCTTCATTCTAAGGTGACTTATATGAGATTCATCATTGTGCAGGGAATTCCTTATCTCATGTA
TTGGAACAGCACATTTTCATGTTATGCTGCAGCTCATTTTCGTAATGCATTGCTTTCAGTGGTGCAGAGTTCATCTACTCATACATGGGATGCTTTTCAGCTTGCACGCG
CTGCTTTTAGGCTTTATTGTGTGAGGAGCAATTATGGTCTGCCCGGGATTGCTGATGACAGTATTATGAGTGATCTAGGGCCACAGCTTATTGGGGAACCCCTAAAGATT
AACGTAGAACCTCCCGAGGTAGATGCAGGTGAAGGTGAAGATGAAGATGGTTCTTTAGAAACCCTCCCAGCCATAAATATTCATGATAATAATGTGACCGTGAGATTTCT
TATCTGTGGAGTGCCTTGCACACCGGATGCCTGCTTATTGAGATCATTGGAGGATGGCCTTAATGCGCTTTTGAACATTGAAATACGTGGGAGTAAACTTCAGGGAAAGT
TTAGTGCTCCTCCACCACCTCTTCAAGCAGGATCCTTTTCTCGTGGAGTTGTGACAATGCGGTGTGATATAGTGACTTGTAGTTCAGCCCACATCTCAATATTGGTGTCT
GGTAGCGCTCATACTTGTTTTGATGACCAGCTCTTGGAGAAACACATCAAACATGAGATTATTGAAAATAACCAATTAGTTCATGCCATGCATGATTGTGAGGGCAACAA
ACATCACATGCACGAGCCTCGAAAATCTGCTTCAGTTGCCTGTGGGGCAACAGTATTTGAGGTTTCCATGAAGGTTCCCGCTTGGGCATCACAGGTCTTGAGGCAACTTG
CACCTGATACGTCGTATCGGAGTTTAGTTGCACTCGGCATTGGGGGAGTTCAGGGTTTACCTGTTGCTTCTTTTGAGAAAGAGGATGCTGAGCGATTGCTCTTCTTTTGT
TCAGGGGATGATAATGATAAACATTCAGAGCAGTTGCTTGTAAGTGTATTACCCAGCTGGTTTAAGCCACCTACTCCTAGTAGAAAGAGAGTAGAATCAAGCCAAGGAAA
AAGGAGCACTCTTTCACATGACAGTCTTGCATATGCAAACATCCCTTCCATTAGAAGAGTAGGTAGAGAGGAACCTGCACCAATGAATGGGTTCAAGGCACCCTTACTCC
CAGCGAGGAAAAGATTAAAAGTAGCCTCCATGAGGCCTGTTCCACGTGTGAATAGGAATAAAATGACGCCTTTCTCTGGATTGACTGAAGTAGATGGGAATAACGGAGGT
CTATCCAAGGCTATTTTATCCATTGTTACTCCATCAAAGCATGTAACTGTAGGATCAACTTCTGCAACACACCGAAAATCTTTTTCAAGCTCATCTCAGTCTAAGCAGAT
TATTTCCTTAAATCCATTGCCTCTAAAGAAGCATGGTTGTGGAAGAAACCCAATTCAAGATTGCTCAGAGGAGGAGTTCTTAAAGGATGTCATGGAGTTTTTATTACTTA
GAGGACATTCACGACTTATTCCTCAAGGTGGACTTGAGGAGTTTCCAGATGCCATACTCAACGGGAAGCGTCTTGACCTCTATAACTTGTATAAGGAGGTGGTCACCCGA
GGAGGCTTTCATGTCGGCAATGGTATCAACTGGAAGGGGCAGATCTTCTCTAAGATGCACAATTACACAATGACCAATAGAATGACTGGTGTTGGAAATACATTGAAAAG
ACATTATGAGACTTACCTTCTAGAATATGAATTGGCTCATGATGATGTAGATGGAGAATGCTGCCTTTTGTGCCACAGTAGTGCAGCAGGGGATTGGGTGAACTGTGGCA
TTTGTGGTGAATGGGCTCATTTCGGGTGCGATCGAAGGCAGGGTCTAGGTGCATTTAAGGATTATGCCAAAACAGATGGGCTAGAGTATGTTTGTCCACATTGTAGCATT
ACAACTTACAAGAAGAAACCACACAGAGTAGTAAACGGGTCTCCGCAAGGAATAACGAATCCACGGATATCTTGA
mRNA sequenceShow/hide mRNA sequence
CAAATCTCAAACATGAAGAAGTTAAAGTTTAGAATCTTCCACCCACTTGATGTTTAAGAAAAACAATATAAGATGAAGATTGTTGTATTGAAAATGTTGAATGTGGCGCC
ATCCTCATCTCTTTATCTTACAAACCCACCAATTTCTTGTCGAGTCCGGTCAATTTCTCATTTCTTTTATCTTGATTCTTGACAGACCCTCTTCATATTCAACATTCAAA
TAAATACGTCGAGATTCTGTATATACACATTATAAACCAAACAATTAACTGAAAATTCAGCCGATCCATCTTCGCCAGTGCATAAGTTTACTCCACTTCCTGCTTCTGCA
TTCGTGAAGATGCGGCCGTATGAGGAGAAAAGAACAAGCTTCAGCTTTGCTTCTTTGACGAAACTTGAAGAATCTGTTCATCAATTCAATCCAAGTTAATTAAAGCCAAC
AGAAAAGGAAACCCAAATTACGAACAAGACAAGATAAGGGTTGAAATCTTTCACCAACAATTTTCGCTAAATCCGTGCTTTGTGGGCACCCAATTAACGCAAATTTCTGG
ATTTGGGATGTGTTGTGATCTGATTTTGTTTACAAATTTCATTGAAGACAAAGAATAAAAAGCCGAGGGGAAGTAAATTTGGAAAGTTGTTTACTTGATTGGATTGGAAT
CGCAGTTTGTGGGAAGGACTGTGATATAAACATTTCCCCTTTCTTGTTGGGTTTAATGTGTTTTGAATCTCTTGATATGTTTGACTTCCGGTTTCGAGATGTGGGAATGT
TTTTATCTGACACCCATTTATTTAACAGAAAGGAACTAATTATTTGATTTAATTTTTGGCAACTGTGTGAGCTTTTGATTTTCTTGTTTATTGCGTGGAATGTGGATAGG
GAGAAAGGTAAGAATATTTCTTATTTCTATTTTTTTTTCAGCGATTTTTTATTGATGTGAATTTTAAGGCCCTAGGGTTCACTTTTCGGGCTCCAGTCTTCTATCGCTGC
CCGAATTTCTTCCCCTTTCGTATGATTTTTAGGGGTTACGGACTCTGTCTTGTCTTTTCGTCGATTTTTCTTCATCTCATCGCGCTGGGTGTTCAAATTTCCTTGATTTT
CCCCATTCGCTTGGTGGGTTCTCCGGTTTCTGTTGTCTGAGGTGCTGAATTTCCTCTCGCTCCTTCTGGTGGTGACTGGTTTGTTTAATTTTGAGGTTTTCTCTTATGGG
TTTTGCTTTGGAGCTTCGATTATGATGGTTTTTTTCCTTTTATGAATCCTTGAAGCGCTTTTGCTCAACTAGCAGCTGAATATATATGCTTTGCTCTTGCAGTACATGTT
GGTCATAGTTGGTGATTCTTAACGGATAGGGTTTGAGTTTTAAATTAGTCCTGCATTCAGTTTTGCAGAAATTTTGTGCTGAAGTGGGTGGTTTTTAGGGTTTAATTGTT
GGTCCAGTAAGAATTAAAGGTTCCGGACGGCTGTGATTTTGATCTTAAAGCCCATCCCCCTGAAGGGCTTTAGGGTGATGGCATTGTTTCAACCTCAGAATGAAAATTCA
GGTTCTTTAGTTCTATGTCATGGCATATGGGTATTGGTCTCTGAGATTTTTAACTGATGCCAGGGCATTAGGGCTCTGTTTTAGCGAATGTTTGTGTTCTAATGGTTCCA
CAAGCCAGTTGAAAGATCTCATTGCTTGAGGAAGTTCGCTTAATGCAATTTTCTCCCGGTTCTTCTTATAAACGATGCTTCATTCCGTAGTAGCTGCTAGACAAACTTGT
AGCCTACTTGCTGTCACCTGTGGAAGTGTACCTAAAGTAAAATGCGAGGAGGAAGTTGATGAGGATAAGTTGAGATATCCGTTTCCAGAATTAGTTTCTTCTGGACGATT
GGAGGTTCGAGTTTTGGCAAATCCAAGCAAGGATGAATTTAGTAGAATTGTGGAATCATGTCTACCGAGCTTCGTCTACTTGCAAGGGGAACAACTTGGAAATGATGAAA
TTGGGTCTTTGGTTTGGAATGGTGTTGATTTGTCTCTTGAAGATTTATGTGGACTATTCAATACTGCACTACCAACCATTGTGTATTTAGAAATCCCAAATGGAGGCAGA
ATGGCAGAGGCTCTTCATTCTAAGGTGACTTATATGAGATTCATCATTGTGCAGGGAATTCCTTATCTCATGTATTGGAACAGCACATTTTCATGTTATGCTGCAGCTCA
TTTTCGTAATGCATTGCTTTCAGTGGTGCAGAGTTCATCTACTCATACATGGGATGCTTTTCAGCTTGCACGCGCTGCTTTTAGGCTTTATTGTGTGAGGAGCAATTATG
GTCTGCCCGGGATTGCTGATGACAGTATTATGAGTGATCTAGGGCCACAGCTTATTGGGGAACCCCTAAAGATTAACGTAGAACCTCCCGAGGTAGATGCAGGTGAAGGT
GAAGATGAAGATGGTTCTTTAGAAACCCTCCCAGCCATAAATATTCATGATAATAATGTGACCGTGAGATTTCTTATCTGTGGAGTGCCTTGCACACCGGATGCCTGCTT
ATTGAGATCATTGGAGGATGGCCTTAATGCGCTTTTGAACATTGAAATACGTGGGAGTAAACTTCAGGGAAAGTTTAGTGCTCCTCCACCACCTCTTCAAGCAGGATCCT
TTTCTCGTGGAGTTGTGACAATGCGGTGTGATATAGTGACTTGTAGTTCAGCCCACATCTCAATATTGGTGTCTGGTAGCGCTCATACTTGTTTTGATGACCAGCTCTTG
GAGAAACACATCAAACATGAGATTATTGAAAATAACCAATTAGTTCATGCCATGCATGATTGTGAGGGCAACAAACATCACATGCACGAGCCTCGAAAATCTGCTTCAGT
TGCCTGTGGGGCAACAGTATTTGAGGTTTCCATGAAGGTTCCCGCTTGGGCATCACAGGTCTTGAGGCAACTTGCACCTGATACGTCGTATCGGAGTTTAGTTGCACTCG
GCATTGGGGGAGTTCAGGGTTTACCTGTTGCTTCTTTTGAGAAAGAGGATGCTGAGCGATTGCTCTTCTTTTGTTCAGGGGATGATAATGATAAACATTCAGAGCAGTTG
CTTGTAAGTGTATTACCCAGCTGGTTTAAGCCACCTACTCCTAGTAGAAAGAGAGTAGAATCAAGCCAAGGAAAAAGGAGCACTCTTTCACATGACAGTCTTGCATATGC
AAACATCCCTTCCATTAGAAGAGTAGGTAGAGAGGAACCTGCACCAATGAATGGGTTCAAGGCACCCTTACTCCCAGCGAGGAAAAGATTAAAAGTAGCCTCCATGAGGC
CTGTTCCACGTGTGAATAGGAATAAAATGACGCCTTTCTCTGGATTGACTGAAGTAGATGGGAATAACGGAGGTCTATCCAAGGCTATTTTATCCATTGTTACTCCATCA
AAGCATGTAACTGTAGGATCAACTTCTGCAACACACCGAAAATCTTTTTCAAGCTCATCTCAGTCTAAGCAGATTATTTCCTTAAATCCATTGCCTCTAAAGAAGCATGG
TTGTGGAAGAAACCCAATTCAAGATTGCTCAGAGGAGGAGTTCTTAAAGGATGTCATGGAGTTTTTATTACTTAGAGGACATTCACGACTTATTCCTCAAGGTGGACTTG
AGGAGTTTCCAGATGCCATACTCAACGGGAAGCGTCTTGACCTCTATAACTTGTATAAGGAGGTGGTCACCCGAGGAGGCTTTCATGTCGGCAATGGTATCAACTGGAAG
GGGCAGATCTTCTCTAAGATGCACAATTACACAATGACCAATAGAATGACTGGTGTTGGAAATACATTGAAAAGACATTATGAGACTTACCTTCTAGAATATGAATTGGC
TCATGATGATGTAGATGGAGAATGCTGCCTTTTGTGCCACAGTAGTGCAGCAGGGGATTGGGTGAACTGTGGCATTTGTGGTGAATGGGCTCATTTCGGGTGCGATCGAA
GGCAGGGTCTAGGTGCATTTAAGGATTATGCCAAAACAGATGGGCTAGAGTATGTTTGTCCACATTGTAGCATTACAACTTACAAGAAGAAACCACACAGAGTAGTAAAC
GGGTCTCCGCAAGGAATAACGAATCCACGGATATCTTGATTCAGTTTTGGCCTCCCCATCTTGAGGTTTTGGCTGCGTTGTTTTTCTTTTTTTCTTTTTGCCTATTCTTA
TCCTACAGGAGAAATTATTTGTTCCATCGGCTCGTAGATGAGTTTTGTTTGAGGTGGTGTGCTGCTAAACCATTAGCTGAAGATTAAATTTATGAGACTTGAGGATGAAA
ACTCTGAAGTGAAGAAGAGAACTAAAGAGTAGAGCACAGAAAGAGGACTTCATCATAGGGTTCTTTTTTGACCTCTTCTTTGTCTAGGTAATAGTTTAGGAGTAAAGTTA
GCAGCAGCATGGCAGTTAACTTTAGTAGGGTGCTGTAAAATACTTCTATTCTTGTAGCTAAGCCATGACTGATGTATTTTAATGAAGATATCTAATTTACAATTTTTTGT
ATATGATGATCCAAGAAAAGTAGATCAATAATCACTACTTCACATGCATTTTAAAACTTATATACAAGTTATCAAACCATC
Protein sequenceShow/hide protein sequence
MLHSVVAARQTCSLLAVTCGSVPKVKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKDEFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLEDLCGLFNTALP
TIVYLEIPNGGRMAEALHSKVTYMRFIIVQGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFRLYCVRSNYGLPGIADDSIMSDLGPQLIGEPLKI
NVEPPEVDAGEGEDEDGSLETLPAINIHDNNVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTMRCDIVTCSSAHISILVS
GSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDTSYRSLVALGIGGVQGLPVASFEKEDAERLLFFC
SGDDNDKHSEQLLVSVLPSWFKPPTPSRKRVESSQGKRSTLSHDSLAYANIPSIRRVGREEPAPMNGFKAPLLPARKRLKVASMRPVPRVNRNKMTPFSGLTEVDGNNGG
LSKAILSIVTPSKHVTVGSTSATHRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTR
GGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSI
TTYKKKPHRVVNGSPQGITNPRIS