; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025565 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025565
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAT-rich interactive domain-containing protein 4-like
Genome locationtig00007935:1100293..1107199
RNA-Seq ExpressionSgr025565
SyntenySgr025565
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001606 - ARID DNA-binding domain
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR036431 - ARID DNA-binding domain superfamily
IPR042293 - AT-rich interactive domain-containing protein 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152656.1 AT-rich interactive domain-containing protein 4-like [Momordica charantia]0.0e+0088.14Show/hide
Query:  ARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFHT
        ARQTCSLLAVTCGSVPKVK E+DVAED+LKYPFPELVSSGRLEVRVLTNPSKDEF+RIVESCQPSFVYLQGEQLENDEIGSLVWNGV+LSLEDLCGLFHT
Subjt:  ARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFHT

Query:  ALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNADSISSDLEPQLIGDHL
        ALP TVYLEIPNG R AEALHSKGIPYV+YW NT SCYAAAHFRN LLSVVQSSSTHTWDAFQLAHAAFRLHC  SNYALPG+ D IS +LEPQLIG+ L
Subjt:  ALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNADSISSDLEPQLIGDHL

Query:  KINVEPIEI---DAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCDI
        KI+VEP EI   DAGEDED SLGTLPAISIHDNNVT+RFLICGVPCT DA LL  LEDGL+ALLN EIRGSKLQGKFSA  PPLQAGSFSRGVVTMRCD+
Subjt:  KINVEPIEI---DAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCDI

Query:  VTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE------------------------------
        VTCSSAHI+ILVSGSAHTCFDDQLLEKHIKHEIIE SQLVHALRDCEGN+H MHE RKSASVACGATVFE                              
Subjt:  VTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE------------------------------

Query:  ----GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGISTVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPARKR
            GLPVASFEKEDAER LFFCSRDGNDKHSDQL LSVLPSWFKPP PSRKRVEPSQGISTVS DSLAYANI SIRRVGGEE APMNGFKA LLPARKR
Subjt:  ----GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGISTVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPARKR

Query:  LKVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVM
        LKVATMRPIPRVHRNKMTPFSG+TEADGNNG  PKA+LP+VTPSKH TVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQ CSEEEFLKDVM
Subjt:  LKVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVM

Query:  EFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC
        EFLLLRGHSRLIPQGGL+EFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC
Subjt:  EFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC

Query:  LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP
        LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKP+RVANGSPQGITNPRIP
Subjt:  LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP

XP_022949406.1 AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita moschata]0.0e+0084.79Show/hide
Query:  AARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFH
        AARQTCSLLAVTCG +PKVK E+DVAE  LKYPFPELVSSGRLEV+VLTNPSK+EF RIVESCQPSFVYLQGEQLENDE+GSLVWNGV+LSLEDLCGLF 
Subjt:  AARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFH

Query:  TALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNADSISSDLEPQLIGDH
        TALPTTVYLE+PNG ++AE LHSKGIPYVIYW NTFSCYAAAHFRNALLSVV+SSSTHTWDAFQLAHAAFRLHCV  NYALPGNAD   SDLEPQLIG+ 
Subjt:  TALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNADSISSDLEPQLIGDH

Query:  LKINVEPIEIDAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCDIVT
         KIN+EP E+DAGEDED SL  +P IS+HDNNVT+R LICG+PCT DA LL  LEDGL+ALLN EIRGSKLQGKFSAP PPLQAGSFSRGVVTMRCDIVT
Subjt:  LKINVEPIEIDAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCDIVT

Query:  CSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE--------------------------------
        CSSAHIS+LVSGSAHTCFDDQLLEKHIKHEIIE SQLVH + DCEGNKHHMH+ RKSASVACGATVFE                                
Subjt:  CSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE--------------------------------

Query:  --GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGI-STVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPARKRL
          G PVASFEKEDAERLLFFCSRD NDKHSDQL++SVLP WFKPPTPSRKRVEPSQG+ +T+S DSLAYANI S+RRVG EE APMNGFKAPLLPARKRL
Subjt:  --GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGI-STVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPARKRL

Query:  KVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSK-QIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVM
        KVATM+PIP VHRNKM  FSG TE DGN+GGQPKA+LP VTPSKH TVGSTSATQRKSFSSSSQSK QII LNPLPLKKHGCGRNP+QDCSEEEFLKDVM
Subjt:  KVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSK-QIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVM

Query:  EFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC
        EFLLLRGHSRLIPQGG+ EFPDA+LNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC
Subjt:  EFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC

Query:  LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP
        LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH VANGSPQGITNPRIP
Subjt:  LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP

XP_022998193.1 AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita maxima]0.0e+0084.41Show/hide
Query:  AARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFH
        AARQTCSLLAVTCG +PKVK E+DVAE  LKYPFPELVSSGRLEV+VLTNPSK+EFSRIVESCQPSFVYLQGEQLENDE+GSLVWNGV+LSLEDLCGLF+
Subjt:  AARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFH

Query:  TALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNADSISSDLEPQLIGDH
        TALPTTVYLE+PNG  +AE LHSKGIPYVIYW NTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRL C+  NYALPGNAD+  SDLEPQLIG+ 
Subjt:  TALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNADSISSDLEPQLIGDH

Query:  LKINVEPIEIDAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCDIVT
         KI VEP E+DAG DED SL  LP IS+HDNNVT+R LICG+PCT DA LL  LEDGL+ALLN EIRGSKLQGKFSAP PPLQA SFSRGVVTMRCDIVT
Subjt:  LKINVEPIEIDAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCDIVT

Query:  CSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE--------------------------------
        CSSAHIS+LVSGSAHTCFDDQLLEKHIKHEIIE SQLVH + DCEGNKHHMH+ RKSASVACGATVFE                                
Subjt:  CSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE--------------------------------

Query:  --GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGISTVSL-DSLAYANISSIRRVGGEESAPMNGFKAPLLPARKRL
          G PVASFEKEDAERLLFFCSRD NDKHSDQL++SVLP+WFKPPTPSRKRVEPSQGI    L DSLAYANI S+RRVG EE APMNGFKAPLLPARKRL
Subjt:  --GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGISTVSL-DSLAYANISSIRRVGGEESAPMNGFKAPLLPARKRL

Query:  KVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSK-QIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVM
        KVATMRPIP VHRNKM  FSG TE DGNNG QPKA+LP+VTPSKH T+GSTSATQRKSFSSSSQSK QII LNPLPLKKHGCGRNP+QDCSEEEFLKDVM
Subjt:  KVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSK-QIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVM

Query:  EFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC
        EFLLLRGHSRLIPQGG+ EFPDA+LNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC
Subjt:  EFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC

Query:  LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP
        LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH +ANGSPQGITNPR+P
Subjt:  LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP

XP_023524673.1 AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0084.92Show/hide
Query:  AARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFH
        AARQTCSLLAVTCG +PKVK E+DVAE  LKYPFPEL SSGRLEV+VLTNPSK+EF RIVESCQPSFVYLQGEQLENDE+GSLVWNGV+LSLEDLCGLF 
Subjt:  AARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFH

Query:  TALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNADSISSDLEPQLIGDH
        TALPTTVYLE+PNG  +AE LHSKGIPYVIYW NTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCV  NYALPGNAD   SDLEPQLIG+ 
Subjt:  TALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNADSISSDLEPQLIGDH

Query:  LKINVEPIEIDAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCDIVT
         KINVEP E+DAGEDED SL  LP IS+HDNNVT+R LICG+PCT DA LL  LEDGL+ALLN EIRGSKLQGKFSAP PPLQAGSFSRGVVTMRCDIVT
Subjt:  LKINVEPIEIDAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCDIVT

Query:  CSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE--------------------------------
        CSSAHIS+LVSGSAHTCFDDQLLEKHIKHEIIE SQLVH + DCEGNKHHMH+ RKSASVACGATVFE                                
Subjt:  CSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE--------------------------------

Query:  --GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGI-STVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPARKRL
          G PVASFEKEDAERLLFFCSRD NDKHSDQL++SVLP WFKPPTPSRKRVEPSQG+ +T+S DSLAYANI S+RRVG EE APMNGFKAPLLPARKRL
Subjt:  --GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGI-STVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPARKRL

Query:  KVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSK-QIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVM
        KVATMRPIP VHRNKM  FSG TE DGN+GGQPKA+LP VTPSKH TVGSTSATQRKSFSSSSQSK QII LNPLPLKKHGCGRNP+QDCSEEEFLKDVM
Subjt:  KVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSK-QIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVM

Query:  EFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC
        EFLLLRGHSRLIPQGG+ EFPDA+LNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC
Subjt:  EFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC

Query:  LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP
        LLC SSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH VANGSPQGITNPR+P
Subjt:  LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP

XP_038883881.1 AT-rich interactive domain-containing protein 4-like [Benincasa hispida]0.0e+0085.86Show/hide
Query:  AARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFH
        AARQTCSLLAVTCGSVPK+K E++V EDKL+YPFPELVSSGRLEVRVL NPSKDEFSRIVES  PSFVYLQGEQL NDEIGSLVWNGV+LSLEDLCGLF+
Subjt:  AARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFH

Query:  TALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNA-DSISSDLEPQLIGD
        T LPT VYLEIPNG R+AEALHSKGIPY++YW +TFSCYAAAHFRNALLSVVQSSSTHTWDAFQLA AAF+L+CV SNY LPG A DSI SDLEPQLIG+
Subjt:  TALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNA-DSISSDLEPQLIGD

Query:  HLKINVEPIEIDA--GEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCD
         LKINVEP E+DA  GED DGSL TLPAISIHDNNVTVRFLICGVPCT DA LL  LEDGL+ALLN EIRGSKLQGKFSAP PPLQAGSFSRGVVTMRCD
Subjt:  HLKINVEPIEIDA--GEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCD

Query:  IVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE-----------------------------
        IVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIE +QLVHA+ DCEGNKHHMHE RKSASVACGATVFE                             
Subjt:  IVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE-----------------------------

Query:  -----GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGI-STVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPAR
             GLPVASFEKEDAERLLFFCS D NDKHS+QL++SVLPSWFKPPTPSRKRVEPSQGI ST+S DSLAYANI SIRRV  EE APMNGFKAPLLP R
Subjt:  -----GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGI-STVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPAR

Query:  KRLKVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKD
        KRLKVA+MRP+PRVHRNK+TPFSG+ E D NNG   KA+LP+VTPSKH TVGSTSAT RKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKD
Subjt:  KRLKVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKD

Query:  VMEFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGE
        VMEFLLLRGHSRLIPQGGL EFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGE
Subjt:  VMEFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGE

Query:  CCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP
        CCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP
Subjt:  CCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP

TrEMBL top hitse value%identityAlignment
A0A0A0LEG9 ARID domain-containing protein0.0e+0083.68Show/hide
Query:  AARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFH
        AARQTCSLLAVTCG+VPKVK E++V EDKLKYPFPELVS GRLEVRVL NPSKDEFSRIVESC PSFVYLQGEQL NDEIGSLVWNGV+LSLEDLCGLF+
Subjt:  AARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFH

Query:  TALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNA-DSISSDLEPQLIGD
         ALPT VYLEIP+G R+AEALHSKGIPY+IYW +TFSCYAAAHFR+ALLSVVQSSSTHTWDAFQLA AAFRL+ V SNY LPG A DS+ SDLEPQLIG+
Subjt:  TALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNA-DSISSDLEPQLIGD

Query:  HLKINVEPIEIDA--GEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCD
         LKI+VEP E+D   GEDEDGSL  LPAI+IHDNNVT+RFLICGVPCT D  LL  LEDGL ALL  E+RGSKLQGKFSAP PPLQAGSFSRGVVTMRCD
Subjt:  HLKINVEPIEIDA--GEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCD

Query:  IVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE-----------------------------
        IVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIE +QLVHA+ DCEGNKHHMH+ RKSAS+ACGATVFE                             
Subjt:  IVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE-----------------------------

Query:  -----GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGI-STVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPAR
             GLPVASFEKEDAERLLFFCS DGNDKHS+QL++SVLPSWFKPPTPSRKRVEPSQGI +++S DSL+YA+I +IRRVG E+  PMNGFKA L PAR
Subjt:  -----GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGI-STVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPAR

Query:  KRLKVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKD
        K+LKVA+MRP+PR+HRNKMTPF+G+TE DGNNGG  KA+L IVTP KH TVGSTSAT RKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKD
Subjt:  KRLKVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKD

Query:  VMEFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGE
        VMEFLLLRGH+RLIPQGGL EFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGE
Subjt:  VMEFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGE

Query:  CCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP
        CCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP
Subjt:  CCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP

A0A6J1DIE1 AT-rich interactive domain-containing protein 4-like0.0e+0088.14Show/hide
Query:  ARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFHT
        ARQTCSLLAVTCGSVPKVK E+DVAED+LKYPFPELVSSGRLEVRVLTNPSKDEF+RIVESCQPSFVYLQGEQLENDEIGSLVWNGV+LSLEDLCGLFHT
Subjt:  ARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFHT

Query:  ALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNADSISSDLEPQLIGDHL
        ALP TVYLEIPNG R AEALHSKGIPYV+YW NT SCYAAAHFRN LLSVVQSSSTHTWDAFQLAHAAFRLHC  SNYALPG+ D IS +LEPQLIG+ L
Subjt:  ALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNADSISSDLEPQLIGDHL

Query:  KINVEPIEI---DAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCDI
        KI+VEP EI   DAGEDED SLGTLPAISIHDNNVT+RFLICGVPCT DA LL  LEDGL+ALLN EIRGSKLQGKFSA  PPLQAGSFSRGVVTMRCD+
Subjt:  KINVEPIEI---DAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCDI

Query:  VTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE------------------------------
        VTCSSAHI+ILVSGSAHTCFDDQLLEKHIKHEIIE SQLVHALRDCEGN+H MHE RKSASVACGATVFE                              
Subjt:  VTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE------------------------------

Query:  ----GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGISTVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPARKR
            GLPVASFEKEDAER LFFCSRDGNDKHSDQL LSVLPSWFKPP PSRKRVEPSQGISTVS DSLAYANI SIRRVGGEE APMNGFKA LLPARKR
Subjt:  ----GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGISTVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPARKR

Query:  LKVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVM
        LKVATMRPIPRVHRNKMTPFSG+TEADGNNG  PKA+LP+VTPSKH TVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQ CSEEEFLKDVM
Subjt:  LKVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVM

Query:  EFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC
        EFLLLRGHSRLIPQGGL+EFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC
Subjt:  EFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC

Query:  LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP
        LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKP+RVANGSPQGITNPRIP
Subjt:  LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP

A0A6J1GBY3 AT-rich interactive domain-containing protein 4-like isoform X10.0e+0084.79Show/hide
Query:  AARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFH
        AARQTCSLLAVTCG +PKVK E+DVAE  LKYPFPELVSSGRLEV+VLTNPSK+EF RIVESCQPSFVYLQGEQLENDE+GSLVWNGV+LSLEDLCGLF 
Subjt:  AARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFH

Query:  TALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNADSISSDLEPQLIGDH
        TALPTTVYLE+PNG ++AE LHSKGIPYVIYW NTFSCYAAAHFRNALLSVV+SSSTHTWDAFQLAHAAFRLHCV  NYALPGNAD   SDLEPQLIG+ 
Subjt:  TALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNADSISSDLEPQLIGDH

Query:  LKINVEPIEIDAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCDIVT
         KIN+EP E+DAGEDED SL  +P IS+HDNNVT+R LICG+PCT DA LL  LEDGL+ALLN EIRGSKLQGKFSAP PPLQAGSFSRGVVTMRCDIVT
Subjt:  LKINVEPIEIDAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCDIVT

Query:  CSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE--------------------------------
        CSSAHIS+LVSGSAHTCFDDQLLEKHIKHEIIE SQLVH + DCEGNKHHMH+ RKSASVACGATVFE                                
Subjt:  CSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE--------------------------------

Query:  --GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGI-STVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPARKRL
          G PVASFEKEDAERLLFFCSRD NDKHSDQL++SVLP WFKPPTPSRKRVEPSQG+ +T+S DSLAYANI S+RRVG EE APMNGFKAPLLPARKRL
Subjt:  --GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGI-STVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPARKRL

Query:  KVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSK-QIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVM
        KVATM+PIP VHRNKM  FSG TE DGN+GGQPKA+LP VTPSKH TVGSTSATQRKSFSSSSQSK QII LNPLPLKKHGCGRNP+QDCSEEEFLKDVM
Subjt:  KVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSK-QIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVM

Query:  EFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC
        EFLLLRGHSRLIPQGG+ EFPDA+LNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC
Subjt:  EFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC

Query:  LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP
        LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH VANGSPQGITNPRIP
Subjt:  LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP

A0A6J1GVQ9 AT-rich interactive domain-containing protein 4-like0.0e+0084.72Show/hide
Query:  AARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFH
        AARQTCSLLAVTCGSV K K E+DV EDKLKYPFP LVSSGRLEVR LTNPS DEFSRIVESC PSFVYLQGEQL NDEIGSLVWNGV+L LEDLCGLF+
Subjt:  AARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFH

Query:  TALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNADSISSDLEPQLIGDH
        TALPT VYLEIPNG R+AEALHSKGIPYV+YW +TFSCYAAAHFRNAL SV+QSSSTHTWDAFQLA AAFRLHC+ S++ALPG  DSI+S LEPQ+ G+ 
Subjt:  TALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNADSISSDLEPQLIGDH

Query:  LKINVEPIEIDA--GEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCDI
        LKINVEP ++D   GEDEDGSL TL AISIHDNNVTVRFLICGVPCT DA LL  LEDGL+ALLN EIRG KLQGKFSAP PPLQAGSF+RGVVTMRCDI
Subjt:  LKINVEPIEIDA--GEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCDI

Query:  VTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE------------------------------
        VTCSSAHISILVSGS HTCFDDQLLEKHIKHEIIE +QLVHA+ DCE NKHHMHE RKSASVACGATVFE                              
Subjt:  VTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE------------------------------

Query:  ----GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGI-STVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPARK
            GLPVASFEKEDAERLLFFCS+D NDKHSDQL++SVLPSWFKPP PSRKRVEPSQGI ST+S D LAYANI  IRRVG EE APMNGFK PLL  RK
Subjt:  ----GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGI-STVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPARK

Query:  RLKVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKDV
        RLKVA+MRPIPRVHRNKMTPFSG+TEADGNNGGQPKA  P+VTPSKH TVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKDV
Subjt:  RLKVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKDV

Query:  MEFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGEC
        MEFLLLRGHSRLIPQGGL EFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTM+NRMTGVGNTLKRHYETYLLEYELAHDDVDGEC
Subjt:  MEFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGEC

Query:  CLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY-KKKPHRVANGSPQGITN-PRIP
        CLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY KKKPHRVANGSPQG+TN PRIP
Subjt:  CLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY-KKKPHRVANGSPQGITN-PRIP

A0A6J1KBW8 AT-rich interactive domain-containing protein 4-like isoform X10.0e+0084.41Show/hide
Query:  AARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFH
        AARQTCSLLAVTCG +PKVK E+DVAE  LKYPFPELVSSGRLEV+VLTNPSK+EFSRIVESCQPSFVYLQGEQLENDE+GSLVWNGV+LSLEDLCGLF+
Subjt:  AARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVNLSLEDLCGLFH

Query:  TALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNADSISSDLEPQLIGDH
        TALPTTVYLE+PNG  +AE LHSKGIPYVIYW NTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRL C+  NYALPGNAD+  SDLEPQLIG+ 
Subjt:  TALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNADSISSDLEPQLIGDH

Query:  LKINVEPIEIDAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCDIVT
         KI VEP E+DAG DED SL  LP IS+HDNNVT+R LICG+PCT DA LL  LEDGL+ALLN EIRGSKLQGKFSAP PPLQA SFSRGVVTMRCDIVT
Subjt:  LKINVEPIEIDAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCDIVT

Query:  CSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE--------------------------------
        CSSAHIS+LVSGSAHTCFDDQLLEKHIKHEIIE SQLVH + DCEGNKHHMH+ RKSASVACGATVFE                                
Subjt:  CSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE--------------------------------

Query:  --GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGISTVSL-DSLAYANISSIRRVGGEESAPMNGFKAPLLPARKRL
          G PVASFEKEDAERLLFFCSRD NDKHSDQL++SVLP+WFKPPTPSRKRVEPSQGI    L DSLAYANI S+RRVG EE APMNGFKAPLLPARKRL
Subjt:  --GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGISTVSL-DSLAYANISSIRRVGGEESAPMNGFKAPLLPARKRL

Query:  KVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSK-QIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVM
        KVATMRPIP VHRNKM  FSG TE DGNNG QPKA+LP+VTPSKH T+GSTSATQRKSFSSSSQSK QII LNPLPLKKHGCGRNP+QDCSEEEFLKDVM
Subjt:  KVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSK-QIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVM

Query:  EFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC
        EFLLLRGHSRLIPQGG+ EFPDA+LNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC
Subjt:  EFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC

Query:  LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP
        LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH +ANGSPQGITNPR+P
Subjt:  LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP

SwissProt top hitse value%identityAlignment
Q6NQ79 AT-rich interactive domain-containing protein 48.2e-24456.53Show/hide
Query:  ARQTCSLLAVTCGS-VPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQL-ENDEIGSLVWNGVNLSLED-LCGL
        +R  C+++AV  G+ +     + D    + KYPFP+L SSGRL+ +VL NP+ +EF   V S    FVYLQGE   ++DE+G LV    + S  D L  L
Subjt:  ARQTCSLLAVTCGS-VPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQL-ENDEIGSLVWNGVNLSLED-LCGL

Query:  FHTALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNAD-SISSDLEPQLI
        F + LPTTVYLE+PNG  LA+AL+SKG+ YVIYWKN FS YAA HFR++L SV+QSS + TWD F +A A+FRL+C   N  LP N++  ++ ++ P L+
Subjt:  FHTALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNAD-SISSDLEPQLI

Query:  GDHLKINVEPIEIDAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCD
        G+  KI+V   E D  E+E+ SL +LP+I I+D +VTVRFL+CG PCT D  LL  L DGL+ALL  E+RGSKL  + SAPAPPLQAG+F+RGVVTMRCD
Subjt:  GDHLKINVEPIEIDAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCD

Query:  IVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE-----------------------------
        + TCSSAHIS+LVSG+A TCF DQLLE HIKHE++EK QLVH++ + E  K    E R+SAS+ACGA+V E                             
Subjt:  IVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE-----------------------------

Query:  -----GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGISTVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPARK
             GL VASFEK+DAERLLFFC +  ND  +   +LS +P+W  PP P+RKR EP +                       E     NG      P  +
Subjt:  -----GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGISTVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPARK

Query:  RLKVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKDV
        ++ VA +RPIP   R+KM PFSG +E    +G   K +LP+  P KH   G T  T RK+FS S Q KQIISLNPLPLKKH CGR  IQ CSEEEFL+DV
Subjt:  RLKVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKDV

Query:  MEFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGEC
        M+FLL+RGH+RL+P GGLAEFPDA+LN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM N+T+TNRMTGVGNTLKRHYETYLLEYE AHDDVDGEC
Subjt:  MEFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGEC

Query:  CLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANG
        CL+C SS AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP+CS++ Y+KK  + +NG
Subjt:  CLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANG

Arabidopsis top hitse value%identityAlignment
AT3G43240.1 ARID/BRIGHT DNA-binding domain-containing protein5.8e-24556.53Show/hide
Query:  ARQTCSLLAVTCGS-VPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQL-ENDEIGSLVWNGVNLSLED-LCGL
        +R  C+++AV  G+ +     + D    + KYPFP+L SSGRL+ +VL NP+ +EF   V S    FVYLQGE   ++DE+G LV    + S  D L  L
Subjt:  ARQTCSLLAVTCGS-VPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQL-ENDEIGSLVWNGVNLSLED-LCGL

Query:  FHTALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNAD-SISSDLEPQLI
        F + LPTTVYLE+PNG  LA+AL+SKG+ YVIYWKN FS YAA HFR++L SV+QSS + TWD F +A A+FRL+C   N  LP N++  ++ ++ P L+
Subjt:  FHTALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNYALPGNAD-SISSDLEPQLI

Query:  GDHLKINVEPIEIDAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCD
        G+  KI+V   E D  E+E+ SL +LP+I I+D +VTVRFL+CG PCT D  LL  L DGL+ALL  E+RGSKL  + SAPAPPLQAG+F+RGVVTMRCD
Subjt:  GDHLKINVEPIEIDAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSRGVVTMRCD

Query:  IVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE-----------------------------
        + TCSSAHIS+LVSG+A TCF DQLLE HIKHE++EK QLVH++ + E  K    E R+SAS+ACGA+V E                             
Subjt:  IVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFE-----------------------------

Query:  -----GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGISTVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPARK
             GL VASFEK+DAERLLFFC +  ND  +   +LS +P+W  PP P+RKR EP +                       E     NG      P  +
Subjt:  -----GLPVASFEKEDAERLLFFCSRDGNDKHSDQLVLSVLPSWFKPPTPSRKRVEPSQGISTVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPARK

Query:  RLKVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKDV
        ++ VA +RPIP   R+KM PFSG +E    +G   K +LP+  P KH   G T  T RK+FS S Q KQIISLNPLPLKKH CGR  IQ CSEEEFL+DV
Subjt:  RLKVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSKHATVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKDV

Query:  MEFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGEC
        M+FLL+RGH+RL+P GGLAEFPDA+LN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM N+T+TNRMTGVGNTLKRHYETYLLEYE AHDDVDGEC
Subjt:  MEFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGEC

Query:  CLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANG
        CL+C SS AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP+CS++ Y+KK  + +NG
Subjt:  CLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAATTCAGCCCAGCGCGCCACTTGTCTTGAACTTCCACTGCGCTCTCCATTTCCTGAGAAGCATGTTTTACTTCCTGGTTTCAGCGCCCTGCTTCTGCATGCGTA
CAGATGCGAGCGAGCTGCTAGACAAACTTGTAGTCTACTTGCTGTCACCTGCGGAAGTGTACCTAAAGTAAAACGTGAGAAGGATGTTGCTGAGGATAAGTTGAAATATC
CCTTTCCAGAATTAGTTTCTTCGGGACGATTGGAGGTCCGAGTTCTGACTAATCCAAGCAAGGATGAGTTTAGTAGAATTGTAGAATCATGTCAACCAAGCTTCGTCTAC
TTGCAAGGGGAACAACTTGAAAATGATGAAATTGGGTCTTTGGTTTGGAATGGTGTTAATTTGTCTCTTGAAGATTTATGTGGACTATTCCATACTGCACTACCAACTAC
CGTGTATTTAGAAATCCCAAATGGAAGCAGATTAGCAGAGGCTCTTCATTCTAAGGGAATTCCTTATGTCATATATTGGAAAAACACATTTTCATGTTATGCTGCAGCTC
ATTTTCGTAATGCATTGCTTTCAGTGGTGCAGAGTTCATCTACTCATACATGGGATGCTTTTCAGCTTGCACATGCTGCTTTTAGGCTTCATTGTGTGTGGAGCAATTAT
GCCCTTCCCGGGAATGCTGACAGTATTAGCAGTGATCTAGAGCCACAACTTATTGGGGACCATTTAAAGATTAACGTAGAACCTATTGAGATAGATGCAGGTGAAGATGA
AGACGGTTCTTTAGGAACCCTTCCTGCCATAAGTATACATGATAATAATGTGACCGTGAGATTTCTTATCTGTGGAGTGCCCTGCACATGGGATGCGGGCTTGTTGACAT
TGTTGGAGGATGGCCTTAGTGCCCTTTTGAACACTGAAATACGTGGGAGTAAACTTCAGGGAAAGTTTAGTGCTCCTGCGCCGCCTCTTCAAGCAGGATCCTTTTCTCGT
GGTGTTGTGACAATGCGATGTGATATAGTGACCTGTAGCTCAGCCCACATCTCTATATTGGTGTCGGGTAGTGCTCATACTTGTTTTGACGATCAGCTGCTGGAAAAACA
TATCAAACATGAGATTATCGAAAAGAGCCAATTAGTTCATGCCCTGCGTGATTGTGAGGGCAACAAACACCATATGCACGAGCTTCGAAAATCTGCTTCAGTTGCTTGCG
GGGCAACAGTATTTGAGGGTTTGCCTGTTGCTTCTTTTGAGAAAGAGGATGCTGAGCGGTTGCTCTTCTTCTGTTCAAGGGATGGGAATGATAAACATTCAGATCAGTTG
GTTTTAAGTGTACTGCCCAGCTGGTTTAAACCACCTACTCCTAGTAGAAAGAGAGTGGAACCAAGCCAAGGAATAAGCACAGTTTCACTCGACAGTCTTGCATATGCAAA
TATCTCTTCCATTAGAAGAGTAGGTGGAGAGGAGTCTGCACCAATGAATGGGTTCAAGGCACCCTTACTCCCAGCTAGAAAACGACTAAAAGTAGCCACCATGAGGCCTA
TTCCACGTGTTCATCGTAATAAAATGACACCCTTCTCTGGAATTACAGAAGCAGATGGGAACAATGGAGGCCAACCCAAGGCCAATCTCCCCATCGTTACCCCATCAAAG
CATGCAACTGTAGGATCAACTTCTGCAACGCAAAGAAAATCGTTTTCAAGCTCATCTCAGTCTAAGCAGATTATTTCCTTAAATCCACTACCTTTAAAGAAACATGGTTG
TGGAAGAAACCCAATTCAAGATTGCTCTGAGGAGGAGTTCTTGAAGGATGTAATGGAATTTTTACTACTTAGAGGACATTCACGACTTATTCCTCAAGGTGGACTTGCCG
AGTTCCCAGATGCCATACTCAATGGGAAGCGTCTTGACCTCTATAACTTGTATAAGGAGGTGGTAACCCGAGGAGGCTTTCATGTCGGCAATGGTATCAACTGGAAGGGG
CAGATCTTCTCTAAGATGCACAATTACACAATGACCAATAGAATGACTGGTGTTGGAAATACACTGAAAAGACATTATGAGACTTACCTTCTAGAATATGAACTGGCGCA
TGATGATGTAGATGGAGAATGCTGCTTGTTGTGTCACAGTAGTGCAGCAGGGGATTGGGTGAACTGTGGAATTTGCGGTGAATGGGCCCATTTTGGGTGCGACCGAAGGC
AGGGTCTTGGTGCATTTAAGGATTATGCCAAAACAGATGGGCTAGAGTATGTTTGTCCACATTGTAGCATCACAACCTACAAGAAGAAACCACACAGAGTAGCAAACGGG
TCTCCGCAAGGAATAACGAATCCACGGATACCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAATTCAGCCCAGCGCGCCACTTGTCTTGAACTTCCACTGCGCTCTCCATTTCCTGAGAAGCATGTTTTACTTCCTGGTTTCAGCGCCCTGCTTCTGCATGCGTA
CAGATGCGAGCGAGCTGCTAGACAAACTTGTAGTCTACTTGCTGTCACCTGCGGAAGTGTACCTAAAGTAAAACGTGAGAAGGATGTTGCTGAGGATAAGTTGAAATATC
CCTTTCCAGAATTAGTTTCTTCGGGACGATTGGAGGTCCGAGTTCTGACTAATCCAAGCAAGGATGAGTTTAGTAGAATTGTAGAATCATGTCAACCAAGCTTCGTCTAC
TTGCAAGGGGAACAACTTGAAAATGATGAAATTGGGTCTTTGGTTTGGAATGGTGTTAATTTGTCTCTTGAAGATTTATGTGGACTATTCCATACTGCACTACCAACTAC
CGTGTATTTAGAAATCCCAAATGGAAGCAGATTAGCAGAGGCTCTTCATTCTAAGGGAATTCCTTATGTCATATATTGGAAAAACACATTTTCATGTTATGCTGCAGCTC
ATTTTCGTAATGCATTGCTTTCAGTGGTGCAGAGTTCATCTACTCATACATGGGATGCTTTTCAGCTTGCACATGCTGCTTTTAGGCTTCATTGTGTGTGGAGCAATTAT
GCCCTTCCCGGGAATGCTGACAGTATTAGCAGTGATCTAGAGCCACAACTTATTGGGGACCATTTAAAGATTAACGTAGAACCTATTGAGATAGATGCAGGTGAAGATGA
AGACGGTTCTTTAGGAACCCTTCCTGCCATAAGTATACATGATAATAATGTGACCGTGAGATTTCTTATCTGTGGAGTGCCCTGCACATGGGATGCGGGCTTGTTGACAT
TGTTGGAGGATGGCCTTAGTGCCCTTTTGAACACTGAAATACGTGGGAGTAAACTTCAGGGAAAGTTTAGTGCTCCTGCGCCGCCTCTTCAAGCAGGATCCTTTTCTCGT
GGTGTTGTGACAATGCGATGTGATATAGTGACCTGTAGCTCAGCCCACATCTCTATATTGGTGTCGGGTAGTGCTCATACTTGTTTTGACGATCAGCTGCTGGAAAAACA
TATCAAACATGAGATTATCGAAAAGAGCCAATTAGTTCATGCCCTGCGTGATTGTGAGGGCAACAAACACCATATGCACGAGCTTCGAAAATCTGCTTCAGTTGCTTGCG
GGGCAACAGTATTTGAGGGTTTGCCTGTTGCTTCTTTTGAGAAAGAGGATGCTGAGCGGTTGCTCTTCTTCTGTTCAAGGGATGGGAATGATAAACATTCAGATCAGTTG
GTTTTAAGTGTACTGCCCAGCTGGTTTAAACCACCTACTCCTAGTAGAAAGAGAGTGGAACCAAGCCAAGGAATAAGCACAGTTTCACTCGACAGTCTTGCATATGCAAA
TATCTCTTCCATTAGAAGAGTAGGTGGAGAGGAGTCTGCACCAATGAATGGGTTCAAGGCACCCTTACTCCCAGCTAGAAAACGACTAAAAGTAGCCACCATGAGGCCTA
TTCCACGTGTTCATCGTAATAAAATGACACCCTTCTCTGGAATTACAGAAGCAGATGGGAACAATGGAGGCCAACCCAAGGCCAATCTCCCCATCGTTACCCCATCAAAG
CATGCAACTGTAGGATCAACTTCTGCAACGCAAAGAAAATCGTTTTCAAGCTCATCTCAGTCTAAGCAGATTATTTCCTTAAATCCACTACCTTTAAAGAAACATGGTTG
TGGAAGAAACCCAATTCAAGATTGCTCTGAGGAGGAGTTCTTGAAGGATGTAATGGAATTTTTACTACTTAGAGGACATTCACGACTTATTCCTCAAGGTGGACTTGCCG
AGTTCCCAGATGCCATACTCAATGGGAAGCGTCTTGACCTCTATAACTTGTATAAGGAGGTGGTAACCCGAGGAGGCTTTCATGTCGGCAATGGTATCAACTGGAAGGGG
CAGATCTTCTCTAAGATGCACAATTACACAATGACCAATAGAATGACTGGTGTTGGAAATACACTGAAAAGACATTATGAGACTTACCTTCTAGAATATGAACTGGCGCA
TGATGATGTAGATGGAGAATGCTGCTTGTTGTGTCACAGTAGTGCAGCAGGGGATTGGGTGAACTGTGGAATTTGCGGTGAATGGGCCCATTTTGGGTGCGACCGAAGGC
AGGGTCTTGGTGCATTTAAGGATTATGCCAAAACAGATGGGCTAGAGTATGTTTGTCCACATTGTAGCATCACAACCTACAAGAAGAAACCACACAGAGTAGCAAACGGG
TCTCCGCAAGGAATAACGAATCCACGGATACCTTGA
Protein sequenceShow/hide protein sequence
MENSAQRATCLELPLRSPFPEKHVLLPGFSALLLHAYRCERAARQTCSLLAVTCGSVPKVKREKDVAEDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVY
LQGEQLENDEIGSLVWNGVNLSLEDLCGLFHTALPTTVYLEIPNGSRLAEALHSKGIPYVIYWKNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVWSNY
ALPGNADSISSDLEPQLIGDHLKINVEPIEIDAGEDEDGSLGTLPAISIHDNNVTVRFLICGVPCTWDAGLLTLLEDGLSALLNTEIRGSKLQGKFSAPAPPLQAGSFSR
GVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEKSQLVHALRDCEGNKHHMHELRKSASVACGATVFEGLPVASFEKEDAERLLFFCSRDGNDKHSDQL
VLSVLPSWFKPPTPSRKRVEPSQGISTVSLDSLAYANISSIRRVGGEESAPMNGFKAPLLPARKRLKVATMRPIPRVHRNKMTPFSGITEADGNNGGQPKANLPIVTPSK
HATVGSTSATQRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKG
QIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANG
SPQGITNPRIP