; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G002740 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G002740
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionAT-rich interactive domain-containing protein 2
Genome locationchr07:2968833..2973908
RNA-Seq ExpressionLsi07G002740
SyntenyLsi07G002740
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR001606 - ARID DNA-binding domain
IPR036431 - ARID DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015291.1 AT-rich interactive domain-containing protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma]4.1e-28674.89Show/hide
Query:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY
        MGRW +SSNASILDCNKDVDPNPSNGCCI  DCL      NV+YDDCKA IR YFEKILWVFLKEIGRRGF+RP+PAL+GEGG+LDLFELF+VVRDKGG 
Subjt:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY

Query:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNG
        QVVSEK+LWSSVV+ELGLDLGLSASVKLIYSKYLSDLE WLMVRCG TKLENG+SDY  +KS PFLSEL AKI GMLYGV RQ SIYDEC GFKSNK NG
Subjt:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNG

Query:  NVNV-AAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV
        NVNV AAAAVEKE KFP+IKK+EHDLHGD+TPIQQ+CT+T         I VIED + LDAVNVE EI+S G+YRESLLRMLKWVRKTAKHP +P NGT+
Subjt:  NVNV-AAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV

Query:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNIDNHHLSTER
        PG S+WK Y SDDALWLQVIR+KDALL RK VDK AEK+                                  ++  KKV+MHPSIYED IDNHHLSTER
Subjt:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNIDNHHLSTER

Query:  ICGSKRSTASA------CNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSWP
        I  SKRS AS       C+NSCPTV+SN ISSLTTE+GKGL NQA+LNGD+PSEMED+ PNEDS E+ VP GAL QA +PEWTGN SDSDSKWLGTRSWP
Subjt:  ICGSKRSTASA------CNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSWP

Query:  SQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWF
         QH NSNSV DR  IGRGRPDSC CQFPGSVECFRFHIAEARMRLKLELG TF+ WRFH MGEEISLQWT EEEKRFKELA+S FNN ++CFW+YS RWF
Subjt:  SQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWF

Query:  PMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDVE
        PMKSRKNL+SYYFNVFLL+ RSYQ+RV+PNSIDSDDED EFG +SG FG KAMEILGSKS+ECS NRQ TDVE
Subjt:  PMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDVE

XP_004146560.2 AT-rich interactive domain-containing protein 2 [Cucumis sativus]1.3e-30378.75Show/hide
Query:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY
        MGRWPISSN SILDCNKDVDPNPS G CI PDCL      NV++DDCKATIRCYFEK+LWVFLKE  RRGFIRPVPALLGEG SLDLFELFMVVRDKGGY
Subjt:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY

Query:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSD-YHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPN
        QVVSEKELWSSVV+ELGLDLGLSASVKLIY KYLSDLE WLMVR GGTKLENGNSD Y+ RK+FP L+ELEAKIK +LYGVLRQKSIYDE SGFKSNKPN
Subjt:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSD-YHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPN

Query:  GNVNVAAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV
        GNVNVA  A EKE K PKI+K+EHDLH D+TPIQQNCT+TP+DNG  +QI VI DCR  DAVNVETE DSHG  RESL RMLKWVRKTAKHPANPSNGTV
Subjt:  GNVNVAAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV

Query:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNI-DNHHLSTE
        PG+SKWK+Y S+DALWLQVI++KDALLNRKDVDKTAEK+                                  ++  KKVRMHP IYEDNI DNHHLSTE
Subjt:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNI-DNHHLSTE

Query:  RICGSKRSTA------SACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSW
        RIC S+RS A       ACNNSCP VQSN I SLTTEIGKGL NQALLNGDL SEMEDNQ NEDSVEKPVP GA FQAV+PEWTGNISDSDSKWLGTRSW
Subjt:  RICGSKRSTA------SACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSW

Query:  PSQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRW
        PSQH N+ SVSDRNPI RGR D CSCQFPGSVEC+RFHIAEARMRLKLELGLTFYDWRFH MGEEISLQWTAEEE RFKELAIS+FNNQ+QCFWN+S +W
Subjt:  PSQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRW

Query:  FPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDV
        FPMKSRKNL+SYYFNVFLL+QRSYQ+RV+PN IDSD EDVEFGCISGDFGAKAME+LGSK VECSEN+QF  +
Subjt:  FPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDV

XP_008452043.1 PREDICTED: AT-rich interactive domain-containing protein 2 [Cucumis melo]4.6e-30980.24Show/hide
Query:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY
        MGRWPISSN SILDCNKDVDPNPSNG CI PDCL      NV++DDCKATIRCYFEKILWVFLKEI RRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY
Subjt:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY

Query:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSD-YHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPN
        QVVSEKELWSSVV+ELGLDLGLSASVKLIY KYLS+LE WLMVR GGTKLENGNSD Y+ RKSFP L+ELEAKIK MLYGVLRQKSIYDE  GFKSNKPN
Subjt:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSD-YHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPN

Query:  GNVNVAAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV
        GNVNVA  A EKE KFPKI+K+EHDLH D+TPIQQNCT+TP+ NG  +QI VI DCR LDAVNVETE DSHGR RESLLRMLKWVRKTAKHPANPSNGTV
Subjt:  GNVNVAAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV

Query:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNI-DNHHLSTE
        P +SKWK+Y SDDALWLQVI++KDALLNRKDVDKTAEK+                                  ++  KKVRMHP IYEDNI DNHHLSTE
Subjt:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNI-DNHHLSTE

Query:  RICGSKRSTA------SACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSW
        RIC S+RS A       A NNSCP V+SN I SLTTEIGKGL NQALLNGDL SEMEDNQ NEDSVEKPVP GALFQA +PEWTGNISDSDSKWLGTR W
Subjt:  RICGSKRSTA------SACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSW

Query:  PSQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRW
        PSQH N+ SVS+RNPIGRGR DSCSCQFPGSVEC+RFHIAEARMRLKLELGLTFYDWRFH MGEEISLQWTAEEEKRFKELAIS+FNNQ+QCFWN+S +W
Subjt:  PSQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRW

Query:  FPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDV
        FPMKSRKNL+SYYFNVFLL+QRSYQ+RV+PN IDSDDEDVEFGCISGDFGAKAMEILGSKSVECSEN+QF D+
Subjt:  FPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDV

XP_022931395.1 AT-rich interactive domain-containing protein 2-like isoform X1 [Cucurbita moschata]5.0e-29273.5Show/hide
Query:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY
        MGRW +SSNASILDCNKDVDPNPSNGCCI  DCL      NV+YDDCKA IRCYFEKILWVFLKEIGRRGF+RP+PAL+GEGG+LDLFELF+VVRDKGG 
Subjt:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY

Query:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNG
        QVVSEK+LWSSVV+ELGLDLGLSASVKLIYSKYLSDLE WLMVRCG TKLENG+SDY  +KS PFLSEL AKI GMLYGV RQ SIYDEC GFKSNK NG
Subjt:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNG

Query:  NVNV-AAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV
        NVNV AAAAVEKE KFP+IKK+EHDLHGD+T IQQ+CT+T         I VIED + LDAVNVE EI+S G+YRESLLRMLKWVRKTAKHP +P NGT+
Subjt:  NVNV-AAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV

Query:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQ-----WEKLPPAIRTSGLHHHKQAAYDFY------------------------VTLSCVLH
        PG S+WK Y SDDALWLQVIR+KDALL RK VDK AEK+        L    R     H+++    +                         V+LSCVL 
Subjt:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQ-----WEKLPPAIRTSGLHHHKQAAYDFY------------------------VTLSCVLH

Query:  DYMMDTKKVRMHPSIYEDNIDNHHLSTERICGSKRSTA------SACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPT
        D MMDTKKV+MHPSIYEDNIDNHHLSTERI  SKRS A      + C+NSCPTV+SN ISSLTTE+GKGL NQA+LNGD+PSEMED+ PNEDS E+ VP 
Subjt:  DYMMDTKKVRMHPSIYEDNIDNHHLSTERICGSKRSTA------SACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPT

Query:  GALFQAVVPEWTGNISDSDSKWLGTRSWPSQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTA
        GA+ QA +PEWTGN SDSDSKWLGTRSWP QH NSNSV DR  IGRGRPDSC CQFPGSVECFRFHIAEARMRLKLELG TF+ WRFH MGEEISLQWT 
Subjt:  GALFQAVVPEWTGNISDSDSKWLGTRSWPSQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTA

Query:  EEEKRFKELAISNFNNQSQCFWNYSSRWFPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTD
        EEEKRFKELA+S FNN ++CFW+YS RWFPMKSRKNL+SYYFNVFLL+ RSYQ+RV+PNSIDSDDED EFG +SG FG KAMEILGS S+ECS NRQ TD
Subjt:  EEEKRFKELAISNFNNQSQCFWNYSSRWFPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTD

Query:  VE
        VE
Subjt:  VE

XP_038893741.1 AT-rich interactive domain-containing protein 2 [Benincasa hispida]0.0e+0082.46Show/hide
Query:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY
        MGRWPISSNASI+DCNKDVDPNPSNGCCI PDCL      NVNYDDCKATIRCYFEKILWVFLKEIGRRG IRPV ALLGEGGSLDLFELFMVVRDKGGY
Subjt:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY

Query:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNG
        QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLE WLMVR GGTKLENGNSDYH RKSFPFLSELEAK+K ML         YDECSGFKSNKPNG
Subjt:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNG

Query:  NVNVAAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTVP
        NVNVA AA+EKE KFPK+KKEEHDLHGD+TPIQQNCT+TP+DNG  DQI VIEDCR L AVN+ETE+D+HGRYRESLLRMLKW RKTAKHP NPSN TVP
Subjt:  NVNVAAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTVP

Query:  GASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNIDNHHLSTERI
        GASKWK+Y SDDALWLQVIR+KDALL RKDVD+ AEK+                                  ++  KK RMHPSIYEDNIDNH LSTERI
Subjt:  GASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNIDNHHLSTERI

Query:  CGSKRSTASACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSWPSQHGNSN
        C SK+S ASACNNS PT+QSN ISSLTTEIGKGL NQAL NGDLPS+MEDNQPNEDSVEKPVPTGALFQAV+PEWTGNISDSDSKWLGT+SWPSQHGN N
Subjt:  CGSKRSTASACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSWPSQHGNSN

Query:  S-VSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWFPMKSRK
        S VSD+NPIG+GRPDSCSCQFPGSVECFRFHIAEARM LKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELA+S+FNNQS+CFWNYS +WFPMKSRK
Subjt:  S-VSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWFPMKSRK

Query:  NLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDVE
        NL+SYYFNVFLL+QRSYQ+R +PNSIDSDDED+EFGCISGDFGAKAMEILGSKSVEC+ENRQFTDVE
Subjt:  NLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDVE

TrEMBL top hitse value%identityAlignment
A0A0A0KZM1 ARID domain-containing protein6.2e-30478.75Show/hide
Query:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY
        MGRWPISSN SILDCNKDVDPNPS G CI PDCL      NV++DDCKATIRCYFEK+LWVFLKE  RRGFIRPVPALLGEG SLDLFELFMVVRDKGGY
Subjt:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY

Query:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSD-YHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPN
        QVVSEKELWSSVV+ELGLDLGLSASVKLIY KYLSDLE WLMVR GGTKLENGNSD Y+ RK+FP L+ELEAKIK +LYGVLRQKSIYDE SGFKSNKPN
Subjt:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSD-YHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPN

Query:  GNVNVAAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV
        GNVNVA  A EKE K PKI+K+EHDLH D+TPIQQNCT+TP+DNG  +QI VI DCR  DAVNVETE DSHG  RESL RMLKWVRKTAKHPANPSNGTV
Subjt:  GNVNVAAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV

Query:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNI-DNHHLSTE
        PG+SKWK+Y S+DALWLQVI++KDALLNRKDVDKTAEK+                                  ++  KKVRMHP IYEDNI DNHHLSTE
Subjt:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNI-DNHHLSTE

Query:  RICGSKRSTA------SACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSW
        RIC S+RS A       ACNNSCP VQSN I SLTTEIGKGL NQALLNGDL SEMEDNQ NEDSVEKPVP GA FQAV+PEWTGNISDSDSKWLGTRSW
Subjt:  RICGSKRSTA------SACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSW

Query:  PSQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRW
        PSQH N+ SVSDRNPI RGR D CSCQFPGSVEC+RFHIAEARMRLKLELGLTFYDWRFH MGEEISLQWTAEEE RFKELAIS+FNNQ+QCFWN+S +W
Subjt:  PSQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRW

Query:  FPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDV
        FPMKSRKNL+SYYFNVFLL+QRSYQ+RV+PN IDSD EDVEFGCISGDFGAKAME+LGSK VECSEN+QF  +
Subjt:  FPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDV

A0A1S3BSW2 AT-rich interactive domain-containing protein 22.2e-30980.24Show/hide
Query:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY
        MGRWPISSN SILDCNKDVDPNPSNG CI PDCL      NV++DDCKATIRCYFEKILWVFLKEI RRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY
Subjt:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY

Query:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSD-YHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPN
        QVVSEKELWSSVV+ELGLDLGLSASVKLIY KYLS+LE WLMVR GGTKLENGNSD Y+ RKSFP L+ELEAKIK MLYGVLRQKSIYDE  GFKSNKPN
Subjt:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSD-YHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPN

Query:  GNVNVAAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV
        GNVNVA  A EKE KFPKI+K+EHDLH D+TPIQQNCT+TP+ NG  +QI VI DCR LDAVNVETE DSHGR RESLLRMLKWVRKTAKHPANPSNGTV
Subjt:  GNVNVAAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV

Query:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNI-DNHHLSTE
        P +SKWK+Y SDDALWLQVI++KDALLNRKDVDKTAEK+                                  ++  KKVRMHP IYEDNI DNHHLSTE
Subjt:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNI-DNHHLSTE

Query:  RICGSKRSTA------SACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSW
        RIC S+RS A       A NNSCP V+SN I SLTTEIGKGL NQALLNGDL SEMEDNQ NEDSVEKPVP GALFQA +PEWTGNISDSDSKWLGTR W
Subjt:  RICGSKRSTA------SACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSW

Query:  PSQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRW
        PSQH N+ SVS+RNPIGRGR DSCSCQFPGSVEC+RFHIAEARMRLKLELGLTFYDWRFH MGEEISLQWTAEEEKRFKELAIS+FNNQ+QCFWN+S +W
Subjt:  PSQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRW

Query:  FPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDV
        FPMKSRKNL+SYYFNVFLL+QRSYQ+RV+PN IDSDDEDVEFGCISGDFGAKAMEILGSKSVECSEN+QF D+
Subjt:  FPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDV

A0A6J1ETI2 AT-rich interactive domain-containing protein 2-like isoform X32.0e-28674.59Show/hide
Query:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY
        MGRW +SSNASILDCNKDVDPNPSNGCCI  DCL      NV+YDDCKA IRCYFEKILWVFLKEIGRRGF+RP+PAL+GEGG+LDLFELF+VVRDKGG 
Subjt:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY

Query:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNG
        QVVSEK+LWSSVV+ELGLDLGLSASVKLIYSKYLSDLE WLMVRCG TKLENG+SDY  +KS PFLSEL AKI GMLYGV RQ SIYDEC GFKSNK NG
Subjt:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNG

Query:  NVNV-AAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV
        NVNV AAAAVEKE KFP+IKK+EHDLHGD+T IQQ+CT+T         I VIED + LDAVNVE EI+S G+YRESLLRMLKWVRKTAKHP +P NGT+
Subjt:  NVNV-AAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV

Query:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNIDNHHLSTER
        PG S+WK Y SDDALWLQVIR+KDALL RK VDK AEK+                                  ++  KKV+MHPSIYEDNIDNHHLSTER
Subjt:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNIDNHHLSTER

Query:  ICGSKRSTA------SACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSWP
        I  SKRS A      + C+NSCPTV+SN ISSLTTE+GKGL NQA+LNGD+PSEMED+ PNEDS E+ VP GA+ QA +PEWTGN SDSDSKWLGTRSWP
Subjt:  ICGSKRSTA------SACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSWP

Query:  SQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWF
         QH NSNSV DR  IGRGRPDSC CQFPGSVECFRFHIAEARMRLKLELG TF+ WRFH MGEEISLQWT EEEKRFKELA+S FNN ++CFW+YS RWF
Subjt:  SQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWF

Query:  PMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDVE
        PMKSRKNL+SYYFNVFLL+ RSYQ+RV+PNSIDSDDED EFG +SG FG KAMEILGS S+ECS NRQ TDVE
Subjt:  PMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDVE

A0A6J1EZB1 AT-rich interactive domain-containing protein 2-like isoform X12.4e-29273.5Show/hide
Query:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY
        MGRW +SSNASILDCNKDVDPNPSNGCCI  DCL      NV+YDDCKA IRCYFEKILWVFLKEIGRRGF+RP+PAL+GEGG+LDLFELF+VVRDKGG 
Subjt:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY

Query:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNG
        QVVSEK+LWSSVV+ELGLDLGLSASVKLIYSKYLSDLE WLMVRCG TKLENG+SDY  +KS PFLSEL AKI GMLYGV RQ SIYDEC GFKSNK NG
Subjt:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNG

Query:  NVNV-AAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV
        NVNV AAAAVEKE KFP+IKK+EHDLHGD+T IQQ+CT+T         I VIED + LDAVNVE EI+S G+YRESLLRMLKWVRKTAKHP +P NGT+
Subjt:  NVNV-AAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV

Query:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQ-----WEKLPPAIRTSGLHHHKQAAYDFY------------------------VTLSCVLH
        PG S+WK Y SDDALWLQVIR+KDALL RK VDK AEK+        L    R     H+++    +                         V+LSCVL 
Subjt:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQ-----WEKLPPAIRTSGLHHHKQAAYDFY------------------------VTLSCVLH

Query:  DYMMDTKKVRMHPSIYEDNIDNHHLSTERICGSKRSTA------SACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPT
        D MMDTKKV+MHPSIYEDNIDNHHLSTERI  SKRS A      + C+NSCPTV+SN ISSLTTE+GKGL NQA+LNGD+PSEMED+ PNEDS E+ VP 
Subjt:  DYMMDTKKVRMHPSIYEDNIDNHHLSTERICGSKRSTA------SACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPT

Query:  GALFQAVVPEWTGNISDSDSKWLGTRSWPSQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTA
        GA+ QA +PEWTGN SDSDSKWLGTRSWP QH NSNSV DR  IGRGRPDSC CQFPGSVECFRFHIAEARMRLKLELG TF+ WRFH MGEEISLQWT 
Subjt:  GALFQAVVPEWTGNISDSDSKWLGTRSWPSQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTA

Query:  EEEKRFKELAISNFNNQSQCFWNYSSRWFPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTD
        EEEKRFKELA+S FNN ++CFW+YS RWFPMKSRKNL+SYYFNVFLL+ RSYQ+RV+PNSIDSDDED EFG +SG FG KAMEILGS S+ECS NRQ TD
Subjt:  EEEKRFKELAISNFNNQSQCFWNYSSRWFPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTD

Query:  VE
        VE
Subjt:  VE

A0A6J1J644 AT-rich interactive domain-containing protein 2-like isoform X15.8e-28674.59Show/hide
Query:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY
        MGRW +SSNASILDCNKDVDPNPSNGCCI  DCL      NV+YDDCKA IRCYFEKILWVFLKEIGRRGF+RP+PAL+GEGG+LDLFELF+VVRDKGG 
Subjt:  MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCL------NVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY

Query:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNG
        QVVSEK+LWSSVV+ELGLDLGLSASVKLIYSKYLSDLE WLMVRCG TKLENG+SDY  +KS PFLSEL AKI GMLYGV RQ SIYDEC GFKSNK NG
Subjt:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNG

Query:  NVNVAAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTVP
        NVNVAAAAVEKE KF +IKK+EHDLHGD+TPIQQ+CT+T         I VIED + LDAVNVE EI+S G+YRESLLRMLKWVRKTAKHP +P NGT+ 
Subjt:  NVNVAAAAVEKETKFPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTVP

Query:  GASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNIDNHHLSTERI
        GAS+WK Y SDDALWLQVI +KDALL RK VDK AEK+                                  ++  KKV+MHPSIYEDNIDNH LSTERI
Subjt:  GASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNIDNHHLSTERI

Query:  CGSKRSTAS------ACNNSCPTVQSNWI-SSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSWP
          SKR  AS       C+NSCPTV+SN I SSLTTE+GKGL NQA+LNGD+PSEMED+ PNEDS E+ VP GAL QA +PEWTGN SDSDSKWLGTR WP
Subjt:  CGSKRSTAS------ACNNSCPTVQSNWI-SSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSWP

Query:  SQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWF
         QH NSNSV DR  IGRGRPDSC CQFPGSVECFRFHIAEARMRLKLELG TF+ WRFH MGEEISLQWTAEEEKRFKELA+S+FNN ++CFW+YS RWF
Subjt:  SQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWF

Query:  PMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDVE
        PMKSRKNL+SYYFNVFLL+ RSYQ+RV+PNSIDSDDED EFGC+SG FG KAME+LGSKS+ECS NRQ TDVE
Subjt:  PMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDVE

SwissProt top hitse value%identityAlignment
Q84JT7 AT-rich interactive domain-containing protein 11.9e-6330.87Show/hide
Query:  MGRWPISSNASILDCN---KDVDPNPSNGCCIDPDCLNVNYDDCKATIR---CYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY
        M  W + ++   +D +   K +D N S      P+ +N      +  I+     F  +L  FL E        P+PA+ GEG ++DLF LF+ V  KGG+
Subjt:  MGRWPISSNASILDCN---KDVDPNPSNGCCIDPDCLNVNYDDCKATIR---CYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY

Query:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNG
          VSE   W  VV E GL+   SAS KLIY KYL     WL       ++  G++D    +       L A++ G L  V ++  +       +  +P  
Subjt:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNG

Query:  NVNVAAAAVEKETK-FPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV
             A  +  E K F    K  +D H  +     +     +  G K   + +E   IL++V  E       R RE  L  LKW+   AK P +PS G V
Subjt:  NVNVAAAAVEKETK-FPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV

Query:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNIDNHHLSTER
        P  S+W SY S++  W Q++  +    +R + D   EK W+K+                                     +MHP +Y+D+    +   ER
Subjt:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNIDNHHLSTER

Query:  ICGSKRSTASACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSWP--SQHG
        +           N S                            D+ S  E+++P           G+ FQA VPEWTG   +SDSKWLGTR WP   +  
Subjt:  ICGSKRSTASACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSWP--SQHG

Query:  NSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWFPMKS
         +N + +R+ IG+GR D C C  PGS+EC +FHI   R +LKLELG  FY W F  MGE     WT  E K+ K L +S+  + S  F + +    P KS
Subjt:  NSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWFPMKS

Query:  RKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDV
        R  +VSY++NV LLQ R+ QSR++P+ IDSD + +
Subjt:  RKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDV

Q9LDD4 AT-rich interactive domain-containing protein 21.6e-8635.44Show/hide
Query:  DDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVR
        D+C+  +R  F++ L VFL+E    G I+P+PA++G+G ++DLF+LF++VR++ G+  VS K LW  V  +LG D  L  S+ LIY KYL+ +E W +  
Subjt:  DDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVR

Query:  CGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNGNVNVAAAAV--------EKETKFPKIKKEEHDLHGDITPIQQNC
              +N +S+                 KG   G+L +       +GFKS   NG       AV        E  ++F + +K   +   D        
Subjt:  CGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNGNVNVAAAAV--------EKETKFPKIKKEEHDLHGDITPIQQNC

Query:  TDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTVPGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAE
             D G      VI +  ++ AV  E   D     R+ L  MLKW+   A  P +P+ G +P +SKWK Y+ +   WLQV R+K++LL ++D    AE
Subjt:  TDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTVPGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAE

Query:  KQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYED---NIDNHHLSTERICGSKRSTASACNNSCPTVQSNWISSLTTEIGKG
         ++   P          H+             +H           HPS+YED   +I     S      SK  ++S CN S     S   S+   ++   
Subjt:  KQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYED---NIDNHHLSTERICGSKRSTASACNNSCPTVQSNWISSLTTEIGKG

Query:  LNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSWPSQHGNS-NSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIA
         + +A L        + N+   +   + +  G   QA V EWT +  DSDSKWLGTR WP ++  + +     + +G+GRPDSCSC+  G VEC R HIA
Subjt:  LNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSWPSQHGNS-NSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIA

Query:  EARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWFPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDV
        E RM LK ELG  F+ WRF+ MGEE+ L+WT EEEKRFK++ I++     Q FW  +++ FP K R+ LVSYYFNVFL+ +R YQ+RV+P SIDSDDE  
Subjt:  EARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWFPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDV

Query:  EFGCISGDFGAKAMEILGSKSVECSENRQFTD
         FG + G FG  A+   GS  + C++NRQ  D
Subjt:  EFGCISGDFGAKAMEILGSKSVECSENRQFTD

Arabidopsis top hitse value%identityAlignment
AT1G26580.1 FUNCTIONS IN: molecular_function unknown1.3e-1934.12Show/hide
Query:  EDSVEKPVPTGALFQAVVPEW----TGNISDS-------------DSKWLGTRSWPSQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMR
        +   +K VP G   QA +PEW    TGNI  S               K  GT   P   G +      + +G+GR   C C+   SV C   HI EAR  
Subjt:  EDSVEKPVPTGALFQAVVPEW----TGNISDS-------------DSKWLGTRSWPSQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMR

Query:  LKLELG-LTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWFPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGC
        L    G  TF +     MGE+ +L+W+ E+ + F E+  SN     Q FW +    F  +++K +VS+YFNVF+L++R+ Q+R     IDSDD++   GC
Subjt:  LKLELG-LTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWFPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGC

Query:  ISGDFGAKAME
          G  G + +E
Subjt:  ISGDFGAKAME

AT2G03470.1 ELM2 domain-containing protein6.4e-1940.91Show/hide
Query:  SVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGL-TFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWFPMKSRK
        S SD    G+GR + C C   GS+ C R HI EAR  L   +G   F +     MGEE++  WT EEE  F ++  SN  +  + FW      FP ++ K
Subjt:  SVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGL-TFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWFPMKSRK

Query:  NLVSYYFNVFLLQQRSYQSRVSPNSIDSDDED
         LVSYYFNVF+L++R  Q+R     +DSDD++
Subjt:  NLVSYYFNVFLLQQRSYQSRVSPNSIDSDDED

AT2G46040.1 ARID/BRIGHT DNA-binding domain;ELM2 domain protein1.3e-6430.87Show/hide
Query:  MGRWPISSNASILDCN---KDVDPNPSNGCCIDPDCLNVNYDDCKATIR---CYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY
        M  W + ++   +D +   K +D N S      P+ +N      +  I+     F  +L  FL E        P+PA+ GEG ++DLF LF+ V  KGG+
Subjt:  MGRWPISSNASILDCN---KDVDPNPSNGCCIDPDCLNVNYDDCKATIR---CYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGY

Query:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNG
          VSE   W  VV E GL+   SAS KLIY KYL     WL       ++  G++D    +       L A++ G L  V ++  +       +  +P  
Subjt:  QVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNG

Query:  NVNVAAAAVEKETK-FPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV
             A  +  E K F    K  +D H  +     +     +  G K   + +E   IL++V  E       R RE  L  LKW+   AK P +PS G V
Subjt:  NVNVAAAAVEKETK-FPKIKKEEHDLHGDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTV

Query:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNIDNHHLSTER
        P  S+W SY S++  W Q++  +    +R + D   EK W+K+                                     +MHP +Y+D+    +   ER
Subjt:  PGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAEKQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNIDNHHLSTER

Query:  ICGSKRSTASACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSWP--SQHG
        +           N S                            D+ S  E+++P           G+ FQA VPEWTG   +SDSKWLGTR WP   +  
Subjt:  ICGSKRSTASACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSWP--SQHG

Query:  NSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWFPMKS
         +N + +R+ IG+GR D C C  PGS+EC +FHI   R +LKLELG  FY W F  MGE     WT  E K+ K L +S+  + S  F + +    P KS
Subjt:  NSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWFPMKS

Query:  RKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDV
        R  +VSY++NV LLQ R+ QSR++P+ IDSD + +
Subjt:  RKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDV

AT4G11400.1 ARID/BRIGHT DNA-binding domain;ELM2 domain protein1.1e-8735.44Show/hide
Query:  DDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVR
        D+C+  +R  F++ L VFL+E    G I+P+PA++G+G ++DLF+LF++VR++ G+  VS K LW  V  +LG D  L  S+ LIY KYL+ +E W +  
Subjt:  DDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVLELGLDLGLSASVKLIYSKYLSDLENWLMVR

Query:  CGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNGNVNVAAAAV--------EKETKFPKIKKEEHDLHGDITPIQQNC
              +N +S+                 KG   G+L +       +GFKS   NG       AV        E  ++F + +K   +   D        
Subjt:  CGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNGNVNVAAAAV--------EKETKFPKIKKEEHDLHGDITPIQQNC

Query:  TDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTVPGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAE
             D G      VI +  ++ AV  E   D     R+ L  MLKW+   A  P +P+ G +P +SKWK Y+ +   WLQV R+K++LL ++D    AE
Subjt:  TDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTVPGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAE

Query:  KQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYED---NIDNHHLSTERICGSKRSTASACNNSCPTVQSNWISSLTTEIGKG
         ++   P          H+             +H           HPS+YED   +I     S      SK  ++S CN S     S   S+   ++   
Subjt:  KQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYED---NIDNHHLSTERICGSKRSTASACNNSCPTVQSNWISSLTTEIGKG

Query:  LNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSWPSQHGNS-NSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIA
         + +A L        + N+   +   + +  G   QA V EWT +  DSDSKWLGTR WP ++  + +     + +G+GRPDSCSC+  G VEC R HIA
Subjt:  LNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSWPSQHGNS-NSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIA

Query:  EARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWFPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDV
        E RM LK ELG  F+ WRF+ MGEE+ L+WT EEEKRFK++ I++     Q FW  +++ FP K R+ LVSYYFNVFL+ +R YQ+RV+P SIDSDDE  
Subjt:  EARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWFPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDV

Query:  EFGCISGDFGAKAMEILGSKSVECSENRQFTD
         FG + G FG  A+   GS  + C++NRQ  D
Subjt:  EFGCISGDFGAKAMEILGSKSVECSENRQFTD

AT5G04110.1 DNA GYRASE B32.5e-3134.68Show/hide
Query:  LSTERICGSKRSTASACNNSCPTVQSNWI-SSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWT---------GNISDSDS
        +S+  +C S R +A   N+  P   SN     +T    K ++N+        +  +      +     +P G  FQA +P W          G+  DS++
Subjt:  LSTERICGSKRSTASACNNSCPTVQSNWI-SSLTTEIGKGLNNQALLNGDLPSEMEDNQPNEDSVEKPVPTGALFQAVVPEWT---------GNISDSDS

Query:  -KWLGTRSWPSQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQ-WTAEEEKRFKELAISNFNNQS
         +WLGT  WP+ +    +V  +  +G GR DSCSC  P S  C + H  EA+  L+ E+   F  W F  MGEEI L+ WTA+EE+RF+ L   N  + S
Subjt:  -KWLGTRSWPSQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQ-WTAEEEKRFKELAISNFNNQS

Query:  QCFWNYSSRWFPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDE
          FW ++S  FP KS+K+L+SYY+NVFL+++       + N+IDSDD+
Subjt:  QCFWNYSSRWFPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGATGGCCTATTTCATCCAATGCTTCCATTTTAGATTGCAACAAAGATGTTGATCCTAATCCCAGTAATGGATGTTGCATTGACCCGGATTGTTTGAATGTTAA
CTATGATGATTGCAAAGCCACGATTAGATGCTATTTTGAGAAAATTCTTTGGGTTTTTCTAAAGGAGATTGGTCGTAGAGGATTTATTAGGCCAGTGCCAGCGTTACTAG
GTGAAGGGGGATCTTTGGATTTGTTTGAACTGTTCATGGTAGTAAGAGATAAAGGAGGTTATCAAGTGGTTTCAGAAAAGGAACTATGGTCTTCAGTGGTTTTGGAATTA
GGTTTGGATCTTGGCCTTTCGGCTTCAGTGAAATTGATTTATTCCAAGTACTTAAGTGACCTAGAGAATTGGCTTATGGTGAGATGTGGAGGCACAAAACTGGAAAATGG
GAACTCTGATTATCACTGCAGGAAAAGCTTTCCATTTTTGTCGGAACTGGAGGCAAAGATTAAGGGTATGTTATATGGTGTGCTGAGACAAAAGAGCATATATGATGAAT
GCTCTGGATTCAAATCTAACAAACCGAATGGGAACGTTAATGTTGCTGCCGCTGCAGTGGAGAAGGAAACAAAATTCCCTAAAATAAAGAAGGAAGAACACGATCTTCAT
GGGGACATTACACCAATTCAACAAAATTGTACTGACACACCTCAGGATAATGGCGGAAAAGATCAAATCCAAGTTATTGAAGATTGTAGAATTTTGGATGCTGTTAATGT
TGAAACTGAAATAGACTCTCATGGGCGATATCGAGAATCGTTATTACGAATGCTGAAGTGGGTGAGAAAGACTGCAAAGCATCCTGCAAATCCATCAAACGGTACAGTAC
CAGGGGCATCCAAGTGGAAATCATATGATAGCGACGATGCATTATGGCTTCAAGTTATCAGGTCAAAGGATGCTCTTTTAAATAGGAAGGATGTTGACAAAACCGCTGAG
AAACAATGGGAAAAGTTACCTCCTGCTATTAGAACCTCAGGTCTCCATCATCATAAGCAAGCAGCATACGACTTCTATGTCACCCTGTCTTGTGTCCTACATGATTACAT
GATGGACACAAAGAAAGTAAGGATGCATCCATCCATTTATGAGGATAATATTGATAACCATCACCTCTCTACAGAAAGGATCTGTGGCAGCAAAAGATCTACTGCTTCGG
CGTGTAATAATTCATGTCCAACTGTTCAAAGCAATTGGATTAGTAGTCTAACAACAGAAATTGGGAAGGGACTCAACAATCAAGCACTCTTGAATGGTGATTTACCATCT
GAAATGGAAGACAATCAGCCAAATGAAGATTCTGTTGAGAAGCCAGTTCCCACGGGTGCTTTATTTCAGGCAGTGGTACCTGAATGGACTGGTAATATTTCCGATAGTGA
CTCTAAATGGCTAGGGACACGGTCGTGGCCTTCTCAACACGGAAATAGTAATTCCGTAAGTGATAGAAATCCCATTGGCAGAGGGAGACCGGATTCATGTAGTTGCCAAT
TTCCAGGATCTGTTGAATGTTTTAGATTTCACATTGCGGAAGCAAGGATGAGATTAAAGCTTGAACTTGGTTTGACATTCTACGATTGGAGATTTCATCATATGGGGGAG
GAAATATCTCTGCAGTGGACTGCTGAAGAGGAAAAGAGATTTAAGGAGTTGGCAATATCCAATTTTAACAATCAAAGTCAGTGCTTCTGGAACTATTCCTCGAGGTGGTT
CCCAATGAAATCAAGGAAAAATTTGGTAAGCTATTACTTCAATGTGTTTCTTTTACAGCAGAGAAGCTATCAGAGTCGTGTGAGTCCAAATAGCATTGATAGTGATGATG
AAGATGTAGAGTTTGGTTGCATCAGTGGTGATTTTGGGGCTAAGGCAATGGAAATTTTAGGCTCAAAATCTGTAGAATGTTCTGAAAATAGACAGTTCACAGATGTGGAG
TAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGAGATGGCCTATTTCATCCAATGCTTCCATTTTAGATTGCAACAAAGATGTTGATCCTAATCCCAGTAATGGATGTTGCATTGACCCGGATTGTTTGAATGTTAA
CTATGATGATTGCAAAGCCACGATTAGATGCTATTTTGAGAAAATTCTTTGGGTTTTTCTAAAGGAGATTGGTCGTAGAGGATTTATTAGGCCAGTGCCAGCGTTACTAG
GTGAAGGGGGATCTTTGGATTTGTTTGAACTGTTCATGGTAGTAAGAGATAAAGGAGGTTATCAAGTGGTTTCAGAAAAGGAACTATGGTCTTCAGTGGTTTTGGAATTA
GGTTTGGATCTTGGCCTTTCGGCTTCAGTGAAATTGATTTATTCCAAGTACTTAAGTGACCTAGAGAATTGGCTTATGGTGAGATGTGGAGGCACAAAACTGGAAAATGG
GAACTCTGATTATCACTGCAGGAAAAGCTTTCCATTTTTGTCGGAACTGGAGGCAAAGATTAAGGGTATGTTATATGGTGTGCTGAGACAAAAGAGCATATATGATGAAT
GCTCTGGATTCAAATCTAACAAACCGAATGGGAACGTTAATGTTGCTGCCGCTGCAGTGGAGAAGGAAACAAAATTCCCTAAAATAAAGAAGGAAGAACACGATCTTCAT
GGGGACATTACACCAATTCAACAAAATTGTACTGACACACCTCAGGATAATGGCGGAAAAGATCAAATCCAAGTTATTGAAGATTGTAGAATTTTGGATGCTGTTAATGT
TGAAACTGAAATAGACTCTCATGGGCGATATCGAGAATCGTTATTACGAATGCTGAAGTGGGTGAGAAAGACTGCAAAGCATCCTGCAAATCCATCAAACGGTACAGTAC
CAGGGGCATCCAAGTGGAAATCATATGATAGCGACGATGCATTATGGCTTCAAGTTATCAGGTCAAAGGATGCTCTTTTAAATAGGAAGGATGTTGACAAAACCGCTGAG
AAACAATGGGAAAAGTTACCTCCTGCTATTAGAACCTCAGGTCTCCATCATCATAAGCAAGCAGCATACGACTTCTATGTCACCCTGTCTTGTGTCCTACATGATTACAT
GATGGACACAAAGAAAGTAAGGATGCATCCATCCATTTATGAGGATAATATTGATAACCATCACCTCTCTACAGAAAGGATCTGTGGCAGCAAAAGATCTACTGCTTCGG
CGTGTAATAATTCATGTCCAACTGTTCAAAGCAATTGGATTAGTAGTCTAACAACAGAAATTGGGAAGGGACTCAACAATCAAGCACTCTTGAATGGTGATTTACCATCT
GAAATGGAAGACAATCAGCCAAATGAAGATTCTGTTGAGAAGCCAGTTCCCACGGGTGCTTTATTTCAGGCAGTGGTACCTGAATGGACTGGTAATATTTCCGATAGTGA
CTCTAAATGGCTAGGGACACGGTCGTGGCCTTCTCAACACGGAAATAGTAATTCCGTAAGTGATAGAAATCCCATTGGCAGAGGGAGACCGGATTCATGTAGTTGCCAAT
TTCCAGGATCTGTTGAATGTTTTAGATTTCACATTGCGGAAGCAAGGATGAGATTAAAGCTTGAACTTGGTTTGACATTCTACGATTGGAGATTTCATCATATGGGGGAG
GAAATATCTCTGCAGTGGACTGCTGAAGAGGAAAAGAGATTTAAGGAGTTGGCAATATCCAATTTTAACAATCAAAGTCAGTGCTTCTGGAACTATTCCTCGAGGTGGTT
CCCAATGAAATCAAGGAAAAATTTGGTAAGCTATTACTTCAATGTGTTTCTTTTACAGCAGAGAAGCTATCAGAGTCGTGTGAGTCCAAATAGCATTGATAGTGATGATG
AAGATGTAGAGTTTGGTTGCATCAGTGGTGATTTTGGGGCTAAGGCAATGGAAATTTTAGGCTCAAAATCTGTAGAATGTTCTGAAAATAGACAGTTCACAGATGTGGAG
TAGAGTCCATGGAGGCAACAGACAGTTTGAAGAAAGGGAGAAAATTGAAGAATGAAGTCTAGAACTCAAGTCATTCCTGCATAAACGTAATTTTTCTCAACATTTTGCTC
CTGCAGCCTGCAGCAGATTCTGAAGTCATCCAAAAAAGAGGGGAAAAGAAACCCAATTTTGGCGAGAACAAGTTTTAGCTACACAACACCATTTTTGTTGGGGGGATCTG
TATGCTATCGGCTGGTGAGACTAAATATGAGGACCAAAGAGAGCCAGCGGTACATTTGTTTGTTTTGAACTTGTGTGTGTGTGTGGAACTCAATCTGATAGTTCTTGGTT
TTGTATTTGGAGAGTGTTGTAATCATTTGAAGGGGCTCCGAGAGAATTTTACCGAATATTTAAAAGTCAACTTGAGTATAGTTCAACAGGTTGAAGACATTTGCCTTAAC
CAAAAAGGTTAGAGGTTTGAAATATCACTCATATGTTTTAGAACTCGGAGGAAAAAAAAATGTTTAAACTATCCTACTTTGGGAGAATTCGTCATTTTTCTCTCCAAAAT
TGGTACATATTATGGGGTTGGAGGATCAAACAGATCGAACTTCTAATCTCTAGGGAAAAAGAT
Protein sequenceShow/hide protein sequence
MGRWPISSNASILDCNKDVDPNPSNGCCIDPDCLNVNYDDCKATIRCYFEKILWVFLKEIGRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVLEL
GLDLGLSASVKLIYSKYLSDLENWLMVRCGGTKLENGNSDYHCRKSFPFLSELEAKIKGMLYGVLRQKSIYDECSGFKSNKPNGNVNVAAAAVEKETKFPKIKKEEHDLH
GDITPIQQNCTDTPQDNGGKDQIQVIEDCRILDAVNVETEIDSHGRYRESLLRMLKWVRKTAKHPANPSNGTVPGASKWKSYDSDDALWLQVIRSKDALLNRKDVDKTAE
KQWEKLPPAIRTSGLHHHKQAAYDFYVTLSCVLHDYMMDTKKVRMHPSIYEDNIDNHHLSTERICGSKRSTASACNNSCPTVQSNWISSLTTEIGKGLNNQALLNGDLPS
EMEDNQPNEDSVEKPVPTGALFQAVVPEWTGNISDSDSKWLGTRSWPSQHGNSNSVSDRNPIGRGRPDSCSCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGE
EISLQWTAEEEKRFKELAISNFNNQSQCFWNYSSRWFPMKSRKNLVSYYFNVFLLQQRSYQSRVSPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDVE