; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10010739 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10010739
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionintegrator complex subunit 3 homolog
Genome locationChr06:25395692..25403878
RNA-Seq ExpressionHG10010739
SyntenyHG10010739
Gene Ontology termsNA
InterPro domainsIPR019333 - Integrator complex subunit 3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056027.1 integrator complex subunit 3-like protein [Cucumis melo var. makuwa]5.5e-20872.01Show/hide
Query:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI
        MVHTLL+FLFLLVDNYDV+RKDKIALGVSSAFSALIEKGVISSLDTLISF G+SP LRDRLRILSS K FQV NE+QL +P+ S KPLPSS+KSC  TGI
Subjt:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI

Query:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTI-NTEILSSRIVN
        I  ESHP+R+V + ++TS GASVPIV D SAS HSV T+VQQ D IEILVKNLGEVT KSYKMGLKTLEELLVLFLSLDDNAQ   TI + EILSSRI+N
Subjt:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTI-NTEILSSRIVN

Query:  TYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL
        TY+ SG+KLFCALELPPNGP YDDEIESATALI+RTFIFHHE NIQ+LLLFCSRNGLPVGARLLSYV+RLAYE NKAGL  NVEFENS+ AEIDS  QLL
Subjt:  TYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL

Query:  LFHLNGYFSFRN----------------------------------------------------------------------------------DLCICK
        LFH+NGYFSFRN                                                                                  DLCICK
Subjt:  LFHLNGYFSFRN----------------------------------------------------------------------------------DLCICK

Query:  EEIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLC
        EEIVKL VTLLDDTDLVNMQFEI AKKFSVFGKD KSIFLL+K+SLNW CLEQRKLWGLIRSELIVSQVRVE+IV KLFCLGVLDASKHAIAIEGLLNLC
Subjt:  EEIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLC

Query:  CYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVR
        CYNAPSPE VEAIMLLPND+F  FSAAVLASW VSNESMLFHSLVDFA KLGKM+ESE V+
Subjt:  CYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVR

TYK27714.1 integrator complex subunit 3-like protein [Cucumis melo var. makuwa]1.1e-20872.19Show/hide
Query:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI
        MVHTLL+FLFLLVDNYDV+RKDKIALGVSSAFSALIEKGVISSLDTLISF G+SP LRDRLRILSS K FQV NE+QL +P+ S KPLPSS+KSC  TGI
Subjt:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI

Query:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTI-NTEILSSRIVN
        I  ESHP+R+V + ++TS GASVPIV D SAS HSV T+VQQ D IEILVKNLGEVT KSYKMGLKTLEELLVLFLSLDDNAQ   TI + EILSSRI+N
Subjt:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTI-NTEILSSRIVN

Query:  TYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL
        TY+ SG+KLFCALELPPNGP YDDEIESATALI+RTFIFHHE NIQ+LLLFCSRNGLPVGARLLSYV+RLAYE NKAGL  NVEFENS+ AEIDS  QLL
Subjt:  TYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL

Query:  LFHLNGYFSFRN----------------------------------------------------------------------------------DLCICK
        LFH+NGYFSFRN                                                                                  DLCICK
Subjt:  LFHLNGYFSFRN----------------------------------------------------------------------------------DLCICK

Query:  EEIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLC
        EEIVKL VTLLDDTDLVNMQFEIIAKKFSVFGKD KSIFLL+K+SLNW CLEQRKLWGLIRSELIVSQVRVE+IV KLFCLGVLDASKHAIAIEGLLNLC
Subjt:  EEIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLC

Query:  CYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVR
        CYNAPSPE VEAIMLLPND+F  FSAAVLASW VSNESMLFHSLVDFA KLGKM+ESE V+
Subjt:  CYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVR

XP_016900895.1 PREDICTED: integrator complex subunit 3 homolog [Cucumis melo]6.5e-20970.93Show/hide
Query:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI
        MVHTLL+FLFLLVDNYDV+RKDKIALGVSSAFSALIEKGVISSLDTLISF G+SP LRDRLRILSS K FQV NE+QL +P+ S KPLPSS+KSC  TGI
Subjt:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI

Query:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTI-NTEILSSRIVN
        I  ESHP+R+V + ++TS GASVPIV D SAS HSV T+VQQ D IEILVKNLGEVT KSYKMGLKTLEELLVLFLSLDDNAQ   TI + EILSSRI+N
Subjt:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTI-NTEILSSRIVN

Query:  TYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL
        TY+ SG+KLFCALELPPNGP YDDEIESATALI+RTFIFHHE NIQ+LLLFCSRNGLPVGARLLSYV+RLAYE NKAGL  NVEFENS+ AEIDS  QLL
Subjt:  TYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL

Query:  LFHLNGYFSFRN----------------------------------------------------------------------------------DLCICK
        LFH+NGYFSFRN                                                                                  DLCICK
Subjt:  LFHLNGYFSFRN----------------------------------------------------------------------------------DLCICK

Query:  EEIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLC
        EEIVKL VTLLDDTDLVNMQFEIIAKKFSVFGKD KSIFLL+K+SLNW CLEQRKLWGLIRSELIVSQVRVE+IV KLFCLGVLDASKHAIAIEGLLNLC
Subjt:  EEIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLC

Query:  CYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVRNAIPKPSQSSVHTLLDH
        CYNAPSPE VEAIMLLPND+F  FSAAVLASW VSNESMLFHSLVDFA KLGKM+ESE V N       S+V  L++H
Subjt:  CYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVRNAIPKPSQSSVHTLLDH

XP_038879691.1 integrator complex subunit 3 homolog isoform X1 [Benincasa hispida]7.9e-22376.29Show/hide
Query:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI
        MVHTLL+FLFLLVDNYDVERKDKIALGVSSAFSALI+KGVISSLDTLISFDGLSP LRDRLR+LSSGKKFQVPNELQL IPN SV PLPSSSKSC   G 
Subjt:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI

Query:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTINTEILSSRIVNT
        IYSESHP+ +V + +AT  G+SVPIVVDVSA  HSVVTDVQQ DNIEILVKNLG VT KSYKMGLKTLEELLVLFLSL+DN QAGRTINTEILSSRIVNT
Subjt:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTINTEILSSRIVNT

Query:  YNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLLL
        Y+L GYKLFCALELPPNG  Y+DEIESATALIIRTFIF+HE NIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGL GNVEFEN DSAEIDSKPQLLL
Subjt:  YNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLLL

Query:  FHLNGYFSFRN----------------------------------------------------------------------------------DLCICKE
        FHLNGY+ FRN                                                                                  DLCICKE
Subjt:  FHLNGYFSFRN----------------------------------------------------------------------------------DLCICKE

Query:  EIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLCC
        EIVKL VTLLDDTDLVNMQFEIIAKKF VFGKDTKSIFLL+KSSLNW CLEQRKLWGLIRSELIVS+V+VES+VLKLFCLGVLDASKHAIAIEGLLNLCC
Subjt:  EIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLCC

Query:  YNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVRN
        YNAPSPELVEAIMLLPND+FH FSAAVLASWVVSNESMLF SLVDFAEKLGKMSESE V N
Subjt:  YNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVRN

XP_038879698.1 uncharacterized protein LOC120071465 isoform X2 [Benincasa hispida]4.8e-20471.66Show/hide
Query:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI
        MVHTLL+FLFLLVDNYDVERKDKIALGVSSAFSALI+KGVISSLDTLISFDGLSP LRDRLR+LSS                                G 
Subjt:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI

Query:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTINTEILSSRIVNT
        IYSESHP+ +V + +AT  G+SVPIVVDVSA  HSVVTDVQQ DNIEILVKNLG VT KSYKMGLKTLEELLVLFLSL+DN QAGRTINTEILSSRIVNT
Subjt:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTINTEILSSRIVNT

Query:  YNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLLL
        Y+L GYKLFCALELPPNG  Y+DEIESATALIIRTFIF+HE NIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGL GNVEFEN DSAEIDSKPQLLL
Subjt:  YNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLLL

Query:  FHLNGYFSFRN----------------------------------------------------------------------------------DLCICKE
        FHLNGY+ FRN                                                                                  DLCICKE
Subjt:  FHLNGYFSFRN----------------------------------------------------------------------------------DLCICKE

Query:  EIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLCC
        EIVKL VTLLDDTDLVNMQFEIIAKKF VFGKDTKSIFLL+KSSLNW CLEQRKLWGLIRSELIVS+V+VES+VLKLFCLGVLDASKHAIAIEGLLNLCC
Subjt:  EIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLCC

Query:  YNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVRN
        YNAPSPELVEAIMLLPND+FH FSAAVLASWVVSNESMLF SLVDFAEKLGKMSESE V N
Subjt:  YNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVRN

TrEMBL top hitse value%identityAlignment
A0A0A0LXR4 Uncharacterized protein5.2e-20470.71Show/hide
Query:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI
        +VHTLL+FLFLLVDNYDV+RKDKIALGVSSAFSALIEKGVISSLD LISF G+SP LRDRLR+LSS KKFQV NE+QL +P+ S KPLPS +KSC   G+
Subjt:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI

Query:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTI-NTEILSSRIVN
        I SESHP+ +V +A +TS G SVPIV D SAS HS  T+VQQ D IEILVKNLGEVT KSYKMGLKTLEELLVLFLSLDDNAQ   TI   EILSSRI+N
Subjt:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTI-NTEILSSRIVN

Query:  TYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL
        TYN SG+KLFCALELPPNGP YDDEIESATALIIRTFIFHHE NI +LLLFCSRNGLPVGARLLSYV+RLAYEANKAGL  NVEFENS+ AE+DS  QLL
Subjt:  TYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL

Query:  LFHLNGYFSFRN----------------------------------------------------------------------------------DLCICK
        LFH+NGYFSFRN                                                                                  DLC+CK
Subjt:  LFHLNGYFSFRN----------------------------------------------------------------------------------DLCICK

Query:  EEIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLC
        EEIVKL VTLLDDTDLVNMQFEIIAKKF VFGKD KSIFLL+KSSLNW CLEQRKLWGLIRSEL+VSQVRVE+IV KLFCLGVLDASKHAIAIEGLLNLC
Subjt:  EEIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLC

Query:  CYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFV
        CY+APSPE VEAIML+PND+FH FSAAVLASWVVSNESMLF SLVDF+ KLGKM+ESE V
Subjt:  CYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFV

A0A1S4DYU4 integrator complex subunit 3 homolog3.2e-20970.93Show/hide
Query:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI
        MVHTLL+FLFLLVDNYDV+RKDKIALGVSSAFSALIEKGVISSLDTLISF G+SP LRDRLRILSS K FQV NE+QL +P+ S KPLPSS+KSC  TGI
Subjt:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI

Query:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTI-NTEILSSRIVN
        I  ESHP+R+V + ++TS GASVPIV D SAS HSV T+VQQ D IEILVKNLGEVT KSYKMGLKTLEELLVLFLSLDDNAQ   TI + EILSSRI+N
Subjt:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTI-NTEILSSRIVN

Query:  TYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL
        TY+ SG+KLFCALELPPNGP YDDEIESATALI+RTFIFHHE NIQ+LLLFCSRNGLPVGARLLSYV+RLAYE NKAGL  NVEFENS+ AEIDS  QLL
Subjt:  TYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL

Query:  LFHLNGYFSFRN----------------------------------------------------------------------------------DLCICK
        LFH+NGYFSFRN                                                                                  DLCICK
Subjt:  LFHLNGYFSFRN----------------------------------------------------------------------------------DLCICK

Query:  EEIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLC
        EEIVKL VTLLDDTDLVNMQFEIIAKKFSVFGKD KSIFLL+K+SLNW CLEQRKLWGLIRSELIVSQVRVE+IV KLFCLGVLDASKHAIAIEGLLNLC
Subjt:  EEIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLC

Query:  CYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVRNAIPKPSQSSVHTLLDH
        CYNAPSPE VEAIMLLPND+F  FSAAVLASW VSNESMLFHSLVDFA KLGKM+ESE V N       S+V  L++H
Subjt:  CYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVRNAIPKPSQSSVHTLLDH

A0A5A7UN45 Integrator complex subunit 3-like protein2.7e-20872.01Show/hide
Query:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI
        MVHTLL+FLFLLVDNYDV+RKDKIALGVSSAFSALIEKGVISSLDTLISF G+SP LRDRLRILSS K FQV NE+QL +P+ S KPLPSS+KSC  TGI
Subjt:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI

Query:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTI-NTEILSSRIVN
        I  ESHP+R+V + ++TS GASVPIV D SAS HSV T+VQQ D IEILVKNLGEVT KSYKMGLKTLEELLVLFLSLDDNAQ   TI + EILSSRI+N
Subjt:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTI-NTEILSSRIVN

Query:  TYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL
        TY+ SG+KLFCALELPPNGP YDDEIESATALI+RTFIFHHE NIQ+LLLFCSRNGLPVGARLLSYV+RLAYE NKAGL  NVEFENS+ AEIDS  QLL
Subjt:  TYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL

Query:  LFHLNGYFSFRN----------------------------------------------------------------------------------DLCICK
        LFH+NGYFSFRN                                                                                  DLCICK
Subjt:  LFHLNGYFSFRN----------------------------------------------------------------------------------DLCICK

Query:  EEIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLC
        EEIVKL VTLLDDTDLVNMQFEI AKKFSVFGKD KSIFLL+K+SLNW CLEQRKLWGLIRSELIVSQVRVE+IV KLFCLGVLDASKHAIAIEGLLNLC
Subjt:  EEIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLC

Query:  CYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVR
        CYNAPSPE VEAIMLLPND+F  FSAAVLASW VSNESMLFHSLVDFA KLGKM+ESE V+
Subjt:  CYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVR

A0A5D3DW40 Integrator complex subunit 3-like protein5.4e-20972.19Show/hide
Query:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI
        MVHTLL+FLFLLVDNYDV+RKDKIALGVSSAFSALIEKGVISSLDTLISF G+SP LRDRLRILSS K FQV NE+QL +P+ S KPLPSS+KSC  TGI
Subjt:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGI

Query:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTI-NTEILSSRIVN
        I  ESHP+R+V + ++TS GASVPIV D SAS HSV T+VQQ D IEILVKNLGEVT KSYKMGLKTLEELLVLFLSLDDNAQ   TI + EILSSRI+N
Subjt:  IYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTI-NTEILSSRIVN

Query:  TYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL
        TY+ SG+KLFCALELPPNGP YDDEIESATALI+RTFIFHHE NIQ+LLLFCSRNGLPVGARLLSYV+RLAYE NKAGL  NVEFENS+ AEIDS  QLL
Subjt:  TYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL

Query:  LFHLNGYFSFRN----------------------------------------------------------------------------------DLCICK
        LFH+NGYFSFRN                                                                                  DLCICK
Subjt:  LFHLNGYFSFRN----------------------------------------------------------------------------------DLCICK

Query:  EEIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLC
        EEIVKL VTLLDDTDLVNMQFEIIAKKFSVFGKD KSIFLL+K+SLNW CLEQRKLWGLIRSELIVSQVRVE+IV KLFCLGVLDASKHAIAIEGLLNLC
Subjt:  EEIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLC

Query:  CYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVR
        CYNAPSPE VEAIMLLPND+F  FSAAVLASW VSNESMLFHSLVDFA KLGKM+ESE V+
Subjt:  CYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVR

A0A6J1ERY7 integrator complex subunit 3 isoform X11.5e-19867.48Show/hide
Query:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLV-IPNQSVKPLPSSSKSCVETG
        MVHTLL+FLFLLVDNYD++RKDKIALGVSSAFSAL+EK VI SLD LISFDGLSP LRDRLRILSSG+K QVP E QL  +P+ S+KP    SKSC ETG
Subjt:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLV-IPNQSVKPLPSSSKSCVETG

Query:  IIYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVV-------------TDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGR
        +IYSE  P+ +VAH SATS GASVP+VVDVSAS HSVV              DV+Q DN+EILVK LGEV  KSYKMGLKTLEELLVLFLSLDDNAQA R
Subjt:  IIYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVV-------------TDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGR

Query:  TINTEILSSRIVNTYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFEN
        TINTEILSSRIVNTY LSGY LF ALEL PN P YDDEI SATALIIRTFIF H   +QELLLFCSRNGLPVGARLLSYVSRLAYE NKAGL GN + +N
Subjt:  TINTEILSSRIVNTYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFEN

Query:  SDSAEIDSKPQLLLFHLNGYFSFRN---------------------------------------------------------------------------
        SD AEIDSK Q L+FH+NGY+SFRN                                                                           
Subjt:  SDSAEIDSKPQLLLFHLNGYFSFRN---------------------------------------------------------------------------

Query:  -------DLCICKEEIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDAS
               D+CICKEEIVKL VT LDDTDLVNMQFEII KKF VFGKD +SIFLL+KSSLNW C EQ KLWGLIRSELIVS+V+V+SIVLKLFC GV+D S
Subjt:  -------DLCICKEEIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDAS

Query:  KHAIAIEGLLNLCCYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVRN
         HAIA+EGLLNLCCYNAPSP+LVEAIMLLPND+F  FSAAVLA+WVVSNESMLFHSLVDFAEKL KMSESE V N
Subjt:  KHAIAIEGLLNLCCYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVRN

SwissProt top hitse value%identityAlignment
F4IDQ5 Protein JASON2.2e-0529.78Show/hide
Query:  RHRTAYVCSVLNPVENLSQWNAVKSKKEFPSTLQKENVELEQESSVEEFPYSSSKPSRELC--VDASLSNWLASSEATPVSKITATTALEATITPVKSSI
        R R+ +V SV N +EN S +   K   E      +E +E E  SS E +     + S E     +AS S WL       ++ +   T     ITP    I
Subjt:  RHRTAYVCSVLNPVENLSQWNAVKSKKEFPSTLQKENVELEQESSVEEFPYSSSKPSRELC--VDASLSNWLASSEATPVSKITATTALEATITPVKSSI

Query:  LQGSSLPKRSSHREMPEVRTVGMYCRQGASDKDRDSASSF---KGIPNTTSKYREDKTVNWHSTPFETRLERALNSRG
                            +G+   Q   ++  + +       GIPN+T+KY+ED+ V+WH+TPFE RLE+AL+  G
Subjt:  LQGSSLPKRSSHREMPEVRTVGMYCRQGASDKDRDSASSF---KGIPNTTSKYREDKTVNWHSTPFETRLERALNSRG

Q55EZ4 Integrator complex subunit 3 homolog5.3e-0436.51Show/hide
Query:  MVHTLLDFLF-LLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLR
        M   L++F+   L+D+YD++RKD I  G+ ++F+ ++EKGV+ SL  +   D L P L ++++
Subjt:  MVHTLLDFLF-LLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLR

Arabidopsis top hitse value%identityAlignment
AT1G04030.1 unknown protein1.1e-2833.76Show/hide
Query:  IPKPSQSSVHTLLDHSNNKLLHPVQAGRDGSAVTGKAHEDEHVVLPQEDAASVHGKKQVLKEEEDSVENSSKPESSSEDFVVPLNPNFKSSCPPIHRYRN
        IPK S   +  + D +  K   P    R       K    EHVV   E++  +  +    K EE   E  S   S ++D ++ +  N   S P  HRY+N
Subjt:  IPKPSQSSVHTLLDHSNNKLLHPVQAGRDGSAVTGKAHEDEHVVLPQEDAASVHGKKQVLKEEEDSVENSSKPESSSEDFVVPLNPNFKSSCPPIHRYRN

Query:  CKDSDD---EDEVFDSHLASDENDEFGMVESMGEDS---------------SVAESSMNVCRLNPTNVRHRTAY-VCSVLNPVENLSQWNAVKSK---KE
        C++SDD   EDE   S    DE++E+       EDS                  E    + R N T VR    Y    VLNPVENL+QW + KSK   K+
Subjt:  CKDSDD---EDEVFDSHLASDENDEFGMVESMGEDS---------------SVAESSMNVCRLNPTNVRHRTAY-VCSVLNPVENLSQWNAVKSK---KE

Query:  FPSTLQKENVELEQESSVEEFPYS----------SSKP----------SRELCVDASLSNWLASSEA------------TPVSKITATTALEATI-----
          S  +  N   +QE   +   +           S KP          ++EL VDASLS WL++SE+            TP  K+ +T+     +     
Subjt:  FPSTLQKENVELEQESSVEEFPYS----------SSKP----------SRELCVDASLSNWLASSEA------------TPVSKITATTALEATI-----

Query:  -TPVKSSI-------LQGSSLPKRS---SHREMPEVRTVGMYCRQGASDKDRDSASSFKGIPNTTSKYREDKTVNWHSTPFETRLERALNS
          PV  ++          +S P++S   S  E P + TVG Y    +   D  SASSFKGIPNT+SKYREDK+VNWHSTPFE RLE+ALN+
Subjt:  -TPVKSSI-------LQGSSLPKRS---SHREMPEVRTVGMYCRQGASDKDRDSASSFKGIPNTTSKYREDKTVNWHSTPFETRLERALNS

AT2G30820.1 unknown protein9.0e-0736.71Show/hide
Query:  KSSILQGSSLPKRSSHREMPEVRTVGMYCRQGASDKDRDSASSFKGIPNTTSKYREDKTVNWHSTPFETRLERALNSRG
        K  I + SS P   +  + P +  V  +  +    +         GIPN+T+KY+ED+ V+WH+TPFE RLE+AL+  G
Subjt:  KSSILQGSSLPKRSSHREMPEVRTVGMYCRQGASDKDRDSASSFKGIPNTTSKYREDKTVNWHSTPFETRLERALNSRG

AT2G30820.2 unknown protein9.0e-0736.71Show/hide
Query:  KSSILQGSSLPKRSSHREMPEVRTVGMYCRQGASDKDRDSASSFKGIPNTTSKYREDKTVNWHSTPFETRLERALNSRG
        K  I + SS P   +  + P +  V  +  +    +         GIPN+T+KY+ED+ V+WH+TPFE RLE+AL+  G
Subjt:  KSSILQGSSLPKRSSHREMPEVRTVGMYCRQGASDKDRDSASSFKGIPNTTSKYREDKTVNWHSTPFETRLERALNSRG

AT4G14590.1 embryo defective 27398.1e-0836.73Show/hide
Query:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSS---GKKFQVPNELQLVIPNQSVKPLPSSSKSC
        + H+LL+FL  LV+ YD+ R+D I  G++SAF  +  KGVI SLD  ++   L+P L+ +L  L S    K   V      V+  Q+V    ++ K C
Subjt:  MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSS---GKKFQVPNELQLVIPNQSVKPLPSSSKSC

AT5G44040.1 unknown protein3.2e-1232.74Show/hide
Query:  LKEEEDSVENSSKPESSSEDFVVPLNPNFKSSCPPIHRYRNCKDSDDEDE----VFDSHLASDENDEFGMVES--MGEDS---------------SVAES
        L EE+     S +   SSE   V    N   S P  HRY+NC++SDDE+E      DS L   ++D+ G+++     +D+                +A++
Subjt:  LKEEEDSVENSSKPESSSEDFVVPLNPNFKSSCPPIHRYRNCKDSDDEDE----VFDSHLASDENDEFGMVES--MGEDS---------------SVAES

Query:  SMNVCRL---NPTNVRHRTAYVCSVLNPVENLSQWNAVKSKKEFPSTLQ--KENV---ELEQESSVEEF--PYSSSKPSR---------ELCVDASLSNW
         M++ R+      +VR R+ YV +VLNP+ENLSQW AVK+K    +  Q  KENV       ES V++    +S ++ SR         E+ VDASLS W
Subjt:  SMNVCRL---NPTNVRHRTAYVCSVLNPVENLSQWNAVKSKKEFPSTLQ--KENV---ELEQESSVEEF--PYSSSKPSR---------ELCVDASLSNW

Query:  LASSEATPVSKITATTALEATITPVK
        L++S+ T     +  +++E T++  K
Subjt:  LASSEATPVSKITATTALEATITPVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCACACTCTCCTTGACTTTCTATTCCTTCTTGTGGATAACTATGATGTTGAAAGGAAGGATAAAATAGCTTTGGGTGTGTCTTCAGCTTTTAGTGCACTTATCGA
AAAAGGAGTAATTTCCTCATTGGACACTTTGATTTCTTTTGACGGTCTTTCTCCATTTCTACGAGACAGGCTTAGGATACTTTCATCAGGTAAGAAGTTTCAGGTTCCAA
ATGAATTGCAATTAGTTATACCTAATCAATCTGTGAAGCCTCTGCCTTCTTCGAGTAAATCCTGTGTAGAAACTGGCATAATATATTCGGAAAGCCATCCTAACCGCGTT
GTAGCCCATGCAAGTGCTACATCTGCTGGTGCTTCTGTTCCTATTGTAGTTGATGTATCTGCCTCTCGTCATTCAGTTGTGACGGATGTACAGCAATTTGACAATATAGA
AATTTTGGTGAAAAATCTTGGTGAAGTTACTAGCAAATCCTATAAAATGGGCCTCAAAACTCTGGAAGAACTTCTAGTTTTATTTCTCTCGCTTGATGACAATGCACAAG
CTGGCAGAACAATAAACACTGAAATACTGTCTTCCAGAATAGTAAATACCTATAACTTGAGTGGGTATAAACTATTCTGTGCTCTTGAATTACCTCCAAATGGTCCCGAT
TATGATGATGAGATAGAATCTGCTACTGCCTTAATAATCCGTACCTTCATCTTTCATCATGAAAGCAATATACAAGAATTGCTTCTATTTTGTTCTAGGAATGGTTTGCC
TGTGGGAGCACGATTGTTATCTTATGTATCTCGTCTGGCTTATGAGGCGAACAAAGCAGGTTTAATAGGTAATGTTGAGTTTGAGAACAGTGATAGTGCGGAAATTGATT
CGAAGCCGCAGTTATTGTTGTTCCATCTGAATGGGTATTTTTCTTTCAGGAATGATTTATGCATATGCAAGGAGGAGATTGTTAAATTATTTGTAACCCTGTTGGATGAC
ACTGATCTTGTTAATATGCAGTTTGAGATTATTGCAAAGAAATTTTCTGTGTTTGGTAAAGACACTAAATCTATTTTTCTTTTACTTAAGAGCTCTCTGAATTGGAGTTG
TCTCGAACAACGTAAACTCTGGGGCTTGATAAGGTCAGAGCTTATAGTTTCACAGGTTCGGGTCGAGAGCATAGTTTTGAAACTTTTCTGCTTGGGTGTCTTAGATGCAA
GCAAGCATGCCATTGCCATTGAAGGTCTTCTAAACTTGTGCTGTTATAATGCACCATCACCTGAGCTTGTTGAGGCAATCATGTTATTACCCAATGATTCATTTCACGAC
TTCTCCGCTGCGGTCTTGGCTTCCTGGGTTGTATCAAACGAGTCAATGCTATTTCATAGCCTGGTTGATTTTGCTGAGAAACTTGGCAAGATGAGTGAGAGTGAGTTTGT
GAGGAATGCGATTCCTAAACCCAGCCAATCTTCTGTTCATACGTTGCTTGATCACTCTAACAACAAACTCTTGCATCCGGTTCAAGCTGGGCGTGATGGGTCAGCAGTGA
CTGGGAAAGCCCATGAGGATGAGCATGTTGTTTTACCCCAAGAAGATGCTGCTTCAGTTCATGGCAAGAAACAAGTCCTGAAGGAGGAGGAAGACAGTGTGGAGAATTCA
AGCAAGCCTGAATCTTCATCTGAAGATTTTGTTGTTCCTCTCAATCCCAACTTCAAATCATCTTGCCCTCCGATTCATCGATACAGGAACTGTAAGGATAGTGATGATGA
AGATGAGGTTTTTGATAGTCATCTTGCTAGTGATGAAAATGATGAATTTGGAATGGTTGAATCAATGGGGGAAGATTCTTCTGTGGCTGAGAGTTCAATGAATGTTTGTA
GGCTAAACCCAACAAATGTGAGGCATAGGACTGCTTATGTTTGCTCTGTACTGAATCCAGTTGAAAATCTTTCTCAATGGAATGCTGTGAAATCAAAGAAGGAATTTCCA
TCAACACTTCAGAAAGAGAATGTGGAGTTAGAACAAGAATCAAGTGTTGAGGAATTTCCATACAGCAGTTCCAAGCCGAGTCGAGAACTGTGTGTTGATGCCAGCCTTTC
TAACTGGTTGGCTTCATCAGAAGCAACACCAGTCAGTAAGATTACTGCAACAACGGCTTTAGAAGCCACCATTACTCCGGTGAAAAGCAGCATATTGCAAGGATCCAGCT
TGCCGAAAAGAAGCAGCCACAGGGAGATGCCTGAAGTCAGAACGGTTGGTATGTATTGTAGGCAAGGAGCGAGTGATAAGGATCGTGATTCAGCTTCTTCGTTTAAAGGA
ATACCGAATACAACTAGCAAGTATAGAGAGGATAAGACAGTGAATTGGCACTCTACACCATTTGAAACAAGGTTGGAGAGAGCTTTGAATAGTAGAGGAGTTGTAGCTTG
A
mRNA sequenceShow/hide mRNA sequence
ATGGTTCACACTCTCCTTGACTTTCTATTCCTTCTTGTGGATAACTATGATGTTGAAAGGAAGGATAAAATAGCTTTGGGTGTGTCTTCAGCTTTTAGTGCACTTATCGA
AAAAGGAGTAATTTCCTCATTGGACACTTTGATTTCTTTTGACGGTCTTTCTCCATTTCTACGAGACAGGCTTAGGATACTTTCATCAGGTAAGAAGTTTCAGGTTCCAA
ATGAATTGCAATTAGTTATACCTAATCAATCTGTGAAGCCTCTGCCTTCTTCGAGTAAATCCTGTGTAGAAACTGGCATAATATATTCGGAAAGCCATCCTAACCGCGTT
GTAGCCCATGCAAGTGCTACATCTGCTGGTGCTTCTGTTCCTATTGTAGTTGATGTATCTGCCTCTCGTCATTCAGTTGTGACGGATGTACAGCAATTTGACAATATAGA
AATTTTGGTGAAAAATCTTGGTGAAGTTACTAGCAAATCCTATAAAATGGGCCTCAAAACTCTGGAAGAACTTCTAGTTTTATTTCTCTCGCTTGATGACAATGCACAAG
CTGGCAGAACAATAAACACTGAAATACTGTCTTCCAGAATAGTAAATACCTATAACTTGAGTGGGTATAAACTATTCTGTGCTCTTGAATTACCTCCAAATGGTCCCGAT
TATGATGATGAGATAGAATCTGCTACTGCCTTAATAATCCGTACCTTCATCTTTCATCATGAAAGCAATATACAAGAATTGCTTCTATTTTGTTCTAGGAATGGTTTGCC
TGTGGGAGCACGATTGTTATCTTATGTATCTCGTCTGGCTTATGAGGCGAACAAAGCAGGTTTAATAGGTAATGTTGAGTTTGAGAACAGTGATAGTGCGGAAATTGATT
CGAAGCCGCAGTTATTGTTGTTCCATCTGAATGGGTATTTTTCTTTCAGGAATGATTTATGCATATGCAAGGAGGAGATTGTTAAATTATTTGTAACCCTGTTGGATGAC
ACTGATCTTGTTAATATGCAGTTTGAGATTATTGCAAAGAAATTTTCTGTGTTTGGTAAAGACACTAAATCTATTTTTCTTTTACTTAAGAGCTCTCTGAATTGGAGTTG
TCTCGAACAACGTAAACTCTGGGGCTTGATAAGGTCAGAGCTTATAGTTTCACAGGTTCGGGTCGAGAGCATAGTTTTGAAACTTTTCTGCTTGGGTGTCTTAGATGCAA
GCAAGCATGCCATTGCCATTGAAGGTCTTCTAAACTTGTGCTGTTATAATGCACCATCACCTGAGCTTGTTGAGGCAATCATGTTATTACCCAATGATTCATTTCACGAC
TTCTCCGCTGCGGTCTTGGCTTCCTGGGTTGTATCAAACGAGTCAATGCTATTTCATAGCCTGGTTGATTTTGCTGAGAAACTTGGCAAGATGAGTGAGAGTGAGTTTGT
GAGGAATGCGATTCCTAAACCCAGCCAATCTTCTGTTCATACGTTGCTTGATCACTCTAACAACAAACTCTTGCATCCGGTTCAAGCTGGGCGTGATGGGTCAGCAGTGA
CTGGGAAAGCCCATGAGGATGAGCATGTTGTTTTACCCCAAGAAGATGCTGCTTCAGTTCATGGCAAGAAACAAGTCCTGAAGGAGGAGGAAGACAGTGTGGAGAATTCA
AGCAAGCCTGAATCTTCATCTGAAGATTTTGTTGTTCCTCTCAATCCCAACTTCAAATCATCTTGCCCTCCGATTCATCGATACAGGAACTGTAAGGATAGTGATGATGA
AGATGAGGTTTTTGATAGTCATCTTGCTAGTGATGAAAATGATGAATTTGGAATGGTTGAATCAATGGGGGAAGATTCTTCTGTGGCTGAGAGTTCAATGAATGTTTGTA
GGCTAAACCCAACAAATGTGAGGCATAGGACTGCTTATGTTTGCTCTGTACTGAATCCAGTTGAAAATCTTTCTCAATGGAATGCTGTGAAATCAAAGAAGGAATTTCCA
TCAACACTTCAGAAAGAGAATGTGGAGTTAGAACAAGAATCAAGTGTTGAGGAATTTCCATACAGCAGTTCCAAGCCGAGTCGAGAACTGTGTGTTGATGCCAGCCTTTC
TAACTGGTTGGCTTCATCAGAAGCAACACCAGTCAGTAAGATTACTGCAACAACGGCTTTAGAAGCCACCATTACTCCGGTGAAAAGCAGCATATTGCAAGGATCCAGCT
TGCCGAAAAGAAGCAGCCACAGGGAGATGCCTGAAGTCAGAACGGTTGGTATGTATTGTAGGCAAGGAGCGAGTGATAAGGATCGTGATTCAGCTTCTTCGTTTAAAGGA
ATACCGAATACAACTAGCAAGTATAGAGAGGATAAGACAGTGAATTGGCACTCTACACCATTTGAAACAAGGTTGGAGAGAGCTTTGAATAGTAGAGGAGTTGTAGCTTG
A
Protein sequenceShow/hide protein sequence
MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGIIYSESHPNRV
VAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTINTEILSSRIVNTYNLSGYKLFCALELPPNGPD
YDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLLLFHLNGYFSFRNDLCICKEEIVKLFVTLLDD
TDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLCCYNAPSPELVEAIMLLPNDSFHD
FSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVRNAIPKPSQSSVHTLLDHSNNKLLHPVQAGRDGSAVTGKAHEDEHVVLPQEDAASVHGKKQVLKEEEDSVENS
SKPESSSEDFVVPLNPNFKSSCPPIHRYRNCKDSDDEDEVFDSHLASDENDEFGMVESMGEDSSVAESSMNVCRLNPTNVRHRTAYVCSVLNPVENLSQWNAVKSKKEFP
STLQKENVELEQESSVEEFPYSSSKPSRELCVDASLSNWLASSEATPVSKITATTALEATITPVKSSILQGSSLPKRSSHREMPEVRTVGMYCRQGASDKDRDSASSFKG
IPNTTSKYREDKTVNWHSTPFETRLERALNSRGVVA