; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G013350 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G013350
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptioncarbon catabolite repressor protein 4 homolog 5
Genome locationchr06:23930451..23939032
RNA-Seq ExpressionLsi06G013350
SyntenyLsi06G013350
Gene Ontology termsGO:0000175 - 3'-5'-exoribonuclease activity (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK28421.1 carbon catabolite repressor protein 4-like protein 5 [Cucumis melo var. makuwa]2.0e-16371.79Show/hide
Query:  MIEKSLDET---MDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSAR
        MIEK LDET   + I AKT+K KRKPSTNA   A NDHRKKRRRLA SSET +P  S PQKLA SN  K+IRS  RTSRKH K RSSQTDGHRRWVYSAR
Subjt:  MIEKSLDET---MDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSAR

Query:  DCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK
        DCSRFID  MVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK
Subjt:  DCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK

Query:  LFALLHQETIEFQSYGLRNNVAQLCVLK------------------------------------------------------------------------
        LF+LLHQETIEFQS+GLRNNVAQLCVLK                                                                        
Subjt:  LFALLHQETIEFQSYGLRNNVAQLCVLK------------------------------------------------------------------------

Query:  ----LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKF
            LDIQLHDRRKISGQLDF+SSH AFRFC  GTK SNVS S+SF WSDEEIRIASGSE+ T LQH LKLSSAYYG+PGS KTRDTNGEPL TSFHSKF
Subjt:  ----LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKF

Query:  MGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE
        MGTVDYIWHSEKLAPVRVLETLPVDAL RTGGLPNE
Subjt:  MGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE

XP_008453389.1 PREDICTED: carbon catabolite repressor protein 4 homolog 5 [Cucumis melo]4.0e-16472.02Show/hide
Query:  MIEKSLDET---MDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSAR
        MIEK LDET   + I AKT+K KRKPSTNA   A NDHRKKRRRLA SSET +P  S+PQKLA SN  K+IRS  RTSRKH K RSSQTDGHRRWVYSAR
Subjt:  MIEKSLDET---MDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSAR

Query:  DCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK
        DCSRFID  MVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK
Subjt:  DCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK

Query:  LFALLHQETIEFQSYGLRNNVAQLCVLK------------------------------------------------------------------------
        LF+LLHQETIEFQS+GLRNNVAQLCVLK                                                                        
Subjt:  LFALLHQETIEFQSYGLRNNVAQLCVLK------------------------------------------------------------------------

Query:  ----LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKF
            LDIQLHDRRKISGQLDFSSSH AFRFC  GTK SNVS S+SF WSDEEIRIASGSE+VT LQH LKLSSAYYG+PGS KTRDTNGEPL TSFHSKF
Subjt:  ----LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKF

Query:  MGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE
        +GTVDYIWHSEKLAPVRVLETLPVDAL RTGGLPNE
Subjt:  MGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE

XP_022921598.1 carbon catabolite repressor protein 4 homolog 5 [Cucurbita moschata]1.2e-16370.57Show/hide
Query:  MIEKSLDE--TMDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARD
        MIE+SLDE   MD+RAKT KN+R+ S N AH   ND RKKRRR AFSSETTIPT S PQKL  S   K +RS  RTSRK+EK RSS++DGHRRWVYS RD
Subjt:  MIEKSLDE--TMDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARD

Query:  CSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKL
        CSRFIDKIMVASYNILGVENALKHPDLYHRVPSKF+DWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYK VYKARTGEANDGCA+FWI+KL
Subjt:  CSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKL

Query:  FALLHQETIEFQSYGLRNNVAQLCVLK-------------------------------------------------------------------------
        F LLHQE+IEFQS+GLRNNVAQLCV K                                                                         
Subjt:  FALLHQETIEFQSYGLRNNVAQLCVLK-------------------------------------------------------------------------

Query:  ---LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFM
           LDIQLHDRRKISGQLDFSSS  AFRFC  GTK SNVSASRSFRWS EEIRIASG E+VTCLQHHLKLSSAYYGVPGSCKTRD NGEPLVTSFHS FM
Subjt:  ---LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFM

Query:  GTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE
        GTVDYIWHSEKLAPVRVLETLP+DAL RTGGLPNE
Subjt:  GTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE

XP_023516129.1 carbon catabolite repressor protein 4 homolog 5 [Cucurbita pepo subsp. pepo]1.4e-16169.89Show/hide
Query:  MIEKSLDE--TMDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARD
        MIE+S DE   MD+RAKT KN+R+ S N AH   ND RKKRRR AFSSETTIPT S PQKL  S   K + S  RTSRK+EK RS ++DGHR WVYS RD
Subjt:  MIEKSLDE--TMDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARD

Query:  CSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKL
        CSRFIDKIMVASYNILGVENALKHPDLYHRVPSKF+DWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCA+FWI+KL
Subjt:  CSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKL

Query:  FALLHQETIEFQSYGLRNNVAQLCVLK-------------------------------------------------------------------------
        F LLHQE+IEFQS+GLRNNVAQLCV K                                                                         
Subjt:  FALLHQETIEFQSYGLRNNVAQLCVLK-------------------------------------------------------------------------

Query:  ---LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFM
           LDIQLHDRRKISGQLDFSSS  AFRFC  GTK SNVSASRSFRWS EEIRIASG E+VTCLQHHLKLSSAYYGVPGSCKTRD NGEPLVTSFHS FM
Subjt:  ---LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFM

Query:  GTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE
        GTVDYIWHSEKLAPVRVLETLP+DAL RTGGLPNE
Subjt:  GTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE

XP_038901965.1 carbon catabolite repressor protein 4 homolog 5 [Benincasa hispida]2.3e-17575.23Show/hide
Query:  MIEKSLDETMDIRAKTEKNKRKPSTNA---AHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSAR
        MIEKSLDETMDIRAK EKNKRKPSTNA   AH++ NDHRKKRRRLA  SETTIPT + PQKLAESNSF+SIRS PRTSRKH+K RSSQTDGHRRWVYSAR
Subjt:  MIEKSLDETMDIRAKTEKNKRKPSTNA---AHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSAR

Query:  DCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK
        DCSRFID+IMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCA+FWIDK
Subjt:  DCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK

Query:  LFALLHQETIEFQSYGLRNNVAQLCVLK------------------------------------------------------------------------
         F+LLHQETIEFQS GLRNNVAQLCVLK                                                                        
Subjt:  LFALLHQETIEFQSYGLRNNVAQLCVLK------------------------------------------------------------------------

Query:  ----LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKF
            LDIQLHDRRKISGQLDFSSSH   RFC  GTK SNVSASRSFRWSDEEIRIASGSE+VT LQHHLKLSSAYYGVPGSCKTRD NGEPL TSFHSKF
Subjt:  ----LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKF

Query:  MGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE
        MGTVDYIWHSEKLAPVRVLETLPVDAL RTGGLPNE
Subjt:  MGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE

TrEMBL top hitse value%identityAlignment
A0A0A0LUW1 Endo/exonuclease/phosphatase domain-containing protein9.0e-16271.33Show/hide
Query:  MIEKSLDE---TMDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSAR
        MI K+ DE    MDI AKT+KNKRKPST+A   A NDHRKKRRRLA SSET IP  S PQKLA S+  K+I S  RTSRKH K RSSQTDGHRRWVYSAR
Subjt:  MIEKSLDE---TMDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSAR

Query:  DCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK
        DCSRFIDK MVASYNILGVENAL HPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRF+DLDELFQNYGYKGVYKARTGEANDGCAVFWIDK
Subjt:  DCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK

Query:  LFALLHQETIEFQSYGLRNNVAQLCVLK------------------------------------------------------------------------
        LF+LLHQETIEFQ++GLRNNVAQLCVLK                                                                        
Subjt:  LFALLHQETIEFQSYGLRNNVAQLCVLK------------------------------------------------------------------------

Query:  ----LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKF
            LDIQLHDRRKISGQLDFSSSH AFRFC  GTK SNVS S+SF WSDEEIRIASGSE+VT LQH LKLSSAYYG+PGS KTRDTNGEPLVTSFHSKF
Subjt:  ----LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKF

Query:  MGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE
        MGTVDYIWHSEKLAPVRVLETLPVDAL +TGGLPNE
Subjt:  MGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE

A0A1S3BW46 carbon catabolite repressor protein 4 homolog 51.9e-16472.02Show/hide
Query:  MIEKSLDET---MDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSAR
        MIEK LDET   + I AKT+K KRKPSTNA   A NDHRKKRRRLA SSET +P  S+PQKLA SN  K+IRS  RTSRKH K RSSQTDGHRRWVYSAR
Subjt:  MIEKSLDET---MDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSAR

Query:  DCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK
        DCSRFID  MVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK
Subjt:  DCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK

Query:  LFALLHQETIEFQSYGLRNNVAQLCVLK------------------------------------------------------------------------
        LF+LLHQETIEFQS+GLRNNVAQLCVLK                                                                        
Subjt:  LFALLHQETIEFQSYGLRNNVAQLCVLK------------------------------------------------------------------------

Query:  ----LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKF
            LDIQLHDRRKISGQLDFSSSH AFRFC  GTK SNVS S+SF WSDEEIRIASGSE+VT LQH LKLSSAYYG+PGS KTRDTNGEPL TSFHSKF
Subjt:  ----LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKF

Query:  MGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE
        +GTVDYIWHSEKLAPVRVLETLPVDAL RTGGLPNE
Subjt:  MGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE

A0A5A7UX51 Carbon catabolite repressor protein 4-like protein 51.9e-16472.02Show/hide
Query:  MIEKSLDET---MDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSAR
        MIEK LDET   + I AKT+K KRKPSTNA   A NDHRKKRRRLA SSET +P  S+PQKLA SN  K+IRS  RTSRKH K RSSQTDGHRRWVYSAR
Subjt:  MIEKSLDET---MDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSAR

Query:  DCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK
        DCSRFID  MVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK
Subjt:  DCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK

Query:  LFALLHQETIEFQSYGLRNNVAQLCVLK------------------------------------------------------------------------
        LF+LLHQETIEFQS+GLRNNVAQLCVLK                                                                        
Subjt:  LFALLHQETIEFQSYGLRNNVAQLCVLK------------------------------------------------------------------------

Query:  ----LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKF
            LDIQLHDRRKISGQLDFSSSH AFRFC  GTK SNVS S+SF WSDEEIRIASGSE+VT LQH LKLSSAYYG+PGS KTRDTNGEPL TSFHSKF
Subjt:  ----LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKF

Query:  MGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE
        +GTVDYIWHSEKLAPVRVLETLPVDAL RTGGLPNE
Subjt:  MGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE

A0A5D3DXW9 Carbon catabolite repressor protein 4-like protein 59.6e-16471.79Show/hide
Query:  MIEKSLDET---MDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSAR
        MIEK LDET   + I AKT+K KRKPSTNA   A NDHRKKRRRLA SSET +P  S PQKLA SN  K+IRS  RTSRKH K RSSQTDGHRRWVYSAR
Subjt:  MIEKSLDET---MDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSAR

Query:  DCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK
        DCSRFID  MVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK
Subjt:  DCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDK

Query:  LFALLHQETIEFQSYGLRNNVAQLCVLK------------------------------------------------------------------------
        LF+LLHQETIEFQS+GLRNNVAQLCVLK                                                                        
Subjt:  LFALLHQETIEFQSYGLRNNVAQLCVLK------------------------------------------------------------------------

Query:  ----LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKF
            LDIQLHDRRKISGQLDF+SSH AFRFC  GTK SNVS S+SF WSDEEIRIASGSE+ T LQH LKLSSAYYG+PGS KTRDTNGEPL TSFHSKF
Subjt:  ----LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKF

Query:  MGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE
        MGTVDYIWHSEKLAPVRVLETLPVDAL RTGGLPNE
Subjt:  MGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE

A0A6J1E4C1 carbon catabolite repressor protein 4 homolog 55.6e-16470.57Show/hide
Query:  MIEKSLDE--TMDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARD
        MIE+SLDE   MD+RAKT KN+R+ S N AH   ND RKKRRR AFSSETTIPT S PQKL  S   K +RS  RTSRK+EK RSS++DGHRRWVYS RD
Subjt:  MIEKSLDE--TMDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARD

Query:  CSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKL
        CSRFIDKIMVASYNILGVENALKHPDLYHRVPSKF+DWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYK VYKARTGEANDGCA+FWI+KL
Subjt:  CSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKL

Query:  FALLHQETIEFQSYGLRNNVAQLCVLK-------------------------------------------------------------------------
        F LLHQE+IEFQS+GLRNNVAQLCV K                                                                         
Subjt:  FALLHQETIEFQSYGLRNNVAQLCVLK-------------------------------------------------------------------------

Query:  ---LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFM
           LDIQLHDRRKISGQLDFSSS  AFRFC  GTK SNVSASRSFRWS EEIRIASG E+VTCLQHHLKLSSAYYGVPGSCKTRD NGEPLVTSFHS FM
Subjt:  ---LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFM

Query:  GTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE
        GTVDYIWHSEKLAPVRVLETLP+DAL RTGGLPNE
Subjt:  GTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNE

SwissProt top hitse value%identityAlignment
A6H7I3 Protein angel homolog 21.6e-1135.2Show/hide
Query:  VASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEV--DRF-NDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQ
        V SYNIL  +    +  LY       L WSFR   I   IK ++A +LCLQEV  D +  ++    ++ GY   YK RTG   DGCA+ +    F+LL  
Subjt:  VASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEV--DRF-NDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQ

Query:  ETIEFQSYGL----RNNVAQLCVLK
          +EF    +    R+NV  + +L+
Subjt:  ETIEFQSYGL----RNNVAQLCVLK

Q0WKY2 Carbon catabolite repressor protein 4 homolog 52.8e-8040.33Show/hide
Query:  RKPSTNAAHRARNDHRKKRRRLAFSSETTIP----TPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGV
        ++   + + ++ N + K  R+    S T  P    TP   Q+  +    +  +SS R  R+ ++  SS  +  R WV+SA +     DK+++ SYN+LGV
Subjt:  RKPSTNAAHRARNDHRKKRRRLAFSSETTIP----TPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGV

Query:  ENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRN
        +NA  H DLY+ VP K L+WS RK LIC  I  YNA ILCLQEVDRF+DLD L +N G++GV+K+RTGEA+DGCA+FW + LF LL  + IEF  +G+RN
Subjt:  ENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRN

Query:  NVAQLCVLK-----------------------------------------------------------------------------------LDIQLHDR
        NVAQLCVL+                                                                                   LD QLHDR
Subjt:  NVAQLCVLK-----------------------------------------------------------------------------------LDIQLHDR

Query:  RKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEK
        R+ISGQ +      +FR   A +  +++S S    WS EE+++A+G +  T +QH LKL+SAY GVPG+ +TRD  GEPL T++HS+F+GTVDYIWH+++
Subjt:  RKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEK

Query:  LAPVRVLETLPVDALNRTGGLPNE
        L PVRVLETLP D L RTGGLP+E
Subjt:  LAPVRVLETLPVDALNRTGGLPNE

Q5VTE6 Protein angel homolog 22.8e-1138.1Show/hide
Query:  VASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEV--DRFN-DLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQ
        V SYNIL  +    +  LY       L WSFR   I   IK ++A +LCLQEV  D +  ++    ++ GY   YK RTG   DGCA+ +    F+LL  
Subjt:  VASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEV--DRFN-DLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQ

Query:  ETIEF
          +EF
Subjt:  ETIEF

Q8VYU4 Carbon catabolite repressor protein 4 homolog 61.6e-3037.5Show/hide
Query:  PTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGVENALKH-PDLYHRVPSKFLDWSFRKELICNAIKF
        P P +  +++     +S R  PR          S+   +R W Y+    S   +K +V SYNIL    A  H   LY  +P   L W +RK  +   +  
Subjt:  PTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGVENALKH-PDLYHRVPSKFLDWSFRKELICNAIKF

Query:  YNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLKLDIQLHDR
        ++A I+CLQEVD+F DL+E  ++ GY  ++K RTG A DGCA+FW    F L+H+E+I+F   GLR+NVAQ+CVL+  +  H +
Subjt:  YNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLKLDIQLHDR

Q8VYU4 Carbon catabolite repressor protein 4 homolog 61.8e-1840.94Show/hide
Query:  QLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYI
        +LHD  +   Q     S    ++         ++ S S  W+  EI  A+G    T ++H L+L S Y  V G   TRD NGEP+VTS+H  FMGTVDYI
Subjt:  QLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYI

Query:  WHSEKLAPVRVLETLPVDALNRTGGLP
        W SE L  VRVL  +P  A+  T G P
Subjt:  WHSEKLAPVRVLETLPVDALNRTGGLP

Q9LS39 Carbon catabolite repressor protein 4 homolog 36.4e-5635.11Show/hide
Query:  SSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKH-EKGRSSQTDGHRRWVYS-ARDCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKEL
        SS T+ P+ S+P+  + + S+     +P   R+H ++  SSQ    R W+ S     S+ +++  V SYNILG  N+  H +LY  V   +L W +RK L
Subjt:  SSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKH-EKGRSSQTDGHRRWVYS-ARDCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKEL

Query:  ICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLKL----------------
        IC  +   N  I+ +QEVD++ DL  + +  GY G YK RTG+  DGCA+FW    F +L +E IEF  +G+R+NVAQL VL+L                
Subjt:  ICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLKL----------------

Query:  -------DIQL-------------------------------------------------HDRRKISGQLDFSSSHAAFRFCRAGTKCSN-VSASRSFRW
               D++L                                                 HD++++SGQ +   +    +    G+K SN ++ S    W
Subjt:  -------DIQL-------------------------------------------------HDRRKISGQLDFSSSHAAFRFCRAGTKCSN-VSASRSFRW

Query:  SDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNEVMVKD
        + EEIR+A+G E+     H LKL+S+Y  V GS  TRD+ GEPL TS+HSKF+GTVDY+W+S+ L P RVL+TLP+D L +T GLP + +  D
Subjt:  SDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNEVMVKD

Arabidopsis top hitse value%identityAlignment
AT1G73875.1 DNAse I-like superfamily protein2.0e-8140.33Show/hide
Query:  RKPSTNAAHRARNDHRKKRRRLAFSSETTIP----TPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGV
        ++   + + ++ N + K  R+    S T  P    TP   Q+  +    +  +SS R  R+ ++  SS  +  R WV+SA +     DK+++ SYN+LGV
Subjt:  RKPSTNAAHRARNDHRKKRRRLAFSSETTIP----TPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGV

Query:  ENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRN
        +NA  H DLY+ VP K L+WS RK LIC  I  YNA ILCLQEVDRF+DLD L +N G++GV+K+RTGEA+DGCA+FW + LF LL  + IEF  +G+RN
Subjt:  ENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRN

Query:  NVAQLCVLK-----------------------------------------------------------------------------------LDIQLHDR
        NVAQLCVL+                                                                                   LD QLHDR
Subjt:  NVAQLCVLK-----------------------------------------------------------------------------------LDIQLHDR

Query:  RKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEK
        R+ISGQ +      +FR   A +  +++S S    WS EE+++A+G +  T +QH LKL+SAY GVPG+ +TRD  GEPL T++HS+F+GTVDYIWH+++
Subjt:  RKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEK

Query:  LAPVRVLETLPVDALNRTGGLPNE
        L PVRVLETLP D L RTGGLP+E
Subjt:  LAPVRVLETLPVDALNRTGGLPNE

AT3G18500.1 DNAse I-like superfamily protein7.3e-4732.57Show/hide
Query:  SSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKH-EKGRSSQTDGHRRWVYS-ARDCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKEL
        SS T+ P+ S+P+  + + S+     +P   R+H ++  SSQ    R W+ S     S+ +++  V SYNILG  N+  H +LY  V   +L W +RK L
Subjt:  SSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKH-EKGRSSQTDGHRRWVYS-ARDCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKEL

Query:  ICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLKL----------------
        IC  +   N  I+ +Q                       RTG+  DGCA+FW    F +L +E IEF  +G+R+NVAQL VL+L                
Subjt:  ICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLKL----------------

Query:  -------DIQL-------------------------------------------------HDRRKISGQLDFSSSHAAFRFCRAGTKCSN-VSASRSFRW
               D++L                                                 HD++++SGQ +   +    +    G+K SN ++ S    W
Subjt:  -------DIQL-------------------------------------------------HDRRKISGQLDFSSSHAAFRFCRAGTKCSN-VSASRSFRW

Query:  SDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNEVMVKD
        + EEIR+A+G E+     H LKL+S+Y  V GS  TRD+ GEPL TS+HSKF+GTVDY+W+S+ L P RVL+TLP+D L +T GLP + +  D
Subjt:  SDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNEVMVKD

AT3G18500.2 DNAse I-like superfamily protein4.5e-5735.11Show/hide
Query:  SSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKH-EKGRSSQTDGHRRWVYS-ARDCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKEL
        SS T+ P+ S+P+  + + S+     +P   R+H ++  SSQ    R W+ S     S+ +++  V SYNILG  N+  H +LY  V   +L W +RK L
Subjt:  SSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKH-EKGRSSQTDGHRRWVYS-ARDCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKEL

Query:  ICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLKL----------------
        IC  +   N  I+ +QEVD++ DL  + +  GY G YK RTG+  DGCA+FW    F +L +E IEF  +G+R+NVAQL VL+L                
Subjt:  ICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLKL----------------

Query:  -------DIQL-------------------------------------------------HDRRKISGQLDFSSSHAAFRFCRAGTKCSN-VSASRSFRW
               D++L                                                 HD++++SGQ +   +    +    G+K SN ++ S    W
Subjt:  -------DIQL-------------------------------------------------HDRRKISGQLDFSSSHAAFRFCRAGTKCSN-VSASRSFRW

Query:  SDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNEVMVKD
        + EEIR+A+G E+     H LKL+S+Y  V GS  TRD+ GEPL TS+HSKF+GTVDY+W+S+ L P RVL+TLP+D L +T GLP + +  D
Subjt:  SDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNEVMVKD

AT3G18500.3 DNAse I-like superfamily protein3.1e-5835.53Show/hide
Query:  SSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKH-EKGRSSQTDGHRRWVYS-ARDCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKEL
        SS T+ P+ S+P+  + + S+     +P   R+H ++  SSQ    R W+ S     S+ +++  V SYNILG  N+  H +LY  V   +L W +RK L
Subjt:  SSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKH-EKGRSSQTDGHRRWVYS-ARDCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKEL

Query:  ICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLKL----------------
        IC  +   N  I+ +QEVD++ DL  + +  GY G YK RTG+  DGCA+FW    F +L +E IEF  +G+R+NVAQL VL+L                
Subjt:  ICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLKL----------------

Query:  -------DIQL-------------------------------------------------HDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSF--R
               D++L                                                 HD++++SGQ +   +    +    G+K SN    RSF   
Subjt:  -------DIQL-------------------------------------------------HDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSF--R

Query:  WSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNEVMVKD
        W+ EEIR+A+G E+     H LKL+S+Y  V GS  TRD+ GEPL TS+HSKF+GTVDY+W+S+ L P RVL+TLP+D L +T GLP + +  D
Subjt:  WSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNEVMVKD

AT5G11350.1 DNAse I-like superfamily protein1.1e-3137.5Show/hide
Query:  PTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGVENALKH-PDLYHRVPSKFLDWSFRKELICNAIKF
        P P +  +++     +S R  PR          S+   +R W Y+    S   +K +V SYNIL    A  H   LY  +P   L W +RK  +   +  
Subjt:  PTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGVENALKH-PDLYHRVPSKFLDWSFRKELICNAIKF

Query:  YNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLKLDIQLHDR
        ++A I+CLQEVD+F DL+E  ++ GY  ++K RTG A DGCA+FW    F L+H+E+I+F   GLR+NVAQ+CVL+  +  H +
Subjt:  YNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLKLDIQLHDR

AT5G11350.1 DNAse I-like superfamily protein1.3e-1940.94Show/hide
Query:  QLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYI
        +LHD  +   Q     S    ++         ++ S S  W+  EI  A+G    T ++H L+L S Y  V G   TRD NGEP+VTS+H  FMGTVDYI
Subjt:  QLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYI

Query:  WHSEKLAPVRVLETLPVDALNRTGGLP
        W SE L  VRVL  +P  A+  T G P
Subjt:  WHSEKLAPVRVLETLPVDALNRTGGLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGAAAAAAGCCTCGATGAAACCATGGACATTCGCGCCAAAACAGAGAAGAATAAGCGCAAACCCTCGACGAACGCCGCACACCGCGCTCGCAACGATCACCGGAA
GAAGCGGCGGAGATTAGCATTCAGTTCAGAAACCACAATCCCAACACCTAGCCATCCTCAAAAGCTTGCCGAATCGAATAGCTTCAAGTCAATTCGTTCTTCCCCTCGAA
CTTCACGAAAGCACGAAAAAGGAAGGTCGAGTCAAACAGATGGTCATCGTCGATGGGTGTACTCTGCTCGTGATTGCTCGAGATTTATAGATAAGATTATGGTTGCTTCA
TATAACATACTAGGAGTGGAAAATGCATTGAAGCATCCAGATTTGTATCATAGAGTGCCTTCCAAATTCTTGGATTGGAGTTTCCGGAAAGAGCTTATATGCAATGCAAT
TAAATTTTACAATGCAGGCATCTTATGCTTGCAGGAGGTTGACCGTTTTAATGATTTAGATGAACTTTTCCAAAATTATGGCTACAAAGGTGTTTACAAGGCTAGAACTG
GTGAAGCAAATGATGGATGTGCTGTATTTTGGATCGACAAACTATTTGCCCTTTTGCATCAAGAAACTATAGAGTTCCAGAGTTATGGGCTACGTAACAATGTTGCTCAA
CTATGTGTTTTGAAGCTAGATATACAACTGCATGATCGCAGAAAGATTTCAGGGCAGCTTGATTTTTCATCATCACATGCAGCCTTCAGATTTTGTCGTGCGGGCACAAA
ATGTTCCAATGTTTCAGCCTCAAGGTCCTTCAGATGGAGCGATGAGGAAATAAGGATTGCATCTGGTAGCGAGCATGTTACCTGTCTTCAGCACCATTTAAAGCTTTCCA
GTGCGTACTATGGGGTTCCTGGAAGTTGTAAAACAAGAGATACTAATGGAGAACCTTTAGTAACTTCATTCCACTCCAAGTTTATGGGAACTGTTGATTATATATGGCAC
TCGGAAAAACTTGCCCCTGTTAGAGTTTTGGAAACATTGCCTGTTGATGCATTAAACAGGACTGGAGGACTTCCAAATGAGGTAATGGTAAAAGACAGTAAAGAGTACTG
TGGGAAAAAGCAGGAGCTGGCGTTGCCTGGTTTAGCTGTTACCCAGTCATTTCCAAATTGA
mRNA sequenceShow/hide mRNA sequence
GCAAAGGCTAACTACGATTGCGGTGTAACTTCAGTCACCGCCTGTGACGTTGGACGATGATCGAAAAAAGCCTCGATGAAACCATGGACATTCGCGCCAAAACAGAGAAG
AATAAGCGCAAACCCTCGACGAACGCCGCACACCGCGCTCGCAACGATCACCGGAAGAAGCGGCGGAGATTAGCATTCAGTTCAGAAACCACAATCCCAACACCTAGCCA
TCCTCAAAAGCTTGCCGAATCGAATAGCTTCAAGTCAATTCGTTCTTCCCCTCGAACTTCACGAAAGCACGAAAAAGGAAGGTCGAGTCAAACAGATGGTCATCGTCGAT
GGGTGTACTCTGCTCGTGATTGCTCGAGATTTATAGATAAGATTATGGTTGCTTCATATAACATACTAGGAGTGGAAAATGCATTGAAGCATCCAGATTTGTATCATAGA
GTGCCTTCCAAATTCTTGGATTGGAGTTTCCGGAAAGAGCTTATATGCAATGCAATTAAATTTTACAATGCAGGCATCTTATGCTTGCAGGAGGTTGACCGTTTTAATGA
TTTAGATGAACTTTTCCAAAATTATGGCTACAAAGGTGTTTACAAGGCTAGAACTGGTGAAGCAAATGATGGATGTGCTGTATTTTGGATCGACAAACTATTTGCCCTTT
TGCATCAAGAAACTATAGAGTTCCAGAGTTATGGGCTACGTAACAATGTTGCTCAACTATGTGTTTTGAAGCTAGATATACAACTGCATGATCGCAGAAAGATTTCAGGG
CAGCTTGATTTTTCATCATCACATGCAGCCTTCAGATTTTGTCGTGCGGGCACAAAATGTTCCAATGTTTCAGCCTCAAGGTCCTTCAGATGGAGCGATGAGGAAATAAG
GATTGCATCTGGTAGCGAGCATGTTACCTGTCTTCAGCACCATTTAAAGCTTTCCAGTGCGTACTATGGGGTTCCTGGAAGTTGTAAAACAAGAGATACTAATGGAGAAC
CTTTAGTAACTTCATTCCACTCCAAGTTTATGGGAACTGTTGATTATATATGGCACTCGGAAAAACTTGCCCCTGTTAGAGTTTTGGAAACATTGCCTGTTGATGCATTA
AACAGGACTGGAGGACTTCCAAATGAGGTAATGGTAAAAGACAGTAAAGAGTACTGTGGGAAAAAGCAGGAGCTGGCGTTGCCTGGTTTAGCTGTTACCCAGTCATTTCC
AAATTGAGTTATGAAAGATTCGGTTCCCCTAATCTTGCGGTAATCTGTCCCTAACTTGTAGCTCCCACATGTCTGTGTTAGCTGTATTTATTATTAGGATAAAATGTTCA
ATTTCCATATACCAATTTTCACTCTTTAATTTTAATTTTTTCTAATTAGACCTTAGATTTAGGTAAAGGTTGTGATTCTCATGCTTGGTTCAATTTTCTGTTTATTCCCT
AATTTTAACC
Protein sequenceShow/hide protein sequence
MIEKSLDETMDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVAS
YNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQ
LCVLKLDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWH
SEKLAPVRVLETLPVDALNRTGGLPNEVMVKDSKEYCGKKQELALPGLAVTQSFPN