; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005313 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005313
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:14091685..14096622
RNA-Seq ExpressionLag0005313
SyntenyLag0005313
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025724 - GAG-pre-integrase domain
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047995.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]4.5e-13550.5Show/hide
Query:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY
        S +FE+ KF+G GDF+LW+KK++A+LVQHKVAK ++D  +LP  +T+ EK+DM E+AY TI+L+L+D VLR V +  +  ELW KL+SLY++KS  N+IY
Subjt:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY

Query:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ
        +KE+ F +KMD SK LEEN+D+F +I +DL N GE++  ENQAVILLNSLPE Y+EVK+ I Y  DSLT+ IVLD+L+++ LE++ E+K    L  + R 
Subjt:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ

Query:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNV-----SANVFE-------GYESAEVLMVSGSNDIKEWIMDS
         K +  K   ++ RS  KG  ++     C  CHKEGH K +CP+   K +     E NV     SA + +       GYESAEVLMVS  +    WIMDS
Subjt:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNV-----SANVFE-------GYESAEVLMVSGSNDIKEWIMDS

Query:  GCTFHMSPHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLK
        GCTFHM+PH+ F T  ++ DGGK ++G+N  CD+KG GSV+    DG  +IL++VRYVP LKRNLISLG LD+SG + KSE G + V K S+VK++G L+
Subjt:  GCTFHMSPHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLK

Query:  NGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKS
        +GLYVL G++VSG     S K++ ++ LWHKR+ H+S++GL  L +Q L+G  K  + PFC+HC++GK+TR+ F    H T+G +DYI SDLWGP+K  S
Subjt:  NGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKS

Query:  LGGAR
        +GG+R
Subjt:  LGGAR

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]7.4e-13850.89Show/hide
Query:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY
        S +FE+ KF+G GDFSLW+KK++A+LVQHKVAK ++D  +LP  +T+ EK+DM E+AYSTI+L+L+D VLR V +  +  ELW KL+SLY++KS +N+IY
Subjt:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY

Query:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ
        +KE+ F +KMD SK LEEN+D+F +I +DL N GE++  ENQAVILLNSLPE Y+EVK+ I Y RDSLT+ IVLD+L+++ LE++ E+K    L  + R 
Subjt:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ

Query:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNV-----SANVFE-------GYESAEVLMVSGSNDIKEWIMDS
         K +  K   ++ RS  KG  ++     C  CHKEGH K +CP+   K +     E NV     SA + +       GYESAEVLMVS  +    WIMDS
Subjt:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNV-----SANVFE-------GYESAEVLMVSGSNDIKEWIMDS

Query:  GCTFHMSPHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLK
        GCTFHM+PH+ F T  ++ DGGK ++G+N  CD+KG GSV+    DG  +IL++VRYVP LKRNLISLG LD+SG + KSE G + V K S+VK++G L+
Subjt:  GCTFHMSPHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLK

Query:  NGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKS
        +GLYVL G++VSG     S K++ ++ LWHKR+ H+S++GL  L +Q L+G  K  + PFC+HC++GK+TR+ F    H T+G +DY+ SDLWGP+K  S
Subjt:  NGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKS

Query:  LGGAR
        +GG+R
Subjt:  LGGAR

KAA0067607.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]7.7e-12749Show/hide
Query:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY
        S +FE+ KF+G GDF++W+KK++ +LVQHKVAK ++D  KLP  +T+ EK+DM E+ YSTI+L+L+D VLR V +  +  ELW KL+SLY++K   N   
Subjt:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY

Query:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ
         KE+ F +KMD SK LEEN+D+F +I +DL N GE++  ENQAVILLNSLPE Y+EVK+ I Y RDSLT+ IVLD+L+++ LE++ E+K    L  + R 
Subjt:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ

Query:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNVS-----ANVFEGYESAEVLMVSGSNDIKEWIMDSGCTFHMS
         K +  K   ++SRS  KG  ++     C  CHK GH K +CP+   K +     E NV+     A + +GYESAEVLMVS  +    WIMDSGCTFHM+
Subjt:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNVS-----ANVFEGYESAEVLMVSGSNDIKEWIMDSGCTFHMS

Query:  PHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLKNGLYVLM
        PH+ F T  ++ DGGK ++G+N  CD+K  GSV+    D   +IL++VRYVP LKRNLISLG LD+SG + KSE G + V K S+VK++G L++GLYVL 
Subjt:  PHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLKNGLYVLM

Query:  GSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKSLGGAR
        G++VSG     S K++ +  LWH R+ H+S++GL  L +Q L+   K  +  FC+HC++GK TR+ F    H T+G +DY+ SDLWGP+K   +GG+R
Subjt:  GSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKSLGGAR

RVW99173.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.5e-10945.14Show/hide
Query:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY
        +AKF++++F GK DF LW+ KM+ALLVQ  +  AL+    LP TM + +K  + E A+S IIL L D VLR V+  ES +E+W KL+SLYM+KS  NR++
Subjt:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY

Query:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ
         K +L++FKM     +EE++D FN+I +DL N    +  E++A++LL SL   Y  +K  I Y RDSLT D                             
Subjt:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ

Query:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNV--SANVFEGYESAEVLMVSGSNDIKEWIMDSGCTFHMSPHK
         +G+ SK     SRS  K      +K  C  CHKEGH K DCP  ++     IKK  N   +A + +GY+SAEVL V+  +  KEWI+DSGC+FHM P K
Subjt:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNV--SANVFEGYESAEVLMVSGSNDIKEWIMDSGCTFHMSPHK

Query:  HFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLKNGLYVLMGSS
         +F   KE DGG  ++GNN+ C I G G+V+ +  DG  ++L  VRY+P LKRNLISLG LDKSG+++KSE  +L V + S+  MK  +KNGLY L+G +
Subjt:  HFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLKNGLYVLMGSS

Query:  VSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKSLGGA
        V  + + V  +    T+LWH+R+ H+S KGL  L KQ ++GN+K +  PFC+HCV GKATR+ F+ A H T+ ++DYI SDLWGPS+V S+GGA
Subjt:  VSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKSLGGA

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]4.8e-13750.69Show/hide
Query:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY
        S +FE+ KF+G GDF+LW+KK++A+LVQHKVAK ++D  +LP  +T+ EK+DM E+AYSTI+L+L+D VLR V +  +  ELW KL+SLY++KS  N+IY
Subjt:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY

Query:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ
        +KE+ F +KMD SK LEEN+D+F +I +DL N GE++  ENQAVILLNSLPE Y+EVK+ I Y RDSLT+ IVLD+L+++ LE++ E+K    L  + R 
Subjt:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ

Query:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNV-----SANVFE-------GYESAEVLMVSGSNDIKEWIMDS
         K +  K   ++ RS  KG  ++     C  CHKEGH K +CP+   K +     E NV     SA + +       GYESAEVLMVS  +    WIMDS
Subjt:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNV-----SANVFE-------GYESAEVLMVSGSNDIKEWIMDS

Query:  GCTFHMSPHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLK
        GCTFHM+PH+ F T  ++ DGGK ++G+N  CD+KG GSV+    DG  +IL++VRYVP LKRNLISLG LD+SG + KSE G + V K S+VK++G L+
Subjt:  GCTFHMSPHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLK

Query:  NGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKS
        +GLYVL G++VSG     S K++ ++ LWHKR+ H+S++GL  L +Q L+G  K  + PFC+HC++GK+TR+ F    H T+G +DY+ SDLWGP+K  S
Subjt:  NGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKS

Query:  LGGAR
        +GG+R
Subjt:  LGGAR

TrEMBL top hitse value%identityAlignment
A0A438IR25 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-10945.14Show/hide
Query:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY
        +AKF++++F GK DF LW+ KM+ALLVQ  +  AL+    LP TM + +K  + E A+S IIL L D VLR V+  ES +E+W KL+SLYM+KS  NR++
Subjt:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY

Query:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ
         K +L++FKM     +EE++D FN+I +DL N    +  E++A++LL SL   Y  +K  I Y RDSLT D                             
Subjt:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ

Query:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNV--SANVFEGYESAEVLMVSGSNDIKEWIMDSGCTFHMSPHK
         +G+ SK     SRS  K      +K  C  CHKEGH K DCP  ++     IKK  N   +A + +GY+SAEVL V+  +  KEWI+DSGC+FHM P K
Subjt:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNV--SANVFEGYESAEVLMVSGSNDIKEWIMDSGCTFHMSPHK

Query:  HFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLKNGLYVLMGSS
         +F   KE DGG  ++GNN+ C I G G+V+ +  DG  ++L  VRY+P LKRNLISLG LDKSG+++KSE  +L V + S+  MK  +KNGLY L+G +
Subjt:  HFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLKNGLYVLMGSS

Query:  VSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKSLGGA
        V  + + V  +    T+LWH+R+ H+S KGL  L KQ ++GN+K +  PFC+HCV GKATR+ F+ A H T+ ++DYI SDLWGPS+V S+GGA
Subjt:  VSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKSLGGA

A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class2.2e-13550.5Show/hide
Query:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY
        S +FE+ KF+G GDF+LW+KK++A+LVQHKVAK ++D  +LP  +T+ EK+DM E+AY TI+L+L+D VLR V +  +  ELW KL+SLY++KS  N+IY
Subjt:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY

Query:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ
        +KE+ F +KMD SK LEEN+D+F +I +DL N GE++  ENQAVILLNSLPE Y+EVK+ I Y  DSLT+ IVLD+L+++ LE++ E+K    L  + R 
Subjt:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ

Query:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNV-----SANVFE-------GYESAEVLMVSGSNDIKEWIMDS
         K +  K   ++ RS  KG  ++     C  CHKEGH K +CP+   K +     E NV     SA + +       GYESAEVLMVS  +    WIMDS
Subjt:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNV-----SANVFE-------GYESAEVLMVSGSNDIKEWIMDS

Query:  GCTFHMSPHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLK
        GCTFHM+PH+ F T  ++ DGGK ++G+N  CD+KG GSV+    DG  +IL++VRYVP LKRNLISLG LD+SG + KSE G + V K S+VK++G L+
Subjt:  GCTFHMSPHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLK

Query:  NGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKS
        +GLYVL G++VSG     S K++ ++ LWHKR+ H+S++GL  L +Q L+G  K  + PFC+HC++GK+TR+ F    H T+G +DYI SDLWGP+K  S
Subjt:  NGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKS

Query:  LGGAR
        +GG+R
Subjt:  LGGAR

A0A5A7UB25 Putative gag-pol polyprotein3.6e-13850.89Show/hide
Query:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY
        S +FE+ KF+G GDFSLW+KK++A+LVQHKVAK ++D  +LP  +T+ EK+DM E+AYSTI+L+L+D VLR V +  +  ELW KL+SLY++KS +N+IY
Subjt:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY

Query:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ
        +KE+ F +KMD SK LEEN+D+F +I +DL N GE++  ENQAVILLNSLPE Y+EVK+ I Y RDSLT+ IVLD+L+++ LE++ E+K    L  + R 
Subjt:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ

Query:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNV-----SANVFE-------GYESAEVLMVSGSNDIKEWIMDS
         K +  K   ++ RS  KG  ++     C  CHKEGH K +CP+   K +     E NV     SA + +       GYESAEVLMVS  +    WIMDS
Subjt:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNV-----SANVFE-------GYESAEVLMVSGSNDIKEWIMDS

Query:  GCTFHMSPHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLK
        GCTFHM+PH+ F T  ++ DGGK ++G+N  CD+KG GSV+    DG  +IL++VRYVP LKRNLISLG LD+SG + KSE G + V K S+VK++G L+
Subjt:  GCTFHMSPHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLK

Query:  NGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKS
        +GLYVL G++VSG     S K++ ++ LWHKR+ H+S++GL  L +Q L+G  K  + PFC+HC++GK+TR+ F    H T+G +DY+ SDLWGP+K  S
Subjt:  NGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKS

Query:  LGGAR
        +GG+R
Subjt:  LGGAR

A0A5A7VKC2 Retrotransposon protein, putative, Ty1-copia subclass3.7e-12749Show/hide
Query:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY
        S +FE+ KF+G GDF++W+KK++ +LVQHKVAK ++D  KLP  +T+ EK+DM E+ YSTI+L+L+D VLR V +  +  ELW KL+SLY++K   N   
Subjt:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY

Query:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ
         KE+ F +KMD SK LEEN+D+F +I +DL N GE++  ENQAVILLNSLPE Y+EVK+ I Y RDSLT+ IVLD+L+++ LE++ E+K    L  + R 
Subjt:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ

Query:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNVS-----ANVFEGYESAEVLMVSGSNDIKEWIMDSGCTFHMS
         K +  K   ++SRS  KG  ++     C  CHK GH K +CP+   K +     E NV+     A + +GYESAEVLMVS  +    WIMDSGCTFHM+
Subjt:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNVS-----ANVFEGYESAEVLMVSGSNDIKEWIMDSGCTFHMS

Query:  PHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLKNGLYVLM
        PH+ F T  ++ DGGK ++G+N  CD+K  GSV+    D   +IL++VRYVP LKRNLISLG LD+SG + KSE G + V K S+VK++G L++GLYVL 
Subjt:  PHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLKNGLYVLM

Query:  GSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKSLGGAR
        G++VSG     S K++ +  LWH R+ H+S++GL  L +Q L+   K  +  FC+HC++GK TR+ F    H T+G +DY+ SDLWGP+K   +GG+R
Subjt:  GSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKSLGGAR

A0A5D3DNU1 Putative gag-pol polyprotein2.3e-13750.69Show/hide
Query:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY
        S +FE+ KF+G GDF+LW+KK++A+LVQHKVAK ++D  +LP  +T+ EK+DM E+AYSTI+L+L+D VLR V +  +  ELW KL+SLY++KS  N+IY
Subjt:  SAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIY

Query:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ
        +KE+ F +KMD SK LEEN+D+F +I +DL N GE++  ENQAVILLNSLPE Y+EVK+ I Y RDSLT+ IVLD+L+++ LE++ E+K    L  + R 
Subjt:  LKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQ

Query:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNV-----SANVFE-------GYESAEVLMVSGSNDIKEWIMDS
         K +  K   ++ RS  KG  ++     C  CHKEGH K +CP+   K +     E NV     SA + +       GYESAEVLMVS  +    WIMDS
Subjt:  LKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNV-----SANVFE-------GYESAEVLMVSGSNDIKEWIMDS

Query:  GCTFHMSPHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLK
        GCTFHM+PH+ F T  ++ DGGK ++G+N  CD+KG GSV+    DG  +IL++VRYVP LKRNLISLG LD+SG + KSE G + V K S+VK++G L+
Subjt:  GCTFHMSPHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLK

Query:  NGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKS
        +GLYVL G++VSG     S K++ ++ LWHKR+ H+S++GL  L +Q L+G  K  + PFC+HC++GK+TR+ F    H T+G +DY+ SDLWGP+K  S
Subjt:  NGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKS

Query:  LGGAR
        +GG+R
Subjt:  LGGAR

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-2525.54Show/hide
Query:  AKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIYL
        AK  +  FDG+  +++WK +++ALL +  V K +     L P   DD  +     A STII +L+D+ L   +   +  ++   LD++Y  KS  +++ L
Subjt:  AKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIYL

Query:  KERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTI-NYDRDSLTLDIVLDSLRSKELELRIEKKKTT------AL
        ++RL S K+     L  +   F+E+  +L+ +G +++  ++   LL +LP  Y  + + I     ++LTL  V + L  +E++++ +   T+       +
Subjt:  KERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTI-NYDRDSLTLDIVLDSLRSKELELRIEKKKTT------AL

Query:  YTKNRQLKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKR--KGKLPIKKENNVSANVFEGYESAEVLMVSGSNDIKE---WIMDSG
        +  N   K    K      +   KGN K K K  C++C +EGH+K DC   KR    K    KEN         +  A ++    +  + +   +++DSG
Subjt:  YTKNRQLKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKR--KGKLPIKKENNVSANVFEGYESAEVLMVSGSNDIKE---WIMDSG

Query:  CTFHMSPHKHFFT---------KIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDS-
         + H+   +  +T         KI     G+ +    +       G V+ + D      L  V +      NL+S+  L ++G S + +   + + K+  
Subjt:  CTFHMSPHKHFFT---------KIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDS-

Query:  -VVKMKGVLKNGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQ-KGLDILYK-----QDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGK
         VVK  G+L N         ++ +  +++ K     +LWH+R  HIS  K L+I  K     Q L+ N + S    C+ C+ GK  RL F     +T  K
Subjt:  -VVKMKGVLKNGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQ-KGLDILYK-----QDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGK

Query:  --VDYILSDLWGP
          +  + SD+ GP
Subjt:  --VDYILSDLWGP

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-7234.6Show/hide
Query:  KFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIYLK
        K+E+ KF+G   FS W+++M+ LL+Q  + K L   SK P TM  ++  D+ E A S I L L+D+V+  + D ++   +W +L+SLYMSK+  N++YLK
Subjt:  KFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIYLK

Query:  ERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQLK
        ++L++  M        +++ FN +   L N G +++ E++A++LLNSLP  Y  + +TI + + ++ L  V  +L   E   +  + +  AL T+ R   
Subjt:  ERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNRQLK

Query:  GQQSKQNRQNSRSNQKGNQKEKEKM-TCNYCHKEGHLKWDCPILKRKGKLPI---KKENNVSANVFEG-------YESAEVLMVSGSNDIKEWIMDSGCT
         Q+S  N   S +  K   + K ++  C  C++ GH K DCP   RKGK      K ++N +A V           E  E + +SG     EW++D+  +
Subjt:  GQQSKQNRQNSRSNQKGNQKEKEKM-TCNYCHKEGHLKWDCPILKRKGKLPI---KKENNVSANVFEG-------YESAEVLMVSGSNDIKEWIMDSGCT

Query:  FHMSPHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLKNGL
         H +P +  F +    D G   MGN     I GIG +  + + G   +L  VR+VP L+ NLIS   LD+ G+          + K S+V  KGV +  L
Subjt:  FHMSPHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLKNGL

Query:  YVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKSLGG
        Y        GELNA  D+IS    LWHKRM H+S+KGL IL K+ LI   K +    CD+C+ GK  R+SF ++S R    +D + SD+ GP +++S+GG
Subjt:  YVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKSLGG

P93293 Uncharacterized mitochondrial protein AtMg003007.2e-1938.79Show/hide
Query:  GNLMVCKDSVVKMKGVLKNGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTE
        G L V K     +KG   + LY+L GS  +GE N +++     T+LWH R+ H+SQ+G+++L K+  + + K S   FC+ C+ GK  R++FS+  H T+
Subjt:  GNLMVCKDSVVKMKGVLKNGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTE

Query:  GKVDYILSDLWGPSKV
          +DY+ SDLWG   V
Subjt:  GKVDYILSDLWGPSKV

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein1.8e-0420.9Show/hide
Query:  DFSLWKKKMKALLVQHKVAKALM-----DPSKLPPTMTDDEKQDMAE---------IAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRI
        D+ +W    K+ L++  +   ++     DPSK P      + +++++          A   +   L D+V R+     S  ++W   D L     Q    
Subjt:  DFSLWKKKMKALLVQHKVAKALM-----DPSKLPPTMTDDEKQDMAE---------IAYSTIILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRI

Query:  YLKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNR
         L++          + LE+ ++D     +D  +    LD   +A+ +L  L     E            TL    D L S   EL    K T+    +  
Subjt:  YLKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTLDIVLDSLRSKELELRIEKKKTTALYTKNR

Query:  QLKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNVSANVFEGYESAEVLMVSGSNDIKEWIMDSGCTFHMSPHKH
          +  +S          +    K K +  C  C+K  H + DC     K ++   KE      V +        + + + D   WI+      +M+P+  
Subjt:  QLKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNVSANVFEGYESAEVLMVSGSNDIKEWIMDSGCTFHMSPHKH

Query:  FFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKS-EGGNLMVC
        +FT +           +     ++G G VK +M +G  K + +V +VPGL RN++S G +    +S  +   G  +VC
Subjt:  FFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKS-EGGNLMVC

ATMG00300.1 Gag-Pol-related retrotransposon family protein5.1e-2038.79Show/hide
Query:  GNLMVCKDSVVKMKGVLKNGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTE
        G L V K     +KG   + LY+L GS  +GE N +++     T+LWH R+ H+SQ+G+++L K+  + + K S   FC+ C+ GK  R++FS+  H T+
Subjt:  GNLMVCKDSVVKMKGVLKNGLYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTE

Query:  GKVDYILSDLWGPSKV
          +DY+ SDLWG   V
Subjt:  GKVDYILSDLWGPSKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAATCCCGACCGGGATTTCTTGGTCAAACGACCGGGATTTCTTGAATTCGACCCGGAAAAAAACCGGTTTTCCCTTGCGGGAAAACTAGTTTTTCCGGGTTTCGA
TCCCGACCGAGAAGTGACCGGGAAGTTTCGTCTTCTCCGGCACCGATCGTTCCTTCTACTCCGAAAACCCCTCCATAAAACCCGCAAACTCCGTGAAGACGCCGGAGAAG
AAGGGAGAAGAAGACGCTTGAAGACGCTCGAAGAAGTCGCTGGAAAAGAAGCCGTCGGAGAAGAAGGGAGAAGAAGACGCTTGAAGAAGTCGCCGGAACCGGAGCCGTCG
CCGGAAAAGAAAGACGAACGGAGAAGAAGACGCCGGACGGAGAAGAAGGTTGTTCGAATAGAATGGAGGAGAGGAAAGAGTTTTACTGAATGGGGCCCAGCCGCCGCCGT
CATCTCTCTTCGTGTGCTTTCTCTCTCTCCGTTTCCGAGTGAGAAGCCCGCCGCCGCCGTTCCTCCTTCTTCTCCCTTCGCGCCGCCGTCGGGTTTTGCAGGAAAACAAG
GACCCACGCACCGCCGCCGCCCAGCCCCTTGCGCAATCCCTCTCTCTCCGCGCGTTCTTCTCTGTCCCGCCACCTGGGAATCGTTTGGGTTTGAGTTTGCTTTAGCTTGG
AGACCCACTGCCCAAGAACGTTTTGGGTTGCTGTCCGTGGTTCGCAAGGCCCGTTTCAGTGTGGTTTCGGCTCCGTTTGGGAGCTTTCTTAGACTGCTCTGGATCGCTGA
AGTTGCTTTAGGAGCGCTTAGTTTTGTTGACAAATCAATGTCGGCTAAATTTGAATTAGACAAGTTTGATGGCAAAGGAGATTTCAGCCTCTGGAAGAAGAAGATGAAGG
CGTTACTTGTTCAACATAAAGTTGCAAAAGCACTAATGGATCCAAGTAAGTTGCCTCCAACTATGACTGATGATGAAAAACAAGATATGGCTGAAATCGCATACAGTACC
ATCATTTTATTCTTAGCTGACAATGTCTTGAGGAGAGTTAGCGATGTTGAAAGTGTGTCTGAACTTTGGGCGAAATTAGATTCACTCTATATGTCAAAATCTCAAATGAA
TAGGATCTATTTGAAAGAGAGACTCTTTAGTTTTAAAATGGATGTTTCGAAAGGGCTTGAAGAGAACATAGATGATTTCAATGAGATTTGTATCGACTTAGTGAATTCAG
GAGAACAGTTAGATACAGAGAATCAAGCTGTGATTCTTCTGAACTCATTGCCCGAAAAATATAAGGAGGTTAAGTCAACCATAAACTATGACAGAGACTCCCTAACATTA
GATATAGTCTTAGACTCCTTACGATCCAAAGAGCTGGAGCTTAGAATTGAAAAGAAAAAAACAACAGCCCTGTATACTAAGAACAGACAATTGAAAGGACAGCAAAGCAA
ACAAAACAGACAAAATTCTAGATCAAATCAAAAGGGGAATCAAAAGGAAAAAGAAAAAATGACATGCAATTATTGCCACAAAGAAGGGCATTTGAAATGGGATTGCCCAA
TTCTGAAAAGAAAAGGTAAACTTCCAATTAAGAAAGAAAATAACGTTTCAGCAAACGTTTTTGAGGGTTATGAATCAGCTGAAGTCCTCATGGTAAGTGGATCAAATGAC
ATAAAAGAGTGGATAATGGATTCAGGGTGCACCTTTCACATGTCACCACACAAACATTTCTTCACCAAAATAAAGGAATTTGATGGAGGAAAGGCTGTCATGGGCAACAA
CCAGCAATGTGATATTAAAGGCATTGGCTCTGTTAAATTTCAAATGGATGATGGGTCTTTTAAAATTCTATCGTCCGTCAGATATGTACCAGGTTTGAAAAGGAATTTGA
TTTCTTTGGGGACTTTAGATAAATCTGGATTTAGTTACAAATCAGAAGGTGGAAATCTCATGGTTTGTAAAGACTCAGTAGTTAAAATGAAAGGAGTTCTAAAAAATGGT
CTGTATGTTCTTATGGGAAGCTCAGTTTCAGGGGAACTAAATGCTGTTTCAGATAAGATCAGTAAGGTTACTCAATTATGGCACAAGAGAATGTGTCACATTAGTCAGAA
AGGCCTTGATATTTTGTATAAGCAGGATCTTATTGGCAATCACAAGCCTTCACAAAGGCCTTTCTGCGATCATTGTGTTTTGGGAAAAGCCACTAGACTAAGCTTCAGTA
GTGCCTCACATAGAACAGAAGGAAAGGTTGACTACATACTTTCGGACCTTTGGGGTCCCTCAAAAGTCAAATCACTTGGTGGTGCACGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGAATCCCGACCGGGATTTCTTGGTCAAACGACCGGGATTTCTTGAATTCGACCCGGAAAAAAACCGGTTTTCCCTTGCGGGAAAACTAGTTTTTCCGGGTTTCGA
TCCCGACCGAGAAGTGACCGGGAAGTTTCGTCTTCTCCGGCACCGATCGTTCCTTCTACTCCGAAAACCCCTCCATAAAACCCGCAAACTCCGTGAAGACGCCGGAGAAG
AAGGGAGAAGAAGACGCTTGAAGACGCTCGAAGAAGTCGCTGGAAAAGAAGCCGTCGGAGAAGAAGGGAGAAGAAGACGCTTGAAGAAGTCGCCGGAACCGGAGCCGTCG
CCGGAAAAGAAAGACGAACGGAGAAGAAGACGCCGGACGGAGAAGAAGGTTGTTCGAATAGAATGGAGGAGAGGAAAGAGTTTTACTGAATGGGGCCCAGCCGCCGCCGT
CATCTCTCTTCGTGTGCTTTCTCTCTCTCCGTTTCCGAGTGAGAAGCCCGCCGCCGCCGTTCCTCCTTCTTCTCCCTTCGCGCCGCCGTCGGGTTTTGCAGGAAAACAAG
GACCCACGCACCGCCGCCGCCCAGCCCCTTGCGCAATCCCTCTCTCTCCGCGCGTTCTTCTCTGTCCCGCCACCTGGGAATCGTTTGGGTTTGAGTTTGCTTTAGCTTGG
AGACCCACTGCCCAAGAACGTTTTGGGTTGCTGTCCGTGGTTCGCAAGGCCCGTTTCAGTGTGGTTTCGGCTCCGTTTGGGAGCTTTCTTAGACTGCTCTGGATCGCTGA
AGTTGCTTTAGGAGCGCTTAGTTTTGTTGACAAATCAATGTCGGCTAAATTTGAATTAGACAAGTTTGATGGCAAAGGAGATTTCAGCCTCTGGAAGAAGAAGATGAAGG
CGTTACTTGTTCAACATAAAGTTGCAAAAGCACTAATGGATCCAAGTAAGTTGCCTCCAACTATGACTGATGATGAAAAACAAGATATGGCTGAAATCGCATACAGTACC
ATCATTTTATTCTTAGCTGACAATGTCTTGAGGAGAGTTAGCGATGTTGAAAGTGTGTCTGAACTTTGGGCGAAATTAGATTCACTCTATATGTCAAAATCTCAAATGAA
TAGGATCTATTTGAAAGAGAGACTCTTTAGTTTTAAAATGGATGTTTCGAAAGGGCTTGAAGAGAACATAGATGATTTCAATGAGATTTGTATCGACTTAGTGAATTCAG
GAGAACAGTTAGATACAGAGAATCAAGCTGTGATTCTTCTGAACTCATTGCCCGAAAAATATAAGGAGGTTAAGTCAACCATAAACTATGACAGAGACTCCCTAACATTA
GATATAGTCTTAGACTCCTTACGATCCAAAGAGCTGGAGCTTAGAATTGAAAAGAAAAAAACAACAGCCCTGTATACTAAGAACAGACAATTGAAAGGACAGCAAAGCAA
ACAAAACAGACAAAATTCTAGATCAAATCAAAAGGGGAATCAAAAGGAAAAAGAAAAAATGACATGCAATTATTGCCACAAAGAAGGGCATTTGAAATGGGATTGCCCAA
TTCTGAAAAGAAAAGGTAAACTTCCAATTAAGAAAGAAAATAACGTTTCAGCAAACGTTTTTGAGGGTTATGAATCAGCTGAAGTCCTCATGGTAAGTGGATCAAATGAC
ATAAAAGAGTGGATAATGGATTCAGGGTGCACCTTTCACATGTCACCACACAAACATTTCTTCACCAAAATAAAGGAATTTGATGGAGGAAAGGCTGTCATGGGCAACAA
CCAGCAATGTGATATTAAAGGCATTGGCTCTGTTAAATTTCAAATGGATGATGGGTCTTTTAAAATTCTATCGTCCGTCAGATATGTACCAGGTTTGAAAAGGAATTTGA
TTTCTTTGGGGACTTTAGATAAATCTGGATTTAGTTACAAATCAGAAGGTGGAAATCTCATGGTTTGTAAAGACTCAGTAGTTAAAATGAAAGGAGTTCTAAAAAATGGT
CTGTATGTTCTTATGGGAAGCTCAGTTTCAGGGGAACTAAATGCTGTTTCAGATAAGATCAGTAAGGTTACTCAATTATGGCACAAGAGAATGTGTCACATTAGTCAGAA
AGGCCTTGATATTTTGTATAAGCAGGATCTTATTGGCAATCACAAGCCTTCACAAAGGCCTTTCTGCGATCATTGTGTTTTGGGAAAAGCCACTAGACTAAGCTTCAGTA
GTGCCTCACATAGAACAGAAGGAAAGGTTGACTACATACTTTCGGACCTTTGGGGTCCCTCAAAAGTCAAATCACTTGGTGGTGCACGGTAA
Protein sequenceShow/hide protein sequence
MMNPDRDFLVKRPGFLEFDPEKNRFSLAGKLVFPGFDPDREVTGKFRLLRHRSFLLLRKPLHKTRKLREDAGEEGRRRRLKTLEEVAGKEAVGEEGRRRRLKKSPEPEPS
PEKKDERRRRRRTEKKVVRIEWRRGKSFTEWGPAAAVISLRVLSLSPFPSEKPAAAVPPSSPFAPPSGFAGKQGPTHRRRPAPCAIPLSPRVLLCPATWESFGFEFALAW
RPTAQERFGLLSVVRKARFSVVSAPFGSFLRLLWIAEVALGALSFVDKSMSAKFELDKFDGKGDFSLWKKKMKALLVQHKVAKALMDPSKLPPTMTDDEKQDMAEIAYST
IILFLADNVLRRVSDVESVSELWAKLDSLYMSKSQMNRIYLKERLFSFKMDVSKGLEENIDDFNEICIDLVNSGEQLDTENQAVILLNSLPEKYKEVKSTINYDRDSLTL
DIVLDSLRSKELELRIEKKKTTALYTKNRQLKGQQSKQNRQNSRSNQKGNQKEKEKMTCNYCHKEGHLKWDCPILKRKGKLPIKKENNVSANVFEGYESAEVLMVSGSND
IKEWIMDSGCTFHMSPHKHFFTKIKEFDGGKAVMGNNQQCDIKGIGSVKFQMDDGSFKILSSVRYVPGLKRNLISLGTLDKSGFSYKSEGGNLMVCKDSVVKMKGVLKNG
LYVLMGSSVSGELNAVSDKISKVTQLWHKRMCHISQKGLDILYKQDLIGNHKPSQRPFCDHCVLGKATRLSFSSASHRTEGKVDYILSDLWGPSKVKSLGGAR