; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G05860 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G05860
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Description30S ribosomal protein S17
Genome locationClcChr07:9216803..9234839
RNA-Seq ExpressionClc07G05860
SyntenyClc07G05860
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005840 - ribosome (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000266 - Ribosomal protein S17/S11
IPR012337 - Ribonuclease H-like superfamily
IPR012340 - Nucleic acid-binding, OB-fold
IPR019984 - 30S ribosomal protein S17
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026280.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]6.2e-4430.86Show/hide
Query:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMRGCRVSKLL--------------------------KARVGDLFQEYLERGALNPDPTDS
        +I D+  +CKRF+EGLR+EI +PV A  +W DF+KLV AA+   RV K L                          K R G        RG        S
Subjt:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMRGCRVSKLL--------------------------KARVGDLFQEYLERGALNPDPTDS

Query:  HFPRL--------------------GQEVMCKDRVKGDLVRHQRRLKVPKQGNLTGHFRRDCPQLMSGSGV-EQRVISQTVSQPKLEATGGEGSGGVKQK
         F +                     G  +   DRV    V    +  V        H+RRDCP L+ G  +   R   +  +  + EA            
Subjt:  HFPRL--------------------GQEVMCKDRVKGDLVRHQRRLKVPKQGNLTGHFRRDCPQLMSGSGV-EQRVISQTVSQPKLEATGGEGSGGVKQK

Query:  GPARRPRSKLELLTDEMLVHTPVGNVVIIDHVYQECEKEDSFY---------FLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPE
              +  + L  D +  H    N    + V++   K +  +         ++I  +KA  +L KGC A+LAHV++ +++KLK E++ VV E+ DVFPE
Subjt:  GPARRPRSKLELLTDEMLVHTPVGNVVIIDHVYQECEKEDSFY---------FLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPE

Query:  ELSRLPSNREVEFTIDLIPDTTPVSQTPY-----------------------------------------------------------------------
        ELS LP +RE+EF+IDL     P+SQ PY                                                                       
Subjt:  ELSRLPSNREVEFTIDLIPDTTPVSQTPY-----------------------------------------------------------------------

Query:  -------PGYYQWFVQGFSKIALPLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV--KKGRSGSHKPDPCQSH
                GYY+ FV+ FSK+ALPL  LTKKNVKFE ++  ERSFQELKKRLV+APVLT+   G EFE+YCDAS QGLGCV  +KG+  ++     + H
Subjt:  -------PGYYQWFVQGFSKIALPLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV--KKGRSGSHKPDPCQSH

KAA0037490.1 retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa]1.3e-4630.69Show/hide
Query:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMRGCRVSKLLKARVGDLFQEYLERGALNPDPTDSHF-----PRLGQEVMCKDRVKGDLVR
        I+A E++  +RF+ GLR EI +PV    +WT+F++LVE A+R  +     K+ V       L RG      T S F      R   E+    R      +
Subjt:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMRGCRVSKLLKARVGDLFQEYLERGALNPDPTDSHF-----PRLGQEVMCKDRVKGDLVR

Query:  HQRRLKVPKQGNLTGHFRRDCPQLMSGSGVEQRVISQTVSQPKLEATGGEGSGGVKQKGPARRPRSK----------------------------LELLT
        ++   +  +     GHF++DCPQL      +QRV SQTV Q ++     EG+ G +QKG   RPR +                             ++L 
Subjt:  HQRRLKVPKQGNLTGHFRRDCPQLMSGSGVEQRVISQTVSQPKLEATGGEGSGGVKQKGPARRPRSK----------------------------LELLT

Query:  DEMLVHTPVGNVVI--IDHVYQECEKEDSFYFLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPEELSRLPSNREVEFTIDLIPDT
        D    H+ V ++ +  ++ + +   K  + Y  +  +   +++++GC  FLAH+V V+  KLKPE++LVV E+LDVF ++LS LP +RE+EFTI+L+P+T
Subjt:  DEMLVHTPVGNVVI--IDHVYQECEKEDSFYFLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPEELSRLPSNREVEFTIDLIPDT

Query:  TPVSQTPY-----------------------------------------------PGYYQW---------------------------------------
         P+SQ PY                                                GY+Q                                        
Subjt:  TPVSQTPY-----------------------------------------------PGYYQW---------------------------------------

Query:  -----------FVQGFSKIALPLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV
                   F++ FS++ALPL  LT+KN KFE S+  E+SFQELKKRLV+AP+L + V GK++ +YCDASR GLGCV
Subjt:  -----------FVQGFSKIALPLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV

KAA0037490.1 retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa]2.3e-1474.19Show/hide
Query:  VTIVSDRDPRFTSKFLINLQQAFETKLQFSTTFHPQTDGQSEMTIQTLEDTLRACALQLKAT
        V+IVSDRDPRFT KF  +LQ+A  T L+FST+FHPQTDGQSE TIQTLED LRAC LQLK +
Subjt:  VTIVSDRDPRFTSKFLINLQQAFETKLQFSTTFHPQTDGQSEMTIQTLEDTLRACALQLKAT

KAA0037490.1 retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa]1.3e-4630.69Show/hide
Query:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMRGCRVSKLLKARVGDLFQEYLERGALNPDPTDSHF-----PRLGQEVMCKDRVKGDLVR
        I+A E++  +RF+ GLR EI +PV    +WT+F++LVE A+R  +     K+ V       L RG      T S F      R   E+    R      +
Subjt:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMRGCRVSKLLKARVGDLFQEYLERGALNPDPTDSHF-----PRLGQEVMCKDRVKGDLVR

Query:  HQRRLKVPKQGNLTGHFRRDCPQLMSGSGVEQRVISQTVSQPKLEATGGEGSGGVKQKGPARRPRSK----------------------------LELLT
        ++   +  +     GHF++DCPQL      +QRV SQTV Q ++     EG+ G +QKG   RPR +                             ++L 
Subjt:  HQRRLKVPKQGNLTGHFRRDCPQLMSGSGVEQRVISQTVSQPKLEATGGEGSGGVKQKGPARRPRSK----------------------------LELLT

Query:  DEMLVHTPVGNVVI--IDHVYQECEKEDSFYFLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPEELSRLPSNREVEFTIDLIPDT
        D    H+ V ++ +  ++ + +   K  + Y  +  +   +++++GC  FLAH+V V+  KLKPE++LVV E+LDVF ++LS LP +RE+EFTI+L+P+T
Subjt:  DEMLVHTPVGNVVI--IDHVYQECEKEDSFYFLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPEELSRLPSNREVEFTIDLIPDT

Query:  TPVSQTPY-----------------------------------------------PGYYQW---------------------------------------
         P+SQ PY                                                GY+Q                                        
Subjt:  TPVSQTPY-----------------------------------------------PGYYQW---------------------------------------

Query:  -----------FVQGFSKIALPLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV
                   F++ FS++ALPL  LT+KN KFE S+  E+SFQELKKRLV+AP+L + V GK++ +YCDASR GLGCV
Subjt:  -----------FVQGFSKIALPLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV

KAA0037766.1 putative polyprotein [Cucumis melo var. makuwa]3.1e-4323.99Show/hide
Query:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAM----------------RGCRVSKLLKARVGDLF---------QEYLERG---ALNPDPT
        I+A E+++C+RF  GLR EI +PV A  +WT+F++LVE A+                RG       + R    F         Q++  R     L     
Subjt:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAM----------------RGCRVSKLLKARVGDLF---------QEYLERG---ALNPDPT

Query:  DSHFPRLGQEVMCKDRVKGDLVRHQRRLKVPKQGNLTG--------------------------------HFRRDCPQLMSGSGVEQRVISQTVSQPKLE
         S F R GQ      R+    +R   RL+ P Q ++                                  HF++DCPQL     ++Q V SQTV Q ++ 
Subjt:  DSHFPRLGQEVMCKDRVKGDLVRHQRRLKVPKQGNLTG--------------------------------HFRRDCPQLMSGSGVEQRVISQTVSQPKLE

Query:  ATGGEGSGGVKQKGPARRPRSK------------LELLTDEMLVHTPVGNVVIIDHVYQECE--------------------------------------
            EG+   +QKG   RPR +            LE L++E+ ++TPVG+V++++ V + CE                                      
Subjt:  ATGGEGSGGVKQKGPARRPRSK------------LELLTDEMLVHTPVGNVVIIDHVYQECE--------------------------------------

Query:  --KEDSF------------------YFLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPEELSRLPSNREVEFTIDLIPDTTPVSQ
          KE  F                    LIS +KA  +L KGC AFLAH+V V+  KLKPE++ +V E+LDVF  +LS LP +RE+EFTI+L+P+  P+SQ
Subjt:  --KEDSF------------------YFLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPEELSRLPSNREVEFTIDLIPDTTPVSQ

Query:  TPY-------------------------------------------------------------------------------------------------
         PY                                                                                                 
Subjt:  TPY-------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------------PGYYQ
                                                                                                        GYY+
Subjt:  -----------------------------------------------------------------------------------------------PGYYQ

Query:  WFVQGFSKIALPLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLG---------------------------------
         F++ FS++A PL TLT+KNVKFE SN  E+SFQELKKRLV+ P+L + V GK++ +YCDASR GL                                  
Subjt:  WFVQGFSKIALPLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLG---------------------------------

Query:  ---------------------------------------------------------------CVKKGR------------------SGSHKPDPCQ---
                                                                        VK+GR                  S ++   P +   
Subjt:  ---------------------------------------------------------------CVKKGR------------------SGSHKPDPCQ---

Query:  ----------------------------SHSEEDLLEK-----------TTVTIVSDRDPRFTSKFLINLQQAFETKLQFSTTFHPQTDGQSEMTIQTLE
                                    S   +D L +             V+IVSD DPRFTSKF  +LQ+A  T L+FST+FHPQTDGQSE TIQTLE
Subjt:  ----------------------------SHSEEDLLEK-----------TTVTIVSDRDPRFTSKFLINLQQAFETKLQFSTTFHPQTDGQSEMTIQTLE

Query:  DTLRACALQLKAT
        D LRAC LQLK +
Subjt:  DTLRACALQLKAT

TYK01965.1 hypothetical protein E5676_scaffold808G00660 [Cucumis melo var. makuwa]2.3e-1474.19Show/hide
Query:  VTIVSDRDPRFTSKFLINLQQAFETKLQFSTTFHPQTDGQSEMTIQTLEDTLRACALQLKAT
        V+IVSDRDPRFT KF  +LQ+A  T L+FST+FHPQTDGQSE TIQTLED LRAC LQLK +
Subjt:  VTIVSDRDPRFTSKFLINLQQAFETKLQFSTTFHPQTDGQSEMTIQTLEDTLRACALQLKAT

TYK01965.1 hypothetical protein E5676_scaffold808G00660 [Cucumis melo var. makuwa]3.6e-4431.66Show/hide
Query:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMRGCRVSKLLKARVGDLFQEYLERGALNPDPTDSHFPRLGQEVMCKDRVKGDLVRHQRRL
        I+A E+++C+RF+ G   EI +P+ A  +WT+F++LV+ A+R      + ++  G+     L RG            R      C       LV     L
Subjt:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMRGCRVSKLLKARVGDLFQEYLERGALNPDPTDSHFPRLGQEVMCKDRVKGDLVRHQRRL

Query:  KVPKQGNLTGHFRRDCPQLMSGSGVEQRVISQTVSQPKLEATGGEGSGGVKQKGPARRPR--SKLELLTDEMLVHTPVGNVVIIDHVYQ-----------
         V  Q    GHF++DCPQL      +Q V SQTV Q ++     EG+ G +QKG   RPR   K+  +T +    +P G V    H++            
Subjt:  KVPKQGNLTGHFRRDCPQLMSGSGVEQRVISQTVSQPKLEATGGEGSGGVKQKGPARRPR--SKLELLTDEMLVHTPVGNVVIIDHVYQ-----------

Query:  ---ECE----------------------------------KEDSFYFLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPEELSRLP
            C                                   ++  F  LIS +KA  +L KGC   LAH+V V+  KLKP+++ VV E+ +VF ++LS LP
Subjt:  ---ECE----------------------------------KEDSFYFLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPEELSRLP

Query:  SNREVEFTIDLIPDTTPVSQTPYP---------------------------------------------GYYQW----------------FVQGFSKIAL
         +RE+EFTI+L+P T P+SQ PY                                              GY+Q                 F++ FS++AL
Subjt:  SNREVEFTIDLIPDTTPVSQTPYP---------------------------------------------GYYQW----------------FVQGFSKIAL

Query:  PLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV
        PL  LT+KN KFE S+   + FQELKKRLV+AP+L + V GK++ +YCDASR GLG V
Subjt:  PLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV

TrEMBL top hitse value%identityAlignment
A0A5A7SJ99 DNA/RNA polymerases superfamily protein3.0e-4430.86Show/hide
Query:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMRGCRVSKLL--------------------------KARVGDLFQEYLERGALNPDPTDS
        +I D+  +CKRF+EGLR+EI +PV A  +W DF+KLV AA+   RV K L                          K R G        RG        S
Subjt:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMRGCRVSKLL--------------------------KARVGDLFQEYLERGALNPDPTDS

Query:  HFPRL--------------------GQEVMCKDRVKGDLVRHQRRLKVPKQGNLTGHFRRDCPQLMSGSGV-EQRVISQTVSQPKLEATGGEGSGGVKQK
         F +                     G  +   DRV    V    +  V        H+RRDCP L+ G  +   R   +  +  + EA            
Subjt:  HFPRL--------------------GQEVMCKDRVKGDLVRHQRRLKVPKQGNLTGHFRRDCPQLMSGSGV-EQRVISQTVSQPKLEATGGEGSGGVKQK

Query:  GPARRPRSKLELLTDEMLVHTPVGNVVIIDHVYQECEKEDSFY---------FLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPE
              +  + L  D +  H    N    + V++   K +  +         ++I  +KA  +L KGC A+LAHV++ +++KLK E++ VV E+ DVFPE
Subjt:  GPARRPRSKLELLTDEMLVHTPVGNVVIIDHVYQECEKEDSFY---------FLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPE

Query:  ELSRLPSNREVEFTIDLIPDTTPVSQTPY-----------------------------------------------------------------------
        ELS LP +RE+EF+IDL     P+SQ PY                                                                       
Subjt:  ELSRLPSNREVEFTIDLIPDTTPVSQTPY-----------------------------------------------------------------------

Query:  -------PGYYQWFVQGFSKIALPLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV--KKGRSGSHKPDPCQSH
                GYY+ FV+ FSK+ALPL  LTKKNVKFE ++  ERSFQELKKRLV+APVLT+   G EFE+YCDAS QGLGCV  +KG+  ++     + H
Subjt:  -------PGYYQWFVQGFSKIALPLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV--KKGRSGSHKPDPCQSH

A0A5A7T292 Retrotransposon protein, putative, Ty3-gypsy subclass6.4e-4730.69Show/hide
Query:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMRGCRVSKLLKARVGDLFQEYLERGALNPDPTDSHF-----PRLGQEVMCKDRVKGDLVR
        I+A E++  +RF+ GLR EI +PV    +WT+F++LVE A+R  +     K+ V       L RG      T S F      R   E+    R      +
Subjt:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMRGCRVSKLLKARVGDLFQEYLERGALNPDPTDSHF-----PRLGQEVMCKDRVKGDLVR

Query:  HQRRLKVPKQGNLTGHFRRDCPQLMSGSGVEQRVISQTVSQPKLEATGGEGSGGVKQKGPARRPRSK----------------------------LELLT
        ++   +  +     GHF++DCPQL      +QRV SQTV Q ++     EG+ G +QKG   RPR +                             ++L 
Subjt:  HQRRLKVPKQGNLTGHFRRDCPQLMSGSGVEQRVISQTVSQPKLEATGGEGSGGVKQKGPARRPRSK----------------------------LELLT

Query:  DEMLVHTPVGNVVI--IDHVYQECEKEDSFYFLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPEELSRLPSNREVEFTIDLIPDT
        D    H+ V ++ +  ++ + +   K  + Y  +  +   +++++GC  FLAH+V V+  KLKPE++LVV E+LDVF ++LS LP +RE+EFTI+L+P+T
Subjt:  DEMLVHTPVGNVVI--IDHVYQECEKEDSFYFLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPEELSRLPSNREVEFTIDLIPDT

Query:  TPVSQTPY-----------------------------------------------PGYYQW---------------------------------------
         P+SQ PY                                                GY+Q                                        
Subjt:  TPVSQTPY-----------------------------------------------PGYYQW---------------------------------------

Query:  -----------FVQGFSKIALPLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV
                   F++ FS++ALPL  LT+KN KFE S+  E+SFQELKKRLV+AP+L + V GK++ +YCDASR GLGCV
Subjt:  -----------FVQGFSKIALPLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV

A0A5A7T292 Retrotransposon protein, putative, Ty3-gypsy subclass1.1e-1474.19Show/hide
Query:  VTIVSDRDPRFTSKFLINLQQAFETKLQFSTTFHPQTDGQSEMTIQTLEDTLRACALQLKAT
        V+IVSDRDPRFT KF  +LQ+A  T L+FST+FHPQTDGQSE TIQTLED LRAC LQLK +
Subjt:  VTIVSDRDPRFTSKFLINLQQAFETKLQFSTTFHPQTDGQSEMTIQTLEDTLRACALQLKAT

A0A5A7T292 Retrotransposon protein, putative, Ty3-gypsy subclass6.4e-4730.69Show/hide
Query:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMRGCRVSKLLKARVGDLFQEYLERGALNPDPTDSHF-----PRLGQEVMCKDRVKGDLVR
        I+A E++  +RF+ GLR EI +PV    +WT+F++LVE A+R  +     K+ V       L RG      T S F      R   E+    R      +
Subjt:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMRGCRVSKLLKARVGDLFQEYLERGALNPDPTDSHF-----PRLGQEVMCKDRVKGDLVR

Query:  HQRRLKVPKQGNLTGHFRRDCPQLMSGSGVEQRVISQTVSQPKLEATGGEGSGGVKQKGPARRPRSK----------------------------LELLT
        ++   +  +     GHF++DCPQL      +QRV SQTV Q ++     EG+ G +QKG   RPR +                             ++L 
Subjt:  HQRRLKVPKQGNLTGHFRRDCPQLMSGSGVEQRVISQTVSQPKLEATGGEGSGGVKQKGPARRPRSK----------------------------LELLT

Query:  DEMLVHTPVGNVVI--IDHVYQECEKEDSFYFLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPEELSRLPSNREVEFTIDLIPDT
        D    H+ V ++ +  ++ + +   K  + Y  +  +   +++++GC  FLAH+V V+  KLKPE++LVV E+LDVF ++LS LP +RE+EFTI+L+P+T
Subjt:  DEMLVHTPVGNVVI--IDHVYQECEKEDSFYFLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPEELSRLPSNREVEFTIDLIPDT

Query:  TPVSQTPY-----------------------------------------------PGYYQW---------------------------------------
         P+SQ PY                                                GY+Q                                        
Subjt:  TPVSQTPY-----------------------------------------------PGYYQW---------------------------------------

Query:  -----------FVQGFSKIALPLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV
                   F++ FS++ALPL  LT+KN KFE S+  E+SFQELKKRLV+AP+L + V GK++ +YCDASR GLGCV
Subjt:  -----------FVQGFSKIALPLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV

A0A5A7V8M7 Putative Retrotransposon protein8.1e-4227.44Show/hide
Query:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMR----------GCRVSKLLKARVG-------DLFQEYLERGALNPDPTDSHFPRLGQEV
        I+A E+N+C+RF+ GLR EI +PV A  +W +F++LVE A+R             +S+      G        +FQ   +R    P  +     + GQE 
Subjt:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMR----------GCRVSKLLKARVG-------DLFQEYLERGALNPDPTDSHFPRLGQEV

Query:  M--------CKDRVKGDLVRHQRRLKVPKQGNLTGHFRRDCPQLMSGSGVEQRVISQTVSQPKLEATGGEGSGGVKQKGPARRPRSK-------------
        +        C    +    +    + V  Q    GHF++DC QL      +Q V SQTV Q ++     EG+ G ++KG   RPR +             
Subjt:  M--------CKDRVKGDLVRHQRRLKVPKQGNLTGHFRRDCPQLMSGSGVEQRVISQTVSQPKLEATGGEGSGGVKQKGPARRPRSK-------------

Query:  ----------------------------------------LELLTDEMLVHTPVGNVVIIDHVYQECE-------------------KEDSF--------
                                                LE L++ + ++TPVG+V+++  V   CE                   KE  F        
Subjt:  ----------------------------------------LELLTDEMLVHTPVGNVVIIDHVYQECE-------------------KEDSF--------

Query:  ----------YFLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPEELSRLPSNREVEFTIDLIPDTTPVSQTPY------------
                    LIS +KA  +L KGC AFLAH+V V+  KLK E++ VV E++DVFP++LS LP +RE+EFTI+L+P TTP+SQ PY            
Subjt:  ----------YFLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPEELSRLPSNREVEFTIDLIPDTTPVSQTPY------------

Query:  --------------------------------------------------------------------------PGYYQW--------------------
                                                                                   GY+Q                     
Subjt:  --------------------------------------------------------------------------PGYYQW--------------------

Query:  --------------------------FVQGFSKIALPLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV
                                  F++ FS++ALPL  LT+KNVKFE S+  E+SFQELKKRLV AP+L + V GK++ +YCDASR GLGCV
Subjt:  --------------------------FVQGFSKIALPLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV

A0A5D3BS00 Integrase catalytic domain-containing protein1.1e-1474.19Show/hide
Query:  VTIVSDRDPRFTSKFLINLQQAFETKLQFSTTFHPQTDGQSEMTIQTLEDTLRACALQLKAT
        V+IVSDRDPRFT KF  +LQ+A  T L+FST+FHPQTDGQSE TIQTLED LRAC LQLK +
Subjt:  VTIVSDRDPRFTSKFLINLQQAFETKLQFSTTFHPQTDGQSEMTIQTLEDTLRACALQLKAT

A0A5D3BS00 Integrase catalytic domain-containing protein1.7e-4431.66Show/hide
Query:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMRGCRVSKLLKARVGDLFQEYLERGALNPDPTDSHFPRLGQEVMCKDRVKGDLVRHQRRL
        I+A E+++C+RF+ G   EI +P+ A  +WT+F++LV+ A+R      + ++  G+     L RG            R      C       LV     L
Subjt:  IIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMRGCRVSKLLKARVGDLFQEYLERGALNPDPTDSHFPRLGQEVMCKDRVKGDLVRHQRRL

Query:  KVPKQGNLTGHFRRDCPQLMSGSGVEQRVISQTVSQPKLEATGGEGSGGVKQKGPARRPR--SKLELLTDEMLVHTPVGNVVIIDHVYQ-----------
         V  Q    GHF++DCPQL      +Q V SQTV Q ++     EG+ G +QKG   RPR   K+  +T +    +P G V    H++            
Subjt:  KVPKQGNLTGHFRRDCPQLMSGSGVEQRVISQTVSQPKLEATGGEGSGGVKQKGPARRPR--SKLELLTDEMLVHTPVGNVVIIDHVYQ-----------

Query:  ---ECE----------------------------------KEDSFYFLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPEELSRLP
            C                                   ++  F  LIS +KA  +L KGC   LAH+V V+  KLKP+++ VV E+ +VF ++LS LP
Subjt:  ---ECE----------------------------------KEDSFYFLISGMKARIILSKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPEELSRLP

Query:  SNREVEFTIDLIPDTTPVSQTPYP---------------------------------------------GYYQW----------------FVQGFSKIAL
         +RE+EFTI+L+P T P+SQ PY                                              GY+Q                 F++ FS++AL
Subjt:  SNREVEFTIDLIPDTTPVSQTPYP---------------------------------------------GYYQW----------------FVQGFSKIAL

Query:  PLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV
        PL  LT+KN KFE S+   + FQELKKRLV+AP+L + V GK++ +YCDASR GLG V
Subjt:  PLMTLTKKNVKFEGSNDYERSFQELKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCV

SwissProt top hitse value%identityAlignment
A8ZV66 30S ribosomal protein S172.5e-1656.94Show/hide
Query:  RAKMKQVVGMVVSNKMQKSVVVAVDRLFYHKLYNRYVKRTSKFMAHDENNQCNIGDRVKLDPSRPLSKRKHW
        R   +Q  G VVS+KM K+V+VAV+RL  H+ Y +YV+RT+KF AHDE NQC +GD+V +  SRPLS+ KHW
Subjt:  RAKMKQVVGMVVSNKMQKSVVVAVDRLFYHKLYNRYVKRTSKFMAHDENNQCNIGDRVKLDPSRPLSKRKHW

B1I1J7 30S ribosomal protein S171.9e-1659.21Show/hide
Query:  RAKMKQVVGMVVSNKMQKSVVVAVDRLFYHKLYNRYVKRTSKFMAHDENNQCNIGDRVKLDPSRPLSKRKHWTKAK
        R   K   G VVS+KM K+VVVAVD L  H LY R V+RT KFMAHDE+NQC IGD+VK+  +RPLS+ K W  A+
Subjt:  RAKMKQVVGMVVSNKMQKSVVVAVDRLFYHKLYNRYVKRTSKFMAHDENNQCNIGDRVKLDPSRPLSKRKHWTKAK

P38519 30S ribosomal protein S173.3e-1658.82Show/hide
Query:  KQVVGMVVSNKMQKSVVVAVDRLFYHKLYNRYVKRTSKFMAHDENNQCNIGDRVKLDPSRPLSKRKHW
        K++ G+VVS+KM K+VVVAV++L  H LY +YVKRT K+ AHDE N+C IGD V+++ +RPLSK K W
Subjt:  KQVVGMVVSNKMQKSVVVAVDRLFYHKLYNRYVKRTSKFMAHDENNQCNIGDRVKLDPSRPLSKRKHW

Q605C1 30S ribosomal protein S178.7e-1754.93Show/hide
Query:  KMKQVVGMVVSNKMQKSVVVAVDRLFYHKLYNRYVKRTSKFMAHDENNQCNIGDRVKLDPSRPLSKRKHWT
        +++ V G VVSNKM +++VVA++R   H LY +Y++RT+K +AHDENN+C+IGD V L  SRP+SK+K WT
Subjt:  KMKQVVGMVVSNKMQKSVVVAVDRLFYHKLYNRYVKRTSKFMAHDENNQCNIGDRVKLDPSRPLSKRKHWT

Q8R7W3 30S ribosomal protein S174.3e-1658.33Show/hide
Query:  RAKMKQVVGMVVSNKMQKSVVVAVDRLFYHKLYNRYVKRTSKFMAHDENNQCNIGDRVKLDPSRPLSKRKHW
        R + K  +G VVSNKMQK++VVAV+  F H LY + VKRT K+  HDENN CN+GD VK+  +RPLSK K W
Subjt:  RAKMKQVVGMVVSNKMQKSVVVAVDRLFYHKLYNRYVKRTSKFMAHDENNQCNIGDRVKLDPSRPLSKRKHW

Arabidopsis top hitse value%identityAlignment
AT1G49400.1 Nucleic acid-binding, OB-fold-like protein6.6e-2880.82Show/hide
Query:  MKQVVGMVVSNKMQKSVVVAVDRLFYHKLYNRYVKRTSKFMAHDENNQCNIGDRVKLDPSRPLSKRKHWTKAK
        MK V+G VVSNKMQKSVVVAVDRLF++K+YNRYVKRTSKFMAHD+ + CNIGDRVKLDPSRPLSK KHW  A+
Subjt:  MKQVVGMVVSNKMQKSVVVAVDRLFYHKLYNRYVKRTSKFMAHDENNQCNIGDRVKLDPSRPLSKRKHWTKAK

AT1G79850.1 ribosomal protein S175.8e-0846.27Show/hide
Query:  MKQVVGMVVSNKMQKSVVVAVDRLFYHKLYNRYVKRTSKFMAHDENNQCNIGDRVKLDPSRPLSKRK
        MK + G VV     K+V V V RL  H  Y R V+   K+ AHD +NQ  +GD V+L+ SRP+SK K
Subjt:  MKQVVGMVVSNKMQKSVVVAVDRLFYHKLYNRYVKRTSKFMAHDENNQCNIGDRVKLDPSRPLSKRK

AT3G18880.1 Nucleic acid-binding, OB-fold-like protein5.6e-2782.61Show/hide
Query:  MKQVVGMVVSNKMQKSVVVAVDRLFYHKLYNRYVKRTSKFMAHDENNQCNIGDRVKLDPSRPLSKRKHW
        MK V+G VVSNKMQ SVVVAVDRLF++ +YNRYVKRTSKFMAHDE + CNIGDRVKLDPSRPLSK KHW
Subjt:  MKQVVGMVVSNKMQKSVVVAVDRLFYHKLYNRYVKRTSKFMAHDENNQCNIGDRVKLDPSRPLSKRKHW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAATCATAGCTGATGAGACTAATCAGTGTAAACGGTTTAAAGAAGGTCTACGGAAAGAGATATGTAGTCCAGTGATAGCGAGTGTAGAATGGACAGATTTC
ACTAAGTTAGTAGAAGCAGCCATGAGGGGGTGTCGTGTGAGCAAGCTACTAAAAGCGAGAGTAGGAGATTTGTTCCAAGAGTATCTGGAAAGGGGAGCTTTAAAC
CCTGACCCAACAGACAGTCATTTTCCAAGACTAGGCCAGGAGGTAATGTGCAAAGATAGAGTCAAAGGAGATTTAGTCAGACATCAGAGGCGATTGAAGGTTCCC
AAGCAGGGCAACCTAACAGGACATTTCAGGAGAGATTGTCCCCAGCTGATGTCAGGAAGTGGAGTAGAGCAGCGTGTAATTTCTCAGACCGTAAGTCAACCGAAG
CTTGAGGCAACAGGTGGTGAAGGCAGTGGTGGTGTGAAACAGAAAGGGCCAGCAAGAAGACCTCGATCGAAGTTAGAACTCTTGACTGATGAAATGTTAGTTCAT
ACACCTGTTGGTAATGTTGTGATAATTGATCATGTTTATCAAGAATGTGAGAAGGAAGATTCTTTCTACTTTTTGATCTCTGGGATGAAGGCTAGAATAATATTG
AGTAAAGGTTGTGAAGCCTTTCTAGCACATGTTGTAGAAGTGAAACTGGCCAAGCTGAAGCCTGAGGAGATGCTAGTGGTGTGTGAATATTTAGATGTATTTCCA
GAGGAGTTGTCAAGATTACCTTCTAATCGAGAAGTGGAGTTTACGATTGATTTAATACCAGATACAACTCCCGTTTCTCAAACTCCTTATCCTGGATACTATCAA
TGGTTCGTCCAGGGATTTTCAAAGATAGCATTACCACTCATGACTTTAACAAAGAAGAATGTTAAGTTTGAGGGGAGTAACGATTATGAGCGAAGTTTCCAGGAA
TTGAAGAAAAGGTTGGTGTCAGCACCGGTATTAACAATTTTGGTACTAGGAAAAGAGTTTGAAGTATACTGTGATGCATCTCGACAGGGATTAGGTTGTGTAAAG
AAGGGAAGAAGCGGTTCCCACAAACCGGATCCTTGCCAGAGTCATAGCGAGGAAGATCTCTTGGAGAAGACTACTGTGACAATTGTATCTGATCGGGATCCAAGA
TTTACTTCTAAGTTCTTGATCAACTTACAACAAGCTTTCGAAACCAAGTTGCAATTTAGTACAACGTTTCATCCTCAAACAGATGGACAGTCGGAAATGACTATT
CAGACTCTAGAAGACACGTTACGAGCTTGTGCATTACAGCTCAAAGCGACCTCTGCCCCCCCCGTTACCAGCTACGGTACTCCGCCCCACCGCGCGCCGTTCCTA
TCCAGACAGTTTCTGAATCCTCGAGTTTGTAGCAGCTTGAAGCTTAAAAGGGCTGTTGGGTTTTCTTTTGGGGGTTGGGTTTTCAATTGGAGGGCCAAGATGAAG
CAGGTAGTCGGGATGGTGGTGTCAAACAAAATGCAGAAATCGGTGGTCGTTGCAGTGGACAGGTTGTTTTATCACAAGCTCTATAATCGCTACGTTAAGCGCACT
TCCAAGTTCATGGCTCATGACGAGAACAACCAGTGCAATATTGGTGACAGGGTTAAACTAGACCCTTCTAGGCCGTTGAGCAAGCGTAAACATTGGACAAAAGCC
AAGTTTCAGCTCCTTCAACCACCTAAGGGGAGATGCAAGCACCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGACAATCATAGCTGATGAGACTAATCAGTGTAAACGGTTTAAAGAAGGTCTACGGAAAGAGATATGTAGTCCAGTGATAGCGAGTGTAGAATGGACAGATTTC
ACTAAGTTAGTAGAAGCAGCCATGAGGGGGTGTCGTGTGAGCAAGCTACTAAAAGCGAGAGTAGGAGATTTGTTCCAAGAGTATCTGGAAAGGGGAGCTTTAAAC
CCTGACCCAACAGACAGTCATTTTCCAAGACTAGGCCAGGAGGTAATGTGCAAAGATAGAGTCAAAGGAGATTTAGTCAGACATCAGAGGCGATTGAAGGTTCCC
AAGCAGGGCAACCTAACAGGACATTTCAGGAGAGATTGTCCCCAGCTGATGTCAGGAAGTGGAGTAGAGCAGCGTGTAATTTCTCAGACCGTAAGTCAACCGAAG
CTTGAGGCAACAGGTGGTGAAGGCAGTGGTGGTGTGAAACAGAAAGGGCCAGCAAGAAGACCTCGATCGAAGTTAGAACTCTTGACTGATGAAATGTTAGTTCAT
ACACCTGTTGGTAATGTTGTGATAATTGATCATGTTTATCAAGAATGTGAGAAGGAAGATTCTTTCTACTTTTTGATCTCTGGGATGAAGGCTAGAATAATATTG
AGTAAAGGTTGTGAAGCCTTTCTAGCACATGTTGTAGAAGTGAAACTGGCCAAGCTGAAGCCTGAGGAGATGCTAGTGGTGTGTGAATATTTAGATGTATTTCCA
GAGGAGTTGTCAAGATTACCTTCTAATCGAGAAGTGGAGTTTACGATTGATTTAATACCAGATACAACTCCCGTTTCTCAAACTCCTTATCCTGGATACTATCAA
TGGTTCGTCCAGGGATTTTCAAAGATAGCATTACCACTCATGACTTTAACAAAGAAGAATGTTAAGTTTGAGGGGAGTAACGATTATGAGCGAAGTTTCCAGGAA
TTGAAGAAAAGGTTGGTGTCAGCACCGGTATTAACAATTTTGGTACTAGGAAAAGAGTTTGAAGTATACTGTGATGCATCTCGACAGGGATTAGGTTGTGTAAAG
AAGGGAAGAAGCGGTTCCCACAAACCGGATCCTTGCCAGAGTCATAGCGAGGAAGATCTCTTGGAGAAGACTACTGTGACAATTGTATCTGATCGGGATCCAAGA
TTTACTTCTAAGTTCTTGATCAACTTACAACAAGCTTTCGAAACCAAGTTGCAATTTAGTACAACGTTTCATCCTCAAACAGATGGACAGTCGGAAATGACTATT
CAGACTCTAGAAGACACGTTACGAGCTTGTGCATTACAGCTCAAAGCGACCTCTGCCCCCCCCGTTACCAGCTACGGTACTCCGCCCCACCGCGCGCCGTTCCTA
TCCAGACAGTTTCTGAATCCTCGAGTTTGTAGCAGCTTGAAGCTTAAAAGGGCTGTTGGGTTTTCTTTTGGGGGTTGGGTTTTCAATTGGAGGGCCAAGATGAAG
CAGGTAGTCGGGATGGTGGTGTCAAACAAAATGCAGAAATCGGTGGTCGTTGCAGTGGACAGGTTGTTTTATCACAAGCTCTATAATCGCTACGTTAAGCGCACT
TCCAAGTTCATGGCTCATGACGAGAACAACCAGTGCAATATTGGTGACAGGGTTAAACTAGACCCTTCTAGGCCGTTGAGCAAGCGTAAACATTGGACAAAAGCC
AAGTTTCAGCTCCTTCAACCACCTAAGGGGAGATGCAAGCACCTGTAGCACCTTATTCTTAAAAGTTGGAGGGTGCAAAGCAAACATTTTTGCAAATGAGGCTAT
AAAGGTACTGCTGAAGCTTTTTCCCAAATCATGAGATATCTGCTAAACATATTAATGCTGACCCATTTTTCTTCTAATTAGCCTCCCTTTTTTATATAGTGGCGT
GATTAAATGGAAGAAAGGTTTTTACTAAGACTATTGCTCACCTTCTGGTAGAAAGGTTCAATTAGCTCCATCTTTCAAGTTCATTGGGTCTAGTGGAGTGGACAT
ATATCGGATATAAGTTTTGATCTTCAACCTTTGAGAGAGATCTTTAGCCAACCTGCATATTGAAGACTCAAGTTCTGTAGAAGATAAACGTTTCCTAATAAAGCT
TTTTCAAGTCAGGATGAAGTTGTAGGTTTGATTCAACTTCTTGTAAGAACATTATGATTTGAATATTATGATGGATTGGTTTTCAACCTGATGTGTCGGATGGAA
ACTTGAAGTTGAATCCATGTAGGGAAAGAGGTTTTGGTCTTTAATGAACTGTAGGAAATGGTTTCAAGAAGTAATGAGGAGATGGTTGGGCCTGATGGGTTTTGT
CTGAAGTCTGGTTTTGTTGAATGCTAACACCAACATAATTTTTGCTTGGCTCCAATTATTAAATCACAAATGATAGTAAGCGCTCTATGGGTGTGAAACCTTATC
ACTTCCCTTAATGATGATAAAAGGTATTTTGTGCGCTTTATATGGGTAGCAATGTTGTTAAATGAAGATTTGGAGCTGAAATACAAGTACACATTTGAGGGATAG
AGCTCAACGCAGTTTGGTTATTCACTCTATTTTGAAATCTAATCCTAATATTGTGATTGTCCAAGAAATACAATTCATATTAGTTGATGGAAGATTGGTTGAACC
TTCATGGAGTTCCAAGAACATTGCAAATGACTTCTTTCTATGCATTGGGTCTATCAGGTCTTTTTGTTTTATTGCACAGCCCTTATTTATGTTTTGGCAGGTTTG
TATGATTGTAACCGGGTCTTGGGAGTAATCTCGATGTGGTAAGATGATCAAGAGGATAGCCTCATGTGGCAATAGATTTATTTCTAGCTCGAAGAACAATTTCTC
TTTTTTACCCTCTAGAAAAACAGTCTTTTACATGTTCAAACACAAGAGAAAGCTCATCTTTCTTGTTTGGTTAGATCTCTCTCCTATACATTGTTGTAAAATTTT
CTAAAATTTATAAGAGTCGGGTCAAATGAGTAGGAAAACATCTATTATCCAATTGTCCTTGAAGTTGGCAATTTGAAAGCTCATTTTCGTCTGTACTGCAAGGCA
CGCTCGGATGTGCTCCTAGACATGCTAGGCGCATGTTTAGCACCTTGCTTTGAATAGGCGAGGCGCAATTAATAAGATGCGTGCCTTTGCAGTGCCTATTGTGAG
GCACACAGGTGCATTAGGCGCGAGCCTTCAACCAAAATGCAAGGAATTCTGCATTTGGGTTTTTTCCCCTTGATCTGCTGTATTTTAATCCTAAACACATTTATT
AAACCTAAACATTACATTACAGCTTTTAATTTACTTCAAAAAAATAGAAACATGAATGAAGCAGGAAGCACCACGCCTCTTCGTCTTCTTCTTGTATGGCATAAA
TGAGGCTTTTCATCTCTGTTTCTTATTCAAAAAACAGAGACTTCCTCTTTGTCTTCTTCTTTGCTATTTCTTACTAAAAAACAGAAGACTTTTTGGGGCTTTTTA
AACAACAGTTCAAGAAATTCTTTAGTCTCTGTGTTTCATTTAAAACACAGGACTCCACACAGCATTTTTGTCTTCTTCACTGGACTGGCCGTTGTTCAGTAATCA
GTTTTCTCCTTGAAGTTTCTCCTTCTTAAACTTCACGGCACCAACACCCACCCTGTCTGTATTTTTCCCCTAGCTCCTGTAAGCTATTTTCCCATTGTCTTCTCT
GTTTCTTTTTCAAAAAATAGAGACTGCCTTTTGTCTTCTTCTTTGCTAATTCTTAATAAATAAACAGAGGACTTTTTGGGGCATTTTAAACCACGACTCAAGAAA
TTTAGGTTTACTAGAGCAAGGGTGTGCTAGTGGACCCCTCCCCCCTCTCTTTCCTGCCTCTTTCCTTTCACCAACATCTCCTCCTCCTCGTCAACTCAGTAAAGC
TGCTTTGCTCCTCTCCCTATCGGTCGAACTGCTTTACTCCTCTTGTCTCCTTTCTTTTATGCCTTCAATCAAGTTTGTCAGTTCATGCACACTTTCACCACCCAT
TGTACATCTTCAAGCTGTAAAATGCATCTTATGGTATCTAAAGGGTTCTCTTGGCCTCGGCCTAACCATCACTCTGGGACCATTAACCACCTTCTTTGCATTCTA
CGATGCCGACTAGGCTGGCTGTCCTGATAGCAGACAGTCCACTATTGTCTTGATTTTGTGTTGAAAGACTAACATTATAACATTGGTTATCTTTTATATATTGTA
GTTAAGCCTCATTTCTTCCAATGAATGTTTTGTAGTGTGTATATACGCATGTATATATTTTTATGTTTTTTTATACATAGTGCGGTTTTAAAATAAATCCGCACC
TTTTTGTGCACCTTGCGCCTAGGCTTCAGACAGTCATTGCGCATTATTGTGCCTTGAGCTTCAAAAACACTGATTATATTTGAACCTTATATTTGGTAGGTTCAT
GCTAAGTTCCTTCGTTTATGCAACAGATGTGGAACTTTTCACTAGATAGTAGATATAGGTTATTTGGGCCATTCTATCATAAAGCTGAAAGAATTTAAAGGCAAA
CTTAGAATGTGGAAAAAAGATTTTTGGAAATGTTGATTGTATTGCCAGCCTGGATGTGCTTTTGATAATGAAAGAGACTTGTCTCAAAATGAAAGTAGCTATACA
ACATCAATGAAAATAAATTGCTAGGGATTATCCTCAAAGAGGATATGTTATGAAGATCAAAGTGCCAACCGAAGTGGCGAAAGAAGGGTGAGGAAAACTTCTACT
TCTTTCACACAATAACTTGGCTACTAATACAAAGAGTTTCCTTTAAGAAATTCATTTGGAGGACGACAGATGTTTGGTCATTGTTTAGCAAGTGGAAAGGAAATC
AATTTTTCTGTCTACAAAATGGATTCTCATCTTTATTCCCAAAGGGCAGTCCAGTTTTTTTTCAATTCCTTATTCAAACGAGCTCCATCGTATTGCTGTGATTGA
GTTTGGCTTAGACAAGGAAGAGAATGTCATTGGGCTTCATCCCATCAACGAATTTATAAGAGATAGCACTTCCTGACTCTGGTTTTTTTTTGGTTTTGTTCCACA
ACTTTATGCATAAAGGTATTATAAACTCTAACATCAATGTGATTTCTTTTTCCTTTGGTTCCTAAAAATTTAATGCCAAATGTTTGAATTCAAGCTTCCGCTTCC
CTATTGCAGAAATTTTGGCTAGTGGACTTCTTTCGGTATTTTCCGGTACTTTTGGTATTGGTCTGAGCTTAAACGGCTTTGATCCTAAAAATTCCAGTGAACATA
CCTTGGCTTCAAGTGTTGCTTCTTTAAGAAAAATGTGGAACGGAAGATTACTATATGTTCTTACGGTGCCCTTTTGTCTCTATCTGGAGAAGGTTTTGTGGTACT
TTCAGTTCAACTAATTGGATTGCAGTTCCTACAAGATCTTAGATGACTCCTAATGCAAATATATATGTAGGAATAGCTACCTTGAGCATAGTTCGATGGGCTAGG
GCATTATAATCTATGACTTTTAAGTCATAGGTTCGAGTTTCATCCTACATATTGTTCAACAAGTTATTTTGTTTTGTTGTGTATTTTTTTTTTTTCTCTCGGCAG
TAACATGTCTGAACTGAAAACATTAGCTGCTTTGTTTCTTTTAGTACAAGTGGAGTGAGGTAGGAGGATTTGGACTTTTGACCTTGAGAGGAGTACATGCTAATA
CCATTGAATTATGTGATTTGTTTACGTGGTTTTCATGCTAGCTACCCTTTTGGGAGTTAAGTGGAATGAAATGATAGAATCTTTAAAGACAGAGTGATGTGGAAT
GAATAGGGAATCCTTAAGTTGAGTGATCGATGGATGATTCTGTAATTCATTATTGTTTCTCCTTTTTGTGGCCTTATCATTTGTTTTGATAATTATTCAGTTTTT
TATGGTTCAACCTGATCATGCATGTATATAATGTCTTTTCAACTCAGTTAAGTTGGCTCCCTCTCCCTTTCTATTGATTTCTTTCATCAATGTAAAAAACTGCAA
ATCTGTCCCACATTTATGTAAAGACACTTACAATAATATTTGTTTGCTTTGCACAACCAATTTAATAGGGGAAACTACTGCAGACCACATCGAGACCGAGTCAGA
TCGAAAAGTTCGCAATGCACTGGAAAACCAAAGAACAAAAACTCTGGAAAATGAAGTTTTGTAAGCAGTATGTTTCTCATGTCAAAGATTTGTAGTTACATTATT
ATTACCGTCTTGTACTGTTTTGTAAACCTTCTTTTCCTATGAGCATTTATCCCCACGATTTGTTGTGAGGTTGTTAAAAATGCATATTCAGAAAGGTTCCGTCCA
TTGGTAGGCATGTCTCCAGGCGGAACAAAGAAGAAGAGATCTTAGGATCTATTTGGTAGAGAATTTTAAAACAAAGAATTTAATAAACATAATTGTATTTCATGT
TTTC
Protein sequenceShow/hide protein sequence
MTIIADETNQCKRFKEGLRKEICSPVIASVEWTDFTKLVEAAMRGCRVSKLLKARVGDLFQEYLERGALNPDPTDSHFPRLGQEVMCKDRVKGDLVRHQRRLKVP
KQGNLTGHFRRDCPQLMSGSGVEQRVISQTVSQPKLEATGGEGSGGVKQKGPARRPRSKLELLTDEMLVHTPVGNVVIIDHVYQECEKEDSFYFLISGMKARIIL
SKGCEAFLAHVVEVKLAKLKPEEMLVVCEYLDVFPEELSRLPSNREVEFTIDLIPDTTPVSQTPYPGYYQWFVQGFSKIALPLMTLTKKNVKFEGSNDYERSFQE
LKKRLVSAPVLTILVLGKEFEVYCDASRQGLGCVKKGRSGSHKPDPCQSHSEEDLLEKTTVTIVSDRDPRFTSKFLINLQQAFETKLQFSTTFHPQTDGQSEMTI
QTLEDTLRACALQLKATSAPPVTSYGTPPHRAPFLSRQFLNPRVCSSLKLKRAVGFSFGGWVFNWRAKMKQVVGMVVSNKMQKSVVVAVDRLFYHKLYNRYVKRT
SKFMAHDENNQCNIGDRVKLDPSRPLSKRKHWTKAKFQLLQPPKGRCKHL