; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031960 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031960
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr11:20953499..20961896
RNA-Seq ExpressionLag0031960
SyntenyLag0031960
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017239676.1 PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus]1.5e-15740.62Show/hide
Query:  RTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNS---
        R   ++ Y++P F+ ++  IARP I A NF +     Q ++   +F+GLS+EDP+ HL++FL + D+F + GVP + +RL LF  SL   A+ WL+S   
Subjt:  RTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNS---

Query:  -----------------FAPG------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEI
                         F+P        EI  F+Q + E+  EA+ERFK+LLRKCPHHGL    ++ TFYNGL       VD +A G+L  +  ++A+EI
Subjt:  -----------------FAPG------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEI

Query:  LERISINSCQWSDVRGTSKKVKSVLEVDGVSTSRVDLAMIANAL--KNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCP------SNPTSVF
        LE I+ N+C+  D R +SKKV  V EVD +++    +     AL  K  ++   +Q P      V N      C  CGE H  + CP      +  +SV 
Subjt:  LERISINSCQWSDVRGTSKKVKSVLEVDGVSTSRVDLAMIANAL--KNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCP------SNPTSVF

Query:  FVG---NQRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQ---PGFAKTQVLPQQSKSALPQPNSGNSLEAMMKEFMARTDAAIQSNQASMRALEL
        +VG   NQ+NNP+SN YNPGWRNHPNFSW    NNV+      Q   PGF +    PQ+ +    +PN+    E ++ ++M +TDA IQS  ASMRALE+
Subjt:  FVG---NQRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQ---PGFAKTQVLPQQSKSALPQPNSGNSLEAMMKEFMARTDAAIQSNQASMRALEL

Query:  Q-GVGGSKENAGASGSMP-------------------------------------------DVEPPYVSPPP---------YVPPLPFPQRQKPKNQDGQ
        Q G   S  N   SGS+P                                           + EP  +S P            PP PFPQR + + QD Q
Subjt:  Q-GVGGSKENAGASGSMP-------------------------------------------DVEPPYVSPPP---------YVPPLPFPQRQKPKNQDGQ

Query:  FKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGRKELGRALCDLGASINLMPLSV
        F+KF+++ K+L INIP  EA+EQM +Y KF+KDIL++K+RL EFETV+LTEECSAIL+  LPPK KDPGSFTIP +IG +  G+ALCDLGAS+NLMPLS+
Subjt:  FKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGRKELGRALCDLGASINLMPLSV

Query:  YRKLGIGEARPTTVTLQLADRSITYPE----------------------------------GRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP
        + KLG+GE +PT+V LQLADRS+ YP                                   GRPFLATGR LIDVQKGELTMRV +E+V FNVF AMK+ 
Subjt:  YRKLGIGEARPTTVTLQLADRSITYPE----------------------------------GRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP

Query:  DEMEDC----SFT------RILE---STVIETTIQDSADKHSEKHGE-----------------------------------------------------
        ++ E C    +FT       +LE   +  +E +++++ D+ +E+  E                                                     
Subjt:  DEMEDC----SFT------RILE---STVIETTIQDSADKHSEKHGE-----------------------------------------------------

Query:  -----TLPIIVESDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPATKEVVKKEVIKWLDAGIIYPIADS
             TLP+I+ S L  EHEE L+++L++Y++AIGW +ADI+GISPSFCMHKI++E+    +IE QRRLNP  KEVVKKE+IKWLDAGIIYPI+DS
Subjt:  -----TLPIIVESDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPATKEVVKKEVIKWLDAGIIYPIADS

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]3.1e-18750.6Show/hide
Query:  QNPLFEQNEQQNNQAKNPILIANDRTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGV-------SDSFV
        +NP    N  Q     NPI +A+DR RAIRAY  P   ELNP I RP+IQ   FE+K VMFQMLQT+GQFHGL  EDPHLHLKSFLGV       SDSF 
Subjt:  QNPLFEQNEQQNNQAKNPILIANDRTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGV-------SDSFV

Query:  IQGVPRDALRLTLFPYSLRDGAKSWLNSFAPG--------------------------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETF
         QGV +D +RL+LFPY LRDGAKSWLN+ APG                          +EIV F+Q EDET SEA ERFKE+LRKCPHHGLPHCIQMETF
Subjt:  IQGVPRDALRLTLFPYSLRDGAKSWLNSFAPG--------------------------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETF

Query:  YNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-SKKVKSVLEVDGVSTSRVDLAMIANALKNVTVISHQQPPA-MEPTAVVNQV
        YNGLN VT+ +VDASA GA+L+KT++EAYEILERI+ N+CQW+DVR    +K + VLEVD +S+    LA + N L+N+ +       A +   A +NQ 
Subjt:  YNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-SKKVKSVLEVDGVSTSRVDLAMIANALKNVTVISHQQPPA-MEPTAVVNQV

Query:  TDEACVYCGEDHNYEFCPSNPTSVFFVGNQ------RNNPYSNFYNPGWRNHPNFSWGGQG-NNVQAQQKMNQP---------GFAKTQVLPQQSKSALP
          E+CVYCGE+H ++ CPSNP S+F+VGNQ      +NNP+SN YNPGWRNHPNFSW GQ   N Q   K N P          ++  QV  Q   +   
Subjt:  TDEACVYCGEDHNYEFCPSNPTSVFFVGNQ------RNNPYSNFYNPGWRNHPNFSWGGQG-NNVQAQQKMNQP---------GFAKTQVLPQQSKSALP

Query:  QPNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQGVGGSKENAGASGSMPDVEPPY-----------------VSPPP--------------YVPPLP
        Q  S  S+E+++KE+MA+ DA IQS QAS+R LE+Q  G      G S S    +                    V   P              Y P  P
Subjt:  QPNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQGVGGSKENAGASGSMPDVEPPY-----------------VSPPP--------------YVPPLP

Query:  FPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGRKELGRALC
        FPQR K K ++  F+KF++ILK++HINIPLVEA++QMPNY KFLKD+L  +++  EF+ VSL EECSAILKN +P K KDPGSFTIPVSIG KELGRALC
Subjt:  FPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGRKELGRALC

Query:  DLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE----------------------------------GRPFLATGRALIDVQKGELTMRVCNE
        DLGA+INLMPLS+Y+KLGIGEARPTTVTLQLADRSITYPE                                  GRPFL  GR L+DV KG +T+R+  +
Subjt:  DLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE----------------------------------GRPFLATGRALIDVQKGELTMRVCNE

Query:  EVKFNVFKAMKYPDEMEDCSFTRILESTVIETTIQDSADKHSEKHGE
        +V+FN+  +MKYP  +E+C       S V E T Q + ++  ++  E
Subjt:  EVKFNVFKAMKYPDEMEDCSFTRILESTVIETTIQDSADKHSEKHGE

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]1.5e-15744.42Show/hide
Query:  QAKNPILIANDRTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRD
        Q  +PI++ +DR RAIR Y  PMFNELNPGI RP+IQA  FE+K VMFQMLQT+GQF  + +EDPHLHL+SFL +SDSF IQGV  +  RL LFP+SLRD
Subjt:  QAKNPILIANDRTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRD

Query:  GAKSWLNSFAPG--------------------------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGAL
         A+SWLN+ +P                           SEI+ F QLEDE+ S+AWERFKELLRKCPHHG+PHCIQMETFYNGLN  +Q ++DASA GA+
Subjt:  GAKSWLNSFAPG--------------------------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGAL

Query:  LAKTFDEAYEILERISINSCQWSDVRGT-SKKVKSVLEVDGVSTSRVDLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNP
        L+K+++EA+EILE I+ N+ QWS+ R   S+KV  VLEVD ++     +A + N LKN+++ + +    ++P A + Q  D +CV+C E H +E CPSNP
Subjt:  LAKTFDEAYEILERISINSCQWSDVRGT-SKKVKSVLEVDGVSTSRVDLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNP

Query:  TSVFFVGNQ----RNNPYSNFYNPGWRNHPNFSWGGQG--NNVQAQQKMNQPGFAKTQVLPQQSKSALPQPNSGNSLEAMMKEFMARTDAAIQSNQASMR
         SV ++GNQ     N  +SN YN  W+NHPN SWG +    +    ++   PGF++    PQ ++++ P     +SLE++M+++MA+ DA IQS  A +R
Subjt:  TSVFFVGNQ----RNNPYSNFYNPGWRNHPNFSWGGQG--NNVQAQQKMNQPGFAKTQVLPQQSKSALPQPNSGNSLEAMMKEFMARTDAAIQSNQASMR

Query:  ALELQ-GVGGSKENAGASGSMP-DVEPP-----------------------------------------------------------------YVSPP--
         LELQ G   ++  A   GS+P D E P                                                                   S P  
Subjt:  ALELQ-GVGGSKENAGASGSMP-DVEPP-----------------------------------------------------------------YVSPP--

Query:  -PYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGR
            PPLPFPQR + + QDGQFKKFL++LKQLHINIPLVEA+EQMPNY KFLKDILTKK+RLGEFE+  LTE   A+LKN +PPK KDPGSFTIP+SIG 
Subjt:  -PYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGR

Query:  KELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEG----------------------------------RPFLATGRALIDVQKGE
                              R LGIGEARPTTVTLQLADRS+ +P+G                                  RPFLATGR LIDV+KGE
Subjt:  KELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEG----------------------------------RPFLATGRALIDVQKGE

Query:  LTMRVCNEEVKFNVFKAMKYPDEMEDCSFTRILESTVIETTIQDSADKHSEKHGETLPIIVESDLMPEHE
        LTMR  +E+  F VF+ ++ PD + +C     ++  ++E    +S  K S+K  + +P+    ++  +HE
Subjt:  LTMRVCNEEVKFNVFKAMKYPDEMEDCSFTRILESTVIETTIQDSADKHSEKHGETLPIIVESDLMPEHE

XP_038972405.1 uncharacterized protein LOC120104748 [Phoenix dactylifera]8.0e-15138.57Show/hide
Query:  NDRTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSF
        N   R +  Y VP  N   P I RP + A NFE+K  + QM+Q   QF G  SEDPH HL +FL + D+  + GV  DA+RL LFP+SL+D AK+WLNS 
Subjt:  NDRTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSF

Query:  APGS--------------------------EIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAY
        AP S                          +I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL    +  +DA+AGG L++K+ +EAY
Subjt:  APGS--------------------------EIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAY

Query:  EILERISINSCQWSDVRGTSKKVKSVLEVDGVS--TSRVD-LAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPTSVFFVG
        E+LE ++ N+ QWS+ R   KKV  + +VDG++   ++VD L  +   L NV  +S    P +            +C  CG  H    C      V FV 
Subjt:  EILERISINSCQWSDVRGTSKKVKSVLEVDGVS--TSRVD-LAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPTSVFFVG

Query:  N-----QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKTQVLPQQSKSALPQPNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQ-GV
        N     Q+NNPYSN YNPGWRNHPNFSW  QGN   + + ++ PGF   Q  P Q +S      +   L     E   R +A +    +S R +E+Q G 
Subjt:  N-----QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKTQVLPQQSKSALPQPNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQ-GV

Query:  GGSKENAGASGSMP---DVEP-----------------------------------------------PYVSPP--PYVPPLPFPQRQKPKNQDGQFKKF
          +  N+   G++P   +V P                                               P   PP  PYVPP+PFPQR K    D QF+KF
Subjt:  GGSKENAGASGSMP---DVEP-----------------------------------------------PYVSPP--PYVPPLPFPQRQKPKNQDGQFKKF

Query:  LEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGRKELGRALCDLGASINLMPLSVYRKL
        L++ +QLHINIP  +A+ Q+P Y KFLK+I++KK++L +FET++LTEECSAI++N LPPK +DPGSF+IP +IG  +  RALCDLGAS++LMPLSV RKL
Subjt:  LEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGRKELGRALCDLGASINLMPLSVYRKL

Query:  GIGEARPTTVTLQLADRSITYP----------------------------------EGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP----
        G+ E +PTT++LQLADRS+ YP                                   GRPFLAT  A+IDV+ G LT++V  EEV+FN+F+A KYP    
Subjt:  GIGEARPTTVTLQLADRSITYP----------------------------------EGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP----

Query:  ----------------------DEMEDC------SFTRILESTVIETTIQDSADKHSEK---------------------------------------HG
                              + +E C      S    LE   +   ++ +  K  ++                                         
Subjt:  ----------------------DEMEDC------SFTRILESTVIETTIQDSADKHSEK---------------------------------------HG

Query:  ETLPIIVESDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPATKEVVKKEVIKWLDAGIIYPIADS
         TLP+IV   L  E  + LI++L+  +KAIGWT++D++GISPS CMH+I +E+     +E QRRLNP  KEVV+ EV+KWLDAGIIYPI+DS
Subjt:  ETLPIIVESDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPATKEVVKKEVIKWLDAGIIYPIADS

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]6.1e-15138.57Show/hide
Query:  NDRTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSF
        N   R +  Y VP  N   P I RP + A NFE+K  + QM+Q   QF G  SEDPH HL +FL + D+  + GV  DA+RL LFP+SL+D AK+WLNS 
Subjt:  NDRTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSF

Query:  APGS--------------------------EIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAY
        AP S                          +I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL    +  +DA+AGG L++K+ +EAY
Subjt:  APGS--------------------------EIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAY

Query:  EILERISINSCQWSDVRGTSKKVKSVLEVDGVS--TSRVD-LAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPTSVFFVG
        E+LE ++ N+ QWS+ R   KKV  + +VDG++   ++VD L  + + L NV  +S    P +            +C  CG  H    C      V FV 
Subjt:  EILERISINSCQWSDVRGTSKKVKSVLEVDGVS--TSRVD-LAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPTSVFFVG

Query:  N-----QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKTQVLPQQSKSALPQPNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQ-GV
        N     Q+NNPYSN YNPGWRNHPNFSW  QGN   + + ++ PGF   Q  P Q +S      +   L     E   R +A +    +S R +E+Q G 
Subjt:  N-----QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKTQVLPQQSKSALPQPNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQ-GV

Query:  GGSKENAGASGSMP---DVEP-----------------------------------------------PYVSPP--PYVPPLPFPQRQKPKNQDGQFKKF
          +  N+   G++P   +V P                                               P   PP  PYVPP+PFPQR K    D QF+KF
Subjt:  GGSKENAGASGSMP---DVEP-----------------------------------------------PYVSPP--PYVPPLPFPQRQKPKNQDGQFKKF

Query:  LEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGRKELGRALCDLGASINLMPLSVYRKL
        L++ +QLHINIP  +A+ Q+P Y KFLK+I++KK++L +FET++LTEECSAI++N LPPK +DPGSF+IP +IG  +  RALCDLGAS++LMPLSV RKL
Subjt:  LEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGRKELGRALCDLGASINLMPLSVYRKL

Query:  GIGEARPTTVTLQLADRSITYP----------------------------------EGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP----
        G+ E +PTT++LQLADRS+ YP                                   GRPFLAT  A+IDV+ G LT++V  EEV+FN+F+A KYP    
Subjt:  GIGEARPTTVTLQLADRSITYP----------------------------------EGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP----

Query:  ----------------------DEMEDC------SFTRILESTVIETTIQDSADKHSEK---------------------------------------HG
                              + +E C      S    LE   +   ++ +  K  ++                                         
Subjt:  ----------------------DEMEDC------SFTRILESTVIETTIQDSADKHSEK---------------------------------------HG

Query:  ETLPIIVESDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPATKEVVKKEVIKWLDAGIIYPIADS
         TLP+IV   L  E  + LI++L+  +KAIGWT++D++GISPS CMH+I +E+     +E QRRLNP  KEVV+ EV+KWLDAGIIYPI+DS
Subjt:  ETLPIIVESDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPATKEVVKKEVIKWLDAGIIYPIADS

TrEMBL top hitse value%identityAlignment
A0A2G9HH15 Reverse transcriptase9.3e-13737.86Show/hide
Query:  LFEQNEQQNNQAKNPILIAND--RTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDA
        L E  EQ+    +N I+I  D      +R   +P   E    +  P++ A   ++++ M +M+Q   QF GLS E+P+ H+ +FL + D+   +GV +DA
Subjt:  LFEQNEQQNNQAKNPILIAND--RTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDA

Query:  LRLTLFPYSLRDGAKSWLNS--------------------FAP------GSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVT
        LRL LF +SL   A  W  S                    F+P       +EI+ FRQ   ET  EAW RF+++LR CP+H +P  IQ+ TFY+GL    
Subjt:  LRLTLFPYSLRDGAKSWLNS--------------------FAP------GSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVT

Query:  QGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTSKKVKSVLEVDGVSTSRVDLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGE
        +  +D   G + L+ T  E + +L  +  N  +    R T  K   V+EVD V+     +  +  ++KN  V   Q  P               C  CGE
Subjt:  QGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTSKKVKSVLEVDGVSTSRVDLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGE

Query:  DHNYEFCPSNPTSVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKTQVLPQQSKSALPQPNSGNSLEAMMKEFMARTDAAI
         H  + CP +  S+ FV N R   NNPYSN YNPGWR HPNFSW    NN Q Q      G A       Q +   P      SLE  + +FMA      
Subjt:  DHNYEFCPSNPTSVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKTQVLPQQSKSALPQPNSGNSLEAMMKEFMARTDAAI

Query:  QSNQASMRALELQ-GVGGSKENAGASGSMPDVEPPYVSPPPYVPPLPFPQRQKPKNQD-------GQFKKFLEILKQLHINIPL-VEAIEQMPNYAKFLK
         S  A+ + +E Q G   +  N+    S+P    P               R   + Q+        + K+ +   K   I  PL V+A+EQMP+Y KF+K
Subjt:  QSNQASMRALELQ-GVGGSKENAGASGSMPDVEPPYVSPPPYVPPLPFPQRQKPKNQD-------GQFKKFLEILKQLHINIPL-VEAIEQMPNYAKFLK

Query:  DILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGRKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE-----
        DIL+KK+RLG++ETV+LTEECSAI++N LPPK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP      
Subjt:  DILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGRKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE-----

Query:  -----------------------------GRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFTRILESTV------------IETT
                                     GRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C    + +               +E  
Subjt:  -----------------------------GRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFTRILESTV------------IETT

Query:  IQDSADKHSEK--------------------------------------------------------HGETLPIIVESDLMPEHEEALIKLLQQYRKAIG
        + D  D+ +E+                                                          +TLP+I+ S L     E L+++L+ ++ AIG
Subjt:  IQDSADKHSEK--------------------------------------------------------HGETLPIIVESDLMPEHEEALIKLLQQYRKAIG

Query:  WTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPATKEVVKKEVIKWLDAGIIYPIADS
        WT+ADI+GISPSFCMHKI LE+G   S+E QRRLNP  KEVVKKE+IKWLDAGIIYPI+DS
Subjt:  WTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPATKEVVKKEVIKWLDAGIIYPIADS

A0A2G9HWF8 Reverse transcriptase6.4e-13837.76Show/hide
Query:  LFEQNEQQNNQAKNPILIAND--RTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDA
        L E  EQ+    +N I++  D      +R   +P   E    +  P++ A   ++++ M QM+Q   QF GLS E+P+ H+ +FL + D+   +GV +DA
Subjt:  LFEQNEQQNNQAKNPILIAND--RTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDA

Query:  LRLTLFPYSLRDGAKSWLNS--------------------FAP------GSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVT
        LRL LF +SL   A  W  S                    F+P       +EI+ FRQ   ET  EAW RF+++LR CP+H +P  IQ+ TFY+GL    
Subjt:  LRLTLFPYSLRDGAKSWLNS--------------------FAP------GSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVT

Query:  QGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTSKKVKSVLEVDGVSTSRVDLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGE
        +  +D   G + L+ T  E + +L  + +N  +    R T  K   V+EVD V+     +  +  ++KN                            CGE
Subjt:  QGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTSKKVKSVLEVDGVSTSRVDLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGE

Query:  DHNYEFCPSNPTSVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQ---QKMNQPGFAKTQVLPQQSKSALPQP---NSGNSLEAMMKEFMA
         H  + CP +  S+ FV N R   NNPYSN YNPGWR HPNFSW    NN Q Q    +  Q G   T+  P+Q   A  Q     +G  L+ ++KE   
Subjt:  DHNYEFCPSNPTSVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQ---QKMNQPGFAKTQVLPQQSKSALPQP---NSGNSLEAMMKEFMA

Query:  RTDAAIQSNQASMRALELQGVGGSKENAGASGSMPDVEPPY-VSPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDI
          +  + S +              KE         +VE P  VS P  + P PFPQR + +    QF KFLE+ K+LHINIP  EA+EQMP+Y KF+KDI
Subjt:  RTDAAIQSNQASMRALELQGVGGSKENAGASGSMPDVEPPY-VSPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDI

Query:  LTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGRKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE-------
        L+KK+RLG++ETV+LTEECSAI++N LPPK KDPG              RALCDLGASINLMP S+YR LG+ EA+PT++TLQLADRS+TYP+       
Subjt:  LTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGRKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE-------

Query:  ---------------------------GRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFTRILESTV------------IETTIQ
                                   GRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C    + ++              +E  + 
Subjt:  ---------------------------GRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFTRILESTV------------IETTIQ

Query:  DSADKHSEK--------------------------------------------------------HGETLPIIVESDLMPEHEEALIKLLQQYRKAIGWT
        D  D+ +E+                                                          +TLP+I+ S L     E L+++L+ ++ AIGWT
Subjt:  DSADKHSEK--------------------------------------------------------HGETLPIIVESDLMPEHEEALIKLLQQYRKAIGWT

Query:  LADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPATKEVVKKEVIKWLDAGIIYPIAD
        +ADI+GISPSFCMHKI LE+    S+E QRRLNP  KEVVKKE+IKWLDAGIIYPI+D
Subjt:  LADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPATKEVVKKEVIKWLDAGIIYPIAD

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129456.4e-13841.41Show/hide
Query:  EQQNNQAKNPILIANDRTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFP
        E  NN   N I +  +  RA+R YVVP+   L+  I RP I A NFE+K    QM+Q+  QF GL S+DP+ HL +FL + D+F   GV  DA+RL LFP
Subjt:  EQQNNQAKNPILIANDRTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFP

Query:  YSLRDGAKSWLNSFAPGS--------------------------EIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDAS
        +SLRD AKSWLNS   GS                          +I  F Q + E+  EAWERFKELLR+CPHHG+P  +Q++TFYNGL G  + ++DA+
Subjt:  YSLRDGAKSWLNSFAPGS--------------------------EIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDAS

Query:  AGGALLAKTFDEAYEILERISINSCQWSDVRGTSKKVKSVLEVDGVSTSRVDLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFC
        AGGAL++K   +AY +LE ++ N+ QW   R  S+K     E+D + T    +A ++  L  + V            AV N +    C  CG+ H+Y+ C
Subjt:  AGGALLAKTFDEAYEILERISINSCQWSDVRGTSKKVKSVLEVDGVSTSRVDLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFC

Query:  PSNPTSVFFVGN---QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKTQVLPQQSKSALPQPNSGNSLEAMMKEFMARTDAAIQSNQASM
        P N  SV FVGN   Q+NNPYSN YNPGWRNHPNFSW         +  M  PGF       QQ++  +P+  S   LE ++ +++++TDA IQS  AS+
Subjt:  PSNPTSVFFVGN---QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKTQVLPQQSKSALPQPNSGNSLEAMMKEFMARTDAAIQSNQASM

Query:  RALELQ-GVGGSKENAGASGSMP--------------------------------DVEPPYVSPPPYV------------------------PPLPFPQR
        R LE Q G   +  N    GS+P                                + E  +V                              PP PFPQR
Subjt:  RALELQ-GVGGSKENAGASGSMP--------------------------------DVEPPYVSPPPYV------------------------PPLPFPQR

Query:  QKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGRKELGRALCDLGA
         + +  + QF+KFL + K+LHINIP  EA+EQMP+Y KFLKDIL+KK++LGEFETV LTEECSAIL+N LPPK KDPGSFTIP +IG     +AL DLGA
Subjt:  QKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGRKELGRALCDLGA

Query:  SINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE----------------------------------GRPFLATGRALIDVQKGELTMRVCNEEVKF
        SINLMP S++ KLG+GE +PT+VTLQLADRS  YP                                   GRPFLAT  A+IDV++G+++ +V  E V+F
Subjt:  SINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE----------------------------------GRPFLATGRALIDVQKGELTMRVCNEEVKF

Query:  NVFKAMKYPDEMEDCSFTRILE
        N+F A K+P     C    +++
Subjt:  NVFKAMKYPDEMEDCSFTRILE

A0A6J1DY39 uncharacterized protein LOC1110256537.9e-13639.87Show/hide
Query:  NQAKNPILIANDRTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLR
        N   NPI +A+ R RA+R Y   +  +LN  +         FE K +M QML  +GQF GL  EDP  HLKSF+ V+++F + G+  DALRLTLFP+S+ 
Subjt:  NQAKNPILIANDRTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLR

Query:  DGAKSWLNSFAPGS--------------------------EIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGA
          A +WLN+F   +                          EI+ FRQ E+E  + AWERFK+L+  CP+ G+P C+Q+E F+ G + +T+ M++ +A G 
Subjt:  DGAKSWLNSFAPGS--------------------------EIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGA

Query:  LLAKTFDEAYEILERISINSCQWSDVRGTSKKVKS----VLEVDGVSTSRVDLAMIANALKNVTVISHQQPPAMEPT--AVVNQVTDEACVYCGEDHNYE
          +K+F+E  EIL+++S ++ QW   +  ++  ++    VL +D +++ +  +  I   LKN+   +     A   T  + V Q+ +  C YCG+ H  E
Subjt:  LLAKTFDEAYEILERISINSCQWSDVRGTSKKVKS----VLEVDGVSTSRVDLAMIANALKNVTVISHQQPPAMEPT--AVVNQVTDEACVYCGEDHNYE

Query:  FCPSNPTSVFFVG---NQRNNPYSNFYNPGWRNHPNFSWGGQ------GNNVQAQQKMNQPGFAKTQVLP-------QQSKSALPQPNSGNSLEAMMKEF
         CPSNP+S+++VG    Q+ NPYSN YNPGW+ HPNFSW GQ      G+N Q ++    PGF  +   P       QQ     P   + +++E +MKE 
Subjt:  FCPSNPTSVFFVG---NQRNNPYSNFYNPGWRNHPNFSWGGQ------GNNVQAQQKMNQPGFAKTQVLP-------QQSKSALPQPNSGNSLEAMMKEF

Query:  MARTDAAIQS---------------------NQASMRALELQ------------------------GVGGSKENAGASGS--------MPD---------
        + + DA ++                      N  ++R LE+Q                         +G    N+ A+ S        MPD         
Subjt:  MARTDAAIQS---------------------NQASMRALELQ------------------------GVGGSKENAGASGS--------MPD---------

Query:  ---------VEPPYVSPP--PYV----PPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAI
                 +  P VS P  P V    PP PFPQR   KNQD  F+KFL+ILKQLHINIP VEA+EQMP YAKF+KDI+T+KK+LGE+ETV+LTE  S +
Subjt:  ---------VEPPYVSPP--PYV----PPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAI

Query:  LKNGLPPKAKDPGSFTIPVSIGRKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE-----------------------------
         K+ +PPK KDPGSFTIP  IG K++GRALCDLGASINLMPLS+++K  IG+A PTTVTLQLADRSIT PE                             
Subjt:  LKNGLPPKAKDPGSFTIPVSIGRKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE-----------------------------

Query:  -----GRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFTRI
             GRPFLATG  LIDV+KGELTMRV +++V FN+  AMKY D+ME+C+   I
Subjt:  -----GRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFTRI

A0A6J1EQ90 uncharacterized protein LOC1114364111.5e-18750.6Show/hide
Query:  QNPLFEQNEQQNNQAKNPILIANDRTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGV-------SDSFV
        +NP    N  Q     NPI +A+DR RAIRAY  P   ELNP I RP+IQ   FE+K VMFQMLQT+GQFHGL  EDPHLHLKSFLGV       SDSF 
Subjt:  QNPLFEQNEQQNNQAKNPILIANDRTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDPHLHLKSFLGV-------SDSFV

Query:  IQGVPRDALRLTLFPYSLRDGAKSWLNSFAPG--------------------------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETF
         QGV +D +RL+LFPY LRDGAKSWLN+ APG                          +EIV F+Q EDET SEA ERFKE+LRKCPHHGLPHCIQMETF
Subjt:  IQGVPRDALRLTLFPYSLRDGAKSWLNSFAPG--------------------------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETF

Query:  YNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-SKKVKSVLEVDGVSTSRVDLAMIANALKNVTVISHQQPPA-MEPTAVVNQV
        YNGLN VT+ +VDASA GA+L+KT++EAYEILERI+ N+CQW+DVR    +K + VLEVD +S+    LA + N L+N+ +       A +   A +NQ 
Subjt:  YNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-SKKVKSVLEVDGVSTSRVDLAMIANALKNVTVISHQQPPA-MEPTAVVNQV

Query:  TDEACVYCGEDHNYEFCPSNPTSVFFVGNQ------RNNPYSNFYNPGWRNHPNFSWGGQG-NNVQAQQKMNQP---------GFAKTQVLPQQSKSALP
          E+CVYCGE+H ++ CPSNP S+F+VGNQ      +NNP+SN YNPGWRNHPNFSW GQ   N Q   K N P          ++  QV  Q   +   
Subjt:  TDEACVYCGEDHNYEFCPSNPTSVFFVGNQ------RNNPYSNFYNPGWRNHPNFSWGGQG-NNVQAQQKMNQP---------GFAKTQVLPQQSKSALP

Query:  QPNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQGVGGSKENAGASGSMPDVEPPY-----------------VSPPP--------------YVPPLP
        Q  S  S+E+++KE+MA+ DA IQS QAS+R LE+Q  G      G S S    +                    V   P              Y P  P
Subjt:  QPNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQGVGGSKENAGASGSMPDVEPPY-----------------VSPPP--------------YVPPLP

Query:  FPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGRKELGRALC
        FPQR K K ++  F+KF++ILK++HINIPLVEA++QMPNY KFLKD+L  +++  EF+ VSL EECSAILKN +P K KDPGSFTIPVSIG KELGRALC
Subjt:  FPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGRKELGRALC

Query:  DLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE----------------------------------GRPFLATGRALIDVQKGELTMRVCNE
        DLGA+INLMPLS+Y+KLGIGEARPTTVTLQLADRSITYPE                                  GRPFL  GR L+DV KG +T+R+  +
Subjt:  DLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE----------------------------------GRPFLATGRALIDVQKGELTMRVCNE

Query:  EVKFNVFKAMKYPDEMEDCSFTRILESTVIETTIQDSADKHSEKHGE
        +V+FN+  +MKYP  +E+C       S V E T Q + ++  ++  E
Subjt:  EVKFNVFKAMKYPDEMEDCSFTRILESTVIETTIQDSADKHSEKHGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTAAGCTAACTGTGCCTCTAGACGACGGAAAGGCCTCGCCTCTCTACACTGTTGGCTCTGATGCTGGCGCAGTTGTGCTAGCCTCATTTGCTCCATCCTCT
CCTCCAACGCCTCAAGCGGGAGAGGTGGATGCTCATGGTGGGGCTATGTTGCCTTCTAGAGTGGGAGTTCTTAGTGGGAACATTTTGGGGAGCTTTGGGGAGCAC
TCTAGGAGGCTCAAGGCGTGCTCGGAGGGACTCCAAGTGCTAGGGACGCGTGTTGGGTACAAAAAAGTTTTTGGCGCCGTTGCCGGGGAGAGAGAAAAGACTAAC
GGAAGCAATGGCCCATTAAGAAGAAGCTCCCAAGACAATGAGAGACATTTTGCAGCCAGTCCTCCCACCGAAAATTCAGGCATCGTTTACGCCCCGATCCAAGCA
AATAATTTCGAGTTAAAGACAGGACTGATCCAGATGACGCGAGACAATTCTTTTAAGGGACATCCTTCAGAGGATCCACATTCGCATCTGCGATCATTCTTGGAG
TTATGCGGGACGGTAAAGATGAATGGAGTTCCGGCCGACGCGATAAGATTAATATTATGCTGCTGGGCGACTGAAGGGAGCAAATTCTATGCTGAAGCAAAACTG
GGAGCAGAAACTGCCACATCACAGCTCGTTAGCCAACTTCATGAACCGACTTCTGTTGAGTGCATGAGCGATCCGCCTGGGCAAAACCCACTGTTTGAGCAAAAT
GAGCAGCAAAATAATCAGGCTAAGAATCCTATCTTGATAGCGAACGATAGGACTAGAGCCATTCGAGCATATGTTGTCCCAATGTTTAATGAGTTGAATCCAGGA
ATTGCACGTCCCCAAATCCAAGCGGAAAATTTTGAAATGAAACTGGTAATGTTTCAGATGTTGCAAACCATGGGGCAATTCCATGGTTTGTCATCTGAAGACCCT
CATTTACATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCTCTCAGATTAACTTTGTTCCCGTATTCTCTTAGAGAT
GGAGCAAAGTCATGGTTAAACTCTTTTGCTCCAGGGAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAG
CTTTTGCGAAAGTGTCCCCACCATGGTTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGGTTAAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCT
GGAGGGGCCCTTTTGGCAAAAACTTTTGATGAAGCTTATGAAATTTTAGAGAGAATATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAGTAAAAAG
GTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCAGTAGGGTTGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCA
CCAGCTATGGAGCCTACTGCAGTGGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGTGGTGAAGACCACAACTACGAGTTTTGCCCCAGTAATCCAACATCT
GTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAATTTCTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGTCAAGGAAATAACGTA
CAAGCGCAACAAAAGATGAACCAGCCAGGATTTGCTAAAACGCAGGTATTGCCCCAGCAAAGTAAGTCGGCTTTGCCCCAGCCAAATTCGGGAAACTCTCTCGAA
GCGATGATGAAAGAATTTATGGCTCGCACAGATGCCGCAATTCAAAGTAATCAAGCTTCGATGAGGGCACTGGAATTGCAAGGTGTTGGAGGCAGCAAAGAGAAT
GCTGGAGCATCTGGTTCTATGCCAGATGTAGAACCACCATATGTGTCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAG
GATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAGTTGCATATAAATATCCCTTTAGTAGAAGCTATTGAGCAAATGCCTAATTATGCTAAATTTCTTAAG
GATATTTTGACTAAAAAGAAAAGGTTAGGTGAATTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGGGCTACCCCCCAAGGCTAAGGAT
CCAGGGTCATTTACCATACCCGTGTCTATAGGTAGAAAAGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGG
AAGTTGGGTATTGGTGAAGCTCGGCCTACCACAGTTACACTCCAATTAGCTGATAGGTCTATCACATATCCAGAGGGTCGTCCATTTTTGGCTACTGGTAGGGCA
TTAATAGATGTTCAAAAAGGAGAATTAACAATGAGAGTCTGTAATGAGGAAGTAAAATTTAATGTGTTTAAAGCCATGAAATATCCAGACGAGATGGAGGATTGC
TCCTTCACCAGGATTCTGGAGAGCACAGTTATTGAGACAACAATACAGGATTCGGCTGATAAGCATTCGGAAAAGCATGGAGAGACGTTGCCCATTATTGTTGAA
TCAGATTTAATGCCAGAGCATGAAGAGGCCTTAATAAAATTGCTGCAGCAATACCGCAAAGCTATAGGTTGGACATTGGCTGATATTCAGGGAATTAGCCCATCT
TTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAACGAAAGAGGTTGTTAAAAAGGAGGTGATT
AAATGGTTGGATGCTGGGATCATCTATCCAATTGCCGATAGCTGTTGCACGATTTTAATCCGCAAGCGCACGGGTCGTCACAAGCTCAGCGTCGAGACGCTACAA
GGACAGCGTCGAGACGCTCTGACGTCATCTTTTCTTGGGCTCCAAGACGTCGCAGTGTCGAGACGCTACTTTGGACAGCGTCGCGACGCTCTCGCGATTTGGATC
TTTCCACGCAGCTGCCTAGCGTCGAGACGCTCCAGGCTTAGCGTCGAGACGCTAGGCCCTGAATCTTCTAGAATGGCCCGATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTAAGCTAACTGTGCCTCTAGACGACGGAAAGGCCTCGCCTCTCTACACTGTTGGCTCTGATGCTGGCGCAGTTGTGCTAGCCTCATTTGCTCCATCCTCT
CCTCCAACGCCTCAAGCGGGAGAGGTGGATGCTCATGGTGGGGCTATGTTGCCTTCTAGAGTGGGAGTTCTTAGTGGGAACATTTTGGGGAGCTTTGGGGAGCAC
TCTAGGAGGCTCAAGGCGTGCTCGGAGGGACTCCAAGTGCTAGGGACGCGTGTTGGGTACAAAAAAGTTTTTGGCGCCGTTGCCGGGGAGAGAGAAAAGACTAAC
GGAAGCAATGGCCCATTAAGAAGAAGCTCCCAAGACAATGAGAGACATTTTGCAGCCAGTCCTCCCACCGAAAATTCAGGCATCGTTTACGCCCCGATCCAAGCA
AATAATTTCGAGTTAAAGACAGGACTGATCCAGATGACGCGAGACAATTCTTTTAAGGGACATCCTTCAGAGGATCCACATTCGCATCTGCGATCATTCTTGGAG
TTATGCGGGACGGTAAAGATGAATGGAGTTCCGGCCGACGCGATAAGATTAATATTATGCTGCTGGGCGACTGAAGGGAGCAAATTCTATGCTGAAGCAAAACTG
GGAGCAGAAACTGCCACATCACAGCTCGTTAGCCAACTTCATGAACCGACTTCTGTTGAGTGCATGAGCGATCCGCCTGGGCAAAACCCACTGTTTGAGCAAAAT
GAGCAGCAAAATAATCAGGCTAAGAATCCTATCTTGATAGCGAACGATAGGACTAGAGCCATTCGAGCATATGTTGTCCCAATGTTTAATGAGTTGAATCCAGGA
ATTGCACGTCCCCAAATCCAAGCGGAAAATTTTGAAATGAAACTGGTAATGTTTCAGATGTTGCAAACCATGGGGCAATTCCATGGTTTGTCATCTGAAGACCCT
CATTTACATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCTCTCAGATTAACTTTGTTCCCGTATTCTCTTAGAGAT
GGAGCAAAGTCATGGTTAAACTCTTTTGCTCCAGGGAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAG
CTTTTGCGAAAGTGTCCCCACCATGGTTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGGTTAAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCT
GGAGGGGCCCTTTTGGCAAAAACTTTTGATGAAGCTTATGAAATTTTAGAGAGAATATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAGTAAAAAG
GTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCAGTAGGGTTGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCA
CCAGCTATGGAGCCTACTGCAGTGGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGTGGTGAAGACCACAACTACGAGTTTTGCCCCAGTAATCCAACATCT
GTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAATTTCTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGTCAAGGAAATAACGTA
CAAGCGCAACAAAAGATGAACCAGCCAGGATTTGCTAAAACGCAGGTATTGCCCCAGCAAAGTAAGTCGGCTTTGCCCCAGCCAAATTCGGGAAACTCTCTCGAA
GCGATGATGAAAGAATTTATGGCTCGCACAGATGCCGCAATTCAAAGTAATCAAGCTTCGATGAGGGCACTGGAATTGCAAGGTGTTGGAGGCAGCAAAGAGAAT
GCTGGAGCATCTGGTTCTATGCCAGATGTAGAACCACCATATGTGTCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAG
GATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAGTTGCATATAAATATCCCTTTAGTAGAAGCTATTGAGCAAATGCCTAATTATGCTAAATTTCTTAAG
GATATTTTGACTAAAAAGAAAAGGTTAGGTGAATTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGGGCTACCCCCCAAGGCTAAGGAT
CCAGGGTCATTTACCATACCCGTGTCTATAGGTAGAAAAGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGG
AAGTTGGGTATTGGTGAAGCTCGGCCTACCACAGTTACACTCCAATTAGCTGATAGGTCTATCACATATCCAGAGGGTCGTCCATTTTTGGCTACTGGTAGGGCA
TTAATAGATGTTCAAAAAGGAGAATTAACAATGAGAGTCTGTAATGAGGAAGTAAAATTTAATGTGTTTAAAGCCATGAAATATCCAGACGAGATGGAGGATTGC
TCCTTCACCAGGATTCTGGAGAGCACAGTTATTGAGACAACAATACAGGATTCGGCTGATAAGCATTCGGAAAAGCATGGAGAGACGTTGCCCATTATTGTTGAA
TCAGATTTAATGCCAGAGCATGAAGAGGCCTTAATAAAATTGCTGCAGCAATACCGCAAAGCTATAGGTTGGACATTGGCTGATATTCAGGGAATTAGCCCATCT
TTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAACGAAAGAGGTTGTTAAAAAGGAGGTGATT
AAATGGTTGGATGCTGGGATCATCTATCCAATTGCCGATAGCTGTTGCACGATTTTAATCCGCAAGCGCACGGGTCGTCACAAGCTCAGCGTCGAGACGCTACAA
GGACAGCGTCGAGACGCTCTGACGTCATCTTTTCTTGGGCTCCAAGACGTCGCAGTGTCGAGACGCTACTTTGGACAGCGTCGCGACGCTCTCGCGATTTGGATC
TTTCCACGCAGCTGCCTAGCGTCGAGACGCTCCAGGCTTAGCGTCGAGACGCTAGGCCCTGAATCTTCTAGAATGGCCCGATTTTGA
Protein sequenceShow/hide protein sequence
MTKLTVPLDDGKASPLYTVGSDAGAVVLASFAPSSPPTPQAGEVDAHGGAMLPSRVGVLSGNILGSFGEHSRRLKACSEGLQVLGTRVGYKKVFGAVAGEREKTN
GSNGPLRRSSQDNERHFAASPPTENSGIVYAPIQANNFELKTGLIQMTRDNSFKGHPSEDPHSHLRSFLELCGTVKMNGVPADAIRLILCCWATEGSKFYAEAKL
GAETATSQLVSQLHEPTSVECMSDPPGQNPLFEQNEQQNNQAKNPILIANDRTRAIRAYVVPMFNELNPGIARPQIQAENFEMKLVMFQMLQTMGQFHGLSSEDP
HLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASA
GGALLAKTFDEAYEILERISINSCQWSDVRGTSKKVKSVLEVDGVSTSRVDLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPTS
VFFVGNQRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKTQVLPQQSKSALPQPNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQGVGGSKEN
AGASGSMPDVEPPYVSPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKD
PGSFTIPVSIGRKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDC
SFTRILESTVIETTIQDSADKHSEKHGETLPIIVESDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPATKEVVKKEVI
KWLDAGIIYPIADSCCTILIRKRTGRHKLSVETLQGQRRDALTSSFLGLQDVAVSRRYFGQRRDALAIWIFPRSCLASRRSRLSVETLGPESSRMARF