; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038691 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038691
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr2:23429469..23433677
RNA-Seq ExpressionLag0038691
SyntenyLag0038691
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017239676.1 PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus]1.5e-11335.42Show/hide
Query:  QFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW---------------------------------SEIVGFRQLEDETFSEA
        +F+GLS+EDP+ HL++FL + D+F + GVP + +RL LF  SL   A+ W                                  EI  F+Q + E+  EA
Subjt:  QFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW---------------------------------SEIVGFRQLEDETFSEA

Query:  WERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADL-------
        +ERFK+LLRKCPHHGL    ++ TFYNGL       VD +A G+L  +  ++A+EILE I+ N+C+  D R ++KKV  V EVD +++  A +       
Subjt:  WERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADL-------

Query:  --------------AMIANALKNVTVI-----------------------------------EENPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQ--
                         A+ ++N+ V                                    + NP+SN YNPGWRNHPNFSW    NNV+      Q  
Subjt:  --------------AMIANALKNVTVI-----------------------------------EENPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQ--

Query:  -PGFAKAQVMPQQNKQALPQQHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDT-GHPRREGKERNNNVVVEEELESGQ
         PGF       QQN +    +   N+ E ++ ++M +TDA IQS  ASMRALE+QVGQLA+ +  RP G LPS+T  +P+ + +E    +     L SG+
Subjt:  -PGFAKAQVMPQQNKQALPQQHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDT-GHPRREGKERNNNVVVEEELESGQ

Query:  GAGGSNKNAGASGFVPDI--EPPYVPPPPYA----------PPLPFPQRQKPKNQD--------------------------------------------
           G+ K     G    +  E P V   P A          PP PFPQR + + QD                                            
Subjt:  GAGGSNKNAGASGFVPDI--EPPYVPPPPYA----------PPLPFPQRQKPKNQD--------------------------------------------

Query:  -----------------------------------GGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIF
                                           G +  G+ALCDLGAS+NLMPLS++ KLG+GE +PT+V LQLADRS+ YP G +EDVLVKVDKFIF
Subjt:  -----------------------------------GGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIF

Query:  PVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTMRVCNEVVKFNVFKAMKYPDEMEDCSFIRIL-------------ESTVIEITIQ---DSAN
        P DFI+LD E D D+P++LGRPFLATGR LIDVQKGELTMRV +E V FNVF AMK+ ++ E C  +                 +  +E++++   D +N
Subjt:  PVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTMRVCNEVVKFNVFKAMKYPDEMEDCSFIRIL-------------ESTVIEITIQ---DSAN

Query:  KHL-----------------EKFRS----KKGSPIKPSLIEAPTLDLKPLPDHLKFVYLGESETLPIIVASDLMPEHEEALIKLLQQYHKAIGWTLADIR
        + +                 ++F S     K    KPS+ E P L+LK LP HLK+ +LGE  TLP+I++S L  EHEE L+++L++Y +AIGW +ADIR
Subjt:  KHL-----------------EKFRS----KKGSPIKPSLIEAPTLDLKPLPDHLKFVYLGESETLPIIVASDLMPEHEEALIKLLQQYHKAIGWTLADIR

Query:  ELAHLFQKQRSNLPEN
         ++  F   + ++ ++
Subjt:  ELAHLFQKQRSNLPEN

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]1.1e-12741.06Show/hide
Query:  QNPLFEQNEQRNNQAENPILIANDRTRAIEHVAN--------------------------------VGQFHGLSSEDPHLHLKSFLGV-------SDSFV
        +NP    N++R     NPI +A+DR RAI   A+                                +GQFHGL  EDPHLHLKSFLGV       SDSF 
Subjt:  QNPLFEQNEQRNNQAENPILIANDRTRAIEHVAN--------------------------------VGQFHGLSSEDPHLHLKSFLGV-------SDSFV

Query:  IQGVPRDALRLTLFPYSLRDGAKSW---------------------------------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETF
         QGV +D +RL+LFPY LRDGAKSW                                 +EIV F+Q EDET SEA ERFKE+LRKCPHHGLPHCIQMETF
Subjt:  IQGVPRDALRLTLFPYSLRDGAKSW---------------------------------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETF

Query:  YNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTV-------------------
        YNGLN VT+ +VDASA GA+L+KT++EAYEILERI+ N+CQW+DVR    +K + VLEVD +S+I A LA + N L+N+ +                   
Subjt:  YNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTV-------------------

Query:  -----------------------------------IEENPYSNFYNPGWRNHPNFSWGGQG-NNVQAQQKMNQP---------GFAKAQVMPQQNKQALP
                                           ++ NP+SN YNPGWRNHPNFSW GQ   N Q   K N P          ++  QV  Q       
Subjt:  -----------------------------------IEENPYSNFYNPGWRNHPNFSWGGQG-NNVQAQQKMNQP---------GFAKAQVMPQQNKQALP

Query:  QQHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTGHPRREG---KERNNNVVVEEELESGQGAGGSNKNAGASGFVPD
        Q  SE S+E+++KE+MA+ DA IQS QAS+R LE+Q+G   N  +     +  +DT     E    KE + +    EE    Q    S + +        
Subjt:  QQHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTGHPRREG---KERNNNVVVEEELESGQGAGGSNKNAGASGFVPD

Query:  IEPPYVPPPPYAPPLPFPQRQKPKNQD-------------------------------------------------------------------------
                  Y P  PFPQR K K ++                                                                         
Subjt:  IEPPYVPPPPYAPPLPFPQRQKPKNQD-------------------------------------------------------------------------

Query:  ------GGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRA
              GGKELGRALCDLGA+INLMPLS+Y+KLGIGEARPTTVTLQLADRSITYPEGKIED+L++VDKFIF  DFIILDYE D DVPIILGRPFL  GR 
Subjt:  ------GGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRA

Query:  LIDVQKGELTMRVCNEVVKFNVFKAMKYPDEMEDCSFI
        L+DV KG +T+R+  + V+FN+  +MKYP  +E+CS +
Subjt:  LIDVQKGELTMRVCNEVVKFNVFKAMKYPDEMEDCSFI

XP_024042858.1 uncharacterized protein LOC112099671 [Citrus clementina]5.5e-11938.93Show/hide
Query:  RNNQAENPILIANDRTRAIEHVANVG--QFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW----------------------
        RN     P+   N +    EHV N    + +GL +EDPHLHLK FL +SD+F   G  +DALRL LFPYSLRD A++W                      
Subjt:  RNNQAENPILIANDRTRAIEHVANVG--QFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW----------------------

Query:  -----------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRG
                   +EI  F QLEDE+  + WERFKELLR+CPHHG+P CIQ+ETFYNGLN  T+ MVDASA  ALL K++ EAYEILERI+ N+ QW   R 
Subjt:  -----------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRG

Query:  T-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVI-----------------------------------------EENPYSNFYNPGWRNHPNFSWGGQ
        T  ++   V  +D ++T+ A +  + N +K +T                                           + NPY N YN GW+ HPNFSW  Q
Subjt:  T-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVI-----------------------------------------EENPYSNFYNPGWRNHPNFSWGGQ

Query:  GNNVQAQQKMNQPGFAKAQVMPQQNKQALPQQHS-ENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTGHPRREGKE----
          N       N+    +   + QQN+     +H    SLE ++KE++A+ +A +QS   S+R LE Q+GQLA  + +R QG LPS+T  PRREG E    
Subjt:  GNNVQAQQKMNQPGFAKAQVMPQQNKQALPQQHS-ENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTGHPRREGKE----

Query:  ------RNNNVVVEE-----ELESGQGA--------GGSNKNAGASGFVPDI-----------EPPYVPPPPYAPPLPFPQRQKPKN------QDGGKEL
              +N +++V+      EL S Q            S+++ G SG    I           E   +    +      P++ K           G +  
Subjt:  ------RNNNVVVEE-----ELESGQGA--------GGSNKNAGASGFVPDI-----------EPPYVPPPPYAPPLPFPQRQKPKN------QDGGKEL

Query:  GRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTM
        GRALCDLGA+INLM LSV+++L + E RPTTVTLQLA+RS  YPE KIEDVLVKVDKFIFPVDFI+LD+EAD++VPIILGRPFLA  + LIDVQK ELTM
Subjt:  GRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTM

Query:  RVCNEVVKFNVFKAMKYPDEMEDCSFIRILESTV------------------------------IEITIQDSANKHLE-----KFRSKKGSPIKPSLIEA
        R+ ++ V FNV +AMK  DE +DC+F+ +++  V                              IE   +  +N+H           ++     PS+   
Subjt:  RVCNEVVKFNVFKAMKYPDEMEDCSFIRILESTV------------------------------IEITIQDSANKHLE-----KFRSKKGSPIKPSLIEA

Query:  PTLDLKPLPDHLKFVYLGESETLPIIVASDLMPEHEEALIKLLQQYHKAI
        P L+LK LP HLK+ YLG++ TLPII+   L    E +L+ +L    K +
Subjt:  PTLDLKPLPDHLKFVYLGESETLPIIVASDLMPEHEEALIKLLQQYHKAI

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]4.8e-11538.96Show/hide
Query:  QAENPILIANDRTRAIEHVA--------------------------------NVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRD
        Q  +PI++ +DR RAI   A                                 VGQF  + +EDPHLHL+SFL +SDSF IQGV  +  RL LFP+SLRD
Subjt:  QAENPILIANDRTRAIEHVA--------------------------------NVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRD

Query:  GAKSW---------------------------------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGAL
         A+SW                                 SEI+ F QLEDE+ S+AWERFKELLRKCPHHG+PHCIQMETFYNGLN  +Q ++DASA GA+
Subjt:  GAKSW---------------------------------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGAL

Query:  LAKTFDEAYEILERISINSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTV------------------------------IEENP----
        L+K+++EA+EILE I+ N+ QWS+ R   ++KV  VLEVD ++ +   +A + N LKN+++                                 NP    
Subjt:  LAKTFDEAYEILERISINSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTV------------------------------IEENP----

Query:  -------------YSNFYNPGWRNHPNFSWGGQG--NNVQAQQKMNQPGFAKAQVMPQQNKQALPQQHSENSLEAMMKEFMARTDAAIQSNQASMRALEL
                     +SN YN  W+NHPN SWG +    +    ++   PGF++    PQ  + + P     +SLE++M+++MA+ DA IQS  A +R LEL
Subjt:  -------------YSNFYNPGWRNHPNFSWGGQG--NNVQAQQKMNQPGFAKAQVMPQQNKQALPQQHSENSLEAMMKEFMARTDAAIQSNQASMRALEL

Query:  QVGQLANELKARPQGKLPSDTGHPRREGKERNNNVVV---------EEEL-----------------ESGQGAGGSNKNAGASGFVPDIEPPYVPPPPYA
        Q+G LANELKARPQG LPSDT +PRR+GKE+  ++ +         EEE+                 ++ Q    + +   A+G   + +          
Subjt:  QVGQLANELKARPQGKLPSDTGHPRREGKERNNNVVV---------EEEL-----------------ESGQGAGGSNKNAGASGFVPDIEPPYVPPPPYA

Query:  PPLPFPQRQKPKNQDGG-KELGRALCDLGASINL------------------------------------------------------MPLSV-YRKLGI
        PPLPFPQR + + QDG  K+    L  L  +I L                                                      +P+S+  R LGI
Subjt:  PPLPFPQRQKPKNQDGG-KELGRALCDLGASINL------------------------------------------------------MPLSV-YRKLGI

Query:  GEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTMRVCNEVVKFNVFKAMKYPDEMEDC
        GEARPTTVTLQLADRS+ +P+GKIEDVLV+VDKFIFP DFIILDYE DR+VPIIL RPFLATGR LIDV+KGELTMR  +E   F VF+ ++ PD + +C
Subjt:  GEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTMRVCNEVVKFNVFKAMKYPDEMEDC

Query:  SFIRILESTVIEITIQDSANKHLEKFRSK
          I  ++  ++E    +S  K  +K R K
Subjt:  SFIRILESTVIEITIQDSANKHLEKFRSK

XP_034899370.1 LOW QUALITY PROTEIN: uncharacterized protein LOC118037487 [Populus alba]1.3e-11735.98Show/hide
Query:  NPQQNPLLQQNPLFEQNEQRNNQAENPILIAND---RTRAIEHVANVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW--
        NP+Q+    ++    Q     +    P + AN+   +   ++ +    QF+GL S+DP+ H+ SFL + D+F   GV  DA+RL LFP+SLRD AK+W  
Subjt:  NPQQNPLLQQNPLFEQNEQRNNQAENPILIAND---RTRAIEHVANVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW--

Query:  -------------------------------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDE
                                        EI  F QLE E   E WER+K+LLR+CPHHGLP  +Q++ FYNGLN  T+ ++DA++GGA ++K+ D+
Subjt:  -------------------------------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDE

Query:  AYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVI----------------------------------------EENPYS
        AY +LE +++N+ QW + R   KK   V E+D ++ + A +  +   LK   +                                         + NPYS
Subjt:  AYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVI----------------------------------------EENPYS

Query:  NFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAQVMPQQNKQALPQQHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKL
          YNPGWRNHPNFSW    NN    +    P  ++      + ++A+ Q  +  S       FM  T+  +Q+  AS+R LE+QVGQLAN L  R QG L
Subjt:  NFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAQVMPQQNKQALPQQHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKL

Query:  PSDTGHPRREGKERNNNVVVEEELESGQGAGGSNKNAGASGFVPDIEP-------PYVPPP--PYAPPLPFPQRQK------------------------
        PS T       KE+   + +    E  Q AG  NK+AG       +EP         +P P       +PFPQR K                        
Subjt:  PSDTGHPRREGKERNNNVVVEEELESGQGAGGSNKNAGASGFVPDIEP-------PYVPPP--PYAPPLPFPQRQK------------------------

Query:  ------------------------------PKNQD----------GGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIED
                                      PK +D          G     +ALCDLGASINLMPLS+++KLG+GEARPTTVTLQLADRS+ +P G IED
Subjt:  ------------------------------PKNQD----------GGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIED

Query:  VLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTMRVCNEVVKFNVFKAMKYPDEMEDCSFIRILESTVIE--------------
        VLVKV KFIFP DFIILD E D ++PI+LGRPFLATG ALIDV+KGEL +RV  E V FNVFKA+K PD  E C  I++++S + E              
Subjt:  VLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTMRVCNEVVKFNVFKAMKYPDEMEDCSFIRILESTVIE--------------

Query:  -------------ITIQDSANKHLEKFRSKKGSP---IKPSLIEAPTLDLKPLPDHLKFVYLGESETLPIIVASDLMPEHEEALIKLLQQYHKAIGWTLA
                         DS   +  ++    G P   IK +  +AP L+L+PLP+HL++ YLGE+ T P+IV++ L    EE L+++L+++  A+GW LA
Subjt:  -------------ITIQDSANKHLEKFRSKKGSP---IKPSLIEAPTLDLKPLPDHLKFVYLGESETLPIIVASDLMPEHEEALIKLLQQYHKAIGWTLA

Query:  DIRELA
        DI+ ++
Subjt:  DIRELA

TrEMBL top hitse value%identityAlignment
A0A2G9HH15 Reverse transcriptase8.0e-10836.3Show/hide
Query:  IEHVANVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW---------------------------------SEIVGFRQL
        I  + N  QF GLS E+P+ H+ +FL + D+   +GV +DALRL LF +SL   A  W                                 +EI+ FRQ 
Subjt:  IEHVANVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW---------------------------------SEIVGFRQL

Query:  EDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRAD
          ET  EAW RF+++LR CP+H +P  IQ+ TFY+GL    +  +D   G + L+ T  E + +L  +  N  +    R T  K   V+EVD V+ + A 
Subjt:  EDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRAD

Query:  LAMIANALKNVTV-------------------------------------IEENPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAQVMPQQN
        +  +  ++KN  V                                      + NPYSN YNPGWR HPNFSW    NN Q Q      G A       Q 
Subjt:  LAMIANALKNVTV-------------------------------------IEENPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAQVMPQQN

Query:  KQALPQQHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDT-GHPRREG-----------------------KERNNNVV
        +   P Q  + SLE  + +FMA       S  A+ + +E Q+GQLAN + +RP+  LPS+T  +PR++                        K +   V+
Subjt:  KQALPQQHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDT-GHPRREG-----------------------KERNNNVV

Query:  VEE---ELESGQGAGGSNKNAGASGFVPDI-----------------EPPYVPPPPYAPPLPFPQRQKPKNQDGGKELGRALCDLGASINLMPLSVYRKL
         EE   E+E+        +      F+ DI                 E   +      P L  P         G    GRALCDLGASINLMP S+YR L
Subjt:  VEE---ELESGQGAGGSNKNAGASGFVPDI-----------------EPPYVPPPPYAPPLPFPQRQKPKNQDGGKELGRALCDLGASINLMPLSVYRKL

Query:  GIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTMRVCNEVVKFNVFKAMKYPDEME
        G+GEA+PT++TLQLADRS+TYP G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELTMRV ++ + FNVFKAMK+P+E +
Subjt:  GIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTMRVCNEVVKFNVFKAMKYPDEME

Query:  DCSFIRI-----------------LESTVIEITIQDS-----------ANKHLEK------FRSKKGSPIKPSLIEAPTLDLKPLPDHLKFVYLGESETL
        +C  + +                 LE  ++++  +++           A+K+ +        R+     +KPS+ E PTL+LKPLP HL + YLGES+TL
Subjt:  DCSFIRI-----------------LESTVIEITIQDS-----------ANKHLEK------FRSKKGSPIKPSLIEAPTLDLKPLPDHLKFVYLGESETL

Query:  PIIVASDLMPEHEEALIKLLQQYHKAIGWTLADIRELAHLF
        P+I++S L     E L+++L+ +  AIGWT+ADI+ ++  F
Subjt:  PIIVASDLMPEHEEALIKLLQQYHKAIGWTLADIRELAHLF

A0A6A2WLX1 Reverse transcriptase1.1e-11236.44Show/hide
Query:  NVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW---------------------------------SEIVGFRQLEDETF
        ++GQF G+ +ED   H+++FL V DSF  +GV  D L+L LFPYSLRD A++W                                 +EI  FRQ +DE+ 
Subjt:  NVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW---------------------------------SEIVGFRQLEDETF

Query:  SEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVR-GTNKKVKSVLEVDGVSTIRADLAMI
         E W+R+K LL+KC +HG     Q+  FYNG+N  T+ ++DASA G LL K+  EA+ IL+RI+ N  Q+   R G+ ++     E++   ++ A L++I
Subjt:  SEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVR-GTNKKVKSVLEVDGVSTIRADLAMI

Query:  ANALKNV---TVIEE--------------------------------------NPYSNFYNPGWRNHPNFSWGGQG----NNVQAQQKMNQP-GFAKAQV
         N LKN+   T ++E                                      NPYSN YN GWR HPNFSW  QG    N    QQ  N+P G+  A  
Subjt:  ANALKNV---TVIEE--------------------------------------NPYSNFYNPGWRNHPNFSWGGQG----NNVQAQQKMNQP-GFAKAQV

Query:  MPQQNKQALPQQHSENSLEAMMKEFMART---------------------DAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTGHPRREGKERNNN
            NK+AL    S +SLEA ++EF++ T                      A IQS+ +S+RALE QVGQ+A  L+ R QG+LPSDT   +  GKE  N 
Subjt:  MPQQNKQALPQQHSENSLEAMMKEFMART---------------------DAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTGHPRREGKERNNN

Query:  VVVEEELESGQGAGGSNKNAGASGFVPD----IEPPYVPPPPYA-PPLPFPQRQK---------------------------------------------
        +     L SG      +K    +  VPD    ++  ++P    A PP PFPQR K                                             
Subjt:  VVVEEELESGQGAGGSNKNAGASGFVPD----IEPPYVPPPPYA-PPLPFPQRQK---------------------------------------------

Query:  --------------------PKNQD----------GGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIF
                            PK  D          G   +G+ALCDLG+S+NL+P S++ KLGIG+ARPT+V LQLAD+S    EG++EDV+V+VDKF+F
Subjt:  --------------------PKNQD----------GGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIF

Query:  PVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTMRVCNEVVKFNVFKAMKYPDEMEDCSFIRILESTVIEIT--------IQDSANKHL---EK
         VDF+ILD E D   PIILGRPFLATGR LID +KGELTMRV ++ V  NVF+ +KY D+ E+C  I  L S + E T        IQ + N++L   E 
Subjt:  PVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTMRVCNEVVKFNVFKAMKYPDEMEDCSFIRILESTVIEIT--------IQDSANKHL---EK

Query:  FRSKKGSPI---------------------------KPSLIEAPTLDLKPLPDHLKFVYLGESETLPIIVASDLMPEHEEALIKLLQQYHKAIGWTLADI
               PI                           KPSL+ AP L+LK LP HLK+VYLG  ETLP+I++++L    E++L+ +L Q+ KAIGWT+AD+
Subjt:  FRSKKGSPI---------------------------KPSLIEAPTLDLKPLPDHLKFVYLGESETLPIIVASDLMPEHEEALIKLLQQYHKAIGWTLADI

Query:  RELA
        + ++
Subjt:  RELA

A0A6A3B6J9 RT_RNaseH_2 domain-containing protein4.8e-11338.61Show/hide
Query:  NVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW---------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQ
        ++GQF G+ +E    H+++FL V DSF  QGV  D L+L  FPYSLRD A++W          +I  FRQ +DE+  E W+R+K LLRKC +HG     Q
Subjt:  NVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW---------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQ

Query:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVR-GTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVIEENPYSNFYNPGWR
        +  FYNG+N  T+ M+D SA G LL K+  EA++IL+ I+ N  Q+   R G  +K     E+D    + A LA I N LKN+    + P          
Subjt:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVR-GTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVIEENPYSNFYNPGWR

Query:  NHPNFSWGGQGNNVQAQQKMNQPGFAKAQ----VMPQQN-KQALPQQHSENSLEAMMKEFMART--------------DAAIQSNQASMRALELQVGQLA
        +HPNFSW  QG +  A Q   Q  F  +Q     MP  N  +  P   + +SLEA M+EF+A T               A +QS+  S+R LE QVGQ+ 
Subjt:  NHPNFSWGGQGNNVQAQQKMNQPGFAKAQ----VMPQQN-KQALPQQHSENSLEAMMKEFMART--------------DAAIQSNQASMRALELQVGQLA

Query:  NELKARPQGKLPSDTGHPRREGKER--------------------------------NNNVVVEE--------ELESGQGAGGSNKNAGASGFVPDIEPP
          L+ R    LPSDT   +  GKE+                                 +  V+EE        E  S +   G+NKNA  +         
Subjt:  NELKARPQGKLPSDTGHPRREGKER--------------------------------NNNVVVEE--------ELESGQGAGGSNKNAGASGFVPDIEPP

Query:  YVPPPPYAPPLPFPQRQKPKNQD----------------------------------------------------GGKELGRALCDLGASINLMPLSVYR
          PPP      PFPQR K  N+D                                                    G K +G+ALCDLG+S+NLMP S++ 
Subjt:  YVPPPPYAPPLPFPQRQKPKNQD----------------------------------------------------GGKELGRALCDLGASINLMPLSVYR

Query:  KLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTMRVCNEVVKFNVFKAMKYPDE
        KLGIG  RPT+V LQLADRS   PEG+IED +VKVDKF+FP+DF+ILD E D  VPIILGRPFLATGR LID ++GELTMR+ ++ V  NVF+++KY D+
Subjt:  KLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTMRVCNEVVKFNVFKAMKYPDE

Query:  MEDCSFIRILESTVIEIT-IQDSANKHLEKFRSKKGSPIKPSLIEAPTLDLKPLPDHLKFVYLGESETLPIIVASDLMPEHEEALIKLLQQYHKAIGWTL
         E+C  I        E++ ++ S   H E    ++  P KPSL  APTL+LK LP H K+ YLG +ETLP+I++S L P  E +L+ +L ++ KA+GWT+
Subjt:  MEDCSFIRILESTVIEIT-IQDSANKHLEKFRSKKGSPIKPSLIEAPTLDLKPLPDHLKFVYLGESETLPIIVASDLMPEHEEALIKLLQQYHKAIGWTL

Query:  ADIRELA
        AD++ ++
Subjt:  ADIRELA

A0A6J1EQ90 uncharacterized protein LOC1114364115.3e-12841.06Show/hide
Query:  QNPLFEQNEQRNNQAENPILIANDRTRAIEHVAN--------------------------------VGQFHGLSSEDPHLHLKSFLGV-------SDSFV
        +NP    N++R     NPI +A+DR RAI   A+                                +GQFHGL  EDPHLHLKSFLGV       SDSF 
Subjt:  QNPLFEQNEQRNNQAENPILIANDRTRAIEHVAN--------------------------------VGQFHGLSSEDPHLHLKSFLGV-------SDSFV

Query:  IQGVPRDALRLTLFPYSLRDGAKSW---------------------------------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETF
         QGV +D +RL+LFPY LRDGAKSW                                 +EIV F+Q EDET SEA ERFKE+LRKCPHHGLPHCIQMETF
Subjt:  IQGVPRDALRLTLFPYSLRDGAKSW---------------------------------SEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETF

Query:  YNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTV-------------------
        YNGLN VT+ +VDASA GA+L+KT++EAYEILERI+ N+CQW+DVR    +K + VLEVD +S+I A LA + N L+N+ +                   
Subjt:  YNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTV-------------------

Query:  -----------------------------------IEENPYSNFYNPGWRNHPNFSWGGQG-NNVQAQQKMNQP---------GFAKAQVMPQQNKQALP
                                           ++ NP+SN YNPGWRNHPNFSW GQ   N Q   K N P          ++  QV  Q       
Subjt:  -----------------------------------IEENPYSNFYNPGWRNHPNFSWGGQG-NNVQAQQKMNQP---------GFAKAQVMPQQNKQALP

Query:  QQHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTGHPRREG---KERNNNVVVEEELESGQGAGGSNKNAGASGFVPD
        Q  SE S+E+++KE+MA+ DA IQS QAS+R LE+Q+G   N  +     +  +DT     E    KE + +    EE    Q    S + +        
Subjt:  QQHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTGHPRREG---KERNNNVVVEEELESGQGAGGSNKNAGASGFVPD

Query:  IEPPYVPPPPYAPPLPFPQRQKPKNQD-------------------------------------------------------------------------
                  Y P  PFPQR K K ++                                                                         
Subjt:  IEPPYVPPPPYAPPLPFPQRQKPKNQD-------------------------------------------------------------------------

Query:  ------GGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRA
              GGKELGRALCDLGA+INLMPLS+Y+KLGIGEARPTTVTLQLADRSITYPEGKIED+L++VDKFIF  DFIILDYE D DVPIILGRPFL  GR 
Subjt:  ------GGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRA

Query:  LIDVQKGELTMRVCNEVVKFNVFKAMKYPDEMEDCSFI
        L+DV KG +T+R+  + V+FN+  +MKYP  +E+CS +
Subjt:  LIDVQKGELTMRVCNEVVKFNVFKAMKYPDEMEDCSFI

A0A6P8DD93 uncharacterized protein LOC1162064531.3e-10533.12Show/hide
Query:  AIEHVANVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW---------------------------------SEIVGFRQ
        A+  +    QF G  +E P  H+  FL   ++  +  V  D +RL LFP+SLRD A++W                                 +EI  F +
Subjt:  AIEHVANVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW---------------------------------SEIVGFRQ

Query:  LEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRA
           E+  EAWERFKE +RKCPHHGLP  + +E FY  L+   + +VDA+AGGAL+ K +DEA  ++E ++ ++  W + R +  +V SV ++D ++ +  
Subjt:  LEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRA

Query:  DLAMI----------------------------------------------ANALKNVTVIEENPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGF
         ++ +                                               N + N     + PYSN YNPGWRNHPNFSW  + N ++       PGF
Subjt:  DLAMI----------------------------------------------ANALKNVTVIEENPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGF

Query:  AKAQVMPQQNKQALPQQHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTGHPRREGKERNNNVVVEEELESGQGAGGS
         K     Q   Q  P Q S++ +E +M  +M +TD  +Q+ QA++R LE Q+ Q++ +L  RP G LPS+T        E N   V    L SG+     
Subjt:  AKAQVMPQQNKQALPQQHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTGHPRREGKERNNNVVVEEELESGQGAGGS

Query:  NKNAGASGFVPD-------IEPP---YVPPPPYAPPLPFPQRQKPKNQD---------------------------------------------------
        N+ A      P+       +E P    +   PY PP+PFP+R K +  D                                                   
Subjt:  NKNAGASGFVPD-------IEPP---YVPPPPYAPPLPFPQRQKPKNQD---------------------------------------------------

Query:  -------------------------------GGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDF
                                       G       L D GASINLMPLS++RKLG+GE + T VTLQLADRSI YP+G +E+VLVKVDKFIFPVDF
Subjt:  -------------------------------GGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDF

Query:  IILDYEADRDVPIILGRPFLATGRALIDVQKGELTMRVCNEVVKFNVFKAMKYPDEMEDCSFIRILESTVIE-------ITIQDSANKHLEKFRS-----
        I+L+ E DR+VP+ILGRPFLATG+ALIDV++G+LT+RV NE + FNV+ A+K  D+ + C  I I++  + E       +   +S  + L+ +       
Subjt:  IILDYEADRDVPIILGRPFLATGRALIDVQKGELTMRVCNEVVKFNVFKAMKYPDEMEDCSFIRILESTVIE-------ITIQDSANKHLEKFRS-----

Query:  ----KKGSPIK---------------PSLIEAPTLDLKPLPDHLKFVYLGESETLPIIVASDLMPEHEEALIKLLQQYHKAIGWTLADIRELAHLFQKQR
            +K S IK                SL ++P L+LKPLP HLK+ YLG  +TLPII++S L  + E+ L+ +L+++ +AIGWT+ADI+ ++ L    R
Subjt:  ----KKGSPIK---------------PSLIEAPTLDLKPLPDHLKFVYLGESETLPIIVASDLMPEHEEALIKLLQQYHKAIGWTLADIRELAHLFQKQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAACGTGCCGCAACTTCCACAGGTTCATGAAGGTCCAGCAGCAGCAAACCCCCAGCAGAATCCGTTGCTGCAGCAAAACCCACTGTTTGAGCAAAATGAGCAGCG
AAATAATCAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCAGAGCCATTGAGCATGTTGCAAACGTGGGGCAATTCCATGGTTTGTCATCTGAAGACCCTCATT
TACATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAG
TCATGGAGTGAAATAGTAGGGTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTACGAAAGTGTCCCCACCATGGGTTACCTCA
TTGTATTCAAATGGAAACATTTTACAATGGTTTAAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAAACTTTTGATGAAGCCTATG
AAATTTTAGAAAGAATATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTATTAGAGGTTGATGGTGTGTCCACCATTAGGGCT
GATCTTGCAATGATTGCTAACGCTCTTAAGAATGTGACAGTAATCGAGGAAAACCCTTATTCTAACTTCTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGG
AGGTCAAGGAAATAACGTACAAGCGCAACAGAAGATGAACCAGCCGGGATTTGCTAAAGCACAGGTAATGCCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAACATTCGG
AGAATTCTCTCGAGGCAATGATGAAAGAATTTATGGCTCGTACAGACGCTGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCT
AATGAGCTCAAGGCAAGGCCTCAAGGGAAACTTCCATCAGATACTGGACACCCTAGAAGGGAAGGTAAGGAGAGAAATAATAATGTTGTTGTCGAGGAAGAGTTGGAGTC
TGGTCAGGGTGCTGGAGGCAGCAATAAAAATGCTGGAGCATCTGGTTTTGTGCCAGATATAGAACCACCATATGTGCCGCCCCCACCTTATGCACCACCTCTACCTTTTC
CACAAAGGCAAAAGCCTAAGAATCAGGATGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGTTA
GGTATTGGTGAAGCTAGACCTACCACAGTTACACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGGTAAGATTGAGGATGTCTTAGTAAAAGTGGATAAATTCAT
ATTTCCTGTTGATTTCATTATTTTAGATTATGAGGCTGATAGAGATGTCCCAATTATTCTAGGACGTCCATTTTTGGCTACTGGTAGGGCGTTAATAGATGTTCAAAAAG
GAGAATTAACAATGAGAGTCTGTAATGAGGTAGTAAAATTTAATGTGTTTAAAGCCATGAAATATCCAGACGAGATGGAGGATTGCTCCTTCATCAGGATTCTGGAGAGC
ACAGTTATTGAGATAACAATACAGGATTCGGCTAATAAGCATTTGGAAAAATTTAGATCAAAGAAAGGCTCCCCAATTAAACCATCCCTGATTGAGGCACCCACTTTAGA
TTTGAAACCCTTGCCGGATCATCTAAAGTTTGTGTATCTTGGGGAAAGTGAGACATTGCCCATTATTGTTGCATCAGATTTAATGCCGGAGCATGAGGAGGCCTTAATAA
AGTTGCTGCAGCAATACCACAAGGCTATTGGTTGGACATTGGCTGATATTAGGGAATTAGCCCATCTTTTTCAAAAACAGAGATCAAACCTCCCAGAAAATTCCCTTGAA
TTTCCCTTGCTCTCCCACAATCAATCAGCGATTATTCCCCGTGAGAATACTGGTGAAGCTCTTGTGGTCGTGTTCACAAAATCCAGCAAGAAGGATCCTTGGAGGTTGCT
GTTTTTTCTGCATTCCGTTGGAGAATTAGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAACGTGCCGCAACTTCCACAGGTTCATGAAGGTCCAGCAGCAGCAAACCCCCAGCAGAATCCGTTGCTGCAGCAAAACCCACTGTTTGAGCAAAATGAGCAGCG
AAATAATCAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCAGAGCCATTGAGCATGTTGCAAACGTGGGGCAATTCCATGGTTTGTCATCTGAAGACCCTCATT
TACATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAG
TCATGGAGTGAAATAGTAGGGTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTACGAAAGTGTCCCCACCATGGGTTACCTCA
TTGTATTCAAATGGAAACATTTTACAATGGTTTAAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAAACTTTTGATGAAGCCTATG
AAATTTTAGAAAGAATATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTATTAGAGGTTGATGGTGTGTCCACCATTAGGGCT
GATCTTGCAATGATTGCTAACGCTCTTAAGAATGTGACAGTAATCGAGGAAAACCCTTATTCTAACTTCTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGG
AGGTCAAGGAAATAACGTACAAGCGCAACAGAAGATGAACCAGCCGGGATTTGCTAAAGCACAGGTAATGCCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAACATTCGG
AGAATTCTCTCGAGGCAATGATGAAAGAATTTATGGCTCGTACAGACGCTGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCT
AATGAGCTCAAGGCAAGGCCTCAAGGGAAACTTCCATCAGATACTGGACACCCTAGAAGGGAAGGTAAGGAGAGAAATAATAATGTTGTTGTCGAGGAAGAGTTGGAGTC
TGGTCAGGGTGCTGGAGGCAGCAATAAAAATGCTGGAGCATCTGGTTTTGTGCCAGATATAGAACCACCATATGTGCCGCCCCCACCTTATGCACCACCTCTACCTTTTC
CACAAAGGCAAAAGCCTAAGAATCAGGATGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGTTA
GGTATTGGTGAAGCTAGACCTACCACAGTTACACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGGTAAGATTGAGGATGTCTTAGTAAAAGTGGATAAATTCAT
ATTTCCTGTTGATTTCATTATTTTAGATTATGAGGCTGATAGAGATGTCCCAATTATTCTAGGACGTCCATTTTTGGCTACTGGTAGGGCGTTAATAGATGTTCAAAAAG
GAGAATTAACAATGAGAGTCTGTAATGAGGTAGTAAAATTTAATGTGTTTAAAGCCATGAAATATCCAGACGAGATGGAGGATTGCTCCTTCATCAGGATTCTGGAGAGC
ACAGTTATTGAGATAACAATACAGGATTCGGCTAATAAGCATTTGGAAAAATTTAGATCAAAGAAAGGCTCCCCAATTAAACCATCCCTGATTGAGGCACCCACTTTAGA
TTTGAAACCCTTGCCGGATCATCTAAAGTTTGTGTATCTTGGGGAAAGTGAGACATTGCCCATTATTGTTGCATCAGATTTAATGCCGGAGCATGAGGAGGCCTTAATAA
AGTTGCTGCAGCAATACCACAAGGCTATTGGTTGGACATTGGCTGATATTAGGGAATTAGCCCATCTTTTTCAAAAACAGAGATCAAACCTCCCAGAAAATTCCCTTGAA
TTTCCCTTGCTCTCCCACAATCAATCAGCGATTATTCCCCGTGAGAATACTGGTGAAGCTCTTGTGGTCGTGTTCACAAAATCCAGCAAGAAGGATCCTTGGAGGTTGCT
GTTTTTTCTGCATTCCGTTGGAGAATTAGGCTAA
Protein sequenceShow/hide protein sequence
MENVPQLPQVHEGPAAANPQQNPLLQQNPLFEQNEQRNNQAENPILIANDRTRAIEHVANVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAK
SWSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRA
DLAMIANALKNVTVIEENPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAQVMPQQNKQALPQQHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLA
NELKARPQGKLPSDTGHPRREGKERNNNVVVEEELESGQGAGGSNKNAGASGFVPDIEPPYVPPPPYAPPLPFPQRQKPKNQDGGKELGRALCDLGASINLMPLSVYRKL
GIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTMRVCNEVVKFNVFKAMKYPDEMEDCSFIRILES
TVIEITIQDSANKHLEKFRSKKGSPIKPSLIEAPTLDLKPLPDHLKFVYLGESETLPIIVASDLMPEHEEALIKLLQQYHKAIGWTLADIRELAHLFQKQRSNLPENSLE
FPLLSHNQSAIIPRENTGEALVVVFTKSSKKDPWRLLFFLHSVGELG