; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031767 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031767
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr11:14138624..14145269
RNA-Seq ExpressionLag0031767
SyntenyLag0031767
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0009987 - cellular process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN16590.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]5.3e-20742.9Show/hide
Query:  DLEIERTFRNRRREQRRNQMENVPQLPQVHEGP--APANPQQNPLLQQNPLFEQNEQRN----------NQAENPIVITNDRTRAIRAYAVPMFNELNPG
        DLEIERTF+ RRR+      E+V Q  +V E      A+  +N  ++   L    E+++           + E P++     T      +    N     
Subjt:  DLEIERTFRNRRREQRRNQMENVPQLPQVHEGP--APANPQQNPLLQQNPLFEQNEQRN----------NQAENPIVITNDRTRAIRAYAVPMFNELNPG

Query:  IARVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGL
          ++ D+   +GV +DALRL LF +SL   A  W  S    SI TW +L E F+SK+F P + A LR+EI+ FRQ   ET  EAW RF+++LR CP+H +
Subjt:  IARVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGL

Query:  PHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKLKSVLEVDGAT-IRADLAMIANALKNVTVISHQQPPAIEP
        P  IQ+ TFY+GL    +  +D   G + L+ T  E + +L  +  N  +    R T  K   V+EVD  T + A +  +  ++KN  V   Q  P    
Subjt:  PHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKLKSVLEVDGAT-IRADLAMIANALKNVTVISHQQPPAIEP

Query:  AAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQSGFAKAQVMPQQNKQALPQQHSEN
                   C  CGE H  + CP +  S+ FV N R   NNPYSN YNPGWR HPNFSW    NN Q Q      G A       Q +   P Q  + 
Subjt:  AAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQSGFAKAQVMPQQNKQALPQQHSEN

Query:  SLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERIK-PSKTQVIDKNGDRNNNVVVEE
        SLE  + +FMA       S  A+ + +E Q+GQLAN + +RP+  LPS+TE +PR++ K Q +AVTLR+G  L+E +K P+K+        +   V+ EE
Subjt:  SLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERIK-PSKTQVIDKNGDRNNNVVVEE

Query:  DLESGQGAGGSNKNVGASGSVPDVDPPYVPPPPYVPPLPFPQKKSLRIRMDILTKTKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKEL
                    K + A   V  ++      P YV           +   DIL+K +RLG++ETV+LTEECSAI++N LPPK KDPGSFTIP +IG    
Subjt:  DLESGQGAGGSNKNVGASGSVPDVDPPYVPPPPYVPPLPFPQKKSLRIRMDILTKTKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKEL

Query:  GRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTT
        GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELT 
Subjt:  GRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTT

Query:  RVCNEELKFNVFKAMKYPDEMEDCSFIRILE-----NTIVETAIQDSANKHLENHGEVSVEDLEVCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKP
        RV ++++ FNVFKAMK+P+E ++C  + + +      +I E  +       L+   E + ED EV  +   +  + F+   V ESL   +R AP   +KP
Subjt:  RVCNEELKFNVFKAMKYPDEMEDCSFIRILE-----NTIVETAIQDSANKHLENHGEVSVEDLEVCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKP

Query:  SLTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSLRSIEQQRRLNPAMKEVVKK
        S+ E PTL+LKPLP HL Y +LGES+TLP+I++S L     E L+++L+ ++ AIGWT+ADI+GISPSFCMHKI LE+G   S+E QRRLNP MKEVVKK
Subjt:  SLTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSLRSIEQQRRLNPAMKEVVKK

Query:  E---------------------------------------------------------------------------MVDRLAGQAYYCFLDGYSGYKQIT
        E                                                                           M+DRLAG+ +YCFLDGYSGY QI 
Subjt:  E---------------------------------------------------------------------------MVDRLAGQAYYCFLDGYSGYKQIT

Query:  IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADS
        I PEDQEKTTFTCPYGTF FR+MPFGLCNAPATFQRCM+AIF+DM+E+ +E+ M +   + DS
Subjt:  IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADS

XP_017239676.1 PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus]1.6e-20842.2Show/hide
Query:  LDLEIERTFRNRRR--EQRRNQMENVPQLPQVHEGPAPANPQQNPLLQQNPLFEQNEQRNNQAENPIVITND---RTRAIRAYAVPMFNELNP-------
        LDLEIE+T +  RR  ++ RN+   +            AN     L  ++ +    +  ++    P +  N+    +  ++A     FN L+        
Subjt:  LDLEIERTFRNRRR--EQRRNQMENVPQLPQVHEGPAPANPQQNPLLQQNPLFEQNEQRNNQAENPIVITND---RTRAIRAYAVPMFNELNP-------

Query:  -GIARVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHH
             + D+F + GVP + +RL LF  SL   A+ WL+S    SI TW++L E FL+KYF P +  +L  EI  F+Q + E+  EA+ERFK+LLRKCPHH
Subjt:  -GIARVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHH

Query:  GLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKLKSVLEVD-----GATIRADLAMIANALKNVTVISHQQ
        GL    ++ TFYNGL       VD +A G+L  +  ++A+EILE I+ N+C+  D R ++KK+  V EVD      A + +    +   L++++V   QQ
Subjt:  GLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKLKSVLEVD-----GATIRADLAMIANALKNVTVISHQQ

Query:  PPAIEPAAVVNQVTDEACVYCGEDHNYEFCP------SNPASVFFVG---NQRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQS---GFAKAQVM
        P  ++ A  V  +    C  CGE H  + CP      +  +SV +VG   NQ+NNP+SN YNPGWRNHPNFSW    NNV+      Q+   GF      
Subjt:  PPAIEPAAVVNQVTDEACVYCGEDHNYEFCP------SNPASVFFVG---NQRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQS---GFAKAQVM

Query:  PQQNKQALPQQHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERIKPSKTQVI
         QQN +    +   N+ E ++ ++M +TDA IQS  ASMRALE+QVGQLA+ +  RP G LPS+TE +P+ + +E  KA+TLRSGK +E       T+ +
Subjt:  PQQNKQALPQQHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERIKPSKTQVI

Query:  DKNGDRNNNVVVEEDLESGQGAGGSNKNVGASGSVPDVDPPYVPPPPYVPPLPFPQK-----------------KSLRIRM-----------------DI
        D  GD    +  E  + S         N  A  S P     +V PPP     PFPQ+                 K L I +                 DI
Subjt:  DKNGDRNNNVVVEEDLESGQGAGGSNKNVGASGSVPDVDPPYVPPPPYVPPLPFPQK-----------------KSLRIRM-----------------DI

Query:  LTKTKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVL
        L++ +RL EFETV+LTEECSAIL+  LPPK KDPGSFTIP +IG +  G+ALCDLGAS+NLMPLS++ KLG+GE +PT+V LQLADRS+ YP G +EDVL
Subjt:  LTKTKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVL

Query:  VKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTTRVCNEELKFNVFKAMKYPDEMEDCSFIRILEN-----TIVETAIQDSANKHL
        VKVDKFIFP DFI+LD E D D+P++LGRPFLATGR LIDVQKGELT RV +E++ FNVF AMK+ ++ E C  +           ++E    D     L
Subjt:  VKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTTRVCNEELKFNVFKAMKYPDEMEDCSFIRILEN-----TIVETAIQDSANKHL

Query:  ENHGEVSVEDLEVCSLDRKNEKELFRCEDVFESLDLDQRKAPPIKPSLTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPEHEEALIKLLQQYRKA
           G+ S E++  C  +        R    FES ++   K+   KPS+ E P L+LK LP HLKY  LGE  TLP+I++S L  EHEE L+++L++Y++A
Subjt:  ENHGEVSVEDLEVCSLDRKNEKELFRCEDVFESLDLDQRKAPPIKPSLTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPEHEEALIKLLQQYRKA

Query:  IGWTLADIQGISPSFCMHKITLEEGSLRSIEQQRRLNPAMKEVVKKE-----------------------------------------------------
        IGW +ADI+GISPSFCMHKI++E+    +IE QRRLNP MKEVVKKE                                                     
Subjt:  IGWTLADIQGISPSFCMHKITLEEGSLRSIEQQRRLNPAMKEVVKKE-----------------------------------------------------

Query:  ----------------------MVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICM
                              M+DRLAG+ +YCFLDGYSGY QI IAPEDQEKTTFTCP+GTFAFR++ FGLCNAP+TFQRCM+AIFSDMIE  VE+ M
Subjt:  ----------------------MVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICM

Query:  GESGQFADS
         +     DS
Subjt:  GESGQFADS

XP_034899370.1 LOW QUALITY PROTEIN: uncharacterized protein LOC118037487 [Populus alba]5.7e-20944.71Show/hide
Query:  QRNNQAENPIVITNDRTRAIRAYAVPMFNELNPGIA---RVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRN
        + NN    P ++   +T +++ Y +P  ++ N  IA    + D+F   GV  DA+RL LFP+SLRD AK+WLNS    S+ +W++LA+ FL+K+FPP + 
Subjt:  QRNNQAENPIVITNDRTRAIRAYAVPMFNELNPGIA---RVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRN

Query:  AKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKLKS
        AK+R EI  F QLE E   E WER+K+LLR+CPHHGLP  +Q++ FYNGLN  T+ ++DA++GGA ++K+ D+AY +LE +++N+ QW + R   KK   
Subjt:  AKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKLKS

Query:  VLEVDGATIRADLAMIANALKNVTVISHQQPPAIEPAAVVNQVTDEACVYCGEDHNYEFCP-SNPAS----VFFVGN--QRNNPYSNFYNPGWRNHPNFS
        V E+D             A+  +T   H     ++   +        C +C  +H  E C   NP S      FV N  ++NNPYS  YNPGWRNHPNFS
Subjt:  VLEVDGATIRADLAMIANALKNVTVISHQQPPAIEPAAVVNQVTDEACVYCGEDHNYEFCP-SNPAS----VFFVGN--QRNNPYSNFYNPGWRNHPNFS

Query:  WGGQGNNVQAQQKMNQSGFAKAQVMPQQNKQALPQQHSENSLEAM---MKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREG
        W             N     K   MP  ++    +   E ++  +      FM  T+  +Q+  AS+R LE+QVGQLAN L  R QG LPS TE      
Subjt:  WGGQGNNVQAQQKMNQSGFAKAQVMPQQNKQALPQQHSENSLEAM---MKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREG

Query:  KEQVKAVTLRSGKPLEERIKPSKTQVIDKNGDRNNNVVVEEDLESGQGAGGSNKNVGASGSVPDVDPPYVPPPPYVPPLPFPQK----------------
        KEQ KA+TLRSGK +E+          +K+  R     + E ++          N+  S  +P+      P    +  +PFPQ+                
Subjt:  KEQVKAVTLRSGKPLEERIKPSKTQVIDKNGDRNNNVVVEEDLESGQGAGGSNKNVGASGSVPDVDPPYVPPPPYVPPLPFPQK----------------

Query:  -KSLRIRMDILTKTKRLG--EFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS
         K L+I +      +++   E+ETV+LTEECSAIL+  LPPK KDPGSFTIP SIG     +ALCDLGASINLMPLS+++KLG+GEARPTTVTLQLADRS
Subjt:  -KSLRIRMDILTKTKRLG--EFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS

Query:  ITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTTRVCNEELKFNVFKAMKYPDEMEDCSFIRILENTIVETA--
        + +P G IEDVLVKV KFIFP DFIILD E D ++PI+LGRPFLATG ALIDV+KGEL  RV  EE+ FNVFKA+K PD  E C  I+++++ I E    
Subjt:  ITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTTRVCNEELKFNVFKAMKYPDEMEDCSFIRILENTIVETA--

Query:  IQDSANKHLENHGEVSVEDLEV----CSLD--RKNEKELFRCEDVFESLDLDQRKAPPIKPSLTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPE
          D     L N+  V  ED E+    C +D    N +  F         DL Q  A  IK +  +AP L+L+PLP+HL+Y +LGE+ T P+IV++ L   
Subjt:  IQDSANKHLENHGEVSVEDLEV----CSLD--RKNEKELFRCEDVFESLDLDQRKAPPIKPSLTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPE

Query:  HEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSLRSIEQQRRLNPAMKE------VVK----------------------------------
         EE L+++L++++ A+GW LADI+GISPS CMHKI LE+    ++E QRRLNP MKE      VVK                                  
Subjt:  HEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSLRSIEQQRRLNPAMKE------VVK----------------------------------

Query:  ----KEMVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADS
             +M+DRLAG  YYCFLDGYSGY QI IAPEDQEKTTFTCPYGTF FRRMPFGLCNAPATFQRCM+AIFSDM+E  +EI M +   F  S
Subjt:  ----KEMVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADS

XP_038976653.1 uncharacterized protein LOC120107448 [Phoenix dactylifera]5.0e-20544.97Show/hide
Query:  VSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHC
        + D+  + GV  DA+RL LFP+SL+D AK+WLNS AP S  TW+ L++ FLSKYFPP + AKLR++I  F Q + E+  EAWER K+L RKCPHHGLP  
Subjt:  VSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHC

Query:  IQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKLKSVLEVDGATI---RAD-LAMIANALKNVTVISHQQPPAIEP
        + ++TFYNGL    +  +DA+AGG L++K+ +EAYE+LE I  N+ QWS  RG  KK+  + +VDG  +   + D L  +   L NV  IS    P +  
Subjt:  IQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKLKSVLEVDGATI---RAD-LAMIANALKNVTVISHQQPPAIEP

Query:  AAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN-----QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQSGFAKAQVMPQQNKQALPQQHS
                  +C  CG  H    C      V FV N     Q+NNPYSN Y PGWRNHPNFSW  QGN   + + ++  GF   Q  P Q +     + +
Subjt:  AAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN-----QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQSGFAKAQVMPQQNKQALPQQHS

Query:  ENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIKPSKTQVIDKNGDRNNNVVVEE
           L     E   R +A +    +S R +E+Q+GQLAN +  R Q  LPS TE      KE  KAVTLRSGK L +     +T V DK      N  V E
Subjt:  ENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIKPSKTQVIDKNGDRNNNVVVEE

Query:  DLESGQGAGGSNKNVGASGSVPDVDPPYVPPPPYVPPLPFPQK-----------------KSLRIRM-----------------DILTKTKRLGEFETVS
        ++E                 +     P  P  PYVPP+PFPQ+                 + L I +                 +I++K ++L +FET++
Subjt:  DLESGQGAGGSNKNVGASGSVPDVDPPYVPPPPYVPPLPFPQK-----------------KSLRIRM-----------------DILTKTKRLGEFETVS

Query:  LTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFII
        LTEECSAI++N LPPK +DPGSF+I  +IG  +  RALCDLGAS++LMPLSV RKLG+ E +PTT++LQLADRS+ YP G +E+VL+KV KFI PVDFII
Subjt:  LTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFII

Query:  LDYEADRDVPIILGRPFLATGRALIDVQKGELTTRVCNEELKFNVFKAMKYPDEMEDCSFIRILENTIVETAIQDSANKHLE----NHGEVSVEDLEVCS
        L+ E D ++PIILGRPFLAT  A+IDV+ G LT +V  EE++ N+F+A KYP   +    + +++ +  E    ++  + LE    + G    ++LEV  
Subjt:  LDYEADRDVPIILGRPFLATGRALIDVQKGELTTRVCNEELKFNVFKAMKYPDEMEDCSFIRILENTIVETAIQDSANKHLE----NHGEVSVEDLEVCS

Query:  LDRKNEKELFRCED---VFESLDLDQRKAPPIKPSLTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGIS
        +    E   F+ +     FE  D+ + K PP  PS  +AP L+LKPLP HL+Y  LGE+ TLP+IV+  L  E  + LI++L+  +KAIGWT++D++GIS
Subjt:  LDRKNEKELFRCED---VFESLDLDQRKAPPIKPSLTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGIS

Query:  PSFCMHKITLEEGSLRSIEQQRRLNPAMKEVVKKE-----------------------MVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTF
        PS CMH+I +E+     +E QRRLNP MKEVV+ E                       +++RLAG AYYCFLDGYSGY QI+I+PEDQEKTTFTCPYGTF
Subjt:  PSFCMHKITLEEGSLRSIEQQRRLNPAMKEVVKKE-----------------------MVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTF

Query:  AFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADS
        AFRRMPFGLCNAPATFQRCM+AIFSD +E  +E+ M +   F  S
Subjt:  AFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADS

XP_042757945.1 uncharacterized protein LOC111885853 [Lactuca sativa]3.4e-20643.81Show/hide
Query:  RVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPH
        ++ D+F  +G+  DA RL LFPY+L+  A+ W  S    SI TW+EL E FL ++FPP RNA LR+ I  F+Q++ E+  E WER+  LLRKCP H LP 
Subjt:  RVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPH

Query:  CIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQW--SDVRGTNKKLKSVLEV----DGATIRADLAMIANALKNVTVISHQQPPA
         +++ETFY GL+  T+ +VD SAGGALL K+++E+ EIL+RI+ N+ QW    V  +  K  +  EV        + A+LA I + +KN+   +  +   
Subjt:  CIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQW--SDVRGTNKKLKSVLEV----DGATIRADLAMIANALKNVTVISHQQPPA

Query:  IEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ----RNNPYSNFYNPGWRNHPNFSWGGQGNNVQ------AQQKMNQSGF---AKAQVMPQ
        +           + C  C   H+Y  CP NP SVFF+G+Q    +NNPYS  YNPGWRNHPNFSWGGQ   +        Q+  N  GF      QV PQ
Subjt:  IEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ----RNNPYSNFYNPGWRNHPNFSWGGQGNNVQ------AQQKMNQSGF---AKAQVMPQ

Query:  ----------QNKQALPQQHSENS-LEAMMKEFMARTD---AAIQSNQA-SMRALELQVGQLANELKARPQGKLPSDTEHPRR-EGKEQVKAVTLRSGKP
                    K A P  + E S  E  M EFM + D    A + NQA +MR LE Q+GQLA  L +R  G LPS+T++P     K Q  A+TLRSGK 
Subjt:  ----------QNKQALPQQHSENS-LEAMMKEFMARTD---AAIQSNQA-SMRALELQVGQLANELKARPQGKLPSDTEHPRR-EGKEQVKAVTLRSGKP

Query:  L------------EERIKPSKT--QVIDKNGDRNNNVVVEEDLESGQGAGGSNKNVGASGSVPDV------DPPYVPPPPYVPP--LPFPQKKS------
        L            E+ +   +T  +  +   D    V +E+  +   G    NK  G + S P V         +      V P  LPFP ++S      
Subjt:  L------------EERIKPSKT--QVIDKNGDRNNNVVVEEDLESGQGAGGSNKNVGASGSVPDV------DPPYVPPPPYVPP--LPFPQKKS------

Query:  -----------------------------LRIRMDILTKTKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPL
                                      +   D+LTK +  GEFETV++T+ C++I++N LP K  DPGSF +P  I GK     LCDLGASINLMPL
Subjt:  -----------------------------LRIRMDILTKTKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPL

Query:  SVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTTRVCNEELKFNVFKAMK
        S++R+LGIGEARPTT++LQLADRS+ YPEGKIED+++K+D    P DFIILDYEAD D  IILGRPFLAT  ALIDV+KGE+T RV +E+  FN+FKA+K
Subjt:  SVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTTRVCNEELKFNVFKAMK

Query:  YPDEMEDCSFIRILENTIVETAIQDSANKHLENHGEVSVEDLEVCSLDRKNEKELFRCEDVFESLDLDQRKAPPIKPSLTEAPTLDLKPLPDHLKYVHLG
         P  ME+CSF+R+++N +     Q S   H        +E++E   +     KE  R   V E LDL  R   P  PS+ +AP L+LK LP HLKY +L 
Subjt:  YPDEMEDCSFIRILENTIVETAIQDSANKHLENHGEVSVEDLEVCSLDRKNEKELFRCEDVFESLDLDQRKAPPIKPSLTEAPTLDLKPLPDHLKYVHLG

Query:  ESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSLRSIEQQRRLNPAMKEV--------------------------
        + +TLP+I++S L    E+ L+ +L++  KAIGWTLADI+GISP+ CMHKI LEEG+  S+E QRRLNP MK+V                          
Subjt:  ESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSLRSIEQQRRLNPAMKEV--------------------------

Query:  ---------VKK---------EMVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICM
                  +K         +M+DRLAG+++YCFLDGYSGY QI+IAPEDQ KTTFTCP+GTFAFRRMPFGLCNAPATFQRCM++IFSDM+E+ VE+ M
Subjt:  ---------VKK---------EMVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICM

Query:  GESGQFADS
         +     D+
Subjt:  GESGQFADS

TrEMBL top hitse value%identityAlignment
A0A2G9HH15 Reverse transcriptase2.6e-20742.9Show/hide
Query:  DLEIERTFRNRRREQRRNQMENVPQLPQVHEGP--APANPQQNPLLQQNPLFEQNEQRN----------NQAENPIVITNDRTRAIRAYAVPMFNELNPG
        DLEIERTF+ RRR+      E+V Q  +V E      A+  +N  ++   L    E+++           + E P++     T      +    N     
Subjt:  DLEIERTFRNRRREQRRNQMENVPQLPQVHEGP--APANPQQNPLLQQNPLFEQNEQRN----------NQAENPIVITNDRTRAIRAYAVPMFNELNPG

Query:  IARVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGL
          ++ D+   +GV +DALRL LF +SL   A  W  S    SI TW +L E F+SK+F P + A LR+EI+ FRQ   ET  EAW RF+++LR CP+H +
Subjt:  IARVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGL

Query:  PHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKLKSVLEVDGAT-IRADLAMIANALKNVTVISHQQPPAIEP
        P  IQ+ TFY+GL    +  +D   G + L+ T  E + +L  +  N  +    R T  K   V+EVD  T + A +  +  ++KN  V   Q  P    
Subjt:  PHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKLKSVLEVDGAT-IRADLAMIANALKNVTVISHQQPPAIEP

Query:  AAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQSGFAKAQVMPQQNKQALPQQHSEN
                   C  CGE H  + CP +  S+ FV N R   NNPYSN YNPGWR HPNFSW    NN Q Q      G A       Q +   P Q  + 
Subjt:  AAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQSGFAKAQVMPQQNKQALPQQHSEN

Query:  SLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERIK-PSKTQVIDKNGDRNNNVVVEE
        SLE  + +FMA       S  A+ + +E Q+GQLAN + +RP+  LPS+TE +PR++ K Q +AVTLR+G  L+E +K P+K+        +   V+ EE
Subjt:  SLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERIK-PSKTQVIDKNGDRNNNVVVEE

Query:  DLESGQGAGGSNKNVGASGSVPDVDPPYVPPPPYVPPLPFPQKKSLRIRMDILTKTKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKEL
                    K + A   V  ++      P YV           +   DIL+K +RLG++ETV+LTEECSAI++N LPPK KDPGSFTIP +IG    
Subjt:  DLESGQGAGGSNKNVGASGSVPDVDPPYVPPPPYVPPLPFPQKKSLRIRMDILTKTKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKEL

Query:  GRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTT
        GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELT 
Subjt:  GRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKGELTT

Query:  RVCNEELKFNVFKAMKYPDEMEDCSFIRILE-----NTIVETAIQDSANKHLENHGEVSVEDLEVCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKP
        RV ++++ FNVFKAMK+P+E ++C  + + +      +I E  +       L+   E + ED EV  +   +  + F+   V ESL   +R AP   +KP
Subjt:  RVCNEELKFNVFKAMKYPDEMEDCSFIRILE-----NTIVETAIQDSANKHLENHGEVSVEDLEVCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKP

Query:  SLTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSLRSIEQQRRLNPAMKEVVKK
        S+ E PTL+LKPLP HL Y +LGES+TLP+I++S L     E L+++L+ ++ AIGWT+ADI+GISPSFCMHKI LE+G   S+E QRRLNP MKEVVKK
Subjt:  SLTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSLRSIEQQRRLNPAMKEVVKK

Query:  E---------------------------------------------------------------------------MVDRLAGQAYYCFLDGYSGYKQIT
        E                                                                           M+DRLAG+ +YCFLDGYSGY QI 
Subjt:  E---------------------------------------------------------------------------MVDRLAGQAYYCFLDGYSGYKQIT

Query:  IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADS
        I PEDQEKTTFTCPYGTF FR+MPFGLCNAPATFQRCM+AIF+DM+E+ +E+ M +   + DS
Subjt:  IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADS

A0A6A2WLX1 Reverse transcriptase6.1e-19344.26Show/hide
Query:  VSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHC
        V DSF  +GV  D L+L LFPYSLRD A++WL+    GS+ +W +L ++FL +Y PPN N +LR+EI  FRQ +DE+  E W+R+K LL+KC +HG    
Subjt:  VSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHC

Query:  IQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVR-GTNKKLKSVLEVDGA-TIRADLAMIANALKNVTVISHQQPPAIEPAA
         Q+  FYNG+N  T+ ++DASA G LL K+  EA+ IL+RI+ N  Q+   R G+ ++     E++   ++ A L++I N LKN+   +  +        
Subjt:  IQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVR-GTNKKLKSVLEVDGA-TIRADLAMIANALKNVTVISHQQPPAIEPAA

Query:  VVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQG----NNVQAQQKMNQ-SGFAKAQVMPQQNKQALPQQH
           + T  AC+ C  +H+   CP+N  S+ FVGN     NNPYSN YN GWR HPNFSW  QG    N    QQ  N+  G+  A      NK+AL    
Subjt:  VVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQG----NNVQAQQKMNQ-SGFAKAQVMPQQNKQALPQQH

Query:  SENSLEAMMKEFMART---------------------DAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERI
        S +SLEA ++EF++ T                      A IQS+ +S+RALE QVGQ+A  L+ R QG+LPSDTE  +  GKE    +TLRSG  +  + 
Subjt:  SENSLEAMMKEFMART---------------------DAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERI

Query:  KPSK-TQVIDKNGDRNNNVV-----VEEDLESGQGAGGSNKNVGASGSVPDVDPPYVPPP--PYVPPLPFPQKKSLRIRMDILTKTKRLGEFETV-SLTE
        K     +V D +     N +             Q     N  V     V  +D  ++  P    V  +P   K       DI TK +++   ETV + TE
Subjt:  KPSK-TQVIDKNGDRNNNVV-----VEEDLESGQGAGGSNKNVGASGSVPDVDPPYVPPP--PYVPPLPFPQKKSLRIRMDILTKTKRLGEFETV-SLTE

Query:  ECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDY
         CS+  K  L PK  DPGSF IP SIG   +G+ALCDLG+S+NL+P S++ KLGIG+ARPT+V LQLAD+S    EG++EDV+V+VDKF+F VDF+ILD 
Subjt:  ECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDY

Query:  EADRDVPIILGRPFLATGRALIDVQKGELTTRVCNEELKFNVFKAMKYPDEMEDCSFIRILENTIVETA--------IQDSANKHLENHGEVSVEDLEVC
        E D   PIILGRPFLATGR LID +KGELT RV ++ +  NVF+ +KY D+ E+C  I  L + I E          IQ + N++L +  E  VE  +  
Subjt:  EADRDVPIILGRPFLATGRALIDVQKGELTTRVCNEELKFNVFKAMKYPDEMEDCSFIRILENTIVETA--------IQDSANKHLENHGEVSVEDLEVC

Query:  SLDRKNEKELFRCEDVFESLDLDQRKAPPIKPSLTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPS
         L+ ++     R +  FE L+ D+  +P  KPSL  AP L+LK LP HLKYV+LG  ETLP+I++++L    E++L+ +L Q++KAIGWT+AD++GISP+
Subjt:  SLDRKNEKELFRCEDVFESLDLDQRKAPPIKPSLTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPS

Query:  FCMHKITLEEGSLRSIEQQRRLNPAMKEV-----------------------------------VKK---------EMVDRLAGQAYYCFLDGYSGYKQI
         CMHKI LE+    SIE QRRLNP MK+V                                    KK         +M+DRLAG+A+YCFLDGYSGY QI
Subjt:  FCMHKITLEEGSLRSIEQQRRLNPAMKEV-----------------------------------VKK---------EMVDRLAGQAYYCFLDGYSGYKQI

Query:  TIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADSIIL
         IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCM AIFS M+E  +EI M +     D   L
Subjt:  TIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADSIIL

A0A6P6XAQ1 Reverse transcriptase1.0e-19241.89Show/hide
Query:  VSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHC
        + D+    GV  DA++L LFP+SLRD AK WL S  P +  TWDELA+ FL+K+FPP + AKLR +I  F Q E ET  EAWER++EL R+CPHHGLP  
Subjt:  VSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHC

Query:  IQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKLKSVLEVDGATIRADLAMIANALKNVTVISHQQPPAIEPAAVV
        + ++TFYNGL   T+  VDA+AGGAL+ KT +EA +++E ++ N+ QW++ RG +++   +LEVD       L M++  + NV  + ++Q  +     VV
Subjt:  IQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKLKSVLEVDGATIRADLAMIANALKNVTVISHQQPPAIEPAAVV

Query:  NQVTDEACVYCGEDHNYEFCPSNPASVFFVGN----QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQSGFAKAQVMPQQ--------NKQALPQ
              +C  CG DH+ +F  S+   V ++ N     +NNPYSN YNPGWRNHPNF W  QGN    Q+ +N  GF + Q + +          K A   
Subjt:  NQVTDEACVYCGEDHNYEFCPSNPASVFFVGN----QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQSGFAKAQVMPQQ--------NKQALPQ

Query:  QHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERIKPSKTQVIDKNGDRNNNV
              L +   +   R +  +       R +E+Q+GQ+AN +  R QG LPS TE +PR    E VKA+TLRSGK L E       +  +K      N 
Subjt:  QHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERIKPSKTQVIDKNGDRNNNV

Query:  VVEEDLESGQGAGGSNKNVGASGSVPDVDPPYVPPPPYVPPLPFPQKKSLRIRMDILTKTKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIG
         + E  E  +   G  K       + D  P  +PPP  +P          +   +I+TK ++L + ET++LTEECSAI++N LPPK KDPGSFT+P +IG
Subjt:  VVEEDLESGQGAGGSNKNVGASGSVPDVDPPYVPPPPYVPPLPFPQKKSLRIRMDILTKTKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIG

Query:  GKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKG
          E  +ALCDLGAS++L+PL+V R+LG+ E + T ++LQLADRSI +P G +E+VL+KV KFI PVDF++LD E D +VPIILGRPFLAT   +IDV++G
Subjt:  GKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFLATGRALIDVQKG

Query:  ELTTRVCNEELKFNVFKAMKYPDEMEDCSFIRILENTIVETAIQDSANKHLE-NHGEVSVEDLEVCSLDRKNEKEL-FRCEDVFESLDLDQRKAPPIKPS
        +   ++  EE++F++ K  KYP   +    + I +   +E +  +  N  LE     + +++ ++  +    + ++ ++  + +E+L L +   P   PS
Subjt:  ELTTRVCNEELKFNVFKAMKYPDEMEDCSFIRILENTIVETAIQDSANKHLE-NHGEVSVEDLEVCSLDRKNEKEL-FRCEDVFESLDLDQRKAPPIKPS

Query:  LTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSLRSIEQQRRLNPAMKEVVKKE
          +AP L+LKPLP HLKY  LGE ETLP+IV S L  E  + L+++L+++ KAIGWT++DI+GISP+ CMH+I LEE S   +E QRRLNP MKEVV+ E
Subjt:  LTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSLRSIEQQRRLNPAMKEVVKKE

Query:  ---------------------------------------------------------------------------MVDRLAGQAYYCFLDGYSGYKQITI
                                                                                   MV+RLAG  +YCFLDG+SGY QI I
Subjt:  ---------------------------------------------------------------------------MVDRLAGQAYYCFLDGYSGYKQITI

Query:  APEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADS
        APEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCM+AIFSD  E  +EI M +   F  +
Subjt:  APEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADS

A0A6P8DD93 uncharacterized protein LOC1162064534.7e-19339.4Show/hide
Query:  LDLEIERTFRNRRREQRRNQMENVPQLP------QVHEGPAPANPQQNPLLQQNPLFEQNEQRNNQAENPIVITNDRTRAIRAYAVPMFNELNPGIARVS
        LD EIERT    RRE RR +   V ++       Q+            P +  + +       NN    P +I   ++     Y     +E   G  +  
Subjt:  LDLEIERTFRNRRREQRRNQMENVPQLP------QVHEGPAPANPQQNPLLQQNPLFEQNEQRNNQAENPIVITNDRTRAIRAYAVPMFNELNPGIARVS

Query:  DSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQ
        ++  +  V  D +RL LFP+SLRD A++W NS    SI TW +L+  FL ++FPP R A+LR+EI  F +   E+  EAWERFKE +RKCPHHGLP  + 
Subjt:  DSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQ

Query:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKLKSVLEVDGATIRADLAMIANALKNVTVISHQQPPAIEPAAVVNQ
        +E FY  L+   + +VDA+AGGAL+ K +DEA  ++E ++ ++  W + R +  ++ SV ++D          IAN    ++ ++  Q   +  A   N 
Subjt:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKLKSVLEVDGATIRADLAMIANALKNVTVISHQQPPAIEPAAVVNQ

Query:  VTDEACVYCGEDHNYEFCPS-NPAS------VFFVGN-QRNN--PYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQSGFAKAQVMPQQNKQALPQQHSE
             C  C   H+   C S NP++      V FV N QR+N  PYSN YNPGWRNHPNFSW  + N ++      + G       P QN    P Q S+
Subjt:  VTDEACVYCGEDHNYEFCPS-NPAS------VFFVGN-QRNN--PYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQSGFAKAQVMPQQNKQALPQQHSE

Query:  NSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIKPSKTQVIDKNGDRNNNVVVEED
        + +E +M  +M +TD  +Q+ QA++R LE Q+ Q++ +L  RP G LPS+TE    E  + V A+ LRSGK LE   + ++TQ      D+    V E  
Subjt:  NSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIKPSKTQVIDKNGDRNNNVVVEED

Query:  LESGQGAGGSNKNVGASGSVPDVDPPYVPPPPYVPPLPFPQK-----------------KSLRIRM-----------------DILTKTKRLGEFETVSL
                   K++G                PYVPP+PFP++                 K L+I +                 D+LTK ++    E V L
Subjt:  LESGQGAGGSNKNVGASGSVPDVDPPYVPPPPYVPPLPFPQK-----------------KSLRIRM-----------------DILTKTKRLGEFETVSL

Query:  TEECSAILKN---GLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDF
        T ECS IL+     LP K +D GSFT+P +IG       L D GASINLMPLS++RKLG+GE + T VTLQLADRSI YP+G +E+VLVKVDKFIFPVDF
Subjt:  TEECSAILKN---GLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDF

Query:  IILDYEADRDVPIILGRPFLATGRALIDVQKGELTTRVCNEELKFNVFKAMKYPDEMEDCSFIRILENTIVETAIQDSANKHLENHGEVSVEDLEVCSLD
        I+L+ E DR+VP+ILGRPFLATG+ALIDV++G+LT RV NE++ FNV+ A+K  D+ + C  I I++  I E+  + +    +E+     + DL+  S D
Subjt:  IILDYEADRDVPIILGRPFLATGRALIDVQKGELTTRVCNEELKFNVFKAMKYPDEMEDCSFIRILENTIVETAIQDSANKHLENHGEVSVEDLEVCSLD

Query:  RKNEKELFRCEDVFESLDLDQRKAPPIKP--SLTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSF
         ++E+E        ++   ++      KP  SLT++P L+LKPLP HLKY +LG  +TLPII++S L  + E+ L+ +L+++++AIGWT+ADI+GISP  
Subjt:  RKNEKELFRCEDVFESLDLDQRKAPPIKP--SLTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSF

Query:  CMHKITLEEGSLRSIEQQRRLNPAMKEVVKKE--------------------------------------------------------------------
        C H+I LE      ++ QRRLNP +KEVVKKE                                                                    
Subjt:  CMHKITLEEGSLRSIEQQRRLNPAMKEVVKKE--------------------------------------------------------------------

Query:  -------MVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADS
               M+++LAG  YYCFLDGYSGY QI IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCM++IFSDM+E+ +EI M +   F  S
Subjt:  -------MVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADS

A0A6P8DKJ2 uncharacterized protein LOC1162042311.4e-19239.31Show/hide
Query:  LDLEIERTFRNRRREQRRNQMENVPQLP------QVHEGPAPANPQQNPLLQQNPLFEQNEQRNNQAENPIVITNDRTRAIRAYAVPMFNELNPGIARVS
        LD EIERT    RRE RR +   V ++       Q+            P +  + +       NN    P +I   ++     Y     +E   G  +  
Subjt:  LDLEIERTFRNRRREQRRNQMENVPQLP------QVHEGPAPANPQQNPLLQQNPLFEQNEQRNNQAENPIVITNDRTRAIRAYAVPMFNELNPGIARVS

Query:  DSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQ
        ++  +  V  D +RL LFP+SLRD A++W NS    SI TW +L+  FL ++FPP R A+LR+EI  F +   E+  EAWERFKE +RKCPHHGLP  + 
Subjt:  DSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQ

Query:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKLKSVLEVDGATIRADLAMIANALKNVTVISHQQPPAIEPAAVVNQ
        +E FY  L+   + +VDA+AGGAL+ K +DEA  ++E ++ ++  W + R +  ++ SV ++D          IAN    ++ ++  Q   +  A   N 
Subjt:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKLKSVLEVDGATIRADLAMIANALKNVTVISHQQPPAIEPAAVVNQ

Query:  VTDEACVYCGEDHNYEFCPS-NPAS------VFFVGN-QRNN--PYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQSGFAKAQVMPQQNKQALPQQHSE
             C  C   H+   C S NP++      V FV N QR+N  PYSN YNPGWRNHPNFSW  + N ++      + G       P QN    P Q S+
Subjt:  VTDEACVYCGEDHNYEFCPS-NPAS------VFFVGN-QRNN--PYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQSGFAKAQVMPQQNKQALPQQHSE

Query:  NSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIKPSKTQVIDKNGDRNNNVVVEED
        + +E +M  +M +TD  +Q+ QA++R LE Q+ Q++ +L  RP G LPS+TE    E  + V A+ LRSGK LE   + ++TQ      D+    V E  
Subjt:  NSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIKPSKTQVIDKNGDRNNNVVVEED

Query:  LESGQGAGGSNKNVGASGSVPDVDPPYVPPPPYVPPLPFPQK-----------------KSLRIRM-----------------DILTKTKRLGEFETVSL
                   K++G                PYVPP+PFP +                 K L+I +                 D+LTK ++    E V L
Subjt:  LESGQGAGGSNKNVGASGSVPDVDPPYVPPPPYVPPLPFPQK-----------------KSLRIRM-----------------DILTKTKRLGEFETVSL

Query:  TEECSAILKN---GLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDF
        T ECS IL+     LP K +D GSFT+P +IG       L D GASINLMPLS++RKLG+GE + T +TLQLADRSI YP+G +E+VLVKVDKFIFPVDF
Subjt:  TEECSAILKN---GLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDF

Query:  IILDYEADRDVPIILGRPFLATGRALIDVQKGELTTRVCNEELKFNVFKAMKYPDEMEDCSFIRILENTIVETAIQDSANKHLENHGEVSVEDLEVCSLD
        I+L+ E DR+VP+ILGRPFLATG+ALIDV++G+LT RV NE++ FNV+ A+K  D+ + C  I I++  I E+  + +    +E+     + DL+  S D
Subjt:  IILDYEADRDVPIILGRPFLATGRALIDVQKGELTTRVCNEELKFNVFKAMKYPDEMEDCSFIRILENTIVETAIQDSANKHLENHGEVSVEDLEVCSLD

Query:  RKNEKELFRCEDVFESLDLDQRKAPPIKP--SLTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSF
         ++E+E        ++   ++      KP  SLT++P L+LKPLP HLKY +LG  +TLPII++S L  + E+ L+ +L+++++AIGWT+ADI+GISP  
Subjt:  RKNEKELFRCEDVFESLDLDQRKAPPIKP--SLTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSF

Query:  CMHKITLEEGSLRSIEQQRRLNPAMKEVVKKE--------------------------------------------------------------------
        C H+I LE      ++ QRRLNP +KEVVKKE                                                                    
Subjt:  CMHKITLEEGSLRSIEQQRRLNPAMKEVVKKE--------------------------------------------------------------------

Query:  -------MVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADS
               M+++LAG  YYCFLDGYSGY QI IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCM++IFSDM+E+ +EI M +   F  S
Subjt:  -------MVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADS

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.64.8e-0935.96Show/hide
Query:  EMVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADSI
        E++ +L    Y+  +D   G+ QI + PE   KT F+  +G + + RMPFGL NAPATFQRCM  I   ++     + + +   F+ S+
Subjt:  EMVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADSI

P09523 Enzymatic polyprotein3.4e-0731.78Show/hide
Query:  QGISPSFCM-HKITLEEGSLRSIEQQRRLNPAM-----KEVVKKEMVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAP
        Q +SP+F + ++     G  R +   + +N A           +E++  L G++ +   D  SG+ Q+ +  E Q+ T FTCP G F ++ +PFGL  AP
Subjt:  QGISPSFCM-HKITLEEGSLRSIEQQRRLNPAM-----KEVVKKEMVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAP

Query:  ATFQRCM
        + FQR M
Subjt:  ATFQRCM

P31843 RNA-directed DNA polymerase homolog1.4e-0840.74Show/hide
Query:  EMVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGE
        ++ DRLA   ++  LD  SGY Q+ IA  D+ KTT    YG+F FR MPFGL NA ATF   M  +  + ++  V + + +
Subjt:  EMVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGE

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.8e-0927.49Show/hide
Query:  ENTIVETAIQDSANKHLENHGEVS-----VEDLEVCSLDRKNEKELFRCEDVFESLD--LDQRKAPPIKPSLTEAPTLDLKPLPDHLKY-VHLGESETLP
        ++T  +  I + A+  LE  G+ S     ++ +E  + D  N       +D F +L   L Q+    I+  L   P  D+  +P  +K+ + +     LP
Subjt:  ENTIVETAIQDSANKHLENHGEVS-----VEDLEVCSLDRKNEKELFRCEDVFESLD--LDQRKAPPIKPSLTEAPTLDLKPLPDHLKY-VHLGESETLP

Query:  IIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSF--CMHKITL---EEGSLRSIEQQRRLNPA-MKEVVKKEMVD----RLAGQAYYCFLDGYS
         +    +  ++E+ + K++Q+        L D + I PS   C   + L   ++G+ R     R LN A + +      +D    R+     +  LD +S
Subjt:  IIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSF--CMHKITL---EEGSLRSIEQQRRLNPA-MKEVVKKEMVD----RLAGQAYYCFLDGYS

Query:  GYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDM
        GY QI + P+D+ KT F  P G + +  MPFGL NAP+TF R M   F D+
Subjt:  GYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDM

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.8e-0927.49Show/hide
Query:  ENTIVETAIQDSANKHLENHGEVS-----VEDLEVCSLDRKNEKELFRCEDVFESLD--LDQRKAPPIKPSLTEAPTLDLKPLPDHLKY-VHLGESETLP
        ++T  +  I + A+  LE  G+ S     ++ +E  + D  N       +D F +L   L Q+    I+  L   P  D+  +P  +K+ + +     LP
Subjt:  ENTIVETAIQDSANKHLENHGEVS-----VEDLEVCSLDRKNEKELFRCEDVFESLD--LDQRKAPPIKPSLTEAPTLDLKPLPDHLKY-VHLGESETLP

Query:  IIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSF--CMHKITL---EEGSLRSIEQQRRLNPA-MKEVVKKEMVD----RLAGQAYYCFLDGYS
         +    +  ++E+ + K++Q+        L D + I PS   C   + L   ++G+ R     R LN A + +      +D    R+     +  LD +S
Subjt:  IIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSF--CMHKITL---EEGSLRSIEQQRRLNPA-MKEVVKKEMVD----RLAGQAYYCFLDGYS

Query:  GYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDM
        GY QI + P+D+ KT F  P G + +  MPFGL NAP+TF R M   F D+
Subjt:  GYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDM

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCCACCTGGGGTACGGTTTGAGCTTGATCTAGAAATTGAAAGGACATTTAGGAACAGAAGGAGGGAGCAGCGCAGAAACCAGATGGAGAACGTGCCGCAACT
TCCACAGGTTCATGAAGGTCCAGCACCAGCAAACCCCCAGCAGAATCCGTTGCTGCAGCAAAACCCACTGTTTGAGCAAAATGAGCAGCGAAATAATCAGGCTGAGAATC
CTATCGTGATAACGAACGATAGGACCAGAGCCATTCGAGCGTATGCTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTGCACGAGTTAGTGATTCTTTTGTAATTCAA
GGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAGTCATGGTTAAACTCTTTTGCTCCAGGATCAATTAGGACGTGGGATGA
GTTAGCTGAAAACTTTTTGAGTAAATATTTCCCACCTAATAGAAATGCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTT
GGGAGAGGTTTAAGGAGCTTTTACGAAAGTGTCCCCACCATGGGTTACCTCATTGTATTCAAATGGAAACATTTTATAATGGTTTAAATGGAGTAACCCAAGGTATGGTC
GATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAAACTTTTGATGAAGCCTATGAAATTTTAGAAAGAATATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAA
TAAAAAGCTTAAGAGTGTATTAGAGGTTGATGGTGCAACCATTAGGGCTGATCTTGCAATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCAC
CAGCTATCGAGCCTGCAGCGGTGGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGCGGTGAAGACCACAACTACGAGTTTTGCCCCAGCAATCCAGCGTCTGTGTTT
TTCGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTCTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGTCAAGGAAATAACGTACAAGCGCAACA
GAAGATGAACCAATCGGGATTTGCTAAAGCGCAGGTAATGCCCCAGCAAAATAAGCAGGCTTTGCCCCAACAACATTCAGAGAATTCTCTCGAGGCAATGATGAAAGAAT
TTATGGCTCGTACAGACGCTGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTCAAGGCAAGGCCTCAAGGGAAA
CTTCCATCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTTACTCTTAGGAGTGGTAAGCCATTAGAAGAAAGAATTAAGCCTAGTAAAACCCA
GGTTATAGATAAAAATGGTGATAGAAATAATAATGTTGTTGTCGAGGAAGATTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCAATAAAAATGTTGGAGCATCTGGTTCTG
TGCCAGATGTAGATCCACCATATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAAGAAAAGCCTAAGAATCAGGATGGATATTTTGACTAAAACGAAA
AGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGGGCTACCCCCCAAGGCTAAGGATCCAGGGTCATTTACTATACCTGTGTC
TATAGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGTTAGGTATTGGTGAAGCTAGACCTACCA
CAGTTACACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGGTAAGATTGAGGATGTCTTAGTAAAAGTTGATAAATTCATATTTCCTGTTGATTTCATTATTTTA
GATTATGAGGCTGATAGAGATGTCCCAATTATTCTAGGACGTCCATTTTTGGCTACTGGTAGGGCGTTAATAGATGTTCAAAAAGGGGAATTAACAACGAGAGTTTGTAA
TGAGGAACTAAAATTTAATGTGTTTAAGGCCATGAAGTATCCAGACGAGATGGAAGATTGCTCTTTCATTAGGATTCTGGAGAACACAATTGTTGAGACAGCAATTCAGG
ATTCAGCTAATAAGCATTTGGAAAATCATGGAGAGGTTAGTGTAGAGGATTTAGAAGTTTGTTCTTTAGATAGAAAAAATGAAAAAGAATTGTTTAGGTGTGAGGATGTT
TTTGAGTCTTTAGATTTAGATCAAAGAAAGGCTCCCCCAATTAAACCATCCCTGACTGAGGCACCTACTTTAGATTTGAAGCCCTTACCAGATCATCTAAAGTATGTGCA
TCTTGGGGAAAGTGAGACATTGCCCATTATTGTTGCATCAGATTTAATGCCGGAGCATGAGGAGGCCTTAATAAAGTTGCTGCAGCAATACCGCAAGGCTATTGGTTGGA
CATTGGCTGACATTCAGGGGATTAGCCCATCTTTTTGTATGCATAAAATCACTCTAGAGGAGGGGTCCTTGAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATG
AAAGAAGTGGTTAAAAAGGAGATGGTTGATCGATTGGCTGGTCAGGCCTACTACTGTTTCTTGGATGGTTATTCTGGGTATAAGCAGATTACTATTGCTCCTGAAGATCA
GGAAAAAACCACTTTCACCTGCCCTTATGGGACGTTTGCTTTCAGGCGAATGCCTTTTGGCCTTTGCAATGCTCCAGCAACATTTCAGCGGTGTATGTTAGCAATTTTTT
CTGATATGATTGAGTCCACTGTTGAGATCTGTATGGGTGAGAGTGGCCAGTTCGCCGACTCAATAATCCTACCATTTCAGGGGCAAGACCGAATGGGGGCTGGGAACATA
ACAGTACAAGATGGAATTCACTCCTTCTCCTATTGGGATTGTTCATTAGAGGAATCAGTGGTACTTAAGGATCCAGAGGTAACCCAGGGAATGACAGAGAGAGAAACTCT
CTCTCAACCCTCTCACTCTCTCAACCAAAAATCACAGCAAGGTAATTCCCTTCCCAAGAAGATTGAGATCCCACAATCTCTGCCTCATGAATCCAAGAGATATAGCGGGT
GTTCGCTATTGGTGGTGTTCGTTGGAGAGATCGTTGACGAAGTGTTCGTTGAGGAGCATAGTGCTGTGAAGAACGGTCTTCAACAACGTCGCGACCCTGCTCAACGGCGT
CGCAACGCTGAAGAACAAGCCGCTGGAAAAATCAGGCCTGCGTCGGGACGCTTTGCCTACAGCCACTCGACGCTGGTGCGTGACTGGATGGTTTGCGGTCCGGTTCGCCC
TGGTCGGTTCGGTTTGAAGCCATGCTGGTCCGGTTCAAGGTATTGTTGGACCGGTTCTTGGAGTATTTGGTTCGGGTTGAGCAATTTGGGTCCGGTTTGGGCTATTCTGA
CACGGTTCGGGCTGGTTCTGAGCGGTTCGAAGCGGTTCAGACCAATTACAAGGCGGTTTAGGGCTAGATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCCACCTGGGGTACGGTTTGAGCTTGATCTAGAAATTGAAAGGACATTTAGGAACAGAAGGAGGGAGCAGCGCAGAAACCAGATGGAGAACGTGCCGCAACT
TCCACAGGTTCATGAAGGTCCAGCACCAGCAAACCCCCAGCAGAATCCGTTGCTGCAGCAAAACCCACTGTTTGAGCAAAATGAGCAGCGAAATAATCAGGCTGAGAATC
CTATCGTGATAACGAACGATAGGACCAGAGCCATTCGAGCGTATGCTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTGCACGAGTTAGTGATTCTTTTGTAATTCAA
GGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAGTCATGGTTAAACTCTTTTGCTCCAGGATCAATTAGGACGTGGGATGA
GTTAGCTGAAAACTTTTTGAGTAAATATTTCCCACCTAATAGAAATGCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTT
GGGAGAGGTTTAAGGAGCTTTTACGAAAGTGTCCCCACCATGGGTTACCTCATTGTATTCAAATGGAAACATTTTATAATGGTTTAAATGGAGTAACCCAAGGTATGGTC
GATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAAACTTTTGATGAAGCCTATGAAATTTTAGAAAGAATATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAA
TAAAAAGCTTAAGAGTGTATTAGAGGTTGATGGTGCAACCATTAGGGCTGATCTTGCAATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCAC
CAGCTATCGAGCCTGCAGCGGTGGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGCGGTGAAGACCACAACTACGAGTTTTGCCCCAGCAATCCAGCGTCTGTGTTT
TTCGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTCTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGTCAAGGAAATAACGTACAAGCGCAACA
GAAGATGAACCAATCGGGATTTGCTAAAGCGCAGGTAATGCCCCAGCAAAATAAGCAGGCTTTGCCCCAACAACATTCAGAGAATTCTCTCGAGGCAATGATGAAAGAAT
TTATGGCTCGTACAGACGCTGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTCAAGGCAAGGCCTCAAGGGAAA
CTTCCATCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTTACTCTTAGGAGTGGTAAGCCATTAGAAGAAAGAATTAAGCCTAGTAAAACCCA
GGTTATAGATAAAAATGGTGATAGAAATAATAATGTTGTTGTCGAGGAAGATTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCAATAAAAATGTTGGAGCATCTGGTTCTG
TGCCAGATGTAGATCCACCATATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAAGAAAAGCCTAAGAATCAGGATGGATATTTTGACTAAAACGAAA
AGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGGGCTACCCCCCAAGGCTAAGGATCCAGGGTCATTTACTATACCTGTGTC
TATAGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGTTAGGTATTGGTGAAGCTAGACCTACCA
CAGTTACACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGGTAAGATTGAGGATGTCTTAGTAAAAGTTGATAAATTCATATTTCCTGTTGATTTCATTATTTTA
GATTATGAGGCTGATAGAGATGTCCCAATTATTCTAGGACGTCCATTTTTGGCTACTGGTAGGGCGTTAATAGATGTTCAAAAAGGGGAATTAACAACGAGAGTTTGTAA
TGAGGAACTAAAATTTAATGTGTTTAAGGCCATGAAGTATCCAGACGAGATGGAAGATTGCTCTTTCATTAGGATTCTGGAGAACACAATTGTTGAGACAGCAATTCAGG
ATTCAGCTAATAAGCATTTGGAAAATCATGGAGAGGTTAGTGTAGAGGATTTAGAAGTTTGTTCTTTAGATAGAAAAAATGAAAAAGAATTGTTTAGGTGTGAGGATGTT
TTTGAGTCTTTAGATTTAGATCAAAGAAAGGCTCCCCCAATTAAACCATCCCTGACTGAGGCACCTACTTTAGATTTGAAGCCCTTACCAGATCATCTAAAGTATGTGCA
TCTTGGGGAAAGTGAGACATTGCCCATTATTGTTGCATCAGATTTAATGCCGGAGCATGAGGAGGCCTTAATAAAGTTGCTGCAGCAATACCGCAAGGCTATTGGTTGGA
CATTGGCTGACATTCAGGGGATTAGCCCATCTTTTTGTATGCATAAAATCACTCTAGAGGAGGGGTCCTTGAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATG
AAAGAAGTGGTTAAAAAGGAGATGGTTGATCGATTGGCTGGTCAGGCCTACTACTGTTTCTTGGATGGTTATTCTGGGTATAAGCAGATTACTATTGCTCCTGAAGATCA
GGAAAAAACCACTTTCACCTGCCCTTATGGGACGTTTGCTTTCAGGCGAATGCCTTTTGGCCTTTGCAATGCTCCAGCAACATTTCAGCGGTGTATGTTAGCAATTTTTT
CTGATATGATTGAGTCCACTGTTGAGATCTGTATGGGTGAGAGTGGCCAGTTCGCCGACTCAATAATCCTACCATTTCAGGGGCAAGACCGAATGGGGGCTGGGAACATA
ACAGTACAAGATGGAATTCACTCCTTCTCCTATTGGGATTGTTCATTAGAGGAATCAGTGGTACTTAAGGATCCAGAGGTAACCCAGGGAATGACAGAGAGAGAAACTCT
CTCTCAACCCTCTCACTCTCTCAACCAAAAATCACAGCAAGGTAATTCCCTTCCCAAGAAGATTGAGATCCCACAATCTCTGCCTCATGAATCCAAGAGATATAGCGGGT
GTTCGCTATTGGTGGTGTTCGTTGGAGAGATCGTTGACGAAGTGTTCGTTGAGGAGCATAGTGCTGTGAAGAACGGTCTTCAACAACGTCGCGACCCTGCTCAACGGCGT
CGCAACGCTGAAGAACAAGCCGCTGGAAAAATCAGGCCTGCGTCGGGACGCTTTGCCTACAGCCACTCGACGCTGGTGCGTGACTGGATGGTTTGCGGTCCGGTTCGCCC
TGGTCGGTTCGGTTTGAAGCCATGCTGGTCCGGTTCAAGGTATTGTTGGACCGGTTCTTGGAGTATTTGGTTCGGGTTGAGCAATTTGGGTCCGGTTTGGGCTATTCTGA
CACGGTTCGGGCTGGTTCTGAGCGGTTCGAAGCGGTTCAGACCAATTACAAGGCGGTTTAGGGCTAGATTTTGA
Protein sequenceShow/hide protein sequence
MSDPPGVRFELDLEIERTFRNRRREQRRNQMENVPQLPQVHEGPAPANPQQNPLLQQNPLFEQNEQRNNQAENPIVITNDRTRAIRAYAVPMFNELNPGIARVSDSFVIQ
GVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMV
DASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKLKSVLEVDGATIRADLAMIANALKNVTVISHQQPPAIEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVF
FVGNQRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQSGFAKAQVMPQQNKQALPQQHSENSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGK
LPSDTEHPRREGKEQVKAVTLRSGKPLEERIKPSKTQVIDKNGDRNNNVVVEEDLESGQGAGGSNKNVGASGSVPDVDPPYVPPPPYVPPLPFPQKKSLRIRMDILTKTK
RLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIIL
DYEADRDVPIILGRPFLATGRALIDVQKGELTTRVCNEELKFNVFKAMKYPDEMEDCSFIRILENTIVETAIQDSANKHLENHGEVSVEDLEVCSLDRKNEKELFRCEDV
FESLDLDQRKAPPIKPSLTEAPTLDLKPLPDHLKYVHLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSLRSIEQQRRLNPAM
KEVVKKEMVDRLAGQAYYCFLDGYSGYKQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEICMGESGQFADSIILPFQGQDRMGAGNI
TVQDGIHSFSYWDCSLEESVVLKDPEVTQGMTERETLSQPSHSLNQKSQQGNSLPKKIEIPQSLPHESKRYSGCSLLVVFVGEIVDEVFVEEHSAVKNGLQQRRDPAQRR
RNAEEQAAGKIRPASGRFAYSHSTLVRDWMVCGPVRPGRFGLKPCWSGSRYCWTGSWSIWFGLSNLGPVWAILTRFGLVLSGSKRFRPITRRFRARF