; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008482 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008482
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:23099512..23104044
RNA-Seq ExpressionLag0008482
SyntenyLag0008482
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0016779 - nucleotidyltransferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016646912.1 PREDICTED: uncharacterized protein LOC103318979 [Prunus mume]1.0e-22945.95Show/hide
Query:  IANDRTRAIRAYVVPIFNELYPGIARPKIQAANFEIKPS-------------------------FLGVSDSFVIQGVPRDALRLTLFPYSLIDGAKSWLN
        +A +  RA+  +  P+       I RP I A NFEIKP+                         FL + D+    GV  DA+RL LFP+SL D AK WL 
Subjt:  IANDRTRAIRAYVVPIFNELYPGIARPKIQAANFEIKPS-------------------------FLGVSDSFVIQGVPRDALRLTLFPYSLIDGAKSWLN

Query:  SFAPRSIRTWDELVEKILSKYFPPNRNAKLSSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDE
        S    SIRTWD+L +K L+K+FPP + AK   +I+ F Q + E   EAWERFK+LLRKCPHH LP  IQ++TFYNGL+  ++ +VDA+A GAL+AKT  E
Subjt:  SFAPRSIRTWDELVEKILSKYFPPNRNAKLSSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDE

Query:  AYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLVMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPS-NPAS-----
        A+E+LE ++ N+ QW   R  N K   VLEVD ++ + A    I+N  K V  +S            +N  T+  C  C   H    C + NP +     
Subjt:  AYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLVMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPS-NPAS-----

Query:  --VFFVGNQRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQV
          V  +  QRNNPYSN YNPGW NHPNFSW    +N Q  Q+   PGF      P Q K+ +  +++   L     +F   T    Q  QAS++ L++QV
Subjt:  --VFFVGNQRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQV

Query:  GQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERIEPSKTQVIDKNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNVEPPYVPPPPY
        GQLAN +  R QG  PS  E +P+ +  EQ KA+TLR GK +   I+  K  +  +   K    E    F   +  + K+     S+P    P +   PY
Subjt:  GQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERIEPSKTQVIDKNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNVEPPYVPPPPY

Query:  VPPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKNKRLGEFETVSLTEECSAILKNGLPSKAKDPGSFTIP--------
        VP +PFPQ  +    +GQF KFLE+ ++L INIP  EA+EQMP+YAKF++DIL+K ++ GE E + LTEECSAIL+  LP K KD GSF IP        
Subjt:  VPPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKNKRLGEFETVSLTEECSAILKNGLPSKAKDPGSFTIP--------

Query:  ---------------------LGIGEAMPTTVTPQLADRSITYPEGKIEDVLVKVDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTM
                             +GIGE  PTTV+ Q+ADRSITYP+G IEDVLVKVD  IFP DF+++D E D D  +ILGRPFL T R LIDV++G LT+
Subjt:  ---------------------LGIGEAMPTTVTPQLADRSITYPEGKIEDVLVKVDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTM

Query:  RVCNEEVKFNVFKAMKYPDEMEDCSLIRI-----------------LEITVIETAIQDSADKHSEKHGESLDLDQ-------------RKAPP-IKPSLI
        RV NE+  F VF+A+K+P E EDC  I +                 LE T++  A     +    ++   LD  Q               APP   PS+I
Subjt:  RVCNEEVKFNVFKAMKYPDEMEDCSLIRI-----------------LEITVIETAIQDSADKHSEKHGESLDLDQ-------------RKAPP-IKPSLI

Query:  EAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHEETLIKLLQQYRKAICWTLADIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE--
         APTL LKPLP HL+Y YLG SETLP+I+A++L    EE ++++L++++ AI WT+ADI+GISPS CM +I +EE     +E QRRLNP +KEVV+ E  
Subjt:  EAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHEETLIKLLQQYRKAICWTLADIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE--

Query:  -------------------------KGGVTVVSNKDNELIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAP
                                 KGG+TVV N++NEL+PTRTVTGWRVC DYR+LN ATRKDHFPLPFIDQML+RL G AYYCFLDGYSGYNQI IAP
Subjt:  -------------------------KGGVTVVSNKDNELIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAP

Query:  EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI
        EDQEKTTFTCP+GTFA+RRMPFGLCNAPATFQRCM  I
Subjt:  EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI

XP_017239676.1 PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus]9.8e-23646.44Show/hide
Query:  RTRAIRAYVVPIFNELYPGIARPKIQAANFEIKP------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLIDGAKSWLNSFAPR
        R   ++ Y++P F+ ++  IARP I A NF +                          +FL + D+F + GVP + +RL LF  SL   A+ WL+S    
Subjt:  RTRAIRAYVVPIFNELYPGIARPKIQAANFEIKP------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLIDGAKSWLNSFAPR

Query:  SIRTWDELVEKILSKYFPPNRNAKLSSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDEAYEIL
        SI TW++LVEK L+KYF P +  +L  EI  F+Q + E+  EA+ERFK+LLRKCPHHGL    ++ TFYNGL       VD +A G+L  +  ++A+EIL
Subjt:  SIRTWDELVEKILSKYFPPNRNAKLSSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDEAYEIL

Query:  ERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLVMIANAL-KNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCP------SNPASVFFV
        E I+ N+C+  D R ++KKV  V EVD +++  A +V    AL K +  +S  +   ++ A  V  +    C  CGE H  + CP      +  +SV +V
Subjt:  ERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLVMIANAL-KNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCP------SNPASVFFV

Query:  G---NQRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQ---PGFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQV
        G   NQ+NNP+SN YNPGW NHPNFSW    NNV+      Q   PGF       QQN +    +   N+ D ++ ++  +TDA IQ   ASMRAL++QV
Subjt:  G---NQRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQ---PGFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQV

Query:  GQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERIEPSKTQVIDKNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNVEPPYVPPPPY
        GQLA+ +  RP G LPS+TE +P+ + +E  KA+TLRSGK +E       T+ +D   D   V+ +E         SN  A AS           P    
Subjt:  GQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERIEPSKTQVIDKNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNVEPPYVPPPPY

Query:  VPPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKNKRLGEFETVSLTEECSAILKNGLPSKAKDPGSFTIP--------
         PP PFPQ  + + Q+ QF+KF+++ K+L INIP  EA+EQM +Y KF++DIL++ +RL EFETV+LTEECSAIL+  LP K KDPGSFTIP        
Subjt:  VPPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKNKRLGEFETVSLTEECSAILKNGLPSKAKDPGSFTIP--------

Query:  ---------------------LGIGEAMPTTVTPQLADRSITYPEGKIEDVLVKVDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTM
                             LG+GE  PT+V  QLADRS+ YP G +EDVLVKVDKFIFP DFI++D E D D+P++LGRPFLATGR LIDVQKGELTM
Subjt:  ---------------------LGIGEAMPTTVTPQLADRSITYPEGKIEDVLVKVDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTM

Query:  RVCNEEVKFNVFKAMKYPDEMEDC----------SLIRILE---ITVIETAIQDSADKHSEKHGE------SLDLDQR------------KAPPIKPSLI
        RV +E+V FNVF AMK+ ++ E C           L  +LE      +E +++++ D+ +E+  E      +L   +R            K+   KPS+ 
Subjt:  RVCNEEVKFNVFKAMKYPDEMEDC----------SLIRILE---ITVIETAIQDSADKHSEKHGE------SLDLDQR------------KAPPIKPSLI

Query:  EAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHEETLIKLLQQYRKAICWTLADIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE--
        E P L+LK LP HLKY +LGE  TLP+I++S L  EHEE L+++L++Y++AI W +ADI+GISPSFCM KI++E+     IE QRRLNP +KEVVKKE  
Subjt:  EAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHEETLIKLLQQYRKAICWTLADIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE--

Query:  -------------------------KGGVTVVSNKDNELIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAP
                                 KGG+TVV+N+ NELIPTRTVTGWRVC DYR+LNKATRKDHFPLPFIDQMLDRL G+ +YCFLDGYSGY+QI IAP
Subjt:  -------------------------KGGVTVVSNKDNELIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAP

Query:  EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI
        EDQEKTTFTCP+GTFAFR++ FGLCNAP+TFQRCM  I
Subjt:  EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI

XP_038972405.1 uncharacterized protein LOC120104748 [Phoenix dactylifera]1.2e-22844.7Show/hide
Query:  NDRTRAIRAYVVPIFNELYPGIARPKIQAANFEIKP------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLIDGAKSWLNSFA
        N   R +  Y VP  N   P I RP + A NFEIKP                        +FL + D+  + GV  DA+RL LFP+SL D AK+WLNS A
Subjt:  NDRTRAIRAYVVPIFNELYPGIARPKIQAANFEIKP------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLIDGAKSWLNSFA

Query:  PRSIRTWDELVEKILSKYFPPNRNAKLSSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDEAYE
        P S  TW+ L +  LSKYFPP + AKL ++I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL    +  +DA+A G L++K+ +EAYE
Subjt:  PRSIRTWDELVEKILSKYFPPNRNAKLSSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDEAYE

Query:  ILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRA---DLVMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN
        +LE ++ N+ QWS+ R   KKV  + +VDG++ + A    LV +   L NV  +S    P +            +C  CG  H    C      V FV N
Subjt:  ILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRA---DLVMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN

Query:  -----QRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQVGQL
             Q+NNPYSN YNPGW NHPNFSW  QGN   + + ++ PGF   Q  P Q +     + +   L     E   R +A +    +S R +++Q+GQL
Subjt:  -----QRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQVGQL

Query:  ANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIEPSKTQVIDKNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNV---EPPYVPPPPYV
        AN + +R QG LPS TE      KE  KAVTLRSGK L +                   V  E   G  V     N   S  V ++     P  P  PYV
Subjt:  ANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIEPSKTQVIDKNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNV---EPPYVPPPPYV

Query:  PPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKNKRLGEFETVSLTEECSAILKNGLPSKAKDPGSFTIP---------
        PP+PFPQ  K    + QF+KFL++ +QLHINIP  +A+ Q+P Y KFL++I++K ++L +FET++LTEECSAI++N LP K +DPGSF+IP         
Subjt:  PPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKNKRLGEFETVSLTEECSAILKNGLPSKAKDPGSFTIP---------

Query:  --------------------LGIGEAMPTTVTPQLADRSITYPEGKIEDVLVKVDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTMR
                            LG+ E  PTT++ QLADRS+ YP G +E+VL+KV KFI P DFI+++ E D ++PIILGRPFLAT  A+IDV+ G LT++
Subjt:  --------------------LGIGEAMPTTVTPQLADRSITYPEGKIEDVLVKVDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTMR

Query:  VCNEEVKFNVFKAMKYP--------------------------DEMEDC------SLIRILEITVIETAIQDSADKHSEKHGESLDLDQRKAPPIKPSLI
        V  EEV+FN+F+A KYP                          + +E C      S    LEI  +  A++ +  K  ++     D+ + K PP  PS +
Subjt:  VCNEEVKFNVFKAMKYP--------------------------DEMEDC------SLIRILEITVIETAIQDSADKHSEKHGESLDLDQRKAPPIKPSLI

Query:  EAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHEETLIKLLQQYRKAICWTLADIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE--
        +AP L+LKPLP HL Y +LGE+ TLP+IV+  L  E  + LI++L+  +KAI WT++D++GISPS CM +I +E+    I+E QRRLNP +KEVV+ E  
Subjt:  EAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHEETLIKLLQQYRKAICWTLADIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE--

Query:  -------------------------KGGVTVVSNKDNELIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAP
                                 KGG+TVV N++NELIPTRTVTGWRVC DYR+LN  TRKDHFPLPF+DQ+L+RL G AYYCFLDGYSGYNQI+I+P
Subjt:  -------------------------KGGVTVVSNKDNELIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAP

Query:  EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI
        EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCM  I
Subjt:  EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI

XP_038973683.1 uncharacterized protein LOC120105384 [Phoenix dactylifera]1.2e-22844.7Show/hide
Query:  NDRTRAIRAYVVPIFNELYPGIARPKIQAANFEIKP------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLIDGAKSWLNSFA
        N   R +  Y VP  N   P I RP + A NFEIKP                        +FL + D+  + GV  DA+RL LFP+SL D AK+WLNS A
Subjt:  NDRTRAIRAYVVPIFNELYPGIARPKIQAANFEIKP------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLIDGAKSWLNSFA

Query:  PRSIRTWDELVEKILSKYFPPNRNAKLSSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDEAYE
        P S  TW+ L +  LSKYFPP + AKL ++I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL    +  +DA+A G L++K+ +EAYE
Subjt:  PRSIRTWDELVEKILSKYFPPNRNAKLSSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDEAYE

Query:  ILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRA---DLVMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN
        +LE ++ N+ QWS+ R   KKV  + +VDG++ + A    LV +   L NV  +S    P +            +C  CG  H    C      V FV N
Subjt:  ILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRA---DLVMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN

Query:  -----QRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQVGQL
             Q+NNPYSN YNPGW NHPNFSW  QGN   + + ++ PGF   Q  P Q +     + +   L     E   R +A +    +S R +++Q+GQL
Subjt:  -----QRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQVGQL

Query:  ANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIEPSKTQVIDKNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNV---EPPYVPPPPYV
        AN + +R QG LPS TE      KE  KAVTLRSGK L +                   V  E   G  V     N   S  V ++     P  P  PYV
Subjt:  ANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIEPSKTQVIDKNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNV---EPPYVPPPPYV

Query:  PPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKNKRLGEFETVSLTEECSAILKNGLPSKAKDPGSFTIP---------
        PP+PFPQ  K    + QF+KFL++ +QLHINIP  +A+ Q+P Y KFL++I++K ++L +FET++LTEECSAI++N LP K +DPGSF+IP         
Subjt:  PPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKNKRLGEFETVSLTEECSAILKNGLPSKAKDPGSFTIP---------

Query:  --------------------LGIGEAMPTTVTPQLADRSITYPEGKIEDVLVKVDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTMR
                            LG+ E  PTT++ QLADRS+ YP G +E+VL+KV KFI P DFI+++ E D ++PIILGRPFLAT  A+IDV+ G LT++
Subjt:  --------------------LGIGEAMPTTVTPQLADRSITYPEGKIEDVLVKVDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTMR

Query:  VCNEEVKFNVFKAMKYP--------------------------DEMEDC------SLIRILEITVIETAIQDSADKHSEKHGESLDLDQRKAPPIKPSLI
        V  EEV+FN+F+A KYP                          + +E C      S    LEI  +  A++ +  K  ++     D+ + K PP  PS +
Subjt:  VCNEEVKFNVFKAMKYP--------------------------DEMEDC------SLIRILEITVIETAIQDSADKHSEKHGESLDLDQRKAPPIKPSLI

Query:  EAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHEETLIKLLQQYRKAICWTLADIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE--
        +AP L+LKPLP HL Y +LGE+ TLP+IV+  L  E  + LI++L+  +KAI WT++D++GISPS CM +I +E+    I+E QRRLNP +KEVV+ E  
Subjt:  EAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHEETLIKLLQQYRKAICWTLADIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE--

Query:  -------------------------KGGVTVVSNKDNELIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAP
                                 KGG+TVV N++NELIPTRTVTGWRVC DYR+LN  TRKDHFPLPF+DQ+L+RL G AYYCFLDGYSGYNQI+I+P
Subjt:  -------------------------KGGVTVVSNKDNELIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAP

Query:  EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI
        EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCM  I
Subjt:  EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]8.8e-22944.7Show/hide
Query:  NDRTRAIRAYVVPIFNELYPGIARPKIQAANFEIKP------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLIDGAKSWLNSFA
        N   R +  Y VP  N   P I RP + A NFEIKP                        +FL + D+  + GV  DA+RL LFP+SL D AK+WLNS A
Subjt:  NDRTRAIRAYVVPIFNELYPGIARPKIQAANFEIKP------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLIDGAKSWLNSFA

Query:  PRSIRTWDELVEKILSKYFPPNRNAKLSSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDEAYE
        P S  TW+ L +  LSKYFPP + AKL ++I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL    +  +DA+A G L++K+ +EAYE
Subjt:  PRSIRTWDELVEKILSKYFPPNRNAKLSSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDEAYE

Query:  ILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRA---DLVMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN
        +LE ++ N+ QWS+ R   KKV  + +VDG++ + A    LV + + L NV  +S    P +            +C  CG  H    C      V FV N
Subjt:  ILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRA---DLVMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN

Query:  -----QRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQVGQL
             Q+NNPYSN YNPGW NHPNFSW  QGN   + + ++ PGF   Q  P Q +     + +   L     E   R +A +    +S R +++Q+GQL
Subjt:  -----QRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQVGQL

Query:  ANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIEPSKTQVIDKNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNV---EPPYVPPPPYV
        AN + +R QG LPS TE      KE  KAVTLRSGK L +                   V  E   G  V     N   S  V ++     P  P  PYV
Subjt:  ANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIEPSKTQVIDKNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNV---EPPYVPPPPYV

Query:  PPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKNKRLGEFETVSLTEECSAILKNGLPSKAKDPGSFTIP---------
        PP+PFPQ  K    + QF+KFL++ +QLHINIP  +A+ Q+P Y KFL++I++K ++L +FET++LTEECSAI++N LP K +DPGSF+IP         
Subjt:  PPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKNKRLGEFETVSLTEECSAILKNGLPSKAKDPGSFTIP---------

Query:  --------------------LGIGEAMPTTVTPQLADRSITYPEGKIEDVLVKVDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTMR
                            LG+ E  PTT++ QLADRS+ YP G +E+VL+KV KFI P DFI+++ E D ++PIILGRPFLAT  A+IDV+ G LT++
Subjt:  --------------------LGIGEAMPTTVTPQLADRSITYPEGKIEDVLVKVDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTMR

Query:  VCNEEVKFNVFKAMKYP--------------------------DEMEDC------SLIRILEITVIETAIQDSADKHSEKHGESLDLDQRKAPPIKPSLI
        V  EEV+FN+F+A KYP                          + +E C      S    LEI  +  A++ +  K  ++     D+ + K PP  PS +
Subjt:  VCNEEVKFNVFKAMKYP--------------------------DEMEDC------SLIRILEITVIETAIQDSADKHSEKHGESLDLDQRKAPPIKPSLI

Query:  EAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHEETLIKLLQQYRKAICWTLADIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE--
        +AP L+LKPLP HL Y +LGE+ TLP+IV+  L  E  + LI++L+  +KAI WT++D++GISPS CM +I +E+    I+E QRRLNP +KEVV+ E  
Subjt:  EAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHEETLIKLLQQYRKAICWTLADIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE--

Query:  -------------------------KGGVTVVSNKDNELIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAP
                                 KGG+TVV N++NELIPTRTVTGWRVC DYR+LN  TRKDHFPLPF+DQ+L+RL G AYYCFLDGYSGYNQI+I+P
Subjt:  -------------------------KGGVTVVSNKDNELIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAP

Query:  EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI
        EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCM  I
Subjt:  EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI

TrEMBL top hitse value%identityAlignment
A0A2G9HH15 Reverse transcriptase5.6e-21342.65Show/hide
Query:  DPPGVRFELDPETEMTFRNRRREQRRNHMENVPQLP--QVLEGPANPQQNPLLQQNLLLEQNEQQNNQAENPILIANDRTRAIRAYVVPIFNELYPGIAR
        +P   R++L  E E TF+ RRR +   H+E   ++   Q++    N +  P+  +NL L +  ++ +    P L A  +       ++    + + G++ 
Subjt:  DPPGVRFELDPETEMTFRNRRREQRRNHMENVPQLP--QVLEGPANPQQNPLLQQNLLLEQNEQQNNQAENPILIANDRTRAIRAYVVPIFNELYPGIAR

Query:  PKIQAANFEIKPSFLGVSDSFVIQGVPRDALRLTLFPYSLIDGAKSWLNSFAPRSIRTWDELVEKILSKYFPPNRNAKLSSEIVGFRQLEDETFSEAWER
           +  N  I  +FL + D+   +GV +DALRL LF +SL+  A  W  S    SI TW +L E+ +SK+F P + A L +EI+ FRQ   ET  EAW R
Subjt:  PKIQAANFEIKPSFLGVSDSFVIQGVPRDALRLTLFPYSLIDGAKSWLNSFAPRSIRTWDELVEKILSKYFPPNRNAKLSSEIVGFRQLEDETFSEAWER

Query:  FKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLVMIANALKNV
        F+++LR CP+H +P  IQ+ TFY+GL    +  +D     + L+ T  E + +L  +  N  +    R T  K   V+EVD V+ + A +  +  ++KN 
Subjt:  FKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLVMIANALKNV

Query:  TVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGFAKAQVMPQ
         V   Q  P               C  CGE H  + CP +  S+ FV N R   NNPYSN YNPGW  HPNFSW    NN Q Q      G A       
Subjt:  TVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGFAKAQVMPQ

Query:  QNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERI-EPSKTQVID
        Q +   P Q    SL+  + +F A T        A+ + ++ Q+GQLAN + +RP+  LPS+TE +PR++ K Q +AVTLR+G  L+E + EP+K++   
Subjt:  QNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERI-EPSKTQVID

Query:  KNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNVEPPYVPPPPYVPPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPL-VEAIEQMPNYAKFLEDILT
           +K ++ E+                                                            K   I  PL V+A+EQMP+Y KF++DIL+
Subjt:  KNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNVEPPYVPPPPYVPPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPL-VEAIEQMPNYAKFLEDILT

Query:  KNKRLGEFETVSLTEECSAILKNGLPSKAKDPGSFTIP-----------------------------LGIGEAMPTTVTPQLADRSITYPEGKIEDVLVK
        K +RLG++ETV+LTEECSAI++N LP K KDPGSFTIP                             LG+GEA PT++T QLADRS+TYP G IED+LVK
Subjt:  KNKRLGEFETVSLTEECSAILKNGLPSKAKDPGSFTIP-----------------------------LGIGEAMPTTVTPQLADRSITYPEGKIEDVLVK

Query:  VDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSLIRILE------------ITVIETAIQDS
        VDKFIFP DF+++D E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + +            +  +E A+ D 
Subjt:  VDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSLIRILE------------ITVIETAIQDS

Query:  ADKHSEKHGE---SLDLD-----------QRKAPP--IKPSLIEAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHEETLIKLLQQYRKAICWTLA
         D+ +E+  E   +LD             +R AP   +KPS+ E PTL+LKPLP HL Y YLGES+TLP+I++S L     E L+++L+ ++ AI WT+A
Subjt:  ADKHSEKHGE---SLDLD-----------QRKAPP--IKPSLIEAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHEETLIKLLQQYRKAICWTLA

Query:  DIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE---------------------------KGGVTVVSNKDNELIPTRTVTGWRVCKDYRRL
        DI+GISPSFCM KI LE+G    +E QRRLNP +KEVVKKE                           KGG+TVV N  NELIPTRTVTGWRVC DYR+L
Subjt:  DIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE---------------------------KGGVTVVSNKDNELIPTRTVTGWRVCKDYRRL

Query:  NKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI
        NKATRKDHFPLPFIDQMLDRL G+ +YCFLDGYSGYNQI I PEDQEKTTFTCPYGTF FR+MPFGLCNAPATFQRCM  I
Subjt:  NKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI

A0A2G9HYA0 Reverse transcriptase3.6e-21247.6Show/hide
Query:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVST
        FRQ   ET  EAW RF+++LR CP+H +P  IQ+ TFY+GL    +  +D     + L+ T  E + +L  +  N  +    R T  K   V+EVD V+ 
Subjt:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVST

Query:  IRADLVMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQ
        + A +  +  ++KN  V   Q  P               C  CGE H  + CP +  S+ FV N R   NNPYSN YNPGW  HPNFSW    NN Q Q 
Subjt:  IRADLVMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQ

Query:  KMNQPGFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKP
             G A       Q +   P Q    SL+  + +F A T        A+ + ++ Q+GQLAN + +RPQG LPS+TE +PR++GK Q +AVTLR+G+ 
Subjt:  KMNQPGFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKP

Query:  LEERI-EPSKTQVIDKNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNVEPPYVPPPPYVPPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLVEAIE
        L+E + EP+K++      +K ++ E++ +                    VE P     P     PFPQ  + +    QF KFLE+ K+LHINIP  EA+E
Subjt:  LEERI-EPSKTQVIDKNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNVEPPYVPPPPYVPPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLVEAIE

Query:  QMPNYAKFLEDILTKNKRLGEFETVSLTEECSAILKNGLPSKAKDPGSFTIP-----------------------------LGIGEAMPTTVTPQLADRS
        QMP+Y KF++DIL+K +RLG++ETV+LTEECSAI++N LP K KDPGSFTIP                             LG+GEA PT++T QLADRS
Subjt:  QMPNYAKFLEDILTKNKRLGEFETVSLTEECSAILKNGLPSKAKDPGSFTIP-----------------------------LGIGEAMPTTVTPQLADRS

Query:  ITYPEGKIEDVLVKVDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSLIRILE---------
        +TYP+G IED+LVKVDKFIFP DF+++D E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + +         
Subjt:  ITYPEGKIEDVLVKVDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSLIRILE---------

Query:  ---ITVIETAIQDSADKHSEKHGE---SLDLD-----------QRKAPP--IKPSLIEAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHEETLIK
           +  +E A+ D  D+ +E+  E   +LD             +R  P   +KPS+ + PTL+LKPLP HL Y YLGES+TLP+I++S L     E L++
Subjt:  ---ITVIETAIQDSADKHSEKHGE---SLDLD-----------QRKAPP--IKPSLIEAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHEETLIK

Query:  LLQQYRKAICWTLADIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE---------------------------KGGVTVVSNKDNELIPTR
        +L+ ++ AI WT+ADI+GISPSFCM KI LE+     +E QRRLNP +KEVVKKE                           KGG+TVV N  NELIPTR
Subjt:  LLQQYRKAICWTLADIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE---------------------------KGGVTVVSNKDNELIPTR

Query:  TVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI
        TVTGWRVC DYR+LNKATRKDHFPLPFIDQMLDRL G+ +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCM  I
Subjt:  TVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI

A0A6A2WLX1 Reverse transcriptase6.2e-22044.77Show/hide
Query:  AENPILIANDRTRAIRAYVVPIFNELYPGIARPKIQAANFEIKP-------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLIDG
        A++ IL A+  +R++     P+ ++L PGI  P+IQAA+FE+KP                         +FL V DSF  +GV  D L+L LFPYSL D 
Subjt:  AENPILIANDRTRAIRAYVVPIFNELYPGIARPKIQAANFEIKP-------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLIDG

Query:  AKSWLNSFAPRSIRTWDELVEKILSKYFPPNRNAKLSSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALL
        A++WL+     S+ +W +L +  L +Y PPN N +L +EI  FRQ +DE+  E W+R+K LL+KC +HG     Q+  FYNG+N  T+ ++DASA G LL
Subjt:  AKSWLNSFAPRSIRTWDELVEKILSKYFPPNRNAKLSSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALL

Query:  AKTFDEAYEILERISINSCQWSDVR-GTNKKVKSVLEVDGVSTIRADLVMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPA
         K+  EA+ IL+RI+ N  Q+   R G+ ++     E++   ++ A L +I N LKN+   +  +           + T  AC+ C  +H+   CP+N  
Subjt:  AKTFDEAYEILERISINSCQWSDVR-GTNKKVKSVLEVDGVSTIRADLVMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPA

Query:  SVFFVGNQR---NNPYSNFYNPGWHNHPNFSWGGQG----NNVQAQQKMNQP-GFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTART-----------
        S+ FVGN     NNPYSN YN GW  HPNFSW  QG    N    QQ  N+P G+  A      NK+A+    S +SL+A ++EF + T           
Subjt:  SVFFVGNQR---NNPYSNFYNPGWHNHPNFSWGGQG----NNVQAQQKMNQP-GFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTART-----------

Query:  ----------DAAIQCNQASMRALKLQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIEPSKTQVIDKNCDKNIVVEKELEFGQG
                   A IQ + +S+RAL+ QVGQ+A  L+ R QG+LPSDTE  +  GKE    +TLRSG  +  +                   +KE +F + 
Subjt:  ----------DAAIQCNQASMRALKLQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIEPSKTQVIDKNCDKNIVVEKELEFGQG

Query:  VGGSNKNAGASGSVPNVEPPYVPPPPYV-PPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKNKRLGEFETV-SLTEEC
             K          V+  ++P      PP PFPQ  K  N   QFKKF++IL QLHINIPL+EA+EQMP YAKF++DI TK +++   ETV + TE C
Subjt:  VGGSNKNAGASGSVPNVEPPYVPPPPYV-PPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKNKRLGEFETV-SLTEEC

Query:  SAILKNGLPSKAKDPGSFTIP-----------------------------LGIGEAMPTTVTPQLADRSITYPEGKIEDVLVKVDKFIFPDDFIIVDYEA
        S+  K  L  K  DPGSF IP                             LGIG+A PT+V  QLAD+S    EG++EDV+V+VDKF+F  DF+I+D E 
Subjt:  SAILKNGLPSKAKDPGSFTIP-----------------------------LGIGEAMPTTVTPQLADRSITYPEGKIEDVLVKVDKFIFPDDFIIVDYEA

Query:  DKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSLIRIL----------------------EITVIETAIQDSAD-----
        D   PIILGRPFLATGR LID +KGELTMRV ++ V  NVF+ +KY D+ E+C  I  L                      E  V + ++ +S D     
Subjt:  DKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSLIRIL----------------------EITVIETAIQDSAD-----

Query:  -------KHSEKHGESLDLDQRKAPPIKPSLIEAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHEETLIKLLQQYRKAICWTLADIQGISPSFCM
                 S+ + E L+ D+  +P  KPSL+ AP L+LK LP HLKYVYLG  ETLP+I++++L    E++L+ +L Q++KAI WT+AD++GISP+ CM
Subjt:  -------KHSEKHGESLDLDQRKAPPIKPSLIEAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHEETLIKLLQQYRKAICWTLADIQGISPSFCM

Query:  QKITLEEGSFRIIEQQRRLNPAIKEVVKKEKGGVTVVSNKDNELIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQ
         KI LE+     IE QRRLNP +K+V    KGG TVV+N+DNEL+PTRTVTGWR+C DYR+LNKAT+KDHFPLPFIDQMLDRL G+A+YCFLDGYSGYNQ
Subjt:  QKITLEEGSFRIIEQQRRLNPAIKEVVKKEKGGVTVVSNKDNELIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQ

Query:  ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI
        I IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCM  I
Subjt:  ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI

A0A6P8DD93 uncharacterized protein LOC1162064531.3e-21441.82Show/hide
Query:  LDPETEMTFRNRRREQRRNHMENVPQLPQVLEGPANPQQNPLLQQNLLLEQNEQQNNQAENPILIANDRTRAIRAYVVPIFNELYPGIARPKIQAANFEI
        LDPE E T    RRE RR     V ++                        ++  N Q +          RA+R Y VP    +   I RP I A NFE+
Subjt:  LDPETEMTFRNRRREQRRNHMENVPQLPQVLEGPANPQQNPLLQQNLLLEQNEQQNNQAENPILIANDRTRAIRAYVVPIFNELYPGIARPKIQAANFEI

Query:  KPSFLGVSDSFVIQGVPR------------------------DALRLTLFPYSLIDGAKSWLNSFAPRSIRTWDELVEKILSKYFPPNRNAKLSSEIVGF
        KP+ + +  S    G P                         D +RL LFP+SL D A++W NS    SI TW +L  K L ++FPP R A+L +EI  F
Subjt:  KPSFLGVSDSFVIQGVPR------------------------DALRLTLFPYSLIDGAKSWLNSFAPRSIRTWDELVEKILSKYFPPNRNAKLSSEIVGF

Query:  RQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTI
         +   E+  EAWERFKE +RKCPHHGLP  + +E FY  L+   + +VDA+A GAL+ K +DEA  ++E ++ ++  W + R +  +V SV ++D ++ +
Subjt:  RQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTI

Query:  RADLVMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPS-NPAS------VFFVGN-QRNN--PYSNFYNPGWHNHPNFSWGGQGN
           +  +   +  +T            +   NQV    C  C   H+   C S NP++      V FV N QR+N  PYSN YNPGW NHPNFSW  + N
Subjt:  RADLVMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPS-NPAS------VFFVGN-QRNN--PYSNFYNPGWHNHPNFSWGGQGN

Query:  NVQAQQKMNQPGFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTL
         ++       PGF K     Q   Q  P Q S + ++ +M  +  +TD  +Q  QA++R L+ Q+ Q++ +L  RP G LPS+TE    E  + V A+ L
Subjt:  NVQAQQKMNQPGFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTL

Query:  RSGKPLEERIEPSKTQVIDKNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNVEPPYVPPPPYVPPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLV
        RSGK LE  I   K Q  +++ +K+   +K  E  Q      K+ G                PYVPP+PFP+  K +  + QF KFL++ K+L INIP  
Subjt:  RSGKPLEERIEPSKTQVIDKNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNVEPPYVPPPPYVPPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLV

Query:  EAIEQMPNYAKFLEDILTKNKRLGEFETVSLTEECSAILKNGLPS---KAKDPGSFTIP-----------------------------LGIGEAMPTTVT
        EA++QMP+YA+F++D+LTK ++    E V LT ECS IL+  LP+   K +D GSFT+P                             LG+GE   T VT
Subjt:  EAIEQMPNYAKFLEDILTKNKRLGEFETVSLTEECSAILKNGLPS---KAKDPGSFTIP-----------------------------LGIGEAMPTTVT

Query:  PQLADRSITYPEGKIEDVLVKVDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSLIRILE--
         QLADRSI YP+G +E+VLVKVDKFIFP DFI+++ E D++VP+ILGRPFLATG+ALIDV++G+LT+RV NE++ FNV+ A+K  D+ + C  I I++  
Subjt:  PQLADRSITYPEGKIEDVLVKVDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSLIRILE--

Query:  ----------ITVIETAIQD-----SADKHSEKHGESL---------DLDQRKAPPIKPSLIEAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHE
                  +  +E+ ++D       D+H E+  E +         +L      P+  SL ++P L+LKPLP HLKY YLG  +TLPII++S L  + E
Subjt:  ----------ITVIETAIQD-----SADKHSEKHGESL---------DLDQRKAPPIKPSLIEAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHE

Query:  ETLIKLLQQYRKAICWTLADIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE---------------------------KGGVTVVSNKDNE
        + L+ +L+++++AI WT+ADI+GISP  C  +I LE     I++ QRRLNP +KEVVKKE                           KGG+TVV N+ N+
Subjt:  ETLIKLLQQYRKAICWTLADIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE---------------------------KGGVTVVSNKDNE

Query:  LIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI
        LIPTRTVTGWRVC DYR+LN ATRKDH PLPFIDQML++L G  YYCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCM  I
Subjt:  LIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI

A0A6P8DKJ2 uncharacterized protein LOC1162042311.7e-21441.73Show/hide
Query:  LDPETEMTFRNRRREQRRNHMENVPQLPQVLEGPANPQQNPLLQQNLLLEQNEQQNNQAENPILIANDRTRAIRAYVVPIFNELYPGIARPKIQAANFEI
        LDPE E T    RRE RR     V ++                        ++  N Q +          RA+R Y VP    +   I RP I A NFE+
Subjt:  LDPETEMTFRNRRREQRRNHMENVPQLPQVLEGPANPQQNPLLQQNLLLEQNEQQNNQAENPILIANDRTRAIRAYVVPIFNELYPGIARPKIQAANFEI

Query:  KPSFLGVSDSFVIQGVPR------------------------DALRLTLFPYSLIDGAKSWLNSFAPRSIRTWDELVEKILSKYFPPNRNAKLSSEIVGF
        KP+ + +  S    G P                         D +RL LFP+SL D A++W NS    SI TW +L  K L ++FPP R A+L +EI  F
Subjt:  KPSFLGVSDSFVIQGVPR------------------------DALRLTLFPYSLIDGAKSWLNSFAPRSIRTWDELVEKILSKYFPPNRNAKLSSEIVGF

Query:  RQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTI
         +   E+  EAWERFKE +RKCPHHGLP  + +E FY  L+   + +VDA+A GAL+ K +DEA  ++E ++ ++  W + R +  +V SV ++D ++ +
Subjt:  RQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAAGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTI

Query:  RADLVMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPS-NPAS------VFFVGN-QRNN--PYSNFYNPGWHNHPNFSWGGQGN
           +  +   +  +T            +   NQV    C  C   H+   C S NP++      V FV N QR+N  PYSN YNPGW NHPNFSW  + N
Subjt:  RADLVMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPS-NPAS------VFFVGN-QRNN--PYSNFYNPGWHNHPNFSWGGQGN

Query:  NVQAQQKMNQPGFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTL
         ++       PGF K     Q   Q  P Q S + ++ +M  +  +TD  +Q  QA++R L+ Q+ Q++ +L  RP G LPS+TE    E  + V A+ L
Subjt:  NVQAQQKMNQPGFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTL

Query:  RSGKPLEERIEPSKTQVIDKNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNVEPPYVPPPPYVPPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLV
        RSGK LE  I   K Q  +++ +K+   +K  E         K+ G                PYVPP+PFP   K +  + QF KFL++ K+L INIP  
Subjt:  RSGKPLEERIEPSKTQVIDKNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNVEPPYVPPPPYVPPLPFPQSQKPKNQNGQFKKFLEILKQLHINIPLV

Query:  EAIEQMPNYAKFLEDILTKNKRLGEFETVSLTEECSAILKNGLPS---KAKDPGSFTIP-----------------------------LGIGEAMPTTVT
        EA++QMP+YA+F++D+LTK ++    E V LT ECS IL+  LP+   K +D GSFT+P                             LG+GE   T +T
Subjt:  EAIEQMPNYAKFLEDILTKNKRLGEFETVSLTEECSAILKNGLPS---KAKDPGSFTIP-----------------------------LGIGEAMPTTVT

Query:  PQLADRSITYPEGKIEDVLVKVDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSLIRILE--
         QLADRSI YP+G +E+VLVKVDKFIFP DFI+++ E D++VP+ILGRPFLATG+ALIDV++G+LT+RV NE++ FNV+ A+K  D+ + C  I I++  
Subjt:  PQLADRSITYPEGKIEDVLVKVDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSLIRILE--

Query:  ----------ITVIETAIQD-----SADKHSEKHGESL---------DLDQRKAPPIKPSLIEAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHE
                  +  +E+ ++D       D+H E+  E +         +L      P+  SL ++P L+LKPLP HLKY YLG  +TLPII++S L  + E
Subjt:  ----------ITVIETAIQD-----SADKHSEKHGESL---------DLDQRKAPPIKPSLIEAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHE

Query:  ETLIKLLQQYRKAICWTLADIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE---------------------------KGGVTVVSNKDNE
        + L+ +L+++++AI WT+ADI+GISP  C  +I LE     I++ QRRLNP +KEVVKKE                           KGG+TVV N+ N+
Subjt:  ETLIKLLQQYRKAICWTLADIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKE---------------------------KGGVTVVSNKDNE

Query:  LIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI
        LIPTRTVTGWRVC DYR+LN ATRKDHFPLPFIDQML++L G  YYCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCM  I
Subjt:  LIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADI

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.68.5e-1745.65Show/hide
Query:  WRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADIV
        +R+  DYR+LN+ T  D  P+P +D++L +L    Y+  +D   G++QI + PE   KT F+  +G + + RMPFGL NAPATFQRCM DI+
Subjt:  WRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADIV

P20825 Retrovirus-related Pol polyprotein from transposon 2971.2e-1544.57Show/hide
Query:  WRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADIV
        +RV  DYR+LN+ T  D +P+P +D++L +L    Y+  +D   G++QI +  E   KT F+   G + + RMPFGL NAPATFQRCM +I+
Subjt:  WRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADIV

P31843 RNA-directed DNA polymerase homolog4.2e-1647.25Show/hide
Query:  RVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADIV
        R+C DYR L K T K+ +P+P +D + DRL    ++  LD  SGY Q+ IA  D+ KTT    YG+F FR MPFGL NA ATF   M +++
Subjt:  RVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMADIV

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.8e-2050.51Show/hide
Query:  LIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMAD
        L+P +  T +R+C DYR LNKAT  D FPLP ID +L R+     +  LD +SGY+QI + P+D+ KT F  P G + +  MPFGL NAP+TF R MAD
Subjt:  LIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMAD

Q99315 Transposon Ty3-G Gag-Pol polyprotein4.8e-2050.51Show/hide
Query:  LIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMAD
        L+P +  T +R+C DYR LNKAT  D FPLP ID +L R+     +  LD +SGY+QI + P+D+ KT F  P G + +  MPFGL NAP+TF R MAD
Subjt:  LIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMAD

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCCGCCTGGGGTACGATTTGAGCTTGATCCAGAAACTGAAATGACATTTAGGAACAGAAGGAGGGAGCAGCGCAGAAACCATATGGAGAACGTGCCGCAACT
TCCGCAGGTTCTTGAAGGTCCAGCGAACCCCCAGCAGAATCCGTTGCTGCAGCAAAACCTGTTGTTAGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCT
TGATAGCGAACGATAGGACCAGAGCCATTCGAGCGTATGTTGTCCCGATTTTTAATGAGTTATATCCAGGGATTGCACGTCCCAAAATCCAAGCGGCAAATTTTGAAATT
AAACCGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGGGATGCTCTTAGATTAACTTTGTTCCCGTATTCACTTATAGATGGAGCAAAGTCATG
GTTAAATTCTTTTGCTCCAAGATCAATTAGGACGTGGGATGAGTTAGTTGAAAAAATTTTGAGTAAATATTTCCCACCTAATAGAAATGCTAAATTAAGCAGTGAAATAG
TAGGGTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCTCATTGTATTCAAATGGAA
ACATTTTACAATGGTTTAAATGGAGTAACCCAAGGTATGGTCGATGCCTCGGCTGCAGGGGCCCTTTTGGCAAAAACTTTTGATGAAGCTTATGAAATTTTAGAAAGAAT
ATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAATAAGAAAGTTAAGAGTGTATTAGAAGTCGATGGTGTGTCCACCATTAGGGCTGATCTTGTTATGATTG
CTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCTATGGAACCTGCAGCAGTGGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGTGGTGAA
GACCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTCTATAATCCAGGTTGGCACAACCACCC
CAACTTCTCATGGGGAGGTCAAGGAAATAACGTACAAGCGCAACAAAAGATGAACCAGCCGGGATTTGCTAAAGCGCAGGTAATGCCCCAGCAAAATAAGCAGGCTATGC
CCCAGCAAAATTCGGGAAACTCTCTCGACGCGGTGATGAAAGAATTTACTGCTCGCACAGATGCCGCAATTCAATGTAATCAAGCTTCGATGAGAGCTCTGAAATTGCAA
GTGGGTCAGCTAGCTAATGAGCTGAAGGCAAGACCTCAAGGGAAACTTCCATCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTTACTCTTAG
GAGTGGTAAGCCATTAGAAGAAAGAATTGAGCCTAGTAAAACCCAGGTTATAGATAAAAATTGTGATAAAAATATTGTTGTTGAGAAAGAATTGGAGTTTGGTCAGGGTG
TTGGAGGCAGCAATAAAAATGCTGGAGCATCTGGTTCTGTTCCAAATGTGGAACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGTCAG
AAGCCTAAGAATCAGAATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAATTGCACATAAATATCCCTTTAGTAGAAGCTATTGAGCAAATGCCTAATTATGCTAA
ATTTCTTGAGGATATTTTGACTAAAAATAAGAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGGGCTACCCTCCAAGGCTA
AGGATCCAGGGTCATTTACCATACCTTTAGGTATTGGTGAAGCTATGCCTACCACAGTCACACCCCAATTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAG
GATGTCTTAGTGAAGGTAGATAAATTCATATTTCCTGATGATTTCATTATTGTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTCGTCCATTTTTGGCAAC
TGGTAGGGCATTAATAGATGTTCAAAAAGGAGAATTAACAATGAGAGTCTGTAATGAGGAAGTAAAATTTAATGTCTTTAAAGCCATGAAATATCCAGACGAAATGGAGG
ATTGCTCCTTAATCAGGATTCTGGAAATCACAGTTATTGAGACAGCAATACAGGATTCGGCTGATAAGCATTCGGAAAAGCATGGAGAGTCTTTAGATTTAGATCAGAGG
AAAGCTCCTCCCATTAAGCCATCCCTGATTGAGGCACCTACTTTAGATCTGAAGCCTTTGCCGAAGCATCTAAAGTATGTGTATCTTGGGGAAAGTGAGACGTTGCCCAT
TATTGTTGCATCCGATTTAATGCCAGAGCATGAGGAGACCTTAATAAAATTGCTGCAGCAATACCGCAAGGCTATATGTTGGACATTGGCTGATATTCAGGGAATTAGCC
CATCTTTTTGTATGCAAAAAATCACTCTAGAGGAGGGATCCTTTAGGATTATTGAGCAACAAAGAAGGCTTAACCCTGCAATAAAAGAGGTTGTTAAAAAGGAGAAAGGA
GGTGTCACTGTGGTGAGCAATAAAGATAATGAGTTGATCCCCACAAGGACAGTAACTGGCTGGAGGGTTTGCAAGGATTACAGAAGGCTTAATAAAGCTACCCGAAAGGA
TCATTTCCCTCTACCATTTATCGACCAGATGTTGGATCGATTGGTTGGTCAGGCCTATTACTGTTTCCTGGATGGTTATTCTGGGTATAACCAGATTACTATTGCTCCTG
AGGATCAGGAGAAAACCACTTTCACCTGTCCTTATGGGACATTCGCTTTTAGAAGAATGCCTTTCGGCCTCTGCAATGCTCCAGCAACATTTCAGCGGTGTATGGCCGAT
ATTGTAGCATTAGCGAACGCTCTTAAAAACATAACTGTGGTAAGCAATGTTCAGCAGCCGCCATTGGTGGAGTCTGCTGCAGTTGTAAACCAAGTGGCAACCAAATCTTG
TGTCTATTGTGGTGAAGAGCATAATTATGAGTTTTGTCCTAAAAATCTAGCTTCTGTGTTTTTTGCAGGTAATCAGAGGAATAACCCATATTCAAATTTTTATAATCCAA
GTTGGCATAACCACCCCAACTTTACGTGGGGAGGGCAAGGAATTAATTCTCAAGCTCTCCATGCACAACAAAAGGTGGCGAATCAGCAAAGATTTATTTTGCAGAACCAG
CAAGGTTTGCCTCAGCAAAATAAGCAGGTTGCCCATCAAAATTCGGGTAATTCTCTGGAGACAATGATGAAGGATTATATGGCTCGTAATGATGTCTTAATCCAAAGTCA
GCAGGCTTCATTGAGAGTCCTAGAGTTTCAAGTGGGCCAGCTAGCTAATGAGCTGAAGGCACGACTTCACGGGAACATTCCTTCAGATATTGAACACCCTATAAAGGAAG
AATTGGAGACTGGTCAAGGTGCTGGAGGCAACAATAATGATGCTGGAACATCTAGTTCTGTTCCAGATGTGGAACCACCTTATGTACCGCCCCCACTCTATAACCCACCC
TTACCTTTTCCACAAAGGCATAAACCCAAGAACCAGGATGTTAAATTTAAGAAGTTTTTAGAGATTCTTAAGCAACTTCATATAAACATACCCTTAGTAGAGGCTTTAGA
ACAAATGCCAAATTATGCAAAATTTCTTAAGGACATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCCGCCTGGGGTACGATTTGAGCTTGATCCAGAAACTGAAATGACATTTAGGAACAGAAGGAGGGAGCAGCGCAGAAACCATATGGAGAACGTGCCGCAACT
TCCGCAGGTTCTTGAAGGTCCAGCGAACCCCCAGCAGAATCCGTTGCTGCAGCAAAACCTGTTGTTAGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCT
TGATAGCGAACGATAGGACCAGAGCCATTCGAGCGTATGTTGTCCCGATTTTTAATGAGTTATATCCAGGGATTGCACGTCCCAAAATCCAAGCGGCAAATTTTGAAATT
AAACCGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGGGATGCTCTTAGATTAACTTTGTTCCCGTATTCACTTATAGATGGAGCAAAGTCATG
GTTAAATTCTTTTGCTCCAAGATCAATTAGGACGTGGGATGAGTTAGTTGAAAAAATTTTGAGTAAATATTTCCCACCTAATAGAAATGCTAAATTAAGCAGTGAAATAG
TAGGGTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCTCATTGTATTCAAATGGAA
ACATTTTACAATGGTTTAAATGGAGTAACCCAAGGTATGGTCGATGCCTCGGCTGCAGGGGCCCTTTTGGCAAAAACTTTTGATGAAGCTTATGAAATTTTAGAAAGAAT
ATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAATAAGAAAGTTAAGAGTGTATTAGAAGTCGATGGTGTGTCCACCATTAGGGCTGATCTTGTTATGATTG
CTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCTATGGAACCTGCAGCAGTGGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGTGGTGAA
GACCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTCTATAATCCAGGTTGGCACAACCACCC
CAACTTCTCATGGGGAGGTCAAGGAAATAACGTACAAGCGCAACAAAAGATGAACCAGCCGGGATTTGCTAAAGCGCAGGTAATGCCCCAGCAAAATAAGCAGGCTATGC
CCCAGCAAAATTCGGGAAACTCTCTCGACGCGGTGATGAAAGAATTTACTGCTCGCACAGATGCCGCAATTCAATGTAATCAAGCTTCGATGAGAGCTCTGAAATTGCAA
GTGGGTCAGCTAGCTAATGAGCTGAAGGCAAGACCTCAAGGGAAACTTCCATCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTTACTCTTAG
GAGTGGTAAGCCATTAGAAGAAAGAATTGAGCCTAGTAAAACCCAGGTTATAGATAAAAATTGTGATAAAAATATTGTTGTTGAGAAAGAATTGGAGTTTGGTCAGGGTG
TTGGAGGCAGCAATAAAAATGCTGGAGCATCTGGTTCTGTTCCAAATGTGGAACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGTCAG
AAGCCTAAGAATCAGAATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAATTGCACATAAATATCCCTTTAGTAGAAGCTATTGAGCAAATGCCTAATTATGCTAA
ATTTCTTGAGGATATTTTGACTAAAAATAAGAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGGGCTACCCTCCAAGGCTA
AGGATCCAGGGTCATTTACCATACCTTTAGGTATTGGTGAAGCTATGCCTACCACAGTCACACCCCAATTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAG
GATGTCTTAGTGAAGGTAGATAAATTCATATTTCCTGATGATTTCATTATTGTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTCGTCCATTTTTGGCAAC
TGGTAGGGCATTAATAGATGTTCAAAAAGGAGAATTAACAATGAGAGTCTGTAATGAGGAAGTAAAATTTAATGTCTTTAAAGCCATGAAATATCCAGACGAAATGGAGG
ATTGCTCCTTAATCAGGATTCTGGAAATCACAGTTATTGAGACAGCAATACAGGATTCGGCTGATAAGCATTCGGAAAAGCATGGAGAGTCTTTAGATTTAGATCAGAGG
AAAGCTCCTCCCATTAAGCCATCCCTGATTGAGGCACCTACTTTAGATCTGAAGCCTTTGCCGAAGCATCTAAAGTATGTGTATCTTGGGGAAAGTGAGACGTTGCCCAT
TATTGTTGCATCCGATTTAATGCCAGAGCATGAGGAGACCTTAATAAAATTGCTGCAGCAATACCGCAAGGCTATATGTTGGACATTGGCTGATATTCAGGGAATTAGCC
CATCTTTTTGTATGCAAAAAATCACTCTAGAGGAGGGATCCTTTAGGATTATTGAGCAACAAAGAAGGCTTAACCCTGCAATAAAAGAGGTTGTTAAAAAGGAGAAAGGA
GGTGTCACTGTGGTGAGCAATAAAGATAATGAGTTGATCCCCACAAGGACAGTAACTGGCTGGAGGGTTTGCAAGGATTACAGAAGGCTTAATAAAGCTACCCGAAAGGA
TCATTTCCCTCTACCATTTATCGACCAGATGTTGGATCGATTGGTTGGTCAGGCCTATTACTGTTTCCTGGATGGTTATTCTGGGTATAACCAGATTACTATTGCTCCTG
AGGATCAGGAGAAAACCACTTTCACCTGTCCTTATGGGACATTCGCTTTTAGAAGAATGCCTTTCGGCCTCTGCAATGCTCCAGCAACATTTCAGCGGTGTATGGCCGAT
ATTGTAGCATTAGCGAACGCTCTTAAAAACATAACTGTGGTAAGCAATGTTCAGCAGCCGCCATTGGTGGAGTCTGCTGCAGTTGTAAACCAAGTGGCAACCAAATCTTG
TGTCTATTGTGGTGAAGAGCATAATTATGAGTTTTGTCCTAAAAATCTAGCTTCTGTGTTTTTTGCAGGTAATCAGAGGAATAACCCATATTCAAATTTTTATAATCCAA
GTTGGCATAACCACCCCAACTTTACGTGGGGAGGGCAAGGAATTAATTCTCAAGCTCTCCATGCACAACAAAAGGTGGCGAATCAGCAAAGATTTATTTTGCAGAACCAG
CAAGGTTTGCCTCAGCAAAATAAGCAGGTTGCCCATCAAAATTCGGGTAATTCTCTGGAGACAATGATGAAGGATTATATGGCTCGTAATGATGTCTTAATCCAAAGTCA
GCAGGCTTCATTGAGAGTCCTAGAGTTTCAAGTGGGCCAGCTAGCTAATGAGCTGAAGGCACGACTTCACGGGAACATTCCTTCAGATATTGAACACCCTATAAAGGAAG
AATTGGAGACTGGTCAAGGTGCTGGAGGCAACAATAATGATGCTGGAACATCTAGTTCTGTTCCAGATGTGGAACCACCTTATGTACCGCCCCCACTCTATAACCCACCC
TTACCTTTTCCACAAAGGCATAAACCCAAGAACCAGGATGTTAAATTTAAGAAGTTTTTAGAGATTCTTAAGCAACTTCATATAAACATACCCTTAGTAGAGGCTTTAGA
ACAAATGCCAAATTATGCAAAATTTCTTAAGGACATTTGA
Protein sequenceShow/hide protein sequence
MSDPPGVRFELDPETEMTFRNRRREQRRNHMENVPQLPQVLEGPANPQQNPLLQQNLLLEQNEQQNNQAENPILIANDRTRAIRAYVVPIFNELYPGIARPKIQAANFEI
KPSFLGVSDSFVIQGVPRDALRLTLFPYSLIDGAKSWLNSFAPRSIRTWDELVEKILSKYFPPNRNAKLSSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQME
TFYNGLNGVTQGMVDASAAGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLVMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGE
DHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGFAKAQVMPQQNKQAMPQQNSGNSLDAVMKEFTARTDAAIQCNQASMRALKLQ
VGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIEPSKTQVIDKNCDKNIVVEKELEFGQGVGGSNKNAGASGSVPNVEPPYVPPPPYVPPLPFPQSQ
KPKNQNGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKNKRLGEFETVSLTEECSAILKNGLPSKAKDPGSFTIPLGIGEAMPTTVTPQLADRSITYPEGKIE
DVLVKVDKFIFPDDFIIVDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSLIRILEITVIETAIQDSADKHSEKHGESLDLDQR
KAPPIKPSLIEAPTLDLKPLPKHLKYVYLGESETLPIIVASDLMPEHEETLIKLLQQYRKAICWTLADIQGISPSFCMQKITLEEGSFRIIEQQRRLNPAIKEVVKKEKG
GVTVVSNKDNELIPTRTVTGWRVCKDYRRLNKATRKDHFPLPFIDQMLDRLVGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMAD
IVALANALKNITVVSNVQQPPLVESAAVVNQVATKSCVYCGEEHNYEFCPKNLASVFFAGNQRNNPYSNFYNPSWHNHPNFTWGGQGINSQALHAQQKVANQQRFILQNQ
QGLPQQNKQVAHQNSGNSLETMMKDYMARNDVLIQSQQASLRVLEFQVGQLANELKARLHGNIPSDIEHPIKEELETGQGAGGNNNDAGTSSSVPDVEPPYVPPPLYNPP
LPFPQRHKPKNQDVKFKKFLEILKQLHINIPLVEALEQMPNYAKFLKDI