; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039135 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039135
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNA-directed DNA polymerase
Genome locationchr2:36688935..36691357
RNA-Seq ExpressionLag0039135
SyntenyLag0039135
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]3.6e-13338.33Show/hide
Query:  PSDGADQTDQETLSLSPKSLNNRILHIGESMEEMKGHVRDVRKMLEQLILQQPNYAGQENLRLIDHHQQIRHQGTVKKRPRKIQENKNNPHENPRMAPLP
        P+    Q   ET  LSP++ +  +  +  S+EE       +R++L  ++    +   +EN +L D   +        +R R+  E    P  N +   +P
Subjt:  PSDGADQTDQETLSLSPKSLNNRILHIGESMEEMKGHVRDVRKMLEQLILQQPNYAGQENLRLIDHHQQIRHQGTVKKRPRKIQENKNNPHENPRMAPLP

Query:  FQEDPRMAPPPFRPPELCLENPRYQEFDSSSEEDVPYNYDPR--------RRGEPYQENNDYKMKVDLPTFSGKFNMEAFLDWIKEVEAFFGYAGTPEEK
          ED  + P   R   +  +  R +E ++SS  +   N D          +     +EN++YKMK+DLP++ GK N+E FLDW+K  E FF Y GT + K
Subjt:  FQEDPRMAPPPFRPPELCLENPRYQEFDSSSEEDVPYNYDPR--------RRGEPYQENNDYKMKVDLPTFSGKFNMEAFLDWIKEVEAFFGYAGTPEEK

Query:  KVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPVNYEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAESENHLIARYVDGL
        KV LVA KL+GGASAWWDQI  NRQ+ GK PIR+W KM KLMK R++P NYEQ LY QYQ+C QG +  A+Y EE HRL  R NL E E HLI+ +V GL
Subjt:  KVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPVNYEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAESENHLIARYVDGL

Query:  RDD--------------------------IQKRRSTTDGKTFQIGSS------------STSNMDKGKEEMGTRQIQGINTKNASTTYNRPNLGKCFRCG
        R D                          I+ R  +T  + ++  +S            +TS     +EE   ++      K     Y RP  G C+RCG
Subjt:  RDD--------------------------IQKRRSTTDGKTFQIGSS------------STSNMDKGKEEMGTRQIQGINTKNASTTYNRPNLGKCFRCG

Query:  QQGHLSNECPQRKTLAIMDAQNEEDFDEEDRSYEDINYIEADQGEQLSCILQRIFLTPNTDSHPQKHLLFRTRCTVNGKVCNIIVDSGSSENIISRKATQ
        Q GH SN+CPQRKT+A+    ++          E+   IEAD+G+ LSCILQR+ ++P  ++  Q+H LF+TRCT+ GKVCN+I+DSGSSEN +S+K   
Subjt:  QQGHLSNECPQRKTLAIMDAQNEEDFDEEDRSYEDINYIEADQGEQLSCILQRIFLTPNTDSHPQKHLLFRTRCTVNGKVCNIIVDSGSSENIISRKATQ

Query:  LLQLKTDPHPNPYK--------------------------------------------------------------------------------------
         L LKT PH  PYK                                                                                      
Subjt:  LLQLKTDPHPNPYK--------------------------------------------------------------------------------------

Query:  ----------------------------------DNYAIEEINPIVLELLESYPEIMKEPTSLPPLRDIQHQIDLLPSSNLPNLPHYRMSLKEYQILQEQ
                                          D    E+I   + EL + YP+I KEPT LPPLRDI H I+LL  ++ P+LPHY MS  EY+IL + 
Subjt:  ----------------------------------DNYAIEEINPIVLELLESYPEIMKEPTSLPPLRDIQHQIDLLPSSNLPNLPHYRMSLKEYQILQEQ

Query:  VQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNAGFLNG
        ++ELL KGHI+PS S   VPALLTPKKD TWR+C DSRAINKITVKY FPIPR+SD+LDQLGGA +FSK+DL+S YHQIRI P DEWKTAFKTN G    
Subjt:  VQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNAGFLNG

Query:  L
        L
Subjt:  L

XP_024440968.1 uncharacterized protein LOC112324119 [Populus trichocarpa]1.0e-12440.37Show/hide
Query:  RPPELCLENPRYQEFDSSSEEDVPYNYDPRRRGEPYQENND---YKMKVDLPTFSGKFNMEAFLDWIKEVEAFFGYAGTPEEKKVKLVAYKLRGGASAWW
        RPP +    P Y++  S  EE+        RRG  YQ   D   ++MK+DLP+F+G+  +E FLDW+  VE FF Y   PE+KKVKLVAY+L GGASAWW
Subjt:  RPPELCLENPRYQEFDSSSEEDVPYNYDPRRRGEPYQENND---YKMKVDLPTFSGKFNMEAFLDWIKEVEAFFGYAGTPEEKKVKLVAYKLRGGASAWW

Query:  DQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPVNYEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAESENHLIARYVDGLRDDIQKRRSTTDGKTF-
        +Q+Q+ R R GK  ++TW+KM +L+++R+LP +YEQ+L+ QYQ C QG++TV  + EE HRL +RNNL E+E   IAR+V GLR  IQ R +     T  
Subjt:  DQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPVNYEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAESENHLIARYVDGLRDDIQKRRSTTDGKTF-

Query:  ---QIGSSSTSNMDKGKEEMGTRQ----------------------------IQGINTKNAST--------------TYNRPNLGKCFRCGQQGHLSNEC
            +   + + +DK K  +G R                             IQG ++  A T               Y RP   KC+RCGQ GH SN+C
Subjt:  ---QIGSSSTSNMDKGKEEMGTRQ----------------------------IQGINTKNAST--------------TYNRPNLGKCFRCGQQGHLSNEC

Query:  PQRKTLAIMDAQNEEDFD--EEDR----SYEDINYIEADQGEQL--SCILQRIFLTPNTDSHPQKHLLFRTRCTVNGKVCNIIVDSGSSENIISRKATQL
        P+R T+ +++ + E  FD  +ED     +YE+      D+GE L  S ++QR+ L P      Q+H +FRTRCTVN +VC+II+DSGSSENIIS+     
Subjt:  PQRKTLAIMDAQNEEDFD--EEDR----SYEDINYIEADQGEQL--SCILQRIFLTPNTDSHPQKHLLFRTRCTVNGKVCNIIVDSGSSENIISRKATQL

Query:  LQLKTDPHPNPYK-----------------------DNYAIE----------------------------------------------------------
        L L+T  HP PYK                        NYA E                                                          
Subjt:  LQLKTDPHPNPYK-----------------------DNYAIE----------------------------------------------------------

Query:  --------------------------------------EINPIVLELLESYPEIMKE--PTSLPPLRDIQHQIDLLPSSNLPNLPHYRMSLKEYQILQEQ
                                               I P +L LLE + E++    P  LPP+RDIQHQID +P ++LPN PHYRM+ KE Q+LQ Q
Subjt:  --------------------------------------EINPIVLELLESYPEIMKE--PTSLPPLRDIQHQIDLLPSSNLPNLPHYRMSLKEYQILQEQ

Query:  VQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNAGFLNG
        V+EL++KG +Q SMSP AVPALL PKKD +WR+C DSRAINKITVKY FPIPRL D+LD L GA VFSK+DL+SGYHQIRI P DEWKTAFKT  G    
Subjt:  VQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNAGFLNG

Query:  L
        L
Subjt:  L

XP_031744062.1 uncharacterized protein LOC116404773 [Cucumis sativus]1.3e-14646.95Show/hide
Query:  RRGEPYQENNDYKMKVDLPTFSGKFNMEAFLDWIKEVEAFFGYAGTPEEKKVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPVN
        RRG    E +DYKMK+DLP + GK N+EAFLDWIK  E FF Y  TPE KKV LVA KLR GASAWWDQ++ NRQR GK+PIR+W KM KL+K R+LP N
Subjt:  RRGEPYQENNDYKMKVDLPTFSGKFNMEAFLDWIKEVEAFFGYAGTPEEKKVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPVN

Query:  YEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAESENHLIARYVD-------GLRDDIQKRRS---TTDGKTFQIGSSSTSNMDKGKE----EMGT-
        YEQ LYNQYQ+C QG ++VADY EE HRL AR NL+E+E H +AR+V         +R     RRS   TT  K+      STS   KGKE    E+   
Subjt:  YEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAESENHLIARYVD-------GLRDDIQKRRS---TTDGKTFQIGSSSTSNMDKGKE----EMGT-

Query:  RQIQGINTKNASTTYNRPNLGKCFRCGQQGHLSNECPQRKTLAIMD--AQNEEDFDEEDRSYEDINYIEADQGEQLSCILQRIFLTPNTDSHPQKHLLFR
        R+ +     +   +Y+RP+LGKCFRCGQ GHLSN CPQRKT+AI +   Q  ED  E +   E+   IEAD GE++SC +QR+ + P  + + Q+H LF+
Subjt:  RQIQGINTKNASTTYNRPNLGKCFRCGQQGHLSNECPQRKTLAIMD--AQNEEDFDEEDRSYEDINYIEADQGEQLSCILQRIFLTPNTDSHPQKHLLFR

Query:  TRCTVNGKVCNIIVDSGSSENIISRKATQLLQLKTDPHPNPYKDNYA-----------------------------------------------------
        TRCT+NG+VC++I+DSGSSEN +++K   +L LK + HP PYK  +                                                      
Subjt:  TRCTVNGKVCNIIVDSGSSENIISRKATQLLQLKTDPHPNPYKDNYA-----------------------------------------------------

Query:  ----------------------------------------------------------------IEEINPIVLELLESYPEIMKEPTSLPPLRDIQHQID
                                                                        +E+I P + +LL  +P I +EP  LPPLRDIQH ID
Subjt:  ----------------------------------------------------------------IEEINPIVLELLESYPEIMKEPTSLPPLRDIQHQID

Query:  LLPSSNLPNLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKS
        L+P ++LPNL HYRMS +EY+IL + ++ELL KGHI+PS+SP AVPALLTPKKD +WR+C DSRAIN+ITVKY FPIPR+SD+LDQLG A +FSK+DLKS
Subjt:  LLPSSNLPNLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKS

Query:  GYHQIRICPEDEWKTAFKTNAG
        GYHQIR+ P DEWKTAFKTN G
Subjt:  GYHQIRICPEDEWKTAFKTNAG

XP_038989925.1 uncharacterized protein LOC120113183 [Phoenix dactylifera]2.0e-12340.73Show/hide
Query:  RRRGEPYQENNDYKMKVDLPTFSGKFNMEAFLDWIKEVEAFFGYAGTPEEKKVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPV
        R RG P     D+++KVDLP F+G  ++E FLDW+ EVE FF Y   P++KKVKLVAYKL+GGASAWWDQ+Q NR R GK  I TW KM + ++ R+LP 
Subjt:  RRRGEPYQENNDYKMKVDLPTFSGKFNMEAFLDWIKEVEAFFGYAGTPEEKKVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPV

Query:  NYEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAESENHLIARYVDGLRDDIQ---------------------KRRSTTDGKTFQ---IGSSSTSN
        +YEQ LY+QYQ+C QG +TV +Y++E +RL ARNNL+E+EN  +ARY+ GL+  I+                     + ++T     FQ     SSS S 
Subjt:  NYEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAESENHLIARYVDGLRDDIQ---------------------KRRSTTDGKTFQ---IGSSSTSN

Query:  MDKGKEEMG---TRQIQGINT--------------KNASTTYNRPNLGKCFRCGQQGHLSNECPQRKTLAIMDAQNEEDFDEEDRSY-------EDINYI
        + K K   G   + Q   I+T              K     Y RP   KC+RC + GH SNECP+R+ + +++A++EE+   E+  +       ED+   
Subjt:  MDKGKEEMG---TRQIQGINT--------------KNASTTYNRPNLGKCFRCGQQGHLSNECPQRKTLAIMDAQNEEDFDEEDRSY-------EDINYI

Query:  EADQGEQLSCILQRIFLTPNTDSHPQKHLLFRTRCTVNGKVCNIIVDSGSSENIISRKATQLLQLKTDPHPNPYK-------------------------
          DQG   S ++QRI   P  +   Q+H +F+T CT+N  VCN+I+DSGSSENI+S    + + LKT+ HP+PYK                         
Subjt:  EADQGEQLSCILQRIFLTPNTDSHPQKHLLFRTRCTVNGKVCNIIVDSGSSENIISRKATQLLQLKTDPHPNPYK-------------------------

Query:  ----------------------------------DNYAI--------------------------------------------EEINPIVLE-----LLE
                                          DN  +                                            EE+  ++++     +L+
Subjt:  ----------------------------------DNYAI--------------------------------------------EEINPIVLE-----LLE

Query:  SYPEIMKE-------------PTSLPPLRDIQHQIDLLPSSNLPNLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSR
          PE  KE             P  LPP+RDIQH IDL+P ++LPNLPHYRMS KE +ILQ+QV++L+ KG I+ SMSP AVPALLTPKKD +WR+C DSR
Subjt:  SYPEIMKE-------------PTSLPPLRDIQHQIDLLPSSNLPNLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSR

Query:  AINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNAG
        AIN+ITVKY FPIPRL+D+LD L GA VFSK+DL+SGYHQIRI P DEWKTAFKT  G
Subjt:  AINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNAG

XP_040994264.1 uncharacterized protein LOC121240799 [Juglans microcarpa x Juglans regia]2.0e-12838.58Show/hide
Query:  ENLRLIDHHQQIRHQGTVKKRPRKIQENKNNPHENPRMAPLPFQEDPRMAPP----PFRPPELCLENPRYQEFDSSSEEDVPYNYDPRRRGEPYQENNDY
        E+++  +   Q R +G +++  ++IQ +  N H       +    D     P     FRPP+      +    D+SS E+   N            N ++
Subjt:  ENLRLIDHHQQIRHQGTVKKRPRKIQENKNNPHENPRMAPLPFQEDPRMAPP----PFRPPELCLENPRYQEFDSSSEEDVPYNYDPRRRGEPYQENNDY

Query:  KMKVDLPTFSGKFNMEAFLDWIKEVEAFFGYAGTPEEKKVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPVNYEQMLYNQYQHC
        K+K+DLP F+G  ++E+FLDW+ EVE FF Y   PE ++VKLVAYKLRGGASAWW+Q Q NR+R GK+P+R W KM +LM+ R+LP +YEQ+LY QYQ+C
Subjt:  KMKVDLPTFSGKFNMEAFLDWIKEVEAFFGYAGTPEEKKVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPVNYEQMLYNQYQHC

Query:  SQGSKTVADYSEESHRLCARNNLAESENHLIARYVDGLRDDIQKRRST----TDGKTFQIGSSSTSNMD-------------KGKE--------------
         QG++T+ +Y+EE +RL +RNNLAE+E   +ARY+ GLR  IQ + +     T  +   +     S +              KG E              
Subjt:  SQGSKTVADYSEESHRLCARNNLAESENHLIARYVDGLRDDIQKRRST----TDGKTFQIGSSSTSNMD-------------KGKE--------------

Query:  ----EMGTRQIQGINTKNAST-------TYNRPNLGKCFRCGQQGHLSNECPQRKTLAIMDAQNEEDFDEEDRSYEDINYIEADQGEQLSCILQRIFLTP
                 Q+   NT    T        YN+P  GKCFRC Q GH SNECP RK++ ++D  ++   + +D S ED  ++E D+G+ ++C++QR+ L P
Subjt:  ----EMGTRQIQGINTKNAST-------TYNRPNLGKCFRCGQQGHLSNECPQRKTLAIMDAQNEEDFDEEDRSYEDINYIEADQGEQLSCILQRIFLTP

Query:  NTDSHPQKHLLFRTRCTVNGKVCNIIVDSGSSENIISRKATQLLQLKTDPHPNPYKDNY-----------------------------------------
          + H Q+H++F+TRCTVN KVCN+I+DSGS ENI+SR     LQL T+ HP PYK N+                                         
Subjt:  NTDSHPQKHLLFRTRCTVNGKVCNIIVDSGSSENIISRKATQLLQLKTDPHPNPYKDNY-----------------------------------------

Query:  ------------------------------------------------------------------------------AIEEINPIVLELLESYPEIMKE
                                                                                      A+ E +P V +LLE + +I  +
Subjt:  ------------------------------------------------------------------------------AIEEINPIVLELLESYPEIMKE

Query:  --PTSLPPLRDIQHQIDLLPSSNLPNLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDI
          P  LPPLRDIQH IDL+P  +LPNLPHYRMS  E++ILQ+QV++L+ KG I+ SMSP AVPALL PKKD +WR+C DSRAINKITVKY FPIPRL+D+
Subjt:  --PTSLPPLRDIQHQIDLLPSSNLPNLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDI

Query:  LDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNAGFLNGL
        LD L G+ VFSK+DL+SGYHQIR+ P DEWKTAFKT  G    L
Subjt:  LDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNAGFLNGL

TrEMBL top hitse value%identityAlignment
A0A5B7BER3 Uncharacterized protein1.7e-13340.47Show/hide
Query:  NKNNPHENPRMAPLPFQEDPRMAPPPFRPPELCLENPRYQEFDSSSEE------DVPYNYDPRRRGEPYQENNDYKMKVDLPTFSGKFNMEAFLDWIKEV
        N+N   +  R+  +P ++ P M   P R       NP Y     S EE      D  Y  DPR      Q   +Y+MK+DLP+F+G  ++E+FLDWI EV
Subjt:  NKNNPHENPRMAPLPFQEDPRMAPPPFRPPELCLENPRYQEFDSSSEE------DVPYNYDPRRRGEPYQENNDYKMKVDLPTFSGKFNMEAFLDWIKEV

Query:  EAFFGYAGTPEEKKVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPVNYEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAE
        E FF      ++K+VKLVAYKL+GGASAWWDQ+Q NR+R GK+P+RTW KM +L++ R+LPV+YEQ+LY QYQ+C QG ++V++YS+E + L +RNNL E
Subjt:  EAFFGYAGTPEEKKVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPVNYEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAE

Query:  SENHLIARYVDGLRDDIQKR----------------------------RSTTDGKTFQIGSSSTSNMDKGKE------EMGTRQIQGINTKNAST-----
        +EN  +ARYV GLR  IQ +                            RS    +++   S +  N DK  E      +  T + Q  ++KN +T     
Subjt:  SENHLIARYVDGLRDDIQKR----------------------------RSTTDGKTFQIGSSSTSNMDKGKE------EMGTRQIQGINTKNAST-----

Query:  -----TYNRPNLGKCFRCGQQGHLSNECPQRKTLAIMDA--QNEEDFD-EEDRSYED----INYIEADQGEQLSCILQRIFLTPNTDSHPQKHLLFRTRC
              Y RP  GKCFRC Q GH SNECP R+ + ++     N  DF+ EE+  Y+D        E D+GE +SC++QR+ L P  +  PQ+H +FRTRC
Subjt:  -----TYNRPNLGKCFRCGQQGHLSNECPQRKTLAIMDA--QNEEDFD-EEDRSYED----INYIEADQGEQLSCILQRIFLTPNTDSHPQKHLLFRTRC

Query:  TVNGKVCNIIVDSGSSENIISRKATQLLQLKTDPHPNPY-----------------------------------------------------------KD
        T+N KVC++I+DSGSSENI+S+   + LQLKT+ HPNPY                                                           KD
Subjt:  TVNGKVCNIIVDSGSSENIISRKATQLLQLKTDPHPNPY-----------------------------------------------------------KD

Query:  NYAI------------------------------------------EEINPIVLELLE--------SYPEIMKE-------------PTSLPPLRDIQHQ
        N  +                                          +E   I++ +++          PEI++              P  LPP+RDIQH 
Subjt:  NYAI------------------------------------------EEINPIVLELLE--------SYPEIMKE-------------PTSLPPLRDIQHQ

Query:  IDLLPSSNLPNLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDL
        IDL+P ++LPNLPHYRMS KE +ILQ+QV++L++KG IQ SMSP AVPALLTPKKD +WR+C DSRAINKITVKY FPIPRL+D+LD L G+ +FSK+DL
Subjt:  IDLLPSSNLPNLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDL

Query:  KSGYHQIRICPEDEWKTAFKTNAGFLNGL
        +SGYHQIRI P DEWKTAFKT  G    L
Subjt:  KSGYHQIRICPEDEWKTAFKTNAGFLNGL

A0A5D3DGR0 Reverse transcriptase1.7e-13338.33Show/hide
Query:  PSDGADQTDQETLSLSPKSLNNRILHIGESMEEMKGHVRDVRKMLEQLILQQPNYAGQENLRLIDHHQQIRHQGTVKKRPRKIQENKNNPHENPRMAPLP
        P+    Q   ET  LSP++ +  +  +  S+EE       +R++L  ++    +   +EN +L D   +        +R R+  E    P  N +   +P
Subjt:  PSDGADQTDQETLSLSPKSLNNRILHIGESMEEMKGHVRDVRKMLEQLILQQPNYAGQENLRLIDHHQQIRHQGTVKKRPRKIQENKNNPHENPRMAPLP

Query:  FQEDPRMAPPPFRPPELCLENPRYQEFDSSSEEDVPYNYDPR--------RRGEPYQENNDYKMKVDLPTFSGKFNMEAFLDWIKEVEAFFGYAGTPEEK
          ED  + P   R   +  +  R +E ++SS  +   N D          +     +EN++YKMK+DLP++ GK N+E FLDW+K  E FF Y GT + K
Subjt:  FQEDPRMAPPPFRPPELCLENPRYQEFDSSSEEDVPYNYDPR--------RRGEPYQENNDYKMKVDLPTFSGKFNMEAFLDWIKEVEAFFGYAGTPEEK

Query:  KVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPVNYEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAESENHLIARYVDGL
        KV LVA KL+GGASAWWDQI  NRQ+ GK PIR+W KM KLMK R++P NYEQ LY QYQ+C QG +  A+Y EE HRL  R NL E E HLI+ +V GL
Subjt:  KVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPVNYEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAESENHLIARYVDGL

Query:  RDD--------------------------IQKRRSTTDGKTFQIGSS------------STSNMDKGKEEMGTRQIQGINTKNASTTYNRPNLGKCFRCG
        R D                          I+ R  +T  + ++  +S            +TS     +EE   ++      K     Y RP  G C+RCG
Subjt:  RDD--------------------------IQKRRSTTDGKTFQIGSS------------STSNMDKGKEEMGTRQIQGINTKNASTTYNRPNLGKCFRCG

Query:  QQGHLSNECPQRKTLAIMDAQNEEDFDEEDRSYEDINYIEADQGEQLSCILQRIFLTPNTDSHPQKHLLFRTRCTVNGKVCNIIVDSGSSENIISRKATQ
        Q GH SN+CPQRKT+A+    ++          E+   IEAD+G+ LSCILQR+ ++P  ++  Q+H LF+TRCT+ GKVCN+I+DSGSSEN +S+K   
Subjt:  QQGHLSNECPQRKTLAIMDAQNEEDFDEEDRSYEDINYIEADQGEQLSCILQRIFLTPNTDSHPQKHLLFRTRCTVNGKVCNIIVDSGSSENIISRKATQ

Query:  LLQLKTDPHPNPYK--------------------------------------------------------------------------------------
         L LKT PH  PYK                                                                                      
Subjt:  LLQLKTDPHPNPYK--------------------------------------------------------------------------------------

Query:  ----------------------------------DNYAIEEINPIVLELLESYPEIMKEPTSLPPLRDIQHQIDLLPSSNLPNLPHYRMSLKEYQILQEQ
                                          D    E+I   + EL + YP+I KEPT LPPLRDI H I+LL  ++ P+LPHY MS  EY+IL + 
Subjt:  ----------------------------------DNYAIEEINPIVLELLESYPEIMKEPTSLPPLRDIQHQIDLLPSSNLPNLPHYRMSLKEYQILQEQ

Query:  VQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNAGFLNG
        ++ELL KGHI+PS S   VPALLTPKKD TWR+C DSRAINKITVKY FPIPR+SD+LDQLGGA +FSK+DL+S YHQIRI P DEWKTAFKTN G    
Subjt:  VQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNAGFLNG

Query:  L
        L
Subjt:  L

A0A6P3Z018 uncharacterized protein LOC1074050621.1e-11939.51Show/hide
Query:  PRRRGEPYQENNDYKMKVDLPTFSGKFNMEAFLDWIKEVEAFFGYAGTPEEKKVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLP
        P R    +Q  +DY++KVD+P F G  N+E FLDW++ VE+FF Y   PE+K+V LVAYK RGGASAWW+Q+ +NR++ GK PI++WS++ ++++ R+LP
Subjt:  PRRRGEPYQENNDYKMKVDLPTFSGKFNMEAFLDWIKEVEAFFGYAGTPEEKKVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLP

Query:  VNYEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAESENHLIARYVDGLRDDIQKR--------RSTTDGKTFQIGS------SSTSNMDKGKEEMG
        V++EQ+LY QY HC QG++++++Y+EE +RL AR NL E+E  L+ARYV GL   IQ+R         S      F+I        + T    K   E+ 
Subjt:  VNYEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAESENHLIARYVDGLRDDIQKR--------RSTTDGKTFQIGS------SSTSNMDKGKEEMG

Query:  TRQIQGI--------------NTKNASTTYNRPNLG----------KCFRCGQQGHLSNECPQRKTLAIMDAQNEEDFDEEDRSYEDINYIEADQGEQLS
          +I+ +              + KN S   N+PN            KCF+CGQQGH SNECP RK + I++ Q++   +E     ++   ++ DQGE + 
Subjt:  TRQIQGI--------------NTKNASTTYNRPNLG----------KCFRCGQQGHLSNECPQRKTLAIMDAQNEEDFDEEDRSYEDINYIEADQGEQLS

Query:  CILQRIFLTPNTDSHPQKHLLFRTRCTVNGKVCNIIVDSGSSENIISRKATQLLQLKTDPHPNPYK-----------------------DNYAIEEI---
        CI+Q++  +P     PQ+H +F+T+CT+  KVC +I DSGSSENI+S+   + L+L T  HPNPYK                        +YA E +   
Subjt:  CILQRIFLTPNTDSHPQKHLLFRTRCTVNGKVCNIIVDSGSSENIISRKATQLLQLKTDPHPNPYK-----------------------DNYAIEEI---

Query:  -----------------------------------------------------------------NPI--------------------------------
                                                                          PI                                
Subjt:  -----------------------------------------------------------------NPI--------------------------------

Query:  -----VLELLESYPEIMKE--PTSLPPLRDIQHQIDLLPSSNLPNLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSR
             +L+LL+ + EI     P SLPP+RDIQH IDLLP + LPNLPHYRM  KE QILQ+ V++LL K  I+ S+SP AVPALL PKK+  WR+C DSR
Subjt:  -----VLELLESYPEIMKE--PTSLPPLRDIQHQIDLLPSSNLPNLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSR

Query:  AINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNAG
        AINKIT KY FPIPRL D+LD+L GA VFSK+DL+SGYHQIRI P DEWKTAFKT  G
Subjt:  AINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNAG

A0A6P6GFU0 uncharacterized protein LOC1124928192.7e-11838.24Show/hide
Query:  PLPFQEDPRM--APPPFRPPE-----LCLENPRYQEFDSSSE---EDVPYNYDPRRRGEPY---QENNDYKMKVDLPTFSGKFNMEAFLDWIKEVEAFFG
        PL    DP +  +PPP  P       L ++       DS SE   E +  N  P+     Y    + +DY++KVD+P F G  N+E FLDW++ VE+FF 
Subjt:  PLPFQEDPRM--APPPFRPPE-----LCLENPRYQEFDSSSE---EDVPYNYDPRRRGEPY---QENNDYKMKVDLPTFSGKFNMEAFLDWIKEVEAFFG

Query:  YAGTPEEKKVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPVNYEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAESENHL
        Y   PE+K+V+LVAYK RGGASAWW+Q+ +NR++ GK PI++WS++ ++++ R+LPV++EQ+LY QY HC QG++++++ +EE +RL AR NL E+E  L
Subjt:  YAGTPEEKKVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPVNYEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAESENHL

Query:  IARYVDGLRDDIQKR--------RSTTDGKTFQIGS------SSTSNMDKGKEEMGTRQIQGI--------------NTKNASTTYNRPNLG--------
        +ARYV GL   IQ+R         S      F+I        + T    K   E+   +I+ +              + KN S   N+PN          
Subjt:  IARYVDGLRDDIQKR--------RSTTDGKTFQIGS------SSTSNMDKGKEEMGTRQIQGI--------------NTKNASTTYNRPNLG--------

Query:  --KCFRCGQQGHLSNECPQRKTLAIMDAQNEEDFDEEDRSYEDINYIEADQGEQLSCILQRIFLTPNTDSHPQKHLLFRTRCTVNGKVCNIIVDSGSSEN
          KCF+CGQQGH SNECP RK + I++ Q++   +E     ++   ++ DQGE + CI+Q++  +P      Q+H +F+T+CT+N KVC +I+DSGSSEN
Subjt:  --KCFRCGQQGHLSNECPQRKTLAIMDAQNEEDFDEEDRSYEDINYIEADQGEQLSCILQRIFLTPNTDSHPQKHLLFRTRCTVNGKVCNIIVDSGSSEN

Query:  IISRKATQLLQLKTDPHPNPYK-----------------------DNYAIEEI-----------------------------------------------
        I+S+   + L+L T  HPNPYK                        +YA E +                                               
Subjt:  IISRKATQLLQLKTDPHPNPYK-----------------------DNYAIEEI-----------------------------------------------

Query:  ---------------------NPI-------------------------------------VLELLESYPEIMKE--PTSLPPLRDIQHQIDLLPSSNLP
                              PI                                     +L+LL+ + EI     P SLPP+RDIQH IDLLP + LP
Subjt:  ---------------------NPI-------------------------------------VLELLESYPEIMKE--PTSLPPLRDIQHQIDLLPSSNLP

Query:  NLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRIC
        NLPHYRM  KE QILQ+ V++LL K  I+ S+SP AVPALL PKK+  WR+C DSRAINKIT KY FPIPRL D+LD+L GA VFSK+DL+SGYHQIRI 
Subjt:  NLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRIC

Query:  PEDEWKTAFKTNAG
        P DEWKTAFKT  G
Subjt:  PEDEWKTAFKTNAG

A0A6P6GHI6 uncharacterized protein LOC1124928641.4e-11938.8Show/hide
Query:  PLPFQEDPRM--APPPFRPPE-----LCLENPRYQEFDSSSE---EDVPYNYDPRRRGEPY---QENNDYKMKVDLPTFSGKFNMEAFLDWIKEVEAFFG
        PL    DP +  +PPP  P       L ++       DS SE   E +  N  P+     Y    + +DY++KVD+P F G  N+E FLDW++ VE+FF 
Subjt:  PLPFQEDPRM--APPPFRPPE-----LCLENPRYQEFDSSSE---EDVPYNYDPRRRGEPY---QENNDYKMKVDLPTFSGKFNMEAFLDWIKEVEAFFG

Query:  YAGTPEEKKVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPVNYEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAESENHL
        Y   PE+K+V+LVAYK RGGASAWW+Q+ +NR++ GK PI++WS++ ++++ R+LPV++EQ+LY QY HC QG++++++Y+EE +RL AR NL E+E  L
Subjt:  YAGTPEEKKVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLPVNYEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAESENHL

Query:  IARYVDGLRDDIQKR--------RSTTDGKTFQIGS------SSTSNMDKGKEEMGTRQIQGI--------------NTKNASTTYNRPNLG--------
        +ARYV GL   IQ+R         S      F+I        + T    K   E+   +I+ +              + KN S   N+PN          
Subjt:  IARYVDGLRDDIQKR--------RSTTDGKTFQIGS------SSTSNMDKGKEEMGTRQIQGI--------------NTKNASTTYNRPNLG--------

Query:  --KCFRCGQQGHLSNECPQRKTLAIMDAQNEEDFDEEDRSYEDINYIEADQGEQLSCILQRIFLTPNTDSHPQKHLLFRTRCTVNGKVCNIIVDSGSSEN
          KCF+CGQQGH SNECP RK + I++ Q  +D  EE  +  D      D+GE + CI+Q++  +P     PQ+H +F+T+CT+N KVC +I+DSGSSEN
Subjt:  --KCFRCGQQGHLSNECPQRKTLAIMDAQNEEDFDEEDRSYEDINYIEADQGEQLSCILQRIFLTPNTDSHPQKHLLFRTRCTVNGKVCNIIVDSGSSEN

Query:  IISRKATQLLQLKTDPHPNPYK-----------------------DNYAIEEI-----------------------------------------------
        I+S+   + L+L T  HPNPYK                        +YA E +                                               
Subjt:  IISRKATQLLQLKTDPHPNPYK-----------------------DNYAIEEI-----------------------------------------------

Query:  ---------------------NPI-------------------------------------VLELLESYPEIMKE--PTSLPPLRDIQHQIDLLPSSNLP
                              PI                                     +L+LL+ + EI     P SLPP+RDIQH IDLLP + LP
Subjt:  ---------------------NPI-------------------------------------VLELLESYPEIMKE--PTSLPPLRDIQHQIDLLPSSNLP

Query:  NLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRIC
        NLPHYRM  KE QILQ+ V++LL K  I+ S+SP AVPALL PKK+  WR+C DSRAINKIT KY FPIPRL D+LD+L GA VFSK+DL+SGYHQIRI 
Subjt:  NLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRIC

Query:  PEDEWKTAFKTNAG
        P DEWKTAFKT  G
Subjt:  PEDEWKTAFKTNAG

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.0e-1735.47Show/hide
Query:  DNYAIEEIN----PIVLELLESYPEIM-KEPTSLPPLRDIQHQIDLLPSSNLPNLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDD
        D Y +E +N      +  LL+ Y +I   E   L      +H I+     NLP    Y       Q ++ Q+Q++L++G I+ S SPY  P  + PKK D
Subjt:  DNYAIEEIN----PIVLELLESYPEIM-KEPTSLPPLRDIQHQIDLLPSSNLPNLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDD

Query:  T-----WRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNAG
              +RI  D R +N+ITV    PIP + +IL +LG    F+ +DL  G+HQI + PE   KTAF T  G
Subjt:  T-----WRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNAG

P10394 Retrovirus-related Pol polyprotein from transposon 4123.3e-1741.88Show/hide
Query:  HYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDD------TWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQI
        +YR    + + +Q QVQ+L+    ++PS+S Y  P LL PKK         WR+  D R INK  +   FP+PR+ DILDQLG A  FS +DL SG+HQI
Subjt:  HYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDD------TWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQI

Query:  RICPEDEWKTAFKTNAG
         +       T+F T+ G
Subjt:  RICPEDEWKTAFKTNAG

P20825 Retrovirus-related Pol polyprotein from transposon 2972.8e-1634.72Show/hide
Query:  KEPTSLPPLRDIQHQIDLLPSSNLPNLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDT-----WRICGDSRAINKITVKYSFPIP
        KE   L     I+H ++   +S + +  +      E ++ + QVQE+L++G I+ S SPY  P  + PKK D      +R+  D R +N+IT+   +PIP
Subjt:  KEPTSLPPLRDIQHQIDLLPSSNLPNLPHYRMSLKEYQILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDT-----WRICGDSRAINKITVKYSFPIP

Query:  RLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNAG
         + +IL +LG    F+ +DL  G+HQI +  E   KTAF T +G
Subjt:  RLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNAG

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.9e-2936.32Show/hide
Query:  NIIVDSGSSENIISRKATQLLQLKTDPHPNPYKDNYAIEEINPIVLELLESYPEIMKEPTSLPPLR------DIQHQIDLLPSSNLPNLPHYRMSLKEYQ
        +I+ + G   N++S    Q ++     H N  KD +       + + L + Y EI++    LPP         ++H I++ P + LP L  Y ++ K  Q
Subjt:  NIIVDSGSSENIISRKATQLLQLKTDPHPNPYKDNYAIEEINPIVLELLESYPEIMKEPTSLPPLR------DIQHQIDLLPSSNLPNLPHYRMSLKEYQ

Query:  ILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNA
         + + VQ+LLD   I PS SP + P +L PKKD T+R+C D R +NK T+   FP+PR+ ++L ++G A +F+ +DL SGYHQI + P+D +KTAF T +
Subjt:  ILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNA

Query:  G
        G
Subjt:  G

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.9e-2936.32Show/hide
Query:  NIIVDSGSSENIISRKATQLLQLKTDPHPNPYKDNYAIEEINPIVLELLESYPEIMKEPTSLPPLR------DIQHQIDLLPSSNLPNLPHYRMSLKEYQ
        +I+ + G   N++S    Q ++     H N  KD +       + + L + Y EI++    LPP         ++H I++ P + LP L  Y ++ K  Q
Subjt:  NIIVDSGSSENIISRKATQLLQLKTDPHPNPYKDNYAIEEINPIVLELLESYPEIMKEPTSLPPLR------DIQHQIDLLPSSNLPNLPHYRMSLKEYQ

Query:  ILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNA
         + + VQ+LLD   I PS SP + P +L PKKD T+R+C D R +NK T+   FP+PR+ ++L ++G A +F+ +DL SGYHQI + P+D +KTAF T +
Subjt:  ILQEQVQELLDKGHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNA

Query:  G
        G
Subjt:  G

Arabidopsis top hitse value%identityAlignment
AT2G15180.1 Zinc knuckle (CCHC-type) family protein7.4e-0425.3Show/hide
Query:  HENPRMAPLPFQEDPRMAPPPFRPPELCLENPRYQEFDSSSEEDVPYNYDPRRRGEPYQENNDY-------KMKVDLPTFSGK---------FNMEAFLD
        H++   +   F+E+P    P   PP+      RY     SS    P    P     P  + N Y       K+   L   +G          F+   +L 
Subjt:  HENPRMAPLPFQEDPRMAPPPFRPPELCLENPRYQEFDSSSEEDVPYNYDPRRRGEPYQENNDY-------KMKVDLPTFSGK---------FNMEAFLD

Query:  WIKEVEAFFGYAGTPEEKKVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLP
        W   +  +F +  T +E K+ +   +L+G A  WWDQ + NR    + PIRTW ++   M  ++ P
Subjt:  WIKEVEAFFGYAGTPEEKKVKLVAYKLRGGASAWWDQIQTNRQRYGKRPIRTWSKMLKLMKNRWLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGGCAAGAAAAACCCCACACTTCCCAGTGACGGCGCTGACCAAACCGATCAAGAGACCTTGAGTCTCTCCCCAAAATCTCTCAACAATCGCATTCTGCACATAGG
GGAATCGATGGAAGAAATGAAGGGCCACGTCCGTGACGTTCGGAAAATGTTGGAGCAATTAATTCTTCAACAACCAAATTATGCGGGACAAGAAAATCTTAGATTAATTG
ACCACCATCAACAAATTAGGCATCAAGGAACGGTTAAAAAACGCCCAAGAAAGATTCAAGAAAATAAAAATAACCCACATGAAAATCCAAGAATGGCTCCCCTACCTTTT
CAAGAAGATCCAAGAATGGCTCCACCACCTTTTCGACCTCCTGAATTATGCCTTGAAAATCCGCGATATCAAGAATTTGATAGCTCAAGTGAAGAAGATGTTCCCTACAA
TTATGATCCAAGAAGGAGAGGAGAACCATATCAAGAAAACAATGATTATAAGATGAAGGTTGATCTCCCAACATTTAGTGGAAAATTCAACATGGAAGCTTTTCTTGATT
GGATAAAAGAGGTAGAAGCTTTCTTTGGTTATGCTGGAACTCCCGAAGAAAAGAAAGTAAAACTAGTAGCTTACAAGTTAAGAGGAGGAGCATCCGCTTGGTGGGATCAA
ATTCAAACCAATAGGCAAAGGTATGGCAAACGTCCTATTAGAACTTGGTCAAAGATGTTAAAACTGATGAAAAACCGTTGGCTACCCGTAAACTATGAACAAATGTTATA
TAACCAGTATCAACATTGTAGCCAAGGAAGTAAGACAGTAGCTGACTATTCTGAGGAATCTCATAGGCTGTGTGCAAGAAACAACTTAGCTGAATCTGAAAATCATTTAA
TTGCACGATATGTTGATGGATTGCGTGATGATATTCAAAAAAGGAGAAGTACCACAGATGGAAAAACTTTCCAGATCGGAAGTTCCTCAACAAGTAATATGGACAAAGGA
AAAGAAGAAATGGGAACACGACAAATACAAGGGATAAACACCAAAAATGCATCCACAACTTATAACCGACCAAATTTGGGAAAATGTTTCCGATGTGGTCAACAGGGCCA
CCTTTCTAATGAATGTCCACAAAGGAAAACTTTAGCAATTATGGATGCACAAAATGAAGAAGACTTTGATGAGGAAGATAGGTCGTATGAGGATATTAATTACATAGAAG
CCGATCAAGGAGAGCAACTCTCTTGCATTTTACAAAGAATTTTCCTAACACCCAATACCGATTCACATCCCCAAAAGCACTTGTTATTTCGAACAAGGTGTACAGTAAAT
GGGAAGGTTTGTAACATCATTGTGGATAGTGGGAGTAGTGAAAACATCATTTCAAGAAAAGCAACACAACTACTTCAACTTAAAACCGACCCTCACCCTAACCCTTACAA
GGACAATTATGCAATTGAGGAAATTAACCCAATTGTACTTGAATTGTTGGAATCTTATCCTGAAATCATGAAAGAACCAACTTCTTTACCACCATTAAGAGATATTCAAC
ATCAAATTGATTTACTGCCTAGTTCAAATCTGCCAAATTTGCCACATTACAGGATGAGTCTTAAAGAATATCAGATTCTTCAAGAACAAGTGCAAGAACTCTTAGACAAG
GGACATATTCAACCTAGCATGAGCCCTTATGCTGTACCAGCTTTATTGACACCTAAAAAAGACGACACTTGGAGAATCTGTGGTGACAGTCGTGCGATCAACAAAATCAC
GGTAAAATATAGTTTTCCTATTCCTAGATTATCTGACATATTAGATCAATTGGGTGGTGCAGTTGTATTTTCAAAGGTGGACCTCAAGAGCGGTTACCACCAAATTAGAA
TATGCCCCGAAGATGAATGGAAAACGGCCTTCAAAACTAATGCGGGCTTTTTGAATGGCTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGGCAAGAAAAACCCCACACTTCCCAGTGACGGCGCTGACCAAACCGATCAAGAGACCTTGAGTCTCTCCCCAAAATCTCTCAACAATCGCATTCTGCACATAGG
GGAATCGATGGAAGAAATGAAGGGCCACGTCCGTGACGTTCGGAAAATGTTGGAGCAATTAATTCTTCAACAACCAAATTATGCGGGACAAGAAAATCTTAGATTAATTG
ACCACCATCAACAAATTAGGCATCAAGGAACGGTTAAAAAACGCCCAAGAAAGATTCAAGAAAATAAAAATAACCCACATGAAAATCCAAGAATGGCTCCCCTACCTTTT
CAAGAAGATCCAAGAATGGCTCCACCACCTTTTCGACCTCCTGAATTATGCCTTGAAAATCCGCGATATCAAGAATTTGATAGCTCAAGTGAAGAAGATGTTCCCTACAA
TTATGATCCAAGAAGGAGAGGAGAACCATATCAAGAAAACAATGATTATAAGATGAAGGTTGATCTCCCAACATTTAGTGGAAAATTCAACATGGAAGCTTTTCTTGATT
GGATAAAAGAGGTAGAAGCTTTCTTTGGTTATGCTGGAACTCCCGAAGAAAAGAAAGTAAAACTAGTAGCTTACAAGTTAAGAGGAGGAGCATCCGCTTGGTGGGATCAA
ATTCAAACCAATAGGCAAAGGTATGGCAAACGTCCTATTAGAACTTGGTCAAAGATGTTAAAACTGATGAAAAACCGTTGGCTACCCGTAAACTATGAACAAATGTTATA
TAACCAGTATCAACATTGTAGCCAAGGAAGTAAGACAGTAGCTGACTATTCTGAGGAATCTCATAGGCTGTGTGCAAGAAACAACTTAGCTGAATCTGAAAATCATTTAA
TTGCACGATATGTTGATGGATTGCGTGATGATATTCAAAAAAGGAGAAGTACCACAGATGGAAAAACTTTCCAGATCGGAAGTTCCTCAACAAGTAATATGGACAAAGGA
AAAGAAGAAATGGGAACACGACAAATACAAGGGATAAACACCAAAAATGCATCCACAACTTATAACCGACCAAATTTGGGAAAATGTTTCCGATGTGGTCAACAGGGCCA
CCTTTCTAATGAATGTCCACAAAGGAAAACTTTAGCAATTATGGATGCACAAAATGAAGAAGACTTTGATGAGGAAGATAGGTCGTATGAGGATATTAATTACATAGAAG
CCGATCAAGGAGAGCAACTCTCTTGCATTTTACAAAGAATTTTCCTAACACCCAATACCGATTCACATCCCCAAAAGCACTTGTTATTTCGAACAAGGTGTACAGTAAAT
GGGAAGGTTTGTAACATCATTGTGGATAGTGGGAGTAGTGAAAACATCATTTCAAGAAAAGCAACACAACTACTTCAACTTAAAACCGACCCTCACCCTAACCCTTACAA
GGACAATTATGCAATTGAGGAAATTAACCCAATTGTACTTGAATTGTTGGAATCTTATCCTGAAATCATGAAAGAACCAACTTCTTTACCACCATTAAGAGATATTCAAC
ATCAAATTGATTTACTGCCTAGTTCAAATCTGCCAAATTTGCCACATTACAGGATGAGTCTTAAAGAATATCAGATTCTTCAAGAACAAGTGCAAGAACTCTTAGACAAG
GGACATATTCAACCTAGCATGAGCCCTTATGCTGTACCAGCTTTATTGACACCTAAAAAAGACGACACTTGGAGAATCTGTGGTGACAGTCGTGCGATCAACAAAATCAC
GGTAAAATATAGTTTTCCTATTCCTAGATTATCTGACATATTAGATCAATTGGGTGGTGCAGTTGTATTTTCAAAGGTGGACCTCAAGAGCGGTTACCACCAAATTAGAA
TATGCCCCGAAGATGAATGGAAAACGGCCTTCAAAACTAATGCGGGCTTTTTGAATGGCTTGTAA
Protein sequenceShow/hide protein sequence
MVGKKNPTLPSDGADQTDQETLSLSPKSLNNRILHIGESMEEMKGHVRDVRKMLEQLILQQPNYAGQENLRLIDHHQQIRHQGTVKKRPRKIQENKNNPHENPRMAPLPF
QEDPRMAPPPFRPPELCLENPRYQEFDSSSEEDVPYNYDPRRRGEPYQENNDYKMKVDLPTFSGKFNMEAFLDWIKEVEAFFGYAGTPEEKKVKLVAYKLRGGASAWWDQ
IQTNRQRYGKRPIRTWSKMLKLMKNRWLPVNYEQMLYNQYQHCSQGSKTVADYSEESHRLCARNNLAESENHLIARYVDGLRDDIQKRRSTTDGKTFQIGSSSTSNMDKG
KEEMGTRQIQGINTKNASTTYNRPNLGKCFRCGQQGHLSNECPQRKTLAIMDAQNEEDFDEEDRSYEDINYIEADQGEQLSCILQRIFLTPNTDSHPQKHLLFRTRCTVN
GKVCNIIVDSGSSENIISRKATQLLQLKTDPHPNPYKDNYAIEEINPIVLELLESYPEIMKEPTSLPPLRDIQHQIDLLPSSNLPNLPHYRMSLKEYQILQEQVQELLDK
GHIQPSMSPYAVPALLTPKKDDTWRICGDSRAINKITVKYSFPIPRLSDILDQLGGAVVFSKVDLKSGYHQIRICPEDEWKTAFKTNAGFLNGL