; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039033 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039033
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr2:34150466..34157515
RNA-Seq ExpressionLag0039033
SyntenyLag0039033
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]1.4e-8532.4Show/hide
Query:  ITSIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLESGGEAASSQLTPNPKFEEWTTVDQALSGWLFGLMTPAVAADV
        ++S  +S+  + L + ++VKLD  NY LW+ MVL I+RG ++DGY+LG K  P EF+     AA S    NP+FE+W   DQ L GWL   MT  +A  +
Subjt:  ITSIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLESGGEAASSQLTPNPKFEEWTTVDQALSGWLFGLMTPAVAADV

Query:  VNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKKG------------------------------------GLDAEYIPIICTIQEKEITSWQELHS
        ++ +TS ++W   + + GA ++++I  L+    +T+KG                                    GLD+EY P++  + ++   SW +L +
Subjt:  VNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKKG------------------------------------GLDAEYIPIICTIQEKEITSWQELHS

Query:  ILVTFEGTLIRFSPSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNRGRGRGRYGNQRGNNSRPTCQLC
         L+TFE  + +            +  TN   N   N +    ++G+ +  N N    N N+           R +N RG    R G  RG + + TCQ+C
Subjt:  ILVTFEGTLIRFSPSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNRGRGRGRYGNQRGNNSRPTCQLC

Query:  GKFGHSAPACYMRFEEDFN-NPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNGNDSLTVGDGTKLHISHVGRSNIGNKDG
        G   H A  C+ RF++ ++ + H + N   G  +A++A+   + D +W  DSGA++H+T         ++++G +SL VG+G KL I   G S +     
Subjt:  GKFGHSAPACYMRFEEDFN-NPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNGNDSLTVGDGTKLHISHVGRSNIGNKDG

Query:  SAICLNNMLHVPHIKHNLISISKLTTDNNLFVEFHPSCCLVKERDSKKVVLRGTLRNGLYQLQIPLKKAFTKSLESPFNPKSSHQANCVLFVSHYPKKCL
         ++ L+++L+VP I  NL+S+SKL  DNN+ VEF  +CC VK++ + K +LRG L++GLYQL      A+    ES                        
Subjt:  SAICLNNMLHVPHIKHNLISISKLTTDNNLFVEFHPSCCLVKERDSKKVVLRGTLRNGLYQLQIPLKKAFTKSLESPFNPKSSHQANCVLFVSHYPKKCL

Query:  TVSASSNKLKSCKVPRSVWHQRLGHASDRVLSVALRSCNLSLPMNEINFFCESCQHGKSHALLFSSSQTHAHRPLELIHCDLWGPSPIASSAGYRYYISF
                          WH++LGH +++VL + L+SCN+ L  ++   FCE+CQ+GK H L F +S +HA   LEL+H D+WGP+PI SS+G++YY+ F
Subjt:  TVSASSNKLKSCKVPRSVWHQRLGHASDRVLSVALRSCNLSLPMNEINFFCESCQHGKSHALLFSSSQTHAHRPLELIHCDLWGPSPIASSAGYRYYISF

Query:  VDNFSRFT
        +D+F+RFT
Subjt:  VDNFSRFT

TXG67243.1 hypothetical protein EZV62_008518 [Acer yangbiense]5.8e-8736.99Show/hide
Query:  TLTSPSSSTAAVAAPVAVPSSITSIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLES-GGEAASSQLTP--------
        TL   SSSTA     V    S +S  SS FG+ L+    +KLD +N++LW+ MV  I++G ++DG++  T+  P EFL S       S  TP        
Subjt:  TLTSPSSSTAAVAAPVAVPSSITSIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLES-GGEAASSQLTP--------

Query:  -NPKFEEWTTVDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKKG-------------------------------
         NP++E+W   DQ L GWL+  MT  VA  V+   T+  +WKALE ++GA SK++ N +R ++Q T+KG                               
Subjt:  -NPKFEEWTTVDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKKG-------------------------------

Query:  -----GLDAEYIPIICTIQEKEITSWQELHSILVTFEGTLIRFSPSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQN
             GLD+EY+PI+  I+ +E  +WQE++  L++++  L   +      +L    + + A N+  N  +           N+  N+ N N   N     
Subjt:  -----GLDAEYIPIICTIQEKEITSWQELHSILVTFEGTLIRFSPSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQN

Query:  YGSRNTNNRGRGRGRYGNQRGNNSRPTCQLCGKFGHSAPACYMRFEEDFNNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKAD
         G R    R RGRG     R NNSRPTCQ+CGKFGHSA  CY R+++++     + N      S ++ATPE V D  W  DSGAT H+T D  NL +K+D
Subjt:  YGSRNTNNRGRGRGRYGNQRGNNSRPTCQLCGKFGHSAPACYMRFEEDFNNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKAD

Query:  YNGNDSLTVGDGTKLHISHVGRSNIGNKDGSAICLNNMLHVPHIKHNLISISKLTTDNNLFVEFHPSCCLVKERDSKKVVLRGTLRNGLYQLQIPLKKAF
        Y G++SL VG+G +L ISHVG  ++ +    +I L  +LHVP I+ NL+S+S+L  DN++F+EFH +CC VK++ +   VLRG L+NGLYQL+IP  K+ 
Subjt:  YNGNDSLTVGDGTKLHISHVGRSNIGNKDGSAICLNNMLHVPHIKHNLISISKLTTDNNLFVEFHPSCCLVKERDSKKVVLRGTLRNGLYQLQIPLKKAF

Query:  TKSLESPFNPKSS--HQANCVLFVSHYPK-KCLTVSASSNKLKSCKVPRSVWHQRLGHASDRVLS
              P   +SS  H     L  S+  K + L+V   S ++KS    ++VWH+RLGH S++VL+
Subjt:  TKSLESPFNPKSS--HQANCVLFVSHYPK-KCLTVSASSNKLKSCKVPRSVWHQRLGHASDRVLS

TXG69253.1 hypothetical protein EZV62_004188 [Acer yangbiense]4.4e-8736.99Show/hide
Query:  TLTSPSSSTAAVAAPVAVPSSITSIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLES-GGEAASSQLTP--------
        TL   SSSTA     V    S +S  SS FG+ L+    +KLD +N++LW+ MV  I++G ++DG++  T+  P EFL S       S  TP        
Subjt:  TLTSPSSSTAAVAAPVAVPSSITSIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLES-GGEAASSQLTP--------

Query:  -NPKFEEWTTVDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKKG-------------------------------
         NP++E+W   DQ L GWL+  MT  VA  V+   T+  +WKALE ++GA SK++ N +R ++Q T+KG                               
Subjt:  -NPKFEEWTTVDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKKG-------------------------------

Query:  -----GLDAEYIPIICTIQEKEITSWQELHSILVTFEGTLIRFSPSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQN
             GLD+EY+PI+  I+ +E  +WQE++  L++++  L   +      +L    + + A N+  N  +           N+  N+ N N   N     
Subjt:  -----GLDAEYIPIICTIQEKEITSWQELHSILVTFEGTLIRFSPSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQN

Query:  YGSRNTNNRGRGRGRYGNQRGNNSRPTCQLCGKFGHSAPACYMRFEEDFNNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKAD
         G R    R RGRG     R NNSRPTCQ+CGKFGHSA  CY R+++++     + N      S ++ATPE V D  W  DSGAT+H+T D  NL +K+D
Subjt:  YGSRNTNNRGRGRGRYGNQRGNNSRPTCQLCGKFGHSAPACYMRFEEDFNNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKAD

Query:  YNGNDSLTVGDGTKLHISHVGRSNIGNKDGSAICLNNMLHVPHIKHNLISISKLTTDNNLFVEFHPSCCLVKERDSKKVVLRGTLRNGLYQLQIPLKKAF
        Y G++SL VG+G +L ISHVG  ++ +    +I L  +LHVP I+ NL+S+S+L  DN++F+EFH +CC VK++ +   VLRG L+NGLYQL+IP  K+ 
Subjt:  YNGNDSLTVGDGTKLHISHVGRSNIGNKDGSAICLNNMLHVPHIKHNLISISKLTTDNNLFVEFHPSCCLVKERDSKKVVLRGTLRNGLYQLQIPLKKAF

Query:  TKSLESPFNPKSS--HQANCVLFVSHYPK-KCLTVSASSNKLKSCKVPRSVWHQRLGHASDRVLS
              P   +SS  H     L  S+  K + L+V   S ++KS    ++VWH+RLGH S++VL+
Subjt:  TKSLESPFNPKSS--HQANCVLFVSHYPK-KCLTVSASSNKLKSCKVPRSVWHQRLGHASDRVLS

XP_022157748.1 uncharacterized protein LOC111024384 isoform X1 [Momordica charantia]1.6e-8950.37Show/hide
Query:  TSPSSSTAAVAAPVAVPSSITS-IISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLES-GGEAASSQLTPNPKFEEWTT
        T  SS    V   VAVP+   S   ++SFGHPL TVLTVKLD+KNY LWRGMVLA+LRGQK DGYVLGT ++P +FL S   E  S  L  NP++ EW  
Subjt:  TSPSSSTAAVAAPVAVPSSITS-IISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLES-GGEAASSQLTPNPKFEEWTT

Query:  VDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKK------------------------------------GGLDAE
        VDQAL GWLFG MTP++A DVV+F++SREVWKALE +YGATSKARINQLR  LQNTKK                                     GL+AE
Subjt:  VDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKK------------------------------------GGLDAE

Query:  YIPIICTIQEKEITSWQELHSILVTFEGTLIRFS-PSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNR
        Y+PI+C I+ K+ TSWQEL + LVTFE TL+R +  S  + +   D + NY +++Q +  +    Q H  Q  Q Q RG         S N      N R
Subjt:  YIPIICTIQEKEITSWQELHSILVTFEGTLIRFS-PSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNR

Query:  GRGRGRYGNQRGNNSRPTCQLCGKFGHSAPACYMRFEEDFNNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNGNDSLTV
        GRGRGR+   RGNNS+P+CQLCGK+GH A  CY RF+E+FNN   S N  N   SAY+A PE+V +P+WL DSGAT H+T+D++NL+VK+DYNG      
Subjt:  GRGRGRYGNQRGNNSRPTCQLCGKFGHSAPACYMRFEEDFNNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNGNDSLTV

Query:  GDGTK
        G G K
Subjt:  GDGTK

XP_022157750.1 uncharacterized protein LOC111024384 isoform X2 [Momordica charantia]9.5e-9051.02Show/hide
Query:  TSPSSSTAAVAAPVAVPSSITS-IISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLES-GGEAASSQLTPNPKFEEWTT
        T  SS    V   VAVP+   S   ++SFGHPL TVLTVKLD+KNY LWRGMVLA+LRGQK DGYVLGT ++P +FL S   E  S  L  NP++ EW  
Subjt:  TSPSSSTAAVAAPVAVPSSITS-IISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLES-GGEAASSQLTPNPKFEEWTT

Query:  VDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKK------------------------------------GGLDAE
        VDQAL GWLFG MTP++A DVV+F++SREVWKALE +YGATSKARINQLR  LQNTKK                                     GL+AE
Subjt:  VDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKK------------------------------------GGLDAE

Query:  YIPIICTIQEKEITSWQELHSILVTFEGTLIRFS-PSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNR
        Y+PI+C I+ K+ TSWQEL + LVTFE TL+R +  S  + +   D + NY +++Q +  +    Q H  Q  Q Q RG         S N      N R
Subjt:  YIPIICTIQEKEITSWQELHSILVTFEGTLIRFS-PSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNR

Query:  GRGRGRYGNQRGNNSRPTCQLCGKFGHSAPACYMRFEEDFNNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNG
        GRGRGR+   RGNNS+P+CQLCGK+GH A  CY RF+E+FNN   S N  N   SAY+A PE+V +P+WL DSGAT H+T+D++NL+VK+DYNG
Subjt:  GRGRGRYGNQRGNNSRPTCQLCGKFGHSAPACYMRFEEDFNNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNG

TrEMBL top hitse value%identityAlignment
A0A6J1DTZ7 uncharacterized protein LOC111024384 isoform X24.6e-9051.02Show/hide
Query:  TSPSSSTAAVAAPVAVPSSITS-IISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLES-GGEAASSQLTPNPKFEEWTT
        T  SS    V   VAVP+   S   ++SFGHPL TVLTVKLD+KNY LWRGMVLA+LRGQK DGYVLGT ++P +FL S   E  S  L  NP++ EW  
Subjt:  TSPSSSTAAVAAPVAVPSSITS-IISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLES-GGEAASSQLTPNPKFEEWTT

Query:  VDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKK------------------------------------GGLDAE
        VDQAL GWLFG MTP++A DVV+F++SREVWKALE +YGATSKARINQLR  LQNTKK                                     GL+AE
Subjt:  VDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKK------------------------------------GGLDAE

Query:  YIPIICTIQEKEITSWQELHSILVTFEGTLIRFS-PSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNR
        Y+PI+C I+ K+ TSWQEL + LVTFE TL+R +  S  + +   D + NY +++Q +  +    Q H  Q  Q Q RG         S N      N R
Subjt:  YIPIICTIQEKEITSWQELHSILVTFEGTLIRFS-PSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNR

Query:  GRGRGRYGNQRGNNSRPTCQLCGKFGHSAPACYMRFEEDFNNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNG
        GRGRGR+   RGNNS+P+CQLCGK+GH A  CY RF+E+FNN   S N  N   SAY+A PE+V +P+WL DSGAT H+T+D++NL+VK+DYNG
Subjt:  GRGRGRYGNQRGNNSRPTCQLCGKFGHSAPACYMRFEEDFNNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNG

A0A6J1DU77 uncharacterized protein LOC111024384 isoform X17.9e-9050.37Show/hide
Query:  TSPSSSTAAVAAPVAVPSSITS-IISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLES-GGEAASSQLTPNPKFEEWTT
        T  SS    V   VAVP+   S   ++SFGHPL TVLTVKLD+KNY LWRGMVLA+LRGQK DGYVLGT ++P +FL S   E  S  L  NP++ EW  
Subjt:  TSPSSSTAAVAAPVAVPSSITS-IISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLES-GGEAASSQLTPNPKFEEWTT

Query:  VDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKK------------------------------------GGLDAE
        VDQAL GWLFG MTP++A DVV+F++SREVWKALE +YGATSKARINQLR  LQNTKK                                     GL+AE
Subjt:  VDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKK------------------------------------GGLDAE

Query:  YIPIICTIQEKEITSWQELHSILVTFEGTLIRFS-PSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNR
        Y+PI+C I+ K+ TSWQEL + LVTFE TL+R +  S  + +   D + NY +++Q +  +    Q H  Q  Q Q RG         S N      N R
Subjt:  YIPIICTIQEKEITSWQELHSILVTFEGTLIRFS-PSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNR

Query:  GRGRGRYGNQRGNNSRPTCQLCGKFGHSAPACYMRFEEDFNNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNGNDSLTV
        GRGRGR+   RGNNS+P+CQLCGK+GH A  CY RF+E+FNN   S N  N   SAY+A PE+V +P+WL DSGAT H+T+D++NL+VK+DYNG      
Subjt:  GRGRGRYGNQRGNNSRPTCQLCGKFGHSAPACYMRFEEDFNNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNGNDSLTV

Query:  GDGTK
        G G K
Subjt:  GDGTK

A0A803P4G6 Uncharacterized protein6.6e-10539.23Show/hide
Query:  LTSPSSSTAAVAAPVAVPSSITSIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLESGGEAASSQ-LTPNPKFEEWTT
        ++S  +++ A     A  S + +     F   L+   ++KLD  N+ LW+ MV  I+RG ++DG++ GTK  P EFL +G  A   + ++ NP+FE W  
Subjt:  LTSPSSSTAAVAAPVAVPSSITSIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLESGGEAASSQ-LTPNPKFEEWTT

Query:  VDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKKGGLDAEYIPIICTIQEKEITSWQELHSILVTFEGTLIRFSPS
         DQ L GWL+  MT  +A +V+   T+ E+W      Y  +           L +    GL AEY+ II  I+ +  T+WQ L  +L++F+  + R    
Subjt:  VDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKKGGLDAEYIPIICTIQEKEITSWQELHSILVTFEGTLIRFSPS

Query:  PISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNRGRGRGRYGNQRGNNSRPTCQLCGKFGHSAPACYMRFE
             L G +TT      Q N  +    QG          RG   YSQN Q+ N G R +  R RGRGRY     NNSRPTCQ+CGKFGHSA  CY R++
Subjt:  PISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNRGRGRGRYGNQRGNNSRPTCQLCGKFGHSAPACYMRFE

Query:  EDF--NNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNGNDSLTVGDGTKLHISHVGRSNIGNKDGSAICLNNMLHVPHI
        E+F  ++P+    +  G  +A+IATP+++    W  DSGA++HIT+   ++S K++Y G ++LTVGDG+KL ISH+G   +    G  + L  MLHVP I
Subjt:  EDF--NNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNGNDSLTVGDGTKLHISHVGRSNIGNKDGSAICLNNMLHVPHI

Query:  KHNLISISKLTTDNNLFVEFHPSCCLVKERDSKKVVLRGTLRNGLYQLQIPLKKAFTKSLESPFNPKSSHQANCV-----LFVSHYPKKCLTVSASSNKL
          NLIS+ KLTTDNN+ +EF+   CLVK++ +KKV+L+G L++GLYQ+Q P  +  +  +  P     + +++ V     +F  H  +  ++ S  S   
Subjt:  KHNLISISKLTTDNNLFVEFHPSCCLVKERDSKKVVLRGTLRNGLYQLQIPLKKAFTKSLESPFNPKSSHQANCV-----LFVSHYPKKCLTVSASSNKL

Query:  KSCKVPRSVWHQRLGHASDRVLSVALRSCNLSLPMNEINFFCESCQHGKSHALLFSSSQTHAHRPLELIHCDLWGPSPIASSAGYRYYISFVDNFSRFT
                VWH+RLGH S +VL   L S N+ +P NE+  FC++CQ+GKSH+L F  SQ  A   L+LIH DLWGP+P+ASS  + YYI FVD FSR+T
Subjt:  KSCKVPRSVWHQRLGHASDRVLSVALRSCNLSLPMNEINFFCESCQHGKSHALLFSSSQTHAHRPLELIHCDLWGPSPIASSAGYRYYISFVDNFSRFT

A0A803PEH4 Uncharacterized protein3.1e-10238.6Show/hide
Query:  TLTSPSSSTAAVAAPVAVPSSITSIISSSFGHP-LSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLESGGEAASSQLTPNPKFEEWT
        T +SP++S+ A  A  +  ++  S + ++F  P L+   ++KLD  NY LW+ MV  I+RG ++ GY+ GT   P EF+  G     +Q+T NP++E W 
Subjt:  TLTSPSSSTAAVAAPVAVPSSITSIISSSFGHP-LSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLESGGEAASSQLTPNPKFEEWT

Query:  TVDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKKG------------------------------------GLDA
          DQ L GWL+  MT  +A +V+   ++  + + LE +YGA SK++++  R  +Q T+KG                                    GLDA
Subjt:  TVDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKKG------------------------------------GLDA

Query:  EYIPIICTIQEKEITSWQELHSILVTFEGTLIRFSPSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYG----SRN
        EY+ I+  I+ +  T+WQEL  +L++F+  + R      +  L+ ++ T+         SS Q N      +  N  RG    SQN  + + G    SR 
Subjt:  EYIPIICTIQEKEITSWQELHSILVTFEGTLIRFSPSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYG----SRN

Query:  TNNRGRGRGRYGNQRGNNSRPTCQLCGKFGHSAPACYMRFEEDF-----NNPHGSTNKG--NGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVK
        T+NR RGRGR     G+ SRPTCQ+ GK+GH+A  CY RF+E +     NNPH     G  N   SA++ATPEV+    W  DSGA++HIT+D ANL+ K
Subjt:  TNNRGRGRGRYGNQRGNNSRPTCQLCGKFGHSAPACYMRFEEDF-----NNPHGSTNKG--NGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVK

Query:  ADYNGNDSLTVGDGTKLHISHVGRSNIGNKDGSAICLNNMLHVPHIKHNLISISKLTTDNNLFVEFHPSCCLVKERDSKKVVLRGTLRNGLYQLQIPLKK
         DYNG +S+ VG+G+KL I+H+G   +  + G+ + L +ML VP I  NL+S+SKL TDNN+ +EF+ + CLVK++ +KKV+L G L++ LYQ       
Subjt:  ADYNGNDSLTVGDGTKLHISHVGRSNIGNKDGSAICLNNMLHVPHIKHNLISISKLTTDNNLFVEFHPSCCLVKERDSKKVVLRGTLRNGLYQLQIPLKK

Query:  AFTKSLESPFNPKSSHQANCVLFVSHYPKKCLTVSASSNKLKS-----CKVPRSVWHQRLGHASDRVLSVALRSCNLSLPMNEINFFCESCQHGKSHALL
             L+SPF  KSSH      F+S +     T+S  SN  +S           V H+RLGH S +VL+  L S N+S+  N +   C++CQ+GK+HAL 
Subjt:  AFTKSLESPFNPKSSHQANCVLFVSHYPKKCLTVSASSNKLKS-----CKVPRSVWHQRLGHASDRVLSVALRSCNLSLPMNEINFFCESCQHGKSHALL

Query:  FSSSQTHAHRPLELIHCDLWGPSPIASSAGYRYYISFVDNFSRFT
        F SS T A   L+LIH DLWGP+PIAS+  + YYI FVD++SR+T
Subjt:  FSSSQTHAHRPLELIHCDLWGPSPIASSAGYRYYISFVDNFSRFT

A0A803PM38 Uncharacterized protein7.4e-8834.93Show/hide
Query:  SIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLESG--GEAASSQLTPNPKFEEWTTVDQALSGWLFGLMTPAVAADV
        +I+   FG  L+    +KLD  N+ LWR MV AI+RG ++DGY+ GT  +P EFL S     + SS    NP FE+W   DQ L GWL+G MT  +A +V
Subjt:  SIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLESG--GEAASSQLTPNPKFEEWTTVDQALSGWLFGLMTPAVAADV

Query:  VNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKKG------------------------------------GLDAEYIPIICTIQEKEITSWQELHS
        +   +S  +W ALE+++GA SKA++++ R  +Q  +KG                                    GLD EY+P++  I+ +  T+WQ+L  
Subjt:  VNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKKG------------------------------------GLDAEYIPIICTIQEKEITSWQELHS

Query:  ILVTFEGTLIRFSPSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNRGRGRGRYGNQRGNNSRPTCQLC
        +L++ +  + R      S+ L+G        N   + ++   + G N   + N NRG  +           +R +NNR RGRG     R +  RPTCQ+C
Subjt:  ILVTFEGTLIRFSPSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNRGRGRGRYGNQRGNNSRPTCQLC

Query:  GKFGHSAPACYMRFEEDFNNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNGNDSLTVGDGTKLHISHVGRSNIGNKDGS
        GK+GHSA  CY R                                      GA++HIT+++  +++K +YNG + +TV +G +L I H+G  ++     S
Subjt:  GKFGHSAPACYMRFEEDFNNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNGNDSLTVGDGTKLHISHVGRSNIGNKDGS

Query:  AICLNNMLHVPHIKHNLISISKLTTDNNLFVEFHPSCCLVKERDSKKVVLRGTLRNGLYQLQIPLKKAFTKSLESPFNPKSSHQANCVLFVSHYPKKCLT
         + L  +LHVP I  NL+SISKLT+DNN+ VEF    C VK++++ +VVL+G L++GLYQ   P       S  S   P S          S+  K    
Subjt:  AICLNNMLHVPHIKHNLISISKLTTDNNLFVEFHPSCCLVKERDSKKVVLRGTLRNGLYQLQIPLKKAFTKSLESPFNPKSSHQANCVLFVSHYPKKCLT

Query:  VSASSNKLKSCKVPRSVWHQRLGHASDRVLSVALRSCNLSLPMNEINFFCESCQHGKSHALLFSSSQTHAHRPLELIHCDLWGPSPIASSAGYRYYISFV
            +N+L  C + +  WH+RLGH S RVL   L   N+   +N    FC++CQ GKSH+L F  +   A  PLEL+H D+WGPSPI S+  +RYYI F+
Subjt:  VSASSNKLKSCKVPRSVWHQRLGHASDRVLSVALRSCNLSLPMNEINFFCESCQHGKSHALLFSSSQTHAHRPLELIHCDLWGPSPIASSAGYRYYISFV

Query:  DNFSRFT
        D+FSR+T
Subjt:  DNFSRFT

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.1e-1120.04Show/hide
Query:  KFEEWTTVDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMY----------------------GATSKARINQLRGTLQNTKKGGLDAEYIPIICT
        K E+W  +D+  +  +   ++  V  ++++  T+R +W  LE +Y                      G    + +N   G +      G+  E       
Subjt:  KFEEWTTVDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMY----------------------GATSKARINQLRGTLQNTKKGGLDAEYIPIICT

Query:  IQEKEITSWQELHSILVTFEGTLIRFSPSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNRGRGRGRYG
         ++K I     L S       T++    +    D++     N    ++                NQ Q    +   ++YQ      R++NN GR   R  
Subjt:  IQEKEITSWQELHSILVTFEGTLIRFSPSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNRGRGRGRYG

Query:  NQRGNNSR-PTCQLCGKFGHSAPACYMRFEEDFNNPHGSTNKGNGETS------------------AYIATPEVVC------DPNWLTDSGATSHITADV
        ++  + SR   C  C + GH        F+ D  NP     KG GETS                        E  C      +  W+ D+ A+ H T  V
Subjt:  NQRGNNSR-PTCQLCGKFGHSAPACYMRFEEDFNNPHGSTNKGNGETS------------------AYIATPEVVC------DPNWLTDSGATSHITADV

Query:  ANLSVKADYNGNDSLTVGDGTKLHISHVGRSNIGNKDGSAICLNNMLHVPHIKHNLISISKLTTDNNLFVEFHPSCCLVKERDSKKVVLRGTLRNGLYQL
         +L  +       ++ +G+ +   I+ +G   I    G  + L ++ HVP ++ NLIS   L  D       +    L K      V+ +G  R  LY+ 
Subjt:  ANLSVKADYNGNDSLTVGDGTKLHISHVGRSNIGNKDGSAICLNNMLHVPHIKHNLISISKLTTDNNLFVEFHPSCCLVKERDSKKVVLRGTLRNGLYQL

Query:  QIPLKKAFTKSLESPFNPKSSHQANCVLFVSHYPKKCLTVSASSNKLKSCKVPRSVWHQRLGHASDRVLSVALRSCNLSLPMNEINFFCESCQHGKSHAL
           + +    + +                                     ++   +WH+R+GH S++ L +  +   +S         C+ C  GK H +
Subjt:  QIPLKKAFTKSLESPFNPKSSHQANCVLFVSHYPKKCLTVSASSNKLKSCKVPRSVWHQRLGHASDRVLSVALRSCNLSLPMNEINFFCESCQHGKSHAL

Query:  LFSSSQTHAHRPLELIHCDLWGPSPIASSAGYRYYISFVDNFSR
         F +S       L+L++ D+ GP  I S  G +Y+++F+D+ SR
Subjt:  LFSSSQTHAHRPLELIHCDLWGPSPIASSAGYRYYISFVDNFSR

P93293 Uncharacterized mitochondrial protein AtMg003005.6e-0832.58Show/hide
Query:  SASSNKLKSCKVPRSVWHQRLGHASDRVLSVALRSCNLSLPMNEINFFCESCQHGKSHALLFSSSQTHAHRPLELIHCDLWGPSPIASS
        +  SN  ++ K    +WH RL H S R + + ++   L         FCE C +GK+H + FS+ Q     PL+ +H DLWG   +  S
Subjt:  SASSNKLKSCKVPRSVWHQRLGHASDRVLSVALRSCNLSLPMNEINFFCESCQHGKSHALLFSSSQTHAHRPLELIHCDLWGPSPIASS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.5e-4927.73Show/hide
Query:  KLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLESGGEAASSQLTPNPKFEEWTTVDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGA
        KL   NYL+W   V A+  G ++ G++ G+ + P   +  G +AA      NP +  W   D+ +   + G ++ +V   V    T+ ++W+ L ++Y  
Subjt:  KLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLESGGEAASSQLTPNPKFEEWTTVDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGA

Query:  TSKARINQLRGTLQNTKKG------------------------------------GLDAEYIPIICTIQEKEI-TSWQELHSILVTFEGTLIRFSPSPIS
         S   + QLR  L+   KG                                     L  EY P+I  I  K+   +  E+H  L+  E  ++  S + + 
Subjt:  TSKARINQLRGTLQNTKKG------------------------------------GLDAEYIPIICTIQEKEI-TSWQELHSILVTFEGTLIRFSPSPIS

Query:  TDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNRGRGRGRYGNQRGNNSRP---TCQLCGKFGHSAPAC--YMR
                T  A + +   ++   N G+   R  N+N  N +      S N+   N                N S+P    CQ+CG  GHSA  C     
Subjt:  TDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNRGRGRGRYGNQRGNNSRP---TCQLCGKFGHSAPAC--YMR

Query:  FEEDFNNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNGNDSLTVGDGTKLHISHVGRSNIGNKDGSAICLNNMLHVPHI
        F    N+    +     +  A +A        NWL DSGAT HIT+D  NLS+   Y G D + V DG+ + ISH G +++  K    + L+N+L+VP+I
Subjt:  FEEDFNNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNGNDSLTVGDGTKLHISHVGRSNIGNKDGSAICLNNMLHVPHI

Query:  KHNLISISKLTTDNNLFVEFHPSCCLVKERDSKKVVLRGTLRNGLYQLQIPLKKAFTKSLESPFNPKSSHQANCVLFVSHYPKKCLTVSASSNKLKSCKV
          NLIS+ +L   N + VEF P+   VK+ ++   +L+G  ++ LY+  I   +  +                  LF S                 S K 
Subjt:  KHNLISISKLTTDNNLFVEFHPSCCLVKERDSKKVVLRGTLRNGLYQLQIPLKKAFTKSLESPFNPKSSHQANCVLFVSHYPKKCLTVSASSNKLKSCKV

Query:  PRSVWHQRLGHASDRVLSVALRSCNLSLPMNEINFF-CESCQHGKSHALLFSSSQTHAHRPLELIHCDLWGPSPIASSAGYRYYISFVDNFSRFT
          S WH RLGH +  +L+  + + +LS+      F  C  C   KS+ + FS S  ++ RPLE I+ D+W  SPI S   YRYY+ FVD+F+R+T
Subjt:  PRSVWHQRLGHASDRVLSVALRSCNLSLPMNEINFF-CESCQHGKSHALLFSSSQTHAHRPLELIHCDLWGPSPIASSAGYRYYISFVDNFSRFT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-4828.87Show/hide
Query:  KLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLESGGEAASSQLTPNPKFEEWTTVDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGA
        KL   NYL+W   V A+  G ++ G++ G+   P   +  G +A       NP +  W   D+ +   + G ++ +V   V    T+ ++W+ L ++Y  
Subjt:  KLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQPSEFLESGGEAASSQLTPNPKFEEWTTVDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGA

Query:  TSKARINQLRGTLQNTKKG-----------------GLDAEYIPIICTIQEKEI-TSWQELHSILVTFEGTLIRFSPSPISTDLSGDQTTNYAYNRQGNF
         S   + QLR   +  +                    L  +Y P+I  I  K+   S  E+H  L+  E  L+  + + +        T N   +R  N 
Subjt:  TSKARINQLRGTLQNTKKG-----------------GLDAEYIPIICTIQEKEI-TSWQELHSILVTFEGTLIRFSPSPISTDLSGDQTTNYAYNRQGNF

Query:  SSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNRGRG-RGRYGNQRGNNSRPTCQLCGKFGHSAPAC--YMRFEEDFNNPHGSTNKGNGETSA
        +  Q N+G N   N N NR N     ++Q  + GSR+ N + +   GR            CQ+C   GHSA  C    +F+   N    ++     +  A
Subjt:  SSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNRGRG-RGRYGNQRGNNSRPTCQLCGKFGHSAPAC--YMRFEEDFNNPHGSTNKGNGETSA

Query:  YIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNGNDSLTVGDGTKLHISHVGRSNIGNKDGSAICLNNMLHVPHIKHNLISISKLTTDNNLFVEFH
         +A        NWL DSGAT HIT+D  NLS    Y G D + + DG+ + I+H G +++     S + LN +L+VP+I  NLIS+ +L   N + VEF 
Subjt:  YIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNGNDSLTVGDGTKLHISHVGRSNIGNKDGSAICLNNMLHVPHIKHNLISISKLTTDNNLFVEFH

Query:  PSCCLVKERDSKKVVLRGTLRNGLYQLQIPLKKAFTKSLESPFNPKSSHQANCVLFVSHYPKKCLTVSASSNKLKSCKVPRSVWHQRLGHASDRVLSVAL
        P+   VK+ ++   +L+G  ++ LY+                  P +S QA     VS +   C             K   S WH RLGH S  +L+  +
Subjt:  PSCCLVKERDSKKVVLRGTLRNGLYQLQIPLKKAFTKSLESPFNPKSSHQANCVLFVSHYPKKCLTVSASSNKLKSCKVPRSVWHQRLGHASDRVLSVAL

Query:  RSCNLSL--PMNEINFFCESCQHGKSHALLFSSSQTHAHRPLELIHCDLWGPSPIASSAGYRYYISFVDNFSRFT
         + +L +  P +++   C  C   KSH + FS+S   + +PLE I+ D+W  SPI S   YRYY+ FVD+F+R+T
Subjt:  RSCNLSL--PMNEINFFCESCQHGKSHALLFSSSQTHAHRPLELIHCDLWGPSPIASSAGYRYYISFVDNFSRFT

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein3.9e-0932.58Show/hide
Query:  SASSNKLKSCKVPRSVWHQRLGHASDRVLSVALRSCNLSLPMNEINFFCESCQHGKSHALLFSSSQTHAHRPLELIHCDLWGPSPIASS
        +  SN  ++ K    +WH RL H S R + + ++   L         FCE C +GK+H + FS+ Q     PL+ +H DLWG   +  S
Subjt:  SASSNKLKSCKVPRSVWHQRLGHASDRVLSVALRSCNLSLPMNEINFFCESCQHGKSHALLFSSSQTHAHRPLELIHCDLWGPSPIASS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCTGGTGCCAAGAGGAAGCAGGAAGAGGAAGTGCATGCAGCTAGACAGAAAGTCAGAGGGTCTCTGCCAGGGACCAGTGAGTGCTTAGTGGGGGAAGTCCAACA
TGAGGTGTTCTTGCCCGGGTCACCGATGAAGAGCTCAAGCCAGAGTATCCTGAGCTTTATGATGACGACGATTCTGATGATAGCTCCTAAGGTGGGGAGTCAGCGTACCC
CTCGCTCAAGTCTCGAGGTTTTTGGACTTGCTCGTGACGATGCTCTTAATGGTGTTGGTTTCTCAAACAGAGAGTTGGAGCATTTCCCTCATGAAAATTATGGGTCTTTG
AGTCGGGGTGGCAGCGAACTAGGCGTGGTGTTGCTACAAGCGGCAGCGCACGACGGGTTGGCAGATTTGCAGCAGAGTGAGCTTGTGCGTAGAGCCATGGCTGATGAAAC
CCTAACCTCTCCTTCAAGCAGCACCGCCGCCGTCGCAGCTCCTGTAGCGGTACCATCATCAATCACGAGTATCATCAGTTCATCGTTTGGTCATCCTCTAAGCACGGTTC
TCACAGTGAAGCTAGATGAGAAAAATTACCTGCTATGGAGGGGAATGGTGCTTGCTATTCTCAGAGGGCAAAAGGTAGATGGATATGTTTTGGGCACAAAATCTCAACCA
TCTGAGTTTCTTGAATCAGGTGGAGAAGCAGCCAGTTCACAGCTTACGCCTAACCCAAAATTTGAAGAGTGGACTACAGTAGATCAAGCTCTCTCCGGATGGCTCTTCGG
ATTAATGACGCCGGCTGTTGCTGCAGATGTTGTTAATTTCAAAACCTCAAGAGAGGTATGGAAGGCGCTTGAACAAATGTATGGAGCGACCAGTAAAGCCAGGATTAACC
AACTTCGAGGGACTCTTCAAAACACCAAAAAGGGCGGTCTTGATGCCGAATATATTCCCATCATCTGTACCATTCAAGAGAAAGAAATTACCTCCTGGCAAGAGTTGCAT
TCGATCTTAGTCACCTTTGAAGGTACGTTGATACGATTCAGTCCTAGTCCTATTTCTACTGACTTAAGTGGTGATCAAACAACAAACTATGCCTACAATAGGCAAGGAAA
TTTCTCGAGTGGACAACAAAATCAGGGTCATAACTATCAGAGGAATCAAAATCAAAATCGTGGAAATCAGAATTATAGCCAGAACTACCAATCTCAAAACTATGGCTCAC
GGAACACCAATAACAGAGGAAGAGGTCGTGGAAGGTATGGCAATCAGAGAGGTAACAATTCCAGACCAACGTGTCAATTATGCGGTAAATTTGGTCATTCTGCCCCAGCC
TGTTATATGCGATTCGAGGAAGATTTTAACAACCCACACGGTTCAACCAACAAAGGAAATGGGGAAACCTCAGCTTATATTGCAACTCCTGAAGTTGTCTGTGACCCAAA
CTGGCTAACTGATAGTGGTGCCACCAGCCACATTACTGCTGATGTTGCGAATCTGAGTGTGAAGGCAGACTACAACGGTAATGACTCCCTTACTGTAGGAGATGGAACAA
AATTGCATATATCTCATGTTGGTAGAAGTAATATTGGTAATAAGGATGGCTCTGCTATTTGTCTGAACAACATGTTACATGTGCCTCATATTAAACATAATCTTATTAGC
ATTTCAAAGCTTACTACTGATAATAACTTGTTTGTGGAATTTCACCCCTCATGTTGTCTTGTGAAGGAAAGGGATTCAAAGAAGGTAGTGCTGCGCGGAACCCTTAGGAA
CGGCCTGTATCAGCTTCAAATCCCTCTTAAAAAGGCGTTTACAAAATCTTTGGAGTCTCCCTTCAATCCCAAAAGTAGTCATCAAGCAAACTGTGTGTTATTTGTTTCTC
ATTACCCCAAAAAGTGTCTTACTGTCTCTGCTAGTTCAAATAAATTAAAGTCATGTAAGGTGCCTAGATCTGTCTGGCATCAGCGTCTTGGTCATGCTTCTGATAGAGTC
TTAAGTGTTGCATTAAGGTCTTGTAATCTCAGTTTGCCAATGAATGAAATCAATTTCTTTTGTGAATCTTGTCAACATGGAAAATCCCATGCTCTACTGTTTTCTTCCTC
TCAAACTCATGCTCATAGACCTCTTGAACTTATTCATTGTGACCTTTGGGGACCTTCTCCTATAGCTTCCTCAGCTGGTTATCGCTATTACATTAGCTTTGTAGATAACT
TTAGTCGCTTCACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCTGGTGCCAAGAGGAAGCAGGAAGAGGAAGTGCATGCAGCTAGACAGAAAGTCAGAGGGTCTCTGCCAGGGACCAGTGAGTGCTTAGTGGGGGAAGTCCAACA
TGAGGTGTTCTTGCCCGGGTCACCGATGAAGAGCTCAAGCCAGAGTATCCTGAGCTTTATGATGACGACGATTCTGATGATAGCTCCTAAGGTGGGGAGTCAGCGTACCC
CTCGCTCAAGTCTCGAGGTTTTTGGACTTGCTCGTGACGATGCTCTTAATGGTGTTGGTTTCTCAAACAGAGAGTTGGAGCATTTCCCTCATGAAAATTATGGGTCTTTG
AGTCGGGGTGGCAGCGAACTAGGCGTGGTGTTGCTACAAGCGGCAGCGCACGACGGGTTGGCAGATTTGCAGCAGAGTGAGCTTGTGCGTAGAGCCATGGCTGATGAAAC
CCTAACCTCTCCTTCAAGCAGCACCGCCGCCGTCGCAGCTCCTGTAGCGGTACCATCATCAATCACGAGTATCATCAGTTCATCGTTTGGTCATCCTCTAAGCACGGTTC
TCACAGTGAAGCTAGATGAGAAAAATTACCTGCTATGGAGGGGAATGGTGCTTGCTATTCTCAGAGGGCAAAAGGTAGATGGATATGTTTTGGGCACAAAATCTCAACCA
TCTGAGTTTCTTGAATCAGGTGGAGAAGCAGCCAGTTCACAGCTTACGCCTAACCCAAAATTTGAAGAGTGGACTACAGTAGATCAAGCTCTCTCCGGATGGCTCTTCGG
ATTAATGACGCCGGCTGTTGCTGCAGATGTTGTTAATTTCAAAACCTCAAGAGAGGTATGGAAGGCGCTTGAACAAATGTATGGAGCGACCAGTAAAGCCAGGATTAACC
AACTTCGAGGGACTCTTCAAAACACCAAAAAGGGCGGTCTTGATGCCGAATATATTCCCATCATCTGTACCATTCAAGAGAAAGAAATTACCTCCTGGCAAGAGTTGCAT
TCGATCTTAGTCACCTTTGAAGGTACGTTGATACGATTCAGTCCTAGTCCTATTTCTACTGACTTAAGTGGTGATCAAACAACAAACTATGCCTACAATAGGCAAGGAAA
TTTCTCGAGTGGACAACAAAATCAGGGTCATAACTATCAGAGGAATCAAAATCAAAATCGTGGAAATCAGAATTATAGCCAGAACTACCAATCTCAAAACTATGGCTCAC
GGAACACCAATAACAGAGGAAGAGGTCGTGGAAGGTATGGCAATCAGAGAGGTAACAATTCCAGACCAACGTGTCAATTATGCGGTAAATTTGGTCATTCTGCCCCAGCC
TGTTATATGCGATTCGAGGAAGATTTTAACAACCCACACGGTTCAACCAACAAAGGAAATGGGGAAACCTCAGCTTATATTGCAACTCCTGAAGTTGTCTGTGACCCAAA
CTGGCTAACTGATAGTGGTGCCACCAGCCACATTACTGCTGATGTTGCGAATCTGAGTGTGAAGGCAGACTACAACGGTAATGACTCCCTTACTGTAGGAGATGGAACAA
AATTGCATATATCTCATGTTGGTAGAAGTAATATTGGTAATAAGGATGGCTCTGCTATTTGTCTGAACAACATGTTACATGTGCCTCATATTAAACATAATCTTATTAGC
ATTTCAAAGCTTACTACTGATAATAACTTGTTTGTGGAATTTCACCCCTCATGTTGTCTTGTGAAGGAAAGGGATTCAAAGAAGGTAGTGCTGCGCGGAACCCTTAGGAA
CGGCCTGTATCAGCTTCAAATCCCTCTTAAAAAGGCGTTTACAAAATCTTTGGAGTCTCCCTTCAATCCCAAAAGTAGTCATCAAGCAAACTGTGTGTTATTTGTTTCTC
ATTACCCCAAAAAGTGTCTTACTGTCTCTGCTAGTTCAAATAAATTAAAGTCATGTAAGGTGCCTAGATCTGTCTGGCATCAGCGTCTTGGTCATGCTTCTGATAGAGTC
TTAAGTGTTGCATTAAGGTCTTGTAATCTCAGTTTGCCAATGAATGAAATCAATTTCTTTTGTGAATCTTGTCAACATGGAAAATCCCATGCTCTACTGTTTTCTTCCTC
TCAAACTCATGCTCATAGACCTCTTGAACTTATTCATTGTGACCTTTGGGGACCTTCTCCTATAGCTTCCTCAGCTGGTTATCGCTATTACATTAGCTTTGTAGATAACT
TTAGTCGCTTCACTTAA
Protein sequenceShow/hide protein sequence
MSSGAKRKQEEEVHAARQKVRGSLPGTSECLVGEVQHEVFLPGSPMKSSSQSILSFMMTTILMIAPKVGSQRTPRSSLEVFGLARDDALNGVGFSNRELEHFPHENYGSL
SRGGSELGVVLLQAAAHDGLADLQQSELVRRAMADETLTSPSSSTAAVAAPVAVPSSITSIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKSQP
SEFLESGGEAASSQLTPNPKFEEWTTVDQALSGWLFGLMTPAVAADVVNFKTSREVWKALEQMYGATSKARINQLRGTLQNTKKGGLDAEYIPIICTIQEKEITSWQELH
SILVTFEGTLIRFSPSPISTDLSGDQTTNYAYNRQGNFSSGQQNQGHNYQRNQNQNRGNQNYSQNYQSQNYGSRNTNNRGRGRGRYGNQRGNNSRPTCQLCGKFGHSAPA
CYMRFEEDFNNPHGSTNKGNGETSAYIATPEVVCDPNWLTDSGATSHITADVANLSVKADYNGNDSLTVGDGTKLHISHVGRSNIGNKDGSAICLNNMLHVPHIKHNLIS
ISKLTTDNNLFVEFHPSCCLVKERDSKKVVLRGTLRNGLYQLQIPLKKAFTKSLESPFNPKSSHQANCVLFVSHYPKKCLTVSASSNKLKSCKVPRSVWHQRLGHASDRV
LSVALRSCNLSLPMNEINFFCESCQHGKSHALLFSSSQTHAHRPLELIHCDLWGPSPIASSAGYRYYISFVDNFSRFT