; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015368 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015368
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag-Pol polyprotein
Genome locationchr12:11623315..11627708
RNA-Seq ExpressionLag0015368
SyntenyLag0015368
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015947503.1 uncharacterized protein LOC107472494 [Arachis duranensis]1.2e-9738.51Show/hide
Query:  QSNQASMRALELQVGQLASELKA-RPQGKLPSDT-EHPRREGKEQVKTVTLRSELEIGQGAGGSK------------ENAGASGSVPD------------
        ++ +AS++ +E Q+GQL+      RP   LPSDT  +P    KE+ K + LRS   +      ++            E A     VP+            
Subjt:  QSNQASMRALELQVGQLASELKA-RPQGKLPSDT-EHPRREGKEQVKTVTLRSELEIGQGAGGSK------------ENAGASGSVPD------------

Query:  -----------VEPPYVPPPPYLPPLPFPQRQKPKNQDGQFKNFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNG
                   ++          PPLP+PQR   + +D  F+ FLE  K+L INIPL EA+EQMP YAKFLK+++ KK+   E ET+ L EECSA+L+ G
Subjt:  -----------VEPPYVPPPPYLPPLPFPQRQKPKNQDGQFKNFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNG

Query:  LPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSVYRKLGIGEARPTTVTLKLADRSI-------------------------------------
        +PPK KDPGSF +  +IG   L +ALCDLG+SINLMPLS+ RKL I E +PT ++L +ADRSI                                     
Subjt:  LPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSVYRKLGIGEARPTTVTLKLADRSI-------------------------------------

Query:  -----------------------------------------------------------------------TYQ------EVDYVSKWVEAIACHQSDAK
                                                                               +Y        V+YVSKWVEAIA    DA+
Subjt:  -----------------------------------------------------------------------TYQ------EVDYVSKWVEAIACHQSDAK

Query:  TVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQSRKDWSFRLDEALWAYRTTYKTPLG
         V +FL+ HIF R+G P+ LVSD G+HF N  + KLL KYG+ H++ATPYHPQ NGQ E++NRE+K ILEK+V  SRKDW+ +L++ALWAYRT +KTP+G
Subjt:  TVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQSRKDWSFRLDEALWAYRTTYKTPLG

Query:  MSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFRHFLMRMGKCFPNGA---ITLQDEKDGKVFKVNGQRVKNYWG
         SP++L+YGK+CHLP+ELEHK FWA K LN D   AG   +LQL EL+EFR       K +   A   I L DE     F  NG R K Y G
Subjt:  MSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFRHFLMRMGKCFPNGA---ITLQDEKDGKVFKVNGQRVKNYWG

XP_027102262.1 uncharacterized protein LOC113723295 [Coffea arabica]2.1e-9936.94Show/hide
Query:  QRNNPYSNFYNPGWCNHPNFSWGGQGNNVQVQQKVNQPGF-AKAQVLPQ-QNKQALPQQNSGNSFEAMMKEFMARTDVAIQSNQASMRALELQVGQLAS-
        +R +PY+N YNPGW +HPNF++            V  PGF  ++Q  P+ Q  Q  P   SG S E ++K     T    Q  + S+  LE Q+ QLAS 
Subjt:  QRNNPYSNFYNPGWCNHPNFSWGGQGNNVQVQQKVNQPGF-AKAQVLPQ-QNKQALPQQNSGNSFEAMMKEFMARTDVAIQSNQASMRALELQVGQLAS-

Query:  --ELKARPQGKLPSDT-EHPRREGKEQVKTVTLRSELEIGQGAGGSKENA-----GASGSVPDVEPPYVPPPPY--------LPPLPFPQRQKPKNQDGQ
           L+++  GKLPS T  +P    K+    +TLRS  E+ + +    E A        G+V   +P  +  P +         PP PFP R     +  Q
Subjt:  --ELKARPQGKLPSDT-EHPRREGKEQVKTVTLRSELEIGQGAGGSKENA-----GASGSVPDVEPPYVPPPPY--------LPPLPFPQRQKPKNQDGQ

Query:  FKNFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSV
         +  L+  +++ +NIPL++AI+Q+P YAKFLK++ T KK+L   E V + E  SA+L+  LPPK KDPG FT+P  IG  ++ +A+ DLG+SIN+MP S+
Subjt:  FKNFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSV

Query:  YRKLGIGE-----ARPTTVTLK--------------------LADRSITYQE------------------------------------------------
        Y  + IG      +R     +K                    +  R ++ +E                                                
Subjt:  YRKLGIGE-----ARPTTVTLK--------------------LADRSITYQE------------------------------------------------

Query:  ---VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQSRK
           VDYVSKW+EA A    +AK V  F++SHIF R+GT RA++SD GTHF N ++  LL KYG+ HR++T YHPQ +GQAEISN+EIKSILEK V+ +RK
Subjt:  ---VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQSRK

Query:  DWSFRLDEALWAYRTTYKTPLGMSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFR---------------------------
        DWS RLD+ALWAYRT +KTP+GMSPYRLI+GK CHLP+ELEH+ +WA+K  N  +  +G  R LQL ELEE R                           
Subjt:  DWSFRLDEALWAYRTTYKTPLGMSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFR---------------------------

Query:  ---------HFLMRM--GK-------------CFPNGAITLQDEKDGKVFKVNGQRVKNYWGEEFQ
                 H  +++  GK              FP+GA+ +Q     K+FKVNG R+K ++ E FQ
Subjt:  ---------HFLMRM--GK-------------CFPNGAITLQDEKDGKVFKVNGQRVKNYWGEEFQ

XP_027120379.1 uncharacterized protein LOC113737329 [Coffea arabica]3.8e-9636.24Show/hide
Query:  QRNNPYSNFYNPGWCNHPNFSWGGQGNNVQVQQKVNQPGFAKAQVLPQQNKQALPQQNSGNSFEAMMKEFMARTDVAIQSNQASMRALELQVGQLAS---
        +R +PY+N YNPGW +HPNFS+  +    Q QQ   QP    AQ      +Q+ P+  SG S E ++K     T    Q  + S+  LE Q+ QLAS   
Subjt:  QRNNPYSNFYNPGWCNHPNFSWGGQGNNVQVQQKVNQPGFAKAQVLPQQNKQALPQQNSGNSFEAMMKEFMARTDVAIQSNQASMRALELQVGQLAS---

Query:  ELKARPQGKLPSDT-EHPRREGKEQVKTVTLRSELEIGQGAGGSKENA-----GASGSVPDVEPPYVPPPPY--------LPPLPFPQRQKPKNQDGQFK
         L+++  GKLPS T  +P    K+    +TLRS  E+ + +    E A        G+V   +P  +  P +         P  PFP R     +  Q +
Subjt:  ELKARPQGKLPSDT-EHPRREGKEQVKTVTLRSELEIGQGAGGSKENA-----GASGSVPDVEPPYVPPPPY--------LPPLPFPQRQKPKNQDGQFK

Query:  NFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSVYR
          L+  +++ +NIPL++AI+Q+P YAKFLK++ T KK+L   E V + E  SA+L+  LPPK KDPG FT+P  I   ++ +A+ DLG SIN+MP S+Y 
Subjt:  NFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSVYR

Query:  KLGIG-------EARPTTV------------------TLKLADRSITYQE--------------------------------------------------
         + +G       +A+   +                  + ++  R ++ +E                                                  
Subjt:  KLGIG-------EARPTTV------------------TLKLADRSITYQE--------------------------------------------------

Query:  -VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQSRKDW
         VDYVSKW+EA A    +AK V  F++SHIF R+GTPRA+++D GTHF N ++  L+ KYG+ HR++T YHPQ +GQAEISN+EIKSILEK V+ +RKDW
Subjt:  -VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQSRKDW

Query:  SFRLDEALWAYRTTYKTPLGMSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFR-----------------------------
        S RLD+ALWAYRT +KTP+GMSPYRLI+GK CHLP+ELEH+ +WA+K  N  ++  G  R LQL +LEE R                             
Subjt:  SFRLDEALWAYRTTYKTPLGMSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFR-----------------------------

Query:  -------HFLMR---------------MGKCFPNGAITLQDEKDGKVFKVNGQRVKNYWGEEFQA
               H  ++               +   FP+GA+ +   K  KVFKVNG R+K ++ E FQA
Subjt:  -------HFLMR---------------MGKCFPNGAITLQDEKDGKVFKVNGQRVKNYWGEEFQA

XP_028949286.1 uncharacterized protein LOC108175062 [Malus domestica]1.3e-10137.31Show/hide
Query:  YKPQGNQRNNPYSNFYNPGWCNHPNFSWGGQGNNVQ-VQQKVNQPGFAKAQVLPQQNKQALPQQNSGNSFEAMMKEFMARTDVAIQSNQASMRALELQVG
        ++ Q  Q+ +PYSN YN GW +HP+  W  Q N  Q V    N+P                               F  +TD AI++       LE Q+ 
Subjt:  YKPQGNQRNNPYSNFYNPGWCNHPNFSWGGQGNNVQ-VQQKVNQPGFAKAQVLPQQNKQALPQQNSGNSFEAMMKEFMARTDVAIQSNQASMRALELQVG

Query:  QLASELKARPQ-GKLPSDTEHPRREGKEQVKTVTLRSELEIGQGAGGSKENAGASGSV-------PDVEPPYVPPPPYLPPLPFPQRQKPKNQDGQFKNF
        QLAS +  + Q G+LPS T        EQ+  VTLRS  E+ +    S E   +             V            P+PFP+R     ++   K  
Subjt:  QLASELKARPQ-GKLPSDTEHPRREGKEQVKTVTLRSELEIGQGAGGSKENAGASGSV-------PDVEPPYVPPPPYLPPLPFPQRQKPKNQDGQFKNF

Query:  LEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSVYRKL
        L+  +++ +N+PL++AI+Q+P YAKFLK++ T K R  + ETV+L+EE S +L+  LPPK KD GSFTIP  IGGKE GRALCDLG+SINLMP SVY  L
Subjt:  LEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSVYRKL

Query:  GIGEARPTTV------------------------------TLKLADRSITYQE-----------------------------------------------
         +G+ + T +                              T +L  + +   E                                               
Subjt:  GIGEARPTTV------------------------------TLKLADRSITYQE-----------------------------------------------

Query:  ----VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQSR
            VDYVSKWVEA A   +D+K V+ F++S+IFARFG PRA++SD G+HF N     LL KY + H+++TPYHPQ +GQAE+SNRE+K ILEK V  SR
Subjt:  ----VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQSR

Query:  KDWSFRLDEALWAYRTTYKTPLGMSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFR--------------------------
        KDWS RL++ALWAYRT YKTP+GMSP+RLIYGK C LP+ELEHK +WA+K  N D+S AG  R LQL EL+E R                          
Subjt:  KDWSFRLDEALWAYRTTYKTPLGMSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFR--------------------------

Query:  ------------HFLMRMGK-------------CFPNGAITLQDEKDGKVFKVNGQRVKNYW
                    H  +  GK              FP+GA+ +Q  K G VFKVNG R+K Y+
Subjt:  ------------HFLMRMGK-------------CFPNGAITLQDEKDGKVFKVNGQRVKNYW

XP_038725352.1 uncharacterized protein LOC120016583, partial [Tripterygium wilfordii]4.2e-9541.54Show/hide
Query:  GQLASELKARPQGKLPSDTEHPRREGKEQVKTVTLRSELEIGQGAGGSKENAGASGSVPDVEPPYVPPPPY----------LPPLPFPQRQKPKNQDGQF
        GQLA  L  R  GKLPS  E   ++ ++      LRS  ++       K   G S   P V  P  P  P+          +P +PFP R  P  +    
Subjt:  GQLASELKARPQGKLPSDTEHPRREGKEQVKTVTLRSELEIGQGAGGSKENAGASGSVPDVEPPYVPPPPY----------LPPLPFPQRQKPKNQDGQF

Query:  KNFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSVY
        ++  E+LK++ INIPL+EAI+ +P YAKFLK++ T K+++G+ E V L+EE SAIL+  LPPK KDPGSFTIP  IG KE+ +AL DLG+S+N+MP SVY
Subjt:  KNFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSVY

Query:  RKLGIGEAR-PTTVTLKLADR---SITY-------------------------QE---------------------------------------------
          L IG+ + P  +T    ++   SI Y                         QE                                             
Subjt:  RKLGIGEAR-PTTVTLKLADR---SITY-------------------------QE---------------------------------------------

Query:  -------------VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSI
                     VDYVSKWVEA+ C  +D   V  FL++ IFARFGTPRA++SD G+HF N    KL+ K G+ H++ATPYHPQ +GQ EISNREIK I
Subjt:  -------------VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSI

Query:  LEKVVHQSRKDWSFRLDEALWAYRTTYKTPLGMSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFRHFLMRMGKCFPNGAITL
        LEK V  SRKDWS +L++ALWAYRT +KTP+GMSPYRL++GK CHLP+EL+H+ +WA+K LNFDL  AG +R LQL EL+E R+      K +       
Subjt:  LEKVVHQSRKDWSFRLDEALWAYRTTYKTPLGMSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFRHFLMRMGKCFPNGAITL

Query:  QDEK-DGKVFKVNGQRVKNY
         D+    KVF+  GQ+V  Y
Subjt:  QDEK-DGKVFKVNGQRVKNY

TrEMBL top hitse value%identityAlignment
A0A1U7Z0W9 uncharacterized protein LOC1045898884.0e-9142.27Show/hide
Query:  ARTDVAIQSNQASMRALELQVGQLASELKARPQGKLPSDTE-HPRREGKEQVKTVTLRS-------------------ELEIGQGAGGSKENAGASGSVP
        A+ D   +++ +S+  LE+Q+GQLA+ + AR QG LPS+ E +PR    EQ+K ++LRS                   E+   +    S++N G  G+  
Subjt:  ARTDVAIQSNQASMRALELQVGQLASELKARPQGKLPSDTE-HPRREGKEQVKTVTLRS-------------------ELEIGQGAGGSKENAGASGSVP

Query:  DVEPPYVPPPPYLPPLPFPQRQKPKNQDGQFKNFLEILKQLHINIPLVEAIEQMPNY--AKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDP
        + +       PY+  +PFPQR K +  + QF  FL+I K+L INIP  EA+ QMP Y  AKF K+I+T K+R  + +TVSLT++CS+I+ + +P K KDP
Subjt:  DVEPPYVPPPPYLPPLPFPQRQKPKNQDGQFKNFLEILKQLHINIPLVEAIEQMPNY--AKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDP

Query:  GSFTIPVSIGGKELGR-----ALCDL-------------------------------GSSINLMPLSVYRKLGI----------GEARPTTVTLKLADRS
        GSFTIP  +G  E  +       CD+                               G +I    +     L I          G    T + +K  D  
Subjt:  GSFTIPVSIGGKELGR-----ALCDL-------------------------------GSSINLMPLSVYRKLGI----------GEARPTTVTLKLADRS

Query:  ITYQE----------VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIK
          +            VDYVSKWVEA+A   +DA+ V +FL+  +F+RFG PRA++SD G+ F       LL KY + HR+ATPYHPQ +GQ E+SNRE+K
Subjt:  ITYQE----------VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIK

Query:  SILEKVVHQSRKDWSFRLDEALWAYRTTYKTPLGMSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFR
         ILEK V+ SRKDW+ +LD+ALWAYRT YKTP+GMSPYRLIYGKACHLP+ELEH+ FWA+K LNFD+  AG  RMLQL EL+E R
Subjt:  SILEKVVHQSRKDWSFRLDEALWAYRTTYKTPLGMSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFR

A0A438CSP7 Gag-Pol polyprotein3.6e-9234.52Show/hide
Query:  YKPQGNQRNNPYSNFYNPGWCNHPNFSWGGQGNNVQVQQKVNQPGFAKAQVLPQQNKQALPQQNSGNSFEAMMKEFMARTDVAIQSNQASMRALELQVGQ
        Y     ++ NP SN++N GW NHPNFSW    NN  VQ+  +  GF       Q+NK  L  +     F     +F+  T    ++  A +  LE QVG+
Subjt:  YKPQGNQRNNPYSNFYNPGWCNHPNFSWGGQGNNVQVQQKVNQPGFAKAQVLPQQNKQALPQQNSGNSFEAMMKEFMARTDVAIQSNQASMRALELQVGQ

Query:  LASELKARPQGKLPSDTEHPRREGKEQVKTVTLRSELEIGQGAGGSKENAGASGSVP---DVEPPYVP--PPPYLPPLPFPQRQKPKNQDGQFKNFLEIL
        ++  L  R QG LPS TE      KE VK +TLRS  E+ Q     ++      SVP   D   P  P  P P    +PFPQ+ K +N D QF  F++  
Subjt:  LASELKARPQGKLPSDTEHPRREGKEQVKTVTLRSELEIGQGAGGSKENAGASGSVP---DVEPPYVP--PPPYLPPLPFPQRQKPKNQDGQFKNFLEIL

Query:  KQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSVYRKLGIGE
        K LHIN+P V+ +EQMP YAKFLK++L+ K++L + E V LTEEC+AIL+  LP K KDPGSFTIP +IG  +  + LC+LG+S+NLMPL +++KL +GE
Subjt:  KQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSVYRKLGIGE

Query:  ARPTT------------------------------------VTLKLADRSI-------------------------------------------TYQ---
         +PTT                                    + L + DR +                                           TY+   
Subjt:  ARPTT------------------------------------VTLKLADRSI-------------------------------------------TYQ---

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------EVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQA
                              + DY+SKWVEA A   +DA+ V +FL+ +IFARFGTPRA++SD  +HF  +    LL KYG+ H IATPYHPQ +GQ 
Subjt:  ----------------------EVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQA

Query:  EISNREIKSILEKVVHQSRKDWSFRLDEALWAYRTTYKTPLGMSPYRLIYGKACHLPLELEHKTFWALKKLNFDL
        E+SN E+K ILEK V+ SR+DWS +LD+ LWAYRTT+KTP+GMSP +L YGK+CHL +ELEHK+FWA+K LNFDL
Subjt:  EISNREIKSILEKVVHQSRKDWSFRLDEALWAYRTTYKTPLGMSPYRLIYGKACHLPLELEHKTFWALKKLNFDL

A0A6P4BT80 uncharacterized protein LOC1074724945.7e-9838.51Show/hide
Query:  QSNQASMRALELQVGQLASELKA-RPQGKLPSDT-EHPRREGKEQVKTVTLRSELEIGQGAGGSK------------ENAGASGSVPD------------
        ++ +AS++ +E Q+GQL+      RP   LPSDT  +P    KE+ K + LRS   +      ++            E A     VP+            
Subjt:  QSNQASMRALELQVGQLASELKA-RPQGKLPSDT-EHPRREGKEQVKTVTLRSELEIGQGAGGSK------------ENAGASGSVPD------------

Query:  -----------VEPPYVPPPPYLPPLPFPQRQKPKNQDGQFKNFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNG
                   ++          PPLP+PQR   + +D  F+ FLE  K+L INIPL EA+EQMP YAKFLK+++ KK+   E ET+ L EECSA+L+ G
Subjt:  -----------VEPPYVPPPPYLPPLPFPQRQKPKNQDGQFKNFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNG

Query:  LPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSVYRKLGIGEARPTTVTLKLADRSI-------------------------------------
        +PPK KDPGSF +  +IG   L +ALCDLG+SINLMPLS+ RKL I E +PT ++L +ADRSI                                     
Subjt:  LPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSVYRKLGIGEARPTTVTLKLADRSI-------------------------------------

Query:  -----------------------------------------------------------------------TYQ------EVDYVSKWVEAIACHQSDAK
                                                                               +Y        V+YVSKWVEAIA    DA+
Subjt:  -----------------------------------------------------------------------TYQ------EVDYVSKWVEAIACHQSDAK

Query:  TVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQSRKDWSFRLDEALWAYRTTYKTPLG
         V +FL+ HIF R+G P+ LVSD G+HF N  + KLL KYG+ H++ATPYHPQ NGQ E++NRE+K ILEK+V  SRKDW+ +L++ALWAYRT +KTP+G
Subjt:  TVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQSRKDWSFRLDEALWAYRTTYKTPLG

Query:  MSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFRHFLMRMGKCFPNGA---ITLQDEKDGKVFKVNGQRVKNYWG
         SP++L+YGK+CHLP+ELEHK FWA K LN D   AG   +LQL EL+EFR       K +   A   I L DE     F  NG R K Y G
Subjt:  MSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFRHFLMRMGKCFPNGA---ITLQDEKDGKVFKVNGQRVKNYWG

A0A6P6VJZ0 uncharacterized protein LOC1137232951.0e-9936.94Show/hide
Query:  QRNNPYSNFYNPGWCNHPNFSWGGQGNNVQVQQKVNQPGF-AKAQVLPQ-QNKQALPQQNSGNSFEAMMKEFMARTDVAIQSNQASMRALELQVGQLAS-
        +R +PY+N YNPGW +HPNF++            V  PGF  ++Q  P+ Q  Q  P   SG S E ++K     T    Q  + S+  LE Q+ QLAS 
Subjt:  QRNNPYSNFYNPGWCNHPNFSWGGQGNNVQVQQKVNQPGF-AKAQVLPQ-QNKQALPQQNSGNSFEAMMKEFMARTDVAIQSNQASMRALELQVGQLAS-

Query:  --ELKARPQGKLPSDT-EHPRREGKEQVKTVTLRSELEIGQGAGGSKENA-----GASGSVPDVEPPYVPPPPY--------LPPLPFPQRQKPKNQDGQ
           L+++  GKLPS T  +P    K+    +TLRS  E+ + +    E A        G+V   +P  +  P +         PP PFP R     +  Q
Subjt:  --ELKARPQGKLPSDT-EHPRREGKEQVKTVTLRSELEIGQGAGGSKENA-----GASGSVPDVEPPYVPPPPY--------LPPLPFPQRQKPKNQDGQ

Query:  FKNFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSV
         +  L+  +++ +NIPL++AI+Q+P YAKFLK++ T KK+L   E V + E  SA+L+  LPPK KDPG FT+P  IG  ++ +A+ DLG+SIN+MP S+
Subjt:  FKNFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSV

Query:  YRKLGIGE-----ARPTTVTLK--------------------LADRSITYQE------------------------------------------------
        Y  + IG      +R     +K                    +  R ++ +E                                                
Subjt:  YRKLGIGE-----ARPTTVTLK--------------------LADRSITYQE------------------------------------------------

Query:  ---VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQSRK
           VDYVSKW+EA A    +AK V  F++SHIF R+GT RA++SD GTHF N ++  LL KYG+ HR++T YHPQ +GQAEISN+EIKSILEK V+ +RK
Subjt:  ---VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQSRK

Query:  DWSFRLDEALWAYRTTYKTPLGMSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFR---------------------------
        DWS RLD+ALWAYRT +KTP+GMSPYRLI+GK CHLP+ELEH+ +WA+K  N  +  +G  R LQL ELEE R                           
Subjt:  DWSFRLDEALWAYRTTYKTPLGMSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFR---------------------------

Query:  ---------HFLMRM--GK-------------CFPNGAITLQDEKDGKVFKVNGQRVKNYWGEEFQ
                 H  +++  GK              FP+GA+ +Q     K+FKVNG R+K ++ E FQ
Subjt:  ---------HFLMRM--GK-------------CFPNGAITLQDEKDGKVFKVNGQRVKNYWGEEFQ

A0A6P6WY65 uncharacterized protein LOC1137373291.8e-9636.24Show/hide
Query:  QRNNPYSNFYNPGWCNHPNFSWGGQGNNVQVQQKVNQPGFAKAQVLPQQNKQALPQQNSGNSFEAMMKEFMARTDVAIQSNQASMRALELQVGQLAS---
        +R +PY+N YNPGW +HPNFS+  +    Q QQ   QP    AQ      +Q+ P+  SG S E ++K     T    Q  + S+  LE Q+ QLAS   
Subjt:  QRNNPYSNFYNPGWCNHPNFSWGGQGNNVQVQQKVNQPGFAKAQVLPQQNKQALPQQNSGNSFEAMMKEFMARTDVAIQSNQASMRALELQVGQLAS---

Query:  ELKARPQGKLPSDT-EHPRREGKEQVKTVTLRSELEIGQGAGGSKENA-----GASGSVPDVEPPYVPPPPY--------LPPLPFPQRQKPKNQDGQFK
         L+++  GKLPS T  +P    K+    +TLRS  E+ + +    E A        G+V   +P  +  P +         P  PFP R     +  Q +
Subjt:  ELKARPQGKLPSDT-EHPRREGKEQVKTVTLRSELEIGQGAGGSKENA-----GASGSVPDVEPPYVPPPPY--------LPPLPFPQRQKPKNQDGQFK

Query:  NFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSVYR
          L+  +++ +NIPL++AI+Q+P YAKFLK++ T KK+L   E V + E  SA+L+  LPPK KDPG FT+P  I   ++ +A+ DLG SIN+MP S+Y 
Subjt:  NFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSVYR

Query:  KLGIG-------EARPTTV------------------TLKLADRSITYQE--------------------------------------------------
         + +G       +A+   +                  + ++  R ++ +E                                                  
Subjt:  KLGIG-------EARPTTV------------------TLKLADRSITYQE--------------------------------------------------

Query:  -VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQSRKDW
         VDYVSKW+EA A    +AK V  F++SHIF R+GTPRA+++D GTHF N ++  L+ KYG+ HR++T YHPQ +GQAEISN+EIKSILEK V+ +RKDW
Subjt:  -VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQSRKDW

Query:  SFRLDEALWAYRTTYKTPLGMSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFR-----------------------------
        S RLD+ALWAYRT +KTP+GMSPYRLI+GK CHLP+ELEH+ +WA+K  N  ++  G  R LQL +LEE R                             
Subjt:  SFRLDEALWAYRTTYKTPLGMSPYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFR-----------------------------

Query:  -------HFLMR---------------MGKCFPNGAITLQDEKDGKVFKVNGQRVKNYWGEEFQA
               H  ++               +   FP+GA+ +   K  KVFKVNG R+K ++ E FQA
Subjt:  -------HFLMR---------------MGKCFPNGAITLQDEKDGKVFKVNGQRVKNYWGEEFQA

SwissProt top hitse value%identityAlignment
A1Z651 Gag-Pol polyprotein6.9e-1638.76Show/hide
Query:  VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQS-RKDW
        VD  S WVEA    +  AK V++ L   IF RFG P+ L SD+G  F + +   +    GI  ++   Y PQ++GQ E  NR IK  L K+   S  +DW
Subjt:  VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQS-RKDW

Query:  SFRLDEALWAYRTTYKTPLGMSPYRLIYG
           L  AL+  R T   P G++PY ++YG
Subjt:  SFRLDEALWAYRTTYKTPLGMSPYRLIYG

P10273 Gag-Pol polyprotein6.9e-1636.43Show/hide
Query:  VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKV-VHQSRKDW
        +D  S W EA       AK VA+ L   IF R+G P+ L SD+G  FI+ +   +    GI  ++   Y PQ++GQ E  NR IK  L K+ +    KDW
Subjt:  VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKV-VHQSRKDW

Query:  SFRLDEALWAYRTTYKTPLGMSPYRLIYG
           L   L+  R T   P G++P+ ++YG
Subjt:  SFRLDEALWAYRTTYKTPLGMSPYRLIYG

P31792 Pol polyprotein (Fragment)9.0e-1637.98Show/hide
Query:  VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKV-VHQSRKDW
        VD  S WVEA    Q  A  VA+ +   IF RFG P+ + SD+G  F++ +   L    GI  ++   Y PQ++GQ E  NR IK  L K+ +    KDW
Subjt:  VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKV-VHQSRKDW

Query:  SFRLDEALWAYRTTYKTPLGMSPYRLIYG
           L  AL   R T     G++PY ++YG
Subjt:  SFRLDEALWAYRTTYKTPLGMSPYRLIYG

Q2F7J0 Gag-Pol polyprotein6.9e-1638.76Show/hide
Query:  VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQS-RKDW
        VD  S WVEA    +  AK V++ L   IF RFG P+ L SD+G  F + +   +    GI  ++   Y PQ++GQ E  NR IK  L K+   S  +DW
Subjt:  VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQS-RKDW

Query:  SFRLDEALWAYRTTYKTPLGMSPYRLIYG
           L  AL+  R T   P G++PY ++YG
Subjt:  SFRLDEALWAYRTTYKTPLGMSPYRLIYG

Q2F7J3 Gag-Pol polyprotein9.0e-1638.76Show/hide
Query:  VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQS-RKDW
        VD  S WVEA    +  AK V + L   IF RFG P+ L SD+G  F + +   +    GI  ++   Y PQ++GQ E  NR IK  L K+   S  +DW
Subjt:  VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQS-RKDW

Query:  SFRLDEALWAYRTTYKTPLGMSPYRLIYG
           L  AL+  R T   P G++PY ++YG
Subjt:  SFRLDEALWAYRTTYKTPLGMSPYRLIYG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGAAACTCTCTACAAACCACAAGGTAATCAGAGGAATAACCCTTACTCTAATTTTTATAATCCAGGTTGGTGCAACCACCCCAACTTCTCATGGGGAGGTCAAGG
AAATAATGTGCAAGTGCAACAAAAGGTAAACCAGCCAGGATTCGCTAAAGCACAGGTATTGCCCCAGCAAAATAAGCAGGCGTTGCCCCAGCAAAATTCGGGAAACTCTT
TTGAGGCGATGATGAAAGAATTTATGGCTCGTACAGATGTTGCAATTCAAAGTAATCAAGCTTCGATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAGTGAGCTG
AAGGCAAGGCCTCAAGGGAAACTTCCATCGGATACTGAACACCCTCGACGGGAAGGTAAGGAGCAGGTAAAAACAGTAACTCTTAGGAGTGAGTTGGAGATTGGTCAGGG
TGCTGGAGGCAGCAAAGAAAATGCTGGAGCATCTGGTTCTGTTCCAGATGTGGAGCCACCTTATGTGCCGCCCCCACCTTATCTACCACCTCTACCTTTTCCACAAAGGC
AAAAGCCTAAGAATCAGGATGGTCAATTTAAAAATTTTTTAGAGATTCTTAAGCAGTTGCATATAAATATCCCTTTAGTAGAAGCTATCGAGCAAATGCCTAATTATGCT
AAATTTCTTAAGGATATTTTAACTAAAAAGAAAAGATTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGGGTTACCACCCAAGGC
TAAGGATCCAGGGTCATTTACCATACCCGTGTCTATAGGTGGAAAAGAATTAGGTAGAGCACTTTGTGATTTAGGCTCAAGCATTAACCTAATGCCTCTTTCGGTCTATC
GAAAGCTTGGTATAGGTGAAGCTAGGCCCACCACAGTCACACTCAAATTAGCTGATAGGTCTATCACATATCAAGAGGTTGATTATGTGTCCAAGTGGGTGGAGGCCATC
GCATGCCATCAGAGTGATGCCAAGACAGTTGCAAGGTTTCTTCAATCGCACATCTTTGCGCGGTTTGGGACACCTAGGGCTCTAGTGAGTGATGATGGTACACATTTTAT
TAATAATATCTTAACTAAGCTTTTAGCTAAGTATGGGATTAAGCATAGGATAGCTACCCCTTATCACCCACAAGCGAATGGTCAAGCTGAAATTAGTAATAGGGAAATTA
AATCGATTCTAGAGAAAGTAGTTCATCAATCTAGAAAGGATTGGTCTTTTAGGTTGGATGAGGCTCTTTGGGCTTATAGGACAACCTATAAGACTCCTCTAGGTATGTCT
CCCTATAGGTTAATATATGGGAAAGCTTGCCATTTACCCTTGGAGCTAGAGCATAAAACGTTTTGGGCCTTGAAGAAGTTAAATTTTGATTTGAGTCGTGCAGGAGCAAT
TAGAATGCTGCAGCTTTATGAATTAGAGGAGTTTCGCCATTTTCTTATGAGAATGGGAAAATGTTTCCCCAATGGAGCAATTACTTTGCAGGATGAAAAAGATGGGAAAG
TGTTCAAGGTGAATGGACAGCGTGTGAAAAATTATTGGGGTGAGGAATTTCAGGCGAAATATCCTTCCCTAAGGGTTAGGAGGTTGTTGCGGCAAAGTTATGGCTGGAGC
AAAATCATCCGAATAGAAGGGATTTCGTTGGGTAAAAACCAAGAAAAGAAGGACACCGGAGGAGAAAGAAGCAAAGAAAGAAGACGACAACAGAGAGTTGAAGAACAAGA
AAAAGAGAGAGAGGATGAGGTTGTTGCAGAAGAAGAAGGAGACCCAAAAGAATCTGACAAACAGAAACAAGAAGAGGATGGACAGGAAGTGGCGACTATAGAAGAAGCTC
GAGAGGAAATTCAGAAGAAGCAGAGTGAGGATGTGCAGACGGAGGCTGTAGAGGAAGAAGGTAGAGAGCCAGTTCAGGAGGCTCGTGTGGAGGTTATCATACCTGAACCT
CCGAAACGTCGCCGCATTAAGCGGAAGGCTGGGCGCATTCCAGTGATCAGGACTGATACCCCATCGCCTCCATCATCAGATTCCGAGAGAGAGAAGGCAGAGCGAGAGGA
ACTTGAGAAAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTGAAACTCTCTACAAACCACAAGGTAATCAGAGGAATAACCCTTACTCTAATTTTTATAATCCAGGTTGGTGCAACCACCCCAACTTCTCATGGGGAGGTCAAGG
AAATAATGTGCAAGTGCAACAAAAGGTAAACCAGCCAGGATTCGCTAAAGCACAGGTATTGCCCCAGCAAAATAAGCAGGCGTTGCCCCAGCAAAATTCGGGAAACTCTT
TTGAGGCGATGATGAAAGAATTTATGGCTCGTACAGATGTTGCAATTCAAAGTAATCAAGCTTCGATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAGTGAGCTG
AAGGCAAGGCCTCAAGGGAAACTTCCATCGGATACTGAACACCCTCGACGGGAAGGTAAGGAGCAGGTAAAAACAGTAACTCTTAGGAGTGAGTTGGAGATTGGTCAGGG
TGCTGGAGGCAGCAAAGAAAATGCTGGAGCATCTGGTTCTGTTCCAGATGTGGAGCCACCTTATGTGCCGCCCCCACCTTATCTACCACCTCTACCTTTTCCACAAAGGC
AAAAGCCTAAGAATCAGGATGGTCAATTTAAAAATTTTTTAGAGATTCTTAAGCAGTTGCATATAAATATCCCTTTAGTAGAAGCTATCGAGCAAATGCCTAATTATGCT
AAATTTCTTAAGGATATTTTAACTAAAAAGAAAAGATTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGGGTTACCACCCAAGGC
TAAGGATCCAGGGTCATTTACCATACCCGTGTCTATAGGTGGAAAAGAATTAGGTAGAGCACTTTGTGATTTAGGCTCAAGCATTAACCTAATGCCTCTTTCGGTCTATC
GAAAGCTTGGTATAGGTGAAGCTAGGCCCACCACAGTCACACTCAAATTAGCTGATAGGTCTATCACATATCAAGAGGTTGATTATGTGTCCAAGTGGGTGGAGGCCATC
GCATGCCATCAGAGTGATGCCAAGACAGTTGCAAGGTTTCTTCAATCGCACATCTTTGCGCGGTTTGGGACACCTAGGGCTCTAGTGAGTGATGATGGTACACATTTTAT
TAATAATATCTTAACTAAGCTTTTAGCTAAGTATGGGATTAAGCATAGGATAGCTACCCCTTATCACCCACAAGCGAATGGTCAAGCTGAAATTAGTAATAGGGAAATTA
AATCGATTCTAGAGAAAGTAGTTCATCAATCTAGAAAGGATTGGTCTTTTAGGTTGGATGAGGCTCTTTGGGCTTATAGGACAACCTATAAGACTCCTCTAGGTATGTCT
CCCTATAGGTTAATATATGGGAAAGCTTGCCATTTACCCTTGGAGCTAGAGCATAAAACGTTTTGGGCCTTGAAGAAGTTAAATTTTGATTTGAGTCGTGCAGGAGCAAT
TAGAATGCTGCAGCTTTATGAATTAGAGGAGTTTCGCCATTTTCTTATGAGAATGGGAAAATGTTTCCCCAATGGAGCAATTACTTTGCAGGATGAAAAAGATGGGAAAG
TGTTCAAGGTGAATGGACAGCGTGTGAAAAATTATTGGGGTGAGGAATTTCAGGCGAAATATCCTTCCCTAAGGGTTAGGAGGTTGTTGCGGCAAAGTTATGGCTGGAGC
AAAATCATCCGAATAGAAGGGATTTCGTTGGGTAAAAACCAAGAAAAGAAGGACACCGGAGGAGAAAGAAGCAAAGAAAGAAGACGACAACAGAGAGTTGAAGAACAAGA
AAAAGAGAGAGAGGATGAGGTTGTTGCAGAAGAAGAAGGAGACCCAAAAGAATCTGACAAACAGAAACAAGAAGAGGATGGACAGGAAGTGGCGACTATAGAAGAAGCTC
GAGAGGAAATTCAGAAGAAGCAGAGTGAGGATGTGCAGACGGAGGCTGTAGAGGAAGAAGGTAGAGAGCCAGTTCAGGAGGCTCGTGTGGAGGTTATCATACCTGAACCT
CCGAAACGTCGCCGCATTAAGCGGAAGGCTGGGCGCATTCCAGTGATCAGGACTGATACCCCATCGCCTCCATCATCAGATTCCGAGAGAGAGAAGGCAGAGCGAGAGGA
ACTTGAGAAAAAATAG
Protein sequenceShow/hide protein sequence
MGETLYKPQGNQRNNPYSNFYNPGWCNHPNFSWGGQGNNVQVQQKVNQPGFAKAQVLPQQNKQALPQQNSGNSFEAMMKEFMARTDVAIQSNQASMRALELQVGQLASEL
KARPQGKLPSDTEHPRREGKEQVKTVTLRSELEIGQGAGGSKENAGASGSVPDVEPPYVPPPPYLPPLPFPQRQKPKNQDGQFKNFLEILKQLHINIPLVEAIEQMPNYA
KFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGSSINLMPLSVYRKLGIGEARPTTVTLKLADRSITYQEVDYVSKWVEAI
ACHQSDAKTVARFLQSHIFARFGTPRALVSDDGTHFINNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILEKVVHQSRKDWSFRLDEALWAYRTTYKTPLGMS
PYRLIYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLYELEEFRHFLMRMGKCFPNGAITLQDEKDGKVFKVNGQRVKNYWGEEFQAKYPSLRVRRLLRQSYGWS
KIIRIEGISLGKNQEKKDTGGERSKERRRQQRVEEQEKEREDEVVAEEEGDPKESDKQKQEEDGQEVATIEEAREEIQKKQSEDVQTEAVEEEGREPVQEARVEVIIPEP
PKRRRIKRKAGRIPVIRTDTPSPPSSDSEREKAEREELEKK