; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0042148 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0042148
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr13:37420494..37433271
RNA-Seq ExpressionLag0042148
SyntenyLag0042148
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD6269918.1 unnamed protein product [Miscanthus lutarioriparius]2.6e-6331.71Show/hide
Query:  SAEVLVVTEGDVDSK--WILDSWCSFHMKPNRHWFQGFEPMEEGKVL-LGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRSLLFLGTFDKAGY-
        + +VLVV  G V  +  WIL S CSFH+  N+ WF  ++P++ G V+ +G+ +   +  I S+Q+K+ D   R +  VR++P + R+L+ L T D  GY 
Subjt:  SAEVLVVTEGDVDSK--WILDSWCSFHMKPNRHWFQGFEPMEEGKVL-LGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRSLLFLGTFDKAGY-

Query:  ------VCKLENEGAMVKLRGKLANG-LYILEGSTIIG--TAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSENLGTLGLCKHCVYG------
              VCK+ ++G+++ + G + +  LY+L GST+ G  TA   S +E   T LWH RLGH+SE G+ EL K+ LL    +G +   +HC++G      
Subjt:  ------VCKLENEGAMVKLRGKLANG-LYILEGSTIIG--TAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSENLGTLGLCKHCVYG------

Query:  -----------VPHQQYDL-----------------------KLLWKCGLKNPPDL----------------THLRISRCVAYAHMKEGKLDNRAEKCIL
                   + +   DL                       + +W   LKN  D                   LR+  C+AYAH+  GKL+ RA KC+ 
Subjt:  -----------VPHQQYDL-----------------------KLLWKCGLKNPPDL----------------THLRISRCVAYAHMKEGKLDNRAEKCIL

Query:  LGYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMRNDQRIARNRAPRVVVCLVNDFVNNLTHPRKCNPPNFR-SQRPSEGVVAPRATHRRIIGISRCN
        LGY  G KGY+LW           K    F +  V+ N+  +  +     +  + +D       P    P N   + R ++G   PR    R+  I  C+
Subjt:  LGYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMRNDQRIARNRAPRVVVCLVNDFVNNLTHPRKCNPPNFR-SQRPSEGVVAPRATHRRIIGISRCN

Query:  LGVETVGYVDTCPSPTYYEHFPHSPCTKFN------------------------------CLRNSIPNAPQEIE----------DMRNIPYASTV-GNQA
        +    V Y  +C      E+  H P T                                 CL N    + +EI           +M+++  A  + G Q 
Subjt:  LGVETVGYVDTCPSPTYYEHFPHSPCTKFN------------------------------CLRNSIPNAPQEIE----------DMRNIPYASTV-GNQA

Query:  WHMKAD---MSNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSDI---------------------------LEGYVDADYAADCDRRRSLSGY
          M  D   MS +PY+SAVGSL+Y+MVC+RPDL++AMS+VSR+M N   +                            L  YVD+D+AAD D+RRSL GY
Subjt:  WHMKAD---MSNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSDI---------------------------LEGYVDADYAADCDRRRSLSGY

Query:  VFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIKE
        VFT  G  VSWR TLQ VVALSTTE EYIA  +A K+
Subjt:  VFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIKE

KAG8472304.1 hypothetical protein CXB51_034358 [Gossypium anomalum]1.1e-6126.98Show/hide
Query:  EVKKNKRKSEGKNSEDGNNVNVTEGYDSAEVLV--VTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRI
        ++K      +GK  E+    +V E Y   E+LV  V +  V  +WILDS C+FHM PNR WF  +E + EG VL+GN   C +  + +I+VKMFD   R 
Subjt:  EVKKNKRKSEGKNSEDGNNVNVTEGYDSAEVLV--VTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRI

Query:  IPRVRYVPELKRSLLFLGTFDKAGYVCKLE------NEGAMVKLRGKLANG-LYILEGSTIIGTAAIA--SLNEQQTTTLWHRRLGHVSEKGLMELHKQG
        +  VRYVP+LKR+L+ L T D  GY    E      ++G++V ++G+     LY+L+GST+ G AA+A  SL++   T LWH RLGH+SE G++EL K+G
Subjt:  IPRVRYVPELKRSLLFLGTFDKAGYVCKLE------NEGAMVKLRGKLANG-LYILEGSTIIGTAAIA--SLNEQQTTTLWHRRLGHVSEKGLMELHKQG

Query:  LLGSENLGTLGLCKHCVYG------------------------------VPHQ---QYDLKLL------------------------WK-----------
        LL  + +  L  C+HCV+G                              VP +    Y L  +                        WK           
Subjt:  LLGSENLGTLGLCKHCVYG------------------------------VPHQ---QYDLKLL------------------------WK-----------

Query:  -----------------------------------------------------CGLK---------------------------------------NPPD
                                                             C L                                        NP +
Subjt:  -----------------------------------------------------CGLK---------------------------------------NPPD

Query:  LTHLRISRCVAYAHMKEGKLDNRAEKCILLGYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMRN---------------DQRIARNRAPRVVVCLVN
         + L+I  C AYAH+  GKL+ R+ KC+ LGY  G+KGY+LW   P   ++V    +VF    ++ N               + +I     P+V   + N
Subjt:  LTHLRISRCVAYAHMKEGKLDNRAEKCILLGYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMRN---------------DQRIARNRAPRVVVCLVN

Query:  DFVNNLTH-------PRKCNPPN------------------FRSQRPSE---------------------------------------------------
           ++  +        R+  PP                     +Q PS                                                    
Subjt:  DFVNNLTH-------PRKCNPPN------------------FRSQRPSE---------------------------------------------------

Query:  ---GVVAPRATH---RRIIGISRC------NLGVETVGYVDTCPSPTYY---EHFPHSPCTKFNCLRN----SIPNAPQEIEDMRNIPYASTVGNQ-AWH
            V +P   H   R ++GI          L V+T           Y    E F  S    + CL       +  +P++     N+  A  V    A H
Subjt:  ---GVVAPRATH---RRIIGISRC------NLGVETVGYVDTCPSPTYY---EHFPHSPCTKFNCLRN----SIPNAPQEIEDMRNIPYASTVGNQ-AWH

Query:  MKAD-------------MSNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSN---------------------------LDSDILEGYVDADYAADCD
         +               MS++PY+SAVGSLMY MVC+RPDL++A+S VSR+M+N                              D + GYVDAD+A D D
Subjt:  MKAD-------------MSNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSN---------------------------LDSDILEGYVDADYAADCD

Query:  RRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIKE
        RRRSL+GYVFT  G  +SW+ TLQ  VALSTTE+EY+A T+A KE
Subjt:  RRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIKE

KAG8474542.1 hypothetical protein CXB51_031315 [Gossypium anomalum]5.8e-6330.38Show/hide
Query:  EVKKNKRKSEGKNSEDGNNVNVTEGYDSAEVLV--VTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRI
        ++K+     + K  E+    +V E Y   E+LV  + +  V  +WILDS C+FHM  NRHWF  +E + EG VL+GN   C +  +  I+VKMF+   RI
Subjt:  EVKKNKRKSEGKNSEDGNNVNVTEGYDSAEVLV--VTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRI

Query:  IPRVRYVPELKRSLLFLGTFDKAGYVCKLENE------GAMVKLRGKLANG-LYILEGSTIIGTAAIA--SLNEQQTTTLWHRRLGHVSEKGLMELHKQG
        +  VR VPELKR+L+ L T D  GY    E+E      G++V ++G+     LY+L+GST+ G AA+A   L++   T LW  RLGH+SE G++EL K+G
Subjt:  IPRVRYVPELKRSLLFLGTFDKAGYVCKLENE------GAMVKLRGKLANG-LYILEGSTIIGTAAIA--SLNEQQTTTLWHRRLGHVSEKGLMELHKQG

Query:  LLGSENLGTLGLCKHCVYGVPHQ--------------QYDLKLLWKCGLKNPPD-------LTHL-----RISRCVAYAHMKEGKLDNRAEKCILLGYSH
        LL  + +  L  C+HCV+G   +              +Y    LW  G    P        LT +     +I RC+AYAH+  GKL++R+  C+ LGY  
Subjt:  LLGSENLGTLGLCKHCVYGVPHQ--------------QYDLKLLWKCGLKNPPD-------LTHL-----RISRCVAYAHMKEGKLDNRAEKCILLGYSH

Query:  GIKGYRLWAARPNIWRIVGKILMVFYNLFVMRN---------------DQRIARNRAPRVVVCLVNDFVNNLTHP-------RKCNPP------------
         +KGY+LW   P   ++V    +VF    ++ N               + +I     P+    + N   ++  +        R+  PP            
Subjt:  GIKGYRLWAARPNIWRIVGKILMVFYNLFVMRN---------------DQRIARNRAPRVVVCLVNDFVNNLTHP-------RKCNPP------------

Query:  ---------NFRSQRPSEGVVAP-------------RATHR-RIIGISRCNLGVETVGY---------VDTCPSPTY--------YEHFP-------HSP
                 N      SE +                 + H+ R     +   G + V Y               P Y        Y   P        SP
Subjt:  ---------NFRSQRPSEGVVAP-------------RATHR-RIIGISRCNLGVETVGY---------VDTCPSPTY--------YEHFP-------HSP

Query:  CTKFNCLRNSIPNAPQEIEDMRNIPYASTVGNQ-AWHMKAD-------------MSNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSDILEGY
          K++ +R  +        ++  +  A  V    A H +               MS++PY+SAVGSLMY M                     ++ ++ GY
Subjt:  CTKFNCLRNSIPNAPQEIEDMRNIPYASTVGNQ-AWHMKAD-------------MSNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSDILEGY

Query:  VDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIKE
        VDAD+A   DRRRSL+ YVFT  G  +SW+ TLQ  VALSTTE+EYIA T+  KE
Subjt:  VDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIKE

KAG8478826.1 hypothetical protein CXB51_028794 [Gossypium anomalum]1.3e-6731.99Show/hide
Query:  EVKKNKRKSEGKNSEDGNNVNVTEGYDSAEVLV--VTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRI
        ++K+      GK  E+    +V E Y   E+LV  V    V  +WILDS C+FHM  NR WF  +E + E  VL+GN   C +  +  I+VKMFD   + 
Subjt:  EVKKNKRKSEGKNSEDGNNVNVTEGYDSAEVLV--VTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRI

Query:  IPRVRYVPELKRSLLFLGTFDKAGYVCKLENEGAMVKLRGKLANGLYILEGSTIIGTAAI--ASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSENL
        +  VR+VPELKR L+ L T D  GY    E+                   GSTI G A +  +SL++   T LWH  LGH+SE  + EL K+GLL  + +
Subjt:  IPRVRYVPELKRSLLFLGTFDKAGYVCKLENEGAMVKLRGKLANGLYILEGSTIIGTAAI--ASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSENL

Query:  GTLGLCKHCVYG----VPHQQYDLKLL---------WKCGLK----------------------------------------------------NPPDLT
          L  CKHCV+G    +   + D + L         W+C                                                       NP + +
Subjt:  GTLGLCKHCVYG----VPHQQYDLKLL---------WKCGLK----------------------------------------------------NPPDLT

Query:  HLRISRCVAYAHMKEGKLDNRAEKCILLGYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMRN------DQRIARNRAPRVVVCLVNDFVNNLTHPRK
         L+I  C AYAH+  GKL+ R+ KC+ L Y  G+KGY+LW   P   ++V    +VF   + + N         IA+NR  R            +  P+K
Subjt:  HLRISRCVAYAHMKEGKLDNRAEKCILLGYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMRN------DQRIARNRAPRVVVCLVNDFVNNLTHPRK

Query:  CNPPNFRSQRPSEGVVAPRATHRRIIGISRCNLGVETVGYVDTCPSPTYYEHFPHSPCTKFNCLRNSIPNAPQEIEDMRNIPYASTVGNQAWHMKADMSN
            N         +VA                 +     +D    P+ Y      P      +  S+           +   +S +  Q+      M +
Subjt:  CNPPNFRSQRPSEGVVAPRATHRRIIGISRCNLGVETVGYVDTCPSPTYYEHFPHSPCTKFNCLRNSIPNAPQEIEDMRNIPYASTVGNQAWHMKADMSN

Query:  IPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSDILEGYVDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIK
        +PY+SAVGSLMY+MVC+RPDL++A+S   R       D + GYVDAD+A D DRRRSL+GYVFT  G  +SW+ TLQ  VALSTTE+EY+  T+A K
Subjt:  IPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSDILEGYVDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIK

KAG8501848.1 hypothetical protein CXB51_004653 [Gossypium anomalum]8.4e-6227.8Show/hide
Query:  KNKRKSEGKNSEDGNNVNVTEGYDSAEVLV--VTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRIIPR
        K+K +SE  N ED         Y   E+LV  V +  V  +WILDS C+FHM PNR WF  +E + EG VL+GN   C +  + +I+VKMFD   R +  
Subjt:  KNKRKSEGKNSEDGNNVNVTEGYDSAEVLV--VTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRIIPR

Query:  VRYVPELKRSLLFLGTFDKAGYVCKLENEGAMVKLRGKLANGLYILEGSTIIGTAAIA--SLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSENLGTL
        VRYVPELKR+L+ L T D  GY    E+                   GST+ G AA+A  SL++   T LWH RLGH+SE G++EL K+GLL  + +  L
Subjt:  VRYVPELKRSLLFLGTFDKAGYVCKLENEGAMVKLRGKLANGLYILEGSTIIGTAAIA--SLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSENLGTL

Query:  GLCKHCVYGVPHQQ------------YDLKLLWK----------------------------------------------------------------CG
          C+HCV+G   ++            +     WK                                                                C 
Subjt:  GLCKHCVYGVPHQQ------------YDLKLLWK----------------------------------------------------------------CG

Query:  LK---------------------------------------NPPDLTHLRISRCVAYAHMKEGKLDNRAEKCILLGYSHGIKGYRLWAARPNIWRIVGKI
        L                                        NP + + L+I  C AYAH+  GKL+ R+ KC+ LGY  G+KGY+LW   P   ++V   
Subjt:  LK---------------------------------------NPPDLTHLRISRCVAYAHMKEGKLDNRAEKCILLGYSHGIKGYRLWAARPNIWRIVGKI

Query:  LMVFYNLFVMRN---------------DQRIARNRAPRVVVCLVNDFVNNLTH-------PRKCNPPN--------------------------------
         +VF    ++ N               + +I     P+V   + N   ++  +        R+  PP                                 
Subjt:  LMVFYNLFVMRN---------------DQRIARNRAPRVVVCLVNDFVNNLTH-------PRKCNPPN--------------------------------

Query:  -----------------FRSQRPSEGVVAPR----------------------------ATHRRIIGISRCN------LGVETVGYVDTCPSPTYY---E
                         F+ +  + GV  P+                            ++ R ++GI   +      L V+T           Y    E
Subjt:  -----------------FRSQRPSEGVVAPR----------------------------ATHRRIIGISRCN------LGVETVGYVDTCPSPTYY---E

Query:  HFPHSPCTKFNCLR---NSIPNAPQEIEDMRNIPYASTVGNQAWHMKADMSNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSN--------------
         F  S    + CL    N     P       +   +ST+  Q+      MS++PY+SAVGSLMY MVC+RPDL++A+S VSR+M+N              
Subjt:  HFPHSPCTKFNCLR---NSIPNAPQEIEDMRNIPYASTVGNQAWHMKADMSNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSN--------------

Query:  -------------LDSDILEGYVDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIKE
                        D + GYVDAD+A D DRRRSL+GYVFT  G  +SW+ TLQ  VALSTTE+EY+A T+A K+
Subjt:  -------------LDSDILEGYVDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIKE

TrEMBL top hitse value%identityAlignment
A0A438CQ40 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-6030.92Show/hide
Query:  KNSEDGNNVNVTEGYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRS
        K    G+   + +GYD+ EVL + E D   +WILDS CSFHM P + WF+ F+  + G VLLGN   C +    ++++K +D+  R++  VRY+PELKR 
Subjt:  KNSEDGNNVNVTEGYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRS

Query:  LLFLGTFDKAGYVCKLE------NEGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSENLGTLGLCKHC
        L+ LG  DK+ Y  KLE        G++  ++  +  GLY L G T+    +     +  TT LWH+RLGH+S + L EL KQG+LG+  L  L  C+HC
Subjt:  LLFLGTFDKAGYVCKLE------NEGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSENLGTLGLCKHC

Query:  VYGVPHQQYDLKLLWKCGLKNPPDLTHLRI---SRCVAYAHMKEGKLDNRAEKCIL---LGYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMRNDQR
        V+G   +    K++ +   +N  +  H  +   SR     +MK+     +    I+    G+  GIK               G++ ++  +L+ ++   R
Subjt:  VYGVPHQQYDLKLLWKCGLKNPPDLTHLRI---SRCVAYAHMKEGKLDNRAEKCIL---LGYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMRNDQR

Query:  IARNRAPRVVVCLVNDFVNNLTHPRKCNPPNFRSQRPSEGVVAPRATHRRIIGISRCNLGVETVGYVD----TCPSPTYYEHFPHSPCTKFNC--LRNSI
        +   R  +    +++   N  +H               +G V  + T   ++ +         + YVD     C    + E        +F    L ++ 
Subjt:  IARNRAPRVVVCLVNDFVNNLTHPRKCNPPNFRSQRPSEGVVAPRATHRRIIGISRCNLGVETVGYVD----TCPSPTYYEHFPHSPCTKFNC--LRNSI

Query:  PNAPQEIEDMRN-----IPYASTVGNQAWHMKADM-SNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNL---------------------------
             EIE  R+     +   S +      MK  +     YAS VGS+MY MVC+RPDLA+A+S++ R+MS L                           
Subjt:  PNAPQEIEDMRN-----IPYASTVGNQAWHMKADM-SNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNL---------------------------

Query:  DSDI---LEGYVDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIKE
        +S +   L+G+VDADYA + D R+SL+GYVF   G  +SW+  LQ VVALS TE+EY+A T+ +KE
Subjt:  DSDI---LEGYVDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIKE

A0A438FFY5 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-6129.88Show/hide
Query:  KNSEDGNNVNVTEGYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRS
        K   +G+   + +GYDSAEVL V E D   +WILDS CSFHM P + WF+ F+  + G VLLGN     +    ++++K +D   R++  VRY+PELKR+
Subjt:  KNSEDGNNVNVTEGYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRS

Query:  LLFLGTFDKAGYVCKLE------NEGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSENLGTLGLCKHC
        L+ LG  DK+GY  K E        G++  ++G + NGLY L G T+    +     +  TT LWH+RLGH+S +GL EL KQ +LG+  L  L  C+HC
Subjt:  LLFLGTFDKAGYVCKLE------NEGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSENLGTLGLCKHC

Query:  VYG----------VPHQQYDLKL----LW------------------------KCGLKNP--------PDLTHLRISRCVAYAHMKEGKLDNRAEKCILL
        V+G          +   Q  L      LW                           ++NP         D  HL++  C AY H K  KL+ RA KCI L
Subjt:  VYG----------VPHQQYDLKL----LW------------------------KCGLKNP--------PDLTHLRISRCVAYAHMKEGKLDNRAEKCILL

Query:  GYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMR----------------------------------------NDQRIARNRAPRVVVC------LV
        GY  G+KGY+LW       + +    + F    + +                                        +++++ + +    ++C      + 
Subjt:  GYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMR----------------------------------------NDQRIARNRAPRVVVC------LV

Query:  NDFV------------------NNLTHPRKCNPPNFRSQRPSEGVV------------APRATHRRI------IGISRC--------NLGVETVG----Y
          FV                    L       PP    +   +G V            +PR  ++R       IG +R         NL  +++     Y
Subjt:  NDFV------------------NNLTHPRKCNPPNFRSQRPSEGVV------------APRATHRRI------IGISRC--------NLGVETVG----Y

Query:  VD----TCPSPTYYEHFPHSPCTKFNC--LRNSIPNAPQEIEDMRNIPYASTVGNQAWHM-----------KADMSNIPYASAVGSLMYLMVCTRPDLAH
        VD     C    + E        +F    L ++      EIE  R+    ST   Q + +           K  M  IPYAS VGS+MY MVC+RPDLA+
Subjt:  VD----TCPSPTYYEHFPHSPCTKFNC--LRNSIPNAPQEIEDMRNIPYASTVGNQAWHM-----------KADMSNIPYASAVGSLMYLMVCTRPDLAH

Query:  AMSVVSRFMSNLDSDILEGYVDA-DYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIKE
        A+S++SR+MS       +       Y A     RSL G V+   G  VSW+  LQ VVALSTTE+EY+A T+A+KE
Subjt:  AMSVVSRFMSNLDSDILEGYVDA-DYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIKE

A0A5A7TIS5 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-5729.93Show/hide
Query:  GYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRSLLFLGTFDKAGYV
        GY+SAEVL+V+  D+   WI+DS C+FHM P R +   F+  + GKVLLG+   CNV+   S+Q+   D   RI+    YVP+LKR+L+ L   D++G  
Subjt:  GYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRSLLFLGTFDKAGYV

Query:  CKLEN------EGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSEN------------LGTLGLCKHCV
         K EN      +G++VKLR    +GLY+LEG+TI G+ AIAS      + LWH+RL H  +       K   L ++N              + G+ +H  
Subjt:  CKLEN------EGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSEN------------LGTLGLCKHCV

Query:  YGVPHQQYDLKLLWKCGL--KNPPDLTHLRISRCVAYAHMKEGKLDNRAEKCILLGYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMRNDQ--RIAR
             QQ  L   +   +  +    L HLR+  C  YAH+K+GKL+ RA KC+ +GY   IKGY+LW            + M      + R+ Q  RI  
Subjt:  YGVPHQQYDLKLLWKCGL--KNPPDLTHLRISRCVAYAHMKEGKLDNRAEKCILLGYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMRNDQ--RIAR

Query:  NRAPRVVVCLVNDFVNNLTHPRKCNPPNFRSQRPSEGVVAPRATHRRIIGISRCNLGVETVGYVDTCPSPTYYEHFPHSPCTKFNCLRNSI----PNAPQ
        +    +     N+ + N  +   C+        P     A    +   +  +  N+  E + + +   S +  +         F+  +N I    P    
Subjt:  NRAPRVVVCLVNDFVNNLTHPRKCNPPNFRSQRPSEGVVAPRATHRRIIGISRCNLGVETVGYVDTCPSPTYYEHFPHSPCTKFNCLRNSI----PNAPQ

Query:  EIEDMRNIPY---ASTVGNQAWHMKADMSNIPYASAVG-----------------------SLMYL-------------MVCTR--------------PD
        +      + Y    ST GN     KA +    Y    G                        ++Y+             MVC                PD
Subjt:  EIEDMRNIPY---ASTVGNQAWHMKADMSNIPYASAVG-----------------------SLMYL-------------MVCTR--------------PD

Query:  LAHAMSVVSRFMSNL------------------------------DSDILEGYVDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYI
        L +AM ++SRFMSN                                S +L G+ DADY A  D+RRSLSG+++   GN+VSW+  LQ VVALSTTESEYI
Subjt:  LAHAMSVVSRFMSNL------------------------------DSDILEGYVDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYI

Query:  AATDAIKE
        +  +AIKE
Subjt:  AATDAIKE

A0A5A7VKC2 Retrotransposon protein, putative, Ty1-copia subclass1.4e-5727.54Show/hide
Query:  KKNKRKSEGKNSEDGNN-VNVTEGYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRIIPR
        K  +  +   N  DG N   +T+GY+SAEVL+V+  D+   WI+DS C+FHM P+R +   F+  + GKVLLG+   C+V+R  S+Q+   D   RI+  
Subjt:  KKNKRKSEGKNSEDGNN-VNVTEGYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRIIPR

Query:  VRYVPELKRSLLFLGTFDKAGYVCKLEN------EGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSEN
        VRYVP+LKR+L+ LG  D++G   K EN      +G++VKLRG L +GLY+LEG+T+ G+ AIAS        LWH RL HVSE+GL  L +QGLL    
Subjt:  VRYVPELKRSLLFLGTFDKAGYVCKLEN------EGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSEN

Query:  LGTLGLCKHCVYGV--------------------------PHQQ----------------------YDLK---------LLWKCGLK----NPPDLTHLR
           L  C+HC+ G                           P ++                      Y LK         L WK  ++      P L HL+
Subjt:  LGTLGLCKHCVYGV--------------------------PHQQ----------------------YDLK---------LLWKCGLK----NPPDLTHLR

Query:  ISRCVAYAHMKEGKLDNRAEKCILLGYSHGIKGYR--------------LWAARPNIW--------RIVGKI-----------------LMVFYNLFVMR
        +  C  YAH+K+GKL+ RA KC+ +GY  G+K  R                  RP++          +V KI                 +++    F+  
Subjt:  ISRCVAYAHMKEGKLDNRAEKCILLGYSHGIKGYR--------------LWAARPNIW--------RIVGKI-----------------LMVFYNLFVMR

Query:  N-------DQRIARNRAPR----------------VVVC---------------LVND---------------FVNNLTH---PRKCN------------
        +       + ++ R+RA R                 + C               +V+D                  N T    P+  N            
Subjt:  N-------DQRIARNRAPR----------------VVVC---------------LVND---------------FVNNLTH---PRKCN------------

Query:  --------PPNFRSQRPSEG-----------VVAPRATHRRIIGISRCNLGVETVGYVDTCPSPTYYEHFPHSPCTKFNCLR----------------NS
                 P ++++  ++G           V +P   H  I  I   ++ V    +++     T + H           LR                N 
Subjt:  --------PPNFRSQRPSEG-----------VVAPRATHRRIIGISRCNLGVETVGYVDTCPSPTYYEHFPHSPCTKFNCLR----------------NS

Query:  IPNAPQEIE-----DMRNI----------PYASTVGNQAWHMK-ADMSNIPYASAVGSLM--YLMVCT---------RPDLAHAMSVVSRFMSNL-----
        +     E E     +++ I             ST+  +++ +K  +  NI  + AV + +  Y  + +         RPDL + MS++SRFMSN      
Subjt:  IPNAPQEIE-----DMRNI----------PYASTVGNQAWHMK-ADMSNIPYASAVGSLM--YLMVCT---------RPDLAHAMSVVSRFMSNL-----

Query:  -------------------------DSDILEGYVDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIKE
                                  S +LEG+ DADY AD D+RRSLSG++F   GN+VSW+  LQ VVALSTTESEYI+  +A+KE
Subjt:  -------------------------DSDILEGYVDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIKE

A0A7H4LGA5 Genome assembly, chromosome: II4.2e-5928.36Show/hide
Query:  KNKRKSEG-KNSEDGNNVNVTEGYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEE-GKVLLGNPHECNVRRINSIQVKMFDNQTRIIPR
        +NK K +G K SE+  NV   +  D A V++    + + +W+LD+ C+FHM P+R  F  F+     G VL  +   C +  I S+++KMFD   + +  
Subjt:  KNKRKSEG-KNSEDGNNVNVTEGYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEE-GKVLLGNPHECNVRRINSIQVKMFDNQTRIIPR

Query:  VRYVPELKRSLLFLGTFDKAGY-------VCKLENEGAMVKLRGKLA--NGLYILEGSTIIG--TAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGL
        VRY+P++KR+L+ + + D  GY       V K+  + +++ ++G L+  NGLY L GST+ G  T  I+  ++     LWH RLGH+SE GL EL+K+GL
Subjt:  VRYVPELKRSLLFLGTFDKAGY-------VCKLENEGAMVKLRGKLA--NGLYILEGSTIIG--TAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGL

Query:  LGSENLGTLGLCKHCVYG-----------------VPHQQYDL-----------------------KLLWKCGLKNPPDL----------------THLR
              G L  C+HC++G                 + +   DL                       + +W   LK+  +                   LR
Subjt:  LGSENLGTLGLCKHCVYG-----------------VPHQQYDL-----------------------KLLWKCGLKNPPDL----------------THLR

Query:  ISRCVAYAHMKEGKLDNRAEKCILLGYSHGIKGYRLWAARPNIWRIVGKILMVF-----------YNLFVMRNDQRIARNRAPRVVVCL-----------
        +  C  YAH+  GKL+ RA KCI LGY  G+KG++LW   P    +V    ++F            N+ V    Q I +    +V   +           
Subjt:  ISRCVAYAHMKEGKLDNRAEKCILLGYSHGIKGYRLWAARPNIWRIVGKILMVF-----------YNLFVMRNDQRIARNRAPRVVVCL-----------

Query:  -------VNDFVNNLTHPRKCNPP--NFRSQRPSEGVVAP-RATHRRIIGISRCNLGVETVGYVDTCPSPTYYEHFPHSPCTKFNCLRN-----------
               VND     T  +   PP  N    R   G+  P R      I     ++  E  G V+T    +Y E    S   K+    +           
Subjt:  -------VNDFVNNLTHPRKCNPP--NFRSQRPSEGVVAP-RATHRRIIGISRCNLGVETVGYVDTCPSPTYYEHFPHSPCTKFNCLRN-----------

Query:  -SIPNAPQEIEDMR---------------------------------------------------NIPYASTVGNQAWHMKAD---MSNIPYASAVGSLM
          +   P+E + +R                                                     P  S +  +   + AD   MS +PY+SAVGSLM
Subjt:  -SIPNAPQEIEDMR---------------------------------------------------NIPYASTVGNQAWHMKAD---MSNIPYASAVGSLM

Query:  YLMVCTRPDLAHAMSVVSRFMSNLDS---------------------------DILEGYVDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALST
        Y MVC+RPDL++A+SVVSR+M+N +                            D L G+VD+D+A D D RRSL+GYVFT  G  VSWR TLQ +VA ST
Subjt:  YLMVCTRPDLAHAMSVVSRFMSNLDS---------------------------DILEGYVDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALST

Query:  TESEYIAATDAIKEGGDCYSVVIHMEVKLQSNPNRPSVFDDMES
        T++EY+A + A KE       +  +  +L  + + P++F D +S
Subjt:  TESEYIAATDAIKEGGDCYSVVIHMEVKLQSNPNRPSVFDDMES

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.5e-1331.97Show/hide
Query:  NIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSDILE------------------------------GYVDADYAADCDRRRSLSGYVFTYLG-N
        N P  S +G LMY+M+CTRPDL  A++++SR+ S  +S++ +                              GYVD+D+A     R+S +GY+F     N
Subjt:  NIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSDILE------------------------------GYVDADYAADCDRRRSLSGYVFTYLG-N

Query:  LVSWRTTLQLVVALSTTESEYIAATDAIKEGGDCYSVVIHMEVKLQS
        L+ W T  Q  VA S+TE+EY+A  +A++E      ++  + +KL++
Subjt:  LVSWRTTLQLVVALSTTESEYIAATDAIKEGGDCYSVVIHMEVKLQS

P0CV72 Secreted RxLR effector protein 1614.5e-1843.41Show/hide
Query:  MSNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSN----------------------------LDSDILEGYVDADYAADCDRRRSLSGYVFTYLGNL
        M N+PY SAVG++MYLMV TRPDLA A+ V+S+F S+                              +  L GY DAD+A D + RRS SGY+F   G  
Subjt:  MSNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSN----------------------------LDSDILEGYVDADYAADCDRRRSLSGYVFTYLGNL

Query:  VSWRTTLQLVVALSTTESEYIAATDAIKE
        VSWR+  Q  VALS+TE EY+A ++A +E
Subjt:  VSWRTTLQLVVALSTTESEYIAATDAIKE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.7e-2648.85Show/hide
Query:  KADMSNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSD---------------------------ILEGYVDADYAADCDRRRSLSGYVFTYLG
        K +M+ +PY+SAVGSLMY MVCTRPD+AHA+ VVSRF+ N   +                           IL+GY DAD A D D R+S +GY+FT+ G
Subjt:  KADMSNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSD---------------------------ILEGYVDADYAADCDRRRSLSGYVFTYLG

Query:  NLVSWRTTLQLVVALSTTESEYIAATDAIKE
          +SW++ LQ  VALSTTE+EYIAAT+  KE
Subjt:  NLVSWRTTLQLVVALSTTESEYIAATDAIKE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.6e-1727.48Show/hide
Query:  KKNKRKSEGKNSEDG------NNVNVTEGYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQT
        +K K ++ G+ ++D       NN NV    +  E  +   G  +S+W++D+  S H  P R  F  +   + G V +GN     +  I  I +K     T
Subjt:  KKNKRKSEGKNSEDG------NNVNVTEGYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQT

Query:  RIIPRVRYVPELKRSLLFLGTFDKAGYVCKLENE------GAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGL
         ++  VR+VP+L+ +L+     D+ GY     N+      G++V  +G     LY        G   + +  ++ +  LWH+R+GH+SEKGL  L K+ L
Subjt:  RIIPRVRYVPELKRSLLFLGTFDKAGYVCKLENE------GAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGL

Query:  LGSENLGTLGLCKHCVYGVPHQ
        +      T+  C +C++G  H+
Subjt:  LGSENLGTLGLCKHCVYGVPHQ

P25600 Putative transposon Ty5-1 protein YCL074W4.9e-0428.57Show/hide
Query:  PYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSDILEG----------------------------YVDADYAADCDRRRSLSGYVFTYLGNLVSWR
        PY S VG L++     RPD+++ +S++SRF+    +  LE                             Y DA + A  D   S  GYV    G  V+W 
Subjt:  PYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSDILEG----------------------------YVDADYAADCDRRRSLSGYVFTYLGNLVSWR

Query:  T-TLQLVVALSTTESEYIAATDAIKE
        +  L+ V+ + +TE+EYI A++ + E
Subjt:  T-TLQLVVALSTTESEYIAATDAIKE

P93293 Uncharacterized mitochondrial protein AtMg003007.8e-1034.91Show/hide
Query:  EGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSENLGTLGLCKHCVYGVPHQQYDLKLLWKCG---LKN
        +G    L+G   + LYIL+GS   G + +A   + + T LWH RL H+S++G+  L K+G L S  + +L  C+ C+YG  H     ++ +  G    KN
Subjt:  EGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSENLGTLGLCKHCVYGVPHQQYDLKLLWKCG---LKN

Query:  PPDLTH
        P D  H
Subjt:  PPDLTH

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein5.5e-1134.91Show/hide
Query:  EGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSENLGTLGLCKHCVYGVPHQQYDLKLLWKCG---LKN
        +G    L+G   + LYIL+GS   G + +A   + + T LWH RL H+S++G+  L K+G L S  + +L  C+ C+YG  H     ++ +  G    KN
Subjt:  EGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSENLGTLGLCKHCVYGVPHQQYDLKLLWKCG---LKN

Query:  PPDLTH
        P D  H
Subjt:  PPDLTH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATAGTTGTGAATGCTCTGAGGTCTCGTGATTTAGAAGTGAAAAAGAACAAGAGGAAATCAGAAGGAAAGAATAGTGAAGATGGAAACAATGTCAATGTCACTGA
AGGCTACGATTCTGCAGAGGTACTAGTTGTGACTGAGGGAGATGTAGATTCTAAATGGATCCTTGACTCATGGTGCTCTTTCCATATGAAACCAAACCGTCATTGGTTTC
AGGGCTTTGAACCAATGGAGGAAGGAAAGGTGCTCTTAGGCAACCCCCATGAGTGCAATGTAAGACGGATCAATTCAATTCAGGTGAAGATGTTTGATAATCAAACAAGG
ATCATCCCCAGGGTGAGGTACGTACCAGAACTGAAACGTAGCCTGCTGTTTTTGGGTACTTTTGATAAGGCAGGCTATGTTTGTAAGCTTGAGAATGAAGGAGCAATGGT
AAAGTTGCGAGGAAAGCTTGCAAATGGATTATATATTTTAGAGGGTTCAACCATCATTGGAACAGCAGCAATAGCTTCTCTTAATGAACAACAAACTACTACCTTATGGC
ATAGGAGACTGGGCCATGTTAGTGAAAAAGGACTCATGGAACTCCATAAACAGGGCTTGTTGGGAAGTGAAAACTTGGGAACACTTGGTCTCTGCAAACATTGTGTTTAT
GGAGTCCCTCATCAGCAATATGATTTAAAACTCCTATGGAAATGTGGACTGAAAAATCCACCAGACCTTACACACTTGAGGATCTCCCGGTGTGTTGCCTATGCACATAT
GAAAGAGGGAAAACTGGATAACAGGGCTGAGAAATGCATTCTGTTGGGATATTCTCATGGTATAAAAGGGTACAGATTGTGGGCAGCCAGGCCTAACATATGGAGAATAG
TGGGGAAGATCTTGATGGTTTTCTACAACTTGTTCGTGATGAGAAACGACCAACGGATTGCAAGGAATCGAGCTCCAAGAGTTGTCGTTTGTTTGGTCAATGATTTCGTT
AACAATTTAACTCATCCCCGTAAGTGTAACCCTCCCAATTTCAGATCCCAGAGGCCCTCCGAAGGGGTGGTCGCTCCTAGGGCGACGCACAGGCGGATCATAGGAATCTC
ACGGTGCAACCTAGGGGTGGAGACCGTAGGATATGTTGACACGTGTCCTTCTCCCACTTACTATGAACATTTTCCCCATTCACCTTGTACGAAATTCAATTGTCTAAGAA
ACAGTATTCCAAACGCCCCTCAAGAAATTGAGGACATGAGAAATATTCCCTATGCCTCGACGGTAGGCAACCAAGCCTGGCATATGAAGGCCGACATGTCAAATATACCT
TATGCTAGTGCAGTTGGGAGTCTTATGTACCTTATGGTATGCACCCGTCCAGACTTAGCTCATGCAATGAGTGTAGTGAGTAGATTCATGTCCAATCTGGACTCAGATAT
CCTAGAAGGATATGTAGATGCTGATTATGCAGCAGATTGTGACAGGAGGAGGTCATTGTCTGGGTATGTATTTACTTACCTTGGAAATCTGGTGAGTTGGAGAACTACTC
TACAGTTAGTTGTGGCTCTTTCCACAACAGAATCTGAGTACATAGCAGCAACTGATGCCATAAAAGAAGGTGGAGATTGTTATTCAGTGGTAATACACATGGAGGTCAAG
CTTCAGTCAAATCCCAATAGACCTTCTGTATTTGATGACATGGAATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAATAGTTGTGAATGCTCTGAGGTCTCGTGATTTAGAAGTGAAAAAGAACAAGAGGAAATCAGAAGGAAAGAATAGTGAAGATGGAAACAATGTCAATGTCACTGA
AGGCTACGATTCTGCAGAGGTACTAGTTGTGACTGAGGGAGATGTAGATTCTAAATGGATCCTTGACTCATGGTGCTCTTTCCATATGAAACCAAACCGTCATTGGTTTC
AGGGCTTTGAACCAATGGAGGAAGGAAAGGTGCTCTTAGGCAACCCCCATGAGTGCAATGTAAGACGGATCAATTCAATTCAGGTGAAGATGTTTGATAATCAAACAAGG
ATCATCCCCAGGGTGAGGTACGTACCAGAACTGAAACGTAGCCTGCTGTTTTTGGGTACTTTTGATAAGGCAGGCTATGTTTGTAAGCTTGAGAATGAAGGAGCAATGGT
AAAGTTGCGAGGAAAGCTTGCAAATGGATTATATATTTTAGAGGGTTCAACCATCATTGGAACAGCAGCAATAGCTTCTCTTAATGAACAACAAACTACTACCTTATGGC
ATAGGAGACTGGGCCATGTTAGTGAAAAAGGACTCATGGAACTCCATAAACAGGGCTTGTTGGGAAGTGAAAACTTGGGAACACTTGGTCTCTGCAAACATTGTGTTTAT
GGAGTCCCTCATCAGCAATATGATTTAAAACTCCTATGGAAATGTGGACTGAAAAATCCACCAGACCTTACACACTTGAGGATCTCCCGGTGTGTTGCCTATGCACATAT
GAAAGAGGGAAAACTGGATAACAGGGCTGAGAAATGCATTCTGTTGGGATATTCTCATGGTATAAAAGGGTACAGATTGTGGGCAGCCAGGCCTAACATATGGAGAATAG
TGGGGAAGATCTTGATGGTTTTCTACAACTTGTTCGTGATGAGAAACGACCAACGGATTGCAAGGAATCGAGCTCCAAGAGTTGTCGTTTGTTTGGTCAATGATTTCGTT
AACAATTTAACTCATCCCCGTAAGTGTAACCCTCCCAATTTCAGATCCCAGAGGCCCTCCGAAGGGGTGGTCGCTCCTAGGGCGACGCACAGGCGGATCATAGGAATCTC
ACGGTGCAACCTAGGGGTGGAGACCGTAGGATATGTTGACACGTGTCCTTCTCCCACTTACTATGAACATTTTCCCCATTCACCTTGTACGAAATTCAATTGTCTAAGAA
ACAGTATTCCAAACGCCCCTCAAGAAATTGAGGACATGAGAAATATTCCCTATGCCTCGACGGTAGGCAACCAAGCCTGGCATATGAAGGCCGACATGTCAAATATACCT
TATGCTAGTGCAGTTGGGAGTCTTATGTACCTTATGGTATGCACCCGTCCAGACTTAGCTCATGCAATGAGTGTAGTGAGTAGATTCATGTCCAATCTGGACTCAGATAT
CCTAGAAGGATATGTAGATGCTGATTATGCAGCAGATTGTGACAGGAGGAGGTCATTGTCTGGGTATGTATTTACTTACCTTGGAAATCTGGTGAGTTGGAGAACTACTC
TACAGTTAGTTGTGGCTCTTTCCACAACAGAATCTGAGTACATAGCAGCAACTGATGCCATAAAAGAAGGTGGAGATTGTTATTCAGTGGTAATACACATGGAGGTCAAG
CTTCAGTCAAATCCCAATAGACCTTCTGTATTTGATGACATGGAATCTTAG
Protein sequenceShow/hide protein sequence
MEIVVNALRSRDLEVKKNKRKSEGKNSEDGNNVNVTEGYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTR
IIPRVRYVPELKRSLLFLGTFDKAGYVCKLENEGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSENLGTLGLCKHCVY
GVPHQQYDLKLLWKCGLKNPPDLTHLRISRCVAYAHMKEGKLDNRAEKCILLGYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMRNDQRIARNRAPRVVVCLVNDFV
NNLTHPRKCNPPNFRSQRPSEGVVAPRATHRRIIGISRCNLGVETVGYVDTCPSPTYYEHFPHSPCTKFNCLRNSIPNAPQEIEDMRNIPYASTVGNQAWHMKADMSNIP
YASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSDILEGYVDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIKEGGDCYSVVIHMEVK
LQSNPNRPSVFDDMES