; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026447 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026447
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr10:36977442..36981567
RNA-Seq ExpressionLag0026447
SyntenyLag0026447
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8501049.1 hypothetical protein CXB51_003148 [Gossypium anomalum]1.1e-12742.75Show/hide
Query:  LEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTN-KKVKSVLEVDGVSTIR
        ++DE+  EAWERFKELLRKCPHHG+PHCIQ+ETFYNGLN  T+ +VDASA GA+L+K+++EAYEI+ERI+ NS QW   R T+ ++V  + EVD  +++ 
Subjt:  LEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTN-KKVKSVLEVDGVSTIR

Query:  ADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKVNQS
        + ++ I++ LKN+T        A  P    NQ  + ACVYCGE H +E CPSNP S++++G   +N   N+  P     P FS        Q Q+ V   
Subjt:  ADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKVNQS

Query:  GFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRK
                         Q  S NSLE ++K +MA+       N A++R LE QVGQLA EL+ RPQG LPSDT++ R  GKE  KA+TLRS + +E +  
Subjt:  GFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRK

Query:  TQDLNSNSDNIVVIEKELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQD----------------------------------
                    +IE E E  ++    +     +  +P      VP   Y PP P+PQR + + Q+                                  
Subjt:  TQDLNSNSDNIVVIEKELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQD----------------------------------

Query:  ----------EWAT----------------PKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIED
                  E+ T                PK KDPG FTIP +IG    G+ALCDLGASINLMP+S+++KLGIGE RPTTVTLQLADRS+ +LEGKI+D
Subjt:  ----------EWAT----------------PKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIED

Query:  VLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVE--------------
        VLV+VDKFIFP DF+ILD+EAD++VPIILGRPF ATGR LIDVQKGELTM V +++V FNVFK+M++PD ++DCS +  LE+ IVE              
Subjt:  VLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVE--------------

Query:  -------------TAIQDSTNK-----------HLEKHGEAPPIKSFLIEAPTLDLKPLPDHLKYVYLGEGETVPIIVASDLT
                      A+ ++  K            LE+     P K+ + E P L+LK LP HLKYVYLG   T+P+IV+++LT
Subjt:  -------------TAIQDSTNK-----------HLEKHGEAPPIKSFLIEAPTLDLKPLPDHLKYVYLGEGETVPIIVASDLT

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]8.2e-14446.7Show/hide
Query:  IARPQIQAANFEMKPVMFQMLQTVGQFHGVP-------------------------------RDALRLTLFPYSLRDGANH-------------------
        I RP+IQ   FE+KPVMFQMLQT+GQFHG+P                               +D +RL+LFPY LRDGA                     
Subjt:  IARPQIQAANFEMKPVMFQMLQTVGQFHGVP-------------------------------RDALRLTLFPYSLRDGANH-------------------

Query:  -------------------GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWS
                            F+Q EDET SEA ERFKE+LRKCPHHGLPHCIQMETFYNGLN VT+ +VDASA GA+L+KT++EAYEILERI+ N+CQW+
Subjt:  -------------------GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWS

Query:  DVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-MEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ------RNNPYSN
        DVR    +K + VLEVD +S+I A LA + N L+N+ +       A +  AA +NQ   E+CVYCGE+H ++ CPSNPAS+F+VGNQ      +NNP+SN
Subjt:  DVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-MEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ------RNNPYSN

Query:  FYNPGWRNHPNFSWGGQG-NNVQAQQKV---------NQSGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANE
         YNPGWRNHPNFSW GQ   N Q   K          NQ  ++  QV  Q       Q  S  S+E+++KE+MA+  A IQS QAS+R LE+Q+G   N 
Subjt:  FYNPGWRNHPNFSWGGQG-NNVQAQQKV---------NQSGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANE

Query:  LKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASG----------SVPDVE-----PPYV
         +     +  +DT+  +R  +  V+    +    +EE  K Q   S+              Q     KE A              ++P VE     P YV
Subjt:  LKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASG----------SVPDVE-----PPYV

Query:  P--PPPYVPPLPFPQRQKPKNQDEWAT-------PKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEG
               +    F + +     +E +         K KDPGSFTIPVSIGGKELGRALCDLGA+INLMPLS+Y+KLGIGEARPTTVTLQLADRSITY EG
Subjt:  P--PPPYVPPLPFPQRQKPKNQDEWAT-------PKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEG

Query:  KIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFI
        KIED+L++VDKFIF  DFIILDYE D DVPIILGRPF   GR L+DV KG +T+R+  ++V+FN+  +MKYP  +E+CS +
Subjt:  KIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFI

XP_023521781.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785639, partial [Cucurbita pepo subsp. pepo]3.6e-12345.65Show/hide
Query:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-MEPAAV
        METFYNGLN  T+ +VDASA GA+L+KT++EAYEILERI+ N+CQW+DVR    +K + VLEVD +S+I A LA + N L+N+ +       A +  AAV
Subjt:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-MEPAAV

Query:  VNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ------RNNPYSNFYNPGWRNHPNFSWGGQGN-NVQAQQKV---------NQSGFAKAQVMPQQNK
        +NQ   E+CV CGE+H ++ C SN  S+F+VGNQ      +NNP+SN YNPGWRNHPNFSW GQG+ N Q   K          NQ  +   Q   Q   
Subjt:  VNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ------RNNPYSNFYNPGWRNHPNFSWGGQGN-NVQAQQKV---------NQSGFAKAQVMPQQNK

Query:  PALPQQNSGNSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVV
         +  Q   G SLE+++KE+MA+  A IQS QAS+R LE+QVGQLANEL+ RP            ++    VK +       L   RK ++       +V 
Subjt:  PALPQQNSGNSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVV

Query:  IEKELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDEWATPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLG
        + +E            +A     +P                                 K KDPGSFTIP+SIGGK+LGRALCDLG+SINLMPLS+Y+KLG
Subjt:  IEKELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDEWATPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLG

Query:  IGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMED
        IGEARPTTVTLQLADRS TY EGKIED+L++VDKF FP DFIILDYEAD DVPIILGRPF  TGR L+DV KG +T+R+ +++V+FN+  +MKYP   ++
Subjt:  IGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMED

Query:  CSFIRIL----------------------ENTIVETAIQDSTNKHLE----KHGEAPPIKSFLIEAPTLDLKPLPDHLKYVYLGEGETVPIIVASDLT
        CS +  L                       + I + A+    N+  E    +  ++ P++  + EAP LDLKPLP +LKY YLG+ +T+PII+++ L+
Subjt:  CSFIRIL----------------------ENTIVETAIQDSTNKHLE----KHGEAPPIKSFLIEAPTLDLKPLPDHLKYVYLGEGETVPIIVASDLT

XP_024042858.1 uncharacterized protein LOC112099671 [Citrus clementina]4.8e-12842.32Show/hide
Query:  MFQMLQTVGQFHGVPRDALRLTLFPYSLRDGAN--------------------------------------HGFRQLEDETFSEAWERFKELLRKCPHHG
        +F  L    +  G  +DALRL LFPYSLRD A                                         F QLEDE+  + WERFKELLR+CPHHG
Subjt:  MFQMLQTVGQFHGVPRDALRLTLFPYSLRDGAN--------------------------------------HGFRQLEDETFSEAWERFKELLRKCPHHG

Query:  LPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAM
        +P CIQ+ETFYNGLN  T+ MVDASA  ALL K++ EAYEILERI+ N+ QW   R T  ++   V  +D ++T+ A +  + N +K +T          
Subjt:  LPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAM

Query:  EPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN----QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKVNQSGFAKAQVMPQQNKPALPQQN
           A V Q+ + +C+YCGE+H ++ CP N ASV +VGN     +NNPY N YN GW+ HPNFSW  Q  N       N++               L QQN
Subjt:  EPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN----QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKVNQSGFAKAQVMPQQNKPALPQQN

Query:  SG---------NSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKP----LEESRKTQDLNSN
         G          SLET++KE++A+  A +QS   S+R LE Q+GQLA  + +R QG LPS+T+ PRREG E  K + LRSGK     ++ ++K  +LNS+
Subjt:  SG---------NSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKP----LEESRKTQDLNSN

Query:  SDNIVVIEKELESGQSAGGSKENAGASGSVPDVEPPYVP------PPPYVPPLPFPQRQKPKNQDEWATPKAKDPGSFTIPVSIGGKELGRALCDLGASI
               E   +       S ++ G SG    +     P          V      Q + PK        K KDPGSFTIP SIG +  GRALCDLGA+I
Subjt:  SDNIVVIEKELESGQSAGGSKENAGASGSVPDVEPPYVP------PPPYVPPLPFPQRQKPKNQDEWATPKAKDPGSFTIPVSIGGKELGRALCDLGASI

Query:  NLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNV
        NLM LSV+++L + E RPTTVTLQLA+RS  Y E KIEDVLVKVDKFIFPVDFI+LD+EAD++VPIILGRPF A  + LIDVQK ELTMR+ +++V FNV
Subjt:  NLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNV

Query:  FKAMKYPDEMEDCSFIRILE------------NTIVETAI------QDSTNKHLEKHGEAP----------------PIKSFL--IEA-PTLDLKPLPDH
         +AMK  DE +DC+F+ +++            N +++ A       +D     +E  GE                   +K+ L  IE+ P L+LK LP H
Subjt:  FKAMKYPDEMEDCSFIRILE------------NTIVETAI------QDSTNKHLEKHGEAP----------------PIKSFL--IEA-PTLDLKPLPDH

Query:  LKYVYLGEGETVPIIV
        LKY YLG+  T+PII+
Subjt:  LKYVYLGEGETVPIIV

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]2.5e-13242.6Show/hide
Query:  GIARPQIQAANFEMKPVMFQMLQTVGQF------------------------HGVPRDALRLTLFPYSLRDGANH-------------------------
        GI RP+IQA  FE+KPVMFQMLQTVGQF                         GV  +  RL LFP+SLRD A                           
Subjt:  GIARPQIQAANFEMKPVMFQMLQTVGQF------------------------HGVPRDALRLTLFPYSLRDGANH-------------------------

Query:  -------------GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-
                      F QLEDE+ S+AWERFKELLRKCPHHG+PHCIQMETFYNGLN  +Q ++DASA GA+L+K+++EA+EILE I+ N+ QWS+ R   
Subjt:  -------------GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-

Query:  NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ----RNNPYSNFYNPGWRNH
        ++KV  VLEVD ++ +   +A + N LKN+++ + +    ++PAA + Q  D +CV+C E H +E CPSNP SV ++GNQ     N  +SN YN  W+NH
Subjt:  NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ----RNNPYSNFYNPGWRNH

Query:  PNFSWGGQG--NNVQAQQKVNQSGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPR
        PN SWG +    +    ++    GF++     Q   P   Q +  +SLE++M+++MA+  A IQS  A +R LELQ+G LANELKARPQG LPSDTE+PR
Subjt:  PNFSWGGQG--NNVQAQQKVNQSGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPR

Query:  REGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVVIEK-ELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQD----------
        R+GKEQ K++ LRSGK L+ S +    +    +I + EK   ++ Q    ++    A+G   + +          PPLPFPQR + + QD          
Subjt:  REGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVVIEK-ELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQD----------

Query:  -------------------------------------------------EWATPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEA
                                                             PK KDPGSFTIP+SIGG                      R LGIGEA
Subjt:  -------------------------------------------------EWATPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEA

Query:  RPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFI
        RPTTVTLQLADRS+ + +GKIEDVLV+VDKFIFP DFIILDYE DR+VPIIL RPF ATGR LIDV+KGELTMR  +E+  F VF+ ++ PD + +C  I
Subjt:  RPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFI

Query:  RILENTIVE
          ++  +VE
Subjt:  RILENTIVE

TrEMBL top hitse value%identityAlignment
A0A2G9GK35 Reverse transcriptase2.8e-11339.28Show/hide
Query:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVST
        FRQ   ET  EAW RF+++LR CP+H +P  IQ+ TFY+GL    +  +D   G + L+ T  E + +L  +  N  +    R T  K   V+EVD V+ 
Subjt:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVST

Query:  IRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQ-
        + A +  +  ++KN  V   Q  P               C  CGE H  + CP +  S+ FV N R   NNPYSN YNPGWR HPNFSW    NN Q Q 
Subjt:  IRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQ-

Query:  --QKVNQSGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRS
           +  Q G  + Q   Q+ KP         SLE  + +FMA       S  A+ + +E Q+GQLAN + +RPQG LPS+TE +PR++GK Q +AVTLR+
Subjt:  --QKVNQSGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRS

Query:  GKPLEESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQR-QKPKNQDEW----------------------
        G+ L+E  K +   S    ++  EKE E                    VE P     P     PFPQR QK K + ++                      
Subjt:  GKPLEESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQR-QKPKNQDEW----------------------

Query:  -------------------------------------ATPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSI
                                               PK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+
Subjt:  -------------------------------------ATPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSI

Query:  TYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILEN-----TIVE
        TY +G IED+LVKVDKFIFP DF++LD E D +VPIILGRPF ATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + +N     +I E
Subjt:  TYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILEN-----TIVE

Query:  TAIQ----------DSTNKH---------------------LEKHGEAPPIKSFLIEAPTLDLKPLPDHLKYVYLGEGETVPIIVASDLTIIHIK
         ++           D  N+                      LE+   +  +K  + E PTL+LKPLP HL Y YLGE +T+P+I++S L+ + ++
Subjt:  TAIQ----------DSTNKH---------------------LEKHGEAPPIKSFLIEAPTLDLKPLPDHLKYVYLGEGETVPIIVASDLTIIHIK

A0A6A2WLX1 Reverse transcriptase6.4e-11836.35Show/hide
Query:  GIARPQIQAANFEMKPVMFQMLQTVGQFHGVP------------------------RDALRLTLFPYSLRDGAN--------------------------
        GI  P+IQAA+FEMKPVMF ML ++GQF G+P                         D L+L LFPYSLRD A                           
Subjt:  GIARPQIQAANFEMKPVMFQMLQTVGQFHGVP------------------------RDALRLTLFPYSLRDGAN--------------------------

Query:  ------------HGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVR-GT
                      FRQ +DE+  E W+R+K LL+KC +HG     Q+  FYNG+N  T+ ++DASA G LL K+  EA+ IL+RI+ N  Q+   R G+
Subjt:  ------------HGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVR-GT

Query:  NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHP
         ++     E++   ++ A L++I N LKN+   +  +           + T  AC+ C  +H+   CP+N  S+ FVGN     NNPYSN YN GWR HP
Subjt:  NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHP

Query:  NFSWGGQG----NNVQAQQKVNQ-SGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTY---------------------AAIQSNQASMRALELQV
        NFSW  QG    N    QQ  N+  G+  A      NK AL    S +SLE  ++EF++ T                      A IQS+ +S+RALE QV
Subjt:  NFSWGGQG----NNVQAQQKVNQ-SGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTY---------------------AAIQSNQASMRALELQV

Query:  GQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYV-PP
        GQ+A  L+ R QG+LPSDTE  +  GKE    +TLRSG  +    K +D     D    +             KEN             ++P      PP
Subjt:  GQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYV-PP

Query:  LPFPQRQKPKNQD--------------------------------------------------------EWATPKAKDPGSFTIPVSIGGKELGRALCDL
         PFPQR K  N +                                                           +PK  DPGSF IP SIG   +G+ALCDL
Subjt:  LPFPQRQKPKNQD--------------------------------------------------------EWATPKAKDPGSFTIPVSIGGKELGRALCDL

Query:  GASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEV
        G+S+NL+P S++ KLGIG+ARPT+V LQLAD+S   LEG++EDV+V+VDKF+F VDF+ILD E D   PIILGRPF ATGR LID +KGELTMRV ++ V
Subjt:  GASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEV

Query:  KFNVFKAMKYPDEMEDC--------------------SFIRILENTIV---ETAIQDSTNKHLEKHGEAPPIKS-----------------FLIEAPTLD
          NVF+ +KY D+ E+C                    +FI++ EN  +   E+ ++      LE+      ++S                  L+ AP L+
Subjt:  KFNVFKAMKYPDEMEDC--------------------SFIRILENTIV---ETAIQDSTNKHLEKHGEAPPIKS-----------------FLIEAPTLD

Query:  LKPLPDHLKYVYLGEGETVPIIVASDLT
        LK LP HLKYVYLG  ET+P+I++++LT
Subjt:  LKPLPDHLKYVYLGEGETVPIIVASDLT

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129453.2e-11736.52Show/hide
Query:  IARPQIQAANFEMKPVMFQMLQTVGQFHGVPR------------------------DALRLTLFPYSLRDGANH--------------------------
        I RP I A NFE+KP   QM+Q+  QF G+P                         DA+RL LFP+SLRD A                            
Subjt:  IARPQIQAANFEMKPVMFQMLQTVGQFHGVPR------------------------DALRLTLFPYSLRDGANH--------------------------

Query:  ------------GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNK
                     F Q + E+  EAWERFKELLR+CPHHG+P  +Q++TFYNGL G  + ++DA+AGGAL++K   +AY +LE ++ N+ QW   R  ++
Subjt:  ------------GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNK

Query:  KVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN---QRNNPYSNFYNPGWRNHPNF
        K     E+D + T+   +A ++  L  + V            AV N +    C  CG+ H+Y+ CP N  SV FVGN   Q+NNPYSN YNPGWRNHPNF
Subjt:  KVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN---QRNNPYSNFYNPGWRNHPNF

Query:  SWGGQGNNVQAQQKVNQSGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGK
        SW          + +   GF       QQ +P +P++ S   LE ++ +++++T A IQS  AS+R LE QVGQLAN +  RPQG LPSDT+ +P+  GK
Subjt:  SWGGQGNNVQAQQKVNQSGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGK

Query:  EQVKAVTLRSGKPLE--ESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYVPPL-----------------------PF
        EQ +A+TLRSGK +E    +  +    + D   + E E+E  Q      EN G S  +        PPPP+   L                       PF
Subjt:  EQVKAVTLRSGKPLE--ESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASGSVPDVEPPYVPPPPYVPPL-----------------------PF

Query:  PQR---------------QKPKNQDEWAT----------------PKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQ
         +                 K +   E+ T                PK KDPGSFTIP +IG     +AL DLGASINLMP S++ KLG+GE +PT+VTLQ
Subjt:  PQR---------------QKPKNQDEWAT----------------PKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQ

Query:  LADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILENTIV
        LADRS  Y  G IEDVLVKVDKFIFPVDF+ILD E DR +PIILGRPF AT  A+IDV++G+++ +V  E V+FN+F A K+P     C  + +++    
Subjt:  LADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILENTIV

Query:  ETAIQDSTNKHLEKHGE--APPIKSFLIEAPTLDLKPLPDHLKYVYLGEGETVPIIVASDLTIIHIKAVKTPLYDDFFDYLDFGNFPPGLSKEQMKEFFH
                    E  GE  +PP      +A T + KP P  ++       E  P +      I     + +  Y   F         P   KE+    F 
Subjt:  ETAIQDSTNKHLEKHGE--APPIKSFLIEAPTLDLKPLPDHLKYVYLGEGETVPIIVASDLTIIHIKAVKTPLYDDFFDYLDFGNFPPGLSKEQMKEFFH

Query:  GVKFYLSNDASMVKQCDEKDGRVFKVNGQRVKHYWGEE
         VK Y       + + D +D   F VNG  ++ Y+ +E
Subjt:  GVKFYLSNDASMVKQCDEKDGRVFKVNGQRVKHYWGEE

A0A6J1DY39 uncharacterized protein LOC1110256533.2e-11738.81Show/hide
Query:  ANFEMKPVMFQMLQTVGQFHGV----PR--------------------DALRLTLFPYSLRDGANH----------------------------------
        A FE KP+M QML  +GQF G+    PR                    DALRLTLFP+S+   A                                    
Subjt:  ANFEMKPVMFQMLQTVGQFHGV----PR--------------------DALRLTLFPYSLRDGANH----------------------------------

Query:  ----GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKS----
             FRQ E+E  + AWERFK+L+  CP+ G+P C+Q+E F+ G + +T+ M++ +A G   +K+F+E  EIL+++S ++ QW   +   +  ++    
Subjt:  ----GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKS----

Query:  VLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPA--AVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVG---NQRNNPYSNFYNPGWRNHPNFSW
        VL +D +++++  +  I   LKN+   +     A      + V Q+ +  C YCG+ H  E CPSNP+S+++VG    Q+ NPYSN YNPGW+ HPNFSW
Subjt:  VLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPA--AVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVG---NQRNNPYSNFYNPGWRNHPNFSW

Query:  GGQ------GNNVQAQQKVNQSGFAKAQVMP-------QQNKPALPQQNSGNSLETMMKEFMARTYAAIQS---------------------NQASMRAL
         GQ      G+N Q ++     GF  +   P       QQ     P Q + +++E +MKE + +  A ++                      N  ++R L
Subjt:  GGQ------GNNVQAQQKVNQSGFAKAQVMP-------QQNKPALPQQNSGNSLETMMKEFMARTYAAIQS---------------------NQASMRAL

Query:  ELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASGSVPD--VEPPY-VPP
        E+Q+GQL NE++ RPQG LPS TE PRR GKE   ++  RSG   E  R                   ES  S    K+    + +VPD  VEP   VP 
Subjt:  ELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASGSVPD--VEPPY-VPP

Query:  PPYV----PPLPFPQRQKPKNQD--------------------------------------------EWAT----------------PKAKDPGSFTIPV
         P V    PP PFPQR   KNQD                                            E+ T                PK KDPGSFTIP 
Subjt:  PPYV----PPLPFPQRQKPKNQD--------------------------------------------EWAT----------------PKAKDPGSFTIPV

Query:  SIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDV
         IGGK++GRALCDLGASINLMPLS+++K  IG+A PTTVTLQLADRSIT  EGKIEDVLVKVDKFIFP DFIILD EAD+DVPIILGRPF ATG  LIDV
Subjt:  SIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDV

Query:  QKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVETAIQDSTNKHLE
        +KGELTMRV +++V FN+  AMKY D+ME+C+ I I +  I    + D  N  +E
Subjt:  QKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVETAIQDSTNKHLE

A0A6J1EQ90 uncharacterized protein LOC1114364114.0e-14446.7Show/hide
Query:  IARPQIQAANFEMKPVMFQMLQTVGQFHGVP-------------------------------RDALRLTLFPYSLRDGANH-------------------
        I RP+IQ   FE+KPVMFQMLQT+GQFHG+P                               +D +RL+LFPY LRDGA                     
Subjt:  IARPQIQAANFEMKPVMFQMLQTVGQFHGVP-------------------------------RDALRLTLFPYSLRDGANH-------------------

Query:  -------------------GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWS
                            F+Q EDET SEA ERFKE+LRKCPHHGLPHCIQMETFYNGLN VT+ +VDASA GA+L+KT++EAYEILERI+ N+CQW+
Subjt:  -------------------GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWS

Query:  DVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-MEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ------RNNPYSN
        DVR    +K + VLEVD +S+I A LA + N L+N+ +       A +  AA +NQ   E+CVYCGE+H ++ CPSNPAS+F+VGNQ      +NNP+SN
Subjt:  DVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-MEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ------RNNPYSN

Query:  FYNPGWRNHPNFSWGGQG-NNVQAQQKV---------NQSGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANE
         YNPGWRNHPNFSW GQ   N Q   K          NQ  ++  QV  Q       Q  S  S+E+++KE+MA+  A IQS QAS+R LE+Q+G   N 
Subjt:  FYNPGWRNHPNFSWGGQG-NNVQAQQKV---------NQSGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTYAAIQSNQASMRALELQVGQLANE

Query:  LKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASG----------SVPDVE-----PPYV
         +     +  +DT+  +R  +  V+    +    +EE  K Q   S+              Q     KE A              ++P VE     P YV
Subjt:  LKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASG----------SVPDVE-----PPYV

Query:  P--PPPYVPPLPFPQRQKPKNQDEWAT-------PKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEG
               +    F + +     +E +         K KDPGSFTIPVSIGGKELGRALCDLGA+INLMPLS+Y+KLGIGEARPTTVTLQLADRSITY EG
Subjt:  P--PPPYVPPLPFPQRQKPKNQDEWAT-------PKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEG

Query:  KIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFI
        KIED+L++VDKFIF  DFIILDYE D DVPIILGRPF   GR L+DV KG +T+R+  ++V+FN+  +MKYP  +E+CS +
Subjt:  KIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCCGCCTGGGGTACGGTTTGAGCTTGACCAGAAATTGAAAGGACATTTAGGAACAGAAGGAGGGAGCAGCGCAGAAATCAGATGGAGAACGTGCCGCAACTC
CGCAGGGATTGCACGTCCCCAAATCCAAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGAGTGCCGAGAGATGCTC
TTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAATCATGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTG
CGAAAGTGTCCCCACCATGGGTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGTTTGAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCTGGAGGGGCCCT
TTTGGCAAAAACTTTTGATGAAGCCTATGAAATTTTAGAAAGAATATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTGTTAG
AGGTTGATGGTGTGTCCACCATTAGGGCTGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCTATGGAGCCTGCAGCA
GTAGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGCGGTGAAGATCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAA
CAACCCTTATTCTAACTTCTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGTCAAGGAAATAACGTTCAAGCGCAACAAAAGGTGAACCAGTCGGGAT
TTGCTAAAGCGCAGGTAATGCCCCAGCAAAATAAGCCGGCTTTGCCCCAGCAAAATTCGGGAAATTCTCTCGAGACAATGATGAAAGAATTTATGGCTCGCACATACGCC
GCAATTCAAAGTAATCAAGCTTCGATGAGGGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTAAAGGCGAGGCCTCAAGGGAAACTTCCTTCGGATACTGAACA
CCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGTAAGCCATTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAGTAATAGTGATAATATTG
TTGTTATTGAAAAAGAGTTGGAGTCTGGTCAGAGTGCTGGAGGCAGCAAAGAAAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAACCACCATATGTGCCACCCCCA
CCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGAATGGGCTACCCCCAAGGCTAAGGATCCAGGGTCATTTACTATACCTGTGTCTAT
TGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGGAAGTTAGGTATTGGTGAAGCTAGACCTACCACAG
TTACGCTCCAATTAGCTGATAGGTCTATCACATATCTAGAGGGTAAGATTGAGGATGTCTTAGTAAAAGTGGATAAATTCATATTTCCTGTTGATTTCATTATTTTAGAT
TATGAGGCTGATAGAGATGTCCCAATTATTCTAGGACGTCCATTTTTTGCTACTGGTAGGGCGTTAATAGATGTTCAAAAAGGGGAATTAACAATGAGAGTTTGTAATGA
GGAAGTAAAATTTAATGTGTTTAAGGCCATGAAGTATCCAGACGAGATGGAAGATTGCTCTTTCATTAGGATTCTGGAGAACACAATTGTTGAGACAGCAATTCAGGATT
CGACTAATAAGCATTTGGAAAAGCATGGAGAGGCTCCTCCAATTAAGTCATTCCTGATTGAGGCACCCACTTTAGATTTGAAGCCCTTGCCGGATCATCTAAAGTATGTG
TATCTTGGGGAGGGTGAGACGGTTCCCATTATTGTTGCATCAGATTTAACTATAATCCATATTAAGGCAGTGAAAACACCTTTGTATGATGACTTTTTCGATTACCTTGA
TTTTGGAAATTTTCCTCCTGGTTTATCAAAAGAACAAATGAAAGAATTTTTCCATGGGGTGAAGTTTTATTTATCGAATGATGCATCTATGGTTAAACAATGTGATGAAA
AAGATGGGAGAGTATTCAAGGTGAATGGACAGCGTGTGAAGCATTATTGGGGTGAGGAGTTTCAGTCGAAATATCCTTCCCTAAGGTTTGCATTTTTAATTGCAACGGTA
ATATTAGTTATGACCGGGCAAATCCTTTCGGTTAGGATTCTTGTTTGCTGCAGCCCTATTTATTCGCAATCTCTGACGAGAGCGCGAAAAGAAAGGGAGAGTGAGGAGGA
AGAGATACCGGTCACACCAGAAGTTCAAAAAGGAAAGTCTAAGAAGAAGAGGACGCCAGAGGAAAAGGAAGCAAAGAGAAGAAGAAGGCAACAGAGGGCTACAGAACAGA
AGGAAGTTCAGGAGGTGGCAGACGTTGTTGCCACTACTGCGGAGGAAGGAAGTACTCAAGAACCAGCAATACAAAACCCAGATACGGTTCAAGAACAGATTGCTGAGAAA
AATCAAGAAACAGAGAATGAAGGTGAGCATAACAAGGAGAAAACACCGGAGCCGGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCCGCCTGGGGTACGGTTTGAGCTTGACCAGAAATTGAAAGGACATTTAGGAACAGAAGGAGGGAGCAGCGCAGAAATCAGATGGAGAACGTGCCGCAACTC
CGCAGGGATTGCACGTCCCCAAATCCAAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGAGTGCCGAGAGATGCTC
TTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAATCATGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTG
CGAAAGTGTCCCCACCATGGGTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGTTTGAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCTGGAGGGGCCCT
TTTGGCAAAAACTTTTGATGAAGCCTATGAAATTTTAGAAAGAATATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTGTTAG
AGGTTGATGGTGTGTCCACCATTAGGGCTGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCTATGGAGCCTGCAGCA
GTAGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGCGGTGAAGATCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAA
CAACCCTTATTCTAACTTCTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGTCAAGGAAATAACGTTCAAGCGCAACAAAAGGTGAACCAGTCGGGAT
TTGCTAAAGCGCAGGTAATGCCCCAGCAAAATAAGCCGGCTTTGCCCCAGCAAAATTCGGGAAATTCTCTCGAGACAATGATGAAAGAATTTATGGCTCGCACATACGCC
GCAATTCAAAGTAATCAAGCTTCGATGAGGGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTAAAGGCGAGGCCTCAAGGGAAACTTCCTTCGGATACTGAACA
CCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGTAAGCCATTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAGTAATAGTGATAATATTG
TTGTTATTGAAAAAGAGTTGGAGTCTGGTCAGAGTGCTGGAGGCAGCAAAGAAAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAACCACCATATGTGCCACCCCCA
CCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGAATGGGCTACCCCCAAGGCTAAGGATCCAGGGTCATTTACTATACCTGTGTCTAT
TGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGGAAGTTAGGTATTGGTGAAGCTAGACCTACCACAG
TTACGCTCCAATTAGCTGATAGGTCTATCACATATCTAGAGGGTAAGATTGAGGATGTCTTAGTAAAAGTGGATAAATTCATATTTCCTGTTGATTTCATTATTTTAGAT
TATGAGGCTGATAGAGATGTCCCAATTATTCTAGGACGTCCATTTTTTGCTACTGGTAGGGCGTTAATAGATGTTCAAAAAGGGGAATTAACAATGAGAGTTTGTAATGA
GGAAGTAAAATTTAATGTGTTTAAGGCCATGAAGTATCCAGACGAGATGGAAGATTGCTCTTTCATTAGGATTCTGGAGAACACAATTGTTGAGACAGCAATTCAGGATT
CGACTAATAAGCATTTGGAAAAGCATGGAGAGGCTCCTCCAATTAAGTCATTCCTGATTGAGGCACCCACTTTAGATTTGAAGCCCTTGCCGGATCATCTAAAGTATGTG
TATCTTGGGGAGGGTGAGACGGTTCCCATTATTGTTGCATCAGATTTAACTATAATCCATATTAAGGCAGTGAAAACACCTTTGTATGATGACTTTTTCGATTACCTTGA
TTTTGGAAATTTTCCTCCTGGTTTATCAAAAGAACAAATGAAAGAATTTTTCCATGGGGTGAAGTTTTATTTATCGAATGATGCATCTATGGTTAAACAATGTGATGAAA
AAGATGGGAGAGTATTCAAGGTGAATGGACAGCGTGTGAAGCATTATTGGGGTGAGGAGTTTCAGTCGAAATATCCTTCCCTAAGGTTTGCATTTTTAATTGCAACGGTA
ATATTAGTTATGACCGGGCAAATCCTTTCGGTTAGGATTCTTGTTTGCTGCAGCCCTATTTATTCGCAATCTCTGACGAGAGCGCGAAAAGAAAGGGAGAGTGAGGAGGA
AGAGATACCGGTCACACCAGAAGTTCAAAAAGGAAAGTCTAAGAAGAAGAGGACGCCAGAGGAAAAGGAAGCAAAGAGAAGAAGAAGGCAACAGAGGGCTACAGAACAGA
AGGAAGTTCAGGAGGTGGCAGACGTTGTTGCCACTACTGCGGAGGAAGGAAGTACTCAAGAACCAGCAATACAAAACCCAGATACGGTTCAAGAACAGATTGCTGAGAAA
AATCAAGAAACAGAGAATGAAGGTGAGCATAACAAGGAGAAAACACCGGAGCCGGTGTAG
Protein sequenceShow/hide protein sequence
MSDPPGVRFELDQKLKGHLGTEGGSSAEIRWRTCRNSAGIARPQIQAANFEMKPVMFQMLQTVGQFHGVPRDALRLTLFPYSLRDGANHGFRQLEDETFSEAWERFKELL
RKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAA
VVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKVNQSGFAKAQVMPQQNKPALPQQNSGNSLETMMKEFMARTYA
AIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIVVIEKELESGQSAGGSKENAGASGSVPDVEPPYVPPP
PYVPPLPFPQRQKPKNQDEWATPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILD
YEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVETAIQDSTNKHLEKHGEAPPIKSFLIEAPTLDLKPLPDHLKYV
YLGEGETVPIIVASDLTIIHIKAVKTPLYDDFFDYLDFGNFPPGLSKEQMKEFFHGVKFYLSNDASMVKQCDEKDGRVFKVNGQRVKHYWGEEFQSKYPSLRFAFLIATV
ILVMTGQILSVRILVCCSPIYSQSLTRARKERESEEEEIPVTPEVQKGKSKKKRTPEEKEAKRRRRQQRATEQKEVQEVADVVATTAEEGSTQEPAIQNPDTVQEQIAEK
NQETENEGEHNKEKTPEPV