; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022724 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022724
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposase
Genome locationchr7:36449584..36459969
RNA-Seq ExpressionLag0022724
SyntenyLag0022724
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR004242 - Transposon, En/Spm-like
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8501049.1 hypothetical protein CXB51_003148 [Gossypium anomalum]2.7e-10538.84Show/hide
Query:  LEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQVV-----SGRMLEAQI-------------------------KRLRVYLEVDGVSTIR
        ++DE+  EAWERFKELLRKCPHHG+PHCIQ+ETFYNGLN  T++V     +G +L                             +R+    EVD  +++ 
Subjt:  LEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQVV-----SGRMLEAQI-------------------------KRLRVYLEVDGVSTIR

Query:  ADLAMIANALKNVTVISHQQPPAMGPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGNAQ-----QNKQALPQQNSGNSLEA
        + ++ I++ LKN+T        A  P    NQ  + ACVYCGE H +E CPSNP S++++G   +N   N+  P   Q     Q  Q   Q  S NSLE 
Subjt:  ADLAMIANALKNVTVISHQQPPAMGPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGNAQ-----QNKQALPQQNSGNSLEA

Query:  MMKEFMARTDAAIQSNQASMRALELQVGQLANELKASLKGNYP-------------------RILNTLEGKGAGGSKENAGASGSVPDVEPPYVPPPP--
        ++K +MA+       N A++R LE QVGQLA EL+   +G  P                   R   T+E       KE+  A  S     P  V  PP  
Subjt:  MMKEFMARTDAAIQSNQASMRALELQVGQLANELKASLKGNYP-------------------RILNTLEGKGAGGSKENAGASGSVPDVEPPYVPPPP--

Query:  --------YVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTI
                Y PP P+PQR + + Q+ QFKKFL++LKQLHINIPLVEA+EQMPNY KF+KDIL+KK++LGEFETV+LT+EC+  L++ LPPK KDPG FTI
Subjt:  --------YVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTI

Query:  PVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE----------------------------------------------
        P +IG    G+ALCDLGASINLMP+S+++KLGIGE RPTTVTLQLADRS+ + E                                              
Subjt:  PVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE----------------------------------------------

Query:  ---------------------------DEMKDFSFIRILESTVIE---TAIQD-----------SDDKHSE-----------------------KHGEAP
                                   D + D S +  LE  ++E    +++D           SD++  E                       +  +  
Subjt:  ---------------------------DEMKDFSFIRILESTVIE---TAIQD-----------SDDKHSE-----------------------KHGEAP

Query:  PIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISPS
          K S+ E P L+LK LP HLKYVYLG + TLP+IV+++L +E EE LI +L+Q++KA GWT+ADI+GISPS
Subjt:  PIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISPS

XP_017239676.1 PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus]1.1e-9035.6Show/hide
Query:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGL--NGVTQVVSG--------------RMLE------------AQIKRLRVYLEVDGVSTI
        F+Q + E+  EA+ERFK+LLRKCPHHGL    ++ TFYNGL     TQV                  +LE            +  K++    EVD +++ 
Subjt:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGL--NGVTQVVSG--------------RMLE------------AQIKRLRVYLEVDGVSTI

Query:  RADLAMIANAL--KNVTVISHQQPPAMGPAAVVNQVTDEACVYCGEDHNYEFCP------SNPASVFFVG---NQRNNPYSNFYNPG-----------NA
         A +     AL  K  ++   +Q P      V N      C  CGE H  + CP      +  +SV +VG   NQ+NNP+SN YNPG           N 
Subjt:  RADLAMIANAL--KNVTVISHQQPPAMGPAAVVNQVTDEACVYCGEDHNYEFCP------SNPASVFFVG---NQRNNPYSNFYNPG-----------NA

Query:  QQN---KQALP---QQN-------SGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKASLKGNYP---------------RILNTLEGKGA
        + N   KQ +P   QQN          + E ++ ++M +TDA IQS  ASMRALE+QVGQLA+ +     G+ P               + +    GK  
Subjt:  QQN---KQALP---QQN-------SGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKASLKGNYP---------------RILNTLEGKGA

Query:  GGSKENAGASGSVPDV--EPPYV----------PPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEF
         G+ +     G    V  E P V          P     PP PFPQR + + QD QF+KF+++ K+L INIP  EA+EQM +Y KF+KDIL++K+RL EF
Subjt:  GGSKENAGASGSVPDV--EPPYV----------PPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEF

Query:  ETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEDEMKD-------FSF--
        ETV+LTEECSAIL+  LPPK KDPGSFTIP +IG +  G+ALCDLGAS+NLMPLS++ KLG+GE +PT+V LQLADRS+ YP   ++D       F F  
Subjt:  ETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEDEMKD-------FSF--

Query:  -------------------------------------IRILESTV---IETAIQDSDDKHS-----------------EKHGEAP---------------
                                             +R+ +  V   + +A++ S+D+ S                 E+H   P               
Subjt:  -------------------------------------IRILESTV---IETAIQDSDDKHS-----------------EKHGEAP---------------

Query:  --------------------------PI-----KPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGI
                                  P+     KPS+ E P L+LK LP HLKY +LGE  TLP+I++S L +EHEE L+++L++Y++AIGW +ADI+GI
Subjt:  --------------------------PI-----KPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGI

Query:  SPSFCLENL---EPPKPGL---NRTKPVPAQPLPHE
        SPSFC+  +   +  KP +    R  PV  + +  E
Subjt:  SPSFCLENL---EPPKPGL---NRTKPVPAQPLPHE

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]6.6e-10441.24Show/hide
Query:  DTHVTIKCCESICRSFHGRDDTIPIKLRDTILERIKQFSYIVMTSCSIAGWTRAIRAYVVPMFNEFDPGIARPQIQAANLNETGDVSDVANRGAFHGFRQ
        D H+ +K    +  SF    D+   +  D  + R+  F Y++        W   +    +  +N               +       +   +     F+Q
Subjt:  DTHVTIKCCESICRSFHGRDDTIPIKLRDTILERIKQFSYIVMTSCSIAGWTRAIRAYVVPMFNEFDPGIARPQIQAANLNETGDVSDVANRGAFHGFRQ

Query:  LEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQVV-----SGRMLEAQI-------------------------KRLRVYLEVDGVSTIR
         EDET SEA ERFKE+LRKCPHHGLPHCIQMETFYNGLN VT+ V     +G +L                             ++ R  LEVD +S+I 
Subjt:  LEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQVV-----SGRMLEAQI-------------------------KRLRVYLEVDGVSTIR

Query:  ADLAMIANALKNVTVISHQQPPA-MGPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ------RNNPYSNFYNP-----------GNAQQNKQ
        A LA + N L+N+ +       A +  AA +NQ   E+CVYCGE+H ++ CPSNPAS+F+VGNQ      +NNP+SN YNP           G +  N+Q
Subjt:  ADLAMIANALKNVTVISHQQPPA-MGPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ------RNNPYSNFYNP-----------GNAQQNKQ

Query:  ALPQQN-------------------------------SGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKASLKGNYPRILNTLEGKGAGG
          P+ N                               S  S+E+++KE+MA+ DA IQS QAS+R LE+Q+G    E  A    ++ +     + +    
Subjt:  ALPQQN-------------------------------SGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKASLKGNYPRILNTLEGKGAGG

Query:  SKENAGASGSVPDVEPPYVPPP--------PYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLT
        + +   +       E P +            Y P  PFPQR K K ++  F+KF++ILK++HINIPLVEA++QMPNY KFLKD+L  +++  EF+ VSL 
Subjt:  SKENAGASGSVPDVEPPYVPPP--------PYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLT

Query:  EECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEDEMKD
        EECSAILKN +P K KDPGSFTIPVSIGGKELGRALCDLGA+INLMPLS+Y+KLGIGEARPTTVTLQLADRSITYPE +++D
Subjt:  EECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEDEMKD

XP_024028757.1 uncharacterized protein LOC112093792 [Morus notabilis]2.1e-9436.7Show/hide
Query:  METFYNGLNGVTQVV----SGRMLEAQI--------------------------KRLRVYLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMGPAAVV
        METFYNGLN  T+ +    + R L  +                           +R+    EVD V+ + A ++ ++N LK++ V           AA  
Subjt:  METFYNGLNGVTQVV----SGRMLEAQI--------------------------KRLRVYLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMGPAAVV

Query:  NQVTDEA--CVYCGEDHNYEFCPSNPASVFFVG--NQRNNPYSNFYNPGNAQ--------------------------------------QNKQALPQQN
        N  T  A  CVYCG +H++E CPSNP SV +V   N+ NNPYSN YN G  Q                                      Q+ Q  P Q 
Subjt:  NQVTDEA--CVYCGEDHNYEFCPSNPASVFFVG--NQRNNPYSNFYNPGNAQ--------------------------------------QNKQALPQQN

Query:  SGNSLEAMMKEFMARTD-------AAIQSNQASMRALELQVGQLANELKASLKGNYPRILNTLEGKGAGGSKEN-------------------AGASGSV
        S   +EA++KE+MAR D       A +QS  AS+R LE QVGQLAN L    +G+ P         G    KE+                   A    S 
Subjt:  SGNSLEAMMKEFMARTD-------AAIQSNQASMRALELQVGQLANELKASLKGNYPRILNTLEGKGAGGSKEN-------------------AGASGSV

Query:  PDVEPPYVPP-------------------PPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLT
           +    PP                    P  PP PFPQR + + QD QF++FL++LKQLHINIPLVEA+EQMP+Y KF+KDILTKK+RLGEFETV+LT
Subjt:  PDVEPPYVPP-------------------PPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLT

Query:  EECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE-----------------------
        EECSAILKN LPPK KDPGSFTIP SIG + +G+ALCDLGASINLMP+S++RKLGIGE  PTTVTLQLADRS  +PE                       
Subjt:  EECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE-----------------------

Query:  --------------------------------------------------DEMKDFSFIRILESTV------------------IETAI-QDSDDKH---
                                                          DE+++ S + +L+S V                  I++ I +D++DK    
Subjt:  --------------------------------------------------DEMKDFSFIRILESTV------------------IETAI-QDSDDKH---

Query:  ---------SEKHGEAPPI--------KPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISPSFCL
                 S +H E+  +        KPS+ E P L+L+PLP HL+Y YLG+S+TLP+I+AS L    E  L+++L+++++AIGWT+ADI+GISPS C+
Subjt:  ---------SEKHGEAPPI--------KPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISPSFCL

Query:  ENL
          +
Subjt:  ENL

XP_030509265.1 uncharacterized protein LOC115723943 [Cannabis sativa]2.5e-9545.47Show/hide
Query:  RAIRAYVVPMFNEFDPGIARPQIQAANLNETGDVSDVANRGAFHGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQVVSGRMLEA
        RAIR Y  PMFNE +P           LN   +V           F++LEDE+ S+AWERFKELLRKCPHHG+PHCIQMETFYNGLN  +++V    L+A
Subjt:  RAIRAYVVPMFNEFDPGIARPQIQAANLNETGDVSDVANRGAFHGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQVVSGRMLEA

Query:  QIKRLRVYLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMGPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGNAQQNKQ
              +    +    I   +A       N    + ++   +     +  +T +         N  +     +S       R       Y PG +QQ + 
Subjt:  QIKRLRVYLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMGPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGNAQQNKQ

Query:  ALPQQNS-GNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKASLKGNYP--------------RILNTLEGKGAGGSKENAGASG-------
        +   QN+  +SLE++M+++MA+ DA IQS  AS+R LELQ+G LANELKA  +G+ P              + ++   GK    S+E    SG       
Subjt:  ALPQQNS-GNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKASLKGNYP--------------RILNTLEGKGAGGSKENAGASG-------

Query:  ----------SVPDVEP-----------PYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFET
                   + D  P               P    PPLPFPQR + + QDGQFKKFL++LKQLHINIPLVEA+EQMPNY KFLKDILTKK+RLGEFET
Subjt:  ----------SVPDVEP-----------PYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFET

Query:  VSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEDEMKD
        V+LTE CSA+LK+ +PPK KDPGSFTIP SIGG+++GRALCDLGASINLMP+S+++KLGIGEARPTTVTLQLADRS+ +PE +++D
Subjt:  VSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEDEMKD

TrEMBL top hitse value%identityAlignment
A0A2G9GK35 Reverse transcriptase5.8e-9034.87Show/hide
Query:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG-----------LNGVTQVVSGRMLE-------------------AQIKRLRVYLEVDGVS
        FRQ   ET  EAW RF+++LR CP+H +P  IQ+ TFY+G           LNG    +SG   E                   A   +    +EVD V+
Subjt:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG-----------LNGVTQVVSGRMLE-------------------AQIKRLRVYLEVDGVS

Query:  TIRADLAMIANALKNVTVISHQQPPAMGPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPG---------NAQQNKQALP
         + A +  +  ++KN  V   Q  P               C  CGE H  + CP +  S+ FV N R   NNPYSN YNPG         N  Q + + P
Subjt:  TIRADLAMIANALKNVTVISHQQPPAMGPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPG---------NAQQNKQALP

Query:  QQNSGN-------------SLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKASLKGNYP-----------------------RILNTLEGKGA
        +   G              SLE  + +FMA       S  A+ + +E Q+GQLAN + +  +G+ P                       R L  +  +  
Subjt:  QQNSGN-------------SLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKASLKGNYP-----------------------RILNTLEGKGA

Query:  GGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAI
           ++   +     +VE P     P     PFPQR + +  + QF KFLE+ K+LHINIP  EA+EQMP+Y KF+KDIL+KK+RLG++ETV+LTEECSAI
Subjt:  GGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAI

Query:  LKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEDEMKD-------FSF--------------
        ++N LPPK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP+  ++D       F F              
Subjt:  LKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEDEMKD-------FSF--------------

Query:  -------------------------IRILESTV---------------------------------------IETAIQDSDDKHSEKHGE----------
                                 +R+ +  +                                       +E A+ D  D+ +E+  E          
Subjt:  -------------------------IRILESTV---------------------------------------IETAIQDSDDKHSEKHGE----------

Query:  ------------APP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISPSFCLENL
                    AP   +KPS+ E PTL+LKPLP HL Y YLGES+TLP+I++S L     E L+++L+ ++ AIGWT+ DI+GISPSFC+  +
Subjt:  ------------APP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISPSFCLENL

A0A2G9HYA0 Reverse transcriptase5.8e-9034.58Show/hide
Query:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG-----------LNGVTQVVSGRMLE-------------------AQIKRLRVYLEVDGVS
        FRQ   ET  EAW RF+++LR CP+H +P  IQ+ TFY+G           LNG    +SG   E                   A   +    +EVD V+
Subjt:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG-----------LNGVTQVVSGRMLE-------------------AQIKRLRVYLEVDGVS

Query:  TIRADLAMIANALKNVTVISHQQPPAMGPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGNAQ----------------
         + A +  +  ++KN  V   Q  P               C  CGE H  + CP +  S+ FV N R   NNPYSN YNPG  Q                
Subjt:  TIRADLAMIANALKNVTVISHQQPPAMGPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGNAQ----------------

Query:  ------QNKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKASLKGNYP-----------------------RILNTLEGKGA
              Q +   P Q    SLE  + +FMA       S  A+ + +E Q+GQLAN + +  +G+ P                       R L  +  +  
Subjt:  ------QNKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKASLKGNYP-----------------------RILNTLEGKGA

Query:  GGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAI
           ++   +     +VE P     P     PFPQR + +  + QF KFLE+ K+LHINIP  EA+EQMP+Y KF+KDIL+KK+RLG++ETV+LTEECSAI
Subjt:  GGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAI

Query:  LKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEDEMKD-------FSF--------------
        ++N LPPK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP+  ++D       F F              
Subjt:  LKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEDEMKD-------FSF--------------

Query:  -------------------------IRILESTV---------------------------------------IETAIQDSDDKHSEKHGE----------
                                 +R+ +  +                                       +E A+ D  D+ +E+  E          
Subjt:  -------------------------IRILESTV---------------------------------------IETAIQDSDDKHSEKHGE----------

Query:  --------------APPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISPSFCLENL
                      +  +KPS+ + PTL+LKPLP HL Y YLGES+TLP+I++S L     E L+++L+ ++ AIGWT+ADI+GISPSFC+  +
Subjt:  --------------APPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISPSFCLENL

A0A2G9HYD8 Reverse transcriptase2.5e-8534.37Show/hide
Query:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQVVSGRMLEAQIKRLRVYLEVDGVSTIRADLAMIANALKNVTVISHQQ------P
        FRQ   ET  EAW RF+++LR CP+H +P  IQ+ TFY+GL            E    +L     ++G S +    A   N L N+    +++      P
Subjt:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQVVSGRMLEAQIKRLRVYLEVDGVSTIRADLAMIANALKNVTVISHQQ------P

Query:  PAMGPAAVVNQVTD--------EACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPG---------NAQQNKQALPQQNSGN----------
        P       V+QVT            +   E H  + CP +  S+ FV N R   NNPYSN YNPG         N  Q + + P+   G           
Subjt:  PAMGPAAVVNQVTD--------EACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPG---------NAQQNKQALPQQNSGN----------

Query:  ---SLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKASLKGNYP-----------------------RILNTLEGKGAGGSKENAGASGSVPDV
           SLE  + +FMA       S  A+ + +E Q+GQLAN + +  +G+ P                       R L  +  K     ++   +     +V
Subjt:  ---SLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKASLKGNYP-----------------------RILNTLEGKGAGGSKENAGASGSVPDV

Query:  EPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFT
        E P     P     PFPQ+ + +  + QF KFLE+ K+LHINIP  EA+EQMP+Y KF+KDIL+KK+RLG++ET +LTEEC+AI++N LPPK KDPGSFT
Subjt:  EPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFT

Query:  IPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEDEMKD-------FSF------------------------------
        IP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP+  ++D       F F                              
Subjt:  IPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEDEMKD-------FSF------------------------------

Query:  ---------IRILESTV---------------------------------------IETAIQDSDDKHSEKHGE------------------------AP
                 +R+ +  +                                       +E A+ D  ++ +E+  E                        + 
Subjt:  ---------IRILESTV---------------------------------------IETAIQDSDDKHSEKHGE------------------------AP

Query:  PIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISPSFCLENL
         +KPS+ + PTL+LKPLP+HL YVYLGES+TLP+I++S L     E L+++L+ ++ AIGWT+ADI+GISPSFC+  +
Subjt:  PIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISPSFCLENL

A0A5A7UCB0 Transposase9.6e-8568.22Show/hide
Query:  HGDLSSRYSCWPVMLVTYNLPPWLCMKRKYLMLTALISGPKQPGNDIDIYLAPLVDDLKKLRNDGVECYDVNEGQCFTLRAILLWTINDFPAYGNLCGCT
        HGDLSSRYSCWPVMLVTYNLPPWLCMKRK+LMLTALISGPKQPGN+ID+YLAPLVDDLK L +DGVECYD  + QCF L+AILLWTINDFPAYGNLCGCT
Subjt:  HGDLSSRYSCWPVMLVTYNLPPWLCMKRKYLMLTALISGPKQPGNDIDIYLAPLVDDLKKLRNDGVECYDVNEGQCFTLRAILLWTINDFPAYGNLCGCT

Query:  TKGYHACPICAEKT---------------------------KQKKAFNGEQELETAPEPLSGEEILAKTSRLQ-----CSF---------GKTKDGLKTR
         KGYHACPIC EKT                           KQKK FNG QELE APEPLSGEEI  +T+ +      C+          GKTKDG+K+R
Subjt:  TKGYHACPICAEKT---------------------------KQKKAFNGEQELETAPEPLSGEEILAKTSRLQ-----CSF---------GKTKDGLKTR

Query:  LDLVELNIRAELAPQLGEKRTFLPPACYTLSKIEKL
        LDLVELNIR+ELAPQ+GEK+ FLPPACYTLS+ EKL
Subjt:  LDLVELNIRAELAPQLGEKRTFLPPACYTLSKIEKL

A0A6J1EQ90 uncharacterized protein LOC1114364113.2e-10441.24Show/hide
Query:  DTHVTIKCCESICRSFHGRDDTIPIKLRDTILERIKQFSYIVMTSCSIAGWTRAIRAYVVPMFNEFDPGIARPQIQAANLNETGDVSDVANRGAFHGFRQ
        D H+ +K    +  SF    D+   +  D  + R+  F Y++        W   +    +  +N               +       +   +     F+Q
Subjt:  DTHVTIKCCESICRSFHGRDDTIPIKLRDTILERIKQFSYIVMTSCSIAGWTRAIRAYVVPMFNEFDPGIARPQIQAANLNETGDVSDVANRGAFHGFRQ

Query:  LEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQVV-----SGRMLEAQI-------------------------KRLRVYLEVDGVSTIR
         EDET SEA ERFKE+LRKCPHHGLPHCIQMETFYNGLN VT+ V     +G +L                             ++ R  LEVD +S+I 
Subjt:  LEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQVV-----SGRMLEAQI-------------------------KRLRVYLEVDGVSTIR

Query:  ADLAMIANALKNVTVISHQQPPA-MGPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ------RNNPYSNFYNP-----------GNAQQNKQ
        A LA + N L+N+ +       A +  AA +NQ   E+CVYCGE+H ++ CPSNPAS+F+VGNQ      +NNP+SN YNP           G +  N+Q
Subjt:  ADLAMIANALKNVTVISHQQPPA-MGPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ------RNNPYSNFYNP-----------GNAQQNKQ

Query:  ALPQQN-------------------------------SGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKASLKGNYPRILNTLEGKGAGG
          P+ N                               S  S+E+++KE+MA+ DA IQS QAS+R LE+Q+G    E  A    ++ +     + +    
Subjt:  ALPQQN-------------------------------SGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKASLKGNYPRILNTLEGKGAGG

Query:  SKENAGASGSVPDVEPPYVPPP--------PYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLT
        + +   +       E P +            Y P  PFPQR K K ++  F+KF++ILK++HINIPLVEA++QMPNY KFLKD+L  +++  EF+ VSL 
Subjt:  SKENAGASGSVPDVEPPYVPPP--------PYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLT

Query:  EECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEDEMKD
        EECSAILKN +P K KDPGSFTIPVSIGGKELGRALCDLGA+INLMPLS+Y+KLGIGEARPTTVTLQLADRSITYPE +++D
Subjt:  EECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEDEMKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCAGTGGCCAGAATTTGGAGCAGAACCTAGAAATCTTCGTCTTGCTCTTTCAACAGATGGAGCATGGTGATCTTAGTAGTCGATACAGTTGTTGGCCAGTTATGTT
GGTGACATACAACCTACCCCCTTGGTTGTGCATGAAGCGAAAATATTTGATGTTAACCGCATTAATATCTGGCCCTAAACAACCAGGAAATGACATAGATATTTATTTGG
CTCCATTAGTTGACGACTTGAAAAAACTTCGGAATGATGGAGTTGAATGCTATGATGTAAATGAAGGACAATGTTTCACACTTAGAGCTATTTTATTGTGGACTATTAAT
GACTTTCCTGCGTATGGCAATTTGTGTGGTTGTACAACAAAAGGATATCATGCATGCCCGATTTGTGCAGAGAAAACAAAGCAGAAGAAAGCTTTTAATGGTGAACAAGA
ATTGGAAACCGCTCCTGAACCATTAAGTGGAGAGGAAATTCTTGCAAAAACAAGTCGACTTCAATGTTCATTTGGTAAAACAAAAGATGGATTGAAAACTCGTTTAGATT
TGGTTGAACTGAACATTAGAGCAGAGTTGGCCCCTCAATTGGGAGAGAAGAGGACATTCTTACCTCCAGCGTGTTACACACTATCGAAGATTGAGAAATTGATCTGTACA
TCGACCAGAAGGATGCATTGGAGAAAATTATATTGTGGAGGAGCTGTAGAGTTTTGTTCTGAATTTCTCTCTGGAGTTGATTCAATTGGATTACGCTCATCGACAAGGAA
AGACAACTCAGAAAGTGATAGACCGCTGTCTGTTGCTTCTTATATCAAACCAAGAATGAATCTCAGGTTGCACTTGCTCTTGAGGTGTCGAACAACACCAATCATCTTCT
TTGAGATGGATAGCTCACGGGCCTCGTCAGATGTAGCCACATATTGTGGTTATGTGGTAACGCATAACAGTGATGTGAAGGCACGTGCTATTCATATGGAGGCCAGACCG
CCTGCTGGACGAGTGTCACCACTATGCATGATTGGCAGGCGTACAAATTCAGGACACGAATTGTCAGATGATCCCAGTAATCGAGCCACTTTATGGAAGGAAGCAAGAAA
AGGAAAAAATAAGGAATTTTTGATGACGAAACTAAAGAACGACGAGCTAGCCGCAACAAATCAAGGTCAAGATATATATACTACTGAAGCATTAGGCACGCCAGAACATA
GAGGGCGTGTTAGAGGAATGGTGGATAACATTGTTGCCATTGGCACAATGTATGAATCTAGTGTCAGATGCCCAACGATTCATGGAGTACCACTAGGAGCCAATAATGTG
CGAGTGGTGGTGGATATGATCACAGGCGACGATGTTCTCATACCAATTCCTGTGGTTGGAGAAATAGAGACGCTTAGCCAAGCAAAGGGTAGCTTCGTGGCATGGCCTCG
CGAACTTGTTATTTTGAATAACAAGAAAAAAGGTATGTTCTCCTGCAAAAACCAAAGGAATGTGTCCGTTGCACAATCTTCCGAACATACAGATACCCACGTTACCATCA
AGTGCTGTGAATCGATATGCCGCTCTTTCCATGGGAGGGATGACACAATTCCTATCAAATTGCGTGACACCATTTTGGAGAGGATAAAACAATTTTCGTACATCGTGATG
ACATCATGCAGTATTGCGGGATGGACCAGAGCCATTCGAGCATATGTTGTCCCGATGTTTAATGAGTTTGATCCAGGGATTGCACGTCCCCAAATCCAAGCGGCAAATTT
GAATGAAACCGGTGATGTTTCAGATGTTGCAAACCGTGGAGCATTCCATGGGTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTT
TACGAAAGTGTCCCCACCATGGGTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGTTTAAATGGAGTAACCCAAGTTGTCAGTGGTCGGATGTTAGAGGCACAA
ATAAAAAGGTTAAGAGTGTATTTAGAGGTTGATGGTGTGTCCACCATTAGGGCTGATCTTGCTATGATTGCTAATGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCA
GCCACCAGCTATGGGCCCTGCAGCAGTGGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGCGGTGAAGACCACAACTACGAGTTTTGCCCCAGCAATCCTGCTTCTG
TGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTCTATAATCCAGGTAATGCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGAAATTCTCTC
GAGGCAATGATGAAAGAATTTATGGCTCGCACAGACGCCGCAATTCAAAGTAATCAAGCTTCGATGAGGGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTCAA
GGCAAGCCTCAAGGGAAACTACCCTCGGATACTAAACACCCTCGAAGGGAAGGGTGCTGGAGGCAGCAAGGAAAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAAC
CACCATATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGACAAAAGCCTAAGAATCAGGATGGTCAATTTAAAAAGTTTTTAGAAATTCTTAAGCAA
TTGCATATAAATATCCCTTTAGTAGAAGCTATAGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTGACTAAAAAGAAAAGGTTAGGTGAGTTTGAAACTGT
ATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGGGCTACCCCCCAAGGCTAAGGATCCAGGGTCATTTACTATACCTGTGTCTATAGGTGGAAAGGAATTAGGTA
GAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGGAAGTTAGGTATTGGTGAAGCTAGACCTACCACAGTTACACTCCAATTAGCTGAT
AGGTCTATCACATATCCAGAGGACGAAATGAAGGATTTCTCCTTCATTAGGATTCTGGAGAGCACAGTTATTGAGACAGCAATACAGGATTCGGATGATAAACATTCAGA
AAAGCATGGAGAGGCTCCCCCAATTAAACCATCCCTGATTGAGGCACCCACTTTAGATTTGAAACCTTTACCGGATCATCTAAAGTATGTGTATCTTGGGGAAAGTGAGA
CATTGCCCATTATTGTTGCATCAGATTTAATGTCGGAGCATGAGGAGGCCTTAATAAAATTGCTGCAGCAATACCGCAAAGCTATAGGTTGGACATTGGCTGATATTCAG
GGGATTAGTCCATCTTTTTGCCTTGAAAATCTCGAACCGCCCAAACCCGGGTTGAACCGGACCAAACCGGTCCCAGCTCAACCCTTACCGCACGAACCGGACCACCCATT
AGGGTTTCAGCACACGTTCTCACGAAAATTGAACTGCCGCACGAAATCGCGACAGCGTCAAGACGTTGAAGGCACAACGTCCCGACGCTGGCGCGATTTTAACCATTCGC
TGGGTGAAATCGCGACAGCGTCGACACACTGTAGGCCAAGCGTCCCGTCGCTGTTGCGTATGGAGCTCGATTCTGGGCAAAAAATAGCGGCTGAACAATTCCATTTGGCC
TCGATTGATACCGCGTCCTTCATCGATCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATCAGTGGCCAGAATTTGGAGCAGAACCTAGAAATCTTCGTCTTGCTCTTTCAACAGATGGAGCATGGTGATCTTAGTAGTCGATACAGTTGTTGGCCAGTTATGTT
GGTGACATACAACCTACCCCCTTGGTTGTGCATGAAGCGAAAATATTTGATGTTAACCGCATTAATATCTGGCCCTAAACAACCAGGAAATGACATAGATATTTATTTGG
CTCCATTAGTTGACGACTTGAAAAAACTTCGGAATGATGGAGTTGAATGCTATGATGTAAATGAAGGACAATGTTTCACACTTAGAGCTATTTTATTGTGGACTATTAAT
GACTTTCCTGCGTATGGCAATTTGTGTGGTTGTACAACAAAAGGATATCATGCATGCCCGATTTGTGCAGAGAAAACAAAGCAGAAGAAAGCTTTTAATGGTGAACAAGA
ATTGGAAACCGCTCCTGAACCATTAAGTGGAGAGGAAATTCTTGCAAAAACAAGTCGACTTCAATGTTCATTTGGTAAAACAAAAGATGGATTGAAAACTCGTTTAGATT
TGGTTGAACTGAACATTAGAGCAGAGTTGGCCCCTCAATTGGGAGAGAAGAGGACATTCTTACCTCCAGCGTGTTACACACTATCGAAGATTGAGAAATTGATCTGTACA
TCGACCAGAAGGATGCATTGGAGAAAATTATATTGTGGAGGAGCTGTAGAGTTTTGTTCTGAATTTCTCTCTGGAGTTGATTCAATTGGATTACGCTCATCGACAAGGAA
AGACAACTCAGAAAGTGATAGACCGCTGTCTGTTGCTTCTTATATCAAACCAAGAATGAATCTCAGGTTGCACTTGCTCTTGAGGTGTCGAACAACACCAATCATCTTCT
TTGAGATGGATAGCTCACGGGCCTCGTCAGATGTAGCCACATATTGTGGTTATGTGGTAACGCATAACAGTGATGTGAAGGCACGTGCTATTCATATGGAGGCCAGACCG
CCTGCTGGACGAGTGTCACCACTATGCATGATTGGCAGGCGTACAAATTCAGGACACGAATTGTCAGATGATCCCAGTAATCGAGCCACTTTATGGAAGGAAGCAAGAAA
AGGAAAAAATAAGGAATTTTTGATGACGAAACTAAAGAACGACGAGCTAGCCGCAACAAATCAAGGTCAAGATATATATACTACTGAAGCATTAGGCACGCCAGAACATA
GAGGGCGTGTTAGAGGAATGGTGGATAACATTGTTGCCATTGGCACAATGTATGAATCTAGTGTCAGATGCCCAACGATTCATGGAGTACCACTAGGAGCCAATAATGTG
CGAGTGGTGGTGGATATGATCACAGGCGACGATGTTCTCATACCAATTCCTGTGGTTGGAGAAATAGAGACGCTTAGCCAAGCAAAGGGTAGCTTCGTGGCATGGCCTCG
CGAACTTGTTATTTTGAATAACAAGAAAAAAGGTATGTTCTCCTGCAAAAACCAAAGGAATGTGTCCGTTGCACAATCTTCCGAACATACAGATACCCACGTTACCATCA
AGTGCTGTGAATCGATATGCCGCTCTTTCCATGGGAGGGATGACACAATTCCTATCAAATTGCGTGACACCATTTTGGAGAGGATAAAACAATTTTCGTACATCGTGATG
ACATCATGCAGTATTGCGGGATGGACCAGAGCCATTCGAGCATATGTTGTCCCGATGTTTAATGAGTTTGATCCAGGGATTGCACGTCCCCAAATCCAAGCGGCAAATTT
GAATGAAACCGGTGATGTTTCAGATGTTGCAAACCGTGGAGCATTCCATGGGTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTT
TACGAAAGTGTCCCCACCATGGGTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGTTTAAATGGAGTAACCCAAGTTGTCAGTGGTCGGATGTTAGAGGCACAA
ATAAAAAGGTTAAGAGTGTATTTAGAGGTTGATGGTGTGTCCACCATTAGGGCTGATCTTGCTATGATTGCTAATGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCA
GCCACCAGCTATGGGCCCTGCAGCAGTGGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGCGGTGAAGACCACAACTACGAGTTTTGCCCCAGCAATCCTGCTTCTG
TGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTCTATAATCCAGGTAATGCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGAAATTCTCTC
GAGGCAATGATGAAAGAATTTATGGCTCGCACAGACGCCGCAATTCAAAGTAATCAAGCTTCGATGAGGGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTCAA
GGCAAGCCTCAAGGGAAACTACCCTCGGATACTAAACACCCTCGAAGGGAAGGGTGCTGGAGGCAGCAAGGAAAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAAC
CACCATATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGACAAAAGCCTAAGAATCAGGATGGTCAATTTAAAAAGTTTTTAGAAATTCTTAAGCAA
TTGCATATAAATATCCCTTTAGTAGAAGCTATAGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTGACTAAAAAGAAAAGGTTAGGTGAGTTTGAAACTGT
ATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGGGCTACCCCCCAAGGCTAAGGATCCAGGGTCATTTACTATACCTGTGTCTATAGGTGGAAAGGAATTAGGTA
GAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGGAAGTTAGGTATTGGTGAAGCTAGACCTACCACAGTTACACTCCAATTAGCTGAT
AGGTCTATCACATATCCAGAGGACGAAATGAAGGATTTCTCCTTCATTAGGATTCTGGAGAGCACAGTTATTGAGACAGCAATACAGGATTCGGATGATAAACATTCAGA
AAAGCATGGAGAGGCTCCCCCAATTAAACCATCCCTGATTGAGGCACCCACTTTAGATTTGAAACCTTTACCGGATCATCTAAAGTATGTGTATCTTGGGGAAAGTGAGA
CATTGCCCATTATTGTTGCATCAGATTTAATGTCGGAGCATGAGGAGGCCTTAATAAAATTGCTGCAGCAATACCGCAAAGCTATAGGTTGGACATTGGCTGATATTCAG
GGGATTAGTCCATCTTTTTGCCTTGAAAATCTCGAACCGCCCAAACCCGGGTTGAACCGGACCAAACCGGTCCCAGCTCAACCCTTACCGCACGAACCGGACCACCCATT
AGGGTTTCAGCACACGTTCTCACGAAAATTGAACTGCCGCACGAAATCGCGACAGCGTCAAGACGTTGAAGGCACAACGTCCCGACGCTGGCGCGATTTTAACCATTCGC
TGGGTGAAATCGCGACAGCGTCGACACACTGTAGGCCAAGCGTCCCGTCGCTGTTGCGTATGGAGCTCGATTCTGGGCAAAAAATAGCGGCTGAACAATTCCATTTGGCC
TCGATTGATACCGCGTCCTTCATCGATCTTTAA
Protein sequenceShow/hide protein sequence
MISGQNLEQNLEIFVLLFQQMEHGDLSSRYSCWPVMLVTYNLPPWLCMKRKYLMLTALISGPKQPGNDIDIYLAPLVDDLKKLRNDGVECYDVNEGQCFTLRAILLWTIN
DFPAYGNLCGCTTKGYHACPICAEKTKQKKAFNGEQELETAPEPLSGEEILAKTSRLQCSFGKTKDGLKTRLDLVELNIRAELAPQLGEKRTFLPPACYTLSKIEKLICT
STRRMHWRKLYCGGAVEFCSEFLSGVDSIGLRSSTRKDNSESDRPLSVASYIKPRMNLRLHLLLRCRTTPIIFFEMDSSRASSDVATYCGYVVTHNSDVKARAIHMEARP
PAGRVSPLCMIGRRTNSGHELSDDPSNRATLWKEARKGKNKEFLMTKLKNDELAATNQGQDIYTTEALGTPEHRGRVRGMVDNIVAIGTMYESSVRCPTIHGVPLGANNV
RVVVDMITGDDVLIPIPVVGEIETLSQAKGSFVAWPRELVILNNKKKGMFSCKNQRNVSVAQSSEHTDTHVTIKCCESICRSFHGRDDTIPIKLRDTILERIKQFSYIVM
TSCSIAGWTRAIRAYVVPMFNEFDPGIARPQIQAANLNETGDVSDVANRGAFHGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQVVSGRMLEAQ
IKRLRVYLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMGPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGNAQQNKQALPQQNSGNSL
EAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKASLKGNYPRILNTLEGKGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQ
LHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLAD
RSITYPEDEMKDFSFIRILESTVIETAIQDSDDKHSEKHGEAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQ
GISPSFCLENLEPPKPGLNRTKPVPAQPLPHEPDHPLGFQHTFSRKLNCRTKSRQRQDVEGTTSRRWRDFNHSLGEIATASTHCRPSVPSLLRMELDSGQKIAAEQFHLA
SIDTASFIDL