; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015821 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015821
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr12:26023767..26027261
RNA-Seq ExpressionLag0015821
SyntenyLag0015821
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_021279280.1 LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 [Herrania umbratica]3.5e-15345.52Show/hide
Query:  EQQNIQVENPILIANDRTRAIRAYAVPMFNELNPGIAR-----------------------------------------VSDSFVIQGVPRDALRLTLFP
        E  N    N I +  +  RA+R Y VP+   L+  I R                                         + D+F   GV  DA+RL LFP
Subjt:  EQQNIQVENPILIANDRTRAIRAYAVPMFNELNPGIAR-----------------------------------------VSDSFVIQGVPRDALRLTLFP

Query:  YSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDAS
        +SLRD AKSWLNS   GSI TW++LA+KFL+K+FPP + AK+R++I  F Q + ES  EAWERFKELLR+CPHHG+P  +Q++TFYNGL G  + ++DA+
Subjt:  YSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDAS

Query:  AGGALLAKPFDEAYEILERISINSCQWSDVRSTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGENHNYEFC
        AGGAL++K   +AY +LE ++ N+ QW   RS ++K     E+D + T+   +A ++  L  + V            AV N +    C  CG++H+Y+ C
Subjt:  AGGALLAKPFDEAYEILERISINSCQWSDVRSTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGENHNYEFC

Query:  PSNPASVFFVGN---QRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKVNQPGFAKAQVLPQQNKQALPQQNSGSSLEAMMKEFMARIDAAIQSNQASM
        P N  SV FVGN   Q+NNPYSN YNPGWRNHPNFSW    +     + +  PGF       QQ +  +P++   S LE ++ +++++ DA IQS  AS+
Subjt:  PSNPASVFFVGN---QRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKVNQPGFAKAQVLPQQNKQALPQQNSGSSLEAMMKEFMARIDAAIQSNQASM

Query:  RALELQVGQLANELKARPQGKLPSILN-TLEGK-----VRSRCGTTLSAAPTLCTTSTF-----------------STKEKPKNQ---------------
        R LE QVGQLAN +  RPQG LPS      +GK     +  R G  +         S                      +K +NQ               
Subjt:  RALELQVGQLANELKARPQGKLPSILN-TLEGK-----VRSRCGTTLSAAPTLCTTSTF-----------------STKEKPKNQ---------------

Query:  ------DGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGA
              + QF+KFL + K+LHINIPF EA+EQMP+Y KFLKDIL+KK++LGEFETV LTEECSAIL+N LPPK KDPGSFTIP +IG     +AL DLGA
Subjt:  ------DGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGA

Query:  SINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMK
        SINLMP S++ KLG+GE +PT+VTLQLADRS  Y  G IEDVLVKVDKFIFPVDF+ILD E D+ +PIILGRPFLAT  A+IDV++G+++ K
Subjt:  SINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMK

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]3.1e-15744.28Show/hide
Query:  VENPILIANDRTRAIRAYAVPMFNELNPGI-----------------------------------------ARVSDSFVIQGVPRDALRLTLFPYSLRDG
        V NPI +A+ R RA+R YA  +  +LN  +                                          +V+++F + G+  DALRLTLFP+S+   
Subjt:  VENPILIANDRTRAIRAYAVPMFNELNPGI-----------------------------------------ARVSDSFVIQGVPRDALRLTLFPYSLRDG

Query:  AKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALL
        A +WLN+F   +I TW ++ +KFL KYFPP RNA +R EI+ FRQ E+E+ + AWERFK+L+  CP+ G+P C+Q+E F+ G + +T+ M++ +A G   
Subjt:  AKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALL

Query:  AKPFDEAYEILERISINSCQWSDVRSTNKKVKS----VLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPT--AVVNQVAEEACVYCGENHNYEFC
        +K F+E  EIL+++S ++ QW   +S  +  ++    VL +D +++++  +  I   LKN+   +     A   T  + V Q+AE  C YCG+ H  E C
Subjt:  AKPFDEAYEILERISINSCQWSDVRSTNKKVKS----VLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPT--AVVNQVAEEACVYCGENHNYEFC

Query:  PSNPASVFFVG---NQRNNPYSNFYNPGWRNHPNFSWGGQGS------NVQAQQKVNQPGFAKAQVLP-------QQNKQALPQQNSGSSLEAMMKEFMA
        PSNP+S+++VG    Q+ NPYSN YNPGW+ HPNFSW GQGS      N Q ++    PGF  +   P       QQ     P Q + S++E +MKE + 
Subjt:  PSNPASVFFVG---NQRNNPYSNFYNPGWRNHPNFSWGGQGS------NVQAQQKVNQPGFAKAQVLP-------QQNKQALPQQNSGSSLEAMMKEFMA

Query:  RIDAAIQS---------------------NQASMRALELQVGQLANELKARPQGKLPSILNTLEGKVRSRCGTTLSAA------PTLCTTSTFS-TKEKP
        + DA ++                      N  ++R LE+Q+GQL NE++ RPQG LPS         +  C +  + +      P +   S+ S ++EK 
Subjt:  RIDAAIQS---------------------NQASMRALELQVGQLANELKARPQGKLPSILNTLEGKVRSRCGTTLSAA------PTLCTTSTFS-TKEKP

Query:  ------------------------------------KNQDGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILK
                                            KNQD  FRKFL+ILKQLHINIPFVEA+EQMP YAKF+KDI+T+KK+LGE+ETV+LTE  S + K
Subjt:  ------------------------------------KNQDGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILK

Query:  NGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVP
        + +PPK KDPGSFTIP  IGGK++GRALCDLGASINLMPLS+++K  IG+A PTTVTLQLADRSIT  EGKIEDVLVKVDKFIFP DFIILD EADKDVP
Subjt:  NGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVP

Query:  IILGRPFLATGRALIDVQKGELTMK
        IILGRPFLATG  LIDV+KGELTM+
Subjt:  IILGRPFLATGRALIDVQKGELTMK

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]2.1e-19052.49Show/hide
Query:  QLKNPLFEQNEQQNIQVENPILIANDRTRAIRAYAVPMFNELNPGIARV------------------------------------------------SDS
        + +NP    N+++     NPI +A+DR RAIRAYA P   ELNP I R                                                 SDS
Subjt:  QLKNPLFEQNEQQNIQVENPILIANDRTRAIRAYAVPMFNELNPGIARV------------------------------------------------SDS

Query:  FVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHCIQME
        F  QGV +D +RL+LFPY LRDGAKSWLN+ APG+I +W+ LAE FL KYFPP RNA+ ++EIV F+Q EDE+ SEA ERFKE+LRKCPHHGLPHCIQME
Subjt:  FVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHCIQME

Query:  TFYNGLNGVTQGMVDASAGGALLAKPFDEAYEILERISINSCQWSDVRST-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-MEPTAVVN
        TFYNGLN VT+ +VDASA GA+L+K ++EAYEILERI+ N+CQW+DVRS   +K + VLEVD +S+I A LA + N L+N+ +       A +   A +N
Subjt:  TFYNGLNGVTQGMVDASAGGALLAKPFDEAYEILERISINSCQWSDVRST-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-MEPTAVVN

Query:  QVAEEACVYCGENHNYEFCPSNPASVFFVGNQ------RNNPYSNFYNPGWRNHPNFSWGGQG-SNVQAQQKVNQP---------GFAKAQVLPQQNKQA
        Q A E+CVYCGE H ++ CPSNPAS+F+VGNQ      +NNP+SN YNPGWRNHPNFSW GQ   N Q   K N P          ++  QV  Q     
Subjt:  QVAEEACVYCGENHNYEFCPSNPASVFFVGNQ------RNNPYSNFYNPGWRNHPNFSWGGQG-SNVQAQQKVNQP---------GFAKAQVLPQQNKQA

Query:  LPQQNSGSSLEAMMKEFMARIDAAIQSNQASMRALELQVG------------------QLANELKARPQGKLPSILNTLEGKVRSRCGTTLSAAPTLCTT
          Q  S +S+E+++KE+MA+ DA IQS QAS+R LE+Q+G                  Q  NE  A  +          E        ++   + T   +
Subjt:  LPQQNSGSSLEAMMKEFMARIDAAIQSNQASMRALELQVG------------------QLANELKARPQGKLPSILNTLEGKVRSRCGTTLSAAPTLCTT

Query:  STFSTKEKPKNQDGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRA
          F  + K K ++  F KF++ILK++HINIP VEA++QMPNY KFLKD+L  +++  EF+ VSL EECSAILKN +P K KDPGSFTIPVSIGGKELGRA
Subjt:  STFSTKEKPKNQDGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRA

Query:  LCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMKSV
        LCDLGA+INLMPLS+Y+KLGIGEARPTTVTLQLADRSITY EGKIED+L++VDKFIF  DFIILDYE D DVPIILGRPFL  GR L+DV KG +T++  
Subjt:  LCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMKSV

Query:  MRK
         +K
Subjt:  MRK

XP_024022668.1 uncharacterized protein LOC112091966 [Morus notabilis]4.9e-14748.12Show/hide
Query:  VSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHC
        + D+F   GV  DA++L LFP+SLRD AK WLNS  PGSI TW++LA+KFL+KYF P + AK+R++I  F Q + ES  EAWERFK+LLR+CPHHGLP  
Subjt:  VSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHC

Query:  IQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEILERISINSCQWSDVRSTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAV
        +Q++TFYNGL+G  +  +DA+AGG+L+AK  D AYE+LE ++ N+ QW   RS  KKV  + EVD ++ +   +A ++     +                
Subjt:  IQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEILERISINSCQWSDVRSTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAV

Query:  VNQVAEEACVYCGENHNYEFCPSNPASVFFVG--NQRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKVNQPGFAKAQVLPQQNKQALPQQNSGSSLEA
         +      C  CG NH    C  N  SV FVG  N++ N  SN+Y+PG +NHPNFSW G      + + +  PG+   Q  PQ+ K  L  ++  + + +
Subjt:  VNQVAEEACVYCGENHNYEFCPSNPASVFFVG--NQRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKVNQPGFAKAQVLPQQNKQALPQQNSGSSLEA

Query:  MMKEFMARIDAAIQSNQASMRALELQVGQLANELKARPQGKLPSILNTLEGKVRS----------RCGTTLSAAPTLCTTS-------------TFSTKE
           + +      +QS  AS+R LE+QVGQLAN +  R QG LPS  NT     R           R G  L+       TS               S ++
Subjt:  MMKEFMARIDAAIQSNQASMRALELQVGQLANELKARPQGKLPSILNTLEGKVRS----------RCGTTLSAAPTLCTTS-------------TFSTKE

Query:  KPKNQ--------------------DGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFT
        KPK +                    D QF KFLE+ K+LHINIPF +A+EQMP+Y KF+KDIL+KK+RL +FETV+LTEECSA+++  LPPK KDPGSFT
Subjt:  KPKNQ--------------------DGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFT

Query:  IPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRAL
        IP +IG +   RALCDLGASINLMPLS+Y++LG+ EA+PT VTLQLADRS+TY  G +EDVLVKVD+ IFP DFI+LD E D +V IILGRPFLATGR L
Subjt:  IPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRAL

Query:  IDVQKGELTMK
        IDVQKGELTM+
Subjt:  IDVQKGELTMK

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]7.5e-17249.65Show/hide
Query:  QVENPILIANDRTRAIRAYAVPMFNELNPGIAR-----------------------------------------VSDSFVIQGVPRDALRLTLFPYSLRD
        Q+ +PI++ +DR RAIR YA PMFNELNPGI R                                         +SDSF IQGV  +  RL LFP+SLRD
Subjt:  QVENPILIANDRTRAIRAYAVPMFNELNPGIAR-----------------------------------------VSDSFVIQGVPRDALRLTLFPYSLRD

Query:  GAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGAL
         A+SWLN+ +P S+  W++ AEKFL KYFPP RNAK RSEI+ F QLEDES S+AWERFKELLRKCPHHG+PHCIQMETFYNGLN  +Q ++DASA GA+
Subjt:  GAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGAL

Query:  LAKPFDEAYEILERISINSCQWSDVRST-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGENHNYEFCPSNP
        L+K ++EA+EILE I+ N+ QWS+ R+  ++KV  VLEVD ++ +   +A + N LKN+++ + +    ++P A + Q  + +CV+C E H +E CPSNP
Subjt:  LAKPFDEAYEILERISINSCQWSDVRST-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGENHNYEFCPSNP

Query:  ASVFFVGNQ----RNNPYSNFYNPGWRNHPNFSWGGQG--SNVQAQQKVNQPGFAKAQVLPQQNKQALPQQNSGSSLEAMMKEFMARIDAAIQSNQASMR
         SV ++GNQ     N  +SN YN  W+NHPN SWG +    +    ++   PGF++    PQ  + + P     SSLE++M+++MA+ DA IQS  A +R
Subjt:  ASVFFVGNQ----RNNPYSNFYNPGWRNHPNFSWGGQG--SNVQAQQKVNQPGFAKAQVLPQQNKQALPQQNSGSSLEAMMKEFMARIDAAIQSNQASMR

Query:  ALELQVGQLANELKARPQGKLPSILNT--LEGK-----VRSRCGTTL---------SAAPT-------------------------------------LC
         LELQ+G LANELKARPQG LPS       +GK     +  R G  L         S  PT                                     +C
Subjt:  ALELQVGQLANELKARPQGKLPSILNT--LEGK-----VRSRCGTTL---------SAAPT-------------------------------------LC

Query:  TTS----TFSTKEKPKNQDGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGG
        ++S     F  + + + QDGQF+KFL++LKQLHINIP VEA+EQMPNY KFLKDILTKK+RLGEFE+  LTE   A+LKN +PPK KDPGSFTIP+SIGG
Subjt:  TTS----TFSTKEKPKNQDGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGG

Query:  KELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGE
                              R LGIGEARPTTVTLQLADRS+ + +GKIEDVLV+VDKFIFP DFIILDYE D++VPIIL RPFLATGR LIDV+KGE
Subjt:  KELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGE

Query:  LTMKS
        LTM++
Subjt:  LTMKS

TrEMBL top hitse value%identityAlignment
A0A6A2WLX1 Reverse transcriptase5.9e-13045.24Show/hide
Query:  VSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHC
        V DSF  +GV  D L+L LFPYSLRD A++WL+    GS+ +W +L + FL +Y PPN N +LR+EI  FRQ +DES  E W+R+K LL+KC +HG    
Subjt:  VSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHC

Query:  IQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEILERISINSCQWSDVR-STNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTA
         Q+  FYNG+N  T+ ++DASA G LL K   EA+ IL+RI+ N  Q+   R  + ++     E++   ++ A L++I N LKN+   +  +   ++ T+
Subjt:  IQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEILERISINSCQWSDVR-STNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTA

Query:  VVNQVAEEACVYCGENHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGSN----VQAQQKVNQP-GFAKAQVLPQQNKQALPQQN
        +       AC+ C  NH+   CP+N  S+ FVGN     NNPYSN YN GWR HPNFSW  QG++       QQ  N+P G+  A      NK+AL    
Subjt:  VVNQVAEEACVYCGENHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGSN----VQAQQKVNQP-GFAKAQVLPQQNKQALPQQN

Query:  SGSSLEAMMKEFMARI---------------------DAAIQSNQASMRALELQVGQLANELKARPQGKLPSILNTLEGKVRSRC-------GTTLS---
        S SSLEA ++EF++                        A IQS+ +S+RALE QVGQ+A  L+ R QG+LPS     +G  +  C       GT ++   
Subjt:  SGSSLEAMMKEFMARI---------------------DAAIQSNQASMRALELQVGQLANELKARPQGKLPSILNTLEGKVRSRC-------GTTLS---

Query:  ---------------------AAPTLCTTSTFSTKEKPKNQDGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETV-SLTEECSA
                             AA        F  + K  N + QF+KF++IL QLHINIP +EA+EQMP YAKF+KDI TKK+++   ETV + TE CS+
Subjt:  ---------------------AAPTLCTTSTFSTKEKPKNQDGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETV-SLTEECSA

Query:  ILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADK
          K  L PK  DPGSF IP SIG   +G+ALCDLG+S+NL+P S++ KLGIG+ARPT+V LQLAD+S   LEG++EDV+V+VDKF+F VDF+ILD E D 
Subjt:  ILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADK

Query:  DVPIILGRPFLATGRALIDVQKGELTMKSV
          PIILGRPFLATGR LID +KGELTM+ V
Subjt:  DVPIILGRPFLATGRALIDVQKGELTMKSV

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129451.7e-15345.52Show/hide
Query:  EQQNIQVENPILIANDRTRAIRAYAVPMFNELNPGIAR-----------------------------------------VSDSFVIQGVPRDALRLTLFP
        E  N    N I +  +  RA+R Y VP+   L+  I R                                         + D+F   GV  DA+RL LFP
Subjt:  EQQNIQVENPILIANDRTRAIRAYAVPMFNELNPGIAR-----------------------------------------VSDSFVIQGVPRDALRLTLFP

Query:  YSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDAS
        +SLRD AKSWLNS   GSI TW++LA+KFL+K+FPP + AK+R++I  F Q + ES  EAWERFKELLR+CPHHG+P  +Q++TFYNGL G  + ++DA+
Subjt:  YSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDAS

Query:  AGGALLAKPFDEAYEILERISINSCQWSDVRSTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGENHNYEFC
        AGGAL++K   +AY +LE ++ N+ QW   RS ++K     E+D + T+   +A ++  L  + V            AV N +    C  CG++H+Y+ C
Subjt:  AGGALLAKPFDEAYEILERISINSCQWSDVRSTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGENHNYEFC

Query:  PSNPASVFFVGN---QRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKVNQPGFAKAQVLPQQNKQALPQQNSGSSLEAMMKEFMARIDAAIQSNQASM
        P N  SV FVGN   Q+NNPYSN YNPGWRNHPNFSW    +     + +  PGF       QQ +  +P++   S LE ++ +++++ DA IQS  AS+
Subjt:  PSNPASVFFVGN---QRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKVNQPGFAKAQVLPQQNKQALPQQNSGSSLEAMMKEFMARIDAAIQSNQASM

Query:  RALELQVGQLANELKARPQGKLPSILN-TLEGK-----VRSRCGTTLSAAPTLCTTSTF-----------------STKEKPKNQ---------------
        R LE QVGQLAN +  RPQG LPS      +GK     +  R G  +         S                      +K +NQ               
Subjt:  RALELQVGQLANELKARPQGKLPSILN-TLEGK-----VRSRCGTTLSAAPTLCTTSTF-----------------STKEKPKNQ---------------

Query:  ------DGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGA
              + QF+KFL + K+LHINIPF EA+EQMP+Y KFLKDIL+KK++LGEFETV LTEECSAIL+N LPPK KDPGSFTIP +IG     +AL DLGA
Subjt:  ------DGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGA

Query:  SINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMK
        SINLMP S++ KLG+GE +PT+VTLQLADRS  Y  G IEDVLVKVDKFIFPVDF+ILD E D+ +PIILGRPFLAT  A+IDV++G+++ K
Subjt:  SINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMK

A0A6J1CPJ3 uncharacterized protein LOC1110129478.5e-14544.11Show/hide
Query:  NEQQNIQVENPILIANDRTRAIRAYAVPMFNELNPGI-----------------------------------------ARVSDSFVIQGVPRDALRLTLF
        N+     V NPI +A+ R RA+R YA  +  +LN  +                                          +V+++F + G+  DALRLTLF
Subjt:  NEQQNIQVENPILIANDRTRAIRAYAVPMFNELNPGI-----------------------------------------ARVSDSFVIQGVPRDALRLTLF

Query:  PYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDA
        P+SL   A +WLN+F   +I T  ++ +KFL KYFPP RNA +R EI+ FRQ E+E+ + AWERFK+L+R CP+ G+P C+Q+E F+   +  T  M++ 
Subjt:  PYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDA

Query:  SAGGALLAKPFDEAYEILERISINSCQWSDVRSTNKKVKS----VLEVDGVSTIRADLAMIANALKNV--TVISHQQPPAMEPTAVVNQVAEEACVYCGE
        +A G   +K F+E  EIL+++S ++ QW   +   +  ++    VL +D +++++  +  I   LKN+     +    PA    + V Q+AE  C     
Subjt:  SAGGALLAKPFDEAYEILERISINSCQWSDVRSTNKKVKS----VLEVDGVSTIRADLAMIANALKNV--TVISHQQPPAMEPTAVVNQVAEEACVYCGE

Query:  NHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGWRNHPNFSWGGQGS------NVQAQQKVNQPGFAKAQVLP-------QQNKQALPQQNSGSSLEAMMK
                        +  Q+ NPYSN YNPGW+ HPNFSW GQGS      N Q +Q    P F  +   P       QQ     P Q + S++E +MK
Subjt:  NHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGWRNHPNFSWGGQGS------NVQAQQKVNQPGFAKAQVLP-------QQNKQALPQQNSGSSLEAMMK

Query:  EF-----------MARIDAAIQS----------NQASMRALELQVGQLANELKARPQGKLPS-------ILNTLEGKVRS---------RCGTTLSAAPT
        EF           M R DA I+           N  ++R LE+Q+GQLANE++ RPQG LPS       I++    + +              ++S AP 
Subjt:  EF-----------MARIDAAIQS----------NQASMRALELQVGQLANELKARPQGKLPS-------ILNTLEGKVRS---------RCGTTLSAAPT

Query:  LCTTST---FSTKEKPKNQDGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIG
        +    +   F  +   KNQD  FRKFL+ILKQLHINIPFVEA+EQMP YAKFLKDI+T+KK+LGE+ETV+LTE  S + K+  PPK KDPGSFTI   IG
Subjt:  LCTTST---FSTKEKPKNQDGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIG

Query:  GKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKG
        GK++GRALCDLGA INLMPLS+++KL IG+A PTTVTL LADRSIT  EGKIEDVLVKVDKFIFP DFIILD EADKDVPIILGRPFLATG  LIDV+KG
Subjt:  GKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKG

Query:  ELTMK
        ELTM+
Subjt:  ELTMK

A0A6J1DY39 uncharacterized protein LOC1110256531.5e-15744.28Show/hide
Query:  VENPILIANDRTRAIRAYAVPMFNELNPGI-----------------------------------------ARVSDSFVIQGVPRDALRLTLFPYSLRDG
        V NPI +A+ R RA+R YA  +  +LN  +                                          +V+++F + G+  DALRLTLFP+S+   
Subjt:  VENPILIANDRTRAIRAYAVPMFNELNPGI-----------------------------------------ARVSDSFVIQGVPRDALRLTLFPYSLRDG

Query:  AKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALL
        A +WLN+F   +I TW ++ +KFL KYFPP RNA +R EI+ FRQ E+E+ + AWERFK+L+  CP+ G+P C+Q+E F+ G + +T+ M++ +A G   
Subjt:  AKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALL

Query:  AKPFDEAYEILERISINSCQWSDVRSTNKKVKS----VLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPT--AVVNQVAEEACVYCGENHNYEFC
        +K F+E  EIL+++S ++ QW   +S  +  ++    VL +D +++++  +  I   LKN+   +     A   T  + V Q+AE  C YCG+ H  E C
Subjt:  AKPFDEAYEILERISINSCQWSDVRSTNKKVKS----VLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPT--AVVNQVAEEACVYCGENHNYEFC

Query:  PSNPASVFFVG---NQRNNPYSNFYNPGWRNHPNFSWGGQGS------NVQAQQKVNQPGFAKAQVLP-------QQNKQALPQQNSGSSLEAMMKEFMA
        PSNP+S+++VG    Q+ NPYSN YNPGW+ HPNFSW GQGS      N Q ++    PGF  +   P       QQ     P Q + S++E +MKE + 
Subjt:  PSNPASVFFVG---NQRNNPYSNFYNPGWRNHPNFSWGGQGS------NVQAQQKVNQPGFAKAQVLP-------QQNKQALPQQNSGSSLEAMMKEFMA

Query:  RIDAAIQS---------------------NQASMRALELQVGQLANELKARPQGKLPSILNTLEGKVRSRCGTTLSAA------PTLCTTSTFS-TKEKP
        + DA ++                      N  ++R LE+Q+GQL NE++ RPQG LPS         +  C +  + +      P +   S+ S ++EK 
Subjt:  RIDAAIQS---------------------NQASMRALELQVGQLANELKARPQGKLPSILNTLEGKVRSRCGTTLSAA------PTLCTTSTFS-TKEKP

Query:  ------------------------------------KNQDGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILK
                                            KNQD  FRKFL+ILKQLHINIPFVEA+EQMP YAKF+KDI+T+KK+LGE+ETV+LTE  S + K
Subjt:  ------------------------------------KNQDGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILK

Query:  NGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVP
        + +PPK KDPGSFTIP  IGGK++GRALCDLGASINLMPLS+++K  IG+A PTTVTLQLADRSIT  EGKIEDVLVKVDKFIFP DFIILD EADKDVP
Subjt:  NGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVP

Query:  IILGRPFLATGRALIDVQKGELTMK
        IILGRPFLATG  LIDV+KGELTM+
Subjt:  IILGRPFLATGRALIDVQKGELTMK

A0A6J1EQ90 uncharacterized protein LOC1114364111.0e-19052.49Show/hide
Query:  QLKNPLFEQNEQQNIQVENPILIANDRTRAIRAYAVPMFNELNPGIARV------------------------------------------------SDS
        + +NP    N+++     NPI +A+DR RAIRAYA P   ELNP I R                                                 SDS
Subjt:  QLKNPLFEQNEQQNIQVENPILIANDRTRAIRAYAVPMFNELNPGIARV------------------------------------------------SDS

Query:  FVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHCIQME
        F  QGV +D +RL+LFPY LRDGAKSWLN+ APG+I +W+ LAE FL KYFPP RNA+ ++EIV F+Q EDE+ SEA ERFKE+LRKCPHHGLPHCIQME
Subjt:  FVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHCIQME

Query:  TFYNGLNGVTQGMVDASAGGALLAKPFDEAYEILERISINSCQWSDVRST-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-MEPTAVVN
        TFYNGLN VT+ +VDASA GA+L+K ++EAYEILERI+ N+CQW+DVRS   +K + VLEVD +S+I A LA + N L+N+ +       A +   A +N
Subjt:  TFYNGLNGVTQGMVDASAGGALLAKPFDEAYEILERISINSCQWSDVRST-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-MEPTAVVN

Query:  QVAEEACVYCGENHNYEFCPSNPASVFFVGNQ------RNNPYSNFYNPGWRNHPNFSWGGQG-SNVQAQQKVNQP---------GFAKAQVLPQQNKQA
        Q A E+CVYCGE H ++ CPSNPAS+F+VGNQ      +NNP+SN YNPGWRNHPNFSW GQ   N Q   K N P          ++  QV  Q     
Subjt:  QVAEEACVYCGENHNYEFCPSNPASVFFVGNQ------RNNPYSNFYNPGWRNHPNFSWGGQG-SNVQAQQKVNQP---------GFAKAQVLPQQNKQA

Query:  LPQQNSGSSLEAMMKEFMARIDAAIQSNQASMRALELQVG------------------QLANELKARPQGKLPSILNTLEGKVRSRCGTTLSAAPTLCTT
          Q  S +S+E+++KE+MA+ DA IQS QAS+R LE+Q+G                  Q  NE  A  +          E        ++   + T   +
Subjt:  LPQQNSGSSLEAMMKEFMARIDAAIQSNQASMRALELQVG------------------QLANELKARPQGKLPSILNTLEGKVRSRCGTTLSAAPTLCTT

Query:  STFSTKEKPKNQDGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRA
          F  + K K ++  F KF++ILK++HINIP VEA++QMPNY KFLKD+L  +++  EF+ VSL EECSAILKN +P K KDPGSFTIPVSIGGKELGRA
Subjt:  STFSTKEKPKNQDGQFRKFLEILKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRA

Query:  LCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMKSV
        LCDLGA+INLMPLS+Y+KLGIGEARPTTVTLQLADRSITY EGKIED+L++VDKFIF  DFIILDYE D DVPIILGRPFL  GR L+DV KG +T++  
Subjt:  LCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMKSV

Query:  MRK
         +K
Subjt:  MRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGTGGGCTATAGGAGAGGTGCTATGAAGAGCACCAGTGATGGGGCATCTGACATAAGACCACCTCGATGCCTATTCGGTCAACACGTCGAGCTAATGACGTTAAA
CAAGCGTTATTGGGAGGCAACCCAAGTCTTGGTTTCAGTCTCTCCTTTGCTTTCAAGCTTTCAAGCTTTCAGTCAAGAAGTCATCATCATTCCAGGCCAATTTGGATTTG
AATGCTTAGGAGCGAATTATTGGGTTCTGATAACTGCCCAAAAGATTATGTTGCTGGGCGACTTGAGGGAGCAAATTCTGTGCTGTAGCAAAACTTGGAACAAAACTGCC
ACATCACAGCTCAAAAACCCGTTGTTTGAACAAAATGAGCAGCAAAACATTCAGGTTGAGAATCCTATCTTGATAGCGAACGATAGGACTAGAGCCATTCGAGCGTATGC
TGTCCCAATGTTTAATGAGTTGAATCCAGGGATTGCACGAGTTAGCGATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTC
TTAGAGATGGAGCAAAGTCATGGTTGAACTCTTTTGCTCCAGGATCAATTAGGACATGGGATGAGTTAGCTGAAAAATTTTTGAGTAAATATTTCCCACCTAATAGAAAT
GCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAGGATGAGAGTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTT
ACCTCATTGTATTCAAATGGAAACATTTTACAATGGTTTAAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAACCTTTTGATGAAG
CCTATGAAATTTTAGAAAGGATATCTATTAATAGTTGTCAGTGGTCAGATGTTAGAAGCACAAATAAAAAGGTTAAGAGTGTATTAGAGGTTGATGGTGTGTCCACCATT
AGGGCTGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCTATGGAGCCTACTGCAGTGGTGAACCAAGTGGCAGAGGA
AGCATGTGTCTATTGTGGTGAAAATCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAATAACCCTTACTCTAACTTTTATA
ATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGACAAGGAAGTAATGTGCAAGCACAACAAAAGGTGAACCAGCCGGGATTTGCTAAAGCGCAGGTATTGCCC
CAGCAAAATAAGCAAGCTTTGCCCCAGCAAAATTCAGGAAGTTCTCTTGAGGCGATGATGAAAGAATTTATGGCTCGTATAGACGCCGCAATTCAAAGTAATCAAGCTTC
GATGAGGGCTCTGGAGTTGCAAGTGGGTCAGCTAGCTAATGAGCTAAAGGCAAGGCCTCAAGGGAAACTTCCCTCGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCA
GATGTGGAACCACCTTAAGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGAAAAGCCTAAGAATCAGGATGGTCAATTTAGGAAATTTTTAGAGATT
CTTAAGCAGTTGCACATAAATATCCCTTTTGTAGAAGCTATTGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTAACTAAAAAGAAGAGGTTAGGTGAGTT
TGAAACTGTATCTCTTACTGAGGAATGTAGTGCTATTCTGAAGAATGGGCTACCACCCAAGGCTAAGGATCCGGGATCATTTACTATACCTGTATCTATAGGTGGAAAAG
AGTTAGGTAGAGCACTCTGTGATTTAGGCGCGAGCATTAACCTAATGCCTCTTTCGGTCTATCGAAAGCTTGGTATAGGTGAAGCTAGGCCTACCACAGTCACACTCCAA
CTAGCTGATAGGTCTATCACATATCTAGAGGGTAAAATTGAGGATGTCTTAGTAAAAGTAGATAAATTCATATTTCCTGTTGATTTTATTATTTTAGACTATGAGGCTGA
TAAAGATGTCCCAATAATTCTAGGTCGTCCATTCTTGGCTACTGGTAGGGCGTTAATAGATGTTCAAAAAGGGGAGTTAACAATGAAGTCTGTAATGAGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGTGGGCTATAGGAGAGGTGCTATGAAGAGCACCAGTGATGGGGCATCTGACATAAGACCACCTCGATGCCTATTCGGTCAACACGTCGAGCTAATGACGTTAAA
CAAGCGTTATTGGGAGGCAACCCAAGTCTTGGTTTCAGTCTCTCCTTTGCTTTCAAGCTTTCAAGCTTTCAGTCAAGAAGTCATCATCATTCCAGGCCAATTTGGATTTG
AATGCTTAGGAGCGAATTATTGGGTTCTGATAACTGCCCAAAAGATTATGTTGCTGGGCGACTTGAGGGAGCAAATTCTGTGCTGTAGCAAAACTTGGAACAAAACTGCC
ACATCACAGCTCAAAAACCCGTTGTTTGAACAAAATGAGCAGCAAAACATTCAGGTTGAGAATCCTATCTTGATAGCGAACGATAGGACTAGAGCCATTCGAGCGTATGC
TGTCCCAATGTTTAATGAGTTGAATCCAGGGATTGCACGAGTTAGCGATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTC
TTAGAGATGGAGCAAAGTCATGGTTGAACTCTTTTGCTCCAGGATCAATTAGGACATGGGATGAGTTAGCTGAAAAATTTTTGAGTAAATATTTCCCACCTAATAGAAAT
GCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAGGATGAGAGTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTT
ACCTCATTGTATTCAAATGGAAACATTTTACAATGGTTTAAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAACCTTTTGATGAAG
CCTATGAAATTTTAGAAAGGATATCTATTAATAGTTGTCAGTGGTCAGATGTTAGAAGCACAAATAAAAAGGTTAAGAGTGTATTAGAGGTTGATGGTGTGTCCACCATT
AGGGCTGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCTATGGAGCCTACTGCAGTGGTGAACCAAGTGGCAGAGGA
AGCATGTGTCTATTGTGGTGAAAATCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAATAACCCTTACTCTAACTTTTATA
ATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGACAAGGAAGTAATGTGCAAGCACAACAAAAGGTGAACCAGCCGGGATTTGCTAAAGCGCAGGTATTGCCC
CAGCAAAATAAGCAAGCTTTGCCCCAGCAAAATTCAGGAAGTTCTCTTGAGGCGATGATGAAAGAATTTATGGCTCGTATAGACGCCGCAATTCAAAGTAATCAAGCTTC
GATGAGGGCTCTGGAGTTGCAAGTGGGTCAGCTAGCTAATGAGCTAAAGGCAAGGCCTCAAGGGAAACTTCCCTCGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCA
GATGTGGAACCACCTTAAGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGAAAAGCCTAAGAATCAGGATGGTCAATTTAGGAAATTTTTAGAGATT
CTTAAGCAGTTGCACATAAATATCCCTTTTGTAGAAGCTATTGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTAACTAAAAAGAAGAGGTTAGGTGAGTT
TGAAACTGTATCTCTTACTGAGGAATGTAGTGCTATTCTGAAGAATGGGCTACCACCCAAGGCTAAGGATCCGGGATCATTTACTATACCTGTATCTATAGGTGGAAAAG
AGTTAGGTAGAGCACTCTGTGATTTAGGCGCGAGCATTAACCTAATGCCTCTTTCGGTCTATCGAAAGCTTGGTATAGGTGAAGCTAGGCCTACCACAGTCACACTCCAA
CTAGCTGATAGGTCTATCACATATCTAGAGGGTAAAATTGAGGATGTCTTAGTAAAAGTAGATAAATTCATATTTCCTGTTGATTTTATTATTTTAGACTATGAGGCTGA
TAAAGATGTCCCAATAATTCTAGGTCGTCCATTCTTGGCTACTGGTAGGGCGTTAATAGATGTTCAAAAAGGGGAGTTAACAATGAAGTCTGTAATGAGGAAGTGA
Protein sequenceShow/hide protein sequence
MLVGYRRGAMKSTSDGASDIRPPRCLFGQHVELMTLNKRYWEATQVLVSVSPLLSSFQAFSQEVIIIPGQFGFECLGANYWVLITAQKIMLLGDLREQILCCSKTWNKTA
TSQLKNPLFEQNEQQNIQVENPILIANDRTRAIRAYAVPMFNELNPGIARVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRN
AKLRSEIVGFRQLEDESFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEILERISINSCQWSDVRSTNKKVKSVLEVDGVSTI
RADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGENHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKVNQPGFAKAQVLP
QQNKQALPQQNSGSSLEAMMKEFMARIDAAIQSNQASMRALELQVGQLANELKARPQGKLPSILNTLEGKVRSRCGTTLSAAPTLCTTSTFSTKEKPKNQDGQFRKFLEI
LKQLHINIPFVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQ
LADRSITYLEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMKSVMRK