; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026036 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026036
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr10:27508037..27513149
RNA-Seq ExpressionLag0026036
SyntenyLag0026036
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_021279280.1 LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 [Herrania umbratica]5.9e-13239.59Show/hide
Query:  DPEIERTFRRRRREQRRHQNQMDNVSRLSQGPEDPANPQQNHLLQQNQPLERNEEQNNQDENPILVANDRSRAIRAYAFPMFDELNPGIERPQIEAANFE
        DP+IERTFRR RRE  +       V+ L+Q                       E+ NN   N I +  + +RA+R Y  P+   L+  I RP I A NFE
Subjt:  DPEIERTFRRRRREQRRHQNQMDNVSRLSQGPEDPANPQQNHLLQQNQPLERNEEQNNQDENPILVANDRSRAIRAYAFPMFDELNPGIERPQIEAANFE

Query:  MKPVMFQMLQTVGQFHGVPM------------------------DALRLTLFPYSLRDGAKAWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIV
        +KP   QM+Q+  QF G+P                         DA+RL LFP+SLRD AK+WLNS   GSI TW +LA+KFL+K+FPP + AK+R++I 
Subjt:  MKPVMFQMLQTVGQFHGVPM------------------------DALRLTLFPYSLRDGAKAWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIV

Query:  GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ-----------------------ERISTNSCQWSDVRGTNKKVKSVLEVDSVS
         F Q + E+  EAWERFKELLR+CPHHG+P  +Q++TFYNGL G+ +                       E +++N+ QW   R  ++K     E+D++ 
Subjt:  GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ-----------------------ERISTNSCQWSDVRGTNKKVKSVLEVDSVS

Query:  TIRVDIAMLANTLKNVTMFSHQQLPAAEPAAVVNQVAEEACVYCGEDHNYEFYPSNPASVFFV-----------------GWRNHPNFSWEGQGSNVQAQ
        T+   +A L+  L   T+  H         AV N +    C  CG+ H+Y+  P N  SV FV                 GWRNHPNFSW    +     
Subjt:  TIRVDIAMLANTLKNVTMFSHQQLPAAEPAAVVNQVAEEACVYCGEDHNYEFYPSNPASVFFV-----------------GWRNHPNFSWEGQGSNVQAQ

Query:  QKVNQPGFAKAQVLPQQNKQALPQQNSESSLEAIIKEYMARTDAAIQSKQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGK
        + +  PGF       QQ +  +P++ S+  LE ++ +Y+++TDA IQS+ AS+R LE QVGQLAN +  RPQG LPSDT+ +P+  GKEQ +A+TLRSGK
Subjt:  QKVNQPGFAKAQVLPQQNKQALPQQNSESSLEAIIKEYMARTDAAIQSKQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGK

Query:  PLEERKEPSKPQDVEKNCDKNIVVEKELETGQ-------GAGGSN-------------------------------------NDAIEQMPDYAKFLKDIL
         +E   + +   ++E + DK  + E E+E  Q         G S                                       +A+EQMP Y KFLKDIL
Subjt:  PLEERKEPSKPQDVEKNCDKNIVVEKELETGQ-------GAGGSN-------------------------------------NDAIEQMPDYAKFLKDIL

Query:  IKKKRLGEFEIVSLTEECSAILKNGLPTKAKDLRSFTISVSIGG----------------------KELGIGEARPTTVTLQLADRSITYPKGKIEDVLV
         KK++LGEFE V LTEECSAIL+N LP K KD  SFTI  +IG                       ++LG+GE +PT+VTLQLADRS  YP+G IEDVLV
Subjt:  IKKKRLGEFEIVSLTEECSAILKNGLPTKAKDLRSFTISVSIGG----------------------KELGIGEARPTTVTLQLADRSITYPKGKIEDVLV

Query:  KVDTFIFPIDFIILDYEADKDVPIILGRPLLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILE
        KVD FIFP+DF+ILD E D+ +PIILGRP LAT  A+IDV++G+++ +V  E V+FN+F A K+P     C  + +++
Subjt:  KVDTFIFPIDFIILDYEADKDVPIILGRPLLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILE

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]2.3e-12838.68Show/hide
Query:  NPILVANDRSRAIRAYAFPMFDELNPGIERPQIEAANFEMKPVMFQMLQTVGQF------------------------HGVPMDALRLTLFPYSLRDGAK
        NPI VA+ R RA+R YA  + ++LN  +       A FE KP+M QML  +GQF                         G+  DALRLTLFP+S+   A 
Subjt:  NPILVANDRSRAIRAYAFPMFDELNPGIERPQIEAANFEMKPVMFQMLQTVGQF------------------------HGVPMDALRLTLFPYSLRDGAK

Query:  AWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG--------LNGATQ------
        AWLN+F   +I TW+++ +KFL KYFPP RNA +R EI+ FRQ E+E  + AWERFK+L+  CP+ G+P C+Q+E F+ G        LNGA        
Subjt:  AWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG--------LNGATQ------

Query:  ---------ERISTNSCQWSDVRGTNKKVKS----VLEVDSVSTIRVDIAMLANTLKNVTMFSHQQLPAAEPAAVVN-----QVAEEACVYCGEDHNYEF
                 +++S ++ QW   +   +  ++    VL +D++++++  I  +   LKN+         AA   A  N     Q+AE  C YCG+ H  E 
Subjt:  ---------ERISTNSCQWSDVRGTNKKVKS----VLEVDSVSTIRVDIAMLANTLKNVTMFSHQQLPAAEPAAVVN-----QVAEEACVYCGEDHNYEF

Query:  YPSNPASVFFV-----------------GWRNHPNFSWEGQGS------NVQAQQKVNQPGFAKAQVLP-------QQNKQALPQQNSESSLEAIIKEYM
         PSNP+S+++V                 GW+ HPNFSW GQGS      N Q ++    PGF  +   P       QQ     P Q + S++E ++KE +
Subjt:  YPSNPASVFFV-----------------GWRNHPNFSWEGQGS------NVQAQQKVNQPGFAKAQVLP-------QQNKQALPQQNSESSLEAIIKEYM

Query:  ARTDAAIQS---------------------KQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSG-------KPLEERKEPSKPQ
         + DA ++                         ++R LE+Q+GQL NE++ RPQG LPS TE PRR GKE   ++  RSG        P E    PS+ +
Subjt:  ARTDAAIQS---------------------KQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSG-------KPLEERKEPSKPQ

Query:  DVEKNCDKNIVVEKELETGQGAGGSNN------------------------------------DAIEQMPDYAKFLKDILIKKKRLGEFEIVSLTEECSA
        D +   DK  +VE  +        SN+                                    +A+EQMP YAKF+KDI+ +KK+LGE+E V+LTE  S 
Subjt:  DVEKNCDKNIVVEKELETGQGAGGSNN------------------------------------DAIEQMPDYAKFLKDILIKKKRLGEFEIVSLTEECSA

Query:  ILKNGLPTKAKDLRSFTISVSIGGKELG----------------------IGEARPTTVTLQLADRSITYPKGKIEDVLVKVDTFIFPIDFIILDYEADK
        + K+ +P K KD  SFTI   IGGK++G                      IG+A PTTVTLQLADRSIT P+GKIEDVLVKVD FIFP DFIILD EADK
Subjt:  ILKNGLPTKAKDLRSFTISVSIGGKELG----------------------IGEARPTTVTLQLADRSITYPKGKIEDVLVKVDTFIFPIDFIILDYEADK

Query:  DVPIILGRPLLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVETTMQDSANKHLEDHGERSIDEDI
        DVPIILGRP LATG  LIDV+KGELTMRV +++V FN+  AMKY D+ME+C+ I I +  I    + D  N  +E   E +  E I
Subjt:  DVPIILGRPLLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVETTMQDSANKHLEDHGERSIDEDI

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]1.1e-16544.51Show/hide
Query:  MNDPPRVRFELDPEIERTFRRRRREQRRHQNQMDNVSRLSQGPEDPANPQQNHLLQQNQPLERNEEQNNQDE---NPILVANDRSRAIRAYAFPMFDELN
        MN P  + F LDPEIERTFRRR +++++   Q  N+ ++  G             Q N+  E      NQ+    NPI +A+DR RAIRAYA P  +ELN
Subjt:  MNDPPRVRFELDPEIERTFRRRRREQRRHQNQMDNVSRLSQGPEDPANPQQNHLLQQNQPLERNEEQNNQDE---NPILVANDRSRAIRAYAFPMFDELN

Query:  PGIERPQIEAANFEMKPVMFQMLQTVGQFHGVPM-------------------------------DALRLTLFPYSLRDGAKAWLNSFAPGSIRTWNELA
        P I RP+I+   FE+KPVMFQMLQT+GQFHG+P+                               D +RL+LFPY LRDGAK+WLN+ APG+I +WN LA
Subjt:  PGIERPQIEAANFEMKPVMFQMLQTVGQFHGVPM-------------------------------DALRLTLFPYSLRDGAKAWLNSFAPGSIRTWNELA

Query:  EKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ-----------------------ERISTNSCQ
        E FL KYFPP RNA+ ++EIV F+Q EDET SEA ERFKE+LRKCPHHGLPHCIQMETFYNGLN  T+                       ERI++N+CQ
Subjt:  EKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ-----------------------ERISTNSCQ

Query:  WSDVRGT-NKKVKSVLEVDSVSTIRVDIAMLANTLKNVTMFSHQQLPA-AEPAAVVNQVAEEACVYCGEDHNYEFYPSNPASVFFV--------------
        W+DVR    +K + VLEVD++S+I   +A + N L+N+ +     + A    AA +NQ A E+CVYCGE+H ++  PSNPAS+F+V              
Subjt:  WSDVRGT-NKKVKSVLEVDSVSTIRVDIAMLANTLKNVTMFSHQQLPA-AEPAAVVNQVAEEACVYCGEDHNYEFYPSNPASVFFV--------------

Query:  ------GWRNHPNFSWEGQG-SNVQAQQKVNQP---------GFAKAQVLPQQNKQALPQQNSESSLEAIIKEYMARTDAAIQSKQASMRALELQVGQLA
              GWRNHPNFSW+GQ   N Q   K N P          ++  QV  Q       Q  SE+S+E++IKEYMA+ DA IQS+QAS+R LE+Q+G   
Subjt:  ------GWRNHPNFSWEGQG-SNVQAQQKVNQP---------GFAKAQVLPQQNKQALPQQNSESSLEAIIKEYMARTDAAIQSKQASMRALELQVGQLA

Query:  NELKARPQGKLPSDTEHPRREG-------------KEQVKAVTLRSGKPLEERKEPSKP-------QDVEKNCDKNIVVEKELETGQGAGGSNNDAIEQM
        N  +     +  +DT+    E              +EQ K  T  S +       PS P       +  E + +K + + KE+           +A++QM
Subjt:  NELKARPQGKLPSDTEHPRREG-------------KEQVKAVTLRSGKPLEERKEPSKP-------QDVEKNCDKNIVVEKELETGQGAGGSNNDAIEQM

Query:  PDYAKFLKDILIKKKRLGEFEIVSLTEECSAILKNGLPTKAKDLRSFTISVSIGGKE----------------------LGIGEARPTTVTLQLADRSIT
        P+Y KFLKD+LI +++  EF++VSL EECSAILKN +P K KD  SFTI VSIGGKE                      LGIGEARPTTVTLQLADRSIT
Subjt:  PDYAKFLKDILIKKKRLGEFEIVSLTEECSAILKNGLPTKAKDLRSFTISVSIGGKE----------------------LGIGEARPTTVTLQLADRSIT

Query:  YPKGKIEDVLVKVDTFIFPIDFIILDYEADKDVPIILGRPLLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVETTMQDS
        YP+GKIED+L++VD FIF  DFIILDYE D DVPIILGRP L  GR L+DV KG +T+R+  ++V+FN+  +MKYP  +E+CS        + E T Q +
Subjt:  YPKGKIEDVLVKVDTFIFPIDFIILDYEADKDVPIILGRPLLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVETTMQDS

Query:  ANKHLEDHGER
          +  ++  E+
Subjt:  ANKHLEDHGER

XP_024949903.1 uncharacterized protein LOC112496715 [Citrus sinensis]1.3e-13942.23Show/hide
Query:  QPLERNEEQNNQ-------DENPILVANDRSRAIRAYAFPMFDELNPGIERPQIEAANFEMKPVMFQMLQTVGQFHGVP---------------------
        QP+  +E  N Q       + N I +ANDR +AIR YA      ++PGI RP ++A NFE+KPVMFQMLQTVGQF+G+P                     
Subjt:  QPLERNEEQNNQ-------DENPILVANDRSRAIRAYAFPMFDELNPGIERPQIEAANFEMKPVMFQMLQTVGQFHGVP---------------------

Query:  ---MDALRLTLFPYSLRDGAKAWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYN
            +ALRL LF +SLRD A+AWLNS  P SI TW++LA+KFL KYFPP +NAKLR+EI  F QLEDE+  +AWERFKELLR+CPHHG+P CIQ+ET YN
Subjt:  ---MDALRLTLFPYSLRDGAKAWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYN

Query:  GLNGATQ-----------------------ERISTNSCQWSDVR-GTNKKVKSVLEVDSVSTIRVDIAMLANTLKNVTMFSHQQLPAAEPAAVVNQVAEE
        GLN +T+                       ERI+ N+ QW   R    +    V  VD+++ +   +  L   +K +T             A VNQ+++ 
Subjt:  GLNGATQ-----------------------ERISTNSCQWSDVR-GTNKKVKSVLEVDSVSTIRVDIAMLANTLKNVTMFSHQQLPAAEPAAVVNQVAEE

Query:  ACVYCGEDHNYEFYPSNPASVFFV------------------GWRNHPNFSWEGQGSNVQAQQKVNQPGFAKAQVLPQQNKQALPQQNSE-SSLEAIIKE
        +CVYCGE H ++  P NP SV +V                  GWR H NFSW  Q     A    N+P  A+     QQN++ +   N + SSLE +IK+
Subjt:  ACVYCGEDHNYEFYPSNPASVFFV------------------GWRNHPNFSWEGQGSNVQAQQKVNQPGFAKAQVLPQQNKQALPQQNSE-SSLEAIIKE

Query:  YMARTDAAIQSKQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPL------------------------------------
        Y+ R +A +QS   S+R LE Q+GQLA  L  RPQG LPS+TE+PRREGKE  K + LRSG+ L                                    
Subjt:  YMARTDAAIQSKQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPL------------------------------------

Query:  -------------EERKEPSKP------------------QDVEKNCDKNIVVEKELETGQGAGGSNNDAIEQMPDYAKFLKDILIKKKRLGEFEIVSLT
                      + K+P++P                  Q  +K  +K + V K+L           + +EQM +YAKFLKDIL KK+RL EFE V++T
Subjt:  -------------EERKEPSKP------------------QDVEKNCDKNIVVEKELETGQGAGGSNNDAIEQMPDYAKFLKDILIKKKRLGEFEIVSLT

Query:  EECSAILKNGLPTKAKDLRSFTISVSIGGKELGIGEARPTTVTLQLADRSITYPKGKIEDVLVKVDTFIFPIDFIILDYEADKDVPIILGRPLLATGRAL
        +E S +L++ +P K KD                     PTTV+LQLADRS  YP+G+I+DVLVKVD FIFP+DFI+LD+EADK+VPIILGRP LATG+ L
Subjt:  EECSAILKNGLPTKAKDLRSFTISVSIGGKELGIGEARPTTVTLQLADRSITYPKGKIEDVLVKVDTFIFPIDFIILDYEADKDVPIILGRPLLATGRAL

Query:  IDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVE
        IDV+KGELTMRV +++V FNV   MK PDE+EDC+FI +++  IVE
Subjt:  IDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVE

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]5.8e-16447.2Show/hide
Query:  NPILVANDRSRAIRAYAFPMFDELNPGIERPQIEAANFEMKPVMFQMLQTVGQF------------------------HGVPMDALRLTLFPYSLRDGAK
        +PI++ +DR+RAIR YA PMF+ELNPGI RP+I+A  FE+KPVMFQMLQTVGQF                         GV  +  RL LFP+SLRD A+
Subjt:  NPILVANDRSRAIRAYAFPMFDELNPGIERPQIEAANFEMKPVMFQMLQTVGQF------------------------HGVPMDALRLTLFPYSLRDGAK

Query:  AWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ--------------
        +WLN+ +P S+  WN+ AEKFL KYFPP RNAK RSEI+ F QLEDE+ S+AWERFKELLRKCPHHG+PHCIQMETFYNGLN  +Q              
Subjt:  AWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ--------------

Query:  ---------ERISTNSCQWSDVRGT-NKKVKSVLEVDSVSTIRVDIAMLANTLKNVTMFSHQQLPAAEPAAVVNQVAEEACVYCGEDHNYEFYPSNPASV
                 E I++N+ QWS+ R   ++KV  VLEVD+++ +   +A + N LKN+++ + + +   +PAA + Q  + +CV+C E H +E  PSNP SV
Subjt:  ---------ERISTNSCQWSDVRGT-NKKVKSVLEVDSVSTIRVDIAMLANTLKNVTMFSHQQLPAAEPAAVVNQVAEEACVYCGEDHNYEFYPSNPASV

Query:  FFVG------------------WRNHPNFSWEGQG--SNVQAQQKVNQPGFAKAQVLPQQNKQALPQQNSESSLEAIIKEYMARTDAAIQSKQASMRALE
         ++G                  W+NHPN SW  +    +    ++   PGF++    PQ  + + P     SSLE+++++YMA+ DA IQS+ A +R LE
Subjt:  FFVG------------------WRNHPNFSWEGQG--SNVQAQQKVNQPGFAKAQVLPQQNKQALPQQNSESSLEAIIKEYMARTDAAIQSKQASMRALE

Query:  LQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPL----EERKEPSKPQDVE------KNCDKNIVVEKELETGQGAGGSNN---------
        LQ+G LANELKARPQG LPSDTE+PRR+GKEQ K++ LRSGK L    EE K   +P  ++      K   + I   + ++T  G   ++          
Subjt:  LQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPL----EERKEPSKPQDVE------KNCDKNIVVEKELETGQGAGGSNN---------

Query:  ------------------------------------DAIEQMPDYAKFLKDILIKKKRLGEFEIVSLTEECSAILKNGLPTKAKDLRSFTISVSIGGKEL
                                            +A+EQMP+Y KFLKDIL KK+RLGEFE   LTE   A+LKN +P K KD  SFTI +SIGG++L
Subjt:  ------------------------------------DAIEQMPDYAKFLKDILIKKKRLGEFEIVSLTEECSAILKNGLPTKAKDLRSFTISVSIGGKEL

Query:  GIGEARPTTVTLQLADRSITYPKGKIEDVLVKVDTFIFPIDFIILDYEADKDVPIILGRPLLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEME
        GIGEARPTTVTLQLADRS+ +P GKIEDVLV+VD FIFP DFIILDYE D++VPIIL RP LATGR LIDV+KGELTMR  +E+  F VF+ ++ PD + 
Subjt:  GIGEARPTTVTLQLADRSITYPKGKIEDVLVKVDTFIFPIDFIILDYEADKDVPIILGRPLLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEME

Query:  DCSFIRILENTIVE
        +C  I  ++  +VE
Subjt:  DCSFIRILENTIVE

TrEMBL top hitse value%identityAlignment
A0A2G9HH15 Reverse transcriptase9.8e-10936.18Show/hide
Query:  DPPRVRFELDPEIERTFRRRRREQRRHQNQMDNVSRLSQGPEDPANPQQNHLLQQNQPLERNEEQNNQDENPILVANDRSRAIRAYAFPMFDELNPGIER
        +P   R++L  EIERTF+ RRR+   H  Q   V                   ++NQ +             I+  N  +  +R  A P   E    +  
Subjt:  DPPRVRFELDPEIERTFRRRRREQRRHQNQMDNVSRLSQGPEDPANPQQNHLLQQNQPLERNEEQNNQDENPILVANDRSRAIRAYAFPMFDELNPGIER

Query:  PQIEAANFEMKPVMFQMLQTVGQF------------------------HGVPMDALRLTLFPYSLRDGAKAWLNSFAPGSIRTWNELAEKFLSKYFPPNR
        P++  A  +++  M +M+Q   QF                         GV  DALRL LF +SL   A  W  S    SI TW +L E+F+SK+F P +
Subjt:  PQIEAANFEMKPVMFQMLQTVGQF------------------------HGVPMDALRLTLFPYSLRDGAKAWLNSFAPGSIRTWNELAEKFLSKYFPPNR

Query:  NAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG-----------------LNGATQE------RISTNSCQWSDVRGTNKKVK
         A LR+EI+ FRQ   ET  EAW RF+++LR CP+H +P  IQ+ TFY+G                 L+G T E       +  N  +    R T  K  
Subjt:  NAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG-----------------LNGATQE------RISTNSCQWSDVRGTNKKVK

Query:  SVLEVDSVSTIRVDIAMLANTLKNVTMFSHQQLPAAEPAAVVNQVAEEACVYCGEDHNYEFYPSNPASVFFV-----------------GWRNHPNFSW-
         V+EVD V+ +   I  L  ++KN  +   Q  P               C  CGE H  +  P +  S+ FV                 GWR HPNFSW 
Subjt:  SVLEVDSVSTIRVDIAMLANTLKNVTMFSHQQLPAAEPAAVVNQVAEEACVYCGEDHNYEFYPSNPASVFFV-----------------GWRNHPNFSW-

Query:  --EGQGSNVQAQQKVNQPGFAKAQVLPQQNKQALPQQNSESSLEAIIKEYMARTDAAIQSKQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGK
          +GQG  ++ QQ               Q +   P Q  + SLE  + ++MA T        A+ + +E Q+GQLAN + +RP+  LPS+TE +PR++ K
Subjt:  --EGQGSNVQAQQKVNQPGFAKAQVLPQQNKQALPQQNSESSLEAIIKEYMARTDAAIQSKQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGK

Query:  EQVKAVTLRSGKPLEE-RKEPSKPQDVEK-NCDKNIVVEKELETGQGAGGSNNDAIEQMPDYAKFLKDILIKKKRLGEFEIVSLTEECSAILKNGLPTKA
         Q +AVTLR+G  L+E  KEP+K ++ E  + +K   +E  LE           A+EQMP Y KF+KDIL KK+RLG++E V+LTEECSAI++N LP K 
Subjt:  EQVKAVTLRSGKPLEE-RKEPSKPQDVEK-NCDKNIVVEKELETGQGAGGSNNDAIEQMPDYAKFLKDILIKKKRLGEFEIVSLTEECSAILKNGLPTKA

Query:  KDLRSFTISVSIGG----------------------KELGIGEARPTTVTLQLADRSITYPKGKIEDVLVKVDTFIFPIDFIILDYEADKDVPIILGRPL
        KD  SFTI  +IG                       + LG+GEA+PT++TLQLADRS+TYP G IED+LVKVD FIFP DF++LD E D +VPIILGRP 
Subjt:  KDLRSFTISVSIGG----------------------KELGIGEARPTTVTLQLADRSITYPKGKIEDVLVKVDTFIFPIDFIILDYEADKDVPIILGRPL

Query:  LATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDC----SFIRILENTIVETTMQDSANKHLEDHGERSIDEDIAEIK--------QTMSDNND
        LATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C     F ++  N  +     D   + L D  +   +ED   +K        ++    + 
Subjt:  LATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDC----SFIRILENTIVETTMQDSANKHLEDHGERSIDEDIAEIK--------QTMSDNND

Query:  DPTVAASNLRPSVLMLGSLEL
        + T  +  L+PS+    +LEL
Subjt:  DPTVAASNLRPSVLMLGSLEL

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129452.8e-13239.59Show/hide
Query:  DPEIERTFRRRRREQRRHQNQMDNVSRLSQGPEDPANPQQNHLLQQNQPLERNEEQNNQDENPILVANDRSRAIRAYAFPMFDELNPGIERPQIEAANFE
        DP+IERTFRR RRE  +       V+ L+Q                       E+ NN   N I +  + +RA+R Y  P+   L+  I RP I A NFE
Subjt:  DPEIERTFRRRRREQRRHQNQMDNVSRLSQGPEDPANPQQNHLLQQNQPLERNEEQNNQDENPILVANDRSRAIRAYAFPMFDELNPGIERPQIEAANFE

Query:  MKPVMFQMLQTVGQFHGVPM------------------------DALRLTLFPYSLRDGAKAWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIV
        +KP   QM+Q+  QF G+P                         DA+RL LFP+SLRD AK+WLNS   GSI TW +LA+KFL+K+FPP + AK+R++I 
Subjt:  MKPVMFQMLQTVGQFHGVPM------------------------DALRLTLFPYSLRDGAKAWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIV

Query:  GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ-----------------------ERISTNSCQWSDVRGTNKKVKSVLEVDSVS
         F Q + E+  EAWERFKELLR+CPHHG+P  +Q++TFYNGL G+ +                       E +++N+ QW   R  ++K     E+D++ 
Subjt:  GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ-----------------------ERISTNSCQWSDVRGTNKKVKSVLEVDSVS

Query:  TIRVDIAMLANTLKNVTMFSHQQLPAAEPAAVVNQVAEEACVYCGEDHNYEFYPSNPASVFFV-----------------GWRNHPNFSWEGQGSNVQAQ
        T+   +A L+  L   T+  H         AV N +    C  CG+ H+Y+  P N  SV FV                 GWRNHPNFSW    +     
Subjt:  TIRVDIAMLANTLKNVTMFSHQQLPAAEPAAVVNQVAEEACVYCGEDHNYEFYPSNPASVFFV-----------------GWRNHPNFSWEGQGSNVQAQ

Query:  QKVNQPGFAKAQVLPQQNKQALPQQNSESSLEAIIKEYMARTDAAIQSKQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGK
        + +  PGF       QQ +  +P++ S+  LE ++ +Y+++TDA IQS+ AS+R LE QVGQLAN +  RPQG LPSDT+ +P+  GKEQ +A+TLRSGK
Subjt:  QKVNQPGFAKAQVLPQQNKQALPQQNSESSLEAIIKEYMARTDAAIQSKQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGK

Query:  PLEERKEPSKPQDVEKNCDKNIVVEKELETGQ-------GAGGSN-------------------------------------NDAIEQMPDYAKFLKDIL
         +E   + +   ++E + DK  + E E+E  Q         G S                                       +A+EQMP Y KFLKDIL
Subjt:  PLEERKEPSKPQDVEKNCDKNIVVEKELETGQ-------GAGGSN-------------------------------------NDAIEQMPDYAKFLKDIL

Query:  IKKKRLGEFEIVSLTEECSAILKNGLPTKAKDLRSFTISVSIGG----------------------KELGIGEARPTTVTLQLADRSITYPKGKIEDVLV
         KK++LGEFE V LTEECSAIL+N LP K KD  SFTI  +IG                       ++LG+GE +PT+VTLQLADRS  YP+G IEDVLV
Subjt:  IKKKRLGEFEIVSLTEECSAILKNGLPTKAKDLRSFTISVSIGG----------------------KELGIGEARPTTVTLQLADRSITYPKGKIEDVLV

Query:  KVDTFIFPIDFIILDYEADKDVPIILGRPLLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILE
        KVD FIFP+DF+ILD E D+ +PIILGRP LAT  A+IDV++G+++ +V  E V+FN+F A K+P     C  + +++
Subjt:  KVDTFIFPIDFIILDYEADKDVPIILGRPLLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILE

A0A6J1CPJ3 uncharacterized protein LOC1110129471.2e-11938.68Show/hide
Query:  DPEIERTFRRRRREQRRHQNQMDNVSRLSQGPEDPANPQQNHLLQQNQPLERNEEQNNQDENPILVANDRSRAIRAYAFPMFDELNPGIERPQIEAANFE
        DPEIE+T +R RREQR                      QQ  + ++ +    N+   N   NPI VA+ R RA+R YA  + ++LN  +       A FE
Subjt:  DPEIERTFRRRRREQRRHQNQMDNVSRLSQGPEDPANPQQNHLLQQNQPLERNEEQNNQDENPILVANDRSRAIRAYAFPMFDELNPGIERPQIEAANFE

Query:  MKPVMFQMLQTVGQFH------------------------GVPMDALRLTLFPYSLRDGAKAWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIV
         KP+M QML  +GQF                         G+  DALRLTLFP+SL   A AWLN+F   +I T +++ +KFL KYFPP RNA +R EI+
Subjt:  MKPVMFQMLQTVGQFH------------------------GVPMDALRLTLFPYSLRDGAKAWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIV

Query:  GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG--------LNGATQ---------------ERISTNSCQWSDVRGTNKKVKS----VLEV
         FRQ E+E  + AWERFK+L+R CP+ G+P C+Q+E F+          LNGA                 +++S ++ QW   +   +  ++    VL +
Subjt:  GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG--------LNGATQ---------------ERISTNSCQWSDVRGTNKKVKS----VLEV

Query:  DSVSTIRVDIAMLANTLKNV--TMFSHQQLPAAEPAAVVNQVAEEACVYCGEDHNYEFYPSNPASVFFVGWRNHPNFSWEGQGS------NVQAQQKVNQ
        D++++++  I  +   LKN+     +    PA    + V Q+AE  C    + +  +F P   ++++  GW+ HPNFSW GQGS      N Q +Q    
Subjt:  DSVSTIRVDIAMLANTLKNV--TMFSHQQLPAAEPAAVVNQVAEEACVYCGEDHNYEFYPSNPASVFFVGWRNHPNFSWEGQGS------NVQAQQKVNQ

Query:  PGFAKAQVLP-------QQNKQALPQQNSESSLEAIIKEY-----------MARTDAAIQS----------KQASMRALELQVGQLANELKARPQGKLPS
        P F  +   P       QQ     P Q + S++E ++KE+           M RTDA I+              ++R LE+Q+GQLANE++ RPQG LPS
Subjt:  PGFAKAQVLP-------QQNKQALPQQNSESSLEAIIKEY-----------MARTDAAIQS----------KQASMRALELQVGQLANELKARPQGKLPS

Query:  DTEHPRR---EGKEQVKAVTLRSGKPLEE-------------RKEPSKPQD-VEKNCDKNIVVEKELETGQGAGGSNNDAIEQMPDYAKFLKDILIKKKR
         TE PRR       + K   +   K +E              R  P  PQ  V KN D N     ++           +A+EQMP YAKFLKDI+ +KK+
Subjt:  DTEHPRR---EGKEQVKAVTLRSGKPLEE-------------RKEPSKPQD-VEKNCDKNIVVEKELETGQGAGGSNNDAIEQMPDYAKFLKDILIKKKR

Query:  LGEFEIVSLTEECSAILKNGLPTKAKDLRSFTISVSIGGKELG----------------------IGEARPTTVTLQLADRSITYPKGKIEDVLVKVDTF
        LGE+E V+LTE  S + K+  P K KD  SFTI   IGGK++G                      IG+A PTTVTL LADRSIT P+GKIEDVLVKVD F
Subjt:  LGEFEIVSLTEECSAILKNGLPTKAKDLRSFTISVSIGGKELG----------------------IGEARPTTVTLQLADRSITYPKGKIEDVLVKVDTF

Query:  IFPIDFIILDYEADKDVPIILGRPLLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVETTMQDSANKHLEDHGERSIDED
        IFP DFIILD EADKDVPIILGRP LATG  LIDV+KGELTMRV +++V FN+  AMKYPD+ E+C  I I +  I    + D  N  +E   E +  E 
Subjt:  IFPIDFIILDYEADKDVPIILGRPLLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVETTMQDSANKHLEDHGERSIDED

Query:  IAEI
        I  I
Subjt:  IAEI

A0A6J1DY39 uncharacterized protein LOC1110256531.1e-12838.68Show/hide
Query:  NPILVANDRSRAIRAYAFPMFDELNPGIERPQIEAANFEMKPVMFQMLQTVGQF------------------------HGVPMDALRLTLFPYSLRDGAK
        NPI VA+ R RA+R YA  + ++LN  +       A FE KP+M QML  +GQF                         G+  DALRLTLFP+S+   A 
Subjt:  NPILVANDRSRAIRAYAFPMFDELNPGIERPQIEAANFEMKPVMFQMLQTVGQF------------------------HGVPMDALRLTLFPYSLRDGAK

Query:  AWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG--------LNGATQ------
        AWLN+F   +I TW+++ +KFL KYFPP RNA +R EI+ FRQ E+E  + AWERFK+L+  CP+ G+P C+Q+E F+ G        LNGA        
Subjt:  AWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG--------LNGATQ------

Query:  ---------ERISTNSCQWSDVRGTNKKVKS----VLEVDSVSTIRVDIAMLANTLKNVTMFSHQQLPAAEPAAVVN-----QVAEEACVYCGEDHNYEF
                 +++S ++ QW   +   +  ++    VL +D++++++  I  +   LKN+         AA   A  N     Q+AE  C YCG+ H  E 
Subjt:  ---------ERISTNSCQWSDVRGTNKKVKS----VLEVDSVSTIRVDIAMLANTLKNVTMFSHQQLPAAEPAAVVN-----QVAEEACVYCGEDHNYEF

Query:  YPSNPASVFFV-----------------GWRNHPNFSWEGQGS------NVQAQQKVNQPGFAKAQVLP-------QQNKQALPQQNSESSLEAIIKEYM
         PSNP+S+++V                 GW+ HPNFSW GQGS      N Q ++    PGF  +   P       QQ     P Q + S++E ++KE +
Subjt:  YPSNPASVFFV-----------------GWRNHPNFSWEGQGS------NVQAQQKVNQPGFAKAQVLP-------QQNKQALPQQNSESSLEAIIKEYM

Query:  ARTDAAIQS---------------------KQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSG-------KPLEERKEPSKPQ
         + DA ++                         ++R LE+Q+GQL NE++ RPQG LPS TE PRR GKE   ++  RSG        P E    PS+ +
Subjt:  ARTDAAIQS---------------------KQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSG-------KPLEERKEPSKPQ

Query:  DVEKNCDKNIVVEKELETGQGAGGSNN------------------------------------DAIEQMPDYAKFLKDILIKKKRLGEFEIVSLTEECSA
        D +   DK  +VE  +        SN+                                    +A+EQMP YAKF+KDI+ +KK+LGE+E V+LTE  S 
Subjt:  DVEKNCDKNIVVEKELETGQGAGGSNN------------------------------------DAIEQMPDYAKFLKDILIKKKRLGEFEIVSLTEECSA

Query:  ILKNGLPTKAKDLRSFTISVSIGGKELG----------------------IGEARPTTVTLQLADRSITYPKGKIEDVLVKVDTFIFPIDFIILDYEADK
        + K+ +P K KD  SFTI   IGGK++G                      IG+A PTTVTLQLADRSIT P+GKIEDVLVKVD FIFP DFIILD EADK
Subjt:  ILKNGLPTKAKDLRSFTISVSIGGKELG----------------------IGEARPTTVTLQLADRSITYPKGKIEDVLVKVDTFIFPIDFIILDYEADK

Query:  DVPIILGRPLLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVETTMQDSANKHLEDHGERSIDEDI
        DVPIILGRP LATG  LIDV+KGELTMRV +++V FN+  AMKY D+ME+C+ I I +  I    + D  N  +E   E +  E I
Subjt:  DVPIILGRPLLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVETTMQDSANKHLEDHGERSIDEDI

A0A6J1EQ90 uncharacterized protein LOC1114364115.1e-16644.51Show/hide
Query:  MNDPPRVRFELDPEIERTFRRRRREQRRHQNQMDNVSRLSQGPEDPANPQQNHLLQQNQPLERNEEQNNQDE---NPILVANDRSRAIRAYAFPMFDELN
        MN P  + F LDPEIERTFRRR +++++   Q  N+ ++  G             Q N+  E      NQ+    NPI +A+DR RAIRAYA P  +ELN
Subjt:  MNDPPRVRFELDPEIERTFRRRRREQRRHQNQMDNVSRLSQGPEDPANPQQNHLLQQNQPLERNEEQNNQDE---NPILVANDRSRAIRAYAFPMFDELN

Query:  PGIERPQIEAANFEMKPVMFQMLQTVGQFHGVPM-------------------------------DALRLTLFPYSLRDGAKAWLNSFAPGSIRTWNELA
        P I RP+I+   FE+KPVMFQMLQT+GQFHG+P+                               D +RL+LFPY LRDGAK+WLN+ APG+I +WN LA
Subjt:  PGIERPQIEAANFEMKPVMFQMLQTVGQFHGVPM-------------------------------DALRLTLFPYSLRDGAKAWLNSFAPGSIRTWNELA

Query:  EKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ-----------------------ERISTNSCQ
        E FL KYFPP RNA+ ++EIV F+Q EDET SEA ERFKE+LRKCPHHGLPHCIQMETFYNGLN  T+                       ERI++N+CQ
Subjt:  EKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ-----------------------ERISTNSCQ

Query:  WSDVRGT-NKKVKSVLEVDSVSTIRVDIAMLANTLKNVTMFSHQQLPA-AEPAAVVNQVAEEACVYCGEDHNYEFYPSNPASVFFV--------------
        W+DVR    +K + VLEVD++S+I   +A + N L+N+ +     + A    AA +NQ A E+CVYCGE+H ++  PSNPAS+F+V              
Subjt:  WSDVRGT-NKKVKSVLEVDSVSTIRVDIAMLANTLKNVTMFSHQQLPA-AEPAAVVNQVAEEACVYCGEDHNYEFYPSNPASVFFV--------------

Query:  ------GWRNHPNFSWEGQG-SNVQAQQKVNQP---------GFAKAQVLPQQNKQALPQQNSESSLEAIIKEYMARTDAAIQSKQASMRALELQVGQLA
              GWRNHPNFSW+GQ   N Q   K N P          ++  QV  Q       Q  SE+S+E++IKEYMA+ DA IQS+QAS+R LE+Q+G   
Subjt:  ------GWRNHPNFSWEGQG-SNVQAQQKVNQP---------GFAKAQVLPQQNKQALPQQNSESSLEAIIKEYMARTDAAIQSKQASMRALELQVGQLA

Query:  NELKARPQGKLPSDTEHPRREG-------------KEQVKAVTLRSGKPLEERKEPSKP-------QDVEKNCDKNIVVEKELETGQGAGGSNNDAIEQM
        N  +     +  +DT+    E              +EQ K  T  S +       PS P       +  E + +K + + KE+           +A++QM
Subjt:  NELKARPQGKLPSDTEHPRREG-------------KEQVKAVTLRSGKPLEERKEPSKP-------QDVEKNCDKNIVVEKELETGQGAGGSNNDAIEQM

Query:  PDYAKFLKDILIKKKRLGEFEIVSLTEECSAILKNGLPTKAKDLRSFTISVSIGGKE----------------------LGIGEARPTTVTLQLADRSIT
        P+Y KFLKD+LI +++  EF++VSL EECSAILKN +P K KD  SFTI VSIGGKE                      LGIGEARPTTVTLQLADRSIT
Subjt:  PDYAKFLKDILIKKKRLGEFEIVSLTEECSAILKNGLPTKAKDLRSFTISVSIGGKE----------------------LGIGEARPTTVTLQLADRSIT

Query:  YPKGKIEDVLVKVDTFIFPIDFIILDYEADKDVPIILGRPLLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVETTMQDS
        YP+GKIED+L++VD FIF  DFIILDYE D DVPIILGRP L  GR L+DV KG +T+R+  ++V+FN+  +MKYP  +E+CS        + E T Q +
Subjt:  YPKGKIEDVLVKVDTFIFPIDFIILDYEADKDVPIILGRPLLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILENTIVETTMQDS

Query:  ANKHLEDHGER
          +  ++  E+
Subjt:  ANKHLEDHGER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGATCCGCCTAGGGTAAGGTTTGAGCTTGATCCAGAAATCGAGAGGACATTCAGGAGAAGAAGGAGAGAGCAGCGTAGACACCAAAATCAAATGGATAACGTGTC
GCGTCTCTCACAGGGTCCTGAAGATCCAGCAAACCCCCAGCAGAATCATTTGCTGCAGCAAAACCAGCCGCTGGAGCGAAATGAAGAGCAAAATAATCAGGATGAGAATC
CTATCTTGGTAGCAAATGATAGGTCCAGAGCCATTCGAGCGTATGCTTTTCCAATGTTTGATGAGTTAAATCCAGGGATTGAACGTCCTCAAATTGAGGCAGCAAATTTT
GAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGTCAATTCCATGGAGTGCCTATGGATGCCCTTAGATTAACTTTGTTTCCGTATTCTCTTAGAGATGGAGC
AAAGGCATGGTTAAATTCTTTTGCTCCAGGATCAATTAGGACATGGAATGAGTTAGCTGAAAAATTTCTTAGTAAATATTTTCCACCAAATAGAAATGCTAAATTGAGGA
GTGAAATAGTAGGGTTTAGGCAACTTGAGGATGAAACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCACATTGTATC
CAAATGGAAACATTTTACAATGGGTTAAATGGAGCGACCCAAGAAAGAATATCTACTAATAGTTGTCAGTGGTCAGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGT
GTTAGAGGTTGATAGTGTGTCCACCATTAGGGTTGATATTGCAATGTTAGCTAACACTCTTAAAAATGTGACAATGTTTAGTCATCAACAGCTGCCAGCTGCGGAGCCTG
CTGCGGTGGTGAACCAAGTTGCAGAAGAAGCATGTGTCTATTGTGGTGAAGACCACAACTACGAGTTTTATCCCAGCAATCCAGCTTCTGTTTTTTTTGTAGGTTGGCGC
AACCACCCCAACTTCTCATGGGAAGGACAAGGAAGTAATGTGCAAGCACAACAAAAGGTGAACCAGCCGGGATTTGCTAAAGCACAGGTATTGCCCCAGCAAAATAAGCA
GGCTTTGCCCCAACAAAATTCAGAGAGTTCTCTTGAGGCAATAATAAAAGAATATATGGCTCGTACAGATGCCGCAATTCAAAGTAAACAAGCTTCAATGAGAGCCCTAG
AATTGCAAGTGGGTCAGCTAGCTAATGAGCTGAAGGCAAGGCCTCAGGGGAAACTTCCTTCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTG
ACTCTTAGGAGTGGTAAGCCACTAGAGGAAAGGAAAGAGCCTAGTAAACCCCAGGATGTAGAAAAGAATTGTGATAAAAATATTGTTGTTGAGAAAGAGTTGGAGACTGG
TCAGGGTGCTGGAGGCAGCAATAATGATGCTATAGAGCAAATGCCTGATTATGCTAAATTTCTTAAGGATATTTTAATTAAAAAGAAGAGGTTAGGTGAGTTTGAAATTG
TATCTCTTACTGAGGAATGTAGTGCTATTCTTAAGAATGGGCTACCAACAAAGGCTAAGGATCTAAGATCATTTACTATATCTGTGTCTATAGGTGGAAAAGAGTTAGGT
ATTGGTGAAGCTAGGCCTACCACAGTCACACTCCAACTAGCTGATAGATCAATCACATATCCTAAGGGTAAAATTGAGGATGTCTTAGTGAAGGTAGATACGTTCATATT
TCCTATTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTCGTCCACTTTTGGCCACTGGTAGGGCATTGATAGATGTTCAAAAAGGGG
AATTAACAATGAGAGTTTATAATGAGGAAGTAAAATTTAATGTTTTTAAGGCCATGAAGTATCCAGACGAGATGGAAGATTGTTCTTTCATTAGAATTCTGGAGAACACA
ATTGTTGAGACAACAATGCAAGATTCGGCAAACAAACATTTGGAAGATCATGGAGAGAGGTCTATTGATGAAGACATTGCTGAAATTAAACAAACAATGTCCGACAACAA
CGACGATCCGACAGTAGCTGCAAGTAATCTGCGGCCCAGCGTCTTGATGCTAGGCAGCCTGGAGCTTTATATAAGCTGTGTTTTGGCTACGGGAATGGGCATCATGCAAT
GA
mRNA sequenceShow/hide mRNA sequence
ATGAACGATCCGCCTAGGGTAAGGTTTGAGCTTGATCCAGAAATCGAGAGGACATTCAGGAGAAGAAGGAGAGAGCAGCGTAGACACCAAAATCAAATGGATAACGTGTC
GCGTCTCTCACAGGGTCCTGAAGATCCAGCAAACCCCCAGCAGAATCATTTGCTGCAGCAAAACCAGCCGCTGGAGCGAAATGAAGAGCAAAATAATCAGGATGAGAATC
CTATCTTGGTAGCAAATGATAGGTCCAGAGCCATTCGAGCGTATGCTTTTCCAATGTTTGATGAGTTAAATCCAGGGATTGAACGTCCTCAAATTGAGGCAGCAAATTTT
GAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGTCAATTCCATGGAGTGCCTATGGATGCCCTTAGATTAACTTTGTTTCCGTATTCTCTTAGAGATGGAGC
AAAGGCATGGTTAAATTCTTTTGCTCCAGGATCAATTAGGACATGGAATGAGTTAGCTGAAAAATTTCTTAGTAAATATTTTCCACCAAATAGAAATGCTAAATTGAGGA
GTGAAATAGTAGGGTTTAGGCAACTTGAGGATGAAACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCACATTGTATC
CAAATGGAAACATTTTACAATGGGTTAAATGGAGCGACCCAAGAAAGAATATCTACTAATAGTTGTCAGTGGTCAGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGT
GTTAGAGGTTGATAGTGTGTCCACCATTAGGGTTGATATTGCAATGTTAGCTAACACTCTTAAAAATGTGACAATGTTTAGTCATCAACAGCTGCCAGCTGCGGAGCCTG
CTGCGGTGGTGAACCAAGTTGCAGAAGAAGCATGTGTCTATTGTGGTGAAGACCACAACTACGAGTTTTATCCCAGCAATCCAGCTTCTGTTTTTTTTGTAGGTTGGCGC
AACCACCCCAACTTCTCATGGGAAGGACAAGGAAGTAATGTGCAAGCACAACAAAAGGTGAACCAGCCGGGATTTGCTAAAGCACAGGTATTGCCCCAGCAAAATAAGCA
GGCTTTGCCCCAACAAAATTCAGAGAGTTCTCTTGAGGCAATAATAAAAGAATATATGGCTCGTACAGATGCCGCAATTCAAAGTAAACAAGCTTCAATGAGAGCCCTAG
AATTGCAAGTGGGTCAGCTAGCTAATGAGCTGAAGGCAAGGCCTCAGGGGAAACTTCCTTCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTG
ACTCTTAGGAGTGGTAAGCCACTAGAGGAAAGGAAAGAGCCTAGTAAACCCCAGGATGTAGAAAAGAATTGTGATAAAAATATTGTTGTTGAGAAAGAGTTGGAGACTGG
TCAGGGTGCTGGAGGCAGCAATAATGATGCTATAGAGCAAATGCCTGATTATGCTAAATTTCTTAAGGATATTTTAATTAAAAAGAAGAGGTTAGGTGAGTTTGAAATTG
TATCTCTTACTGAGGAATGTAGTGCTATTCTTAAGAATGGGCTACCAACAAAGGCTAAGGATCTAAGATCATTTACTATATCTGTGTCTATAGGTGGAAAAGAGTTAGGT
ATTGGTGAAGCTAGGCCTACCACAGTCACACTCCAACTAGCTGATAGATCAATCACATATCCTAAGGGTAAAATTGAGGATGTCTTAGTGAAGGTAGATACGTTCATATT
TCCTATTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTCGTCCACTTTTGGCCACTGGTAGGGCATTGATAGATGTTCAAAAAGGGG
AATTAACAATGAGAGTTTATAATGAGGAAGTAAAATTTAATGTTTTTAAGGCCATGAAGTATCCAGACGAGATGGAAGATTGTTCTTTCATTAGAATTCTGGAGAACACA
ATTGTTGAGACAACAATGCAAGATTCGGCAAACAAACATTTGGAAGATCATGGAGAGAGGTCTATTGATGAAGACATTGCTGAAATTAAACAAACAATGTCCGACAACAA
CGACGATCCGACAGTAGCTGCAAGTAATCTGCGGCCCAGCGTCTTGATGCTAGGCAGCCTGGAGCTTTATATAAGCTGTGTTTTGGCTACGGGAATGGGCATCATGCAAT
GA
Protein sequenceShow/hide protein sequence
MNDPPRVRFELDPEIERTFRRRRREQRRHQNQMDNVSRLSQGPEDPANPQQNHLLQQNQPLERNEEQNNQDENPILVANDRSRAIRAYAFPMFDELNPGIERPQIEAANF
EMKPVMFQMLQTVGQFHGVPMDALRLTLFPYSLRDGAKAWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCI
QMETFYNGLNGATQERISTNSCQWSDVRGTNKKVKSVLEVDSVSTIRVDIAMLANTLKNVTMFSHQQLPAAEPAAVVNQVAEEACVYCGEDHNYEFYPSNPASVFFVGWR
NHPNFSWEGQGSNVQAQQKVNQPGFAKAQVLPQQNKQALPQQNSESSLEAIIKEYMARTDAAIQSKQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAV
TLRSGKPLEERKEPSKPQDVEKNCDKNIVVEKELETGQGAGGSNNDAIEQMPDYAKFLKDILIKKKRLGEFEIVSLTEECSAILKNGLPTKAKDLRSFTISVSIGGKELG
IGEARPTTVTLQLADRSITYPKGKIEDVLVKVDTFIFPIDFIILDYEADKDVPIILGRPLLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILENT
IVETTMQDSANKHLEDHGERSIDEDIAEIKQTMSDNNDDPTVAASNLRPSVLMLGSLELYISCVLATGMGIMQ