; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032479 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032479
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr11:33093201..33101245
RNA-Seq ExpressionLag0032479
SyntenyLag0032479
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]1.5e-10839.35Show/hide
Query:  AKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASAVGALL
        A AWLN+F   +I+TW+++V+KFL KYFPPTR+  +R EI+ FRQ E E  + AWERFK+L+  CP+  +P C+Q+E F+ G +I+T+ M++ +A G   
Subjt:  AKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASAVGALL

Query:  AKTFNEAYEILERISTYSCQWSDVRGSNKKVKS----VLEVDGVSTIRADIVEVANALKNV-------TVVSHVQQPPVVNQVTEESCVYCGEEHNYEFC
        +K+FNE  EIL+++S ++ QW   +   +  ++    VL +D +++++  I  +   LKN+               P  V Q+ E +C YCG+ H  E C
Subjt:  AKTFNEAYEILERISTYSCQWSDVRGSNKKVKS----VLEVDGVSTIRADIVEVANALKNV-------TVVSHVQQPPVVNQVTEESCVYCGEEHNYEFC

Query:  PNNPASVFFAG---NQRNNLYSNFYNPGWCNHLNFAWGGQGSNSQV--PQAHQKVVNQSRFVKSQELP----------------QQN-------------
        P+NP+S+++ G    Q+ N YSN YNPGW  H NF+W GQGS++     Q +++      F  S   P                QQN             
Subjt:  PNNPASVFFAG---NQRNNLYSNFYNPGWCNHLNFAWGGQGSNSQV--PQAHQKVVNQSRFVKSQELP----------------QQN-------------

Query:  ----------------------------KQQASLRALELQVGQLARELKARPQGNIPSDIEHPIREGKE-----------KGAGGNNNDAGA--------
                                    +   ++R LE+Q+GQL  E++ RPQG++PS  E P R GKE           K  G    D  +        
Subjt:  ----------------------------KQQASLRALELQVGQLARELKARPQGNIPSDIEHPIREGKE-----------KGAGGNNNDAGA--------

Query:  SGSVPD--VEPPY-VPPPPY----NPPLPFPQRQKPKNQD--------------------------------------------EWATT-----------
        + +VPD  VEP   VP  P      PP PFPQR   KNQD                                            E+ T            
Subjt:  SGSVPD--VEPPY-VPPPPY----NPPLPFPQRQKPKNQD--------------------------------------------EWATT-----------

Query:  -----KAKDPGSSTIPVSIGGKELGRAHCDLGASINLMPLSVYRKLGIGEARPTTVTLRLADRSITYPVGKIEDVLVQVDTFIFPVDFIILDYEVDKNVP
             K KDPGS TIP  IGGK++GRA CDLGASINLMPLS+++K  IG+A PTTVTL+LADRSIT P GKIEDVLV+VD FIFP DFIILD E DK+VP
Subjt:  -----KAKDPGSSTIPVSIGGKELGRAHCDLGASINLMPLSVYRKLGIGEARPTTVTLRLADRSITYPVGKIEDVLVQVDTFIFPVDFIILDYEVDKNVP

Query:  IILGRPFLATGSLLIDVQKEELTMKVHDQKVKFNMFDAMKYPNDIEDCSFIQV--------LDEFVEDHFEKELME
        IILGRPFLATG  LIDV+K ELTM+V DQKV FNM DAMKY +D+E+C+ I +        LD+ +    E EL E
Subjt:  IILGRPFLATGSLLIDVQKEELTMKVHDQKVKFNMFDAMKYPNDIEDCSFIQV--------LDEFVEDHFEKELME

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]4.3e-12745.63Show/hide
Query:  QDGAKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASAVG
        +DGAK+WLN+ AP +I +WN L E FL KYFPPTR+ + ++EI+ F+Q E+ET SEA ERFKE+LRKCPHH LP+CIQME FYNGLNIVT+ +VDASA G
Subjt:  QDGAKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASAVG

Query:  ALLAKTFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVEVANALKNVT------VVSHVQQPPVVNQVTEESCVYCGEEHNYEFCP
        A+L+KT+NEAYEILERI++ +CQW+DVR +  +K + VLEVD +S+I A +  V N L+N+       + + V     +NQ   ESCVYCGEEH ++ CP
Subjt:  ALLAKTFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVEVANALKNVT------VVSHVQQPPVVNQVTEESCVYCGEEHNYEFCP

Query:  NNPASVFFAGNQ------RNNLYSNFYNPGWCNHLNFAWGGQG-SNSQVPQAHQ-----KVVNQSRFVKSQELPQQNK----------------------
        +NPAS+F+ GNQ      +NN +SN YNPGW NH NF+W GQ   N Q+P         ++ NQ  +  SQ++  Q K                      
Subjt:  NNPASVFFAGNQ------RNNLYSNFYNPGWCNHLNFAWGGQG-SNSQVPQAHQ-----KVVNQSRFVKSQELPQQNK----------------------

Query:  --------QQASLRALELQVGQLARELKARPQGNIPSDIEHPIREGKEKGA--GGNNNDAGASGSVPDVEPPYVPPP---PYNPPLPFPQRQKPKNQD--
                QQASLR LE+Q+G      K   QG+  S      ++  E+ A    ++ D       P ++           Y P  PFPQR K K ++  
Subjt:  --------QQASLRALELQVGQLARELKARPQGNIPSDIEHPIREGKEKGA--GGNNNDAGASGSVPDVEPPYVPPP---PYNPPLPFPQRQKPKNQD--

Query:  ----------------------------------------------------------EWATTKAKDPGSSTIPVSIGGKELGRAHCDLGASINLMPLSV
                                                                       K KDPGS TIPVSIGGKELGRA CDLGA+INLMPLS+
Subjt:  ----------------------------------------------------------EWATTKAKDPGSSTIPVSIGGKELGRAHCDLGASINLMPLSV

Query:  YRKLGIGEARPTTVTLRLADRSITYPVGKIEDVLVQVDTFIFPVDFIILDYEVDKNVPIILGRPFLATGSLLIDVQKEELTMKVHDQKVKFNMFDAMKYP
        Y+KLGIGEARPTTVTL+LADRSITYP GKIED+L+QVD FIF  DFIILDYEVD +VPIILGRPFL  G  L+DV K  +T+++  QKV+FN+ D+MKYP
Subjt:  YRKLGIGEARPTTVTLRLADRSITYPVGKIEDVLVQVDTFIFPVDFIILDYEVDKNVPIILGRPFLATGSLLIDVQKEELTMKVHDQKVKFNMFDAMKYP

Query:  NDIEDCSFIQVL------DEFVEDHFEKE
          IE+CS +  L      +E+ ++  E+E
Subjt:  NDIEDCSFIQVL------DEFVEDHFEKE

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]3.7e-0644.74Show/hide
Query:  LGYGMPANHQNPLLEQNGQQNNQAKNPILVANDRDRAIRAYVFPIIDELNPVIVCPVIEASNFEMKLVMFHMLEMV
        LG  +    +NP +  N  Q     NPI +A+DR+RAIRAY  P ++ELNP I+ P I+ + FE+K VMF ML+ +
Subjt:  LGYGMPANHQNPLLEQNGQQNNQAKNPILVANDRDRAIRAYVFPIIDELNPVIVCPVIEASNFEMKLVMFHMLEMV

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]1.4e-11444.29Show/hide
Query:  QDSVSTLPSKRAFQDGAKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLN
        QD++       + +D A+AWLNS    SI+TWNEL +KFL KYFPPT++ KL++EI  F Q E+E+  + WERFKELLR+CPHH +P CIQ+E FYNGLN
Subjt:  QDSVSTLPSKRAFQDGAKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLN

Query:  IVTQCMVDASAVGALLAKTFNEAYEILERISTYSCQWSDVR-GSNKKVKSVLEVDGVSTIRADIVEVANALKNVTVVSHVQQPPVVNQVTEESCVYCGEE
          T+ MVDASA  ALL K++ EAYEILERI+  + QW   R  + ++   V  +D ++T+ A +  + N +K +T       P  V Q+ E SC+YCGEE
Subjt:  IVTQCMVDASAVGALLAKTFNEAYEILERISTYSCQWSDVR-GSNKKVKSVLEVDGVSTIRADIVEVANALKNVTVVSHVQQPPVVNQVTEESCVYCGEE

Query:  HNYEFCPNNPASVFFAGN----QRNNLYSNFYNPGWCNHLNFAW----------GGQGSNSQVPQAHQKVVNQSR----------------FVKSQELPQ
        H ++ CP N ASV + GN     +NN Y N YN GW  H NF+W           GQ  N+Q    HQ+   Q                    K++ + Q
Subjt:  HNYEFCPNNPASVFFAGN----QRNNLYSNFYNPGWCNHLNFAW----------GGQGSNSQVPQAHQKVVNQSR----------------FVKSQELPQ

Query:  QNKQQASLRALELQVGQLARELKARPQGNIPSDIEHPIREGKE-----KGAGGNNND-----------AGASGSVPD-----VEPPYVPPPPYNPPLPFP
           Q  SLR LE Q+GQLA  + +R QG++PS+ + P REG E         G N D             +S   P       +P +         +   
Subjt:  QNKQQASLRALELQVGQLARELKARPQGNIPSDIEHPIREGKE-----KGAGGNNND-----------AGASGSVPD-----VEPPYVPPPPYNPPLPFP

Query:  QRQKPKNQDEWATT--------------KAKDPGSSTIPVSIGGKELGRAHCDLGASINLMPLSVYRKLGIGEARPTTVTLRLADRSITYPVGKIEDVLV
        +  +P N  E   T              K KDPGS TIP SIG +  GRA CDLGA+INLM LSV+++L + E RPTTVTL+LA+RS  YP  KIEDVLV
Subjt:  QRQKPKNQDEWATT--------------KAKDPGSSTIPVSIGGKELGRAHCDLGASINLMPLSVYRKLGIGEARPTTVTLRLADRSITYPVGKIEDVLV

Query:  QVDTFIFPVDFIILDYEVDKNVPIILGRPFLATGSLLIDVQKEELTMKVHDQKVKFNMFDAMKYPNDIEDCSFIQVLDEFVEDHFEK
        +VD FIFPVDFI+LD+E DK VPIILGRPFLA    LIDVQK ELTM+++DQ+V FN+ +AMK  ++ +DC+F+ V+D  V D   K
Subjt:  QVDTFIFPVDFIILDYEVDKNVPIILGRPFLATGSLLIDVQKEELTMKVHDQKVKFNMFDAMKYPNDIEDCSFIQVLDEFVEDHFEK

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]6.4e-10740.61Show/hide
Query:  AFQDGAKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASA
        + +D A++WLN+ +P S++ WN+  EKFL KYFPPTR+ K RSEIM F Q E+E+ S+AWERFKELLRKCPHH +P+CIQME FYNGLN  +Q ++DASA
Subjt:  AFQDGAKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASA

Query:  VGALLAKTFNEAYEILERISTYSCQWSDVRG-SNKKVKSVLEVDGVSTIRADIVEVANALKNVTVVSHVQ-QPPVVNQVTEESCVYCGEEHNYEFCPNNP
         GA+L+K++NEA+EILE I++ + QWS+ R   ++KV  VLEVD ++ +   +  + N LKN+++ +    QP    Q  + SCV+C E H +E CP+NP
Subjt:  VGALLAKTFNEAYEILERISTYSCQWSDVRG-SNKKVKSVLEVDGVSTIRADIVEVANALKNVTVVSHVQ-QPPVVNQVTEESCVYCGEEHNYEFCPNNP

Query:  ASVFFAGNQ----RNNLYSNFYNPGWCNHLNFAWGG--------QGSNSQVPQAHQKVVNQSRFVKSQELPQQN--------------KQQASLRALELQ
         SV + GNQ     N  +SN YN  W NH N +WG         QG  +  P   Q++ +      SQ    ++               Q A LR LELQ
Subjt:  ASVFFAGNQ----RNNLYSNFYNPGWCNHLNFAWGG--------QGSNSQVPQAHQKVVNQSRFVKSQELPQQN--------------KQQASLRALELQ

Query:  VGQLARELKARPQGNIPSDIEHPIREGKE--------------------KGAG--------------------GNNNDAGASGSVPDVEPPYVPPPPYNP
        +G LA ELKARPQG++PSD E+P R+GKE                    KG+G                           A+G   + +          P
Subjt:  VGQLARELKARPQGNIPSDIEHPIREGKE--------------------KGAG--------------------GNNNDAGASGSVPDVEPPYVPPPPYNP

Query:  PLPFPQRQKPKNQD-----------------------------------------------------------EWATTKAKDPGSSTIPVSIGGKELGRA
        PLPFPQR + + QD                                                                K KDPGS TIP+SIGG      
Subjt:  PLPFPQRQKPKNQD-----------------------------------------------------------EWATTKAKDPGSSTIPVSIGGKELGRA

Query:  HCDLGASINLMPLSVYRKLGIGEARPTTVTLRLADRSITYPVGKIEDVLVQVDTFIFPVDFIILDYEVDKNVPIILGRPFLATGSLLIDVQKEELTMKVH
                        R LGIGEARPTTVTL+LADRS+ +P GKIEDVLVQVD FIFP DFIILDYE D+ VPIIL RPFLATG  LIDV+K ELTM+  
Subjt:  HCDLGASINLMPLSVYRKLGIGEARPTTVTLRLADRSITYPVGKIEDVLVQVDTFIFPVDFIILDYEVDKNVPIILGRPFLATGSLLIDVQKEELTMKVH

Query:  DQKVKFNMFDAMKYPNDIEDCSFIQVLD
        D++  F +F  ++ P+ I +C  I  +D
Subjt:  DQKVKFNMFDAMKYPNDIEDCSFIQVLD

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]1.0e-0353.7Show/hide
Query:  QAKNPILVANDRDRAIRAYVFPIIDELNPVIVCPVIEASNFEMKLVMFHMLEMV
        Q  +PI++ +DR RAIR Y  P+ +ELNP IV P I+A  FE+K VMF ML+ V
Subjt:  QAKNPILVANDRDRAIRAYVFPIIDELNPVIVCPVIEASNFEMKLVMFHMLEMV

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]2.1e-10241.31Show/hide
Query:  QDGAKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASAVG
        +D A+AWLNS  P SI TWN+L EKFL KYFPPTR+ K+R+EIM FRQ E+E  SEAWERFK+LLRKC HH +P+C+Q+E FY+ L+   + ++DA+A G
Subjt:  QDGAKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASAVG

Query:  ALLAKTFNEAYEILERISTYSCQWSDVRG-SNKKVKSVLEVDGVSTIRADIVEVANALK---NVTVVSHVQQPPVVNQVTEESCVYCGEEHNYEFCPNNP
        A  AKT+NE Y+ILERIS  + +WS+ R   +K    + E+D +S++ A IV + N +K   N+    +  Q  + N    ESCV+CGE H+YEFCP NP
Subjt:  ALLAKTFNEAYEILERISTYSCQWSDVRG-SNKKVKSVLEVDGVSTIRADIVEVANALK---NVTVVSHVQQPPVVNQVTEESCVYCGEEHNYEFCPNNP

Query:  ASVFFAGNQ-RNNLYSNFYNPGW----CNHLNFAWGGQGSNSQVPQAHQKVV-----------NQSRFVKSQELPQQNKQQASLRALELQVGQLARELKA
         SV + GN  RN+ +S  +   +     +  N   G    ++++P+ H K V           N  +   S+ + Q+N   +          +       
Subjt:  ASVFFAGNQ-RNNLYSNFYNPGW----CNHLNFAWGGQGSNSQVPQAHQKVV-----------NQSRFVKSQELPQQNKQQASLRALELQVGQLARELKA

Query:  RPQGNIPSDIEHPIREGKEKGAGGNNNDAGASGSVPDVEPPYVPPPPYN-PPLPFPQRQKPKNQD-----------------------------------
          +  IP  +                + A +   +P  +     P   +   LPFP R K KN D                                   
Subjt:  RPQGNIPSDIEHPIREGKEKGAGGNNNDAGASGSVPDVEPPYVPPPPYN-PPLPFPQRQKPKNQD-----------------------------------

Query:  ---------EWATT----------------KAKDPGSSTIPVSIGGKELGRAHCDLGASINLMPLSVYRKLGIGEARPTTVTLRLADRSITYPVGKIEDV
                 E  T                 K KDPGS TI  SIGGKE+G A CDLGASINLMPLS++ +LGIG+ R T VTL+LADRS+ YP  KIED+
Subjt:  ---------EWATT----------------KAKDPGSSTIPVSIGGKELGRAHCDLGASINLMPLSVYRKLGIGEARPTTVTLRLADRSITYPVGKIEDV

Query:  LVQVDTFIFPVDFIILDYEVDKNVPIILGRPFLATGSLLIDVQKEELTMKVHDQKVKFNMFDAMKYPNDIEDCSFIQVLDE
        LV+VD FIFP DF++LDYE +KNVPIILGRPFLATG  LIDVQK ELTM+V+DQ+V FN+F  +K+  +IEDCS I  + E
Subjt:  LVQVDTFIFPVDFIILDYEVDKNVPIILGRPFLATGSLLIDVQKEELTMKVHDQKVKFNMFDAMKYPNDIEDCSFIQVLDE

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129456.1e-9538.49Show/hide
Query:  AFQDGAKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASA
        + +D AK+WLNS    SI+TW +L +KFL+K+FPP ++ K+R++I  F Q + E+  EAWERFKELLR+CPHH +P  +Q++ FYNGL    + ++DA+A
Subjt:  AFQDGAKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASA

Query:  VGALLAKTFNEAYEILERISTYSCQWSDVRGSNKKVKSVLEVDGVSTIRADIVEVANALKNVTVVSHVQQPPVVNQVTEESCVYCGEEHNYEFCPNNPAS
         GAL++K   +AY +LE +++ + QW   R  ++K     E+D + T+   +  ++  L  + V  H  Q  +V       C  CG+ H+Y+ CP N  S
Subjt:  VGALLAKTFNEAYEILERISTYSCQWSDVRGSNKKVKSVLEVDGVSTIRADIVEVANALKNVTVVSHVQQPPVVNQVTEESCVYCGEEHNYEFCPNNPAS

Query:  VFFAGN---QRNNLYSNFYNPGWCNHLNFAWGGQGSNS-----QVPQAHQKVVNQSRFVKSQ--ELPQQ---------NKQQASLRALELQVGQLARELK
        V F GN   Q+NN YSN YNPGW NH NF+W      S       P   Q+   Q    KSQ  EL  Q           Q ASLR LE QVGQLA  + 
Subjt:  VFFAGN---QRNNLYSNFYNPGWCNHLNFAWGGQGSNS-----QVPQAHQKVVNQSRFVKSQ--ELPQQ---------NKQQASLRALELQVGQLARELK

Query:  ARPQGNIPSDIE-HP----------IREGKE----------------KGAGGNNNDAGASGSVPDVEPPYVPPPPYNPPLPFPQR---------------
         RPQG++PSD + +P          +R GKE                   G   N+        D           +PP PFPQR               
Subjt:  ARPQGNIPSDIE-HP----------IREGKE----------------KGAGGNNNDAGASGSVPDVEPPYVPPPPYNPPLPFPQR---------------

Query:  -----------------------------QKPKNQDEWATT----------------KAKDPGSSTIPVSIGGKELGRAHCDLGASINLMPLSVYRKLGI
                                      K +   E+ T                 K KDPGS TIP +IG     +A  DLGASINLMP S++ KLG+
Subjt:  -----------------------------QKPKNQDEWATT----------------KAKDPGSSTIPVSIGGKELGRAHCDLGASINLMPLSVYRKLGI

Query:  GEARPTTVTLRLADRSITYPVGKIEDVLVQVDTFIFPVDFIILDYEVDKNVPIILGRPFLATGSLLIDVQKEELTMKVHDQKVKFNMFDAMKYPNDIEDC
        GE +PT+VTL+LADRS  YP G IEDVLV+VD FIFPVDF+ILD E D+ +PIILGRPFLAT   +IDV++ +++ KV ++ V+FN+F+A K+P+    C
Subjt:  GEARPTTVTLRLADRSITYPVGKIEDVLVQVDTFIFPVDFIILDYEVDKNVPIILGRPFLATGSLLIDVQKEELTMKVHDQKVKFNMFDAMKYPNDIEDC

Query:  SFIQVLDE
          ++++DE
Subjt:  SFIQVLDE

A0A6J1CPJ3 uncharacterized protein LOC1110129471.0e-9438.27Show/hide
Query:  AKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASAVGALL
        A AWLN+F   +I T +++V+KFL KYFPPTR+  +R EI+ FRQ E E  + AWERFK+L+R CP+  +P C+Q+E F+   +I T  M++ +A G   
Subjt:  AKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASAVGALL

Query:  AKTFNEAYEILERISTYSCQWSDVRGSNKKVKS----VLEVDGVSTIRADIVEVANALKNV-------TVVSHVQQPPVVNQVTEESCVYCGEEHNYEFC
        +K+FNE  EIL+++S ++ QW   +   +  ++    VL +D +++++  I  +   LKN+        +      P  V Q+ E +C            
Subjt:  AKTFNEAYEILERISTYSCQWSDVRGSNKKVKS----VLEVDGVSTIRADIVEVANALKNV-------TVVSHVQQPPVVNQVTEESCVYCGEEHNYEFC

Query:  PNNPASVFFAGNQRNNLYSNFYNPGWCNHLNFAWGGQGSNSQVPQAHQ--KVVNQSRFVKSQELP----------------QQN----------------
                    Q+ N YSN YNPGW  H NF+W GQGS+S   Q  Q  +     RF  S   P                QQN                
Subjt:  PNNPASVFFAGNQRNNLYSNFYNPGWCNHLNFAWGGQGSNSQVPQAHQ--KVVNQSRFVKSQELP----------------QQN----------------

Query:  -------------------------KQQASLRALELQVGQLARELKARPQGNIPSDIEHPIREGKEKGAGGNNNDAGASGSVPD--VEPPY---VPPPPY
                                 +   ++R LE+Q+GQLA E++ RPQG++PS  E P R      +   +        VPD  VEP     V P   
Subjt:  -------------------------KQQASLRALELQVGQLARELKARPQGNIPSDIEHPIREGKEKGAGGNNNDAGASGSVPD--VEPPY---VPPPPY

Query:  N--PPLPFPQRQKPKNQD--------------------------------------------EWATT----------------KAKDPGSSTIPVSIGGK
        N   P PFPQR   KNQD                                            E+ T                 K KDPGS TI   IGGK
Subjt:  N--PPLPFPQRQKPKNQD--------------------------------------------EWATT----------------KAKDPGSSTIPVSIGGK

Query:  ELGRAHCDLGASINLMPLSVYRKLGIGEARPTTVTLRLADRSITYPVGKIEDVLVQVDTFIFPVDFIILDYEVDKNVPIILGRPFLATGSLLIDVQKEEL
        ++GRA CDLGA INLMPLS+++KL IG+A PTTVTL LADRSIT P GKIEDVLV+VD FIFP DFIILD E DK+VPIILGRPFLATG  LIDV+K EL
Subjt:  ELGRAHCDLGASINLMPLSVYRKLGIGEARPTTVTLRLADRSITYPVGKIEDVLVQVDTFIFPVDFIILDYEVDKNVPIILGRPFLATGSLLIDVQKEEL

Query:  TMKVHDQKVKFNMFDAMKYPNDIEDCSFIQV--------LDEFVEDHFEKELMEYHTQKFGEIQIEDLE
        TM+V DQKV FNM DAMKYP+D E+C  I +        LD+ +    E EL E   +K G I I  L+
Subjt:  TMKVHDQKVKFNMFDAMKYPNDIEDCSFIQV--------LDEFVEDHFEKELMEYHTQKFGEIQIEDLE

A0A6J1DZC3 uncharacterized protein LOC1110244494.5e-9038.97Show/hide
Query:  LVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASAVGALLAKTFNEAYEILERISTYS
        +V+KFL+KYFPPT++  +R EI+ FRQ E ET +E WERFKEL+R CP+  +P C+Q+E FY   +I T+ M++ +A G   +KT+NE   IL++++ ++
Subjt:  LVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASAVGALLAKTFNEAYEILERISTYS

Query:  CQWSDVRGSNKKV----KSVLEVDGVSTIRADIVEVANALKNVTVVSHVQQPPVVN-------QVTEESCVYCGEEHNYEFCPNNPASVFFAGNQRNNLY
          W   R   +        V  +D +S++++ +  +   +KN+   +    P   N        + E  C YCG+ HN E CP+NP  + + G   +N  
Subjt:  CQWSDVRGSNKKV----KSVLEVDGVSTIRADIVEVANALKNVTVVSHVQQPPVVN-------QVTEESCVYCGEEHNYEFCPNNPASVFFAGNQRNNLY

Query:  SNFYNPGWCNHLNFAWGGQGSNSQVPQAHQKVVNQSRFVK----------------------------SQELPQQN---------KQQASLRALELQVGQ
            N     +  +   G       PQ H +  N S+ V+                             QE   +N         +  A++R LE Q+GQ
Subjt:  SNFYNPGWCNHLNFAWGGQGSNSQVPQAHQKVVNQSRFVK----------------------------SQELPQQN---------KQQASLRALELQVGQ

Query:  LARELKARPQGNIPSDIEHPIREGKEKGAGGNNNDA-----------GASGSVPDVEPPYVPPPPYNPPLPFPQR------QKPKNQDEWATT-------
        LA ELK RP+G +PS  E P  EG+E                     G+S    + E    P  P    +P   +       + K   E+ T        
Subjt:  LARELKARPQGNIPSDIEHPIREGKEKGAGGNNNDA-----------GASGSVPDVEPPYVPPPPYNPPLPFPQR------QKPKNQDEWATT-------

Query:  ---------KAKDPGSSTIPVSIGGKELGRAHCDLGASINLMPLSVYRKLGIGEARPTTVTLRLADRSITYPVGKIEDVLVQVDTFIFPVDFIILDYEVD
                 K KDPGS TIP SIGGK++GRA CDL ASINLMPLS+++KL IG+A PTTVTL+LADRSIT P GKIEDVLV+VD FIFP DFIIL+ E D
Subjt:  ---------KAKDPGSSTIPVSIGGKELGRAHCDLGASINLMPLSVYRKLGIGEARPTTVTLRLADRSITYPVGKIEDVLVQVDTFIFPVDFIILDYEVD

Query:  KNVPIILGRPFLATGSLLIDVQKEELTMKVHDQKVKFNMFDAMKYPNDIEDCSFIQV--------LDEFVEDHFEKELME
        K+VPIILGRPFL+TG  LIDV+K ELTM V DQKV FNM DAMKYP+D+E+C+ I +        LD+ +    E +L E
Subjt:  KNVPIILGRPFLATGSLLIDVQKEELTMKVHDQKVKFNMFDAMKYPNDIEDCSFIQV--------LDEFVEDHFEKELME

A0A6J1EQ90 uncharacterized protein LOC1114364112.1e-12745.63Show/hide
Query:  QDGAKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASAVG
        +DGAK+WLN+ AP +I +WN L E FL KYFPPTR+ + ++EI+ F+Q E+ET SEA ERFKE+LRKCPHH LP+CIQME FYNGLNIVT+ +VDASA G
Subjt:  QDGAKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASAVG

Query:  ALLAKTFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVEVANALKNVT------VVSHVQQPPVVNQVTEESCVYCGEEHNYEFCP
        A+L+KT+NEAYEILERI++ +CQW+DVR +  +K + VLEVD +S+I A +  V N L+N+       + + V     +NQ   ESCVYCGEEH ++ CP
Subjt:  ALLAKTFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVEVANALKNVT------VVSHVQQPPVVNQVTEESCVYCGEEHNYEFCP

Query:  NNPASVFFAGNQ------RNNLYSNFYNPGWCNHLNFAWGGQG-SNSQVPQAHQ-----KVVNQSRFVKSQELPQQNK----------------------
        +NPAS+F+ GNQ      +NN +SN YNPGW NH NF+W GQ   N Q+P         ++ NQ  +  SQ++  Q K                      
Subjt:  NNPASVFFAGNQ------RNNLYSNFYNPGWCNHLNFAWGGQG-SNSQVPQAHQ-----KVVNQSRFVKSQELPQQNK----------------------

Query:  --------QQASLRALELQVGQLARELKARPQGNIPSDIEHPIREGKEKGA--GGNNNDAGASGSVPDVEPPYVPPP---PYNPPLPFPQRQKPKNQD--
                QQASLR LE+Q+G      K   QG+  S      ++  E+ A    ++ D       P ++           Y P  PFPQR K K ++  
Subjt:  --------QQASLRALELQVGQLARELKARPQGNIPSDIEHPIREGKEKGA--GGNNNDAGASGSVPDVEPPYVPPP---PYNPPLPFPQRQKPKNQD--

Query:  ----------------------------------------------------------EWATTKAKDPGSSTIPVSIGGKELGRAHCDLGASINLMPLSV
                                                                       K KDPGS TIPVSIGGKELGRA CDLGA+INLMPLS+
Subjt:  ----------------------------------------------------------EWATTKAKDPGSSTIPVSIGGKELGRAHCDLGASINLMPLSV

Query:  YRKLGIGEARPTTVTLRLADRSITYPVGKIEDVLVQVDTFIFPVDFIILDYEVDKNVPIILGRPFLATGSLLIDVQKEELTMKVHDQKVKFNMFDAMKYP
        Y+KLGIGEARPTTVTL+LADRSITYP GKIED+L+QVD FIF  DFIILDYEVD +VPIILGRPFL  G  L+DV K  +T+++  QKV+FN+ D+MKYP
Subjt:  YRKLGIGEARPTTVTLRLADRSITYPVGKIEDVLVQVDTFIFPVDFIILDYEVDKNVPIILGRPFLATGSLLIDVQKEELTMKVHDQKVKFNMFDAMKYP

Query:  NDIEDCSFIQVL------DEFVEDHFEKE
          IE+CS +  L      +E+ ++  E+E
Subjt:  NDIEDCSFIQVL------DEFVEDHFEKE

A0A6J1EQ90 uncharacterized protein LOC1114364111.8e-0644.74Show/hide
Query:  LGYGMPANHQNPLLEQNGQQNNQAKNPILVANDRDRAIRAYVFPIIDELNPVIVCPVIEASNFEMKLVMFHMLEMV
        LG  +    +NP +  N  Q     NPI +A+DR+RAIRAY  P ++ELNP I+ P I+ + FE+K VMF ML+ +
Subjt:  LGYGMPANHQNPLLEQNGQQNNQAKNPILVANDRDRAIRAYVFPIIDELNPVIVCPVIEASNFEMKLVMFHMLEMV

A0A6J1EQ90 uncharacterized protein LOC1114364117.4e-10939.35Show/hide
Query:  AKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASAVGALL
        A AWLN+F   +I+TW+++V+KFL KYFPPTR+  +R EI+ FRQ E E  + AWERFK+L+  CP+  +P C+Q+E F+ G +I+T+ M++ +A G   
Subjt:  AKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIVTQCMVDASAVGALL

Query:  AKTFNEAYEILERISTYSCQWSDVRGSNKKVKS----VLEVDGVSTIRADIVEVANALKNV-------TVVSHVQQPPVVNQVTEESCVYCGEEHNYEFC
        +K+FNE  EIL+++S ++ QW   +   +  ++    VL +D +++++  I  +   LKN+               P  V Q+ E +C YCG+ H  E C
Subjt:  AKTFNEAYEILERISTYSCQWSDVRGSNKKVKS----VLEVDGVSTIRADIVEVANALKNV-------TVVSHVQQPPVVNQVTEESCVYCGEEHNYEFC

Query:  PNNPASVFFAG---NQRNNLYSNFYNPGWCNHLNFAWGGQGSNSQV--PQAHQKVVNQSRFVKSQELP----------------QQN-------------
        P+NP+S+++ G    Q+ N YSN YNPGW  H NF+W GQGS++     Q +++      F  S   P                QQN             
Subjt:  PNNPASVFFAG---NQRNNLYSNFYNPGWCNHLNFAWGGQGSNSQV--PQAHQKVVNQSRFVKSQELP----------------QQN-------------

Query:  ----------------------------KQQASLRALELQVGQLARELKARPQGNIPSDIEHPIREGKE-----------KGAGGNNNDAGA--------
                                    +   ++R LE+Q+GQL  E++ RPQG++PS  E P R GKE           K  G    D  +        
Subjt:  ----------------------------KQQASLRALELQVGQLARELKARPQGNIPSDIEHPIREGKE-----------KGAGGNNNDAGA--------

Query:  SGSVPD--VEPPY-VPPPPY----NPPLPFPQRQKPKNQD--------------------------------------------EWATT-----------
        + +VPD  VEP   VP  P      PP PFPQR   KNQD                                            E+ T            
Subjt:  SGSVPD--VEPPY-VPPPPY----NPPLPFPQRQKPKNQD--------------------------------------------EWATT-----------

Query:  -----KAKDPGSSTIPVSIGGKELGRAHCDLGASINLMPLSVYRKLGIGEARPTTVTLRLADRSITYPVGKIEDVLVQVDTFIFPVDFIILDYEVDKNVP
             K KDPGS TIP  IGGK++GRA CDLGASINLMPLS+++K  IG+A PTTVTL+LADRSIT P GKIEDVLV+VD FIFP DFIILD E DK+VP
Subjt:  -----KAKDPGSSTIPVSIGGKELGRAHCDLGASINLMPLSVYRKLGIGEARPTTVTLRLADRSITYPVGKIEDVLVQVDTFIFPVDFIILDYEVDKNVP

Query:  IILGRPFLATGSLLIDVQKEELTMKVHDQKVKFNMFDAMKYPNDIEDCSFIQV--------LDEFVEDHFEKELME
        IILGRPFLATG  LIDV+K ELTM+V DQKV FNM DAMKY +D+E+C+ I +        LD+ +    E EL E
Subjt:  IILGRPFLATGSLLIDVQKEELTMKVHDQKVKFNMFDAMKYPNDIEDCSFIQV--------LDEFVEDHFEKELME

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTTAGGTGAGTTTTCAGTGTGGGTACCCATAGCGTTTGGAGGTTTAACCGCCGATTTGGAGCGAAAGTGGAACCCGAACAGCAAGTTTAAAGTTCTGTTTTGCTT
GATTAAAACCTTGGAGTTAAGGAATTTGTCGAAGGTTATCAAGTGTTATTTAGCTAAGGTGGTACTTGATGACATCAGGCGAGTGGCGAGAGCAGTGGCTTGGCTAGGCT
ATGGGATGCCTGCAAATCACCAAAATCCGCTGCTGGAGCAAAATGGACAGCAAAATAATCAGGCTAAGAATCCTATCCTTGTAGCAAATGATAGGGACAGAGCCATTAGA
GCGTATGTTTTTCCAATTATTGACGAGTTAAATCCAGTGATAGTGTGTCCAGTTATTGAGGCATCAAATTTTGAAATGAAACTGGTCATGTTTCACATGTTAGAGATGGT
TGATGTGGACTTTTCCTACCGCAAGCGCACGGGCATGGCTACCGACATTTTAGGGAACTGTTTACTGGATTATTTTGGACGTTTTTACCCCTTGTATTTACGGTTATTTA
CAGACGCCGTTTACGTCTGTTGTTTCCACGCAGGGTCAGCGGGGCATTTTTTGTGCGCTGGATGTATCCCGATTTACACGGGATTTGGTGGGCGGTCTGTTTTGATCCTA
TTGTTCAGGCGTAGCATGTCGACAGTTAAACAGGTCACGATATTGTTTGGTGGTGTCTGGAGTGCTGAGGGTATTTATAGTGAATTCAAGTCCGGAAGTGTGGATGTTAG
AAGTGATTCGAATCTCCAAGAGTTTAAAGGGGAGAAACGACCGAGCATGCAAGAGATGGTTGTCGACGCGCCGGATGGAGGTCGTTGGCACAGTGGAGATGGTTCGTCGG
CGTGGTATGAGAGCCTTCAATTCCTCTGTTTCTGCCCTGTTTACGGCTCCAAGAAGTCCCAAAAATTCCCTCCCAAAAACCTAGCCGCAAAAGACCTATATTCTGTATTT
TTCGTCGTAGCGTCTCGACTCTGTCAAGACAGCGTCTCGACGCTGCCTTCAAAACGTGCGTTTCAAGATGGAGCAAAGGCGTGGTTGAATTCTTTTGCTCCAGCATCGAT
AAGTACGTGGAATGAGCTAGTAGAGAAATTTCTTAGTAAGTATTTTCCACCAACTAGGAGTACCAAGTTAAGGAGTGAGATAATGGGATTTAGGCAAAATGAGGAAGAAA
CTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCACTGTTTGCCATATTGTATTCAAATGGAAATATTTTACAATGGGTTAAACATAGTA
ACCCAGTGTATGGTTGATGCTTCTGCGGTAGGGGCCCTTTTGGCAAAAACCTTTAATGAAGCTTATGAGATTTTAGAAAGAATATCAACCTACAGTTGTCAATGGTCAGA
TGTTAGAGGCTCTAATAAAAAAGTTAAGAGTGTGTTAGAAGTTGATGGTGTGTCTACCATTAGGGCCGATATTGTAGAAGTAGCTAACGCTCTTAAAAATGTAACAGTGG
TTAGTCATGTCCAGCAGCCGCCAGTGGTAAACCAAGTTACAGAAGAATCATGTGTTTATTGTGGTGAAGAGCATAATTATGAGTTTTGTCCTAATAATCCAGCTTCTGTA
TTTTTTGCAGGTAATCAAAGGAATAACCTTTATTCTAACTTTTATAATCCAGGTTGGTGCAACCACCTCAACTTTGCGTGGGGAGGGCAAGGAAGCAATTCGCAAGTCCC
CCAAGCACATCAAAAGGTGGTGAACCAGTCCAGATTTGTTAAATCGCAGGAATTGCCCCAGCAAAATAAGCAGCAGGCTTCATTGAGAGCCCTAGAGTTGCAAGTGGGTC
AGCTAGCTAGAGAGCTGAAGGCACGACCTCAAGGGAACATTCCTTCAGATATTGAACACCCTATAAGGGAAGGTAAGGAGAAGGGTGCTGGAGGCAACAATAATGATGCT
GGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTACCGCCCCCACCCTATAACCCACCCTTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGAATG
GGCTACCACCAAGGCTAAGGATCCAGGATCATCCACTATTCCTGTCTCCATAGGTGGAAAGGAGTTGGGGAGAGCACATTGTGATTTAGGTGCAAGCATAAACCTTATGC
CTCTTTCGGTTTATCGAAAGTTAGGTATAGGTGAAGCTAGGCCTACAACAGTCACACTCCGATTAGCAGATAGGTCTATCACATATCCTGTAGGTAAAATTGAGGATGTT
CTGGTCCAGGTAGATACATTTATTTTTCCTGTCGATTTCATTATTCTAGATTATGAGGTAGATAAGAATGTCCCAATTATTCTTGGTCGTCCATTTTTGGCAACTGGTAG
TTTATTGATAGATGTCCAAAAGGAGGAGCTTACAATGAAGGTGCATGACCAAAAGGTGAAGTTTAATATGTTTGATGCAATGAAATATCCTAATGATATTGAGGATTGCT
CGTTCATTCAGGTGTTGGATGAGTTTGTTGAGGACCACTTTGAGAAGGAATTGATGGAGTACCATACCCAAAAATTTGGAGAAATCCAAATTGAGGATTTGGAAATAGGT
GGATTGGAGCATGAGCATAAATTTCAACCTGAGATTAGGAAATCCTTCCTTGATGAGTGGTTATTTACTGTAGCTCATATTAAGGCGGTGAAAACACCTTGGTATGATGA
CTTTTCCAATTACCTTGATTTTGGAATTTGCCTCTTGGTTTATCAAAAGACAGATGAAAGAATTTTTCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTTTAGGTGAGTTTTCAGTGTGGGTACCCATAGCGTTTGGAGGTTTAACCGCCGATTTGGAGCGAAAGTGGAACCCGAACAGCAAGTTTAAAGTTCTGTTTTGCTT
GATTAAAACCTTGGAGTTAAGGAATTTGTCGAAGGTTATCAAGTGTTATTTAGCTAAGGTGGTACTTGATGACATCAGGCGAGTGGCGAGAGCAGTGGCTTGGCTAGGCT
ATGGGATGCCTGCAAATCACCAAAATCCGCTGCTGGAGCAAAATGGACAGCAAAATAATCAGGCTAAGAATCCTATCCTTGTAGCAAATGATAGGGACAGAGCCATTAGA
GCGTATGTTTTTCCAATTATTGACGAGTTAAATCCAGTGATAGTGTGTCCAGTTATTGAGGCATCAAATTTTGAAATGAAACTGGTCATGTTTCACATGTTAGAGATGGT
TGATGTGGACTTTTCCTACCGCAAGCGCACGGGCATGGCTACCGACATTTTAGGGAACTGTTTACTGGATTATTTTGGACGTTTTTACCCCTTGTATTTACGGTTATTTA
CAGACGCCGTTTACGTCTGTTGTTTCCACGCAGGGTCAGCGGGGCATTTTTTGTGCGCTGGATGTATCCCGATTTACACGGGATTTGGTGGGCGGTCTGTTTTGATCCTA
TTGTTCAGGCGTAGCATGTCGACAGTTAAACAGGTCACGATATTGTTTGGTGGTGTCTGGAGTGCTGAGGGTATTTATAGTGAATTCAAGTCCGGAAGTGTGGATGTTAG
AAGTGATTCGAATCTCCAAGAGTTTAAAGGGGAGAAACGACCGAGCATGCAAGAGATGGTTGTCGACGCGCCGGATGGAGGTCGTTGGCACAGTGGAGATGGTTCGTCGG
CGTGGTATGAGAGCCTTCAATTCCTCTGTTTCTGCCCTGTTTACGGCTCCAAGAAGTCCCAAAAATTCCCTCCCAAAAACCTAGCCGCAAAAGACCTATATTCTGTATTT
TTCGTCGTAGCGTCTCGACTCTGTCAAGACAGCGTCTCGACGCTGCCTTCAAAACGTGCGTTTCAAGATGGAGCAAAGGCGTGGTTGAATTCTTTTGCTCCAGCATCGAT
AAGTACGTGGAATGAGCTAGTAGAGAAATTTCTTAGTAAGTATTTTCCACCAACTAGGAGTACCAAGTTAAGGAGTGAGATAATGGGATTTAGGCAAAATGAGGAAGAAA
CTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCACTGTTTGCCATATTGTATTCAAATGGAAATATTTTACAATGGGTTAAACATAGTA
ACCCAGTGTATGGTTGATGCTTCTGCGGTAGGGGCCCTTTTGGCAAAAACCTTTAATGAAGCTTATGAGATTTTAGAAAGAATATCAACCTACAGTTGTCAATGGTCAGA
TGTTAGAGGCTCTAATAAAAAAGTTAAGAGTGTGTTAGAAGTTGATGGTGTGTCTACCATTAGGGCCGATATTGTAGAAGTAGCTAACGCTCTTAAAAATGTAACAGTGG
TTAGTCATGTCCAGCAGCCGCCAGTGGTAAACCAAGTTACAGAAGAATCATGTGTTTATTGTGGTGAAGAGCATAATTATGAGTTTTGTCCTAATAATCCAGCTTCTGTA
TTTTTTGCAGGTAATCAAAGGAATAACCTTTATTCTAACTTTTATAATCCAGGTTGGTGCAACCACCTCAACTTTGCGTGGGGAGGGCAAGGAAGCAATTCGCAAGTCCC
CCAAGCACATCAAAAGGTGGTGAACCAGTCCAGATTTGTTAAATCGCAGGAATTGCCCCAGCAAAATAAGCAGCAGGCTTCATTGAGAGCCCTAGAGTTGCAAGTGGGTC
AGCTAGCTAGAGAGCTGAAGGCACGACCTCAAGGGAACATTCCTTCAGATATTGAACACCCTATAAGGGAAGGTAAGGAGAAGGGTGCTGGAGGCAACAATAATGATGCT
GGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTACCGCCCCCACCCTATAACCCACCCTTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGAATG
GGCTACCACCAAGGCTAAGGATCCAGGATCATCCACTATTCCTGTCTCCATAGGTGGAAAGGAGTTGGGGAGAGCACATTGTGATTTAGGTGCAAGCATAAACCTTATGC
CTCTTTCGGTTTATCGAAAGTTAGGTATAGGTGAAGCTAGGCCTACAACAGTCACACTCCGATTAGCAGATAGGTCTATCACATATCCTGTAGGTAAAATTGAGGATGTT
CTGGTCCAGGTAGATACATTTATTTTTCCTGTCGATTTCATTATTCTAGATTATGAGGTAGATAAGAATGTCCCAATTATTCTTGGTCGTCCATTTTTGGCAACTGGTAG
TTTATTGATAGATGTCCAAAAGGAGGAGCTTACAATGAAGGTGCATGACCAAAAGGTGAAGTTTAATATGTTTGATGCAATGAAATATCCTAATGATATTGAGGATTGCT
CGTTCATTCAGGTGTTGGATGAGTTTGTTGAGGACCACTTTGAGAAGGAATTGATGGAGTACCATACCCAAAAATTTGGAGAAATCCAAATTGAGGATTTGGAAATAGGT
GGATTGGAGCATGAGCATAAATTTCAACCTGAGATTAGGAAATCCTTCCTTGATGAGTGGTTATTTACTGTAGCTCATATTAAGGCGGTGAAAACACCTTGGTATGATGA
CTTTTCCAATTACCTTGATTTTGGAATTTGCCTCTTGGTTTATCAAAAGACAGATGAAAGAATTTTTCCATGA
Protein sequenceShow/hide protein sequence
MFLGEFSVWVPIAFGGLTADLERKWNPNSKFKVLFCLIKTLELRNLSKVIKCYLAKVVLDDIRRVARAVAWLGYGMPANHQNPLLEQNGQQNNQAKNPILVANDRDRAIR
AYVFPIIDELNPVIVCPVIEASNFEMKLVMFHMLEMVDVDFSYRKRTGMATDILGNCLLDYFGRFYPLYLRLFTDAVYVCCFHAGSAGHFLCAGCIPIYTGFGGRSVLIL
LFRRSMSTVKQVTILFGGVWSAEGIYSEFKSGSVDVRSDSNLQEFKGEKRPSMQEMVVDAPDGGRWHSGDGSSAWYESLQFLCFCPVYGSKKSQKFPPKNLAAKDLYSVF
FVVASRLCQDSVSTLPSKRAFQDGAKAWLNSFAPASISTWNELVEKFLSKYFPPTRSTKLRSEIMGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIV
TQCMVDASAVGALLAKTFNEAYEILERISTYSCQWSDVRGSNKKVKSVLEVDGVSTIRADIVEVANALKNVTVVSHVQQPPVVNQVTEESCVYCGEEHNYEFCPNNPASV
FFAGNQRNNLYSNFYNPGWCNHLNFAWGGQGSNSQVPQAHQKVVNQSRFVKSQELPQQNKQQASLRALELQVGQLARELKARPQGNIPSDIEHPIREGKEKGAGGNNNDA
GASGSVPDVEPPYVPPPPYNPPLPFPQRQKPKNQDEWATTKAKDPGSSTIPVSIGGKELGRAHCDLGASINLMPLSVYRKLGIGEARPTTVTLRLADRSITYPVGKIEDV
LVQVDTFIFPVDFIILDYEVDKNVPIILGRPFLATGSLLIDVQKEELTMKVHDQKVKFNMFDAMKYPNDIEDCSFIQVLDEFVEDHFEKELMEYHTQKFGEIQIEDLEIG
GLEHEHKFQPEIRKSFLDEWLFTVAHIKAVKTPWYDDFSNYLDFGICLLVYQKTDERIFP