; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039176 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039176
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr2:37969468..37978857
RNA-Seq ExpressionLag0039176
SyntenyLag0039176
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]4.9e-17648.57Show/hide
Query:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWLESV
        +E  P+ ++D+++PV+    S I+  PI A NFELK  LI M +   F G P +DP+ HL  FLEIC T+K+NGV  D IRLRLFPFSL+DKA+ WL+S+
Subjt:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWLESV

Query:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK
        + GSI +W ++A+ FL KFFPPAKT +LR+EIG F+Q D E LYEAWERYK+++RRCPQHG PDWLQVQ+FYNGLN  T+T++D ++GG+ +SKT   A 
Subjt:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK

Query:  DLLEEMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSKVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS
         LLEEMA+ +YQWPTER    K AG++EL+  ++L AQ+A+L++ ++ LT+ ++ +S   +A         +   E+VQY+ NR+Y      +PN+YHP 
Subjt:  DLLEEMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSKVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS

Query:  LRNHENFSYANTKNVLQP--PPGFASTSTPEKKNNLEEMVALFIKE---------QRILNVSLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDT
        LRNHEN SY NTKNVLQP  PPGF S  + EKK +LE+ +  F++E          R+ N+  +T  +N  A +KN+EVQIGQ+A+ +NA Q+G FPS+T
Subjt:  LRNHENFSYANTKNVLQP--PPGFASTSTPEKKNNLEEMVALFIKE---------QRILNVSLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDT

Query:  EPNPKEQCKMVVLRSGRRLEDSLEKKKEEE-------KRRDEDKGAEAQKASSERFQHPPN---------------------------------SIELKC
        E NPKEQCK + LRSG+ +E +  K+ +         + +D+ +  E    + E    PP                                   I  K 
Subjt:  EPNPKEQCKMVVLRSGRRLEDSLEKKKEEE-------KRRDEDKGAEAQKASSERFQHPPN---------------------------------SIELKC

Query:  DFSNSCAEALYNMPNYGKFMKEMLSKKKSLKK-EVFNLSESSSTIISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTD
          +   A+AL  MPNY KF+K+++SKK+ L++ E   LSE  S I+  ++P K KDPGSFT+PCTIG+  FDR LCDLGASINLMP+SV RK+GL  M  
Subjt:  DFSNSCAEALYNMPNYGKFMKEMLSKKKSLKK-EVFNLSESSSTIISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTD

Query:  TDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
        T ++LQLADRSI +P G++EDVLVKV+KFIFP DFVVLDM+ED++VP+ILGRPFLATG+A I V  G+LTL ++ E+V+F+I+   +      TCF V
Subjt:  TDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]1.1e-17549.28Show/hide
Query:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWLESV
        +E  P+ ++D+++PV+    S I+  PI A NFELK  LI M +   F G P +DP+ HL  FLEIC T+K+NGV  D IRLRLFPFSL+DKA+ WL+S+
Subjt:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWLESV

Query:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK
        + GSI +W ++A+ FL KFFPPAKT +LR+EIG F+Q D E LYEAWERYK+++RRCPQHG PDWLQVQ+FYNGLN  T+T++D ++GG+ +SKT   A 
Subjt:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK

Query:  DLLEEMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSKVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS
         LLEEMA+ +YQWPTER    K AG+++L+  ++L AQ+A+L++ ++ LT+ ++ +S   LA         +   E+VQY+ NR+Y      +PN+YHP 
Subjt:  DLLEEMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSKVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS

Query:  LRNHENFSYANTKNVLQP--PPGFASTSTPEKKNNLEEMVALFIKE---------QRILNVSLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDT
        LRNHEN SY NTKNVLQP  PPGF S  + E+K +LE+ +  F++E          R+ N+  +T  +N  AA+KN+EVQIGQ+A+ +NA Q+G FPS+T
Subjt:  LRNHENFSYANTKNVLQP--PPGFASTSTPEKKNNLEEMVALFIKE---------QRILNVSLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDT

Query:  EPNPKEQCKMVVLRSGRRLEDS---------------LEKKKEEEKRRDEDKGAEAQKASSERF-QHPP------------------------NSIELKC
        E NPKEQCK + LRSG+ +E S                 K K EE     D   E   A +  F  +PP                          I  K 
Subjt:  EPNPKEQCKMVVLRSGRRLEDS---------------LEKKKEEEKRRDEDKGAEAQKASSERF-QHPP------------------------NSIELKC

Query:  DFSNSCAEALYNMPNYGKFMKEMLSKKKSLKK-EVFNLSESSSTIISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTD
          +   A+AL  MPNY KF+K+++SKK+ L++ E   LSE  S I+  ++P K KDPGSFT+PCTIG+  FD+ LCDLGASINLMP SV RK+GL  M  
Subjt:  DFSNSCAEALYNMPNYGKFMKEMLSKKKSLKK-EVFNLSESSSTIISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTD

Query:  TDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
        T ++LQLADRSI +P G++EDVLVKV+KFIFP DFVVLDM+ED+EVP+ILGRPFLATG+A I V  G+LTL ++ E+V+F I+          TCF V
Subjt:  TDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]3.3e-18050.14Show/hide
Query:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWLESV
        Q   P+ ++D+++P++    SGI    I A NFELK  LI M +   F G P +DP+ HL  FLEIC TIKMNGV  D IRLRLFPFSL+DKA+ WL+S+
Subjt:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWLESV

Query:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK
        + GSI++W ++A+ FL KFFPPAKT +LR+EIG FRQ D E LYEAWERYK+++R CPQHG PDWLQVQ+FYNGLN  T+T++D ++GG+ +SKT   A 
Subjt:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK

Query:  DLLEEMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSKVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS
         LLEEMA+ +YQWPTER    K AG++EL+  ++L AQ+ASL++ ++ LT+ ++ +    +A         +   E+VQYI NR+Y      +PN+YHP 
Subjt:  DLLEEMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSKVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS

Query:  LRNHENFSYANTKNVLQPPPGFASTSTPEKKNNLEEMVALFIKEQRILNV-------SLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNP
        LRNHENFSY NTKNVLQPPPGF S  + EKK +LE+ +  F++E +           +++T  +N  A +KN+EVQIGQ+A+ +NA Q+G FPS+TE NP
Subjt:  LRNHENFSYANTKNVLQPPPGFASTSTPEKKNNLEEMVALFIKEQRILNV-------SLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNP

Query:  KEQCKMVVLRSGRRLEDSLEKKKE---------------EEKRRDEDKGAEAQKASSERF-QHPP------------------------NSIELKCDFSN
        KEQCK + LRSGR +E S  K+ E               EE+   ED   E     S  F  +PP                          I  K   + 
Subjt:  KEQCKMVVLRSGRRLEDSLEKKKE---------------EEKRRDEDKGAEAQKASSERF-QHPP------------------------NSIELKCDFSN

Query:  SCAEALYNMPNYGKFMKEMLSKKKSLKK-EVFNLSESSSTIISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVT
          A+AL  MPNY KF+K+++SKK+ L++ E   LSE  S II  ++P K KDPGSFT+PCTIG   FD+ LCDLGASINLMP SVYRK+GL  M  T ++
Subjt:  SCAEALYNMPNYGKFMKEMLSKKKSLKK-EVFNLSESSSTIISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVT

Query:  LQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
        LQLADRSI +P G++EDVLVKV+KFIFP DFVVLDM+ED+EVP+ILGRPFLATG+A + V  G+LTL ++ E+V F+I+   +      TCF V
Subjt:  LQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

XP_023903214.1 uncharacterized protein LOC112015077 [Quercus suber]7.1e-17550.15Show/hide
Query:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWLESV
        Q   P+ ++D+++P++    SGI    I A NFELK  LI M +   F G P +DP+ HL  FLEIC T+KMNGV  D IRLRLFPFSL+DKA+ WL+S+
Subjt:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWLESV

Query:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK
        + GSI++W ++A+ FL KFFPPAKT +LR+EIG FRQ D E LYEAWERYK+++R CPQHG PDWLQVQ+FYNGLN  T+T++D ++GG+ +SKT   A 
Subjt:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK

Query:  DLLEEMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSKVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS
         LLEEMA+ +YQWPTER    K AG++EL+  ++L AQ+ASL++ ++ L++ ++ +S   +A         +   E+VQYI NR+Y      +PN+YHP 
Subjt:  DLLEEMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSKVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS

Query:  LRNHENFSYANTKNVLQPPPGFASTSTPEKKNNLEEMVALFIKE---------QRILNVSLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEP
        LRNHENFSY NTKNVLQPPPGF S  + EKK +LE+ +  F++E          R+ N+  +T  +N  A +KN+EVQIGQ+A+ +NA Q+G FPS+TE 
Subjt:  LRNHENFSYANTKNVLQPPPGFASTSTPEKKNNLEEMVALFIKE---------QRILNVSLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEP

Query:  NPKEQCKMVVLRSGRRLEDSLEKKKE---------------EEKRRDEDKGAEAQKASSERFQHPPNSIELKCDF--SNSCAEALYN-MPNYGKFMKEML
        NPKEQCK + LRSGR +E S  K+ E               EE+   ED   E     S  F   P  +     +   NS   +    +    KF+K+++
Subjt:  NPKEQCKMVVLRSGRRLEDSLEKKKE---------------EEKRRDEDKGAEAQKASSERFQHPPNSIELKCDF--SNSCAEALYN-MPNYGKFMKEML

Query:  SKKKSLKK-EVFNLSESSSTIISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLV
        SKK+ L++ E   LSE  S II  ++P K KD GSFT+PCTIG   FD+ LCDLGASINLMP SVYRK+GL  M  T ++LQLADRSI +P G++EDVLV
Subjt:  SKKKSLKK-EVFNLSESSSTIISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLV

Query:  KVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
        KV+KFIFP +FVVLDM+ED+EVP+ILGRPFLA G+A + V  G+LTL ++ E+V F+I+   +   +  TCF V
Subjt:  KVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

XP_023929660.1 uncharacterized protein LOC112040975 [Quercus suber]8.1e-17950.52Show/hide
Query:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWLESV
        Q   P+ ++D+++P++    SGI +  I A NFEL   LI M +   F G P +DP+ HL  FLEIC  +KMNGV  D IRLRLFPFSL+DKA+ WL+S+
Subjt:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWLESV

Query:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK
        + GSI++W ++A+ FL KFFPPAKT +LR+EIG FRQ D E LYEAWERYK+++R CPQHG  DWLQVQ+FYNGLN  T+T++D ++GG+ +SKT   A 
Subjt:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK

Query:  DLLEEMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSKVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS
         LLEEMA+  YQWPTER    K AG++EL+  ++L AQ+ASL++ ++ LT+ ++ +S+  +A         +   E VQYI NR+Y      +PN+YHP 
Subjt:  DLLEEMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSKVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS

Query:  LRNHENFSYANTKNVLQPPPGFASTSTPEKKNNLEEMVALFIKEQRI-------LNVSLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNP
        LRNHENFSY NTKNVLQPPPGF S  + EKK +LE+ +  F++E +           +++T  +N  A +KN+EVQIGQ+A+ +NA Q+G FPS+TE NP
Subjt:  LRNHENFSYANTKNVLQPPPGFASTSTPEKKNNLEEMVALFIKEQRI-------LNVSLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNP

Query:  KEQCKMVVLRSGRRLEDSLEKKKE---------------EEKRRDEDKGAEAQKASSERFQHPPNSIELKCDFSNSCAEALYNMPNYGKFMKEMLSKKKS
        KEQCK + LRSGR +E S  K+ E               EE+    D   E     S  F   P  +     +      AL  MPNY KF+K+++SKK+ 
Subjt:  KEQCKMVVLRSGRRLEDSLEKKKE---------------EEKRRDEDKGAEAQKASSERFQHPPNSIELKCDFSNSCAEALYNMPNYGKFMKEMLSKKKS

Query:  LKK-EVFNLSESSSTIISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKF
        L++ E   LSE  S I+  ++P K KDPGSFT+PCTIG   FD+ LCDLGASINLMP SVYRK+GL  M  T ++LQLA+RSI +P G++EDVLVKV+KF
Subjt:  LKK-EVFNLSESSSTIISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKF

Query:  IFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
        IFP DFVVLDM+ED+EVP+ILGRPFLATG+A + V  G+LTL +  E+V+F+I+   + +    TCF V
Subjt:  IFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

TrEMBL top hitse value%identityAlignment
A0A1S3UKD4 uncharacterized protein LOC1067662678.3e-13743.36Show/hide
Query:  MAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWL
        M  ++   K IRD+  P        IV  PIQA NFE+K  L+Q+ + N F G  SEDP+SHL +FL IC T+K NGV  DAI LRLFPFSL+DKAK+WL
Subjt:  MAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWL

Query:  ESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVT
        +S+  GSISTW+++A  F+TK+FPP+K+ K+R EI +F Q D E LYEAWERYKE++R+CP H  P+WLQVQ FYNGL+P+ K +LD ++GGSF+ KT  
Subjt:  ESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVT

Query:  EAKDLLEEMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSKVV---KSISTLAEG-------HSKKEGQDVEEVQYIG--NRSYT
        EA + LE MA  +     +R    +K G+ E++   ++ AQ   LT  +  LT    +    +++T + G       H   E Q   +   I    +   
Subjt:  EAKDLLEEMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSKVV---KSISTLAEG-------HSKKEGQDVEEVQYIG--NRSYT

Query:  QGVPNFYHPSLRNHENFSYANTKNVLQPPPGFASTSTPEKKNN-----------LEEMVALFIKEQRILNVSLQTSVNNHDAALKNMEVQIGQIASAVNA
        Q   NF + + R  +    +N      P P +   + P  + N           L    + F+ +        +T+  N +A+++N+E QIGQ++  ++ 
Subjt:  QGVPNFYHPSLRNHENFSYANTKNVLQPPPGFASTSTPEKKNN-----------LEEMVALFIKEQRILNVSLQTSVNNHDAALKNMEVQIGQIASAVNA

Query:  LQKGKFPSDTEPNPKEQCKMVVLRSGRRLEDSLEKKKEEEKRRDEDKGAEAQKASSERFQHPPNSIELKCDFSNSCAEALYNMPNYGKFMKEMLSKKKSL
           G FPSDT PNP+EQCK + LRS R LE     + E EK++  D   E  + S+E+       IE K         +L  MP+Y KFMK++LSKK+ L
Subjt:  LQKGKFPSDTEPNPKEQCKMVVLRSGRRLEDSLEKKKEEEKRRDEDKGAEAQKASSERFQHPPNSIELKCDFSNSCAEALYNMPNYGKFMKEMLSKKKSL

Query:  KK-EVFNLSESSSTIISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFI
        ++ E   L+E  S II  ++P K KDP SF +PC IG +S  +ALCDLGASINLMP S+++++G+  +  T +TLQLADRS+T+P G+VEDVLVKV+KFI
Subjt:  KK-EVFNLSESSSTIISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFI

Query:  FPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSVG-PEYLTDDDEEVDYNL
        FP DFVVLDM+ED +VPIILGRPFLATG+  I V  G L L + DEKV FSI            CF     E L  DD  VDY++
Subjt:  FPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSVG-PEYLTDDDEEVDYNL

A0A1U7Z951 uncharacterized protein LOC1045905682.8e-14044.48Show/hide
Query:  KAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDN-SFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWLESVETGS
        + + D+ +P L    S IV   I A NF++K  +IQM ++   F G   EDP++H+ +FLEIC T K NGV  D +RLRLFPFSL+DK K WL S+   S
Subjt:  KAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDN-SFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWLESVETGS

Query:  ISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLLE
        ISTWDE+A  FL+K+FPP+K  K+R +I TF Q D E LYE+WERYKE+LR+ P HG P WLQVQ FYN L  + KT++D +AGGS  +KT   A  L+E
Subjt:  ISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLLE

Query:  EMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKL-TSSKVVKSIST--LAEGHSKKE-------GQDVEEVQYIGNRSYTQG--VPNFY
        EM A +YQW +ER  V ++  L+ +D  ++L AQ+ +L+  + ++  +   V++++      GH   E        Q  ++V ++GN    QG   P  +
Subjt:  EMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKL-TSSKVVKSIST--LAEGHSKKE-------GQDVEEVQYIGNRSYTQG--VPNFY

Query:  HPSLRNHENFSYANTKNVLQPPPGFASTSTPEKKNNLEEMVALFIKEQRILNVSLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNPKEQC
        +P  RNH N S+ N +N ++P P   + + PE K+NLEE++  FI        S +T   N +A++KN+E Q+GQ+A  +++  +G  PS+TE NP+EQ 
Subjt:  HPSLRNHENFSYANTKNVLQPPPGFASTSTPEKKNNLEEMVALFIKEQRILNVSLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNPKEQC

Query:  KMVVLRSGRRLEDSLEKKKEEEKRRDEDKGAEAQKASSERFQHPPNSIELKCDFSNSCAEALYNMPNYGKFMKEMLSKKKSL-KKEVFNLSESSSTIISG
        + + LRSG+ L++ +EKK +EE                                     +AL  MP+Y KF+KE+L+ K  L       ++E  S I+  
Subjt:  KMVVLRSGRRLEDSLEKKKEEEKRRDEDKGAEAQKASSERFQHPPNSIELKCDFSNSCAEALYNMPNYGKFMKEMLSKKKSL-KKEVFNLSESSSTIISG

Query:  RIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPI
        ++P K KDPGSFT+PCTIG +  ++ALCDLGA+INLM YSV++K+GL     T V LQL DRSI HP G++EDVLVKV+KFIFPVDF+VLDM+ED +VP+
Subjt:  RIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPI

Query:  ILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSI
        ILGRPFLATGKA + V  G+L+L I DE+V+F +
Subjt:  ILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSI

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129455.5e-15746.08Show/hide
Query:  EAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDN-SFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWLESVE
        EA +A+RD++ P++   +  I    I A NFE+K   IQM + +  F G PS+DP+SHL +FLEIC T K NGV  DAIRLRLFPFSL+DKAK WL S+ 
Subjt:  EAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDN-SFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWLESVE

Query:  TGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKD
         GSI+TW++LAQ FL KFFPPAKT K+R +I +F Q D E LYEAWER+KE+LRRCP HG PDWLQVQ FYNGL  S KT++D +AGG+ +SK   +A +
Subjt:  TGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKD

Query:  LLEEMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSKVVKSI---STLAEGHSKKE-GQDVEEVQYIGNRSYTQGVP--NFYHPS
        LLEEMA+ +YQWP+ER    K  G YE+D   +L  Q+A+L+  L+ L    V  S+       + HS  +   + E VQ++GN +  Q  P  N Y+P 
Subjt:  LLEEMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSKVVKSI---STLAEGHSKKE-GQDVEEVQYIGNRSYTQGVP--NFYHPS

Query:  LRNHENFSYANTKNVLQP----PPGF---ASTSTPEKKNNLEEMVALFIKEQRILNVSLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNP
         RNH NFS++N      P    PPGF   A    PEKK+ LEE++  +I +   +       + +  A+L+N+E Q+GQ+A+++N   +G  PSDT+ NP
Subjt:  LRNHENFSYANTKNVLQP----PPGF---ASTSTPEKKNNLEEMVALFIKEQRILNVSLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNP

Query:  --KEQCKMVVLRSGRRLEDSLEKKKEEE----------------KRRDEDKGAEAQKASSERFQHPPN-------------------SIELKCDFSNSCA
          KEQC+ + LRSG+ +E   +K  E E                +++D+DK   A+   + +  HPP                    ++  K   +   A
Subjt:  --KEQCKMVVLRSGRRLEDSLEKKKEEE----------------KRRDEDKGAEAQKASSERFQHPPN-------------------SIELKCDFSNSCA

Query:  EALYNMPNYGKFMKEMLSKKKSLKK-EVFNLSESSSTIISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQL
        EAL  MP+Y KF+K++LSKK+ L + E   L+E  S I+  ++P K KDPGSFT+PCTIG + F +AL DLGASINLMP+S++ K+GL     T VTLQL
Subjt:  EALYNMPNYGKFMKEMLSKKKSLKK-EVFNLSESSSTIISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQL

Query:  ADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTC
        ADRS  +P G++EDVLVKV+KFIFPVDF++LDM+ED+++PIILGRPFLAT  A I V  GK++  + +E V F+IF   +   S + C
Subjt:  ADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTC

A0A6J1DU19 uncharacterized protein LOC1110243611.3e-13444.34Show/hide
Query:  IRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWLESVETGSIST
        IRD+ QP  P  + GI+  PI A N ELK GLIQM R+N+F+G+ +EDP++HL  FL++CGT+KMNGV  DAIRLRLFP SLQDK               
Subjt:  IRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWLESVETGSIST

Query:  WDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLLEEMA
          E+ QAFLT FFPPAKTT+LRTEI +FR+ D EQL+E WERYKE+LR+CPQHG  +WLQ+Q+FYNGLN  T+T+LD +AGG+ LS+T   A  LL++MA
Subjt:  WDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLLEEMA

Query:  ATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSKVVKSISTLAEGHSKKEGQDVEEVQYIGNRSYTQGVPNFYHPSLRNHENFSYANT
          S+QWP+ER    K AG+YE+DE SSLKAQ+ +LTNA++KL+      S   +A                           ++Y P++   +  S+   
Subjt:  ATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSKVVKSISTLAEGHSKKEGQDVEEVQYIGNRSYTQGVPNFYHPSLRNHENFSYANT

Query:  KNVLQPPPGFASTSTPEKKNNLEEMVALFIKE-----QRILN--VSLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNPKEQCKMVVLRSG
                        EKK++LE+++  FI E      RI N    ++  +  +  ++KNMEVQIGQIA  +N +QKGKFPSD E  P+E CK V LRSG
Subjt:  KNVLQPPPGFASTSTPEKKNNLEEMVALFIKE-----QRILN--VSLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNPKEQCKMVVLRSG

Query:  RRLEDSLEKKKEE-----EKRRDEDK-------GAEAQKASSERFQHPPNSIELKCDFSNSCAEALYNMPNYGKFMKEMLSKKKSLKK-EVFNLSESSST
        + L++  +KK EE     E+R ++++         +A K +S     PPNS+            AL  MPNY +FMK++++ K+ L+  E  NL+E  S 
Subjt:  RRLEDSLEKKKEE-----EKRRDEDK-------GAEAQKASSERFQHPPNSIELKCDFSNSCAEALYNMPNYGKFMKEMLSKKKSLKK-EVFNLSESSST

Query:  IISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDK
        I+  ++P K KDPGSFT+PCTI   SF++ALCD+ ASINLM                             P+GV+EDVLVKV++ IFP DFVVL  +ED 
Subjt:  IISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDK

Query:  EVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
        E+PIILGR FLATG A I V  G LTL +++E VVF I    +    + TC  +
Subjt:  EVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

A0A6P6XAQ1 Reverse transcriptase2.0e-14343.04Show/hide
Query:  MAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWL
        MA  E   + +RDF  P      + IV   + A NFE+K  LIQM + + + G+ +EDP+SHL +FLEIC TIK NGV  DAI+LRLFPFSL+DKAK WL
Subjt:  MAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWL

Query:  ESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVT
        +S    + +TWDELA+AFL KFFPP KT KLR +I +F Q + E LYEAWERY+E+ RRCP HG PDWL VQ FYNGL   TKT +D +AGG+ + KT  
Subjt:  ESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVT

Query:  EAKDLLEEMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSK-----VVKSISTLAEGHSKKEGQDVEEVQYIGNRS---YTQGVP
        EA+ L+EEMAA +YQW  ERG   + AG+ E+D  + L A++ ++   LN+   S      VV S +     H        E+VQY+ N +         
Subjt:  EAKDLLEEMAATSYQWPTERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSK-----VVKSISTLAEGHSKKEGQDVEEVQYIGNRS---YTQGVP

Query:  NFYHPSLRNHENFSYANTKNVLQP--PPGFASTSTPEKKNNLEEMV------ALFIKEQRILNVSLQT------SVNNHDAALKNMEVQIGQIASAVNAL
        N Y+P  RNH NF + +  N  +P  PPGF    T  +     E+       A   K +++ + + Q        ++      +N+EVQ+GQIA+AVN  
Subjt:  NFYHPSLRNHENFSYANTKNVLQP--PPGFASTSTPEKKNNLEEMV------ALFIKEQRILNVSLQT------SVNNHDAALKNMEVQIGQIASAVNAL

Query:  QKGKFPSDTEPNPKEQCKMVVLRSGRRLED------SLEKKKEEEKRRDEDKGAEAQKASSERFQHPPNSIELKCDFSNSCAEALYNMPNYGKFMKEMLS
         +G  PS TE NP+E  K + LRSG+ L +        E +K E K+  E K    ++   E+ +   N ++++             +P+Y KF+KE+++
Subjt:  QKGKFPSDTEPNPKEQCKMVVLRSGRRLED------SLEKKKEEEKRRDEDKGAEAQKASSERFQHPPNSIELKCDFSNSCAEALYNMPNYGKFMKEMLS

Query:  KKKSL-KKEVFNLSESSSTIISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVK
        KK+ L   E   L+E  S II  ++P K KDPGSFTVPCTIG V F +ALCDLGAS++L+P +V R++GL  +  T+++LQLADRSI HPMG++E+VL+K
Subjt:  KKKSL-KKEVFNLSESSSTIISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVK

Query:  VNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQD------ESVCSLHTCFSVGPEY----LTDDDEEVDYNLG
        V KFI PVDFVVLDM+ED  VPIILGRPFLAT    I V  GK    I +E+V F +   +      + V S+  C  +  E     L +D  E+  N G
Subjt:  VNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQD------ESVCSLHTCFSVGPEY----LTDDDEEVDYNLG

Query:  LGLGEMLVDNM
        +G+ E  ++ M
Subjt:  LGLGEMLVDNM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCACCAAGAAGAAGCTCCCAAGGCAATTAGAGATTTTCTGCAGCCAGTTCTTCCTACCGAAAATTCTGGAATCGTTTACGCCCCCATCCAAGCTACAAATTTTGA
GCTAAAGACAGGGTTGATTCAGATGGCGCGCGATAACTCGTTCAAGGGACATCCTTCCGAGGATCCTCACTCTCATCTGCGATCATTCCTAGAAATATGTGGGACAATAA
AGATGAACGGAGTTCCGGCCGACGCAATCAGATTGAGGCTATTTCCATTTTCTCTTCAGGATAAAGCAAAAGATTGGCTCGAATCAGTCGAGACGGGCAGCATCAGTACT
TGGGACGAGCTTGCCCAGGCTTTTCTGACAAAATTTTTTCCACCAGCTAAGACTACCAAGCTCCGGACTGAAATTGGAACATTCAGACAGCTTGACGAGGAGCAGTTATA
CGAAGCGTGGGAAAGATATAAAGAAATGCTTAGGCGGTGCCCCCAACACGGATATCCTGATTGGCTTCAGGTACAGTTATTCTATAATGGATTAAACCCCTCCACAAAGA
CAGTCCTAGACACATCAGCAGGAGGGAGTTTTCTTTCAAAAACAGTGACAGAAGCCAAGGACCTGCTTGAGGAAATGGCGGCGACAAGTTATCAATGGCCGACGGAGAGA
GGAACAGTTACAAAAAAGGCTGGATTATATGAATTAGATGAGTCAAGTTCACTGAAAGCGCAACTGGCATCTCTGACCAATGCTCTAAACAAATTGACGTCATCTAAGGT
GGTTAAGTCCATTTCCACCTTAGCTGAAGGTCATTCGAAGAAAGAAGGTCAAGATGTGGAAGAAGTTCAATACATAGGAAATAGATCATATACTCAAGGAGTACCGAACT
TCTACCACCCCAGTCTGCGCAATCACGAGAACTTCTCATATGCAAATACGAAGAATGTTTTGCAGCCACCCCCAGGTTTTGCATCAACGAGTACTCCTGAGAAGAAAAAT
AATCTGGAGGAGATGGTGGCTTTATTCATCAAAGAACAGAGAATACTGAATGTGAGTCTCCAGACATCAGTGAACAACCACGACGCAGCTCTAAAGAATATGGAAGTGCA
GATAGGTCAGATTGCTTCAGCAGTAAATGCCCTGCAGAAGGGAAAATTTCCAAGTGATACTGAGCCTAACCCGAAAGAGCAGTGTAAGATGGTGGTTCTAAGAAGTGGCA
GAAGACTGGAGGACAGTTTAGAGAAGAAAAAGGAAGAAGAAAAGAGAAGGGATGAAGATAAAGGGGCTGAGGCACAAAAAGCCTCCTCTGAAAGGTTCCAACATCCTCCT
AACTCTATTGAATTAAAATGTGATTTTTCTAACAGCTGTGCAGAAGCTTTGTATAATATGCCGAATTATGGAAAATTCATGAAAGAAATGCTTTCTAAAAAGAAAAGTCT
GAAAAAAGAAGTTTTTAATTTATCTGAGAGTAGCAGTACCATCATTTCTGGTAGGATACCCTCTAAGCAGAAAGACCCAGGGAGTTTTACTGTTCCCTGCACCATAGGAG
AAGTATCCTTCGATAGGGCCTTGTGTGATTTAGGAGCAAGTATAAATTTGATGCCCTACTCTGTGTACAGGAAGATTGGTTTATCAGGTATGACAGACACTGACGTCACT
CTCCAGCTTGCCGACAGATCGATTACCCACCCGATGGGTGTCGTGGAGGACGTGTTGGTGAAAGTCAACAAATTCATCTTTCCTGTAGATTTCGTGGTACTGGACATGAA
GGAGGACAAAGAAGTGCCAATTATCTTAGGCAGACCTTTCCTAGCCACTGGTAAGGCTGAGATTAGCGTGCATACAGGTAAGCTTACCTTGAACATTGATGATGAAAAAG
TCGTATTTAGTATTTTTGGCCAAGATGAATCTGTTTGTAGCTTGCATACATGTTTTTCTGTTGGGCCTGAATACTTGACTGATGACGATGAAGAGGTAGACTATAATCTT
GGCCTAGGCTTAGGAGAAATGCTTGTTGATAATATGGATTTTGATCATGATGAATATATGGATAATCCTCTGTTTGAAAATGATTTGGATCTGCCTGACTTCGAAAATGA
ATTAGACTTGCCTGCTTGTGATAATGAAAGATCTGCAGTTGATGATTTACCTTCCTTTGAAAATGAATTAGAATTGCCTGAACTGAATAATTTTAATGATGATATTGAAT
TGCCTGACAGTGAGCATGAACGTAAAAGGCAGAAAAAAGAGTGCTCGATAGATAATTTTGAGTCTGATCGTGATTACAGTGAATCTATTGAGTCTGATCTTGACATTCCT
GAATGCATGAATCCTGACAATGTGAATACTTTCTTTTCATGTCCTGATGAGGTGTATAGAATAGAATCTGACCCAGAAGAACTTGAATCTGTGCATAGTATAGAATCTGA
TCCTGAAATTCTTGAATTCTTTAATTCTTCTGATGATGAGTCATATGATAGCTTAGTTGAATCATTTTTTAATTCTGAGTCTGAAGGATCTAATGAGCATGTAGATATTT
TTTCTGATGAATGGACTGGCATGATTGATAGGCCATCCATAGATCCTAGACCAGTAGACATTATAACGCTTGATGACTCCGTTAACAATTGTTTTGAAAATAGAGATTTA
GAAAAGAGATTTGATGCGGCTGATTATTTTCTGAAATGTTATTATGAGAATCTCTTTAGTGAAGATGGGCCAGGGGAAAGTTTTTTTTCTCCCCTGTTTGTAGTATGCAA
TTCGGTGCGTGATTTCTTGCAGGGCCCGGAGGATGATGAAAATTTTTATTTTGAGCCTCCGATCACCTTAGCAAACCCCAAAATTCCACATTTCCCATTTTTCATGGATC
CAAAGGAAGAGGATGAAGCGCTCCTAGCTCAGGGAAGTCTCACGTCTCGACGCTTAGCAAGGCAGCGTCGCGACGCTGCCCTTTCTTGTTGCCCAAGCCGCGTGCGCAGC
AGCGTCGCGACGCTTGGCAAGGCAGCGTCGCGACGCTACCCCTTTTCCAGGCCTATAAATAGGCGCCCTTGGAACCCAGCAACTTCCAATCCCTATCACCATAGACTCCC
ACAAGTCGTTCAAAGGCCGGAGGATAGCAGGGAAGACACTAGAGGTGGTCCACACACTGTTCGTGATCAAAGGAATCAAAAGAAGTTGCTGAATCGGTTTCCAAAAGATC
TAAGGATAAGCCCGAATAAACTGGAGTTCATAGTTAGCTCAGGATTAAGATCGAGTTACCTAGGTCATCGCATTGAAATAGACGCCCCCACTCGCATGTCTCTACATGAA
CGATTCAGGATCACATCGTTTGTACCATTTGTACAAAGTGGGCCGCGTCTTATAGTGTTTCCTAGGATTAAGCTATCTATTCGATCTTATCCCCAAAATGGTAGGCTTGT
CGAGTCGGCGATGCTGGCCACTCTCACCCATGCAGATCAAAGGATAATCTCGCATAAACAGGAGTTCATAGATAGCTCAGGATTAAGAACGAGTTACCTAGGCAGAATGT
GTGCAACTGAGGCAGCATCGAGACGCTACCTCCACAGCATCTCGACGTTGCGGCACTTTCCAATTGGGCTAGGGCGTTCAACGCGACAGCGTCGAGACGTTGTCCTTAAT
GTCGCGACGCTGTCGCGGCCGATTGTCCTGTATAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCCACCAAGAAGAAGCTCCCAAGGCAATTAGAGATTTTCTGCAGCCAGTTCTTCCTACCGAAAATTCTGGAATCGTTTACGCCCCCATCCAAGCTACAAATTTTGA
GCTAAAGACAGGGTTGATTCAGATGGCGCGCGATAACTCGTTCAAGGGACATCCTTCCGAGGATCCTCACTCTCATCTGCGATCATTCCTAGAAATATGTGGGACAATAA
AGATGAACGGAGTTCCGGCCGACGCAATCAGATTGAGGCTATTTCCATTTTCTCTTCAGGATAAAGCAAAAGATTGGCTCGAATCAGTCGAGACGGGCAGCATCAGTACT
TGGGACGAGCTTGCCCAGGCTTTTCTGACAAAATTTTTTCCACCAGCTAAGACTACCAAGCTCCGGACTGAAATTGGAACATTCAGACAGCTTGACGAGGAGCAGTTATA
CGAAGCGTGGGAAAGATATAAAGAAATGCTTAGGCGGTGCCCCCAACACGGATATCCTGATTGGCTTCAGGTACAGTTATTCTATAATGGATTAAACCCCTCCACAAAGA
CAGTCCTAGACACATCAGCAGGAGGGAGTTTTCTTTCAAAAACAGTGACAGAAGCCAAGGACCTGCTTGAGGAAATGGCGGCGACAAGTTATCAATGGCCGACGGAGAGA
GGAACAGTTACAAAAAAGGCTGGATTATATGAATTAGATGAGTCAAGTTCACTGAAAGCGCAACTGGCATCTCTGACCAATGCTCTAAACAAATTGACGTCATCTAAGGT
GGTTAAGTCCATTTCCACCTTAGCTGAAGGTCATTCGAAGAAAGAAGGTCAAGATGTGGAAGAAGTTCAATACATAGGAAATAGATCATATACTCAAGGAGTACCGAACT
TCTACCACCCCAGTCTGCGCAATCACGAGAACTTCTCATATGCAAATACGAAGAATGTTTTGCAGCCACCCCCAGGTTTTGCATCAACGAGTACTCCTGAGAAGAAAAAT
AATCTGGAGGAGATGGTGGCTTTATTCATCAAAGAACAGAGAATACTGAATGTGAGTCTCCAGACATCAGTGAACAACCACGACGCAGCTCTAAAGAATATGGAAGTGCA
GATAGGTCAGATTGCTTCAGCAGTAAATGCCCTGCAGAAGGGAAAATTTCCAAGTGATACTGAGCCTAACCCGAAAGAGCAGTGTAAGATGGTGGTTCTAAGAAGTGGCA
GAAGACTGGAGGACAGTTTAGAGAAGAAAAAGGAAGAAGAAAAGAGAAGGGATGAAGATAAAGGGGCTGAGGCACAAAAAGCCTCCTCTGAAAGGTTCCAACATCCTCCT
AACTCTATTGAATTAAAATGTGATTTTTCTAACAGCTGTGCAGAAGCTTTGTATAATATGCCGAATTATGGAAAATTCATGAAAGAAATGCTTTCTAAAAAGAAAAGTCT
GAAAAAAGAAGTTTTTAATTTATCTGAGAGTAGCAGTACCATCATTTCTGGTAGGATACCCTCTAAGCAGAAAGACCCAGGGAGTTTTACTGTTCCCTGCACCATAGGAG
AAGTATCCTTCGATAGGGCCTTGTGTGATTTAGGAGCAAGTATAAATTTGATGCCCTACTCTGTGTACAGGAAGATTGGTTTATCAGGTATGACAGACACTGACGTCACT
CTCCAGCTTGCCGACAGATCGATTACCCACCCGATGGGTGTCGTGGAGGACGTGTTGGTGAAAGTCAACAAATTCATCTTTCCTGTAGATTTCGTGGTACTGGACATGAA
GGAGGACAAAGAAGTGCCAATTATCTTAGGCAGACCTTTCCTAGCCACTGGTAAGGCTGAGATTAGCGTGCATACAGGTAAGCTTACCTTGAACATTGATGATGAAAAAG
TCGTATTTAGTATTTTTGGCCAAGATGAATCTGTTTGTAGCTTGCATACATGTTTTTCTGTTGGGCCTGAATACTTGACTGATGACGATGAAGAGGTAGACTATAATCTT
GGCCTAGGCTTAGGAGAAATGCTTGTTGATAATATGGATTTTGATCATGATGAATATATGGATAATCCTCTGTTTGAAAATGATTTGGATCTGCCTGACTTCGAAAATGA
ATTAGACTTGCCTGCTTGTGATAATGAAAGATCTGCAGTTGATGATTTACCTTCCTTTGAAAATGAATTAGAATTGCCTGAACTGAATAATTTTAATGATGATATTGAAT
TGCCTGACAGTGAGCATGAACGTAAAAGGCAGAAAAAAGAGTGCTCGATAGATAATTTTGAGTCTGATCGTGATTACAGTGAATCTATTGAGTCTGATCTTGACATTCCT
GAATGCATGAATCCTGACAATGTGAATACTTTCTTTTCATGTCCTGATGAGGTGTATAGAATAGAATCTGACCCAGAAGAACTTGAATCTGTGCATAGTATAGAATCTGA
TCCTGAAATTCTTGAATTCTTTAATTCTTCTGATGATGAGTCATATGATAGCTTAGTTGAATCATTTTTTAATTCTGAGTCTGAAGGATCTAATGAGCATGTAGATATTT
TTTCTGATGAATGGACTGGCATGATTGATAGGCCATCCATAGATCCTAGACCAGTAGACATTATAACGCTTGATGACTCCGTTAACAATTGTTTTGAAAATAGAGATTTA
GAAAAGAGATTTGATGCGGCTGATTATTTTCTGAAATGTTATTATGAGAATCTCTTTAGTGAAGATGGGCCAGGGGAAAGTTTTTTTTCTCCCCTGTTTGTAGTATGCAA
TTCGGTGCGTGATTTCTTGCAGGGCCCGGAGGATGATGAAAATTTTTATTTTGAGCCTCCGATCACCTTAGCAAACCCCAAAATTCCACATTTCCCATTTTTCATGGATC
CAAAGGAAGAGGATGAAGCGCTCCTAGCTCAGGGAAGTCTCACGTCTCGACGCTTAGCAAGGCAGCGTCGCGACGCTGCCCTTTCTTGTTGCCCAAGCCGCGTGCGCAGC
AGCGTCGCGACGCTTGGCAAGGCAGCGTCGCGACGCTACCCCTTTTCCAGGCCTATAAATAGGCGCCCTTGGAACCCAGCAACTTCCAATCCCTATCACCATAGACTCCC
ACAAGTCGTTCAAAGGCCGGAGGATAGCAGGGAAGACACTAGAGGTGGTCCACACACTGTTCGTGATCAAAGGAATCAAAAGAAGTTGCTGAATCGGTTTCCAAAAGATC
TAAGGATAAGCCCGAATAAACTGGAGTTCATAGTTAGCTCAGGATTAAGATCGAGTTACCTAGGTCATCGCATTGAAATAGACGCCCCCACTCGCATGTCTCTACATGAA
CGATTCAGGATCACATCGTTTGTACCATTTGTACAAAGTGGGCCGCGTCTTATAGTGTTTCCTAGGATTAAGCTATCTATTCGATCTTATCCCCAAAATGGTAGGCTTGT
CGAGTCGGCGATGCTGGCCACTCTCACCCATGCAGATCAAAGGATAATCTCGCATAAACAGGAGTTCATAGATAGCTCAGGATTAAGAACGAGTTACCTAGGCAGAATGT
GTGCAACTGAGGCAGCATCGAGACGCTACCTCCACAGCATCTCGACGTTGCGGCACTTTCCAATTGGGCTAGGGCGTTCAACGCGACAGCGTCGAGACGTTGTCCTTAAT
GTCGCGACGCTGTCGCGGCCGATTGTCCTGTATAAATAG
Protein sequenceShow/hide protein sequence
MAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTIKMNGVPADAIRLRLFPFSLQDKAKDWLESVETGSIST
WDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLLEEMAATSYQWPTER
GTVTKKAGLYELDESSSLKAQLASLTNALNKLTSSKVVKSISTLAEGHSKKEGQDVEEVQYIGNRSYTQGVPNFYHPSLRNHENFSYANTKNVLQPPPGFASTSTPEKKN
NLEEMVALFIKEQRILNVSLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNPKEQCKMVVLRSGRRLEDSLEKKKEEEKRRDEDKGAEAQKASSERFQHPP
NSIELKCDFSNSCAEALYNMPNYGKFMKEMLSKKKSLKKEVFNLSESSSTIISGRIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVT
LQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSVGPEYLTDDDEEVDYNL
GLGLGEMLVDNMDFDHDEYMDNPLFENDLDLPDFENELDLPACDNERSAVDDLPSFENELELPELNNFNDDIELPDSEHERKRQKKECSIDNFESDRDYSESIESDLDIP
ECMNPDNVNTFFSCPDEVYRIESDPEELESVHSIESDPEILEFFNSSDDESYDSLVESFFNSESEGSNEHVDIFSDEWTGMIDRPSIDPRPVDIITLDDSVNNCFENRDL
EKRFDAADYFLKCYYENLFSEDGPGESFFSPLFVVCNSVRDFLQGPEDDENFYFEPPITLANPKIPHFPFFMDPKEEDEALLAQGSLTSRRLARQRRDAALSCCPSRVRS
SVATLGKAASRRYPFSRPINRRPWNPATSNPYHHRLPQVVQRPEDSREDTRGGPHTVRDQRNQKKLLNRFPKDLRISPNKLEFIVSSGLRSSYLGHRIEIDAPTRMSLHE
RFRITSFVPFVQSGPRLIVFPRIKLSIRSYPQNGRLVESAMLATLTHADQRIISHKQEFIDSSGLRTSYLGRMCATEAASRRYLHSISTLRHFPIGLGRSTRQRRDVVLN
VATLSRPIVLYK