; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008487 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008487
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:23271339..23274821
RNA-Seq ExpressionLag0008487
SyntenyLag0008487
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]3.8e-15045.16Show/hide
Query:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWLESV
        +E  P+ ++D+++PV+    S I+  PI A NFELK  LI M +   F G P +DP+ HL  FLEIC TVK+NGV  D IRLRLFPFSL+DKA+ WL+S+
Subjt:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWLESV

Query:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK
        + GSI +W ++A+ FL KFFPPAKT +LR+EIG F+Q D E LYEAWERYK+++RRCPQHG PDWLQVQ+FYNGLN  T+T++D ++GG+ +SKT   A 
Subjt:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK

Query:  DLLEEMAATNES--------------------SSLKAQLASLTNALNKLT------SSEVVKSISTLAEGHSKKEGQDVEEVQYIGNRSYT---QGVPNF
         LLEEMA+ N                      ++L AQ+A+L++ ++ LT      S+E V S S +   +   +    E+VQY+ NR+Y      +PN+
Subjt:  DLLEEMAATNES--------------------SSLKAQLASLTNALNKLT------SSEVVKSISTLAEGHSKKEGQDVEEVQYIGNRSYT---QGVPNF

Query:  YHPSLRNHENFSYANTKNVLQP--PPGFASTSTPEKKNNLEEMMALFIKE--QRIWNV-----NLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPS
        YHP LRNHEN SY NTKNVLQP  PPGF S  + EKK +LE+ M  F++E   R         N++T  +N  A +KN+EVQIGQ+A+ +NA Q+G FPS
Subjt:  YHPSLRNHENFSYANTKNVLQP--PPGFASTSTPEKKNNLEEMMALFIKE--QRIWNV-----NLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPS

Query:  DTEPNPKEQCKMVVLRSGRRLE-----------------NSLEKKKEEE---------------------------------------------------
        +TE NPKEQCK + LRSG+ +E                  S ++ +EEE                                                   
Subjt:  DTEPNPKEQCKMVVLRSGRRLE-----------------NSLEKKKEEE---------------------------------------------------

Query:  ----------------------------KRRDEDEGAEAQKTSSE-------RIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLS
                                    KRR E+   E  K S E       ++P K KDPGSFT+PCTIG+  FDR LCDLGASINLMP+SV RK+GL 
Subjt:  ----------------------------KRRDEDEGAEAQKTSSE-------RIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLS

Query:  GMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCF
         M  T ++LQLADRSI +P G++EDVLVKV+KFIFP DFVVLDM+ED++VP+ILGRPFLATG+A I V  G+LTL ++ E+V+F+I+   +      TCF
Subjt:  GMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCF

Query:  SV
         V
Subjt:  SV

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]8.4e-15045.27Show/hide
Query:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWLESV
        +E  P+ ++D+++PV+    S I+  PI A NFELK  LI M +   F G P +DP+ HL  FLEIC TVK+NGV  D IRLRLFPFSL+DKA+ WL+S+
Subjt:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWLESV

Query:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK
        + GSI +W ++A+ FL KFFPPAKT +LR+EIG F+Q D E LYEAWERYK+++RRCPQHG PDWLQVQ+FYNGLN  T+T++D ++GG+ +SKT   A 
Subjt:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK

Query:  DLLEEMAATNES--------------------SSLKAQLASLTNALNKLTSSEVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS
         LLEEMA+ N                      ++L AQ+A+L++ ++ LT+  + +S   LA         +   E+VQY+ NR+Y      +PN+YHP 
Subjt:  DLLEEMAATNES--------------------SSLKAQLASLTNALNKLTSSEVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS

Query:  LRNHENFSYANTKNVLQP--PPGFASTSTPEKKNNLEEMMALFIKE--QRIWNV-----NLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEP
        LRNHEN SY NTKNVLQP  PPGF S  + E+K +LE+ M  F++E   R         N++T  +N  AA+KN+EVQIGQ+A+ +NA Q+G FPS+TE 
Subjt:  LRNHENFSYANTKNVLQP--PPGFASTSTPEKKNNLEEMMALFIKE--QRIWNV-----NLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEP

Query:  NPKEQCKMVVLRSGRRLENS---------------LEKKKEEE---------------------------------------------------------
        NPKEQCK + LRSG+ +E S                 K K EE                                                         
Subjt:  NPKEQCKMVVLRSGRRLENS---------------LEKKKEEE---------------------------------------------------------

Query:  ------------------------KRRDEDEGAEAQKTSSE-------RIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTD
                                KRR E+   E  K S E       ++P K KDPGSFT+PCTIG+  FD+ LCDLGASINLMP SV RK+GL  M  
Subjt:  ------------------------KRRDEDEGAEAQKTSSE-------RIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTD

Query:  TDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
        T ++LQLADRSI +P G++EDVLVKV+KFIFP DFVVLDM+ED+EVP+ILGRPFLATG+A I V  G+LTL ++ E+V+F I+          TCF V
Subjt:  TDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]2.8e-15345.39Show/hide
Query:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWLESV
        Q   P+ ++D+++P++    SGI    I A NFELK  LI M +   F G P +DP+ HL  FLEIC T+KMNGV  D IRLRLFPFSL+DKA+ WL+S+
Subjt:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWLESV

Query:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK
        + GSI++W ++A+ FL KFFPPAKT +LR+EIG FRQ D E LYEAWERYK+++R CPQHG PDWLQVQ+FYNGLN  T+T++D ++GG+ +SKT   A 
Subjt:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK

Query:  DLLEEMAATNES--------------------SSLKAQLASLTNALNKLTSSEVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS
         LLEEMA+ N                      ++L AQ+ASL++ ++ LT+  + +    +A         +   E+VQYI NR+Y      +PN+YHP 
Subjt:  DLLEEMAATNES--------------------SSLKAQLASLTNALNKLTSSEVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS

Query:  LRNHENFSYANTKNVLQPPPGFASTSTPEKKNNLEEMMALFIKEQRIWNV-------NLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNP
        LRNHENFSY NTKNVLQPPPGF S  + EKK +LE+ M  F++E +           N++T  +N  A +KN+EVQIGQ+A+ +NA Q+G FPS+TE NP
Subjt:  LRNHENFSYANTKNVLQPPPGFASTSTPEKKNNLEEMMALFIKEQRIWNV-------NLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNP

Query:  KEQCKMVVLRSGRRLE---------------NSLEKKKEEEKRRDED-----------------------------------------------------
        KEQCK + LRSGR +E               N   K K EE+   ED                                                     
Subjt:  KEQCKMVVLRSGRRLE---------------NSLEKKKEEEKRRDED-----------------------------------------------------

Query:  --------------------------EGAEAQKTSSE-------RIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVT
                                  E  E  K S E       ++P K KDPGSFT+PCTIG   FD+ LCDLGASINLMP SVYRK+GL  M  T ++
Subjt:  --------------------------EGAEAQKTSSE-------RIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVT

Query:  LQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
        LQLADRSI +P G++EDVLVKV+KFIFP DFVVLDM+ED+EVP+ILGRPFLATG+A + V  G+LTL ++ E+V F+I+   +      TCF V
Subjt:  LQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

XP_023903214.1 uncharacterized protein LOC112015077 [Quercus suber]2.1e-15346.58Show/hide
Query:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWLESV
        Q   P+ ++D+++P++    SGI    I A NFELK  LI M +   F G P +DP+ HL  FLEIC TVKMNGV  D IRLRLFPFSL+DKA+ WL+S+
Subjt:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWLESV

Query:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK
        + GSI++W ++A+ FL KFFPPAKT +LR+EIG FRQ D E LYEAWERYK+++R CPQHG PDWLQVQ+FYNGLN  T+T++D ++GG+ +SKT   A 
Subjt:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK

Query:  DLLEEMAATNES--------------------SSLKAQLASLTNALNKLTSSEVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS
         LLEEMA+ N                      ++L AQ+ASL++ ++ L++  + +S   +A         +   E+VQYI NR+Y      +PN+YHP 
Subjt:  DLLEEMAATNES--------------------SSLKAQLASLTNALNKLTSSEVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS

Query:  LRNHENFSYANTKNVLQPPPGFASTSTPEKKNNLEEMMALFIKEQRIWNV-------NLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNP
        LRNHENFSY NTKNVLQPPPGF S  + EKK +LE+ M  F++E +           N++T  +N  A +KN+EVQIGQ+A+ +NA Q+G FPS+TE NP
Subjt:  LRNHENFSYANTKNVLQPPPGFASTSTPEKKNNLEEMMALFIKEQRIWNV-------NLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNP

Query:  KEQCKMVVLRSGRRLE---------------NSLEKKKEEEKRRDED-----------------------------------------------------
        KEQCK + LRSGR +E               N   K K EE+   ED                                                     
Subjt:  KEQCKMVVLRSGRRLE---------------NSLEKKKEEEKRRDED-----------------------------------------------------

Query:  ----EGAEAQKTSSE-------RIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKV
            E  E  K S E       ++P K KD GSFT+PCTIG   FD+ LCDLGASINLMP SVYRK+GL  M  T ++LQLADRSI +P G++EDVLVKV
Subjt:  ----EGAEAQKTSSE-------RIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKV

Query:  NKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
        +KFIFP +FVVLDM+ED+EVP+ILGRPFLA G+A + V  G+LTL ++ E+V F+I+   +   +  TCF V
Subjt:  NKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

XP_023929660.1 uncharacterized protein LOC112040975 [Quercus suber]1.3e-15346.55Show/hide
Query:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWLESV
        Q   P+ ++D+++P++    SGI +  I A NFEL   LI M +   F G P +DP+ HL  FLEIC  VKMNGV  D IRLRLFPFSL+DKA+ WL+S+
Subjt:  QEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWLESV

Query:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK
        + GSI++W ++A+ FL KFFPPAKT +LR+EIG FRQ D E LYEAWERYK+++R CPQHG  DWLQVQ+FYNGLN  T+T++D ++GG+ +SKT   A 
Subjt:  ETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAK

Query:  DLLEEMAAT--------------------NESSSLKAQLASLTNALNKLTSSEVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS
         LLEEMA+                        ++L AQ+ASL++ ++ LT+  + +S+  +A         +   E VQYI NR+Y      +PN+YHP 
Subjt:  DLLEEMAAT--------------------NESSSLKAQLASLTNALNKLTSSEVVKSISTLAEGHSKKEGQDV--EEVQYIGNRSYT---QGVPNFYHPS

Query:  LRNHENFSYANTKNVLQPPPGFASTSTPEKKNNLEEMMALFIKEQRI-------WNVNLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNP
        LRNHENFSY NTKNVLQPPPGF S  + EKK +LE+ M  F++E +           N++T  +N  A +KN+EVQIGQ+A+ +NA Q+G FPS+TE NP
Subjt:  LRNHENFSYANTKNVLQPPPGFASTSTPEKKNNLEEMMALFIKEQRI-------WNVNLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNP

Query:  KEQCKMVVLRSGRRLENSLEKKKE---------EEKRRDEDEGA--------------------------------------------------------
        KEQCK + LRSGR +E S  K+ E         + K + E+E                                                          
Subjt:  KEQCKMVVLRSGRRLENSLEKKKE---------EEKRRDEDEGA--------------------------------------------------------

Query:  -EAQKTSSE-------RIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFP
         E  K S E       ++P K KDPGSFT+PCTIG   FD+ LCDLGASINLMP SVYRK+GL  M  T ++LQLA+RSI +P G++EDVLVKV+KFIFP
Subjt:  -EAQKTSSE-------RIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFP

Query:  VDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
         DFVVLDM+ED+EVP+ILGRPFLATG+A + V  G+LTL +  E+V+F+I+   + +    TCF V
Subjt:  VDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

TrEMBL top hitse value%identityAlignment
A0A1S3UKD4 uncharacterized protein LOC1067662671.8e-12141.2Show/hide
Query:  MAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWL
        M  ++   K IRD+  P        IV  PIQA NFE+K  L+Q+ + N F G  SEDP+SHL +FL IC T+K NGV  DAI LRLFPFSL+DKAK+WL
Subjt:  MAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWL

Query:  ESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVT
        +S+  GSISTW+++A  F+TK+FPP+K+ K+R EI +F Q D E LYEAWERYKE++R+CP H  P+WLQVQ FYNGL+P+ K +LD ++GGSF+ KT  
Subjt:  ESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVT

Query:  EAKDLLEEMA------------------ATNESSSLKAQLASLTNALNKLTSSEVV---KSISTLAEG-------HSKKEGQDVEEVQYIG--NRSYTQG
        EA + LE MA                    N   ++ AQ   LT  +  LT    +    +++T + G       H   E Q   +   I    +   Q 
Subjt:  EAKDLLEEMA------------------ATNESSSLKAQLASLTNALNKLTSSEVV---KSISTLAEG-------HSKKEGQDVEEVQYIG--NRSYTQG

Query:  VPNFYHPSLRNHENFSYANTKNVLQPPPGFASTSTPEKKNN-----------LEEMMALFIKEQRIWNVNLQTSVNNHDAALKNMEVQIGQIASVVNALQ
          NF + + R  +    +N      P P +   + P  + N           L    + F+ +   +    +T+  N +A+++N+E QIGQ++  ++   
Subjt:  VPNFYHPSLRNHENFSYANTKNVLQPPPGFASTSTPEKKNN-----------LEEMMALFIKEQRIWNVNLQTSVNNHDAALKNMEVQIGQIASVVNALQ

Query:  KGKFPSDTEPNPKEQCKMVVLRSGRRLENSLEKKKEEEKRRDEDE------GAEAQKTSSERIPS-----------------------------------
         G FPSDT PNP+EQCK + LRS R LE+    + E EK++  DE        E ++ S E++PS                                   
Subjt:  KGKFPSDTEPNPKEQCKMVVLRSGRRLENSLEKKKEEEKRRDEDE------GAEAQKTSSERIPS-----------------------------------

Query:  --KQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIIL
          K KDP SF +PC IG +S  +ALCDLGASINLMP S+++++G+  +  T +TLQLADRS+T+P G+VEDVLVKV+KFIFP DFVVLDM+ED +VPIIL
Subjt:  --KQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIIL

Query:  GRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSVG-PEYLTDDDEEVDYNL
        GRPFLATG+  I V  G L L + DEKV FSI            CF     E L  DD  VDY++
Subjt:  GRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSVG-PEYLTDDDEEVDYNL

A0A1U7Z951 uncharacterized protein LOC1045905681.6e-12544.46Show/hide
Query:  KAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDN-SFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWLESVETGS
        + + D+ +P L    S IV   I A NF++K  +IQM ++   F G   EDP++H+ +FLEIC T K NGV  D +RLRLFPFSL+DK K WL S+   S
Subjt:  KAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDN-SFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWLESVETGS

Query:  ISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLLE
        ISTWDE+A  FL+K+FPP+K  K+R +I TF Q D E LYE+WERYKE+LR+ P HG P WLQVQ FYN L  + KT++D +AGGS  +KT   A  L+E
Subjt:  ISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLLE

Query:  EMAATN-ESSSLKAQLASLTNALNKLTSSEVVKSISTLAE----------------------GHSKKE-------GQDVEEVQYIGNRSYTQG--VPNFY
        EM A N +  S +AQ+       N  T + +   I  L++                      GH   E        Q  ++V ++GN    QG   P  +
Subjt:  EMAATN-ESSSLKAQLASLTNALNKLTSSEVVKSISTLAE----------------------GHSKKE-------GQDVEEVQYIGNRSYTQG--VPNFY

Query:  HPSLRNHENFSYANTKNVLQPPPGFASTSTPEKKNNLEEMMALFIKEQRIWNVNLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQC
        +P  RNH N S+ N +N ++P P   + + PE K+NLEE+M  FI        + +T   N +A++KN+E Q+GQ+A ++++  +G  PS+TE NP+EQ 
Subjt:  HPSLRNHENFSYANTKNVLQPPPGFASTSTPEKKNNLEEMMALFIKEQRIWNVNLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQC

Query:  KMVVLRSGRRLENSLEKKKEEE--------------------KRRDEDEGAEAQKTSS---ERIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMP
        + + LRSG+ L+   +K KEE+                    K  D    A  ++ S     ++P K KDPGSFT+PCTIG +  ++ALCDLGA+INLM 
Subjt:  KMVVLRSGRRLENSLEKKKEEE--------------------KRRDEDEGAEAQKTSS---ERIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMP

Query:  YSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSI
        YSV++K+GL     T V LQL DRSI HP G++EDVLVKV+KFIFPVDF+VLDM+ED +VP+ILGRPFLATGKA + V  G+L+L I DE+V+F +
Subjt:  YSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSI

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129453.6e-13042.19Show/hide
Query:  EAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDN-SFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWLESVE
        EA +A+RD++ P++   +  I    I A NFE+K   IQM + +  F G PS+DP+SHL +FLEIC T K NGV  DAIRLRLFPFSL+DKAK WL S+ 
Subjt:  EAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDN-SFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWLESVE

Query:  TGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKD
         GSI+TW++LAQ FL KFFPPAKT K+R +I +F Q D E LYEAWER+KE+LRRCP HG PDWLQVQ FYNGL  S KT++D +AGG+ +SK   +A +
Subjt:  TGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKD

Query:  LLEEMAATN------ESSSLKAQLASLTNALNKLTS--SEVVKSISTL---------------AEGHSKKE-GQDVEEVQYIGNRSYTQGVP--NFYHPS
        LLEEMA+ N       S S KA  A   +AL  LT+  + + K + TL                + HS  +   + E VQ++GN +  Q  P  N Y+P 
Subjt:  LLEEMAATN------ESSSLKAQLASLTNALNKLTS--SEVVKSISTL---------------AEGHSKKE-GQDVEEVQYIGNRSYTQGVP--NFYHPS

Query:  LRNHENFSYANTKNVLQP----PPGF---ASTSTPEKKNNLEEMMALFIKEQRIWNVNLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNP
         RNH NFS++N      P    PPGF   A    PEKK+ LEE++  +I +           + +  A+L+N+E Q+GQ+A+ +N   +G  PSDT+ NP
Subjt:  LRNHENFSYANTKNVLQP----PPGF---ASTSTPEKKNNLEEMMALFIKEQRIWNVNLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNP

Query:  --KEQCKMVVLRSGRRLENSLEK---------------KKEEEKRRDEDEGAEAQKTS------------------------------------------
          KEQC+ + LRSG+ +E   +K               + E E ++ +D+ AE Q TS                                          
Subjt:  --KEQCKMVVLRSGRRLENSLEK---------------KKEEEKRRDEDEGAEAQKTS------------------------------------------

Query:  ------------------------------------SERIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADR
                                              ++P K KDPGSFT+PCTIG + F +AL DLGASINLMP+S++ K+GL     T VTLQLADR
Subjt:  ------------------------------------SERIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADR

Query:  SITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTC
        S  +P G++EDVLVKV+KFIFPVDF++LDM+ED+++PIILGRPFLAT  A I V  GK++  + +E V F+IF   +   S + C
Subjt:  SITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTC

A0A6P6XAQ1 Reverse transcriptase2.6e-12039.91Show/hide
Query:  MAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWL
        MA  E   + +RDF  P      + IV   + A NFE+K  LIQM + + + G+ +EDP+SHL +FLEIC T+K NGV  DAI+LRLFPFSL+DKAK WL
Subjt:  MAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWL

Query:  ESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVT
        +S    + +TWDELA+AFL KFFPP KT KLR +I +F Q + E LYEAWERY+E+ RRCP HG PDWL VQ FYNGL   TKT +D +AGG+ + KT  
Subjt:  ESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVT

Query:  EAKDLLEEMAATNES--------------------SSLKAQLASLTNALNKLTSSE-----VVKSISTLAEGHSKKEGQDVEEVQYIGNRS---YTQGVP
        EA+ L+EEMAA N                      + L A++ ++   LN+   S      VV S +     H        E+VQY+ N +         
Subjt:  EAKDLLEEMAATNES--------------------SSLKAQLASLTNALNKLTSSE-----VVKSISTLAEGHSKKEGQDVEEVQYIGNRS---YTQGVP

Query:  NFYHPSLRNHENFSYANTKNVLQP--PPGFASTSTPEKKNNLEEMM------ALFIKEQRIWNVNLQT------SVNNHDAALKNMEVQIGQIASVVNAL
        N Y+P  RNH NF + +  N  +P  PPGF    T  +     E+       A   K +++ +   Q        ++      +N+EVQ+GQIA+ VN  
Subjt:  NFYHPSLRNHENFSYANTKNVLQP--PPGFASTSTPEKKNNLEEMM------ALFIKEQRIWNVNLQT------SVNNHDAALKNMEVQIGQIASVVNAL

Query:  QKGKFPSDTEPNPKEQCKMVVLRSGRRL---------------EN----SLEKKKEEEKRRDEDEGAEAQKTSS--------------------------
         +G  PS TE NP+E  K + LRSG+ L               EN     L++  +EEK +++ E  E Q   +                          
Subjt:  QKGKFPSDTEPNPKEQCKMVVLRSGRRL---------------EN----SLEKKKEEEKRRDEDEGAEAQKTSS--------------------------

Query:  ---------------ERIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFP
                        ++P K KDPGSFTVPCTIG V F +ALCDLGAS++L+P +V R++GL  +  T+++LQLADRSI HPMG++E+VL+KV KFI P
Subjt:  ---------------ERIPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFP

Query:  VDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQD------ESVCSLHTCFSVGPEY----LTDDDEEVDYNLGLGLGEML
        VDFVVLDM+ED  VPIILGRPFLAT    I V  GK    I +E+V F +   +      + V S+  C  +  E     L +D  E+  N G+G+ E  
Subjt:  VDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQD------ESVCSLHTCFSVGPEY----LTDDDEEVDYNLGLGLGEML

Query:  VDNM
        ++ M
Subjt:  VDNM

A0A6P8DD93 uncharacterized protein LOC1162064533.1e-11337.66Show/hide
Query:  APKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWLESVETG
        A +A+RD+  P +    S I    I A NFELK  LIQM + N F G+P+E P  H+  FL+ C TVKMN V  D IRL+LFPFSL+DKA+ W  S+   
Subjt:  APKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWLESVETG

Query:  SISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLL
        SI+TW +L+  FL +FFPPA+T +LR EI  F + + E LYEAWER+KE +R+CP HG PD L +++FY  L+ + ++++D +AGG+ + K   EA  L+
Subjt:  SISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLL

Query:  EEMAAT-----NESS--------------SLKAQLASLTNALNKLTSSEVVK----SISTLAEG-HSKKE--------GQDVEEVQYIGN--RSYTQGVP
        EEMA++     NE S              +L  Q+++LT  ++KLTS+        +   L  G HS  E          + E+V ++ N  RS      
Subjt:  EEMAAT-----NESS--------------SLKAQLASLTNALNKLTSSEVVK----SISTLAEG-HSKKE--------GQDVEEVQYIGN--RSYTQGVP

Query:  NFYHPSLRNHENFSYANTKNVLQPPPGF-----ASTSTPEK-KNNLEEMMALFIKEQRIWNVNLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSD
        N Y+P  RNH NFS+ N  N L+PPPGF     A  + P++ ++ +EE+M  ++++         T + N  A ++N+E QI QI+  ++    G  PS+
Subjt:  NFYHPSLRNHENFSYANTKNVLQPPPGF-----ASTSTPEK-KNNLEEMMALFIKEQRIWNVNLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSD

Query:  TEPNPKEQCKMVVLRSGRRLENSLEKKKEEEKRRDEDEG----------------------------------------------------AEA--QKTS
        TE NPK     ++LRSG+ LE    K + +E+  ++D+G                                                    AEA  Q  S
Subjt:  TEPNPKEQCKMVVLRSGRRLENSLEKKKEEEKRRDEDEG----------------------------------------------------AEA--QKTS

Query:  SER----------------------------------IPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSI
          R                                  +P KQ+D GSFTVPCTIG   F+  L D GASINLMP S++RK+GL     T VTLQLADRSI
Subjt:  SER----------------------------------IPSKQKDPGSFTVPCTIGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSI

Query:  THPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSVGPEYLTDD--DEEV
         +P G+VE+VLVKV+KFIFPVDF+VL+M+ED+EVP+ILGRPFLATGKA I V  GKLTL + +E++ F+++   +      +C+++    + D+   E V
Subjt:  THPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSVGPEYLTDD--DEEV

Query:  DYNLGLGLGEMLVDNMD
        +   G+   E ++ ++D
Subjt:  DYNLGLGLGEMLVDNMD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCACCAAGAAGAAGCTCCCAAGGCAATTAGAGATTTTCTGCAGCCAGTTCTTCCTACCGAAAATTCTGGGATTGTTTACGCCCCCATCCAAGCTACAAATTTTGA
GCTAAAAACAGGGTTGATTCAGATGGCGCGCGATAACTCGTTCAAGGGACATCCTTCCGAGGACCCTCACTCTCATCTGCGATCATTCCTAGAAATTTGTGGGACGGTAA
AGATGAACGGAGTTCCGGCCGATGCGATCAGATTGAGGCTATTTCCATTTTCTCTACAGGATAAAGCAAAAGATTGGCTCGAATCAGTCGAGACGGGCAGCATCAGTACT
TGGGACGAGCTTGCCCAGGCTTTTCTGACAAAATTTTTTCCACCAGCTAAGACTACCAAGCTGCGGACTGAAATTGGAACATTCAGACAGCTTGACGAGGAGCAGTTGTA
CGAAGCATGGGAAAGATATAAGGAAATGCTTAGGCGATGCCCCCAACATGGATATCCTGATTGGCTTCAGGTACAGTTATTCTATAATGGATTAAACCCCTCCACGAAGA
CAGTCCTAGACACATCAGCAGGAGGGAGTTTTCTTTCAAAAACAGTGACAGAAGCCAAAGACCTGCTTGAGGAAATGGCGGCGACAAATGAGTCAAGTTCACTGAAAGCG
CAACTAGCATCTCTGACCAATGCACTAAACAAATTGACGTCATCTGAGGTGGTTAAGTCCATTTCCACCTTAGCTGAAGGACATTCGAAGAAAGAAGGTCAAGATGTGGA
AGAAGTTCAATACATAGGAAACAGATCATATACTCAAGGAGTACCGAACTTCTACCACCCCAGTCTGCGCAATCACGAGAACTTCTCATATGCAAATACGAAGAATGTTT
TGCAGCCACCCCCAGGTTTTGCATCAACGAGTACTCCTGAGAAGAAAAATAATCTGGAGGAGATGATGGCTTTATTCATCAAGGAACAAAGAATATGGAATGTAAATCTC
CAGACATCAGTAAACAACCACGACGCAGCTCTAAAGAATATGGAAGTGCAGATAGGTCAGATTGCTTCAGTAGTAAATGCCCTTCAGAAGGGAAAATTTCCAAGCGATAC
TGAGCCTAACCCGAAAGAGCAGTGTAAGATGGTGGTTTTGAGAAGTGGCAGAAGACTGGAGAACAGTTTAGAGAAGAAAAAGGAAGAAGAGAAGAGAAGGGATGAAGATG
AAGGGGCTGAGGCACAAAAAACCTCATCTGAAAGGATACCCTCTAAGCAAAAAGATCCAGGGAGTTTTACTGTTCCCTGCACCATAGGAGAAGTATCCTTCGATAGGGCT
TTGTGTGATTTAGGAGCAAGTATAAATTTGATGCCCTACTCTGTGTACAGGAAGATTGGTTTATCAGGTATGACAGACACCGACGTCACTCTCCAGCTTGCCGATAGATC
GATTACCCACCCGATGGGTGTTGTGGAGGACGTGTTGGTGAAAGTCAACAAATTCATCTTTCCTGTAGATTTCGTGGTACTGGACATGAAGGAGGACAAAGAGGTGCCAA
TTATCTTAGGCAGACCTTTCCTAGCCACTGGTAAGGCTGAGATTAGCGTGCATACAGGTAAACTTACCTTGAACATTGATGATGAAAAAGTCGTGTTTAGTATTTTTGGC
CAAGATGAATCTGTTTGTAGTTTGCATACATGTTTTTCTGTTGGGCCTGAATACTTGACTGATGATGATGAAGAGGTAGACTATAATCTTGGCCTAGGCCTAGGAGAAAT
GCTTGTTGATAATATGGACTTTGATCATGATGCATATATGGATAATCCTCTGTTTGAAAATGACTTGGATCTGCCTGACTTCGAAAATGAATTAGACTTGCCTGCTTGTG
AAAATGAAAGATCTGCAGTTGATGATTTACCTTCCTTTGAAAATGAATTAGATTTGCCTGAAATGGATAATTTTAATGATGATATTGAATTGCCTGACATTGAACATGAA
CGTAAAAGGCATAAAAAAGATTGCTCGATAGATAATTTTGAGTCTGACCATGATTACAGTGAATCTATTGAGTCTGATCTTGACATTCCTGAATGCATGAATCCTGACAA
TGGTCTTGAAACGCCTACCTCTATTGATTTAATTTGCCTTTTTAGTGATTTCAAAAGTATTATCAGCGGGATCCTCCCGGAACTCCGGATTTTTTTGGAATATGTTTCAA
AACCAATGAAGCTCTACCGATCAAGCACTAGGCAACTGGCTATCCCTAACTCAGTCACTGGCGTTCACCAATACACAATGATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCCACCAAGAAGAAGCTCCCAAGGCAATTAGAGATTTTCTGCAGCCAGTTCTTCCTACCGAAAATTCTGGGATTGTTTACGCCCCCATCCAAGCTACAAATTTTGA
GCTAAAAACAGGGTTGATTCAGATGGCGCGCGATAACTCGTTCAAGGGACATCCTTCCGAGGACCCTCACTCTCATCTGCGATCATTCCTAGAAATTTGTGGGACGGTAA
AGATGAACGGAGTTCCGGCCGATGCGATCAGATTGAGGCTATTTCCATTTTCTCTACAGGATAAAGCAAAAGATTGGCTCGAATCAGTCGAGACGGGCAGCATCAGTACT
TGGGACGAGCTTGCCCAGGCTTTTCTGACAAAATTTTTTCCACCAGCTAAGACTACCAAGCTGCGGACTGAAATTGGAACATTCAGACAGCTTGACGAGGAGCAGTTGTA
CGAAGCATGGGAAAGATATAAGGAAATGCTTAGGCGATGCCCCCAACATGGATATCCTGATTGGCTTCAGGTACAGTTATTCTATAATGGATTAAACCCCTCCACGAAGA
CAGTCCTAGACACATCAGCAGGAGGGAGTTTTCTTTCAAAAACAGTGACAGAAGCCAAAGACCTGCTTGAGGAAATGGCGGCGACAAATGAGTCAAGTTCACTGAAAGCG
CAACTAGCATCTCTGACCAATGCACTAAACAAATTGACGTCATCTGAGGTGGTTAAGTCCATTTCCACCTTAGCTGAAGGACATTCGAAGAAAGAAGGTCAAGATGTGGA
AGAAGTTCAATACATAGGAAACAGATCATATACTCAAGGAGTACCGAACTTCTACCACCCCAGTCTGCGCAATCACGAGAACTTCTCATATGCAAATACGAAGAATGTTT
TGCAGCCACCCCCAGGTTTTGCATCAACGAGTACTCCTGAGAAGAAAAATAATCTGGAGGAGATGATGGCTTTATTCATCAAGGAACAAAGAATATGGAATGTAAATCTC
CAGACATCAGTAAACAACCACGACGCAGCTCTAAAGAATATGGAAGTGCAGATAGGTCAGATTGCTTCAGTAGTAAATGCCCTTCAGAAGGGAAAATTTCCAAGCGATAC
TGAGCCTAACCCGAAAGAGCAGTGTAAGATGGTGGTTTTGAGAAGTGGCAGAAGACTGGAGAACAGTTTAGAGAAGAAAAAGGAAGAAGAGAAGAGAAGGGATGAAGATG
AAGGGGCTGAGGCACAAAAAACCTCATCTGAAAGGATACCCTCTAAGCAAAAAGATCCAGGGAGTTTTACTGTTCCCTGCACCATAGGAGAAGTATCCTTCGATAGGGCT
TTGTGTGATTTAGGAGCAAGTATAAATTTGATGCCCTACTCTGTGTACAGGAAGATTGGTTTATCAGGTATGACAGACACCGACGTCACTCTCCAGCTTGCCGATAGATC
GATTACCCACCCGATGGGTGTTGTGGAGGACGTGTTGGTGAAAGTCAACAAATTCATCTTTCCTGTAGATTTCGTGGTACTGGACATGAAGGAGGACAAAGAGGTGCCAA
TTATCTTAGGCAGACCTTTCCTAGCCACTGGTAAGGCTGAGATTAGCGTGCATACAGGTAAACTTACCTTGAACATTGATGATGAAAAAGTCGTGTTTAGTATTTTTGGC
CAAGATGAATCTGTTTGTAGTTTGCATACATGTTTTTCTGTTGGGCCTGAATACTTGACTGATGATGATGAAGAGGTAGACTATAATCTTGGCCTAGGCCTAGGAGAAAT
GCTTGTTGATAATATGGACTTTGATCATGATGCATATATGGATAATCCTCTGTTTGAAAATGACTTGGATCTGCCTGACTTCGAAAATGAATTAGACTTGCCTGCTTGTG
AAAATGAAAGATCTGCAGTTGATGATTTACCTTCCTTTGAAAATGAATTAGATTTGCCTGAAATGGATAATTTTAATGATGATATTGAATTGCCTGACATTGAACATGAA
CGTAAAAGGCATAAAAAAGATTGCTCGATAGATAATTTTGAGTCTGACCATGATTACAGTGAATCTATTGAGTCTGATCTTGACATTCCTGAATGCATGAATCCTGACAA
TGGTCTTGAAACGCCTACCTCTATTGATTTAATTTGCCTTTTTAGTGATTTCAAAAGTATTATCAGCGGGATCCTCCCGGAACTCCGGATTTTTTTGGAATATGTTTCAA
AACCAATGAAGCTCTACCGATCAAGCACTAGGCAACTGGCTATCCCTAACTCAGTCACTGGCGTTCACCAATACACAATGATCTAG
Protein sequenceShow/hide protein sequence
MAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPADAIRLRLFPFSLQDKAKDWLESVETGSIST
WDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLLEEMAATNESSSLKA
QLASLTNALNKLTSSEVVKSISTLAEGHSKKEGQDVEEVQYIGNRSYTQGVPNFYHPSLRNHENFSYANTKNVLQPPPGFASTSTPEKKNNLEEMMALFIKEQRIWNVNL
QTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQCKMVVLRSGRRLENSLEKKKEEEKRRDEDEGAEAQKTSSERIPSKQKDPGSFTVPCTIGEVSFDRA
LCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFG
QDESVCSLHTCFSVGPEYLTDDDEEVDYNLGLGLGEMLVDNMDFDHDAYMDNPLFENDLDLPDFENELDLPACENERSAVDDLPSFENELDLPEMDNFNDDIELPDIEHE
RKRHKKDCSIDNFESDHDYSESIESDLDIPECMNPDNGLETPTSIDLICLFSDFKSIISGILPELRIFLEYVSKPMKLYRSSTRQLAIPNSVTGVHQYTMI