; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008493 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008493
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:23521685..23525477
RNA-Seq ExpressionLag0008493
SyntenyLag0008493
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]1.7e-13541.18Show/hide
Query:  MRRNKVVDLFPLDLEIIRTLKFIRREKRLAEAMAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLE
        MRR +  D+ P+D EI RTL+ +RR K LA A   +E  P+ ++D+++PV+    S I+  PI A NFELK  LI M +   F G P +DP+ HL  FLE
Subjt:  MRRNKVVDLFPLDLEIIRTLKFIRREKRLAEAMAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLE

Query:  ICGTVKMNGVPVDAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDW
        IC TVK+NGV  D IRLRLFPFSL+DKA+ WL+S++ GSI +W ++A+ FL KFFPPAKT +LR+EIG F+Q D E LYEAWERYK+++RRCPQHG PDW
Subjt:  ICGTVKMNGVPVDAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDW

Query:  LQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNKLTSS-------------
        LQVQ+FYNGLN  T+T++D ++GG+ +SKT   A  LLEEMA+ +YQWPTE+    K AG+++L+  ++L AQ+A+L++ ++ LT+              
Subjt:  LQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNKLTSS-------------

Query:  -----------------------------------------------------EPPPGFASTSTPEKKNNMEEMVALFIKE--QRILNV-----NLQTSV
                                                             + PPGF S  + E+K ++E+ +  F++E   R         N++T  
Subjt:  -----------------------------------------------------EPPPGFASTSTPEKKNNMEEMVALFIKE--QRILNV-----NLQTSV

Query:  NNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNSKEQCKMVVLRSGRRLE-----------------DSSEKKNEEEKRRDEDEGTEAQKASS----
        +N  AA+KN+EVQIGQ+A+ +NA Q+G FPS+TE N KEQCK + LRSG+ +E                  S  K  E+E   D  E T+     S    
Subjt:  NNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNSKEQCKMVVLRSGRRLE-----------------DSSEKKNEEEKRRDEDEGTEAQKASS----

Query:  -----------ERFQHPP------------NSIELKCDFSNS-------------FAGRKEDERQNDNKKLTEE------EVVPCNHHDRGEVS------
                   +RFQ                 I +   F+++                +K    + +  KL+EE      + +P    D G  +      
Subjt:  -----------ERFQHPP------------NSIELKCDFSNS-------------FAGRKEDERQNDNKKLTEE------EVVPCNHHDRGEVS------

Query:  ---FDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKKVPIILGRPFLATGKAEISVHIG
           FD+ LCDLGASINLMP SV RK+GL  M  T ++LQLADRSI +P G++EDVLVKV+KFIFP DFVVLDM+ED++VP+ILGRPFLATG+A I V  G
Subjt:  ---FDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKKVPIILGRPFLATGKAEISVHIG

Query:  KLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
        +LTL ++ E+V+F I+          TCF V
Subjt:  KLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]4.4e-13141.54Show/hide
Query:  MRRNKVVDLFPLDLEIIRTLKFIRR-EKRLAEAMAHQ-------EEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPH
        MRR + +DL  +D E  RT + +R  ++   EAMA Q       +   +AIRD+++PV+    SGI    I A NFELK GLI M + N F G   EDP+
Subjt:  MRRNKVVDLFPLDLEIIRTLKFIRR-EKRLAEAMAHQ-------EEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPH

Query:  SHLRSFLEICGTVKMNGVPVDAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRC
        +HL SFLEIC TVKMNGV  DAIRLRLF FSL+DKAK W +S+  GSI+TWD+LAQ FLTK+FPP+K+ +LR EI  F+QLD E  YEAWER+K++LRRC
Subjt:  SHLRSFLEICGTVKMNGVPVDAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRC

Query:  PQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNKLTSS-----
        PQHG+  W+Q+++FYNGLN  T+T++D +AGG  ++KT   A  LL+++A  SYQWP+E+  + K AGL+E+D  ++L AQ+ASLTN +  LT+      
Subjt:  PQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNKLTSS-----

Query:  -----------------------------------------------------------EPPPGFASTSTPEKKNNMEEMVALFIKEQR-------ILNV
                                                                   +PPPGF +T   + K  +E+++  FI E R       +   
Subjt:  -----------------------------------------------------------EPPPGFASTSTPEKKNNMEEMVALFIKEQR-------ILNV

Query:  NLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNSKEQCKMVVLRSGRRLEDSSEKK-------------NEEEKRRDEDEGTEAQKASSER
        N++T V+   A +KN+EVQIGQ+A+ + + QKGKFPSDTE N +E C  + LRSG+ +E+S  KK              + E+++ E EGT+  K  S  
Subjt:  NLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNSKEQCKMVVLRSGRRLEDSSEKK-------------NEEEKRRDEDEGTEAQKASSER

Query:  FQHPPNSIELKCDFSNSFAGRKEDER----------------------------------QNDNKKLTEEEV-------------------------VPC
        F   P  ++    F   F  +K D++                                   ++ KKL E E                          +PC
Subjt:  FQHPPNSIELKCDFSNSFAGRKEDER----------------------------------QNDNKKLTEEEV-------------------------VPC

Query:  NHHDRGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKKVPIILGRPFLATGKAE
        N    G ++FDRALCD GASINLMP SV++K+GL  +  T +TLQLADRSIT+P G++EDVLVKV+KFI PVDFVVLDM+E++K+P+ILGRPFLATG+A 
Subjt:  NHHDRGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKKVPIILGRPFLATGKAE

Query:  ISV
        I V
Subjt:  ISV

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]1.0e-13241.12Show/hide
Query:  AMAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPVDAIRLRLFPFSLQDKAKDW
        A   Q   P+ ++D+++P++    SGI    I A NFELK  LI M +   F G P +DP+ HL  FLEIC T+KMNGV  D IRLRLFPFSL+DKA+ W
Subjt:  AMAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPVDAIRLRLFPFSLQDKAKDW

Query:  LESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTV
        L+S++ GSI++W ++A+ FL KFFPPAKT +LR+EIG FRQ D E LYEAWERYK+++R CPQHG PDWLQVQ+FYNGLN  T+T++D ++GG+ +SKT 
Subjt:  LESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTV

Query:  TEAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNKLTSS--------------------------------------------
          A  LLEEMA+ +YQWPTE+    K AG++EL+  ++L AQ+ASL++ ++ LT+                                             
Subjt:  TEAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNKLTSS--------------------------------------------

Query:  --------------------EPPPGFASTSTPEKKNNMEEMVALFIKEQRILNV-------NLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDT
                            +PPPGF S  + EKK ++E+ +  F++E +           N++T  +N  A +KN+EVQIGQ+A+ +NA Q+G FPS+T
Subjt:  --------------------EPPPGFASTSTPEKKNNMEEMVALFIKEQRILNV-------NLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDT

Query:  EPNSKEQCKMVVLRSGRRLEDSSEKKNE---------------EEKRRDEDEGTEAQKASSERFQHPPNSIELKCDFSNSFAGRKEDER-----------
        E N KEQCK + LRSGR +E S  K+ E               EE+   ED   E     S  F   P  +     +   F  +K D++           
Subjt:  EPNSKEQCKMVVLRSGRRLEDSSEKKNE---------------EEKRRDEDEGTEAQKASSERFQHPPNSIELKCDFSNSFAGRKEDER-----------

Query:  -------------------------------QNDNKKLTEE------EVVPCNHHDRGEVS---------FDRALCDLGASINLMPYSVYRKIGLSGMTD
                                       + +  KL+EE      + +P    D G  +         FD+ LCDLGASINLMP SVYRK+GL  M  
Subjt:  -------------------------------QNDNKKLTEE------EVVPCNHHDRGEVS---------FDRALCDLGASINLMPYSVYRKIGLSGMTD

Query:  TDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKKVPIILGRPFLATGKAEISVHIGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
        T ++LQLADRSI +P G++EDVLVKV+KFIFP DFVVLDM+ED++VP+ILGRPFLATG+A + V  G+LTL ++ E+V F+I+   +      TCF V
Subjt:  TDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKKVPIILGRPFLATGKAEISVHIGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

XP_023903214.1 uncharacterized protein LOC112015077 [Quercus suber]6.1e-13342.75Show/hide
Query:  AMAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPVDAIRLRLFPFSLQDKAKDW
        A   Q   P+ ++D+++P++    SGI    I A NFELK  LI M +   F G P +DP+ HL  FLEIC TVKMNGV  D IRLRLFPFSL+DKA+ W
Subjt:  AMAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPVDAIRLRLFPFSLQDKAKDW

Query:  LESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTV
        L+S++ GSI++W ++A+ FL KFFPPAKT +LR+EIG FRQ D E LYEAWERYK+++R CPQHG PDWLQVQ+FYNGLN  T+T++D ++GG+ +SKT 
Subjt:  LESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTV

Query:  TEAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNKLTSS--------------------------------------------
          A  LLEEMA+ +YQWPTE+    K AG++EL+  ++L AQ+ASL++ ++ L++                                             
Subjt:  TEAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNKLTSS--------------------------------------------

Query:  --------------------EPPPGFASTSTPEKKNNMEEMVALFIKEQRILNV-------NLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDT
                            +PPPGF S  + EKK ++E+ +  F++E +           N++T  +N  A +KN+EVQIGQ+A+ +NA Q+G FPS+T
Subjt:  --------------------EPPPGFASTSTPEKKNNMEEMVALFIKEQRILNV-------NLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDT

Query:  EPNSKEQCKMVVLRSGRRLEDSSEKKNE---------------EEKRRDEDEGTEAQKASSERF-QHPP-----------NSIE---LKC-----DFSNS
        E N KEQCK + LRSGR +E S  K+ E               EE+   ED   E     S  F  +PP           NS +    KC      F   
Subjt:  EPNSKEQCKMVVLRSGRRLEDSSEKKNE---------------EEKRRDEDEGTEAQKASSERF-QHPP-----------NSIE---LKC-----DFSNS

Query:  FAGRKEDERQNDNKKLTEE------EVVPCNHHDRGEVS---------FDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDV
           +K    + +  KL+EE      + +P    D G  +         FD+ LCDLGASINLMP SVYRK+GL  M  T ++LQLADRSI +P G++EDV
Subjt:  FAGRKEDERQNDNKKLTEE------EVVPCNHHDRGEVS---------FDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDV

Query:  LVKVNKFIFPVDFVVLDMKEDKKVPIILGRPFLATGKAEISVHIGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
        LVKV+KFIFP +FVVLDM+ED++VP+ILGRPFLA G+A + V  G+LTL ++ E+V F+I+   +   +  TCF V
Subjt:  LVKVNKFIFPVDFVVLDMKEDKKVPIILGRPFLATGKAEISVHIGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

XP_023929660.1 uncharacterized protein LOC112040975 [Quercus suber]3.9e-13242.24Show/hide
Query:  AMAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPVDAIRLRLFPFSLQDKAKDW
        A   Q   P+ ++D+++P++    SGI +  I A NFEL   LI M +   F G P +DP+ HL  FLEIC  VKMNGV  D IRLRLFPFSL+DKA+ W
Subjt:  AMAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPVDAIRLRLFPFSLQDKAKDW

Query:  LESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTV
        L+S++ GSI++W ++A+ FL KFFPPAKT +LR+EIG FRQ D E LYEAWERYK+++R CPQHG  DWLQVQ+FYNGLN  T+T++D ++GG+ +SKT 
Subjt:  LESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTV

Query:  TEAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNKLTSS--------------------------------------------
          A  LLEEMA+  YQWPTE+    K AG++EL+  ++L AQ+ASL++ ++ LT+                                             
Subjt:  TEAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNKLTSS--------------------------------------------

Query:  --------------------EPPPGFASTSTPEKKNNMEEMVALFIKEQRI-------LNVNLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDT
                            +PPPGF S  + EKK ++E+ +  F++E +           N++T  +N  A +KN+EVQIGQ+A+ +NA Q+G FPS+T
Subjt:  --------------------EPPPGFASTSTPEKKNNMEEMVALFIKEQRI-------LNVNLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDT

Query:  EPNSKEQCKMVVLRSGRRLEDSSEKKNE---------------EEKRRDEDEGTEAQKASSERF--------------QHPPNSIELKCDFSNSFAGRKE
        E N KEQCK + LRSGR +E S  K+ E               EE+    D   E     S  F              QH    +     F      +K 
Subjt:  EPNSKEQCKMVVLRSGRRLEDSSEKKNE---------------EEKRRDEDEGTEAQKASSERF--------------QHPPNSIELKCDFSNSFAGRKE

Query:  DERQNDNKKLTEE------EVVPCNHHDRGEVS---------FDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNK
           + +  KL+EE      + +P    D G  +         FD+ LCDLGASINLMP SVYRK+GL  M  T ++LQLA+RSI +P G++EDVLVKV+K
Subjt:  DERQNDNKKLTEE------EVVPCNHHDRGEVS---------FDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNK

Query:  FIFPVDFVVLDMKEDKKVPIILGRPFLATGKAEISVHIGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
        FIFP DFVVLDM+ED++VP+ILGRPFLATG+A + V  G+LTL +  E+V+F+I+   + +    TCF V
Subjt:  FIFPVDFVVLDMKEDKKVPIILGRPFLATGKAEISVHIGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

TrEMBL top hitse value%identityAlignment
A0A1S3UKD4 uncharacterized protein LOC1067662674.7e-11538.98Show/hide
Query:  RREKRLAE------AMAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPVDAIRL
        R+E+R  E       M  ++   K IRD+  P        IV  PIQA NFE+K  L+Q+ + N F G  SEDP+SHL +FL IC T+K NGV  DAI L
Subjt:  RREKRLAE------AMAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPVDAIRL

Query:  RLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTV
        RLFPFSL+DKAK+WL+S+  GSISTW+++A  F+TK+FPP+K+ K+R EI +F Q D E LYEAWERYKE++R+CP H  P+WLQVQ FYNGL+P+ K +
Subjt:  RLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTV

Query:  LDTSAGGSFLSKTVTEAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNKLT--------------------------------
        LD ++GGSF+ KT  EA + LE MA  +     ++    +K G+ E++   ++ AQ   LT  +  LT                                
Subjt:  LDTSAGGSFLSKTVTEAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNKLT--------------------------------

Query:  --------------------------------------SSEPPPGFASTSTPEKKNN-----------MEEMVALFIKEQRILNVNLQTSVNNHDAALKN
                                              S+ P P +   + P  + N           +    + F+ +        +T+  N +A+++N
Subjt:  --------------------------------------SSEPPPGFASTSTPEKKNN-----------MEEMVALFIKEQRILNVNLQTSVNNHDAALKN

Query:  MEVQIGQIASAVNALQKGKFPSDTEPNSKEQCKMVVLRSGRRLEDSSEKKNEEEKRRDEDEGTEAQKASSERFQHPPNSIELKCDFSNSFAGRKEDERQN
        +E QIGQ++  ++    G FPSDT PN +EQCK + LRS R LE     + E EK++  DE  E + A  E  +     +     F      +K   +++
Subjt:  MEVQIGQIASAVNALQKGKFPSDTEPNSKEQCKMVVLRSGRRLEDSSEKKNEEEKRRDEDEGTEAQKASSERFQHPPNSIELKCDFSNSFAGRKEDERQN

Query:  DNKKLTEE------------------EVVPCNHHDRGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFI
        +   LTEE                   V+PC   + G +S  +ALCDLGASINLMP S+++++G+  +  T +TLQLADRS+T+P G+VEDVLVKV+KFI
Subjt:  DNKKLTEE------------------EVVPCNHHDRGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFI

Query:  FPVDFVVLDMKEDKKVPIILGRPFLATGKAEISVHIGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSVG-PEYLTDDDEEVDYNL
        FP DFVVLDM+ED KVPIILGRPFLATG+  I V  G L L + DEKV FSI            CF     E L  DD  VDY++
Subjt:  FPVDFVVLDMKEDKKVPIILGRPFLATGKAEISVHIGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSVG-PEYLTDDDEEVDYNL

A0A1U7Z951 uncharacterized protein LOC1045905681.0e-10938.69Show/hide
Query:  DLEIIRTLKFIRREKRLAEAMAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDN-SFKGHPSEDPHSHLRSFLEICGTVKMNGVP
        D EI RTL    R  +   A + +    + + D+ +P L    S IV   I A NF++K  +IQM ++   F G   EDP++H+ +FLEIC T K NGV 
Subjt:  DLEIIRTLKFIRREKRLAEAMAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDN-SFKGHPSEDPHSHLRSFLEICGTVKMNGVP

Query:  VDAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLN
         D +RLRLFPFSL+DK K WL S+   SISTWDE+A  FL+K+FPP+K  K+R +I TF Q D E LYE+WERYKE+LR+ P HG P WLQVQ FYN L 
Subjt:  VDAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLN

Query:  PSTKTVLDTSAGGSFLSKTVTEAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNA-------------------------------
         + KT++D +AGGS  +KT   A  L+EEM A +YQW +E+  + ++  L+ +D  ++L AQ+ +L+                                 
Subjt:  PSTKTVLDTSAGGSFLSKTVTEAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNA-------------------------------

Query:  ---------------LNKLTSSEPPPGF----------------------ASTSTPEKKNNMEEMVALFIKEQRILNVNLQTSVNNHDAALKNMEVQIGQ
                       L +   ++ P  F                       + + PE K+N+EE++  FI        + +T   N +A++KN+E Q+GQ
Subjt:  ---------------LNKLTSSEPPPGF----------------------ASTSTPEKKNNMEEMVALFIKEQRILNVNLQTSVNNHDAALKNMEVQIGQ

Query:  IASAVNALQKGKFPSDTEPNSKEQCKMVVLRSGRRLEDSSEKKNEEEKRRDEDEGTE-AQKASSERFQHPPNSIELKCDFSNSFAGRKEDERQNDNKKLT
        +A  +++  +G  PS+TE N +EQ + + LRSG+ L++  +K  EE+        T+  ++  + + +    +     +  +     K  ++  D    T
Subjt:  IASAVNALQKGKFPSDTEPNSKEQCKMVVLRSGRRLEDSSEKKNEEEKRRDEDEGTE-AQKASSERFQHPPNSIELKCDFSNSFAGRKEDERQNDNKKLT

Query:  EEEVVPCNHHDRGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKKVPIILGRPF
            +PC     G +  ++ALCDLGA+INLM YSV++K+GL     T V LQL DRSI HP G++EDVLVKV+KFIFPVDF+VLDM+ED  VP+ILGRPF
Subjt:  EEEVVPCNHHDRGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKKVPIILGRPF

Query:  LATGKAEISVHIGKLTLNIDDEKVVFSI
        LATGKA + V  G+L+L I DE+V+F +
Subjt:  LATGKAEISVHIGKLTLNIDDEKVVFSI

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129452.2e-12037.79Show/hide
Query:  MRRNKVVDLFPLDLEIIRTLKFIRREK----RLAEAMAHQE------------EAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDN-SF
        M+R   ++L P D +I RT +  RRE      L + MA               EA +A+RD++ P++   +  I    I A NFE+K   IQM + +  F
Subjt:  MRRNKVVDLFPLDLEIIRTLKFIRREK----RLAEAMAHQE------------EAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDN-SF

Query:  KGHPSEDPHSHLRSFLEICGTVKMNGVPVDAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWE
         G PS+DP+SHL +FLEIC T K NGV  DAIRLRLFPFSL+DKAK WL S+  GSI+TW++LAQ FL KFFPPAKT K+R +I +F Q D E LYEAWE
Subjt:  KGHPSEDPHSHLRSFLEICGTVKMNGVPVDAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWE

Query:  RYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNK
        R+KE+LRRCP HG PDWLQVQ FYNGL  S KT++D +AGG+ +SK   +A +LLEEMA+ +YQWP+E+    K  G YE+D   +L  Q+A+L+  L+ 
Subjt:  RYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNK

Query:  L-------------------------------------------------------------------TSSEP--PPGF---ASTSTPEKKNNMEEMVAL
        L                                                                   ++ +P  PPGF   A    PEKK+ +EE++  
Subjt:  L-------------------------------------------------------------------TSSEP--PPGF---ASTSTPEKKNNMEEMVAL

Query:  FIKEQRILNVNLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTE--PNSKEQCKMVVLRSGRRLEDSSEKKNEEEKRRDEDEG-----------
        +I +   +       + +  A+L+N+E Q+GQ+A+++N   +G  PSDT+  P  KEQC+ + LRSG+ +E  ++K  E E    + EG           
Subjt:  FIKEQRILNVNLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTE--PNSKEQCKMVVLRSGRRLEDSSEKKNEEEKRRDEDEG-----------

Query:  --TEAQKASSERFQHPPNSIELKCD------------------------------------FSNSFAGRKEDERQNDNKKLTEE----------------
           +A+   + +  HPP     +                                      F      +K    + +   LTEE                
Subjt:  --TEAQKASSERFQHPPNSIELKCD------------------------------------FSNSFAGRKEDERQNDNKKLTEE----------------

Query:  --EVVPCNHHDRGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKKVPIILGRPF
            +PC     G + F +AL DLGASINLMP+S++ K+GL     T VTLQLADRS  +P G++EDVLVKV+KFIFPVDF++LDM+ED+++PIILGRPF
Subjt:  --EVVPCNHHDRGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKKVPIILGRPF

Query:  LATGKAEISVHIGKLTLNIDDEKVVFSIFGQDESVCSLHTC
        LAT  A I V  GK++  + +E V F+IF   +   S + C
Subjt:  LATGKAEISVHIGKLTLNIDDEKVVFSIFGQDESVCSLHTC

A0A6J1DU19 uncharacterized protein LOC1110243611.1e-11440.59Show/hide
Query:  IRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPVDAIRLRLFPFSLQDKAKDWLESVETGSIST
        IRD+ QP  P  + GI+  PI A N ELK GLIQM R+N+F+G+ +EDP++HL  FL++CGTVKMNGV  DAIRLRLFP SLQDK               
Subjt:  IRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPVDAIRLRLFPFSLQDKAKDWLESVETGSIST

Query:  WDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLLEEMA
          E+ QAFLT FFPPAKTT+LRTEI +FR+ D EQL+E WERYKE+LR+CPQHG  +WLQ+Q+FYNGLN  T+T+LD +AGG+ LS+T   A  LL++MA
Subjt:  WDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLLEEMA

Query:  ATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNKLT---SSEPPPGFASTST-----------------PEKKNNMEEMVALFIKE-----QR
          S+QWP+E+    K AG+YE+DE SSLKAQ+ +LTNA++KL+   +S      A+T T                  EKK+++E+++  FI E      R
Subjt:  ATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNKLT---SSEPPPGFASTST-----------------PEKKNNMEEMVALFIKE-----QR

Query:  ILN--VNLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNSKEQCKMVVLRSGRRLEDSSEKKNEE-----EKRRDEDE-------GTEAQK
        I N    ++  +  +  ++KNMEVQIGQIA  +N +QKGKFPSD E   +E CK V LRSG+ L++  +KK EE     E+R +++E         +A K
Subjt:  ILN--VNLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNSKEQCKMVVLRSGRRLEDSSEKKNEE-----EKRRDEDE-------GTEAQK

Query:  ASSERFQHPPNSIELK----------CDFSNSFAGRKEDERQNDNKKLTEE------EVVPCNHHDRGEV---------SFDRALCDLGASINLMPYSVY
         +S     PPNS+               F       K      +   LTEE        +P    D G           SF++ALCD+ ASINLM     
Subjt:  ASSERFQHPPNSIELK----------CDFSNSFAGRKEDERQNDNKKLTEE------EVVPCNHHDRGEV---------SFDRALCDLGASINLMPYSVY

Query:  RKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKKVPIILGRPFLATGKAEISVHIGKLTLNIDDEKVVFSIFGQDESVC
                                P+GV+EDVLVKV++ IFP DFVVL  +ED ++PIILGR FLATG A I V +G LTL +++E VVF I    +   
Subjt:  RKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKKVPIILGRPFLATGKAEISVHIGKLTLNIDDEKVVFSIFGQDESVC

Query:  SLHTCFSVG---------PEYLTDDDEEVDYNLGL---GLGEMLMDNVNFDHDAYMDNPMFEN---DLDLPDFENELDLPACENERYAVDDLPSFENELN
         + TC  +            +++  D      LG     +GE L   V+F HDA    P+  +    +D+ +   +LD P  +     + +LP+    + 
Subjt:  SLHTCFSVG---------PEYLTDDDEEVDYNLGL---GLGEMLMDNVNFDHDAYMDNPMFEN---DLDLPDFENELDLPACENERYAVDDLPSFENELN

Query:  LPEMNNF
        L E + F
Subjt:  LPEMNNF

A0A6P6XAQ1 Reverse transcriptase1.9e-10838.9Show/hide
Query:  MAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPVDAIRLRLFPFSLQDKAKDWL
        MA  E   + +RDF  P      + IV   + A NFE+K  LIQM + + + G+ +EDP+SHL +FLEIC T+K NGV  DAI+LRLFPFSL+DKAK WL
Subjt:  MAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGVPVDAIRLRLFPFSLQDKAKDWL

Query:  ESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVT
        +S    + +TWDELA+AFL KFFPP KT KLR +I +F Q + E LYEAWERY+E+ RRCP HG PDWL VQ FYNGL   TKT +D +AGG+ + KT  
Subjt:  ESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVT

Query:  EAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNK--------------------------LTSSEP-----------------
        EA+ L+EEMAA +YQW  E+G   + AG+ E+D  + L A++ ++   LN+                           +SSE                  
Subjt:  EAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNK--------------------------LTSSEP-----------------

Query:  --------------------------PPGFASTSTPEKKNNMEEMV------ALFIKEQRILNVNLQT------SVNNHDAALKNMEVQIGQIASAVNAL
                                  PPGF    T  +     E+       A   K +++ +   Q        ++      +N+EVQ+GQIA+AVN  
Subjt:  --------------------------PPGFASTSTPEKKNNMEEMV------ALFIKEQRILNVNLQT------SVNNHDAALKNMEVQIGQIASAVNAL

Query:  QKGKFPSDTEPNSKEQCKMVVLRSGRRLED-------SSEKKNEEEKRRDEDEGTEAQKASSERFQHP---------PNSIELKCDFSNSFAGRKEDERQ
         +G  PS TE N +E  K + LRSG+ L +          +K E +K  +  EG++ +K   +  ++          P  I     F      +K     
Subjt:  QKGKFPSDTEPNSKEQCKMVVLRSGRRLED-------SSEKKNEEEKRRDEDEGTEAQKASSERFQHP---------PNSIELKCDFSNSFAGRKEDERQ

Query:  NDNKKLTEE------------------EVVPCNHHDRGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKF
        ++   LTEE                    VPC     G V F +ALCDLGAS++L+P +V R++GL  +  T+++LQLADRSI HPMG++E+VL+KV KF
Subjt:  NDNKKLTEE------------------EVVPCNHHDRGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKF

Query:  IFPVDFVVLDMKEDKKVPIILGRPFLATGKAEISVHIGKLTLNIDDEKVVFSI
        I PVDFVVLDM+ED  VPIILGRPFLAT    I V  GK    I +E+V F +
Subjt:  IFPVDFVVLDMKEDKKVPIILGRPFLATGKAEISVHIGKLTLNIDDEKVVFSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAAGAAACAAGGTGGTTGATTTGTTTCCGCTAGATCTTGAGATTATTAGGACTCTTAAATTCATTAGAAGAGAAAAAAGATTAGCAGAAGCGATGGCCCACCAAGA
AGAAGCTCCCAAGGCAATTAGAGATTTTCTGCAGCCAGTTCTTCCTACCGAGAATTCTGGAATTGTCTACGCCCCTATCCAAGCTACAAATTTTGAGCTAAAAACAGGGT
TGATTCAGATGGCACGCGATAACTCGTTCAAGGGACATCCTTCCGAGGACCCCCACTCTCATCTGCGATCATTCCTAGAAATATGCGGGACGGTAAAGATGAACGGAGTT
CCGGTCGACGCAATCAGATTGAGGCTGTTTCCATTTTCTCTACAGGATAAAGCAAAAGATTGGCTCGAATCGGTCGAGACGGGCAGCATCAGTACTTGGGACGAGCTTGC
CCAGGCTTTTCTGACAAAATTTTTTCCACCGGCTAAGACTACCAAGCTCCGGACTGAAATTGGAACATTCAGACAGCTTGACGAGGAGCAGTTATACGAAGCGTGGGAAA
GATATAAGGAAATGCTTAGGCGATGCCCCCAACACGGATATCCTGATTGGCTTCAGGTACAGTTATTTTATAATGGATTAAACCCCTCCACAAAGACAGTCTTAGACACA
TCAGCAGGAGGGAGTTTTCTTTCAAAAACAGTGACTGAAGCCAAGGACCTGCTTGAGGAAATGGCGGCAACAAGTTATCAATGGCCGACGGAGAAAGGAACAATTACAAA
AAAGGCTGGATTATATGAATTGGATGAGTCAAGTTCACTGAAAGCGCAACTGGCATCTCTGACCAATGCACTAAACAAATTGACGTCATCTGAGCCACCCCCAGGTTTTG
CATCAACGAGTACTCCTGAAAAGAAAAATAATATGGAGGAGATGGTGGCTTTATTCATCAAGGAACAAAGAATATTGAATGTGAATCTCCAGACCTCAGTAAACAACCAC
GATGCAGCTCTAAAGAATATGGAAGTACAGATAGGTCAGATTGCTTCAGCAGTAAATGCCCTTCAAAAGGGAAAATTTCCAAGTGATACTGAGCCTAACTCGAAAGAGCA
GTGTAAGATGGTGGTTCTGAGAAGTGGCAGAAGACTGGAGGACAGTTCAGAGAAGAAAAATGAAGAAGAAAAGAGAAGGGATGAAGATGAAGGGACTGAGGCACAAAAAG
CCTCCTCTGAAAGGTTCCAACATCCTCCCAACTCTATTGAATTAAAATGTGATTTTTCTAACAGCTTTGCAGGTAGAAAAGAAGATGAGAGGCAGAATGACAATAAGAAG
CTGACTGAGGAAGAAGTGGTTCCATGCAACCACCATGACAGAGGAGAAGTATCCTTCGATAGGGCTCTATGTGATTTAGGAGCAAGTATAAATTTGATGCCCTACTCTGT
ATACAGGAAGATTGGTTTATCAGGTATGACAGATACCGACGTCACTCTCCAGCTTGCTGACAGATCGATTACCCACCCGATGGGTGTTGTGGAGGACGTGTTGGTGAAAG
TCAACAAATTCATCTTCCCTGTAGATTTCGTGGTACTGGACATGAAGGAGGACAAAAAAGTGCCAATTATCTTAGGCAGACCTTTCCTAGCCACTGGTAAGGCTGAGATT
AGCGTGCATATAGGTAAACTTACCTTGAACATTGATGATGAAAAAGTCGTGTTCAGTATTTTTGGCCAAGATGAATCTGTTTGTAGTTTGCATACATGTTTTTCTGTTGG
GCCTGAATACTTGACTGATGATGATGAAGAGGTAGACTATAATCTTGGCCTAGGCTTAGGAGAAATGCTTATGGATAATGTGAATTTTGATCATGATGCATATATGGATA
ATCCTATGTTTGAAAATGATTTGGATCTGCCTGACTTTGAAAATGAATTAGACTTGCCTGCTTGTGAAAATGAAAGATATGCAGTTGATGATTTACCTTCCTTTGAAAAT
GAATTAAATTTGCCTGAAATGAATAATTTTGATGATGATATTGAATTGCCTGACATTGAGCATGAACATGAAATGCATAAAAAAGATTGCTTGATAGATAATTTTGAGTC
TGACCATGATTACAGTGAATCTATTGAGTCTGATCTTGACATTCCTGAATGCATGAATCCTGACAATGTGAATACTTTTTTTTCATGTCCTGATGATGTGTATAGCATAG
AATCTGACCCAGAAGAACTTGAATCTGTGCATAGTACAGAATCTGATCCTGAAATTCTTGAATTCTTTAATTCTTCTGATGATGAGTCATGTGATAATGTCGATTGTGCA
GGTTTTAGCTCCTACAAGCACCCACCAGTAAATGATTTGGCTGTTTGGGGCGATGCACAGTTCGAGGCCTTGGGTAATATGGTCAAGGGTCGAACGCTGAGCACCGTAGA
GAAACGTCGAGGCCTTGGGAAATATATGGTAAGGGGTCGGCGTGACTCGAAGGAGTCAGTCTCAGAGAGCCTTGTGAGGGCTATGCGTGTTGAGAGAGCTCGTGGAGCTT
AA
mRNA sequenceShow/hide mRNA sequence
ATGCGAAGAAACAAGGTGGTTGATTTGTTTCCGCTAGATCTTGAGATTATTAGGACTCTTAAATTCATTAGAAGAGAAAAAAGATTAGCAGAAGCGATGGCCCACCAAGA
AGAAGCTCCCAAGGCAATTAGAGATTTTCTGCAGCCAGTTCTTCCTACCGAGAATTCTGGAATTGTCTACGCCCCTATCCAAGCTACAAATTTTGAGCTAAAAACAGGGT
TGATTCAGATGGCACGCGATAACTCGTTCAAGGGACATCCTTCCGAGGACCCCCACTCTCATCTGCGATCATTCCTAGAAATATGCGGGACGGTAAAGATGAACGGAGTT
CCGGTCGACGCAATCAGATTGAGGCTGTTTCCATTTTCTCTACAGGATAAAGCAAAAGATTGGCTCGAATCGGTCGAGACGGGCAGCATCAGTACTTGGGACGAGCTTGC
CCAGGCTTTTCTGACAAAATTTTTTCCACCGGCTAAGACTACCAAGCTCCGGACTGAAATTGGAACATTCAGACAGCTTGACGAGGAGCAGTTATACGAAGCGTGGGAAA
GATATAAGGAAATGCTTAGGCGATGCCCCCAACACGGATATCCTGATTGGCTTCAGGTACAGTTATTTTATAATGGATTAAACCCCTCCACAAAGACAGTCTTAGACACA
TCAGCAGGAGGGAGTTTTCTTTCAAAAACAGTGACTGAAGCCAAGGACCTGCTTGAGGAAATGGCGGCAACAAGTTATCAATGGCCGACGGAGAAAGGAACAATTACAAA
AAAGGCTGGATTATATGAATTGGATGAGTCAAGTTCACTGAAAGCGCAACTGGCATCTCTGACCAATGCACTAAACAAATTGACGTCATCTGAGCCACCCCCAGGTTTTG
CATCAACGAGTACTCCTGAAAAGAAAAATAATATGGAGGAGATGGTGGCTTTATTCATCAAGGAACAAAGAATATTGAATGTGAATCTCCAGACCTCAGTAAACAACCAC
GATGCAGCTCTAAAGAATATGGAAGTACAGATAGGTCAGATTGCTTCAGCAGTAAATGCCCTTCAAAAGGGAAAATTTCCAAGTGATACTGAGCCTAACTCGAAAGAGCA
GTGTAAGATGGTGGTTCTGAGAAGTGGCAGAAGACTGGAGGACAGTTCAGAGAAGAAAAATGAAGAAGAAAAGAGAAGGGATGAAGATGAAGGGACTGAGGCACAAAAAG
CCTCCTCTGAAAGGTTCCAACATCCTCCCAACTCTATTGAATTAAAATGTGATTTTTCTAACAGCTTTGCAGGTAGAAAAGAAGATGAGAGGCAGAATGACAATAAGAAG
CTGACTGAGGAAGAAGTGGTTCCATGCAACCACCATGACAGAGGAGAAGTATCCTTCGATAGGGCTCTATGTGATTTAGGAGCAAGTATAAATTTGATGCCCTACTCTGT
ATACAGGAAGATTGGTTTATCAGGTATGACAGATACCGACGTCACTCTCCAGCTTGCTGACAGATCGATTACCCACCCGATGGGTGTTGTGGAGGACGTGTTGGTGAAAG
TCAACAAATTCATCTTCCCTGTAGATTTCGTGGTACTGGACATGAAGGAGGACAAAAAAGTGCCAATTATCTTAGGCAGACCTTTCCTAGCCACTGGTAAGGCTGAGATT
AGCGTGCATATAGGTAAACTTACCTTGAACATTGATGATGAAAAAGTCGTGTTCAGTATTTTTGGCCAAGATGAATCTGTTTGTAGTTTGCATACATGTTTTTCTGTTGG
GCCTGAATACTTGACTGATGATGATGAAGAGGTAGACTATAATCTTGGCCTAGGCTTAGGAGAAATGCTTATGGATAATGTGAATTTTGATCATGATGCATATATGGATA
ATCCTATGTTTGAAAATGATTTGGATCTGCCTGACTTTGAAAATGAATTAGACTTGCCTGCTTGTGAAAATGAAAGATATGCAGTTGATGATTTACCTTCCTTTGAAAAT
GAATTAAATTTGCCTGAAATGAATAATTTTGATGATGATATTGAATTGCCTGACATTGAGCATGAACATGAAATGCATAAAAAAGATTGCTTGATAGATAATTTTGAGTC
TGACCATGATTACAGTGAATCTATTGAGTCTGATCTTGACATTCCTGAATGCATGAATCCTGACAATGTGAATACTTTTTTTTCATGTCCTGATGATGTGTATAGCATAG
AATCTGACCCAGAAGAACTTGAATCTGTGCATAGTACAGAATCTGATCCTGAAATTCTTGAATTCTTTAATTCTTCTGATGATGAGTCATGTGATAATGTCGATTGTGCA
GGTTTTAGCTCCTACAAGCACCCACCAGTAAATGATTTGGCTGTTTGGGGCGATGCACAGTTCGAGGCCTTGGGTAATATGGTCAAGGGTCGAACGCTGAGCACCGTAGA
GAAACGTCGAGGCCTTGGGAAATATATGGTAAGGGGTCGGCGTGACTCGAAGGAGTCAGTCTCAGAGAGCCTTGTGAGGGCTATGCGTGTTGAGAGAGCTCGTGGAGCTT
AA
Protein sequenceShow/hide protein sequence
MRRNKVVDLFPLDLEIIRTLKFIRREKRLAEAMAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTVKMNGV
PVDAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDT
SAGGSFLSKTVTEAKDLLEEMAATSYQWPTEKGTITKKAGLYELDESSSLKAQLASLTNALNKLTSSEPPPGFASTSTPEKKNNMEEMVALFIKEQRILNVNLQTSVNNH
DAALKNMEVQIGQIASAVNALQKGKFPSDTEPNSKEQCKMVVLRSGRRLEDSSEKKNEEEKRRDEDEGTEAQKASSERFQHPPNSIELKCDFSNSFAGRKEDERQNDNKK
LTEEEVVPCNHHDRGEVSFDRALCDLGASINLMPYSVYRKIGLSGMTDTDVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKKVPIILGRPFLATGKAEI
SVHIGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSVGPEYLTDDDEEVDYNLGLGLGEMLMDNVNFDHDAYMDNPMFENDLDLPDFENELDLPACENERYAVDDLPSFEN
ELNLPEMNNFDDDIELPDIEHEHEMHKKDCLIDNFESDHDYSESIESDLDIPECMNPDNVNTFFSCPDDVYSIESDPEELESVHSTESDPEILEFFNSSDDESCDNVDCA
GFSSYKHPPVNDLAVWGDAQFEALGNMVKGRTLSTVEKRRGLGKYMVRGRRDSKESVSESLVRAMRVERARGA