; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035694 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035694
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr3:27448465..27450890
RNA-Seq ExpressionLag0035694
SyntenyLag0035694
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]1.3e-9837.87Show/hide
Query:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELLQ----
        +NGV  D IRLRLFPFSL+DKA+ WL+S++ GSI +W ++A+ FL KFFPPAKT +LR+EIG F+Q + E LYEAWE+YK+++RRCPQ   P+ LQ    
Subjt:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELLQ----

Query:  ---------------------------------------------------KARLYELDESSSLKAQLASLTNALNKLT------SPEVVKSISTLVEGH
                                                            A ++EL+  ++L AQ+A+L++ ++ LT      S E V S S +V  +
Subjt:  ---------------------------------------------------KARLYELDESSSLKAQLASLTNALNKLT------SPEVVKSISTLVEGH

Query:  SKKEGQDVEEVQYIGNRPY---AQGVPNFYHPSLRNYENLSYSNTKNVLQP--PPGFASSSAPEKKNNQEEMVALFIKE---------HRILNVSLQTSV
           +    E+VQY+ NR Y      +PN+YHP LRN+ENLSY NTKNVLQP  PPGF  S   EKK + E+ +  F++E          R+ N+  +T  
Subjt:  SKKEGQDVEEVQYIGNRPY---AQGVPNFYHPSLRNYENLSYSNTKNVLQP--PPGFASSSAPEKKNNQEEMVALFIKE---------HRILNVSLQTSV

Query:  NNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQCKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRK
        +N  A +KN+EVQIGQ+A+ +NA Q+G FPS+TE NPKEQCK + LRSG+ +E++  +E K       T A    S                        
Subjt:  NNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQCKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRK

Query:  EDERQDDKKKLTEEEVVPCNHHDRGSHISLPKRRGECPTFDFRE--------LPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKE
               K ++ EEE+V    +D      LP      PT  F +        LP+PQRF+  KLD+QF KFL++FKK+ +NIP  +AL  MPNY KF+K+
Subjt:  EDERQDDKKKLTEEEVVPCNHHDRGSHISLPKRRGECPTFDFRE--------LPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKE

Query:  MLSRKKSLKK-EVFNLSESSSTIVS----------GSFTVPCTIGEVSFDRALCDLGASINLMSYSVFRKIGLSGGQ-----------------------
        ++S+K+ L++ E   LSE  S I+            SFT+PCTIG+  FDR LCDLGASINLM + V RK+GL   +                       
Subjt:  MLSRKKSLKK-EVFNLSESSSTIVS----------GSFTVPCTIGEVSFDRALCDLGASINLMSYSVFRKIGLSGGQ-----------------------

Query:  --------IGADY--------------LGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
                  AD+              LGRPFLATG+A I V  G+LTL ++ E+V+F+I+   +      TCF V
Subjt:  --------IGADY--------------LGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]3.1e-10038.17Show/hide
Query:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELLQ----
        +NGV  D IRLRLFPFSL+DKA+ WL+S++ GSI +W ++A+ FL KFFPPAKT +LR+EIG F+Q + E LYEAWE+YK+++RRCPQ   P+ LQ    
Subjt:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELLQ----

Query:  ---------------------------------------------------KARLYELDESSSLKAQLASLTNALNKLT------SPEVVKSISTLVEGH
                                                            A ++EL+  ++L AQ+A+L++ ++ LT      S E V S S +V  +
Subjt:  ---------------------------------------------------KARLYELDESSSLKAQLASLTNALNKLT------SPEVVKSISTLVEGH

Query:  SKKEGQDVEEVQYIGNRPY---AQGVPNFYHPSLRNYENLSYSNTKNVLQP--PPGFASSSAPEKKNNQEEMVALFIKE---------HRILNVSLQTSV
           +    E+VQY+ NR Y      +PN+YHP LRN+ENLSY NTKNVLQP  PPGF  S   EKK + E+ +  F++E          R+ N+  +T  
Subjt:  SKKEGQDVEEVQYIGNRPY---AQGVPNFYHPSLRNYENLSYSNTKNVLQP--PPGFASSSAPEKKNNQEEMVALFIKE---------HRILNVSLQTSV

Query:  NNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQCKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRK
        +N  A +KN+EVQIGQ+A+ +NA Q+G FPS+TE NPKEQCK + LRSG+ +E++  +E K       T A    S                        
Subjt:  NNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQCKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRK

Query:  EDERQDDKKKLTEEEVVPCNHHDRGSHISLPKRRGECPTFDFRE--------LPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKE
               K ++ EEE+V    +D      LP      PT  F +        LP+PQRF+  KLD+QF KFL++FKK+ +NIP  +AL  MPNY KF+K+
Subjt:  EDERQDDKKKLTEEEVVPCNHHDRGSHISLPKRRGECPTFDFRE--------LPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKE

Query:  MLSRKKSLKK-EVFNLSESSSTIVS----------GSFTVPCTIGEVSFDRALCDLGASINLMSYSVFRKIGLSGGQ-----------------------
        ++S+K+ L++ E   LSE  S I+           GSFT+PCTIG+  FDR LCDLGASINLM +SV RK+GL   +                       
Subjt:  MLSRKKSLKK-EVFNLSESSSTIVS----------GSFTVPCTIGEVSFDRALCDLGASINLMSYSVFRKIGLSGGQ-----------------------

Query:  --------IGADY--------------LGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
                  AD+              LGRPFLATG+A I V  G+LTL ++ E+V+F+I+   +      TCF V
Subjt:  --------IGADY--------------LGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]1.6e-9937.87Show/hide
Query:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELLQ----
        +NGV  D IRLRLFPFSL+DKA+ WL+S++ GSI +W ++A+ FL KFFPPAKT +LR+EIG F+Q + E LYEAWE+YK+++RRCPQ   P+ LQ    
Subjt:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELLQ----

Query:  ---------------------------------------------------KARLYELDESSSLKAQLASLTNALNKLT------SPEVVKSISTLVEGH
                                                            A +++L+  ++L AQ+A+L++ ++ LT      S E + S S +V  +
Subjt:  ---------------------------------------------------KARLYELDESSSLKAQLASLTNALNKLT------SPEVVKSISTLVEGH

Query:  SKKEGQDVEEVQYIGNRPY---AQGVPNFYHPSLRNYENLSYSNTKNVLQP--PPGFASSSAPEKKNNQEEMV-------ALFIKEHRILNVSLQTSVNN
           +    E+VQY+ NR Y      +PN+YHP LRN+ENLSY NTKNVLQP  PPGF S  +  K + ++ MV       A F K    L+ +++T  +N
Subjt:  SKKEGQDVEEVQYIGNRPY---AQGVPNFYHPSLRNYENLSYSNTKNVLQP--PPGFASSSAPEKKNNQEEMV-------ALFIKEHRILNVSLQTSVNN

Query:  HDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQCKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRKED
          AA+KN+EVQIGQ+A+ +NA Q+G FPS+TE NPKEQCK + LRSG+ +E+S  +E K                     P ++ +              
Subjt:  HDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQCKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRKED

Query:  ERQDDKKKLTEEEVV--PCNHHDRGSHISLPKRRGECPTFDFRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSRKKSL
             K K+ E+E+V       D    IS P    + P      LP+PQRF+  KLD+QF KFL++FKK+ +NIP  +AL  MPNY KF+K+++S+K+ L
Subjt:  ERQDDKKKLTEEEVV--PCNHHDRGSHISLPKRRGECPTFDFRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSRKKSL

Query:  KK-EVFNLSESSSTIVS----------GSFTVPCTIGEVSFDRALCDLGASINLMSYSVFRKIGLSGGQ-------------------------------
        ++ E   LSE  S I+           GSFT+PCTIG+  FD+ LCDLGASINLM  SV RK+GL   +                               
Subjt:  KK-EVFNLSESSSTIVS----------GSFTVPCTIGEVSFDRALCDLGASINLMSYSVFRKIGLSGGQ-------------------------------

Query:  IGADY--------------LGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
          AD+              LGRPFLATG+A I V  G+LTL ++ E+V+F I+          TCF V
Subjt:  IGADY--------------LGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

XP_022860306.1 uncharacterized protein LOC111380876 [Olea europaea var. sylvestris]2.6e-9436.93Show/hide
Query:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELLQ----
        MNGV  DAIRLRLF FSL+DKAK W +S+  GSI+TWD+LAQ FLTK+FPP+K+T+L +EI  F+QL+ E  YEAWE++K++LRRCPQ  + + +Q    
Subjt:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELLQ----

Query:  ---------------------------------------------------KARLYELDESSSLKAQLASLTNALNKLTSPEVVKSISTLVEGHSKKEGQ
                                                            A  +E+D  ++L AQ+ASLTN +  LT+    + + +++   S  +  
Subjt:  ---------------------------------------------------KARLYELDESSSLKAQLASLTNALNKLTSPEVVKSISTLVEGHSKKEGQ

Query:  DV--EEVQYIGNRPYAQG---VPNFYHPSLRNYENLSYSNTKNVLQPPPGFASSSAPEKKNNQEEMVALFIKEHR-------ILNVSLQTSVNNHDAALK
        +V  E+ QY+ +R Y Q      N YHP LRN++NLSY N +N LQPPP F + ++ + K   E+++  FI E R       +   +++T V+N  A +K
Subjt:  DV--EEVQYIGNRPYAQG---VPNFYHPSLRNYENLSYSNTKNVLQPPPGFASSSAPEKKNNQEEMVALFIKEHR-------ILNVSLQTSVNNHDAALK

Query:  NMEVQIGQIASVVNALQKGKFPSDTEPNPKEQCKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRKEDERQDDK
        N+EVQIGQ+A+ + + QKGKFP DTE NP+E C  +ILRSG+ +E+S                     ++   P + + +             +ERQ ++
Subjt:  NMEVQIGQIASVVNALQKGKFPSDTEPNPKEQCKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRKEDERQDDK

Query:  KKLTEEEVVPCNHHDRGSHISLPKRRG--ECPTFDFRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSRKKSLKK-EVF
        +K TE E         G+ I  P      + P      LPFPQRF   K D+QF KFLE+FKK+ +NIP  EAL  MPNY KF+KE++S KK L++ E  
Subjt:  KKLTEEEVVPCNHHDRGSHISLPKRRG--ECPTFDFRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSRKKSLKK-EVF

Query:  NLSESSSTIV---------SGSFTVPCTIGEVSFDRALCDLGASINLMSYSVFRKIGLSGGQ-------------------------------IGADY--
         L+E  S I+         SGSF +PC IG ++FDRALCDLGASINLM  SVF+K+GL+  +                               +  D+  
Subjt:  NLSESSSTIV---------SGSFTVPCTIGEVSFDRALCDLGASINLMSYSVFRKIGLSGGQ-------------------------------IGADY--

Query:  ------------LGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTC
                    LGRPFLATG+A I V  GKLTL  ++E V F+I   ++    ++ C
Subjt:  ------------LGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTC

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]5.7e-10238.07Show/hide
Query:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELLQ----
        MNGV  D IRLRLFPFSL+DKA+ WL+S++ GSI++W ++A+ FL KFFPPAKT +LR+EIG FRQ + E LYEAWE+YK+++R CPQ   P+ LQ    
Subjt:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELLQ----

Query:  ---------------------------------------------------KARLYELDESSSLKAQLASLTNALNKLTSPEVVKSISTLVEGHSKKEGQ
                                                            A ++EL+  ++L AQ+ASL++ ++ LT+  + +    +          
Subjt:  ---------------------------------------------------KARLYELDESSSLKAQLASLTNALNKLTSPEVVKSISTLVEGHSKKEGQ

Query:  DV--EEVQYIGNRPY---AQGVPNFYHPSLRNYENLSYSNTKNVLQPPPGFASSSAPEKKNNQEEMV-------ALFIKEHRILNVSLQTSVNNHDAALK
        +   E+VQYI NR Y      +PN+YHP LRN+EN SY NTKNVLQPPPGF S  + +K + ++ MV       A F K    L+ +++T  +N  A +K
Subjt:  DV--EEVQYIGNRPY---AQGVPNFYHPSLRNYENLSYSNTKNVLQPPPGFASSSAPEKKNNQEEMV-------ALFIKEHRILNVSLQTSVNNHDAALK

Query:  NMEVQIGQIASVVNALQKGKFPSDTEPNPKEQCKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRKEDERQDDK
        N+EVQIGQ+A+ +NA Q+G FPS+TE NPKEQCK + LRSGR +E+S  +E                +E     PN+                      K
Subjt:  NMEVQIGQIASVVNALQKGKFPSDTEPNPKEQCKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRKEDERQDDK

Query:  KKLTEEEVV--PCNHHDRGSHISLPKRRGECPTFDFRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSRKKSLKK-EVF
         K+ EEE+V       D    IS P    + P      LP+PQRF+  KLD+QF KFL++FKK+ +NIP  +AL  MPNY KF+K+++S+K+ L++ E  
Subjt:  KKLTEEEVV--PCNHHDRGSHISLPKRRGECPTFDFRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSRKKSLKK-EVF

Query:  NLSESSSTIVS----------GSFTVPCTIGEVSFDRALCDLGASINLMSYSVFRKIGLSGGQ-------------------------------IGADY-
         LSE  S I+           GSFT+PCTIG   FD+ LCDLGASINLM  SV+RK+GL   +                                 AD+ 
Subjt:  NLSESSSTIVS----------GSFTVPCTIGEVSFDRALCDLGASINLMSYSVFRKIGLSGGQ-------------------------------IGADY-

Query:  -------------LGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV
                     LGRPFLATG+A + V  G+LTL ++ E+V F+I+   +      TCF V
Subjt:  -------------LGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSV

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129455.8e-7633.84Show/hide
Query:  NGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELLQ-----
        NGV  DAIRLRLFPFSL+DKAK WL S+  GSI+TW++LAQ FL KFFPPAKT K+R +I +F Q + E LYEAWE++KE+LRRCP    P+ LQ     
Subjt:  NGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELLQ-----

Query:  --------------------------------------------------KARLYELDESSSLKAQLASLTNALNKLTSPEVVKSI---STLVEGHSKKE
                                                              YE+D   +L  Q+A+L+  L+ L    V  S+       + HS  +
Subjt:  --------------------------------------------------KARLYELDESSSLKAQLASLTNALNKLTSPEVVKSI---STLVEGHSKKE

Query:  -GQDVEEVQYIGNRPYAQGVP--NFYHPSLRNYENLSYSNTKNVLQP----PPGF---ASSSAPEKKNNQEEMVALFIKEHRILNVSLQTSVNNHDAALK
           + E VQ++GN    Q  P  N Y+P  RN+ N S+SN      P    PPGF   A    PEKK+  EE++  +I +           + +  A+L+
Subjt:  -GQDVEEVQYIGNRPYAQGVP--NFYHPSLRNYENLSYSNTKNVLQP----PPGF---ASSSAPEKKNNQEEMVALFIKEHRILNVSLQTSVNNHDAALK

Query:  NMEVQIGQIASVVNALQKGKFPSDTEPNP--KEQCKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRKEDERQD
        N+E Q+GQ+A+ +N   +G  PSDT+ NP  KEQC+ + LRSG+ +E  +++  +   E+   E             N IE++         +K+D++ +
Subjt:  NMEVQIGQIASVVNALQKGKFPSDTEPNP--KEQCKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRKEDERQD

Query:  DKKKLTEEEVVPCNHHDRGSHISLPKRRGECPTFDFRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSRKKSLKK-EVF
        ++   T + + P                           PFPQR +  KL++QFQKFL +FKKL +NIP  EAL  MP+Y KF+K++LS+K+ L + E  
Subjt:  DKKKLTEEEVVPCNHHDRGSHISLPKRRGECPTFDFRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSRKKSLKK-EVF

Query:  NLSESSSTIVS----------GSFTVPCTIGEVSFDRALCDLGASINLMSYSVFRKIGLSGGQ-------------------------------IGADY-
         L+E  S I+           GSFT+PCTIG + F +AL DLGASINLM +S+F K+GL   +                                  D+ 
Subjt:  NLSESSSTIVS----------GSFTVPCTIGEVSFDRALCDLGASINLMSYSVFRKIGLSGGQ-------------------------------IGADY-

Query:  -------------LGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTC
                     LGRPFLAT  A I V  GK++  + +E V F+IF   +   S + C
Subjt:  -------------LGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTC

A0A6P6TF62 Reverse transcriptase6.2e-7032.58Show/hide
Query:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELLQKARL
        MNGV  +AIRLRLFPFSL+DKAK WL S    + +TWD+L++AFL K+FPP KT KLR +I  F Q+  E LYEAWE+++++LR+CP    PE L     
Subjt:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELLQKARL

Query:  Y-----------------------ELDESSSLKAQLASLTNALNK------LTSPEVVKSISTLVEGHSKKEGQDVEEVQYIGN-RPYAQGVP--NFYHP
        Y                       E+D  + L AQ+ ++   L++        S   +   S     H   E  D E+VQ++ N    AQ  P  N Y+P
Subjt:  Y-----------------------ELDESSSLKAQLASLTNALNK------LTSPEVVKSISTLVEGHSKKEGQDVEEVQYIGN-RPYAQGVP--NFYHP

Query:  SLRNYENLSYSNTKNVLQP--PPGFAS-SSAPEKKNNQEEMVALFIKEHRILNVSLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQ
          RN+ N  + +  N  +P  PPGF S     E K   E  V    K        ++  ++      +N+EVQIGQIAS +N   +G+ PS TE NPKE 
Subjt:  SLRNYENLSYSNTKNVLQP--PPGFAS-SSAPEKKNNQEEMVALFIKEHRILNVSLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQ

Query:  CKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRKEDERQDDKKKLTEEEVVPCNHHDRGSHISLPKRRGECPTF
         K + LRSG++LE       K+ +  E  E Q+                    N+ A  + + RQ+ +      E  P +     ++I +P         
Subjt:  CKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRKEDERQDDKKKLTEEEVVPCNHHDRGSHISLPKRRGECPTF

Query:  DFRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSRKKSLKK-EVFNLSESSSTIVS----------GSFTVPCTIGEVS
            +PFPQR K  K D+ F+KF++MFK+L +NIP  +A+  +P+Y KF+KE+++RK+ L+  E   L+E  S I+           GSF++PCTIG ++
Subjt:  DFRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSRKKSLKK-EVFNLSESSSTIVS----------GSFTVPCTIGEVS

Query:  FDRALCDLGASINLMSYSVFRKIGLSGGQ-------------------------------IGADY--------------LGRPFLATGKAEISVHTGKLT
        F +ALCDLGAS++L+  +V R++GL   +                               I  D+              LGRPFLAT    I V  GKL 
Subjt:  FDRALCDLGASINLMSYSVFRKIGLSGGQ-------------------------------IGADY--------------LGRPFLATGKAEISVHTGKLT

Query:  LNIDDEKVVFSIFGQDESVCSLHTCFSVGP-EYLTDDDEEVGYNLGLGLGEMLIDNGDFDHDA
          I +E+V F++   ++        +S+G  E LT +  +V ++    L   L+ +G  D D+
Subjt:  LNIDDEKVVFSIFGQDESVCSLHTCFSVGP-EYLTDDDEEVGYNLGLGLGEMLIDNGDFDHDA

A0A6P6X9H2 Reverse transcriptase1.8e-6932.43Show/hide
Query:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELLQKARL
        MNGV  +AIRLRLFPFSL+DKAK WL S    + +TWD+L++AFL K+FPP KT KLR +I  F Q+  E LYE WE+++++LR+CP    PE L     
Subjt:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELLQKARL

Query:  Y-----------------------ELDESSSLKAQLASLTNALNK------LTSPEVVKSISTLVEGHSKKEGQDVEEVQYIGN-RPYAQGVP--NFYHP
        Y                       E+D  + L AQ+ ++   L++        S   +   S     H   E  D E+VQ++ N    AQ  P  N Y+P
Subjt:  Y-----------------------ELDESSSLKAQLASLTNALNK------LTSPEVVKSISTLVEGHSKKEGQDVEEVQYIGN-RPYAQGVP--NFYHP

Query:  SLRNYENLSYSNTKNVLQP--PPGFAS-SSAPEKKNNQEEMVALFIKEHRILNVSLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQ
          RN+ N  + +  N  +P  PPGF S     E K   E  V    K        ++  ++      +N+EVQIGQIAS +N   +G+ PS TE NPKE 
Subjt:  SLRNYENLSYSNTKNVLQP--PPGFAS-SSAPEKKNNQEEMVALFIKEHRILNVSLQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQ

Query:  CKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRKEDERQDDKKKLTEEEVVPCNHHDRGSHISLPKRRGECPTF
         K + LRSG++LE       K+ +  E  E Q+                    N+ A  + + RQ+ +      E  P +     ++I +P         
Subjt:  CKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRKEDERQDDKKKLTEEEVVPCNHHDRGSHISLPKRRGECPTF

Query:  DFRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSRKKSLKK-EVFNLSESSSTIVS----------GSFTVPCTIGEVS
            +PFPQR K  K D+ F+KF++MFK+L +NIP  +A+  +P+Y KF+KE+++RK+ L+  E   L+E  S I+           GSF++PCTIG ++
Subjt:  DFRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSRKKSLKK-EVFNLSESSSTIVS----------GSFTVPCTIGEVS

Query:  FDRALCDLGASINLMSYSVFRKIGLSGGQ-------------------------------IGADY--------------LGRPFLATGKAEISVHTGKLT
        F +ALCDLGAS++L+  +V R++GL   +                               I  D+              LGRPFLAT    I V  GKL 
Subjt:  FDRALCDLGASINLMSYSVFRKIGLSGGQ-------------------------------IGADY--------------LGRPFLATGKAEISVHTGKLT

Query:  LNIDDEKVVFSIFGQDESVCSLHTCFSVGP-EYLTDDDEEVGYNLGLGLGEMLIDNGDFDHDA
          I +E+V F++   ++        +S+G  E LT +  +V ++    L   L+ +G  D D+
Subjt:  LNIDDEKVVFSIFGQDESVCSLHTCFSVGP-EYLTDDDEEVGYNLGLGLGEMLIDNGDFDHDA

A0A6P8DD93 uncharacterized protein LOC1162064534.6e-6529.93Show/hide
Query:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELL-----
        MN V  D IRL+LFPFSL+DKA+ W  S+   SI+TW +L+  FL +FFPPA+T +LR EI  F + N E LYEAWE++KE +R+CP    P+ L     
Subjt:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELL-----

Query:  -------------------------------------------------QKARLYELDESSSLKAQLASLTNALNKLTSPEVVK----SISTLVEG-HSK
                                                         + A + ++D  ++L  Q+++LT  ++KLTS         +   L  G HS 
Subjt:  -------------------------------------------------QKARLYELDESSSLKAQLASLTNALNKLTSPEVVK----SISTLVEG-HSK

Query:  KE--------GQDVEEVQYIGNRPYAQGVP--NFYHPSLRNYENLSYSNTKNVLQPPPGF-----ASSSAPEKKNNQEEMVALFIKEHRILNVSLQTSVN
         E          + E+V ++ N   +   P  N Y+P  RN+ N S+ N  N L+PPPGF     A ++ P++  ++ E + L   +         T + 
Subjt:  KE--------GQDVEEVQYIGNRPYAQGVP--NFYHPSLRNYENLSYSNTKNVLQPPPGF-----ASSSAPEKKNNQEEMVALFIKEHRILNVSLQTSVN

Query:  NHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQCKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRKE
        N  A ++N+E QI QI+  ++    G  PS+TE NPK     ++LRSG+ LE  +++ + + +  E+ + ++                         + E
Subjt:  NHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQCKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRKE

Query:  DERQDDKKKLTEEEVVPCNHHDRGSHISLPKRRGECPTFDFRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSRKKS--
        + RQ   K L  +  VP                          +PFP+R K  +LD QF KFL++FKKL +NIP  EAL  MP+Y +FMK++L++K+   
Subjt:  DERQDDKKKLTEEEVVPCNHHDRGSHISLPKRRGECPTFDFRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSRKKS--

Query:  --------------LKKEVFNLSESSSTIVSGSFTVPCTIGEVSFDRALCDLGASINLMSYSVFRKIGLSGGQ---------------------------
                      L+K++ NL         GSFTVPCTIG   F+  L D GASINLM  S+FRK+GL   +                           
Subjt:  --------------LKKEVFNLSESSSTIVSGSFTVPCTIGEVSFDRALCDLGASINLMSYSVFRKIGLSGGQ---------------------------

Query:  ----IGADY--------------LGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSVG--PEYLTDDDEE
               D+              LGRPFLATGKA I V  GKLTL + +E++ F+++   +      +C+++    E +++  EE
Subjt:  ----IGADY--------------LGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSVG--PEYLTDDDEE

A0A6P8DKJ2 uncharacterized protein LOC1162042311.9e-6630.22Show/hide
Query:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELL-----
        MN V  D IRL+LFPFSL+DKA+ W  S+   SI+TW +L+  FL +FFPPA+T +LR EI  F + N E LYEAWE++KE +R+CP    P+ L     
Subjt:  MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELL-----

Query:  -------------------------------------------------QKARLYELDESSSLKAQLASLTNALNKLTSPEVVK----SISTLVEG-HSK
                                                         + A + ++D  ++L  Q+++LT  ++KLTS         +   L  G HS 
Subjt:  -------------------------------------------------QKARLYELDESSSLKAQLASLTNALNKLTSPEVVK----SISTLVEG-HSK

Query:  KE--------GQDVEEVQYIGNRPYAQGVP--NFYHPSLRNYENLSYSNTKNVLQPPPGF-----ASSSAPEKKNNQEEMVALFIKEHRILNVSLQTSVN
         E          + E+V ++ N   +   P  N Y+P  RN+ N S+ N  N L+PPPGF     A ++ P++  ++ E + L   +         T + 
Subjt:  KE--------GQDVEEVQYIGNRPYAQGVP--NFYHPSLRNYENLSYSNTKNVLQPPPGF-----ASSSAPEKKNNQEEMVALFIKEHRILNVSLQTSVN

Query:  NHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQCKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRKE
        N  A ++N+E QI QI+  ++    G  PS+TE NPK     ++LRSG+ LE                                        N+ A  +E
Subjt:  NHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQCKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRKE

Query:  DERQDDKKKLTEEEVVPCNHHDRGSHISLPKRRGECPTFDFRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSRKKS--
        +  + DK K   EE               P+R+          +PFP R K  +LD QF KFL++FKKL +NIP  EAL  MP+Y +FMK++L++K+   
Subjt:  DERQDDKKKLTEEEVVPCNHHDRGSHISLPKRRGECPTFDFRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSRKKS--

Query:  --------------LKKEVFNLSESSSTIVSGSFTVPCTIGEVSFDRALCDLGASINLMSYSVFRKIGLSGGQ---------------------------
                      L+K++ NL         GSFTVPCTIG   F+  L D GASINLM  S+FRK+GL   +                           
Subjt:  --------------LKKEVFNLSESSSTIVSGSFTVPCTIGEVSFDRALCDLGASINLMSYSVFRKIGLSGGQ---------------------------

Query:  ----IGADY--------------LGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSVG--PEYLTDDDEE
               D+              LGRPFLATGKA I V  GKLTL + +E++ F+++   +      +C+++    E +++  EE
Subjt:  ----IGADY--------------LGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSVG--PEYLTDDDEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGGAGTTCTAGCAGACGCGATAAGATTAAGGTTGTTTCCATTTTCTCTACAGGATAAAGCAAAAGATTGGCTCGAATCAGTCGAGACAGGCAGCATCAGTACTTG
GGACGAGCTTGCCCAAGCTTTTCTGACGAAATTTTTCCCACCTGCTAAGACTACCAAGCTCCGGACTGAAATCGGAACATTTAGGCAGCTTAACGAGGAACAGTTGTACG
AAGCGTGGGAAAAATACAAAGAAATGCTTAGGCGGTGCCCCCAAAGCAGATATCCTGAATTGCTTCAGAAGGCTAGATTATATGAATTGGATGAGTCAAGTTCACTGAAA
GCACAACTGGCATCTCTGACCAATGCACTGAACAAATTGACTTCACCTGAGGTGGTTAAGTCCATTTCCACCTTAGTTGAAGGTCATTCAAAGAAGGAAGGTCAAGATGT
GGAAGAAGTTCAATACATAGGAAATAGACCATATGCTCAAGGAGTACCAAACTTCTACCACCCCAGTCTGCGCAATTACGAGAACTTATCATATTCGAATACGAAGAATG
TTTTGCAGCCACCCCCAGGTTTTGCATCATCTAGTGCTCCTGAAAAGAAAAATAACCAGGAGGAGATGGTGGCTTTATTCATCAAAGAACATAGAATATTGAATGTGAGT
CTCCAGACATCAGTAAACAACCACGACGCAGCTCTAAAGAACATGGAAGTACAGATAGGACAGATCGCTTCAGTAGTAAATGCCCTTCAGAAGGGAAAATTTCCAAGTGA
TACTGAGCCTAACCCGAAAGAGCAGTGTAAGATGGTGATTCTGAGAAGTGGCAGAAGACTGGAGAAGAGTTCAAAGGAAGAAGAAAAGAGAAGGGATGAATATGAAAGGA
CTGAGGCACAAAAAGCCTCCTCTGAAAGGTTCCAACGTCCTCCCAACTCTATTGAATTAAAATGTGATTTTTCTAACAAACTTGCAGATAGAAAAGAAGATGAGAGGCAG
GATGACAAAAAGAAGCTGACTGAGGAAGAAGTGGTTCCATGCAACCACCATGACAGAGGTTCGCATATTAGCCTGCCCAAGCGGAGGGGCGAATGTCCAACCTTTGATTT
CAGGGAGTTACCTTTTCCTCAAAGATTTAAAAATGTCAAATTAGATGAACAGTTTCAAAAGTTTCTAGAAATGTTTAAAAAGTTGTCTGTGAATATTCCATTGGTAGAAG
CCTTGTATAATATGCCAAATTATGGGAAATTCATGAAGGAAATGCTTTCAAGAAAGAAAAGTCTGAAAAAAGAAGTTTTTAATTTATCTGAGAGTAGCAGTACCATTGTT
TCTGGGAGTTTTACTGTTCCCTGCACCATAGGAGAAGTATCCTTCGATAGGGCTTTATGTGATTTAGGAGCAAGTATAAATTTGATGTCCTACTCTGTTTTCAGGAAGAT
TGGTTTATCAGGAGGACAAATAGGTGCCGATTATTTAGGCAGACCTTTCCTAGCCACTGGTAAGGCTGAGATTAGCGTGCATACAGGTAAACTTACCTTGAACATTGATG
ATGAAAAAGTTGTGTTTAGTATTTTTGGCCAAGATGAATCTGTTTGTAGTTTGCACACATGTTTTTCTGTTGGTCCTGAATACTTGACTGATGACGATGAAGAGGTAGGC
TATAATCTTGGCCTAGGCTTAGGAGAAATGCTTATAGATAATGGGGATTTTGATCATGATGCATATTTGGATAATCCTCTGTTTGAAAATGATTTGGATCTGCCTGACTT
TGAGAATGAATTAGACTTACCTGCTTGTGAAAATGAAAGATCTGCAGTTGATGATTTACCTTCCTTTGAAAATGAATTAGATTTTCCTGAAATGGATAATTTTGATGATG
ATATTGAATTGCCTGACATTGAGGATGAACATGAAATGCATAAAAAAGATTGCTTAATAGATAATTCTGAGTCTGATATGATTACAGTGAATCTATTGAGTCTGATCTTG
ACATTGCTGAGTGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAACGGAGTTCTAGCAGACGCGATAAGATTAAGGTTGTTTCCATTTTCTCTACAGGATAAAGCAAAAGATTGGCTCGAATCAGTCGAGACAGGCAGCATCAGTACTTG
GGACGAGCTTGCCCAAGCTTTTCTGACGAAATTTTTCCCACCTGCTAAGACTACCAAGCTCCGGACTGAAATCGGAACATTTAGGCAGCTTAACGAGGAACAGTTGTACG
AAGCGTGGGAAAAATACAAAGAAATGCTTAGGCGGTGCCCCCAAAGCAGATATCCTGAATTGCTTCAGAAGGCTAGATTATATGAATTGGATGAGTCAAGTTCACTGAAA
GCACAACTGGCATCTCTGACCAATGCACTGAACAAATTGACTTCACCTGAGGTGGTTAAGTCCATTTCCACCTTAGTTGAAGGTCATTCAAAGAAGGAAGGTCAAGATGT
GGAAGAAGTTCAATACATAGGAAATAGACCATATGCTCAAGGAGTACCAAACTTCTACCACCCCAGTCTGCGCAATTACGAGAACTTATCATATTCGAATACGAAGAATG
TTTTGCAGCCACCCCCAGGTTTTGCATCATCTAGTGCTCCTGAAAAGAAAAATAACCAGGAGGAGATGGTGGCTTTATTCATCAAAGAACATAGAATATTGAATGTGAGT
CTCCAGACATCAGTAAACAACCACGACGCAGCTCTAAAGAACATGGAAGTACAGATAGGACAGATCGCTTCAGTAGTAAATGCCCTTCAGAAGGGAAAATTTCCAAGTGA
TACTGAGCCTAACCCGAAAGAGCAGTGTAAGATGGTGATTCTGAGAAGTGGCAGAAGACTGGAGAAGAGTTCAAAGGAAGAAGAAAAGAGAAGGGATGAATATGAAAGGA
CTGAGGCACAAAAAGCCTCCTCTGAAAGGTTCCAACGTCCTCCCAACTCTATTGAATTAAAATGTGATTTTTCTAACAAACTTGCAGATAGAAAAGAAGATGAGAGGCAG
GATGACAAAAAGAAGCTGACTGAGGAAGAAGTGGTTCCATGCAACCACCATGACAGAGGTTCGCATATTAGCCTGCCCAAGCGGAGGGGCGAATGTCCAACCTTTGATTT
CAGGGAGTTACCTTTTCCTCAAAGATTTAAAAATGTCAAATTAGATGAACAGTTTCAAAAGTTTCTAGAAATGTTTAAAAAGTTGTCTGTGAATATTCCATTGGTAGAAG
CCTTGTATAATATGCCAAATTATGGGAAATTCATGAAGGAAATGCTTTCAAGAAAGAAAAGTCTGAAAAAAGAAGTTTTTAATTTATCTGAGAGTAGCAGTACCATTGTT
TCTGGGAGTTTTACTGTTCCCTGCACCATAGGAGAAGTATCCTTCGATAGGGCTTTATGTGATTTAGGAGCAAGTATAAATTTGATGTCCTACTCTGTTTTCAGGAAGAT
TGGTTTATCAGGAGGACAAATAGGTGCCGATTATTTAGGCAGACCTTTCCTAGCCACTGGTAAGGCTGAGATTAGCGTGCATACAGGTAAACTTACCTTGAACATTGATG
ATGAAAAAGTTGTGTTTAGTATTTTTGGCCAAGATGAATCTGTTTGTAGTTTGCACACATGTTTTTCTGTTGGTCCTGAATACTTGACTGATGACGATGAAGAGGTAGGC
TATAATCTTGGCCTAGGCTTAGGAGAAATGCTTATAGATAATGGGGATTTTGATCATGATGCATATTTGGATAATCCTCTGTTTGAAAATGATTTGGATCTGCCTGACTT
TGAGAATGAATTAGACTTACCTGCTTGTGAAAATGAAAGATCTGCAGTTGATGATTTACCTTCCTTTGAAAATGAATTAGATTTTCCTGAAATGGATAATTTTGATGATG
ATATTGAATTGCCTGACATTGAGGATGAACATGAAATGCATAAAAAAGATTGCTTAATAGATAATTCTGAGTCTGATATGATTACAGTGAATCTATTGAGTCTGATCTTG
ACATTGCTGAGTGCATGA
Protein sequenceShow/hide protein sequence
MNGVLADAIRLRLFPFSLQDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLNEEQLYEAWEKYKEMLRRCPQSRYPELLQKARLYELDESSSLK
AQLASLTNALNKLTSPEVVKSISTLVEGHSKKEGQDVEEVQYIGNRPYAQGVPNFYHPSLRNYENLSYSNTKNVLQPPPGFASSSAPEKKNNQEEMVALFIKEHRILNVS
LQTSVNNHDAALKNMEVQIGQIASVVNALQKGKFPSDTEPNPKEQCKMVILRSGRRLEKSSKEEEKRRDEYERTEAQKASSERFQRPPNSIELKCDFSNKLADRKEDERQ
DDKKKLTEEEVVPCNHHDRGSHISLPKRRGECPTFDFRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSRKKSLKKEVFNLSESSSTIV
SGSFTVPCTIGEVSFDRALCDLGASINLMSYSVFRKIGLSGGQIGADYLGRPFLATGKAEISVHTGKLTLNIDDEKVVFSIFGQDESVCSLHTCFSVGPEYLTDDDEEVG
YNLGLGLGEMLIDNGDFDHDAYLDNPLFENDLDLPDFENELDLPACENERSAVDDLPSFENELDFPEMDNFDDDIELPDIEDEHEMHKKDCLIDNSESDMITVNLLSLIL
TLLSA