; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039668 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039668
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:47988762..47992645
RNA-Seq ExpressionLag0039668
SyntenyLag0039668
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023929193.1 uncharacterized protein LOC112040541 [Quercus suber]3.0e-8030.86Show/hide
Query:  CRGTKVQAGSSTEEMWMPIMTNPSLPVFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDAIVNSKTG-RWRFMRV
        CRG  +    + EE+   I       VFL ET +++K    ++ + +F+  F V +E   GGL +LW   V V ++S+S  HID IVN  T   WR    
Subjt:  CRGTKVQAGSSTEEMWMPIMTNPSLPVFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDAIVNSKTG-RWRFMRV

Query:  YGNPDPRKHADSWDLLKRL-------------------------------SLLTQGSKVLVSLGGVGQEVSRP-------RGDNIIKERLDKFLLNQDLL
        YG P+     + W++L+ L                               SL+     VL   G V    S P       R   +I ERLD+ + N D L
Subjt:  YGNPDPRKHADSWDLLKRL-------------------------------SLLTQGSKVLVSLGGVGQEVSRP-------RGDNIIKERLDKFLLNQDLL

Query:  NKVEKIIVERRPFHSSYHRPILATIVHKEERSNYGGSMRRP-KFECSWVNFEACRDIIRKCWR-DHQGSGLISINNRVINCLEKLAKWNNDRLKGSLKWA
        ++     V+    ++S HRPIL ++    E   +    R+P +FE  W+    C  +I   W  + +GS ++    +V  C + L  WN DR    L   
Subjt:  NKVEKIIVERRPFHSSYHRPILATIVHKEERSNYGGSMRRP-KFECSWVNFEACRDIIRKCWR-DHQGSGLISINNRVINCLEKLAKWNNDRLKGSLKWA

Query:  IERQLKVIQDIDSRGLVNDVLLISAEKALKRLLDEEELKKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSIRVSEDQNK
          +++K  ++          LL  AE+   R  +  E K+RN IKG+ +  G W E+E+ I  + T YF  LFTS  S    LDR+L GV   V++D   
Subjt:  IERQLKVIQDIDSRGLVNDVLLISAEKALKRLLDEEELKKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSIRVSEDQNK

Query:  NLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKIIAKALA
        +L +P+ K EVEFA+K M P KA G D    +F+Q YW  VG ++S+  L  LN    ++ IN T I+LIPK++ PE +T FRPISLC+V+YKI++K +A
Subjt:  NLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKIIAKALA

Query:  NRC-----------------------------------------------------GSEKI---------------MNCLQSMSYSVLINRIPQKEFAPG
        NR                                                      G   I               +N  +++  S+ ++ I  + +   
Subjt:  NRC-----------------------------------------------------GSEKI---------------MNCLQSMSYSVLINRIPQKEFAPG

Query:  RGMSSQTGNNKSRIFSRIKEKVWKVLQRWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNNICEEINKMCANLWWGFNSSSSNPKAHWASWKKLCVSK
         G+ S  G+ K   F+++KE++W  +Q WKEK+ S  GKEV+IK++ +AIP YS+S F+LP ++C++I  M    WWG        K HW  W  LC SK
Subjt:  RGMSSQTGNNKSRIFSRIKEKVWKVLQRWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNNICEEINKMCANLWWGFNSSSSNPKAHWASWKKLCVSK

XP_024038343.1 uncharacterized protein LOC112097373 [Citrus clementina]2.3e-6427.12Show/hide
Query:  MTNPSLPVFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDAIVNSKTGR-WRFMRVYGNPDPRKHADSWDLLKRL
        M  P L VFL ET+   +  +++  +L +E+ F V + G  GGL LLWNS+  V I+S+++ HIDA +  + G+  R   VYG+PD R+   +W LL++L
Subjt:  MTNPSLPVFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDAIVNSKTGR-WRFMRVYGNPDPRKHADSWDLLKRL

Query:  SLLTQGSKVLVSLGGVGQEVSRPR-GDNIIKERLDKFLLNQDLLNKVEKIIVERRPFHSSYHRPILATIVHKEERSNYGGSMRRP---KFECSWVNFEAC
        S         +S  G     S  R G   ++ERLD+F+ N    +             +S H P++  +  +    N+  + RR     +E  W  ++ C
Subjt:  SLLTQGSKVLVSLGGVGQEVSRPR-GDNIIKERLDKFLLNQDLLNKVEKIIVERRPFHSSYHRPILATIVHKEERSNYGGSMRRP---KFECSWVNFEAC

Query:  RDIIRK------CWRDHQGSGLISINNRVI-NCLEKLAKWNNDRLKGSLKWAIERQLKVIQDIDSRGL--VNDVLLISAEKALKRLLDEEEL--------
        ++II K      CW        +S+  +V  N + +L  W+ +  +G  K  +E+ +  ++ +    +  V    +   E+ ++ +L ++E+        
Subjt:  RDIIRK------CWRDHQGSGLISINNRVI-NCLEKLAKWNNDRLKGSLKWAIERQLKVIQDIDSRGL--VNDVLLISAEKALKRLLDEEEL--------

Query:  --------------------KKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSIRVSEDQNKNLMRPFSKAEVEFALKSM
                            KK+N I GI N  G W+E  EG+     KYF  LFT+SK +++ +   L G+S RVS + N++L  PF+  EV  AL  M
Subjt:  --------------------KKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSIRVSEDQNKNLMRPFSKAEVEFALKSM

Query:  NPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKIIAKALANRCGS-------------
         PTKA G D   A+FFQ +W  V + V   CL +LN+  DV P N T I LI K   P  +T FRPISLC+V+Y+I+AKA+ANR  +             
Subjt:  NPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKIIAKALANRCGS-------------

Query:  -----------------EKIMNC-----------------------------LQSMSYS-----------------------------------------
                          KI +C                             ++S+ +S                                         
Subjt:  -----------------EKIMNC-----------------------------LQSMSYS-----------------------------------------

Query:  ----------------------------VLINRIPQKEFAPGR-------------------------GMSSQTGNNKSRIFSRIKEKVWKVLQRWKEKM
                                     L N      F  G                          G+ S  G  +S  F+ IK K+   +  W+ K 
Subjt:  ----------------------------VLINRIPQKEFAPGR-------------------------GMSSQTGNNKSRIFSRIKEKVWKVLQRWKEKM

Query:  FSIGGKEVLIKSITRAIPIYSLSYFRLPNNICEEINKMCANLWWGFNSSSSNPKAHWASWKKLCVSK
        FS GGKEVLIK+ T+AIP Y++S F++P  IC++I ++ AN WWG+ +       HW+ W KL  +K
Subjt:  FSIGGKEVLIKSITRAIPIYSLSYFRLPNNICEEINKMCANLWWGFNSSSSNPKAHWASWKKLCVSK

XP_030495126.1 uncharacterized protein LOC115710915 [Cannabis sativa]7.0e-6927.41Show/hide
Query:  VFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDAIVNSKTG-RWRFMRVYGNPDPRKHADSWDLLKRLSLLTQG-
        VFL ET++     + +++ L +E  F V  +G SGGL LLW     V + S++  HIDA +N +    WRF   Y +PDP +   SW LLKR++    G 
Subjt:  VFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDAIVNSKTG-RWRFMRVYGNPDPRKHADSWDLLKRLSLLTQG-

Query:  ----------SKVLVSLGGVG------------------QEVS---------RPRGDNIIKERLDKFLLNQDLLNKVEKIIVERRPFHSSYHRPILATI-
                   ++   LGG G                  +E+            R  N+I ERLD+ ++N +         V+      S H P+L T  
Subjt:  ----------SKVLVSLGGVG------------------QEVS---------RPRGDNIIKERLDKFLLNQDLLNKVEKIIVERRPFHSSYHRPILATI-

Query:  -----VHKEERSNYGGSMRRPKFECSWVNFEACRDIIRKCWRDHQGSGLISINNR---VINCLEKLAKWNNDRLKGSLKWAIERQLKVIQDIDSRGLVND
               K+ER  Y     R  +E +W + E C+ II+  W+  +G+ + S  +    + NC   L +WN  + K +L    + + +V +  +       
Subjt:  -----VHKEERSNYGGSMRRPKFECSWVNFEACRDIIRKCWRDHQGSGLISINNR---VINCLEKLAKWNNDRLKGSLKWAIERQLKVIQDIDSRGLVND

Query:  VLLISAEKALKRLLDEEEL----------------------------KKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVS
          L   EK L   L +EEL                            +K+N I G+ ++  +W   +E I      +FQ LF+++       + +   V 
Subjt:  VLLISAEKALKRLLDEEEL----------------------------KKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVS

Query:  IRVSEDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVV
        +R S  QN  L+  F+ A++   L  +N  KA   D    +F++N+W+++G +++K+CL +LN N D R +NKT + LIPK++ P+ +  +RPISLC+V 
Subjt:  IRVSEDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVV

Query:  YKIIAKALANRCGS----------------------------------------------------------------------------EKIMNCLQSM
        YKIIAK LANR                                                                               +KIMNC++S+
Subjt:  YKIIAKALANRCGS----------------------------------------------------------------------------EKIMNCLQSM

Query:  SYSVLINRIPQKEFAPGRGMS--------------------SQTGNNKSRIFSRIKEKVWKVLQRWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNN
        S+S+L+N     E   G G+S                    +  G  K  +F  I+ K+    Q WK  +FS  G+E+L+K+I +AIP Y +S FRLP  
Subjt:  SYSVLINRIPQKEFAPGRGMS--------------------SQTGNNKSRIFSRIKEKVWKVLQRWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNN

Query:  ICEEINKMCANLWWGFNSSSSNPKAHWASWKKLCVSK
        + ++I+ M A  WWG  SS +  K HW +W KLC  K
Subjt:  ICEEINKMCANLWWGFNSSSSNPKAHWASWKKLCVSK

XP_030498122.1 uncharacterized protein LOC115713779 [Cannabis sativa]2.7e-6525.78Show/hide
Query:  VFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDAIVNSKTG-RWRFMRVYGNPDPRKHADSWDLLKRLSLLTQGS
        +F++ET+      DK + +L F +   VP +G  GGL LLW S V +SI +YS  HID  V S  G    F   YG+P   +   +W LL+R S +   S
Subjt:  VFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDAIVNSKTG-RWRFMRVYGNPDPRKHADSWDLLKRLSLLTQGS

Query:  KVLV------------SLGGV-------------------------GQE---VSRPRGDNIIKERLDKFLLNQDLLNKVEKIIVERRPFHSSYHRPILAT
           V             +GG                          GQ     ++  G   +KERLD   +N    +      ++   F+ S HR + AT
Subjt:  KVLV------------SLGGV-------------------------GQE---VSRPRGDNIIKERLDKFLLNQDLLNKVEKIIVERRPFHSSYHRPILAT

Query:  IVHKEERSNYGGSMRRPKFECSWVNFEACRDIIRKCWRDHQGSGLISINNRVINCLEKLAKWNNDRLKGSLKWAIE---RQLKVIQDIDSRGLVNDVLLI
        I  K +++       R +FE  W+  EAC DII   W     S L+ + N + +C   L +W+     G+L+  I+   + +  +Q+  S    +   L 
Subjt:  IVHKEERSNYGGSMRRPKFECSWVNFEACRDIIRKCWRDHQGSGLISINNRVINCLEKLAKWNNDRLKGSLKWAIE---RQLKVIQDIDSRGLVNDVLLI

Query:  SAEKALKRLLDEEE----------------------------LKKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSIRVS
        ++E+ L  LL +EE                                N IK + +  G     E  +  I + YFQ++FTS + D  A++ IL  V   + 
Subjt:  SAEKALKRLLDEEE----------------------------LKKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSIRVS

Query:  EDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKII
        E+    +  PF+ +EV  AL SM+  K+ G D    MFF NYW+IVG  V++  L VLN   D    N T I+LIPK++ P+ +T +RPISLC+V+YK++
Subjt:  EDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKII

Query:  AKALANRC----------------------------------------GSE------------------------------------KIMNCLQSMSYSV
        +K++  R                                         GS+                                     I+ CLQS+SYS 
Subjt:  AKALANRC----------------------------------------GSE------------------------------------KIMNCLQSMSYSV

Query:  LINRIPQKEFAPGR--------------------------------------------------------------------------------------
        L+N     +  P R                                                                                      
Subjt:  LINRIPQKEFAPGR--------------------------------------------------------------------------------------

Query:  ----------------------------------GMSSQTGNNKSRIFSRIKEKVWKVLQRWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNNICEE
                                          G+ S +G +K+R+FS I +K+WK+L  WKE++FS GGKEVL+K++ +AIP Y++S FRLP  +C +
Subjt:  ----------------------------------GMSSQTGNNKSRIFSRIKEKVWKVLQRWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNNICEE

Query:  INKMCANLWWGFNSSSSNPKAHWASWKKLCVSKV
        I  M AN WWG  S++S    HW +W  LC +KV
Subjt:  INKMCANLWWGFNSSSSNPKAHWASWKKLCVSKV

XP_030923017.1 uncharacterized protein LOC115949892 [Quercus lobata]1.5e-6826.32Show/hide
Query:  VFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDAIVN-SKTGRWRFMRVYGNPDPRKHADSWDLLKRLS------
        +FL+ET+  +   ++L+  LKF++ F VP   L GGL LLWN+ +N+ I ++S  HIDA+VN      WRF   YG P+     DSW +L+ L       
Subjt:  VFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDAIVN-SKTGRWRFMRVYGNPDPRKHADSWDLLKRLS------

Query:  ---------------------------------LLTQGSKVLVSLGGVGQEVSRPRGDNIIKERLDKFLLNQDLLNKVEKIIVERRPFHSSYHRPILATI
                                         L   G K L   G      +R   + ++  RLD+ +   D + K     +   P  SS H+P+   +
Subjt:  ---------------------------------LLTQGSKVLVSLGGVGQEVSRPRGDNIIKERLDKFLLNQDLLNKVEKIIVERRPFHSSYHRPILATI

Query:  VHKEERSNYGGSMRRPKFECSWVNFEACRDIIRKCW-RDHQGSGLISINNRVINCLEKLAKWNNDRLKGSLKWAI--ERQLKVIQDIDSRGLVNDVLLIS
        V  + ++ +  + +  +FE  W+N E C  ++   W R+  G  +  I ++V  C  +L  W+ + + G+++ A+   R+L VI + DS    N   +  
Subjt:  VHKEERSNYGGSMRRPKFECSWVNFEACRDIIRKCW-RDHQGSGLISINNRVINCLEKLAKWNNDRLKGSLKWAI--ERQLKVIQDIDSRGLVNDVLLIS

Query:  AEKALKRLLDEE----------------------------ELKKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSIRVSE
          + + RL+D E                            E  K+N I G+ N  G W+E EE I E+ T Y+  LF  S  +   L+ +L GV  RVSE
Subjt:  AEKALKRLLDEE----------------------------ELKKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSIRVSE

Query:  DQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKIIA
          N  L++PF   EV  ALK M P  A G D    +F++N+W+ VG EVS+  L VLN       +N T ISLIPK + P  +T FRPISLC+V+YK+IA
Subjt:  DQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKIIA

Query:  KALANRCGS--------------------------------------------------------------EKIMN--------------CLQSMSYSVL
        K LANR                                                                 EKIM               C++S++YS++
Subjt:  KALANRCGS--------------------------------------------------------------EKIMN--------------CLQSMSYSVL

Query:  INRIPQKEFAPGRGM-------------------------------------------------------------------------------------
        +N  P     P RG+                                                                                     
Subjt:  INRIPQKEFAPGRGM-------------------------------------------------------------------------------------

Query:  -----SSQT------------------------------GNNKSRIFSRIKEKVWKVLQRWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNNICEEI
             SS T                              G +K + FS IKE++WK L+ WKE++ S  G+E+L+K++ +AIPIY++S FRLP  +  +I
Subjt:  -----SSQT------------------------------GNNKSRIFSRIKEKVWKVLQRWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNNICEEI

Query:  NKMCANLWWGFNSSSSNPKAHWASWKKLCVSK
          +    WWG+       K HW SW+KLC  K
Subjt:  NKMCANLWWGFNSSSSNPKAHWASWKKLCVSK

TrEMBL top hitse value%identityAlignment
A0A2N9GN35 Uncharacterized protein5.2e-7028.46Show/hide
Query:  AGSSTEEMWMPIMTNPSLPVFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDAIVNSKT-GRWRFMRVYGNPDPR
        +  +  E+ + +     L +FL ET+   K  + L+    F   F VP+ G SGGL LLW+ +  VSI SYS  HIDA+++    G W F   YG+P   
Subjt:  AGSSTEEMWMPIMTNPSLPVFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDAIVNSKT-GRWRFMRVYGNPDPR

Query:  KHADSWDLLKRLSLLTQGSKVLVSLGGVGQE----------VSRPRGDNIIKERLDKFLLNQDLLNKVEKIIVERRPFHSSYHRPILATIVHKEERSNYG
          A +WDLL+   +L     +    GG   E          V+RP      + ++ +F L  D    ++   V       S HRP+   +  K       
Subjt:  KHADSWDLLKRLSLLTQGSKVLVSLGGVGQE----------VSRPRGDNIIKERLDKFLLNQDLLNKVEKIIVERRPFHSSYHRPILATIVHKEERSNYG

Query:  GSMRRPKFECSWVNFEACRDIIRKCWRDHQ-GSGLISINNRVINCLEKLAKWNNDRLKGSLKWAIERQLKVIQDIDSRGLVN-DVLLISAEKALKRLLD-
           RR +FE  W   E C D IRK W   Q G+ +  +  ++    E+L KW+ D+  GS++ AI  + +++Q  +     N +V LI  +K  + L D 
Subjt:  GSMRRPKFECSWVNFEACRDIIRKCWRDHQ-GSGLISINNRVINCLEKLAKWNNDRLKGSLKWAIERQLKVIQDIDSRGLVN-DVLLISAEKALKRLLD-

Query:  ---EEELKKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSIRVSEDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHA
           EE++++RNHI GI ++ GVW  ++E +  I   Y++ LFT+S       D IL GV   V+ + N  L   F++ EVE+ LK M P KA G D    
Subjt:  ---EEELKKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSIRVSEDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHA

Query:  MFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKIIAKALANRC----------------------------
        +F+Q YW IVG++++   L  L +   ++ IN T + LIPK++  ES+  FRPISLC+VVYKIIAK LANR                             
Subjt:  MFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKIIAKALANRC----------------------------

Query:  -----------GSEK-------------------------------------IMNCLQSMSYSVLINRIPQKEFAPGR----------------------
                   G+++                                     IM C++++SYSVLIN  P+  F P R                      
Subjt:  -----------GSEK-------------------------------------IMNCLQSMSYSVLINRIPQKEFAPGR----------------------

Query:  --------------------------------------------------------------------------GMSSQTGNNKSRIFSRIKEKVWKVLQ
                                                                                  G+ S  G +K   F ++KE+VW  +Q
Subjt:  --------------------------------------------------------------------------GMSSQTGNNKSRIFSRIKEKVWKVLQ

Query:  RWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNNICEEINKMCANLWWGFNSSSSNPKAHWASWKKLCVSK
         WK K+ S  GKEVLIK++ +AIP Y+++ F+LP  +C E+ +M  + WWG    S   K HW +W  LC  K
Subjt:  RWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNNICEEINKMCANLWWGFNSSSSNPKAHWASWKKLCVSK

A0A2N9GN35 Uncharacterized protein1.9e-1131.11Show/hide
Query:  GNNIFKCKFSSKKDKERIVEGGPWNFDKGLLVFVELKDNNKISMLQSRYASFWIQIPDLSIAYYSRKNVMALGNSIRKFIDVDVDENGKCKENPHRIRVE
        GNN     F+   D ER++  GPW FDK L++   ++D+   S +     SFW+QI DL +   +      +GN++ +   ++    G    N  R+RV+
Subjt:  GNNIFKCKFSSKKDKERIVEGGPWNFDKGLLVFVELKDNNKISMLQSRYASFWIQIPDLSIAYYSRKNVMALGNSIRKFIDVDVDENGKCKENPHRIRVE

Query:  IDIQDPLCRGTKVQAGSSTEEMWMPIMTNPSLPVF
        +D+  PLCRG K+  G   ++ W+       LP+F
Subjt:  IDIQDPLCRGTKVQAGSSTEEMWMPIMTNPSLPVF

A0A2N9IPS8 Reverse transcriptase domain-containing protein1.8e-7527.37Show/hide
Query:  STEEMWMPIMTNPSLPVFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDA-IVNSKTGR-WRFMRVYGNPDPRKH
        +  E+ + I       +FL ET+  +   ++L++ +KF+ +F VP  G  GGL +LW +K++V + +YS+ HIDA IV+ + G+ +R    YGNP+  K 
Subjt:  STEEMWMPIMTNPSLPVFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDA-IVNSKTGR-WRFMRVYGNPDPRKH

Query:  ADSWDLLKRLSLLTQGSKV------------------------------------LVSLGGVGQEVSRPR---GDNIIKERLDKFLLNQDLLNKVEKIIV
         +SW LLK LS L+    +                                    L  LG VG   +  R   G+ ++  RLD+ + +   L   +  +V
Subjt:  ADSWDLLKRLSLLTQGSKV------------------------------------LVSLGGVGQEVSRPR---GDNIIKERLDKFLLNQDLLNKVEKIIV

Query:  ERRPFHSSYHRPILATIVHKEERSNYGGSMRRPK----FECSWVNFEACRDIIRKCWRD--HQGSGLISINNRVINCLEKLAKWNNDRLKGSLKWAIERQ
              +S H PIL  I         G  ++R K    FE  W+  E CR++I   W D   +GS +  +  ++  C   L  W+ +R  GSL  +I+R+
Subjt:  ERRPFHSSYHRPILATIVHKEERSNYGGSMRRPK----FECSWVNFEACRDIIRKCWRD--HQGSGLISINNRVINCLEKLAKWNNDRLKGSLKWAIERQ

Query:  LKVIQDIDSR---GLVNDVLLISAEKALKRLLDEEEL----------------------------KKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLF
         + +Q + +    G    +L +  +  L  LL++EE+                            ++ NHI G+ + +GVW  E+  I EI   YFQ +F
Subjt:  LKVIQDIDSR---GLVNDVLLISAEKALKRLLDEEEL----------------------------KKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLF

Query:  TSSKSDKEALDRILVGVSIRVSEDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKL
        TSS    E++  +L G+   V+   N  L   F+K EV  ALK M PTKA G D   A+F+Q YWDIVG EV++  L +L+    +R IN T I+LIPK+
Subjt:  TSSKSDKEALDRILVGVSIRVSEDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKL

Query:  EVPESMTHFRPISLCSVVYKIIAKALANR-----------------------------------------------------------------------
        + PE++T FRPISLC+V+YKI++K LANR                                                                       
Subjt:  EVPESMTHFRPISLCSVVYKIIAKALANR-----------------------------------------------------------------------

Query:  CGSEK-----IMNCLQSMSYSVLINR--------------------------------------------------------------------------
         G  K     +M CL+S+SYSVLIN                                                                           
Subjt:  CGSEK-----IMNCLQSMSYSVLINR--------------------------------------------------------------------------

Query:  -------IPQKEFAPGR---------------------------------------GMSSQTGNNKSRIFSRIKEKVWKVLQRWKEKMFSIGGKEVLIKS
               + Q E+A G+                                       G+ S  G +KS  F  IK +VW+ +  WKEK  S  G+EVLIK+
Subjt:  -------IPQKEFAPGR---------------------------------------GMSSQTGNNKSRIFSRIKEKVWKVLQRWKEKMFSIGGKEVLIKS

Query:  ITRAIPIYSLSYFRLPNNICEEINKMCANLWWGFNSSSSNPKAHWASWKKLCVSK
        + ++IP YS+S F+LP ++C ++N M +N WWG +  +   KAHW  W KLC SK
Subjt:  ITRAIPIYSLSYFRLPNNICEEINKMCANLWWGFNSSSSNPKAHWASWKKLCVSK

A0A2N9IPS8 Reverse transcriptase domain-containing protein1.1e-0828.46Show/hide
Query:  IKIKSEGNNIFKCKFSSKKDKERIVEGGPWNFDKGLLVFVELKDNNKISMLQSRYASFWIQIPDLSIAYYSRKNVMALGNSIRKFIDVDVDENGKCKENP
        I ++  G+N+F  +F +  +++R++ G PW FD  LL   E   +   S +Q  +  FW+    + + Y ++     +G ++   +DVDV E+G      
Subjt:  IKIKSEGNNIFKCKFSSKKDKERIVEGGPWNFDKGLLVFVELKDNNKISMLQSRYASFWIQIPDLSIAYYSRKNVMALGNSIRKFIDVDVDENGKCKENP

Query:  HRIRVEIDIQDPLCRGTKVQAGS
         R+R+ +D   P+ RG  V  GS
Subjt:  HRIRVEIDIQDPLCRGTKVQAGS

A0A2N9IPS8 Reverse transcriptase domain-containing protein7.0e-7531.4Show/hide
Query:  VFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDAIVNSKTG-RWRFMRVYGNPDPRKHADSWDLLKRLSLLTQ--
        +FL ET+ +    + ++++L +   F V   G SGGL LLW S   V I+S++  HIDA+V +  G  WRF   YG+PDP    +SW LL RL  +    
Subjt:  VFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDAIVNSKTG-RWRFMRVYGNPDPRKHADSWDLLKRLSLLTQ--

Query:  -------------------GSKVLVSLGGVGQEVS-------RPRG----------DNIIKERLDKFLLNQDLLNKVEKIIVERRPFHSSYHRPILATIV
                           G K    +    Q +S        P+G           N+I E+LD  L N D +N  +        + +S HRPIL    
Subjt:  -------------------GSKVLVSLGGVGQEVS-------RPRG----------DNIIKERLDKFLLNQDLLNKVEKIIVERRPFHSSYHRPILATIV

Query:  HKEERSNYGGSMRRPK------FECSWVNFEACRDIIRKCWR-DHQGSGLISINNRVINCLEKLAKWN----------NDRLKGSLKWA---------IE
          E      GS+ R K      +E +W + E C  II + WR    G  + ++ + + NC ++L +WN          +  LKG + W          I 
Subjt:  HKEERSNYGGSMRRPK------FECSWVNFEACRDIIRKCWR-DHQGSGLISINNRVINCLEKLAKWN----------NDRLKGSLKWA---------IE

Query:  RQLKVIQDIDSRGLVNDVLLISAEKALKRLLDEEELK----------KRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSI
        RQ K+ +D++      ++      +AL     +   K          K++ I+G+ N+ G W   E  I E+   YFQ LF+SS    +  + +   +  
Subjt:  RQLKVIQDIDSRGLVNDVLLISAEKALKRLLDEEELK----------KRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSI

Query:  RVSEDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVY
        R+SE  N+ L  PF+  E++ +L  ++P KA G+D    MFF N WD++ ++ +  CL +LN+N D + IN+T I LI K++ P  M+ FRPISLC+VVY
Subjt:  RVSEDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVY

Query:  KIIAKALANRCGSEKIMNCLQSMSYSVLI-NRIPQKEFAPGR----------------------------------GMSSQTGNNKSRIFSRIKEKVWKV
        K+++K LANR   +  MN   S + S  I  RI Q      R                                  GM S  G NK ++F  I++KV   
Subjt:  KIIAKALANRCGSEKIMNCLQSMSYSVLI-NRIPQKEFAPGR----------------------------------GMSSQTGNNKSRIFSRIKEKVWKV

Query:  LQRWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNNICEEINKMCANLWWGFNSSSSNPKAHWASWKKLC
        L+ WK  +FS  GKEVLIK++ +AIP Y +S FR+   I +E+  M A   WG  S+++  K HW +WKKLC
Subjt:  LQRWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNNICEEINKMCANLWWGFNSSSSNPKAHWASWKKLC

A0A803QF94 Uncharacterized protein1.2e-7129.23Show/hide
Query:  VFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDA-IVNSKTGRWRFMRVYGNPDPRKHADSWDLLKRL-------
        +FL+ET+       +L+  L + +    P  GLSGGL LLW+  V+V++ ++     D  +V      + F   YG PD    + SW L KR        
Subjt:  VFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDA-IVNSKTGRWRFMRVYGNPDPRKHADSWDLLKRL-------

Query:  ---------SLLTQGSKV---------------------LVSLGGVGQE---VSRPRGDNIIKERLDKFLLNQDLLNKVEKIIVERRPFHSSYHRPILAT
                  +L+   K                      L  +  +G     +   R  + IKERLD   +N    +      VE   ++ S HR I+A+
Subjt:  ---------SLLTQGSKV---------------------LVSLGGVGQE---VSRPRGDNIIKERLDKFLLNQDLLNKVEKIIVERRPFHSSYHRPILAT

Query:  IVHKEERSNYGGSMRRPKFECSWVNFEACRDIIRKCWRDHQG-SGLISINNRVINCLEKLAKWNNDRLKGSLKWAIERQLKVIQDIDS---RGLVNDVLL
                N   S  R +FE  W+      +II + W D      + ++   + +C +KL  W+ ++  G +K  I+     + D+++   R       L
Subjt:  IVHKEERSNYGGSMRRPKFECSWVNFEACRDIIRKCWRDHQG-SGLISINNRVINCLEKLAKWNNDRLKGSLKWAIERQLKVIQDIDS---RGLVNDVLL

Query:  ISAEKALKRLLDEEEL----------------------------KKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSIRV
         SAE+ L+ LL++EE+                            K  N IK + N+ G  V+    I  +   YF  +F++S  D +AL   L  +   V
Subjt:  ISAEKALKRLLDEEEL----------------------------KKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSIRV

Query:  SEDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKI
        ++  N +L+RPF+ AEV  AL++M P K+ G D   AMF+Q  W IVG+ V+K  L +LN+  D   +N+T I+LIPK++ P+ +  FRPISLC+V+ K+
Subjt:  SEDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKI

Query:  IAKALANRCGSEKIMNCLQSMSYSVLINRI-------------PQKEFAPGR-----------------------GMSSQTGNNKSRIFSRIKEKVWKVL
        + K L +R   + ++  + S   +  ++R+               K    GR                       G+ S +G +K  +FS IKEK+WK++
Subjt:  IAKALANRCGSEKIMNCLQSMSYSVLINRI-------------PQKEFAPGR-----------------------GMSSQTGNNKSRIFSRIKEKVWKVL

Query:  QRWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNNICEEINKMCANLWWGFNSSSSNPKAHWASWKKLCVSK
          W EK+FS GG+EVL+K++ ++IP Y++S FRLP   C ++  M AN WWG N++ S  K HW SW  LC SK
Subjt:  QRWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNNICEEINKMCANLWWGFNSSSSNPKAHWASWKKLCVSK

A0A803QF94 Uncharacterized protein1.4e-0629.5Show/hide
Query:  EGNNIFKCKFSSKKDKERIVEGGPWNFDKGLLVFVELKDNNKISMLQSRYASFWIQIPDLSIAYYSRKNVMALGNSIRKFIDVDVDENGKCKENPHRIRV
        E   +F  KF  + DK R++   PW+F   L+V         +S     +  FW+QI  L     S+    ALGN I +FI+V  D   +      R+RV
Subjt:  EGNNIFKCKFSSKKDKERIVEGGPWNFDKGLLVFVELKDNNKISMLQSRYASFWIQIPDLSIAYYSRKNVMALGNSIRKFIDVDVDENGKCKENPHRIRV

Query:  EIDIQDPLCRGTKVQAGSSTEEMWMPIMTNPSLPVFLVE
        +I    PL RG  ++     ++ W+       LP F  E
Subjt:  EIDIQDPLCRGTKVQAGSSTEEMWMPIMTNPSLPVFLVE

A0A803QF94 Uncharacterized protein2.8e-7126.38Show/hide
Query:  VFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDAIVNSKTG-RWRFMRVYGNPDPRKHADSWDLLKRLSLLTQGS
        +FL ET+ S    + ++++L FE  F V  +G SGGL LLW + V V ++S++  HIDA+V +  G  WRF   YG+PDP    +SW LLKRL  + +G+
Subjt:  VFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDAIVNSKTG-RWRFMRVYGNPDPRKHADSWDLLKRLSLLTQGS

Query:  KVLVSLGGVGQEVSR-----------------------------------------PRGDNIIKERLDKFLLNQDLLNKVEKIIVERRPFHSSYHRPILA
         V    GG   E+++                                          R DN+I E+LD+ L N D  +  +   V    + +S HRP+  
Subjt:  KVLVSLGGVGQEVSR-----------------------------------------PRGDNIIKERLDKFLLNQDLLNKVEKIIVERRPFHSSYHRPILA

Query:  TI-VHKEERSNYGGSMRRPKFECSWVNFEACRDIIRKCWRD-HQGSGLISINNRVINCLEKLAKWNN----------DRLKGSLKWAIERQLKVIQDIDS
        T  +H   + N      R  FE +W   E C++II+K W + + G+ + ++   +  C EKL KWN             LK  + W + +       +  
Subjt:  TI-VHKEERSNYGGSMRRPKFECSWVNFEACRDIIRKCWRD-HQGSGLISINNRVINCLEKLAKWNN----------DRLKGSLKWAIERQLKVIQDIDS

Query:  RGLVNDVLLISAEKAL-----KRLL--------------DEEELKKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSIRV
        + L ND+  +  ++ +      R L                 + K++N I+G+ ++  +W      I EI   YFQTLF+ S    E  D ++  V  R+
Subjt:  RGLVNDVLLISAEKAL-----KRLL--------------DEEELKKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSIRV

Query:  SEDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKI
        S ++N+ L++PF + EV  A+  ++P KA G D    +FFQ  W++VG+EV+  CL VLN   D   +N+T I LIPK + P  ++ FRPISLC+VVYK+
Subjt:  SEDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKI

Query:  IAKALANR---------------------------CGSE-------------------------------------------------KIMNCLQSMSYS
        ++K LANR                            G E                                                 K+MNC++S+S+S
Subjt:  IAKALANR---------------------------CGSE-------------------------------------------------KIMNCLQSMSYS

Query:  VLINRIPQKEFAPGR-------------------------------------------------------------------------------------
        +LIN   Q  F P R                                                                                     
Subjt:  VLINRIPQKEFAPGR-------------------------------------------------------------------------------------

Query:  -----------------------------------GMSSQTGNNKSRIFSRIKEKVWKVLQRWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNNICE
                                           GM +  G NK  IF +I+++V   LQ WK  +FS  GKE+LIK++ +A+P Y +S FR+   I  
Subjt:  -----------------------------------GMSSQTGNNKSRIFSRIKEKVWKVLQRWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNNICE

Query:  EINKMCANLWWGFNSSSSNPKAHWASWKKLCVSK
        +I  M A  WWG N+     K HW SW+K+C  K
Subjt:  EINKMCANLWWGFNSSSSNPKAHWASWKKLCVSK

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.0e-0630.23Show/hide
Query:  FSRIKEKVWKVLQRWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNNICEEINKMCANLWWGFNSSSSNPKAHWASWKKLCVSK
        F  I E+V   +  W+EK  S  G+  L K++  ++P++S+S   LP +I   ++++     WG  S++   K H   W K+C  K
Subjt:  FSRIKEKVWKVLQRWKEKMFSIGGKEVLIKSITRAIPIYSLSYFRLPNNICEEINKMCANLWWGFNSSSSNPKAHWASWKKLCVSK

P14381 Transposon TX1 uncharacterized 149 kDa protein3.1e-1127.57Show/hide
Query:  EEELKKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSIRVSEDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFF
        E++   R  I  +  E+G  +E+ E I +    ++Q LF+      +A + +  G+ + VSE + + L  P +  E+  AL+ M   K+ G D     FF
Subjt:  EEELKKRNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKSDKEALDRILVGVSIRVSEDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFF

Query:  QNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKIIAKALANRCGSEKIMNCLQSMSYSV
        Q +WD +G +  ++      +        +  +SL+PK      + ++RP+SL S  YKI+AKA++ R  S          SY+V
Subjt:  QNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKIIAKALANRCGSEKIMNCLQSMSYSV

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.2e-0728.66Show/hide
Query:  RNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKS--DKEALDRILVGVSIRVSEDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYW
        +N IK +  ++ V VE    + E+   Y+  L  S       +++ RI      R ++     L    S  E+  A+ +M   KA G D   A FF   W
Subjt:  RNHIKGIVNEEGVWVEEEEGIWEITTKYFQTLFTSSKS--DKEALDRILVGVSIRVSEDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYW

Query:  DIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKII
         +V +               ++  N T I+LIPK+   + ++ FRP+S C+VVYKII
Subjt:  DIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLEVPESMTHFRPISLCSVVYKII


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGCAAAATAAAGATTAAAAGTGAAGGGAACAACATTTTCAAATGCAAGTTCAGTAGCAAAAAAGACAAGGAGCGAATAGTGGAAGGAGGCCCCTGGAACTTCGA
CAAAGGTCTGCTAGTTTTCGTGGAACTAAAAGATAACAACAAGATATCAATGCTACAGTCCAGATACGCTTCCTTTTGGATACAAATACCAGATTTGTCTATCGCTTATT
ACTCTAGGAAAAACGTGATGGCTTTAGGCAACTCAATCAGAAAGTTCATTGACGTAGATGTGGATGAAAATGGGAAATGCAAAGAAAACCCGCATAGAATCAGAGTAGAA
ATAGATATTCAAGATCCTTTGTGTCGGGGCACAAAGGTTCAAGCAGGATCATCAACGGAAGAAATGTGGATGCCAATAATGACAAACCCATCCCTACCAGTATTCCTTGT
GGAGACACAAAATAGTAGAAAGATTGAGGACAAGTTGAAAATGGAGCTGAAATTTGAGCACAGTTTCGGGGTTCCGAATGAGGGGCTTAGCGGAGGCTTATGTTTACTGT
GGAATTCGAAGGTCAACGTCAGTATTGAATCCTACTCTCAGGGCCACATAGATGCGATTGTTAATAGCAAAACGGGAAGGTGGAGATTTATGAGGGTTTACGGCAACCCG
GACCCAAGAAAGCATGCAGATTCTTGGGATCTTCTCAAAAGACTCAGCTTATTGACCCAGGGTTCAAAGGTCCTAGTTTCACTTGGAGGAGTCGGCCAAGAGGTGAGTCG
GCCAAGAGGTGATAACATCATAAAAGAAAGATTAGACAAATTCTTACTAAACCAGGATTTGCTTAACAAAGTGGAAAAGATCATTGTGGAGCGTAGGCCTTTCCACTCGT
CATACCATAGGCCCATTCTGGCGACCATTGTTCACAAAGAAGAGAGGAGCAATTATGGCGGTTCCATGAGAAGACCCAAGTTTGAATGTTCTTGGGTGAATTTTGAGGCT
TGCAGGGATATTATTCGCAAATGTTGGAGGGATCATCAAGGGTCTGGGTTGATCAGCATAAACAACAGGGTCATAAATTGCTTGGAAAAGCTGGCAAAGTGGAATAACGA
TAGGTTGAAAGGGTCTTTAAAATGGGCTATTGAAAGGCAGCTCAAAGTTATCCAAGACATTGATTCCAGGGGCCTGGTCAACGATGTGCTACTGATTTCAGCAGAAAAAG
CCCTTAAGAGGCTTCTAGATGAGGAAGAGTTGAAGAAAAGGAATCATATAAAGGGAATTGTCAATGAGGAAGGAGTTTGGGTGGAGGAAGAGGAAGGGATATGGGAGATC
ACTACAAAATACTTTCAAACACTTTTTACCTCCTCTAAGTCGGATAAGGAAGCCTTGGATAGAATCTTGGTTGGAGTCAGCATAAGGGTTTCAGAGGATCAAAACAAAAA
CCTTATGAGGCCTTTCTCAAAAGCTGAGGTTGAATTCGCTCTAAAAAGTATGAACCCAACTAAGGCATCGGGCTCGGATGAAGCACACGCGATGTTCTTTCAGAACTATT
GGGACATCGTTGGAGAAGAAGTTTCAAAGATATGTCTGATAGTGTTAAATGAGAATGTAGATGTGAGACCTATAAATAAGACCCGAATCTCTCTCATTCCAAAATTGGAG
GTCCCAGAGAGTATGACCCACTTTAGACCTATTAGCTTGTGTAGTGTGGTGTACAAAATTATTGCAAAAGCTCTTGCTAACAGGTGTGGATCTGAAAAAATCATGAATTG
TCTTCAGTCCATGAGCTACTCAGTCCTTATTAATAGGATCCCTCAAAAAGAGTTCGCTCCAGGAAGAGGTATGTCGTCTCAGACAGGGAATAACAAGAGCAGGATCTTCT
CAAGAATCAAAGAGAAGGTTTGGAAGGTGTTGCAAAGGTGGAAGGAGAAGATGTTCTCGATAGGAGGAAAAGAAGTTCTTATCAAAAGCATAACTCGGGCCATTCCCATT
TACTCCTTGAGCTACTTCAGGCTCCCAAACAACATTTGTGAGGAGATTAACAAGATGTGTGCTAATTTATGGTGGGGATTCAATAGCTCTAGCTCCAATCCAAAAGCACA
CTGGGCAAGTTGGAAGAAGCTTTGCGTTAGCAAAGTGTTGCTAACCGCAAGCGCACGGGTCGTCACAAGTAATATAAAACGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGCAAAATAAAGATTAAAAGTGAAGGGAACAACATTTTCAAATGCAAGTTCAGTAGCAAAAAAGACAAGGAGCGAATAGTGGAAGGAGGCCCCTGGAACTTCGA
CAAAGGTCTGCTAGTTTTCGTGGAACTAAAAGATAACAACAAGATATCAATGCTACAGTCCAGATACGCTTCCTTTTGGATACAAATACCAGATTTGTCTATCGCTTATT
ACTCTAGGAAAAACGTGATGGCTTTAGGCAACTCAATCAGAAAGTTCATTGACGTAGATGTGGATGAAAATGGGAAATGCAAAGAAAACCCGCATAGAATCAGAGTAGAA
ATAGATATTCAAGATCCTTTGTGTCGGGGCACAAAGGTTCAAGCAGGATCATCAACGGAAGAAATGTGGATGCCAATAATGACAAACCCATCCCTACCAGTATTCCTTGT
GGAGACACAAAATAGTAGAAAGATTGAGGACAAGTTGAAAATGGAGCTGAAATTTGAGCACAGTTTCGGGGTTCCGAATGAGGGGCTTAGCGGAGGCTTATGTTTACTGT
GGAATTCGAAGGTCAACGTCAGTATTGAATCCTACTCTCAGGGCCACATAGATGCGATTGTTAATAGCAAAACGGGAAGGTGGAGATTTATGAGGGTTTACGGCAACCCG
GACCCAAGAAAGCATGCAGATTCTTGGGATCTTCTCAAAAGACTCAGCTTATTGACCCAGGGTTCAAAGGTCCTAGTTTCACTTGGAGGAGTCGGCCAAGAGGTGAGTCG
GCCAAGAGGTGATAACATCATAAAAGAAAGATTAGACAAATTCTTACTAAACCAGGATTTGCTTAACAAAGTGGAAAAGATCATTGTGGAGCGTAGGCCTTTCCACTCGT
CATACCATAGGCCCATTCTGGCGACCATTGTTCACAAAGAAGAGAGGAGCAATTATGGCGGTTCCATGAGAAGACCCAAGTTTGAATGTTCTTGGGTGAATTTTGAGGCT
TGCAGGGATATTATTCGCAAATGTTGGAGGGATCATCAAGGGTCTGGGTTGATCAGCATAAACAACAGGGTCATAAATTGCTTGGAAAAGCTGGCAAAGTGGAATAACGA
TAGGTTGAAAGGGTCTTTAAAATGGGCTATTGAAAGGCAGCTCAAAGTTATCCAAGACATTGATTCCAGGGGCCTGGTCAACGATGTGCTACTGATTTCAGCAGAAAAAG
CCCTTAAGAGGCTTCTAGATGAGGAAGAGTTGAAGAAAAGGAATCATATAAAGGGAATTGTCAATGAGGAAGGAGTTTGGGTGGAGGAAGAGGAAGGGATATGGGAGATC
ACTACAAAATACTTTCAAACACTTTTTACCTCCTCTAAGTCGGATAAGGAAGCCTTGGATAGAATCTTGGTTGGAGTCAGCATAAGGGTTTCAGAGGATCAAAACAAAAA
CCTTATGAGGCCTTTCTCAAAAGCTGAGGTTGAATTCGCTCTAAAAAGTATGAACCCAACTAAGGCATCGGGCTCGGATGAAGCACACGCGATGTTCTTTCAGAACTATT
GGGACATCGTTGGAGAAGAAGTTTCAAAGATATGTCTGATAGTGTTAAATGAGAATGTAGATGTGAGACCTATAAATAAGACCCGAATCTCTCTCATTCCAAAATTGGAG
GTCCCAGAGAGTATGACCCACTTTAGACCTATTAGCTTGTGTAGTGTGGTGTACAAAATTATTGCAAAAGCTCTTGCTAACAGGTGTGGATCTGAAAAAATCATGAATTG
TCTTCAGTCCATGAGCTACTCAGTCCTTATTAATAGGATCCCTCAAAAAGAGTTCGCTCCAGGAAGAGGTATGTCGTCTCAGACAGGGAATAACAAGAGCAGGATCTTCT
CAAGAATCAAAGAGAAGGTTTGGAAGGTGTTGCAAAGGTGGAAGGAGAAGATGTTCTCGATAGGAGGAAAAGAAGTTCTTATCAAAAGCATAACTCGGGCCATTCCCATT
TACTCCTTGAGCTACTTCAGGCTCCCAAACAACATTTGTGAGGAGATTAACAAGATGTGTGCTAATTTATGGTGGGGATTCAATAGCTCTAGCTCCAATCCAAAAGCACA
CTGGGCAAGTTGGAAGAAGCTTTGCGTTAGCAAAGTGTTGCTAACCGCAAGCGCACGGGTCGTCACAAGTAATATAAAACGGTAG
Protein sequenceShow/hide protein sequence
MEGKIKIKSEGNNIFKCKFSSKKDKERIVEGGPWNFDKGLLVFVELKDNNKISMLQSRYASFWIQIPDLSIAYYSRKNVMALGNSIRKFIDVDVDENGKCKENPHRIRVE
IDIQDPLCRGTKVQAGSSTEEMWMPIMTNPSLPVFLVETQNSRKIEDKLKMELKFEHSFGVPNEGLSGGLCLLWNSKVNVSIESYSQGHIDAIVNSKTGRWRFMRVYGNP
DPRKHADSWDLLKRLSLLTQGSKVLVSLGGVGQEVSRPRGDNIIKERLDKFLLNQDLLNKVEKIIVERRPFHSSYHRPILATIVHKEERSNYGGSMRRPKFECSWVNFEA
CRDIIRKCWRDHQGSGLISINNRVINCLEKLAKWNNDRLKGSLKWAIERQLKVIQDIDSRGLVNDVLLISAEKALKRLLDEEELKKRNHIKGIVNEEGVWVEEEEGIWEI
TTKYFQTLFTSSKSDKEALDRILVGVSIRVSEDQNKNLMRPFSKAEVEFALKSMNPTKASGSDEAHAMFFQNYWDIVGEEVSKICLIVLNENVDVRPINKTRISLIPKLE
VPESMTHFRPISLCSVVYKIIAKALANRCGSEKIMNCLQSMSYSVLINRIPQKEFAPGRGMSSQTGNNKSRIFSRIKEKVWKVLQRWKEKMFSIGGKEVLIKSITRAIPI
YSLSYFRLPNNICEEINKMCANLWWGFNSSSSNPKAHWASWKKLCVSKVLLTASARVVTSNIKR