; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011346 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011346
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr1:22316157..22325595
RNA-Seq ExpressionLag0011346
SyntenyLag0011346
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.8e-15751.17Show/hide
Query:  IKVRDWAYTNDKFLVKYNSLFEPDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQ
        I +RD    N+       S  E   +VV+V M +  T E  M EM+  IN LMK +EE+D +IA LK Q++    +ESSQT V+K  DKGK +V+++QPQ
Subjt:  IKVRDWAYTNDKFLVKYNSLFEPDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQ

Query:  CSS-SVASLSIQQLQDMITNCIRAQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNA
          S SVASLS+QQLQDMI N IRAQYGGP Q S +Y KPYTKRIDNLR P+GYQP KFQQFDGKGNPKQHI HFVETCENAG+  D LV+QFVR+LKGNA
Subjt:  CSS-SVASLSIQQLQDMITNCIRAQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNA

Query:  FAWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELA
        F                           TRR VSM ELTNT QRKGE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELA
Subjt:  FAWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELA

Query:  TRAHDMELSIASRENQDLLLPNMR-------------------------------------KERRN---DEE---TIEESMKKIYPFPDADIPDMLEQLL
        TRAHDM+LSIA+R  +D L+   R                                     K +RN   DE+   T+ E  KK+YPFPD+D+ DMLEQL+
Subjt:  TRAHDMELSIASRENQDLLLPNMR-------------------------------------KERRN---DEE---TIEESMKKIYPFPDADIPDMLEQLL

Query:  ETQLIELPKCKRLEEMEKVDDLKYCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSNIATIKEKSK-------HQRKKDL------------
        E QLI+LP+CKR E++ KVDD  YCKYHRVI H VE+CFVLK+LI KLA E KIE D+DEV Q+N   +   S        + ++K L            
Subjt:  ETQLIELPKCKRLEEMEKVDDLKYCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSNIATIKEKSK-------HQRKKDL------------

Query:  --KKLQPKRQRSKKFSQPRQQVTAKEIFSKTFHKKEKENF------GTSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEE
          +K      ++K+     +     E   ++F +   E         T+  ++V       EE+DNS + +QRTSVFD IKP TTR SVFQR+SMA  +E
Subjt:  --KKLQPKRQRSKKFSQPRQQVTAKEIFSKTFHKKEKENF------GTSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEE

Query:  ENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSHMKRKFSVLINTDGSL
        ENQC   T  + SAF+RLS+S SKK RPST  FDRLK+T+ Q +R+M  L+ K F E N+D K+ S +PS MKRK SV INT+GSL
Subjt:  ENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSHMKRKFSVLINTDGSL

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]1.8e-15663.02Show/hide
Query:  KYNSLFE------PDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQCSSSVASLS
        ++NSL E      P  ++++V +T+  T E+RM E+++ +N LMKA+EE+D +IA LK  IE++  AESS T  IKN +KGK I+Q+ QPQ S+S+ASLS
Subjt:  KYNSLFE------PDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQCSSSVASLS

Query:  IQQLQDMITNCIRAQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNAFAWYTDLEPE
        +QQLQ+MI N I+ QYGGP Q   LY KPYTKRIDN+R P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGT  DLLVKQFVRTLKGNAF WYTDLEPE
Subjt:  IQQLQDMITNCIRAQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNAFAWYTDLEPE

Query:  SVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSI
        S+DSWE+LER+FLNRFYSTRR VSM ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI
Subjt:  SVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSI

Query:  ASRENQDLLLPNMRK------------------------------------ERRNDE-----ETIEESMKKIYPFPDADIPDMLEQLLETQLIELPKCKR
        A+R N DLL+P +RK                                    E+R DE      T++E  +K+YPFPD+D+PDML+QLLE QLI+LP+CKR
Subjt:  ASRENQDLLLPNMRK------------------------------------ERRNDE-----ETIEESMKKIYPFPDADIPDMLEQLLETQLIELPKCKR

Query:  LEEMEKVDDLKYCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSNIATI
          EM +V+D  YCKYHRVI HPVE+CFVLK+LILKLA++ KIE +LD+V Q+N A +
Subjt:  LEEMEKVDDLKYCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSNIATI

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]2.3e-1040.67Show/hide
Query:  KEKENFGTSYCIDVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSK----KSRPST--SVFDRLK
        K K     +  I VEE  +S++ +    QR+SVFDRI     RPSVFQR+S +  ++ NQ S  +STR SAFQRL+ S  K       P+T  S F RL 
Subjt:  KEKENFGTSYCIDVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSK----KSRPST--SVFDRLK

Query:  VTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSHMKRKFSVLINTDGSLK
        V+  + ++K           V  D++++S+ PS MKRK  V +NT+GSLK
Subjt:  VTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSHMKRKFSVLINTDGSLK

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]3.1e-15646.93Show/hide
Query:  IKVRDWAYTNDKFLVKYNSLFEPDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQ
        I +RD    N+       S  E   +VV+V M +  T E  M EM+  IN LMK  EE+D +IA LK Q++     ESSQT V+K  DKGK +VQ++QPQ
Subjt:  IKVRDWAYTNDKFLVKYNSLFEPDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQ

Query:  CSS-SVASLSIQQLQDMITNCIRAQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNA
          S SVASLS+QQLQDMI N IRAQYGGP Q S +Y K YTKRIDNLR P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+  D LV+QFVR+LKGNA
Subjt:  CSS-SVASLSIQQLQDMITNCIRAQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNA

Query:  FAWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELA
        F WYTDLEPE                                   GE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEEL+
Subjt:  FAWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELA

Query:  TRAHDMELSIASRENQDLLLPNMRKERRND--------------------------------------------EETIEESMKKIYPFPDADIPDMLEQL
        TRAHDMELSIA+   +D L+   ++  +N+                                              T+ E  KK+YPFPD+D+ DMLEQL
Subjt:  TRAHDMELSIASRENQDLLLPNMRKERRND--------------------------------------------EETIEESMKKIYPFPDADIPDMLEQL

Query:  LETQLIELPKCKRLEEMEKVDDLKYCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSN----------------------------------
        +E QLI+LP+CKR E+  KVDD  YCKYHRVI HPVE+CFVLK+LILKLA E KIE D+DEV Q+N                                  
Subjt:  LETQLIELPKCKRLEEMEKVDDLKYCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSN----------------------------------

Query:  --------------------------------------------------IATIKEKSKHQRKKDLKKLQPKRQRSKKFSQPRQQVTAKEIFSKTFHKKE
                                                             I  K K +R K + K +P + + K F QPR+ +   E   ++F +  
Subjt:  --------------------------------------------------IATIKEKSKHQRKKDLKKLQPKRQRSKKFSQPRQQVTAKEIFSKTFHKKE

Query:  KENF------GTSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRL
         E         T+  ++V       EEVDNS + +QRT VF RIKP T R SVFQR+SMA  EEENQC  ST  R SAF+RLS+ST KK RPSTS FDRL
Subjt:  KENF------GTSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRL

Query:  KVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSHMKRKFSVLINTDGSL
        K+ + Q +R+M +L+ K F E N+D K+ S +PS MKRK SV INT+GSL
Subjt:  KVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSHMKRKFSVLINTDGSL

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]7.0e-18556.11Show/hide
Query:  PDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQCSSSVASLSIQQLQDMITNCIR
        P  ++++V +T   T E RM E+++ +N LMK +EE+D +IA LK  IE++  AESS    +KN DKGK ++Q+ QPQ S+S+ASLS+QQLQ+MI + I+
Subjt:  PDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQCSSSVASLSIQQLQDMITNCIR

Query:  AQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNAFAWYTDLEPESVDSWEELEREFL
         QYGGP Q   LY+KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGT  DLLVKQFVRTLKGNA  WY DLEPES+D+WE+LER+FL
Subjt:  AQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNAFAWYTDLEPESVDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM
        NRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM

Query:  RK--------------------------------------ERRND-----EETIEESMKKIYPFPDADIPDMLEQLLETQLIELPKCKRLEEMEKVDDLK
        R                                       ER++D     + T++E  +K+YPFPD+D+ DMLEQLLE QLI+LP+CKR E+  KVDD  
Subjt:  RK--------------------------------------ERRND-----EETIEESMKKIYPFPDADIPDMLEQLLETQLIELPKCKRLEEMEKVDDLK

Query:  YCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSNIATIKEKSKHQRKKDLKKLQPKRQRSKKFSQPRQQVTAKEIFSKTFHKKEKE------
        YCKYHRVI HPVE+CFVLK+LILKLA E KIE D+DEV Q+N A I+  S            P + + + F Q R+ +T  E   ++F + + E      
Subjt:  YCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSNIATIKEKSKHQRKKDLKKLQPKRQRSKKFSQPRQQVTAKEIFSKTFHKKEKE------

Query:  ------------NFGTSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVT
                    N+G+S     +EV+NS +  QRTSVFDRIKP TTR SVFQR+S+A  EEENQC     TR S  +RLS+ST KK RPSTS FDRLK+T
Subjt:  ------------NFGTSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVT

Query:  SGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSHMKRKFSVLINTDGSL
        + Q +R+M + + K F E N+D K+ S +PS MKRK  V INT+GSL
Subjt:  SGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSHMKRKFSVLINTDGSL

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]3.7e-17851.3Show/hide
Query:  KYNSLFE------PDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQCSSSVASLS
        ++NSL E      P  ++++V +T   T E RM E+++ +N LMK +EE+D +IA LK  IE++  AESS    +KN DKGK ++Q+ QPQ S+S+ASLS
Subjt:  KYNSLFE------PDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQCSSSVASLS

Query:  IQQLQDMITNCIRAQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNAFAWYTDLEPE
        +QQLQ+MI + I+ QYGGP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGT  DLLVKQFVRTLKGNAF  Y DLEPE
Subjt:  IQQLQDMITNCIRAQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNAFAWYTDLEPE

Query:  SVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSI
        S+D+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI
Subjt:  SVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSI

Query:  ASRENQDLLLPNMRK--------------------------------------ERRND-----EETIEESMKKIYPFPDADIPDMLEQLLETQLIELPKC
         +R  +D L+P  R                                       ER++D     + T++E  +K+YPF D+D+ DMLEQLLE QLI+LPKC
Subjt:  ASRENQDLLLPNMRK--------------------------------------ERRND-----EETIEESMKKIYPFPDADIPDMLEQLLETQLIELPKC

Query:  KRLEEMEKVDDLKYCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSN-----------------------------------IATIKEKSKH
        KR ++ EKVDD  YCKYHRVI HPVE+CFVLK+LILKLA E KIE ++DEV Q+N                                   I TI  ++K 
Subjt:  KRLEEMEKVDDLKYCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSN-----------------------------------IATIKEKSKH

Query:  QRKKD----------LKKLQP----------------------KRQRSKK-------------FSQPRQQVTAKEIFSKTF---HKKEKENFGTSYCIDV
           KD           K  QP                      K +R+KK             F Q R+ +T  E   ++F   H +E     T +   +
Subjt:  QRKKD----------LKKLQP----------------------KRQRSKK-------------FSQPRQQVTAKEIFSKTF---HKKEKENFGTSYCIDV

Query:  EEVDNSKKS----------EQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSGQSKRKMDNLE
         EV+N+  S           QRTSVFDRIKP TTR SVFQR+SMA  EEENQC     TR S F+RLS+S SKK+RPSTS FDRLK+T+ Q +R+M +L+
Subjt:  EEVDNSKKS----------EQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSGQSKRKMDNLE

Query:  MKLFDEVNNDKKLQSSIPSHMKRKFSVLINT
         K F E N+D K+ S +PS MKRK  V INT
Subjt:  MKLFDEVNNDKKLQSSIPSHMKRKFSVLINT

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein1.3e-15751.17Show/hide
Query:  IKVRDWAYTNDKFLVKYNSLFEPDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQ
        I +RD    N+       S  E   +VV+V M +  T E  M EM+  IN LMK +EE+D +IA LK Q++    +ESSQT V+K  DKGK +V+++QPQ
Subjt:  IKVRDWAYTNDKFLVKYNSLFEPDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQ

Query:  CSS-SVASLSIQQLQDMITNCIRAQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNA
          S SVASLS+QQLQDMI N IRAQYGGP Q S +Y KPYTKRIDNLR P+GYQP KFQQFDGKGNPKQHI HFVETCENAG+  D LV+QFVR+LKGNA
Subjt:  CSS-SVASLSIQQLQDMITNCIRAQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNA

Query:  FAWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELA
        F                           TRR VSM ELTNT QRKGE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELA
Subjt:  FAWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELA

Query:  TRAHDMELSIASRENQDLLLPNMR-------------------------------------KERRN---DEE---TIEESMKKIYPFPDADIPDMLEQLL
        TRAHDM+LSIA+R  +D L+   R                                     K +RN   DE+   T+ E  KK+YPFPD+D+ DMLEQL+
Subjt:  TRAHDMELSIASRENQDLLLPNMR-------------------------------------KERRN---DEE---TIEESMKKIYPFPDADIPDMLEQLL

Query:  ETQLIELPKCKRLEEMEKVDDLKYCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSNIATIKEKSK-------HQRKKDL------------
        E QLI+LP+CKR E++ KVDD  YCKYHRVI H VE+CFVLK+LI KLA E KIE D+DEV Q+N   +   S        + ++K L            
Subjt:  ETQLIELPKCKRLEEMEKVDDLKYCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSNIATIKEKSK-------HQRKKDL------------

Query:  --KKLQPKRQRSKKFSQPRQQVTAKEIFSKTFHKKEKENF------GTSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEE
          +K      ++K+     +     E   ++F +   E         T+  ++V       EE+DNS + +QRTSVFD IKP TTR SVFQR+SMA  +E
Subjt:  --KKLQPKRQRSKKFSQPRQQVTAKEIFSKTFHKKEKENF------GTSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEE

Query:  ENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSHMKRKFSVLINTDGSL
        ENQC   T  + SAF+RLS+S SKK RPST  FDRLK+T+ Q +R+M  L+ K F E N+D K+ S +PS MKRK SV INT+GSL
Subjt:  ENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSHMKRKFSVLINTDGSL

A0A5A7TZU9 Ribonuclease H8.7e-15763.02Show/hide
Query:  KYNSLFE------PDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQCSSSVASLS
        ++NSL E      P  ++++V +T+  T E+RM E+++ +N LMKA+EE+D +IA LK  IE++  AESS T  IKN +KGK I+Q+ QPQ S+S+ASLS
Subjt:  KYNSLFE------PDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQCSSSVASLS

Query:  IQQLQDMITNCIRAQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNAFAWYTDLEPE
        +QQLQ+MI N I+ QYGGP Q   LY KPYTKRIDN+R P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGT  DLLVKQFVRTLKGNAF WYTDLEPE
Subjt:  IQQLQDMITNCIRAQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNAFAWYTDLEPE

Query:  SVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSI
        S+DSWE+LER+FLNRFYSTRR VSM ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI
Subjt:  SVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSI

Query:  ASRENQDLLLPNMRK------------------------------------ERRNDE-----ETIEESMKKIYPFPDADIPDMLEQLLETQLIELPKCKR
        A+R N DLL+P +RK                                    E+R DE      T++E  +K+YPFPD+D+PDML+QLLE QLI+LP+CKR
Subjt:  ASRENQDLLLPNMRK------------------------------------ERRNDE-----ETIEESMKKIYPFPDADIPDMLEQLLETQLIELPKCKR

Query:  LEEMEKVDDLKYCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSNIATI
          EM +V+D  YCKYHRVI HPVE+CFVLK+LILKLA++ KIE +LD+V Q+N A +
Subjt:  LEEMEKVDDLKYCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSNIATI

A0A5A7TZU9 Ribonuclease H1.1e-1040.67Show/hide
Query:  KEKENFGTSYCIDVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSK----KSRPST--SVFDRLK
        K K     +  I VEE  +S++ +    QR+SVFDRI     RPSVFQR+S +  ++ NQ S  +STR SAFQRL+ S  K       P+T  S F RL 
Subjt:  KEKENFGTSYCIDVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSK----KSRPST--SVFDRLK

Query:  VTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSHMKRKFSVLINTDGSLK
        V+  + ++K           V  D++++S+ PS MKRK  V +NT+GSLK
Subjt:  VTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSHMKRKFSVLINTDGSLK

A0A5A7TZU9 Ribonuclease H1.5e-15646.93Show/hide
Query:  IKVRDWAYTNDKFLVKYNSLFEPDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQ
        I +RD    N+       S  E   +VV+V M +  T E  M EM+  IN LMK  EE+D +IA LK Q++     ESSQT V+K  DKGK +VQ++QPQ
Subjt:  IKVRDWAYTNDKFLVKYNSLFEPDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQ

Query:  CSS-SVASLSIQQLQDMITNCIRAQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNA
          S SVASLS+QQLQDMI N IRAQYGGP Q S +Y K YTKRIDNLR P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+  D LV+QFVR+LKGNA
Subjt:  CSS-SVASLSIQQLQDMITNCIRAQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNA

Query:  FAWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELA
        F WYTDLEPE                                   GE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEEL+
Subjt:  FAWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELA

Query:  TRAHDMELSIASRENQDLLLPNMRKERRND--------------------------------------------EETIEESMKKIYPFPDADIPDMLEQL
        TRAHDMELSIA+   +D L+   ++  +N+                                              T+ E  KK+YPFPD+D+ DMLEQL
Subjt:  TRAHDMELSIASRENQDLLLPNMRKERRND--------------------------------------------EETIEESMKKIYPFPDADIPDMLEQL

Query:  LETQLIELPKCKRLEEMEKVDDLKYCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSN----------------------------------
        +E QLI+LP+CKR E+  KVDD  YCKYHRVI HPVE+CFVLK+LILKLA E KIE D+DEV Q+N                                  
Subjt:  LETQLIELPKCKRLEEMEKVDDLKYCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSN----------------------------------

Query:  --------------------------------------------------IATIKEKSKHQRKKDLKKLQPKRQRSKKFSQPRQQVTAKEIFSKTFHKKE
                                                             I  K K +R K + K +P + + K F QPR+ +   E   ++F +  
Subjt:  --------------------------------------------------IATIKEKSKHQRKKDLKKLQPKRQRSKKFSQPRQQVTAKEIFSKTFHKKE

Query:  KENF------GTSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRL
         E         T+  ++V       EEVDNS + +QRT VF RIKP T R SVFQR+SMA  EEENQC  ST  R SAF+RLS+ST KK RPSTS FDRL
Subjt:  KENF------GTSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRL

Query:  KVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSHMKRKFSVLINTDGSL
        K+ + Q +R+M +L+ K F E N+D K+ S +PS MKRK SV INT+GSL
Subjt:  KVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSHMKRKFSVLINTDGSL

A0A5A7URH1 Ty3-gypsy retrotransposon protein3.4e-18556.11Show/hide
Query:  PDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQCSSSVASLSIQQLQDMITNCIR
        P  ++++V +T   T E RM E+++ +N LMK +EE+D +IA LK  IE++  AESS    +KN DKGK ++Q+ QPQ S+S+ASLS+QQLQ+MI + I+
Subjt:  PDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQCSSSVASLSIQQLQDMITNCIR

Query:  AQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNAFAWYTDLEPESVDSWEELEREFL
         QYGGP Q   LY+KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGT  DLLVKQFVRTLKGNA  WY DLEPES+D+WE+LER+FL
Subjt:  AQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNAFAWYTDLEPESVDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM
        NRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM

Query:  RK--------------------------------------ERRND-----EETIEESMKKIYPFPDADIPDMLEQLLETQLIELPKCKRLEEMEKVDDLK
        R                                       ER++D     + T++E  +K+YPFPD+D+ DMLEQLLE QLI+LP+CKR E+  KVDD  
Subjt:  RK--------------------------------------ERRND-----EETIEESMKKIYPFPDADIPDMLEQLLETQLIELPKCKRLEEMEKVDDLK

Query:  YCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSNIATIKEKSKHQRKKDLKKLQPKRQRSKKFSQPRQQVTAKEIFSKTFHKKEKE------
        YCKYHRVI HPVE+CFVLK+LILKLA E KIE D+DEV Q+N A I+  S            P + + + F Q R+ +T  E   ++F + + E      
Subjt:  YCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSNIATIKEKSKHQRKKDLKKLQPKRQRSKKFSQPRQQVTAKEIFSKTFHKKEKE------

Query:  ------------NFGTSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVT
                    N+G+S     +EV+NS +  QRTSVFDRIKP TTR SVFQR+S+A  EEENQC     TR S  +RLS+ST KK RPSTS FDRLK+T
Subjt:  ------------NFGTSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVT

Query:  SGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSHMKRKFSVLINTDGSL
        + Q +R+M + + K F E N+D K+ S +PS MKRK  V INT+GSL
Subjt:  SGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSHMKRKFSVLINTDGSL

A0A5D3BX77 Retrotransposon gag protein1.8e-17851.3Show/hide
Query:  KYNSLFE------PDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQCSSSVASLS
        ++NSL E      P  ++++V +T   T E RM E+++ +N LMK +EE+D +IA LK  IE++  AESS    +KN DKGK ++Q+ QPQ S+S+ASLS
Subjt:  KYNSLFE------PDSDVVTVKMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQCSSSVASLS

Query:  IQQLQDMITNCIRAQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNAFAWYTDLEPE
        +QQLQ+MI + I+ QYGGP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGT  DLLVKQFVRTLKGNAF  Y DLEPE
Subjt:  IQQLQDMITNCIRAQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNAFAWYTDLEPE

Query:  SVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSI
        S+D+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI
Subjt:  SVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSI

Query:  ASRENQDLLLPNMRK--------------------------------------ERRND-----EETIEESMKKIYPFPDADIPDMLEQLLETQLIELPKC
         +R  +D L+P  R                                       ER++D     + T++E  +K+YPF D+D+ DMLEQLLE QLI+LPKC
Subjt:  ASRENQDLLLPNMRK--------------------------------------ERRND-----EETIEESMKKIYPFPDADIPDMLEQLLETQLIELPKC

Query:  KRLEEMEKVDDLKYCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSN-----------------------------------IATIKEKSKH
        KR ++ EKVDD  YCKYHRVI HPVE+CFVLK+LILKLA E KIE ++DEV Q+N                                   I TI  ++K 
Subjt:  KRLEEMEKVDDLKYCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSN-----------------------------------IATIKEKSKH

Query:  QRKKD----------LKKLQP----------------------KRQRSKK-------------FSQPRQQVTAKEIFSKTF---HKKEKENFGTSYCIDV
           KD           K  QP                      K +R+KK             F Q R+ +T  E   ++F   H +E     T +   +
Subjt:  QRKKD----------LKKLQP----------------------KRQRSKK-------------FSQPRQQVTAKEIFSKTF---HKKEKENFGTSYCIDV

Query:  EEVDNSKKS----------EQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSGQSKRKMDNLE
         EV+N+  S           QRTSVFDRIKP TTR SVFQR+SMA  EEENQC     TR S F+RLS+S SKK+RPSTS FDRLK+T+ Q +R+M +L+
Subjt:  EEVDNSKKS----------EQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSGQSKRKMDNLE

Query:  MKLFDEVNNDKKLQSSIPSHMKRKFSVLINT
         K F E N+D K+ S +PS MKRK  V INT
Subjt:  MKLFDEVNNDKKLQSSIPSHMKRKFSVLINT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATTGGTTTATGGTCCAACCAATAAACCGAGTCCCTCTCGGGCCAATGAGAGGGCGGGATCCCTTTGTTCAAGCCCCGGAGTCAGCACTTAAGGGAACGACCTC
TCTACTATCCCTTATTCGGGACACCCCCACTCGCATGTCTCTACATGGACGATTTAGGATCACATCGTCTATACCATCTATAAAGTGGGCCGCGTCTCATAGTGTCTCCA
GAATAAGAAAAAATTTCCAGCGAATAAAAGACAAGAGGGCTGCTGCGTTTCTGTTCGTTGAAGCGTCGTTGGCGAAGAACGGTCAAGTCTACAACGAAGATCAACAAGTC
AGTAGGCCGATCATCCAAGAAGATCAACAAGCCAACCGATCGATCAAGAGGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAA
CAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTTAACAGACTGATTATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAG
GTCGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAAGATCAACAAGCCAACCGATCGAACAGATCATCAAGC
CAAAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAAGAGATCATCAACCTACA
GACCGATCATCCAAGAAGCTCAACAAACCAGTCCAAGAAGATCAACAAGCTAGAGACCAAAGTTTATATATTTAAGAGAGAGAATCAGAGGATTAAAGTTAGAGATTGGG
CATATACCAATGACAAGTTTCTTGTTAAGTATAACTCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCAAGATGACTGAGACAAGAACTATGGAAGAAAGAATGACT
GAGATGCAAGAGCACATCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATCGCGCAACTAAAGTGCCAAATTGAGAACCAACATATCGCCGAATCAAGTCA
AACACAAGTCATAAAAAATCATGACAAAGGAAAGACTATAGTGCAAGATGATCAGCCACAGTGTTCTTCTTCAGTTGCTTCACTATCCATCCAACAGCTCCAAGATATGA
TCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTATTACAAACCTTATACTAAGAGGATTGATAACTTGAGAACGCCAATCGGGTATCAA
CCACCAAAATTTCAGCAGTTTGATGGAAAGGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAACGCTGGTACTCTAGAGGACCTACTAGTCAAGCA
GTTCGTTCGAACACTTAAAGGAAATGCTTTTGCCTGGTACACTGATCTAGAACCTGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACA
GCACTAGAAGAACCGTTAGCATGTTCGAGCTCACCAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGTTGGAGAGCCATGAGTCTAGATTGCAAA
GATCGTCTCACTGAACTCTCTTCTGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAAC
TCGCGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAATCAAGACCTTCTACTCCCTAATATGAGGAAAGAAAGAAGGAACGATGAAGAGACTATAGAAGAATCTA
TGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAACGCAACTGATAGAGCTTCCTAAGTGTAAACGACTAGAAGAGATGGAG
AAAGTCGATGATCTCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTTGTCTTAAAAGACTTAATTCTAAAGCTAGCTATGGAAGGAAAAAT
TGAGTTTGACCTTGATGAAGTAACTCAATCAAATATTGCTACAATCAAAGAAAAGAGCAAGCATCAAAGAAAGAAAGATCTTAAGAAACTTCAACCCAAGAGGCAGAGAA
GTAAAAAGTTTTCTCAACCTCGACAACAGGTGACTGCGAAGGAGATCTTCTCCAAAACTTTCCACAAAAAGGAAAAAGAGAACTTTGGAACATCCTACTGCATCGACGTA
GAAGAAGTTGACAATTCCAAGAAAAGTGAACAAAGGACTTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCAAAGAATGAGTATGGCCGCGAC
AGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGACCTTCAACATCTGTTTTTGATC
GCCTCAAAGTAACAAGTGGTCAATCTAAAAGAAAGATGGATAACTTGGAGATGAAACTTTTTGATGAAGTAAACAACGACAAGAAGCTTCAAAGTAGCATCCCGTCACAT
ATGAAGAGAAAGTTCTCTGTTCTCATAAATACAGATGGTTCCTTGAAGTTTCCTTCCACCAAGTCTGAAGGCTCCCCCAAGTCGAGTCGAAGGCTCACACGTTGCTTCGT
TGTAGTTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTGCATCGCTGCAGTCCCTTCTCTCAAGTTTGAAAGTTCTCACACTGCTTCGAAGGTTCTCACGCGCTTTGCTA
CAGTTCCTTCAGGTCCTTACATTGAGTGCATCACTGAAGGCGAATCTGGTGACTACCACTGCAGGCTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAAGAGTG
CATCACTGTAGGCAAATCTGGCGATTACTCCTAAAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACT
GCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGTGAAATCACTGCAAGTGGTGAAATCACTGCAA
GTGAAGCTGATGACGACCGTGGTGACCACCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTG
GTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAA
CTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACT
GGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAA
TCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGAT
GACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCC
CTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCAT
CAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACA
GGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAG
TGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTG
GTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCTGCAGG
AAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTG
ACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGACAGGT
GGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAA
GCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGA
CCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTAC
AGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTC
TAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCAC
TGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACG
ACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGC
AGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAA
GTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTG
GTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAA
AGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACAATTGGTTTATGGTCCAACCAATAAACCGAGTCCCTCTCGGGCCAATGAGAGGGCGGGATCCCTTTGTTCAAGCCCCGGAGTCAGCACTTAAGGGAACGACCTC
TCTACTATCCCTTATTCGGGACACCCCCACTCGCATGTCTCTACATGGACGATTTAGGATCACATCGTCTATACCATCTATAAAGTGGGCCGCGTCTCATAGTGTCTCCA
GAATAAGAAAAAATTTCCAGCGAATAAAAGACAAGAGGGCTGCTGCGTTTCTGTTCGTTGAAGCGTCGTTGGCGAAGAACGGTCAAGTCTACAACGAAGATCAACAAGTC
AGTAGGCCGATCATCCAAGAAGATCAACAAGCCAACCGATCGATCAAGAGGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAA
CAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTTAACAGACTGATTATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAG
GTCGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAAGATCAACAAGCCAACCGATCGAACAGATCATCAAGC
CAAAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAAGAGATCATCAACCTACA
GACCGATCATCCAAGAAGCTCAACAAACCAGTCCAAGAAGATCAACAAGCTAGAGACCAAAGTTTATATATTTAAGAGAGAGAATCAGAGGATTAAAGTTAGAGATTGGG
CATATACCAATGACAAGTTTCTTGTTAAGTATAACTCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCAAGATGACTGAGACAAGAACTATGGAAGAAAGAATGACT
GAGATGCAAGAGCACATCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATCGCGCAACTAAAGTGCCAAATTGAGAACCAACATATCGCCGAATCAAGTCA
AACACAAGTCATAAAAAATCATGACAAAGGAAAGACTATAGTGCAAGATGATCAGCCACAGTGTTCTTCTTCAGTTGCTTCACTATCCATCCAACAGCTCCAAGATATGA
TCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTATTACAAACCTTATACTAAGAGGATTGATAACTTGAGAACGCCAATCGGGTATCAA
CCACCAAAATTTCAGCAGTTTGATGGAAAGGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAACGCTGGTACTCTAGAGGACCTACTAGTCAAGCA
GTTCGTTCGAACACTTAAAGGAAATGCTTTTGCCTGGTACACTGATCTAGAACCTGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACA
GCACTAGAAGAACCGTTAGCATGTTCGAGCTCACCAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGTTGGAGAGCCATGAGTCTAGATTGCAAA
GATCGTCTCACTGAACTCTCTTCTGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAAC
TCGCGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAATCAAGACCTTCTACTCCCTAATATGAGGAAAGAAAGAAGGAACGATGAAGAGACTATAGAAGAATCTA
TGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAACGCAACTGATAGAGCTTCCTAAGTGTAAACGACTAGAAGAGATGGAG
AAAGTCGATGATCTCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTTGTCTTAAAAGACTTAATTCTAAAGCTAGCTATGGAAGGAAAAAT
TGAGTTTGACCTTGATGAAGTAACTCAATCAAATATTGCTACAATCAAAGAAAAGAGCAAGCATCAAAGAAAGAAAGATCTTAAGAAACTTCAACCCAAGAGGCAGAGAA
GTAAAAAGTTTTCTCAACCTCGACAACAGGTGACTGCGAAGGAGATCTTCTCCAAAACTTTCCACAAAAAGGAAAAAGAGAACTTTGGAACATCCTACTGCATCGACGTA
GAAGAAGTTGACAATTCCAAGAAAAGTGAACAAAGGACTTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCAAAGAATGAGTATGGCCGCGAC
AGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGACCTTCAACATCTGTTTTTGATC
GCCTCAAAGTAACAAGTGGTCAATCTAAAAGAAAGATGGATAACTTGGAGATGAAACTTTTTGATGAAGTAAACAACGACAAGAAGCTTCAAAGTAGCATCCCGTCACAT
ATGAAGAGAAAGTTCTCTGTTCTCATAAATACAGATGGTTCCTTGAAGTTTCCTTCCACCAAGTCTGAAGGCTCCCCCAAGTCGAGTCGAAGGCTCACACGTTGCTTCGT
TGTAGTTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTGCATCGCTGCAGTCCCTTCTCTCAAGTTTGAAAGTTCTCACACTGCTTCGAAGGTTCTCACGCGCTTTGCTA
CAGTTCCTTCAGGTCCTTACATTGAGTGCATCACTGAAGGCGAATCTGGTGACTACCACTGCAGGCTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAAGAGTG
CATCACTGTAGGCAAATCTGGCGATTACTCCTAAAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACT
GCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGTGAAATCACTGCAAGTGGTGAAATCACTGCAA
GTGAAGCTGATGACGACCGTGGTGACCACCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTG
GTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAA
CTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACT
GGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAA
TCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGAT
GACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCC
CTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCAT
CAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACA
GGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAG
TGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTG
GTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCTGCAGG
AAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTG
ACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGACAGGT
GGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAA
GCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGA
CCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTAC
AGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTC
TAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCAC
TGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACG
ACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGC
AGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAA
GTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTG
GTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAA
AGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGA
Protein sequenceShow/hide protein sequence
MNNWFMVQPINRVPLGPMRGRDPFVQAPESALKGTTSLLSLIRDTPTRMSLHGRFRITSSIPSIKWAASHSVSRIRKNFQRIKDKRAAAFLFVEASLAKNGQVYNEDQQV
SRPIIQEDQQANRSIKRINKSAGRSSKRINKLTSRSNRSSSQQADPRDHQVNRLIIQEDQQANKPIQQIIKPTGRSKRSTSQPTDQEDQQVSRPIIQEDQQANRSNRSSS
QRSSSQQADPRDHQVSRPIIQEDQQANKPIQEIINLQTDHPRSSTNQSKKINKLETKVYIFKRENQRIKVRDWAYTNDKFLVKYNSLFEPDSDVVTVKMTETRTMEERMT
EMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKTIVQDDQPQCSSSVASLSIQQLQDMITNCIRAQYGGPTQDSLLYYKPYTKRIDNLRTPIGYQ
PPKFQQFDGKGNPKQHIAHFVETCENAGTLEDLLVKQFVRTLKGNAFAWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCK
DRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKERRNDEETIEESMKKIYPFPDADIPDMLEQLLETQLIELPKCKRLEEME
KVDDLKYCKYHRVIGHPVERCFVLKDLILKLAMEGKIEFDLDEVTQSNIATIKEKSKHQRKKDLKKLQPKRQRSKKFSQPRQQVTAKEIFSKTFHKKEKENFGTSYCIDV
EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSH
MKRKFSVLINTDGSLKFPSTKSEGSPKSSRRLTRCFVVVSFLQVRRFSRCIAAVPSLKFESSHTASKVLTRFATVPSGPYIECITEGESGDYHCRLLRSPNKMGTGLARV
HHCRQIWRLLLKETTVIKVTGLDSHQSDWSRQVVKSLQVKLMTTVVTTPEETTVIKVTGLDRCEITASGEITASEADDDRGDHPAGNYSHQSDWSRQVVKSLQVKLMTTV
VTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVK
SLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSH
QSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTV
VTTPAGNYSHQSDWSRQTGGEITASEADDDRGDHPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQTG
GEITASEADDDRGDHPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNY
SHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMT
TVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQV
VKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKADDDRGDHPCRKLQSSK