; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004960 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004960
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr6:9016929..9018675
RNA-Seq ExpressionLag0004960
SyntenyLag0004960
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN69274.1 TatD related DNase [Prunus dulcis]6.6e-9137.19Show/hide
Query:  GPQAIKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTPVIE
        G   ++++HL FADDTI F   K     NL  +++LF   SG+  N  KS  LG+    +    +A  +GC+ G WP  YLGLPL  NPR  NFW PV++
Subjt:  GPQAIKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTPVIE

Query:  KVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLAKWA
        KV+KRLQ W    +SKGGR TLIQA L+++P YY+S+F+ P  VT  +E+L R+FLW+   E K CHL++W  +   KEEGGLGI  + ++N +L AKW 
Subjt:  KVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLAKWA

Query:  WRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQSKFPSLFNLSLKKDALIAELWE
        WRF  E N+L  +II SK+G  SN     +    S R PW  I K            VG G+   FWED WL    L+  FP L++LS +K+  IA  W 
Subjt:  WRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQSKFPSLFNLSLKKDALIAELWE

Query:  PTN----AAWNLHLRRHLCDSEILEWSLLSHQLSSFS-FNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSNDLYGQLWKGLMLKKVKFSMWELSHKC
          N      W+   RR+L ++EI E  LL   L +   F +  D   W +E+   FS  S    L S         Y  +WK     K++F +W  ++  
Subjt:  PTN----AAWNLHLRRHLCDSEILEWSLLSHQLSSFS-FNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSNDLYGQLWKGLMLKKVKFSMWELSHKC

Query:  INTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFFWNLWLDRNA
        INT + IQRR P   LSPS C    +  E   H+F  C Y+   W  +  A G ++        LL   L        + +L    ++A FWN+W++RN 
Subjt:  INTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFFWNLWLDRNA

Query:  RIF-NNKQQNVYAFIESTTYLAIFWSS
        RIF  +    V    +   + A  W+S
Subjt:  RIF-NNKQQNVYAFIESTTYLAIFWSS

CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]9.5e-9036.17Show/hide
Query:  KAEQDSSIEGFWVGEGPQAIKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPL
        KAE+ + +EGF VG      +++HL FADDTI FSS +   +  L NV+ +F   SGL  N +KS   G+ +E    + LA M  CK   WP  YLGLPL
Subjt:  KAEQDSSIEGFWVGEGPQAIKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPL

Query:  HDNPRRYNFWTPVIEKVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGI
          NP+   FW PVIE++ +RL  W    +S GGR TLIQ+ LT++P Y+LS+F+ P  V   IE++ R FLW    E K  HL+ W  +  PK  GGLG 
Subjt:  HDNPRRYNFWTPVIEKVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGI

Query:  YDIHKKNVSLLAKWAWRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHID----IRVGKGDNTLFWEDNWLGSSPLQSKF
          I  +NV+LL KW WR+ +E +AL  ++I S +G+ SN   +      S R PW AI     L+Y          VG GD   FW+D W G  PL  ++
Subjt:  YDIHKKNVSLLAKWAWRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHID----IRVGKGDNTLFWEDNWLGSSPLQSKF

Query:  PSLFNLSLKKDALIAELWEPTNA-AWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNT-EDTWVWNLEQNDHFSTGSLTQKLASQAYQSSNDLYGQ--LWK
        P L  +   K+A I+ +   T   +WN   RR+L DSEI +   L   L     +++  D   W L  +  F+  S    LA   Y  S  ++    +W 
Subjt:  PSLFNLSLKKDALIAELWEPTNA-AWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNT-EDTWVWNLEQNDHFSTGSLTQKLASQAYQSSNDLYGQ--LWK

Query:  GLMLKKVKFSMWELSHKCINTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVL
          +  KVK  +W ++HK +NT +++Q R P  +LSP  C L  K  ET  H+F  C      W  +  +    +     +  +L     G  F     VL
Subjt:  GLMLKKVKFSMWELSHKCINTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVL

Query:  WRNFLYAFFWNLWLDRNARIFNNKQQNVYAFIESTTYLAIFWSSHIPLFVTIP
        W+N   A  W +W +RNARIF +K +N     +S  +L  FW+    +F  IP
Subjt:  WRNFLYAFFWNLWLDRNARIFNNKQQNVYAFIESTTYLAIFWSSHIPLFVTIP

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.9e-9836Show/hide
Query:  IKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTPVIEKVQK
        + +TH+LFADD ++F   +   ++NL  ++ LFE+ASGL+ N +KS    + + T +A  +A  +G   G  PT+YLG+PL   P   NFW  V++K+QK
Subjt:  IKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTPVIEKVQK

Query:  RLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLAKWAWRFY
        +L NW  + +SKGGR TLI +TL +LPIY +S+F+ PK +   IE  +R+FLW   S      L++W+ +  PKE+GGLGI+ ++  N +LL KW W+F 
Subjt:  RLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLAKWAWRFY

Query:  QEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQSKFPSLFNLSLKKDALIAELWEPTNA
         E + L +++I SK+              +S+  PW A+ +     Y +I  +V  G++  FW DNW G++PL    P LF LS  K   + E W P++ 
Subjt:  QEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQSKFPSLFNLSLKKDALIAELWEPTNA

Query:  AWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSN---DLYGQLWKGLMLKKVKFSMWELSHKCINTAEV
         W+LH+ R L D E   W  +   L +   N      +WNL  N+ F T S+ + +A      +N   +LY  LWK    KK KF +W L H CINTA+ 
Subjt:  AWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSN---DLYGQLWKGLMLKKVKFSMWELSHKCINTAEV

Query:  IQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFFWNLWLDRNARIFNNK
        +Q+R PN +LSP+ C + NK+ E   H+F  C Y+   W   +    W  +   DV SL+Q  +     +N   ++  N      W +WL+RN RIF  +
Subjt:  IQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFFWNLWLDRNARIFNNK

Query:  QQNVYAFIESTTYLAIFWSSHIPLF
        ++      E T      WS    LF
Subjt:  QQNVYAFIESTTYLAIFWSSHIPLF

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.1e-9736.09Show/hide
Query:  VGEGPQAIKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTP
        V  GP  + +TH+LFADD ++F   K   ++NL  ++ LFE+ASGL+ N +KS    + + T +A  +   +G   G  PTTYLG+PL   P   NFW  
Subjt:  VGEGPQAIKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTP

Query:  VIEKVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLA
        +++K+QK+L +W  + +SKGGR TLI +TL +LPIY LS+F+ PK +   IE  +R+FLW   S      L++W+ +  PKE+GGLGI+ +H  N +LL 
Subjt:  VIEKVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLA

Query:  KWAWRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQSKFPSLFNLSLKKDALIAE
        KW W+F  E   L +++I SK+              +S+  PW A+       Y +I  +V  G++  FW DNW G+SPL    P LF LS  K   + +
Subjt:  KWAWRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQSKFPSLFNLSLKKDALIAE

Query:  LWEPTNAAWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSN---DLYGQLWKGLMLKKVKFSMWELSHK
        LW P+   WN+H+ R L D E   W  +   L +   +      +W L  N+ F T S+ + L+  +   +N    LY  LWK    KK KF +W L H 
Subjt:  LWEPTNAAWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSN---DLYGQLWKGLMLKKVKFSMWELSHK

Query:  CINTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFFWNLWLDRN
        CINTA+ +Q+R PN +LSP+ C + NK+ E   H+F  C Y+   W   Q    W  +   DV SL Q  +     K    ++  N +    W +WL+RN
Subjt:  CINTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFFWNLWLDRN

Query:  ARIFNNKQQNVYAFIESTTYLAIFWSSHIPLF
         RIF  +++      E        WS    LF
Subjt:  ARIFNNKQQNVYAFIESTTYLAIFWSSHIPLF

KAA0044556.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.6e-9736.09Show/hide
Query:  VGEGPQAIKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTP
        V  GP  + +TH+LFADD ++F   K   ++NL  ++ LFE+ASGL+ N +KS    + + T +A  +   +G   G  PTTYLG+PL   P   NFW  
Subjt:  VGEGPQAIKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTP

Query:  VIEKVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLA
        +++K+QK+L +W  + +SKGGR TLI +TL +LPIY LS+F+ PK +   IE  +R+FLW   S      L++W+ +  PKE+GGLGI+ +H  N +LL 
Subjt:  VIEKVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLA

Query:  KWAWRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQSKFPSLFNLSLKKDALIAE
        KW W+F  E   L +++I SK+              +S+  PW A+       Y +I  +V  G++  FW DNW G+SPL    P LF LS  K   + +
Subjt:  KWAWRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQSKFPSLFNLSLKKDALIAE

Query:  LWEPTNAAWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSN---DLYGQLWKGLMLKKVKFSMWELSHK
        LW P+   WN+H+ R L D E   W  +   L +   +      +W L  N+ F T S+ + L+  +   +N    LY  LWK    KK KF +W L H 
Subjt:  LWEPTNAAWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSN---DLYGQLWKGLMLKKVKFSMWELSHK

Query:  CINTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFFWNLWLDRN
        CINTA+ +Q+R PN +LSP+ C + NK+ E   H+F  C Y+   W   Q    W  +   DV SL Q  +     K    ++  N +    W +WL+RN
Subjt:  CINTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFFWNLWLDRN

Query:  ARIFNNKQQNVYAFIESTTYLAIFWSSHIPLF
         RIF  +++      E        WS    LF
Subjt:  ARIFNNKQQNVYAFIESTTYLAIFWSSHIPLF

TrEMBL top hitse value%identityAlignment
A0A438GDE7 LINE-1 retrotransposable element ORF2 protein4.6e-9035.99Show/hide
Query:  KAEQDSSIEGFWVGEGPQAIKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPL
        KAE+ + +EGF VG      +++HL FADDTI FSS +   +  L NV+ +F   SGL  N +KS   G+ +E    + LA M  CK   WP  YLGLPL
Subjt:  KAEQDSSIEGFWVGEGPQAIKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPL

Query:  HDNPRRYNFWTPVIEKVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGI
          NP+   FW PVIE++ +RL  W    +S GGR TLIQ+ LT++P Y+LS+F+ P  V   IE++ R FLW    E K  HL+ W  +  PK  GGLG 
Subjt:  HDNPRRYNFWTPVIEKVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGI

Query:  YDIHKKNVSLLAKWAWRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHID----IRVGKGDNTLFWEDNWLGSSPLQSKF
          I  +NV+LL KW WR+ +E +AL  ++I S +G+ SN   +      S R PW AI     L+Y          VG GD   FW+D W G  PL  ++
Subjt:  YDIHKKNVSLLAKWAWRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHID----IRVGKGDNTLFWEDNWLGSSPLQSKF

Query:  PSLFNLSLKKDALIAELWEPTNA-AWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNT-EDTWVWNLEQNDHFSTGSLTQKLASQAYQSSNDLYGQ--LWK
        P L  +   K+A I+ +   T   +WN   RR+L DSEI +   L         +++  D   W+L  +  F+  S    LA   Y  S  ++    +W 
Subjt:  PSLFNLSLKKDALIAELWEPTNA-AWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNT-EDTWVWNLEQNDHFSTGSLTQKLASQAYQSSNDLYGQ--LWK

Query:  GLMLKKVKFSMWELSHKCINTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVL
          +  KVK  +W ++HK +NT +++Q R P  +LSP  C L  K  ET  H+F  C      W  +  +    +     +  +L     G  F     VL
Subjt:  GLMLKKVKFSMWELSHKCINTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVL

Query:  WRNFLYAFFWNLWLDRNARIFNNKQQNVYAFIESTTYLAIFWSSHIPLFVTIP
        W+N   A  W +W +RNARIF +K +N     +S  +L  FW+    +F  IP
Subjt:  WRNFLYAFFWNLWLDRNARIFNNKQQNVYAFIESTTYLAIFWSSHIPLFVTIP

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein9.3e-9936Show/hide
Query:  IKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTPVIEKVQK
        + +TH+LFADD ++F   +   ++NL  ++ LFE+ASGL+ N +KS    + + T +A  +A  +G   G  PT+YLG+PL   P   NFW  V++K+QK
Subjt:  IKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTPVIEKVQK

Query:  RLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLAKWAWRFY
        +L NW  + +SKGGR TLI +TL +LPIY +S+F+ PK +   IE  +R+FLW   S      L++W+ +  PKE+GGLGI+ ++  N +LL KW W+F 
Subjt:  RLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLAKWAWRFY

Query:  QEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQSKFPSLFNLSLKKDALIAELWEPTNA
         E + L +++I SK+              +S+  PW A+ +     Y +I  +V  G++  FW DNW G++PL    P LF LS  K   + E W P++ 
Subjt:  QEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQSKFPSLFNLSLKKDALIAELWEPTNA

Query:  AWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSN---DLYGQLWKGLMLKKVKFSMWELSHKCINTAEV
         W+LH+ R L D E   W  +   L +   N      +WNL  N+ F T S+ + +A      +N   +LY  LWK    KK KF +W L H CINTA+ 
Subjt:  AWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSN---DLYGQLWKGLMLKKVKFSMWELSHKCINTAEV

Query:  IQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFFWNLWLDRNARIFNNK
        +Q+R PN +LSP+ C + NK+ E   H+F  C Y+   W   +    W  +   DV SL+Q  +     +N   ++  N      W +WL+RN RIF  +
Subjt:  IQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFFWNLWLDRNARIFNNK

Query:  QQNVYAFIESTTYLAIFWSSHIPLF
        ++      E T      WS    LF
Subjt:  QQNVYAFIESTTYLAIFWSSHIPLF

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein1.0e-9736.09Show/hide
Query:  VGEGPQAIKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTP
        V  GP  + +TH+LFADD ++F   K   ++NL  ++ LFE+ASGL+ N +KS    + + T +A  +   +G   G  PTTYLG+PL   P   NFW  
Subjt:  VGEGPQAIKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTP

Query:  VIEKVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLA
        +++K+QK+L +W  + +SKGGR TLI +TL +LPIY LS+F+ PK +   IE  +R+FLW   S      L++W+ +  PKE+GGLGI+ +H  N +LL 
Subjt:  VIEKVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLA

Query:  KWAWRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQSKFPSLFNLSLKKDALIAE
        KW W+F  E   L +++I SK+              +S+  PW A+       Y +I  +V  G++  FW DNW G+SPL    P LF LS  K   + +
Subjt:  KWAWRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQSKFPSLFNLSLKKDALIAE

Query:  LWEPTNAAWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSN---DLYGQLWKGLMLKKVKFSMWELSHK
        LW P+   WN+H+ R L D E   W  +   L +   +      +W L  N+ F T S+ + L+  +   +N    LY  LWK    KK KF +W L H 
Subjt:  LWEPTNAAWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSN---DLYGQLWKGLMLKKVKFSMWELSHK

Query:  CINTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFFWNLWLDRN
        CINTA+ +Q+R PN +LSP+ C + NK+ E   H+F  C Y+   W   Q    W  +   DV SL Q  +     K    ++  N +    W +WL+RN
Subjt:  CINTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFFWNLWLDRN

Query:  ARIFNNKQQNVYAFIESTTYLAIFWSSHIPLF
         RIF  +++      E        WS    LF
Subjt:  ARIFNNKQQNVYAFIESTTYLAIFWSSHIPLF

A0A5A7TR15 LINE-1 retrotransposable element ORF2 protein1.7e-9736.09Show/hide
Query:  VGEGPQAIKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTP
        V  GP  + +TH+LFADD ++F   K   ++NL  ++ LFE+ASGL+ N +KS    + + T +A  +   +G   G  PTTYLG+PL   P   NFW  
Subjt:  VGEGPQAIKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTP

Query:  VIEKVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLA
        +++K+QK+L +W  + +SKGGR TLI +TL +LPIY LS+F+ PK +   IE  +R+FLW   S      L++W+ +  PKE+GGLGI+ +H  N +LL 
Subjt:  VIEKVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLA

Query:  KWAWRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQSKFPSLFNLSLKKDALIAE
        KW W+F  E   L +++I SK+              +S+  PW A+       Y +I  +V  G++  FW DNW G+SPL    P LF LS  K   + +
Subjt:  KWAWRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQSKFPSLFNLSLKKDALIAE

Query:  LWEPTNAAWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSN---DLYGQLWKGLMLKKVKFSMWELSHK
        LW P+   WN+H+ R L D E   W  +   L +   +      +W L  N+ F T S+ + L+  +   +N    LY  LWK    KK KF +W L H 
Subjt:  LWEPTNAAWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSN---DLYGQLWKGLMLKKVKFSMWELSHK

Query:  CINTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFFWNLWLDRN
        CINTA+ +Q+R PN +LSP+ C + NK+ E   H+F  C Y+   W   Q    W  +   DV SL Q  +     K    ++  N +    W +WL+RN
Subjt:  CINTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFFWNLWLDRN

Query:  ARIFNNKQQNVYAFIESTTYLAIFWSSHIPLF
         RIF  +++      E        WS    LF
Subjt:  ARIFNNKQQNVYAFIESTTYLAIFWSSHIPLF

A0A5H2XQW2 TatD related DNase3.2e-9137.19Show/hide
Query:  GPQAIKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTPVIE
        G   ++++HL FADDTI F   K     NL  +++LF   SG+  N  KS  LG+    +    +A  +GC+ G WP  YLGLPL  NPR  NFW PV++
Subjt:  GPQAIKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTPVIE

Query:  KVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLAKWA
        KV+KRLQ W    +SKGGR TLIQA L+++P YY+S+F+ P  VT  +E+L R+FLW+   E K CHL++W  +   KEEGGLGI  + ++N +L AKW 
Subjt:  KVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLAKWA

Query:  WRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQSKFPSLFNLSLKKDALIAELWE
        WRF  E N+L  +II SK+G  SN     +    S R PW  I K            VG G+   FWED WL    L+  FP L++LS +K+  IA  W 
Subjt:  WRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQSKFPSLFNLSLKKDALIAELWE

Query:  PTN----AAWNLHLRRHLCDSEILEWSLLSHQLSSFS-FNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSNDLYGQLWKGLMLKKVKFSMWELSHKC
          N      W+   RR+L ++EI E  LL   L +   F +  D   W +E+   FS  S    L S         Y  +WK     K++F +W  ++  
Subjt:  PTN----AAWNLHLRRHLCDSEILEWSLLSHQLSSFS-FNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSNDLYGQLWKGLMLKKVKFSMWELSHKC

Query:  INTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFFWNLWLDRNA
        INT + IQRR P   LSPS C    +  E   H+F  C Y+   W  +  A G ++        LL   L        + +L    ++A FWN+W++RN 
Subjt:  INTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFFWNLWLDRNA

Query:  RIF-NNKQQNVYAFIESTTYLAIFWSS
        RIF  +    V    +   + A  W+S
Subjt:  RIF-NNKQQNVYAFIESTTYLAIFWSS

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.8e-3024.41Show/hide
Query:  VIEKVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLA
        ++E+V  R+  W    +S  GR TL +A L+++P++ +S    P+ +   +++L R+FLW   +EKK  HL+KWS +  PK+EGGLG+      N +L++
Subjt:  VIEKVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLA

Query:  KWAWRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKS-----KYLIYDHIDIRVGKGDNTLFWEDNWLGSSPL----QSKFPSLFNLS
        K  WR  QE N+L   ++  K+       +I +      +G W +  +S     + ++   +    G G    FW D W+   PL      + P+  +  
Subjt:  KWAWRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKS-----KYLIYDHIDIRVGKGDNTLFWEDNWLGSSPL----QSKFPSLFNLS

Query:  LKKDALIAELWEPTNAAWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSN--DLYGQLWKGLMLKKVKF
        + KD     LW P        +  +  ++  LE   +   L         D   W   Q+  FS  S  + L        N    +  LWK  + ++VK 
Subjt:  LKKDALIAELWEPTNAAWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSN--DLYGQLWKGLMLKKVKF

Query:  SMWELSHKCINTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFF
         +W + ++ + T E   RR  ++S   + C +    VE+ +H+   C      W  +      Q   S  +   L   L       D  + W        
Subjt:  SMWELSHKCINTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFF

Query:  WNLWLDRNARIF--NNKQQNVYAFIE
        W  W  R   IF  N K ++   F++
Subjt:  WNLWLDRNARIF--NNKQQNVYAFIE

P93295 Uncharacterized mitochondrial protein AtMg003102.5e-0826.39Show/hide
Query:  LPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKE-EGGLGIYDIHKKNVSLLAKWAWRFYQEPNALGRKIIGSKFGTTSNPLKI
        LP+Y +S F+  K +   +      F W     K+    + W  L   KE +GGLG  D+   N +LLAK ++R   +P+ L  +++ S++   S+ ++ 
Subjt:  LPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKE-EGGLGIYDIHKKNVSLLAKWAWRFYQEPNALGRKIIGSKFGTTSNPLKI

Query:  GEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWL
           +  S    W +I   + L+   +   +G G +T  W D W+
Subjt:  GEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWL

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.8e-2023.83Show/hide
Query:  GMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTPVIEKVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSF
        G++    A +   +    G+ P  YLGLPL       + + P++EK++ R+  W + ++S  GR  LI + + +L  +++S F+ P      I+ +  SF
Subjt:  GMETQQAALLASMYGCKHGSWPTTYLGLPLHDNPRRYNFWTPVIEKVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSF

Query:  LWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLAKWAWRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHID
        LW           + WS +  PK+EGGLGI  + + N        W      +  G   +GS                      W  I K + L    + 
Subjt:  LWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLAKWAWRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHID

Query:  IRVGKGDNTLFWEDNW--LGSSPLQSKFPSLFNLSLKKDALIAELWEPTNAAWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNTEDTWVWNLEQNDHFST
          +  G NT FW DNW  +G     +      ++ +   A +AE      A  N   RRH  D+ +L    +  ++      + EDT  W     D F  
Subjt:  IRVGKGDNTLFWEDNW--LGSSPLQSKFPSLFNLSLKKDALIAELWEPTNAAWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNTEDTWVWNLEQNDHFST

Query:  GSLTQK--LASQAYQSSNDLYGQLWKGLMLKKVKFSMWELSHKCINTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAA
           T++   A++  +   + Y  +W      K     W      + T +   R    ++ + S C L +  VET+ H+F  C Y+A
Subjt:  GSLTQK--LASQAYQSSNDLYGQLWKGLMLKKVKFSMWELSHKCINTAEVIQRRFPNSSLSPSCCCLWNKAVETQIHIFSRCEYAA

AT3G25720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.7e-0428.7Show/hide
Query:  LPKEEGGLGIYDIHKKNVSLLAKWAWRF-------------YQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGD
        LPK EGGLG+    + N +L  K  WR              Y     LG   + SKF T+        + L S    W  + + + L    +   +G G 
Subjt:  LPKEEGGLGIYDIHKKNVSLLAKWAWRF-------------YQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGD

Query:  NTLFWEDNWLGSSPL
           FW DNW    PL
Subjt:  NTLFWEDNWLGSSPL

AT4G29090.1 Ribonuclease H-like superfamily protein2.2e-2825.76Show/hide
Query:  LPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLAKWAWRFYQEPNALGRKIIGSKFGTTSNPLKIG
        LP Y ++ F  PK V   I  +   F W+   E KG H   W HL   K EGG+G  DI   N++LL K  WR    P +L  K+  S++   S+PL   
Subjt:  LPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGIYDIHKKNVSLLAKWAWRFYQEPNALGRKIIGSKFGTTSNPLKIG

Query:  EKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQS-----KFPSLFNLSLKKDALIAELWEPTNAAWNLHLRRHLCDSEILEWSL
          S  S    W +IH S+ ++       VG G++ + W   WL S P  +     + P     S+     +++L + +   W   +   L     +E  L
Subjt:  EKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQS-----KFPSLFNLSLKKDALIAELWEPTNAAWNLHLRRHLCDSEILEWSL

Query:  LSHQLSSFSFNNTEDTWVWNLEQNDHFSTGS----LTQKLASQ------AYQSSNDLYGQLWKGLMLKKVKFSMWELSHKCINTAEVIQRRFPNSSLS-P
        +             D++ W+   +  ++  S    LTQ +  +      +  S N +Y ++WK     K++  +W    KC++ +  +        LS  
Subjt:  LSHQLSSFSFNNTEDTWVWNLEQNDHFSTGS----LTQKLASQ------AYQSSNDLYGQLWKGLMLKKVKFSMWELSHKCINTAEVIQRRFPNSSLS-P

Query:  SCCCLWNKAVETQIHIFSRCEYAAAFW--DHIQVAFGWQFARSGDVLSLLQFTL-LGHP--FKNDSKVLWRNFLYAFFWNLWLDRNARIFNNKQQN
        S C       ET  H+  +C +A   W    I +  G ++A S  V     F L  G+P   K    V W        W LW +RN  +F  ++ N
Subjt:  SCCCLWNKAVETQIHIFSRCEYAAAFW--DHIQVAFGWQFARSGDVLSLLQFTL-LGHP--FKNDSKVLWRNFLYAFFWNLWLDRNARIFNNKQQN

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.8e-0926.39Show/hide
Query:  LPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKE-EGGLGIYDIHKKNVSLLAKWAWRFYQEPNALGRKIIGSKFGTTSNPLKI
        LP+Y +S F+  K +   +      F W     K+    + W  L   KE +GGLG  D+   N +LLAK ++R   +P+ L  +++ S++   S+ ++ 
Subjt:  LPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKE-EGGLGIYDIHKKNVSLLAKWAWRFYQEPNALGRKIIGSKFGTTSNPLKI

Query:  GEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWL
           +  S    W +I   + L+   +   +G G +T  W D W+
Subjt:  GEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAAACCGAGAGGAAAAATCAGAGCTTCGAGGGGCCTTAGACAAGTCGAATGCTTCAAAGGCTGAACAGGACTCCTCCATCGAAGGCTTTTGGGTTGGTGAAGGCCC
ACAAGCCATTAAAATCACTCATCTCCTGTTTGCGGACGACACCATCCTATTTTCATCTCCAAAGACATCGTGCATCACCAACCTGTTCAATGTTATTAGACTCTTTGAAG
CTGCATCTGGGCTGAGTACTAATTGCAACAAGTCTGAATTTTTGGGCCTTGGGATGGAAACACAGCAAGCCGCTCTCTTAGCATCTATGTATGGATGTAAACACGGTTCT
TGGCCGACCACATATTTGGGTCTTCCCCTACATGATAACCCGAGAAGGTATAACTTCTGGACCCCTGTGATTGAAAAGGTCCAAAAAAGATTGCAAAATTGGGGCTCTAC
CAACATCTCCAAAGGAGGAAGGCACACCCTTATCCAAGCAACCCTCACAAACCTTCCTATATACTACCTCTCTATATTCCAAGCCCCCAAAAAGGTCACTACAGCAATAG
AGAAACTATATCGATCCTTTCTTTGGAAACGAGGCAGTGAGAAAAAAGGTTGTCACCTCTTAAAGTGGTCTCATCTTCAACTACCTAAGGAAGAAGGCGGACTGGGCATT
TATGATATACATAAGAAAAATGTATCCCTTTTAGCTAAATGGGCTTGGAGATTCTATCAGGAACCAAATGCTCTAGGGAGGAAAATCATTGGCTCTAAATTTGGCACAAC
TAGTAACCCTCTCAAAATTGGTGAAAAATCCCTGAATTCTTCCAGGGGCCCGTGGCTTGCTATCCACAAATCAAAATACCTTATATATGATCACATTGACATTAGAGTTG
GGAAAGGTGACAACACGCTCTTTTGGGAGGACAATTGGCTGGGGTCTTCTCCTTTACAGTCCAAGTTCCCCTCGCTATTCAATCTCTCGCTAAAAAAGGATGCTCTTATA
GCAGAATTATGGGAACCAACCAATGCGGCATGGAACCTTCATTTAAGAAGACACCTCTGTGATTCTGAAATTCTGGAATGGTCTTTATTATCTCATCAATTGTCCTCTTT
CTCCTTCAACAATACTGAAGACACTTGGGTTTGGAATCTGGAACAAAACGACCATTTTTCTACTGGATCCCTCACCCAAAAATTGGCTTCACAAGCTTACCAATCCAGTA
ATGATCTTTATGGCCAGCTTTGGAAGGGTCTTATGCTGAAAAAAGTTAAGTTCTCCATGTGGGAGCTCAGCCACAAGTGTATTAATACTGCAGAGGTCATTCAGAGGCGA
TTCCCCAACTCCTCATTATCTCCTAGCTGCTGCTGCTTGTGGAACAAGGCTGTCGAAACACAAATTCATATTTTTAGTCGCTGTGAATATGCTGCAGCTTTCTGGGACCA
CATTCAAGTCGCTTTTGGTTGGCAATTTGCTCGTTCGGGTGATGTCCTTTCCCTTCTTCAATTCACTCTTCTTGGGCATCCTTTTAAAAATGATTCTAAGGTTCTATGGC
GGAATTTTTTGTACGCATTCTTCTGGAACTTATGGCTTGATAGAAATGCTAGAATCTTCAACAATAAGCAACAGAATGTCTATGCCTTTATTGAATCTACTACATATCTT
GCCATATTTTGGAGTAGTCACATCCCCCTTTTTGTAACTATCCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTAAACCGAGAGGAAAAATCAGAGCTTCGAGGGGCCTTAGACAAGTCGAATGCTTCAAAGGCTGAACAGGACTCCTCCATCGAAGGCTTTTGGGTTGGTGAAGGCCC
ACAAGCCATTAAAATCACTCATCTCCTGTTTGCGGACGACACCATCCTATTTTCATCTCCAAAGACATCGTGCATCACCAACCTGTTCAATGTTATTAGACTCTTTGAAG
CTGCATCTGGGCTGAGTACTAATTGCAACAAGTCTGAATTTTTGGGCCTTGGGATGGAAACACAGCAAGCCGCTCTCTTAGCATCTATGTATGGATGTAAACACGGTTCT
TGGCCGACCACATATTTGGGTCTTCCCCTACATGATAACCCGAGAAGGTATAACTTCTGGACCCCTGTGATTGAAAAGGTCCAAAAAAGATTGCAAAATTGGGGCTCTAC
CAACATCTCCAAAGGAGGAAGGCACACCCTTATCCAAGCAACCCTCACAAACCTTCCTATATACTACCTCTCTATATTCCAAGCCCCCAAAAAGGTCACTACAGCAATAG
AGAAACTATATCGATCCTTTCTTTGGAAACGAGGCAGTGAGAAAAAAGGTTGTCACCTCTTAAAGTGGTCTCATCTTCAACTACCTAAGGAAGAAGGCGGACTGGGCATT
TATGATATACATAAGAAAAATGTATCCCTTTTAGCTAAATGGGCTTGGAGATTCTATCAGGAACCAAATGCTCTAGGGAGGAAAATCATTGGCTCTAAATTTGGCACAAC
TAGTAACCCTCTCAAAATTGGTGAAAAATCCCTGAATTCTTCCAGGGGCCCGTGGCTTGCTATCCACAAATCAAAATACCTTATATATGATCACATTGACATTAGAGTTG
GGAAAGGTGACAACACGCTCTTTTGGGAGGACAATTGGCTGGGGTCTTCTCCTTTACAGTCCAAGTTCCCCTCGCTATTCAATCTCTCGCTAAAAAAGGATGCTCTTATA
GCAGAATTATGGGAACCAACCAATGCGGCATGGAACCTTCATTTAAGAAGACACCTCTGTGATTCTGAAATTCTGGAATGGTCTTTATTATCTCATCAATTGTCCTCTTT
CTCCTTCAACAATACTGAAGACACTTGGGTTTGGAATCTGGAACAAAACGACCATTTTTCTACTGGATCCCTCACCCAAAAATTGGCTTCACAAGCTTACCAATCCAGTA
ATGATCTTTATGGCCAGCTTTGGAAGGGTCTTATGCTGAAAAAAGTTAAGTTCTCCATGTGGGAGCTCAGCCACAAGTGTATTAATACTGCAGAGGTCATTCAGAGGCGA
TTCCCCAACTCCTCATTATCTCCTAGCTGCTGCTGCTTGTGGAACAAGGCTGTCGAAACACAAATTCATATTTTTAGTCGCTGTGAATATGCTGCAGCTTTCTGGGACCA
CATTCAAGTCGCTTTTGGTTGGCAATTTGCTCGTTCGGGTGATGTCCTTTCCCTTCTTCAATTCACTCTTCTTGGGCATCCTTTTAAAAATGATTCTAAGGTTCTATGGC
GGAATTTTTTGTACGCATTCTTCTGGAACTTATGGCTTGATAGAAATGCTAGAATCTTCAACAATAAGCAACAGAATGTCTATGCCTTTATTGAATCTACTACATATCTT
GCCATATTTTGGAGTAGTCACATCCCCCTTTTTGTAACTATCCCTTAG
Protein sequenceShow/hide protein sequence
MVNREEKSELRGALDKSNASKAEQDSSIEGFWVGEGPQAIKITHLLFADDTILFSSPKTSCITNLFNVIRLFEAASGLSTNCNKSEFLGLGMETQQAALLASMYGCKHGS
WPTTYLGLPLHDNPRRYNFWTPVIEKVQKRLQNWGSTNISKGGRHTLIQATLTNLPIYYLSIFQAPKKVTTAIEKLYRSFLWKRGSEKKGCHLLKWSHLQLPKEEGGLGI
YDIHKKNVSLLAKWAWRFYQEPNALGRKIIGSKFGTTSNPLKIGEKSLNSSRGPWLAIHKSKYLIYDHIDIRVGKGDNTLFWEDNWLGSSPLQSKFPSLFNLSLKKDALI
AELWEPTNAAWNLHLRRHLCDSEILEWSLLSHQLSSFSFNNTEDTWVWNLEQNDHFSTGSLTQKLASQAYQSSNDLYGQLWKGLMLKKVKFSMWELSHKCINTAEVIQRR
FPNSSLSPSCCCLWNKAVETQIHIFSRCEYAAAFWDHIQVAFGWQFARSGDVLSLLQFTLLGHPFKNDSKVLWRNFLYAFFWNLWLDRNARIFNNKQQNVYAFIESTTYL
AIFWSSHIPLFVTIP