; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy01g011380 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy01g011380
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr01:31975499..31978361
RNA-Seq ExpressionLcy01g011380
SyntenyLcy01g011380
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN68838.1 hypothetical protein VITISV_030956 [Vitis vinifera]1.1e-10541.49Show/hide
Query:  LCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPIC
        L  P W +GGDFN+ R S E+   S   T + + F+ FI+   L DLPL +  +TWS+ + NP    +DRFL ++E    F   +   L R TSDH+PI 
Subjt:  LCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPIC

Query:  LTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRV
        L     +WGP PFRF N WL H SF      WW+    +GW GH F++KL+ +K +LK WN+  FG+ S  K  +   L N D +E+ G ++   + +R 
Subjt:  LTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRV

Query:  QIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISLDQ
          K EL  ++  EE+ W Q+ ++KW KEGD +S FFH++    R R  I E+   +GQ +   + I++E + +++KL++  +         DWSPIS + 
Subjt:  QIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISLDQ

Query:  QAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIARV
           LES F+E EI KA+  +  +K PGPDGFT   F+  W  +K+D+  VF +F +S IIN S N ++I L+PKK  ++ + ++RPISL T LYK+IA+V
Subjt:  QAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIARV

Query:  LSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK
        L+ R++ +L +  I   Q AFV  RQI+DA LIANE++DE  R   +GVV K+D EKA+D V W+FLD V+++K FG  WRK
Subjt:  LSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]9.1e-10842.77Show/hide
Query:  KALCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFP
        ++LCLP W+I GDFNI RW  E +  S    R    FN FI+   L D P  N  +TWS+ R NP+ S +DRFL++      FG    R L+R  SDHFP
Subjt:  KALCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFP

Query:  ICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINR
        I L     +WGP PFR  N+ L    F     +WW ++  +G+PG++FIQ L  L K +K+W          NK  L  E+  IDK+E  G ++     +
Subjt:  ICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINR

Query:  RVQIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISL
        R+ +K++L+SI  N+  +WHQR + +W   GD ++S+FHR+  IN+R+N I  I    G SL    DI + FI  +Q +++K+++   L +   W+PIS 
Subjt:  RVQIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISL

Query:  DQQAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIA
          Q+ L   F E EI   +    + K PGPDG+T  F+KK W  LKDD+  VF DF K+ I+N ++N T+I LI KK       +YRPISLTT LYK++A
Subjt:  DQQAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIA

Query:  RVLSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK
        + L+ RLK  LP   I   Q AF+  RQI DA LIANE ID + +RK +G V+KLDIEKAFD + W+F+D +L  K F H WRK
Subjt:  RVLSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.1e-10541.49Show/hide
Query:  LCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPIC
        L  P W +GGDFN+ R S E+   S   T + + F+ FI+   L DLPL +  +TWS+ + NP    +DRFL ++E    F   +   L R TSDH+PI 
Subjt:  LCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPIC

Query:  LTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRV
        L     +WGP PFRF N WL H SF      WW+    +GW GH F++KL+ +K +LK WN+  FG+ S  K  +   L N D +E+ G ++   + +R 
Subjt:  LTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRV

Query:  QIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISLDQ
          K EL  ++  EE+ W Q+ ++KW KEGD +S FFH++    R R  I E+   +GQ +   + I++E + +++KL++  +         DWSPIS + 
Subjt:  QIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISLDQ

Query:  QAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIARV
           LES F+E EI KA+  +  +K PGPDGFT   F+  W  +K+D+  VF +F +S IIN S N ++I L+PKK  ++ + ++RPISL T LYK+IA+V
Subjt:  QAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIARV

Query:  LSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK
        L+ R++ +L +  I   Q AFV  RQI+DA LIANE++DE  R   +GVV K+D EKA+D V W+FLD V+++K FG  WRK
Subjt:  LSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK

RVX13544.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.1e-10541.49Show/hide
Query:  LCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPIC
        L  P W +GGDFN+ R S E+   S   T + + F+ FI+   L DLPL +  +TWS+ + NP    +DRFL ++E    F   +   L R TSDH+PI 
Subjt:  LCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPIC

Query:  LTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRV
        L     +WGP PFRF N WL H SF      WW+    +GW GH F++KL+ +K +LK WN+  FG+ S  K  +   L N D +E+ G ++   + +R 
Subjt:  LTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRV

Query:  QIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISLDQ
          K EL  ++  EE+ W Q+ ++KW KEGD +S FFH++    R R  I E+   +GQ +   + I++E + +++KL++  +         DWSPIS + 
Subjt:  QIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISLDQ

Query:  QAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIARV
           LES F+E EI KA+  +  +K PGPDGFT   F+  W  +K+D+  VF +F +S IIN S N ++I L+PKK  ++ + ++RPISL T LYK+IA+V
Subjt:  QAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIARV

Query:  LSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK
        L+ R++ +L +  I   Q AFV  RQI+DA LIANE++DE  R   +GVV K+D EKA+D V W+FLD V+++K FG  WRK
Subjt:  LSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.3e-10842.77Show/hide
Query:  KALCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFP
        ++LCLP W+I GDFNI RW  E +  S    R    FN FI+   L D P  N  +TWS+ R NP+ S +DRFL++      FG    R L+R  SDHFP
Subjt:  KALCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFP

Query:  ICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINR
        I L     +WGP PFR  N+ L    F     +WW ++  +G+PG++FIQ L  L K +K+W          NK  L  E+  IDK+E  G ++     +
Subjt:  ICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINR

Query:  RVQIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISL
        R+ +K++L+SI  N+  +WHQR + +W   GD ++S+FHR+  IN+R+N I  I    G SL    DI + FI  +Q +++K+++   L +   W+PIS 
Subjt:  RVQIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISL

Query:  DQQAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIA
          Q+ L   F E EI   +    + K PGPDG+T  F+KK W  LKDD+  VF DF K+ I+N ++N T+I LI KK       +YRPISLTT LYK++A
Subjt:  DQQAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIA

Query:  RVLSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK
        + L+ RLK  LP   I   Q AF+  RQI DA LIANE+ID + +RK +G V+KLDIEKAFD + W+F+D +L  K F H WRK
Subjt:  RVLSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK

TrEMBL top hitse value%identityAlignment
A0A438GDE7 LINE-1 retrotransposable element ORF2 protein5.4e-10641.49Show/hide
Query:  LCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPIC
        L  P W +GGDFN+ R S E+   S   T + + F+ FI+   L DLPL +  +TWS+ + NP    +DRFL ++E    F   +   L R TSDH+PI 
Subjt:  LCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPIC

Query:  LTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRV
        L     +WGP PFRF N WL H SF      WW+    +GW GH F++KL+ +K +LK WN+  FG+ S  K  +   L N D +E+ G ++   + +R 
Subjt:  LTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRV

Query:  QIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISLDQ
          K EL  ++  EE+ W Q+ ++KW KEGD +S FFH++    R R  I E+   +GQ +   + I++E + +++KL++  +         DWSPIS + 
Subjt:  QIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISLDQ

Query:  QAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIARV
           LES F+E EI KA+  +  +K PGPDGFT   F+  W  +K+D+  VF +F +S IIN S N ++I L+PKK  ++ + ++RPISL T LYK+IA+V
Subjt:  QAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIARV

Query:  LSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK
        L+ R++ +L +  I   Q AFV  RQI+DA LIANE++DE  R   +GVV K+D EKA+D V W+FLD V+++K FG  WRK
Subjt:  LSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK

A0A438JX47 LINE-1 retrotransposable element ORF2 protein5.4e-10641.49Show/hide
Query:  LCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPIC
        L  P W +GGDFN+ R S E+   S   T + + F+ FI+   L DLPL +  +TWS+ + NP    +DRFL ++E    F   +   L R TSDH+PI 
Subjt:  LCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPIC

Query:  LTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRV
        L     +WGP PFRF N WL H SF      WW+    +GW GH F++KL+ +K +LK WN+  FG+ S  K  +   L N D +E+ G ++   + +R 
Subjt:  LTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRV

Query:  QIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISLDQ
          K EL  ++  EE+ W Q+ ++KW KEGD +S FFH++    R R  I E+   +GQ +   + I++E + +++KL++  +         DWSPIS + 
Subjt:  QIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISLDQ

Query:  QAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIARV
           LES F+E EI KA+  +  +K PGPDGFT   F+  W  +K+D+  VF +F +S IIN S N ++I L+PKK  ++ + ++RPISL T LYK+IA+V
Subjt:  QAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIARV

Query:  LSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK
        L+ R++ +L +  I   Q AFV  RQI+DA LIANE++DE  R   +GVV K+D EKA+D V W+FLD V+++K FG  WRK
Subjt:  LSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein4.4e-10842.77Show/hide
Query:  KALCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFP
        ++LCLP W+I GDFNI RW  E +  S    R    FN FI+   L D P  N  +TWS+ R NP+ S +DRFL++      FG    R L+R  SDHFP
Subjt:  KALCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFP

Query:  ICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINR
        I L     +WGP PFR  N+ L    F     +WW ++  +G+PG++FIQ L  L K +K+W          NK  L  E+  IDK+E  G ++     +
Subjt:  ICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINR

Query:  RVQIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISL
        R+ +K++L+SI  N+  +WHQR + +W   GD ++S+FHR+  IN+R+N I  I    G SL    DI + FI  +Q +++K+++   L +   W+PIS 
Subjt:  RVQIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISL

Query:  DQQAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIA
          Q+ L   F E EI   +    + K PGPDG+T  F+KK W  LKDD+  VF DF K+ I+N ++N T+I LI KK       +YRPISLTT LYK++A
Subjt:  DQQAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIA

Query:  RVLSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK
        + L+ RLK  LP   I   Q AF+  RQI DA LIANE ID + +RK +G V+KLDIEKAFD + W+F+D +L  K F H WRK
Subjt:  RVLSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein2.6e-10842.77Show/hide
Query:  KALCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFP
        ++LCLP W+I GDFNI RW  E +  S    R    FN FI+   L D P  N  +TWS+ R NP+ S +DRFL++      FG    R L+R  SDHFP
Subjt:  KALCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFP

Query:  ICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINR
        I L     +WGP PFR  N+ L    F     +WW ++  +G+PG++FIQ L  L K +K+W          NK  L  E+  IDK+E  G ++     +
Subjt:  ICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINR

Query:  RVQIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISL
        R+ +K++L+SI  N+  +WHQR + +W   GD ++S+FHR+  IN+R+N I  I    G SL    DI + FI  +Q +++K+++   L +   W+PIS 
Subjt:  RVQIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISL

Query:  DQQAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIA
          Q+ L   F E EI   +    + K PGPDG+T  F+KK W  LKDD+  VF DF K+ I+N ++N T+I LI KK       +YRPISLTT LYK++A
Subjt:  DQQAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIA

Query:  RVLSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK
        + L+ RLK  LP   I   Q AF+  RQI DA LIANE+ID + +RK +G V+KLDIEKAFD + W+F+D +L  K F H WRK
Subjt:  RVLSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK

A5CAA2 Reverse transcriptase domain-containing protein5.4e-10641.49Show/hide
Query:  LCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPIC
        L  P W +GGDFN+ R S E+   S   T + + F+ FI+   L DLPL +  +TWS+ + NP    +DRFL ++E    F   +   L R TSDH+PI 
Subjt:  LCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPIC

Query:  LTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRV
        L     +WGP PFRF N WL H SF      WW+    +GW GH F++KL+ +K +LK WN+  FG+ S  K  +   L N D +E+ G ++   + +R 
Subjt:  LTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRV

Query:  QIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISLDQ
          K EL  ++  EE+ W Q+ ++KW KEGD +S FFH++    R R  I E+   +GQ +   + I++E + +++KL++  +         DWSPIS + 
Subjt:  QIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISLDQ

Query:  QAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIARV
           LES F+E EI KA+  +  +K PGPDGFT   F+  W  +K+D+  VF +F +S IIN S N ++I L+PKK  ++ + ++RPISL T LYK+IA+V
Subjt:  QAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCLYKVIARV

Query:  LSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK
        L+ R++ +L +  I   Q AFV  RQI+DA LIANE++DE  R   +GVV K+D EKA+D V W+FLD V+++K FG  WRK
Subjt:  LSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLWRK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein9.4e-1527.71Show/hide
Query:  QQSVNKNRLGLELANIDKVEESGSITKADINRRV-QIKAELISIVANEEVLWHQRCKLKWFKE--GDIDSSFFHRLMAINRRRNTISEILAAHGQSLIID
        Q+    + L  +L  ++K E++ S  KA   + + +I+AEL  I   ++ L        WF E    ID     RL+   R +N I  I    G      
Subjt:  QQSVNKNRLGLELANIDKVEESGSITKADINRRV-QIKAELISIVANEEVLWHQRCKLKWFKE--GDIDSSFFHRLMAINRRRNTISEILAAHGQSLIID

Query:  KDIEKEFIDFYQKLF-SKQNHRPSLPNISDWSPISLDQQAALESL---FSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSI
         +I+    ++Y+ L+ +K  +   +    D   +    Q  +ESL    +  EI   +  L + K+PGPDGFTAEF+++    L   +  +F    K  I
Subjt:  KDIEKEFIDFYQKLF-SKQNHRPSLPNISDWSPISLDQQAALESL---FSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSI

Query:  INASLNETYICLIPKKIGAKAVGE-YRPISLTTCLYKVIARVLSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRK-RQGVVIKLDIEK
        +  S  E  I LIPK        E +RPISL     K++ ++L+ R+++ +  +I    Q  F+   Q       +  +I   NR K +  V+I +D EK
Subjt:  INASLNETYICLIPKKIGAKAVGE-YRPISLTTCLYKVIARVLSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRK-RQGVVIKLDIEK

Query:  AFDTVDWNFLDEVL
        AFD +   F+ + L
Subjt:  AFDTVDWNFLDEVL

P08548 LINE-1 reverse transcriptase homolog9.1e-1824.56Show/hide
Query:  IIGGDFNITRWSWERSPLSFTPTRATRKFNRFI--ASAALQDLPLSNGKYTWSSFRPNP-----------SMSLIDRFLITDELSTKFGTIVVRKLDRAT
        I+ GDFN        +PL+     + +K ++ I   ++ +Q L L++    + +F PN            + S ID  L      +KF  I +  +    
Subjt:  IIGGDFNITRWSWERSPLSFTPTRATRKFNRFI--ASAALQDLPLSNGKYTWSSFRPNP-----------SMSLIDRFLITDELSTKFGTIVVRKLDRAT

Query:  SDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSV-----------------------
        SDH  I + L N+R                  LHT    WK N L         +  K + K L+Q N Q    Q++                       
Subjt:  SDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSV-----------------------

Query:  --NKNRLGLELANIDKVEESGSITKADINRRV-QIKAELISIVANEEVLWHQRCKLKWF--KEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKD
            N L   L  ++K E S    K    + + +I+AEL + + N+ ++        WF  K   ID    + L    R ++ IS I   + +      +
Subjt:  --NKNRLGLELANIDKVEESGSITKADINRRV-QIKAELISIVANEEVLWHQRCKLKWF--KEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKD

Query:  IEKEFIDFYQKLFS-KQNHRPSLPNISDWSPISLDQQAALESL---FSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIIN
        I+K   ++Y+KL+S K  +   +    +   +    Q  +E L    S  EI   +Q+L   K+PGPDGFT+EF++     L   +  +F +  K  I+ 
Subjt:  IEKEFIDFYQKLFS-KQNHRPSLPNISDWSPISLDQQAALESL---FSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIIN

Query:  ASLNETYICLIPKKIGAKAVGE-YRPISLTTCLYKVIARVLSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRK-RQGVVIKLDIEKAF
         +  E  I LIPK        E YRPISL     K++ ++L+ R+++ +  II         G++   +     N +I   N+ K +  +++ +D EKAF
Subjt:  ASLNETYICLIPKKIGAKAVGE-YRPISLTTCLYKVIARVLSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRK-RQGVVIKLDIEKAF

Query:  DTVDWNFLDEVLK
        D +   F+   LK
Subjt:  DTVDWNFLDEVLK

P11369 LINE-1 retrotransposable element ORF2 protein7.7e-1723.54Show/hide
Query:  KALCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDL-----PLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRAT
        KA   P  II GDFN    S +RS       R T K    +    L D+     P + G YT+ S  P+ + S ID  +       ++  I +  +    
Subjt:  KALCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDL-----PLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRAT

Query:  SDHFPICLTLGND-RWGPPPFRF------VNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKG-LKKELKQWNQQVFGQQSVNKNRLGLELANIDKV
        SDH  + L   N+   G P F +      +N  L        ++ + + N        +    +K  L+ +L   +     +++ + + L   L  ++K 
Subjt:  SDHFPICLTLGND-RWGPPPFRF------VNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKG-LKKELKQWNQQVFGQQSVNKNRLGLELANIDKV

Query:  EESGSITKADINRRVQIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFS-KQNHR
        +E+ S  ++     ++++ E+  +     +    + +  +F++ +       RL   +R +  I++I    G      ++I+     FY++L+S K  + 
Subjt:  EESGSITKADINRRVQIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFS-KQNHR

Query:  PSLPNISD---WSPISLDQQAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETY----ICLIPK-KI
          +    D      ++ DQ   L S  S  EI   +  L + K+PGPDGF+AEF++    T K+D+  + +  F    +  +L  ++    I LIPK + 
Subjt:  PSLPNISD---WSPISLDQQAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETY----ICLIPK-KI

Query:  GAKAVGEYRPISLTTCLYKVIARVLSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRK-RQGVVIKLDIEKAFDTVDWNFLDEVLK
            +  +RPISL     K++ ++L+ R++  +   II P Q  F+   Q       +  +I   N+ K +  ++I LD EKAFD +   F+ +VL+
Subjt:  GAKAVGEYRPISLTTCLYKVIARVLSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRK-RQGVVIKLDIEKAFDTVDWNFLDEVLK

P14381 Transposon TX1 uncharacterized 149 kDa protein1.4e-2927.05Show/hide
Query:  IIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNG----KYTWSSFRP-NPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICL
        IIGGDFN T  + +R+ +      +       IA  +L D+          +T+   R  + S S IDR  I+  L ++  +  +R      SDH  + L
Subjt:  IIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNG----KYTWSSFRP-NPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICL

Query:  TLGNDRWGPPP--FRFVNAWLSHASFLHTVESWWKA--------NPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSI
         +      P    + F N+ L    F  +V   W+           L+ W     +     LK   +++ + V GQ++     L  E+ ++++   SGS 
Subjt:  TLGNDRWGPPP--FRFVNAWLSHASFLHTVESWWKA--------NPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSI

Query:  TKADINRRVQIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRP-SLPNI
         +A     ++ K  L ++   +      R +++   + D  S FF+ L      R  I+ + A  G  L   + I      FYQ LFS     P +   +
Subjt:  TKADINRRVQIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNHRP-SLPNI

Query:  SDWSP-ISLDQQAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISL
         D  P +S  ++  LE+  +  E+ +A++ +  NK+PG DG T EFF+  W+TL  D   V  + FK   +  S     + L+PKK   + +  +RP+SL
Subjt:  SDWSP-ISLDQQAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISL

Query:  TTCLYKVIARVLSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFG
         +  YK++A+ +S RLK +L   +I P QS  V  R I D   +  +L+    R       + LD EKAFD VD  +L   L+   FG
Subjt:  TTCLYKVIARVLSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFG

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.2e-3427.48Show/hide
Query:  IWIIGGDFN-ITRWSWERSPLSFT-PTRATRKFNRFIASAALQDLPLSNGKYTWSSFR-PNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICL
        + I+ GDF+ I   S   S L  + P R   +F   +  + L D+P     YTWS+ +  NP +  +DR +   +  + F + +        SDH P  +
Subjt:  IWIIGGDFN-ITRWSWERSPLSFT-PTRATRKFNRFIASAALQDLPLSNGKYTWSSFR-PNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICL

Query:  TLGN-DRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRV
         L N  +     FR+ +   +H +FL ++   W+     G    S  + LK  KK  K  N+Q FG    +K +  L+  +++ ++       +D   RV
Subjt:  TLGN-DRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRV

Query:  Q--IKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNH---RPSLPNISDWSP
        +   + +     A  E  + Q+ ++KW ++GD ++ FFH+++  N+ +N I  +       +     +++  + +Y  L    +      S+  I D  P
Subjt:  Q--IKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHGQSLIIDKDIEKEFIDFYQKLFSKQNH---RPSLPNISDWSP

Query:  ISLDQQAA--LESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCL
           +   A  L +L S+ EI  AV  +  NK PGPD FTAEFF +SW  +KD       +FF++  +    N T I LIPK  G   +  +RP+S  T +
Subjt:  ISLDQQAA--LESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNETYICLIPKKIGAKAVGEYRPISLTTCL

Query:  YKVI
        YK+I
Subjt:  YKVI

AT4G20520.1 RNA binding;RNA-directed DNA polymerases4.1e-0530Show/hide
Query:  KRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRK--RQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLW
        +RLK ++   +I P Q++F+  R   D  +   E +    R+K  +  +++KLD+EKA+D + W++L++ L    F  +W
Subjt:  KRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRK--RQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFGHLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAATACAACATAGGTTATGTCGTCGGTATTCATGGCAAGATACCATCATCTTCGATGGCTTCCAGTGCTACACGCGCCGACGAGAAAGTATTTGTTGCGGATAA
CGAACGGGTTCATAACCCACGCGCCACCACCCTGAATGATGAAACAAAAGGGGAAAAGAAACAGTCCATATACAGAAACGACTTTCCACAGGCCCCAAATGATGTATTGG
ACACATCCACTGCTCTGATGTCTGCGTCTTTATCTGACAGGGACCCTCTGGCGCCATCTATCACCACAGAGCCCCAATCCCCAAAATCCTCTCTTGATAAGCACTCGGTA
AAAGTACCCAATAAAACCCTACCTTTTGACGAAGAGCCTCAATCCGATCAGCAGGGAATAGGTTTACAATACACAGATCTCATTGAGGTTTTTGTGGAAGAAGACATCTT
GGAAGAGTTGTATACAGAGGACACCAAAATTGACCCAGCTGTATATCTTCCCATGATCTTCCCCTGGCTGACTGAGCACGGAATGTGCATTATGCCCATGCCTAAGGCCC
TATGCTTACCTATATGGATTATCGGTGGTGATTTCAACATCACTCGCTGGTCTTGGGAACGGTCCCCTCTCTCCTTCACCCCAACCCGTGCCACAAGGAAATTCAATCGC
TTCATTGCCTCTGCCGCCCTACAAGACCTTCCCCTCTCAAACGGCAAGTACACTTGGTCTAGTTTCAGGCCAAATCCATCGATGTCGCTTATTGATAGGTTCTTGATCAC
AGACGAATTATCTACAAAATTTGGAACAATTGTGGTTCGCAAGTTAGATAGAGCTACATCTGATCATTTCCCGATTTGCCTCACTCTGGGGAATGATCGTTGGGGTCCTC
CTCCATTCAGATTTGTCAATGCTTGGCTATCTCATGCCTCCTTCCTTCATACTGTTGAGTCATGGTGGAAGGCAAACCCATTGTCTGGATGGCCTGGCCATAGTTTTATT
CAAAAGCTAAAAGGTCTTAAAAAGGAGTTGAAGCAATGGAACCAACAAGTTTTTGGCCAGCAATCGGTTAATAAAAACAGGCTGGGGCTAGAACTTGCGAATATCGATAA
GGTAGAAGAAAGTGGTTCCATTACCAAAGCCGATATTAACAGAAGGGTACAGATTAAGGCTGAATTGATCTCTATTGTGGCAAATGAAGAAGTTTTATGGCATCAAAGAT
GTAAGTTAAAATGGTTTAAAGAAGGAGATATTGACTCATCTTTCTTTCACAGACTTATGGCTATCAACAGAAGGAGGAATACCATTTCTGAAATTCTTGCAGCCCATGGA
CAAAGCCTTATAATCGACAAAGATATAGAGAAGGAATTTATTGATTTTTATCAGAAGCTCTTTAGTAAACAGAACCATAGACCCTCTCTGCCTAACATCAGTGACTGGAG
TCCCATTTCCCTTGACCAGCAAGCTGCTTTAGAATCTCTTTTTTCAGAACCTGAGATCTATAAGGCAGTACAAGACTTGGGCTCGAATAAGACCCCCGGACCGGACGGCT
TCACGGCAGAATTCTTTAAAAAATCTTGGAACACCCTCAAGGATGACATAAAGGGAGTGTTCAATGATTTTTTTAAGAGCAGTATTATTAATGCAAGCCTCAACGAAACC
TACATCTGCCTCATCCCTAAGAAGATTGGTGCTAAAGCTGTGGGGGAGTATAGACCTATCAGTTTGACAACATGTCTTTACAAGGTTATTGCCCGAGTTCTTTCTAAACG
TCTTAAAAGAATCCTCCCATATATTATTATTACTCCGTATCAATCAGCTTTTGTGGGGAATAGGCAAATCATGGATGCCTCCCTGATTGCCAATGAGCTTATTGATGAAT
TCAACAGAAGAAAGAGACAAGGGGTGGTTATCAAGCTCGACATAGAGAAAGCTTTCGACACCGTTGACTGGAATTTTCTTGATGAGGTCTTAAAAGTCAAACGCTTTGGT
CATTTATGGAGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAATACAACATAGGTTATGTCGTCGGTATTCATGGCAAGATACCATCATCTTCGATGGCTTCCAGTGCTACACGCGCCGACGAGAAAGTATTTGTTGCGGATAA
CGAACGGGTTCATAACCCACGCGCCACCACCCTGAATGATGAAACAAAAGGGGAAAAGAAACAGTCCATATACAGAAACGACTTTCCACAGGCCCCAAATGATGTATTGG
ACACATCCACTGCTCTGATGTCTGCGTCTTTATCTGACAGGGACCCTCTGGCGCCATCTATCACCACAGAGCCCCAATCCCCAAAATCCTCTCTTGATAAGCACTCGGTA
AAAGTACCCAATAAAACCCTACCTTTTGACGAAGAGCCTCAATCCGATCAGCAGGGAATAGGTTTACAATACACAGATCTCATTGAGGTTTTTGTGGAAGAAGACATCTT
GGAAGAGTTGTATACAGAGGACACCAAAATTGACCCAGCTGTATATCTTCCCATGATCTTCCCCTGGCTGACTGAGCACGGAATGTGCATTATGCCCATGCCTAAGGCCC
TATGCTTACCTATATGGATTATCGGTGGTGATTTCAACATCACTCGCTGGTCTTGGGAACGGTCCCCTCTCTCCTTCACCCCAACCCGTGCCACAAGGAAATTCAATCGC
TTCATTGCCTCTGCCGCCCTACAAGACCTTCCCCTCTCAAACGGCAAGTACACTTGGTCTAGTTTCAGGCCAAATCCATCGATGTCGCTTATTGATAGGTTCTTGATCAC
AGACGAATTATCTACAAAATTTGGAACAATTGTGGTTCGCAAGTTAGATAGAGCTACATCTGATCATTTCCCGATTTGCCTCACTCTGGGGAATGATCGTTGGGGTCCTC
CTCCATTCAGATTTGTCAATGCTTGGCTATCTCATGCCTCCTTCCTTCATACTGTTGAGTCATGGTGGAAGGCAAACCCATTGTCTGGATGGCCTGGCCATAGTTTTATT
CAAAAGCTAAAAGGTCTTAAAAAGGAGTTGAAGCAATGGAACCAACAAGTTTTTGGCCAGCAATCGGTTAATAAAAACAGGCTGGGGCTAGAACTTGCGAATATCGATAA
GGTAGAAGAAAGTGGTTCCATTACCAAAGCCGATATTAACAGAAGGGTACAGATTAAGGCTGAATTGATCTCTATTGTGGCAAATGAAGAAGTTTTATGGCATCAAAGAT
GTAAGTTAAAATGGTTTAAAGAAGGAGATATTGACTCATCTTTCTTTCACAGACTTATGGCTATCAACAGAAGGAGGAATACCATTTCTGAAATTCTTGCAGCCCATGGA
CAAAGCCTTATAATCGACAAAGATATAGAGAAGGAATTTATTGATTTTTATCAGAAGCTCTTTAGTAAACAGAACCATAGACCCTCTCTGCCTAACATCAGTGACTGGAG
TCCCATTTCCCTTGACCAGCAAGCTGCTTTAGAATCTCTTTTTTCAGAACCTGAGATCTATAAGGCAGTACAAGACTTGGGCTCGAATAAGACCCCCGGACCGGACGGCT
TCACGGCAGAATTCTTTAAAAAATCTTGGAACACCCTCAAGGATGACATAAAGGGAGTGTTCAATGATTTTTTTAAGAGCAGTATTATTAATGCAAGCCTCAACGAAACC
TACATCTGCCTCATCCCTAAGAAGATTGGTGCTAAAGCTGTGGGGGAGTATAGACCTATCAGTTTGACAACATGTCTTTACAAGGTTATTGCCCGAGTTCTTTCTAAACG
TCTTAAAAGAATCCTCCCATATATTATTATTACTCCGTATCAATCAGCTTTTGTGGGGAATAGGCAAATCATGGATGCCTCCCTGATTGCCAATGAGCTTATTGATGAAT
TCAACAGAAGAAAGAGACAAGGGGTGGTTATCAAGCTCGACATAGAGAAAGCTTTCGACACCGTTGACTGGAATTTTCTTGATGAGGTCTTAAAAGTCAAACGCTTTGGT
CATTTATGGAGGAAGTGA
Protein sequenceShow/hide protein sequence
MEEYNIGYVVGIHGKIPSSSMASSATRADEKVFVADNERVHNPRATTLNDETKGEKKQSIYRNDFPQAPNDVLDTSTALMSASLSDRDPLAPSITTEPQSPKSSLDKHSV
KVPNKTLPFDEEPQSDQQGIGLQYTDLIEVFVEEDILEELYTEDTKIDPAVYLPMIFPWLTEHGMCIMPMPKALCLPIWIIGGDFNITRWSWERSPLSFTPTRATRKFNR
FIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFI
QKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRVQIKAELISIVANEEVLWHQRCKLKWFKEGDIDSSFFHRLMAINRRRNTISEILAAHG
QSLIIDKDIEKEFIDFYQKLFSKQNHRPSLPNISDWSPISLDQQAALESLFSEPEIYKAVQDLGSNKTPGPDGFTAEFFKKSWNTLKDDIKGVFNDFFKSSIINASLNET
YICLIPKKIGAKAVGEYRPISLTTCLYKVIARVLSKRLKRILPYIIITPYQSAFVGNRQIMDASLIANELIDEFNRRKRQGVVIKLDIEKAFDTVDWNFLDEVLKVKRFG
HLWRK