; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005921 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005921
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr6:33722514..33726300
RNA-Seq ExpressionLag0005921
SyntenyLag0005921
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]2.5e-17238.99Show/hide
Query:  GASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFN
        GASGGIL++W     +  E + G +S+SI   +      WLSA+YGP+   L   FW EL D+AGLA  RW +GGDFNV R S EK      T SM+ F+
Subjt:  GASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFN

Query:  LWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQN
         +I    LID+PL++ SFTWS    +     LDRFL SN+  Q F  +    LPR TSDH+PI L      WGP+PFRFEN WL H  FK     WW + 
Subjt:  LWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQN

Query:  PIDGWPGHGFMMKLKILKTELLRWNK-SHRSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFF
          +GW GH FM KL+ +K +L  WNK S   + +    ++S L   D++E    LS +   QR L + ++E++  R+ ++WR + ++ W+KEGD N++FF
Subjt:  PIDGWPGHGFMMKLKILKTELLRWNK-SHRSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFF

Query:  HRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE------------------------
        H++   R+ +  I EL + +G+ +     I++E + +++KLYT      +    +DWS I    A  LE PFT++E                        
Subjt:  HRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE------------------------

Query:  ---WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAF-TIAENQMAFVSNRQILDAALIASEL
           W+                   N + N ++I L+PK+  SR ++D+RPISLI+  YKIIA+VL+ R++ V   TI   Q AFV  RQILDA LIA+E+
Subjt:  ---WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAF-TIAENQMAFVSNRQILDAALIASEL

Query:  IDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRA
        +D+ + + ++GVV K+D EKA+D V WDFLD VL++K                                   G+RQGDPLSPFLF +V+D LSR+L    
Subjt:  IDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRA

Query:  SLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK
            +    +G +   V+HLQFADDT+ FS+S    +  L +++ +F   SGL +N  KS + GIN++  ++        C+   WP  YLGLPLGGNPK
Subjt:  SLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK

Query:  LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQL
           FW PVIERI ++L  W+  Y+S GGR TLIQ+ L+ MP Y+LSL+K+P+ VA  ++++ RDF W G       H + W+    P   GGLG G   +
Subjt:  LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQL

Query:  RNSALLAKWIWRFLHEQESLWRKLITTKYYS
        RN ALL KW+WR+  E  +LW ++I + Y S
Subjt:  RNSALLAKWIWRFLHEQESLWRKLITTKYYS

CAN68838.1 hypothetical protein VITISV_030956 [Vitis vinifera]2.5e-17238.75Show/hide
Query:  GASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFN
        GASGGIL++W     +  E + G +S+SI   +      WLSA+YGP+   L    W EL D+AGLA  RW +GGDFNV R S EK     +T SM++F+
Subjt:  GASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFN

Query:  LWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQN
         +I    LID+PL++ SFTWS    +     LDRFL SN+  Q F  +    LPR TSDH+PI L      WGP+PFRFEN WL H  FK     WW + 
Subjt:  LWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQN

Query:  PIDGWPGHGFMMKLKILKTELLRWNK-SHRSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFF
          +GW GH FM KL+ +K +L  WNK S   + +    ++S L   D++E    LS +   QR + + ++E++  R+ ++WR + ++ W+KEGD N+KFF
Subjt:  PIDGWPGHGFMMKLKILKTELLRWNK-SHRSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFF

Query:  HRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE------------------------
        H++   R+ +  I EL + +G  +     I++E + +++KLYT      +    +DWS I    A  LE PFT++E                        
Subjt:  HRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE------------------------

Query:  ---WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAF-TIAENQMAFVSNRQILDAALIASEL
           W+                   N + N ++I L+PK+  SR ++D+RPISLI+  YKIIA+VL+ R+++V   TI   Q AFV  RQILDA LIA+E+
Subjt:  ---WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAF-TIAENQMAFVSNRQILDAALIASEL

Query:  IDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRA
        +D+ + + ++GVV K+D EKA+D V WDFLD V+++K                                   G+RQGDPLSPFLF +V+D LSR+L    
Subjt:  IDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRA

Query:  SLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK
            +    +G +   V+HLQFADDT+ FS+S    +  L +++ +F   SGL +N  KS + GIN++  ++        C+   WP  YLGLPLGGNPK
Subjt:  SLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK

Query:  LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQL
           FW PVIERI ++L  W+  Y+S GGR TLIQ+ L+ MP Y+LSL+K+P+ VA  ++++ RDF W G       H + W+    P   GGLG G   +
Subjt:  LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQL

Query:  RNSALLAKWIWRFLHEQESLWRKLITTKYYS
        RN ALL KW+WR+  E  +LW ++I + Y S
Subjt:  RNSALLAKWIWRFLHEQESLWRKLITTKYYS

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.0e-17339.81Show/hide
Query:  GASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFN
        GASGGI++LW        E + G +S+++     +   FWL+++YGP        FW EL DL GL   RW +GGDFNV R   EK  +  +T +MR F+
Subjt:  GASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFN

Query:  LWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQN
         +I    LID PL+N +FTWS   +      LDRFL S++    F  +  + LPR TSDH PI L    + WGP+PFRFEN WLLH +FK     WW + 
Subjt:  LWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQN

Query:  PIDGWPGHGFMMKLKILKTELLRWN-KSHRSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFF
          +GW GH FM KLK +K++L  WN  +   + E   ++++ L  +D IE    L+ D   +R L + ++ED+  ++ V WR + ++ W+KEGD N+KFF
Subjt:  PIDGWPGHGFMMKLKILKTELLRWN-KSHRSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFF

Query:  HRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE------------------------
        HR+   R+ +  I  L+S  G +L    DI +E ++F+  LY+K     +    IDW  I       L+ PFT++E                        
Subjt:  HRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE------------------------

Query:  ---WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAF-TIAENQMAFVSNRQILDAALIASEL
           WD                   N + N T+I L+PK+  S  ++DYRPISL++  YKIIA+VLS RL++V   TI+++Q AFV  R ILDA LIA+E+
Subjt:  ---WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAF-TIAENQMAFVSNRQILDAALIASEL

Query:  IDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRA
        +D+ + + ++G+V K+D EKA+D VDW FLD VLQ K                                   G+RQGDPLSPFLF LV+D LSR+L    
Subjt:  IDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRA

Query:  SLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK
          G      +G     V+ LQFADDT+ FS +    + NL  I+ +F + SGL IN  KS + GIN   + +    S F CR+ +WP +YLGLPLGGNPK
Subjt:  SLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK

Query:  LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQL
           FW PV+ERI ++L  WK  Y+S GGR TLIQ+ LS +P Y+LSL+K+P+ +A  ++K+ R+F W G+      H ++WE    P  LGGLG G   L
Subjt:  LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQL

Query:  RNSALLAKWIWRFLHEQESLWRKLITTKY
        RN ALL KW+WRF  E+  LW K+I + Y
Subjt:  RNSALLAKWIWRFLHEQESLWRKLITTKY

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.5e-17238.75Show/hide
Query:  GASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFN
        GASGGIL++W     +  E + G +S+SI   +      WLSA+YGP+   L    W EL D+AGLA  RW +GGDFNV R S EK     +T SM++F+
Subjt:  GASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFN

Query:  LWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQN
         +I    LID+PL++ SFTWS    +     LDRFL SN+  Q F  +    LPR TSDH+PI L      WGP+PFRFEN WL H  FK     WW + 
Subjt:  LWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQN

Query:  PIDGWPGHGFMMKLKILKTELLRWNK-SHRSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFF
          +GW GH FM KL+ +K +L  WNK S   + +    ++S L   D++E    LS +   QR + + ++E++  R+ ++WR + ++ W+KEGD N+KFF
Subjt:  PIDGWPGHGFMMKLKILKTELLRWNK-SHRSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFF

Query:  HRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE------------------------
        H++   R+ +  I EL + +G  +     I++E + +++KLYT      +    +DWS I    A  LE PFT++E                        
Subjt:  HRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE------------------------

Query:  ---WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAF-TIAENQMAFVSNRQILDAALIASEL
           W+                   N + N ++I L+PK+  SR ++D+RPISLI+  YKIIA+VL+ R+++V   TI   Q AFV  RQILDA LIA+E+
Subjt:  ---WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAF-TIAENQMAFVSNRQILDAALIASEL

Query:  IDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRA
        +D+ + + ++GVV K+D EKA+D V WDFLD V+++K                                   G+RQGDPLSPFLF +V+D LSR+L    
Subjt:  IDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRA

Query:  SLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK
            +    +G +   V+HLQFADDT+ FS+S    +  L +++ +F   SGL +N  KS + GIN++  ++        C+   WP  YLGLPLGGNPK
Subjt:  SLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK

Query:  LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQL
           FW PVIERI ++L  W+  Y+S GGR TLIQ+ L+ MP Y+LSL+K+P+ VA  ++++ RDF W G       H + W+    P   GGLG G   +
Subjt:  LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQL

Query:  RNSALLAKWIWRFLHEQESLWRKLITTKYYS
        RN ALL KW+WR+  E  +LW ++I + Y S
Subjt:  RNSALLAKWIWRFLHEQESLWRKLITTKYYS

RVX13544.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.5e-17238.75Show/hide
Query:  GASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFN
        GASGGIL++W     +  E + G +S+SI   +      WLSA+YGP+   L    W EL D+AGLA  RW +GGDFNV R S EK     +T SM++F+
Subjt:  GASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFN

Query:  LWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQN
         +I    LID+PL++ SFTWS    +     LDRFL SN+  Q F  +    LPR TSDH+PI L      WGP+PFRFEN WL H  FK     WW + 
Subjt:  LWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQN

Query:  PIDGWPGHGFMMKLKILKTELLRWNK-SHRSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFF
          +GW GH FM KL+ +K +L  WNK S   + +    ++S L   D++E    LS +   QR + + ++E++  R+ ++WR + ++ W+KEGD N+KFF
Subjt:  PIDGWPGHGFMMKLKILKTELLRWNK-SHRSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFF

Query:  HRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE------------------------
        H++   R+ +  I EL + +G  +     I++E + +++KLYT      +    +DWS I    A  LE PFT++E                        
Subjt:  HRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE------------------------

Query:  ---WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAF-TIAENQMAFVSNRQILDAALIASEL
           W+                   N + N ++I L+PK+  SR ++D+RPISLI+  YKIIA+VL+ R+++V   TI   Q AFV  RQILDA LIA+E+
Subjt:  ---WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAF-TIAENQMAFVSNRQILDAALIASEL

Query:  IDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRA
        +D+ + + ++GVV K+D EKA+D V WDFLD V+++K                                   G+RQGDPLSPFLF +V+D LSR+L    
Subjt:  IDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRA

Query:  SLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK
            +    +G +   V+HLQFADDT+ FS+S    +  L +++ +F   SGL +N  KS + GIN++  ++        C+   WP  YLGLPLGGNPK
Subjt:  SLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK

Query:  LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQL
           FW PVIERI ++L  W+  Y+S GGR TLIQ+ L+ MP Y+LSL+K+P+ VA  ++++ RDF W G       H + W+    P   GGLG G   +
Subjt:  LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQL

Query:  RNSALLAKWIWRFLHEQESLWRKLITTKYYS
        RN ALL KW+WR+  E  +LW ++I + Y S
Subjt:  RNSALLAKWIWRFLHEQESLWRKLITTKYYS

TrEMBL top hitse value%identityAlignment
A0A438FWU5 LINE-1 retrotransposable element ORF2 protein4.9e-17439.81Show/hide
Query:  GASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFN
        GASGGI++LW        E + G +S+++     +   FWL+++YGP        FW EL DL GL   RW +GGDFNV R   EK  +  +T +MR F+
Subjt:  GASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFN

Query:  LWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQN
         +I    LID PL+N +FTWS   +      LDRFL S++    F  +  + LPR TSDH PI L    + WGP+PFRFEN WLLH +FK     WW + 
Subjt:  LWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQN

Query:  PIDGWPGHGFMMKLKILKTELLRWN-KSHRSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFF
          +GW GH FM KLK +K++L  WN  +   + E   ++++ L  +D IE    L+ D   +R L + ++ED+  ++ V WR + ++ W+KEGD N+KFF
Subjt:  PIDGWPGHGFMMKLKILKTELLRWN-KSHRSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFF

Query:  HRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE------------------------
        HR+   R+ +  I  L+S  G +L    DI +E ++F+  LY+K     +    IDW  I       L+ PFT++E                        
Subjt:  HRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE------------------------

Query:  ---WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAF-TIAENQMAFVSNRQILDAALIASEL
           WD                   N + N T+I L+PK+  S  ++DYRPISL++  YKIIA+VLS RL++V   TI+++Q AFV  R ILDA LIA+E+
Subjt:  ---WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAF-TIAENQMAFVSNRQILDAALIASEL

Query:  IDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRA
        +D+ + + ++G+V K+D EKA+D VDW FLD VLQ K                                   G+RQGDPLSPFLF LV+D LSR+L    
Subjt:  IDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRA

Query:  SLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK
          G      +G     V+ LQFADDT+ FS +    + NL  I+ +F + SGL IN  KS + GIN   + +    S F CR+ +WP +YLGLPLGGNPK
Subjt:  SLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK

Query:  LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQL
           FW PV+ERI ++L  WK  Y+S GGR TLIQ+ LS +P Y+LSL+K+P+ +A  ++K+ R+F W G+      H ++WE    P  LGGLG G   L
Subjt:  LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQL

Query:  RNSALLAKWIWRFLHEQESLWRKLITTKY
        RN ALL KW+WRF  E+  LW K+I + Y
Subjt:  RNSALLAKWIWRFLHEQESLWRKLITTKY

A0A438JX47 LINE-1 retrotransposable element ORF2 protein1.2e-17238.75Show/hide
Query:  GASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFN
        GASGGIL++W     +  E + G +S+SI   +      WLSA+YGP+   L    W EL D+AGLA  RW +GGDFNV R S EK     +T SM++F+
Subjt:  GASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFN

Query:  LWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQN
         +I    LID+PL++ SFTWS    +     LDRFL SN+  Q F  +    LPR TSDH+PI L      WGP+PFRFEN WL H  FK     WW + 
Subjt:  LWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQN

Query:  PIDGWPGHGFMMKLKILKTELLRWNK-SHRSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFF
          +GW GH FM KL+ +K +L  WNK S   + +    ++S L   D++E    LS +   QR + + ++E++  R+ ++WR + ++ W+KEGD N+KFF
Subjt:  PIDGWPGHGFMMKLKILKTELLRWNK-SHRSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFF

Query:  HRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE------------------------
        H++   R+ +  I EL + +G  +     I++E + +++KLYT      +    +DWS I    A  LE PFT++E                        
Subjt:  HRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE------------------------

Query:  ---WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAF-TIAENQMAFVSNRQILDAALIASEL
           W+                   N + N ++I L+PK+  SR ++D+RPISLI+  YKIIA+VL+ R+++V   TI   Q AFV  RQILDA LIA+E+
Subjt:  ---WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAF-TIAENQMAFVSNRQILDAALIASEL

Query:  IDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRA
        +D+ + + ++GVV K+D EKA+D V WDFLD V+++K                                   G+RQGDPLSPFLF +V+D LSR+L    
Subjt:  IDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRA

Query:  SLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK
            +    +G +   V+HLQFADDT+ FS+S    +  L +++ +F   SGL +N  KS + GIN++  ++        C+   WP  YLGLPLGGNPK
Subjt:  SLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK

Query:  LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQL
           FW PVIERI ++L  W+  Y+S GGR TLIQ+ L+ MP Y+LSL+K+P+ VA  ++++ RDF W G       H + W+    P   GGLG G   +
Subjt:  LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQL

Query:  RNSALLAKWIWRFLHEQESLWRKLITTKYYS
        RN ALL KW+WR+  E  +LW ++I + Y S
Subjt:  RNSALLAKWIWRFLHEQESLWRKLITTKYYS

A5CAA2 Reverse transcriptase domain-containing protein1.2e-17238.75Show/hide
Query:  GASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFN
        GASGGIL++W     +  E + G +S+SI   +      WLSA+YGP+   L    W EL D+AGLA  RW +GGDFNV R S EK     +T SM++F+
Subjt:  GASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFN

Query:  LWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQN
         +I    LID+PL++ SFTWS    +     LDRFL SN+  Q F  +    LPR TSDH+PI L      WGP+PFRFEN WL H  FK     WW + 
Subjt:  LWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQN

Query:  PIDGWPGHGFMMKLKILKTELLRWNK-SHRSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFF
          +GW GH FM KL+ +K +L  WNK S   + +    ++S L   D++E    LS +   QR + + ++E++  R+ ++WR + ++ W+KEGD N+KFF
Subjt:  PIDGWPGHGFMMKLKILKTELLRWNK-SHRSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFF

Query:  HRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE------------------------
        H++   R+ +  I EL + +G  +     I++E + +++KLYT      +    +DWS I    A  LE PFT++E                        
Subjt:  HRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE------------------------

Query:  ---WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAF-TIAENQMAFVSNRQILDAALIASEL
           W+                   N + N ++I L+PK+  SR ++D+RPISLI+  YKIIA+VL+ R+++V   TI   Q AFV  RQILDA LIA+E+
Subjt:  ---WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAF-TIAENQMAFVSNRQILDAALIASEL

Query:  IDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRA
        +D+ + + ++GVV K+D EKA+D V WDFLD V+++K                                   G+RQGDPLSPFLF +V+D LSR+L    
Subjt:  IDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRA

Query:  SLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK
            +    +G +   V+HLQFADDT+ FS+S    +  L +++ +F   SGL +N  KS + GIN++  ++        C+   WP  YLGLPLGGNPK
Subjt:  SLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK

Query:  LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQL
           FW PVIERI ++L  W+  Y+S GGR TLIQ+ L+ MP Y+LSL+K+P+ VA  ++++ RDF W G       H + W+    P   GGLG G   +
Subjt:  LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQL

Query:  RNSALLAKWIWRFLHEQESLWRKLITTKYYS
        RN ALL KW+WR+  E  +LW ++I + Y S
Subjt:  RNSALLAKWIWRFLHEQESLWRKLITTKYYS

M5VS59 Reverse transcriptase domain-containing protein (Fragment)2.4e-17640.24Show/hide
Query:  VGASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNF
        +G SGGI +LW     ++ +++ G +S+SI IV   G D+WLS IYGP +    + FW+EL DL G   D+W LGGDFNV R+S EKS++  VT+SMR+F
Subjt:  VGASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNF

Query:  NLWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQ
        N +I   NL D  L N SFTWS    +     LDRFL+S      F   + K LPR+TSDH PI L+   + WGPSPFRFEN WL H DF   +  WW +
Subjt:  NLWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQ

Query:  NPIDGWPGHGFMMKLKILKTELLRWNKSH-RSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKF
        + I GW G+ FM +LK+LK++L  W+K     V  +L    ++L  LD  E  + L    R++R  L  +I D+  ++ V WR R K+ W +EGD NTKF
Subjt:  NPIDGWPGHGFMMKLKILKTELLRWNKSH-RSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKF

Query:  FHRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE-----------------------
        FHR+    +++N I +L   D   +  D +IE+E I F++ LY+ ++++ +    ++W  I + +A  LE PF  +E                       
Subjt:  FHRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDE-----------------------

Query:  ----WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQV-AFTIAENQMAFVSNRQILDAALIASE
            W+                   N   NET+ICLIPK+ +S  V D RPISL++  YK+I++VL++RL++V   TI+++Q AFV  RQILDA L+A+E
Subjt:  ----WD------------------YNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQV-AFTIAENQMAFVSNRQILDAALIASE

Query:  LIDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFR
        ++++ +   +KG+V K+D EKA+D V+W+F+D VL  K                                   G+RQGDPLSPFLF LVSD LSR++   
Subjt:  LIDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFR

Query:  ASLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNP
          +  +     G   + V+HLQFADDT+           NL  ++K+F   SG+ IN  KS +LGIN   + +      +GC +G WP  YLGLPLGGNP
Subjt:  ASLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNP

Query:  KLPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQ
        +  +FW+PV+++++K+L  WK   +SKGGR TLIQA LSS+P YY+SL+K+P  VA  +++++R+F WEG       H ++WE        GGLGIG+ +
Subjt:  KLPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQ

Query:  LRNSALLAKWIWRFLHEQESLWRKLITTKY
         RN AL AKW+WRF  E  SLW ++I +KY
Subjt:  LRNSALLAKWIWRFLHEQESLWRKLITTKY

M5WJ76 Reverse transcriptase domain-containing protein (Fragment)7.1e-17340.72Show/hide
Query:  VGASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNF
        +G SGGI +LW     ++ +++ G +S+SI I    G D+WLS IYGP +    + FW+EL DL G   D W LGGDFNV R+S EKS++  VT+SMR+F
Subjt:  VGASGGILLLWRDPDFTIRETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNF

Query:  NLWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQ
        N +I   NL D  L N SFTWS    +     LDRFL+S    + F   + K LPR+TSDH PI L+   + WGPSPFRFEN WL H DFK  +  WW +
Subjt:  NLWIDSYNLIDIPLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQ

Query:  NPIDGWPGHGFMMKLKILKTELLRWNKSH-RSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKF
        + I GW G+ FM +LK+LK++L  W+K     V  +L    ++L  LD  E  + L    R++R  L  +I D+  ++ V WR R K+ W ++GD NTKF
Subjt:  NPIDGWPGHGFMMKLKILKTELLRWNKSH-RSVMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKF

Query:  FHRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDEWD------------------YNV
        FHR+    +++N I +L   D   +  D +IE+E I F++ LY+ +++        D  + +        + F    W+                   N 
Subjt:  FHRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNIDWSRIREAQATTLEVPFTDDEWD------------------YNV

Query:  ALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQV-AFTIAENQMAFVSNRQILDAALIASELIDDWKTTNKKGVVIKLDLEKAFDKVD
          NET+ICLIPK+ +S  V DYRPISL++  YK+I++VL++ L++V   TI+++Q AFV  RQILDA L+A+E++++ +   +KG+V K+D EKA+D V+
Subjt:  ALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQV-AFTIAENQMAFVSNRQILDAALIASELIDDWKTTNKKGVVIKLDLEKAFDKVD

Query:  WDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRASLGKIATHPIGTSSLHVNHLQFADDT
        W+F+D V+  K                                   G+RQGDPLSPFLF LVSD LSRL+     +  +     G   + V+HLQFADDT
Subjt:  WDFLDAVLQIK-----------------------------------GIRQGDPLSPFLFILVSDYLSRLLSFRASLGKIATHPIGTSSLHVNHLQFADDT

Query:  LLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPKLPSFWHPVIERIQKKLHSWKYLYISK
        +           NL  ++K+F   SG+ IN  KS +LGIN     +      +GC +G WP  YLGLPLGGNP+  +FW+PV+E+++K+L  WK   +SK
Subjt:  LLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPKLPSFWHPVIERIQKKLHSWKYLYISK

Query:  GGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQLRNSALLAKWIWRFLHEQESLWRKLIT
        GGR TLIQA LSS+P YY+SL+K+P  VA  +++++R+F WEG       H ++WE        GGLGIG+ + R  AL AKW+WRF  E  SLW ++I 
Subjt:  GGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQLRNSALLAKWIWRFLHEQESLWRKLIT

Query:  TKY
        +KY
Subjt:  TKY

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.7e-2720.71Show/hide
Query:  QELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFNLWIDSYNLIDI--PLQNGSFTWSRYGS-HRSLSLLDRFLISNDCLQKFGSAQLKRLP
        Q L DL        ++ GDFN      ++S+   V +  +  N  +   +LIDI   L   S  ++ + + H + S +D  + S   L K    ++  + 
Subjt:  QELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFNLWIDSYNLIDI--PLQNGSFTWSRYGS-HRSLSLLDRFLISNDCLQKFGSAQLKRLP

Query:  RVTSDHYPISLNF--------GEISWGPSPFRFENSWLLHKDFKSVVVSWWNQNPIDG------WPGHGFMMKLKILKTELLRWNKSHRSVMENLFMLIS
           SDH  I L             +W  +     + W +H + K+ +  ++  N          W     + + K +     +  K  RS ++    L S
Subjt:  RVTSDHYPISLNF--------GEISWGPSPFRFENSWLLHKDFKSVVVSWWNQNPIDG------WPGHGFMMKLKILKTELLRWNKSHRSVMENLFMLIS

Query:  QLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFFHRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKL
        QLK L+  E        +R +   ++ ++++I  +  +   +  +  + +  ++  +   R+I  ++ KN I  + +  G       +I+    ++Y+ L
Subjt:  QLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFFHRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKL

Query:  Y-TKDQDMHFLPTNID---WSRIREAQATTLEVPFTDDEWDYNV---------------------------------------------ALNETYICLIP
        Y  K +++  + T +D     R+ + +  +L  P T  E    +                                             +  E  I LIP
Subjt:  Y-TKDQDMHFLPTNID---WSRIREAQATTLEVPFTDDEWDYNV---------------------------------------------ALNETYICLIP

Query:  K-RVDSRSVNDYRPISLISCAYKIIARVLSNRLKQ-VAFTIAENQMAFVSNRQILDAALIASELIDDW-KTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQ
        K   D+    ++RPISL++   KI+ ++L+NR++Q +   I  +Q+ F+   Q       +  +I    +  +K  V+I +D EKAFDK+   F+   L 
Subjt:  K-RVDSRSVNDYRPISLISCAYKIIARVLSNRLKQ-VAFTIAENQMAFVSNRQILDAALIASELIDDW-KTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQ

Query:  IKGI-----------------------------------RQGDPLSPFLFILVSDYLSRLLSFRASLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDAS
          GI                                   RQG PLSP LF +V + L+R +     +  I    +G   + ++   FADD +++  +   
Subjt:  IKGI-----------------------------------RQGDPLSPFLFILVSDYLSRLLSFRASLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDAS

Query:  AVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK--LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLI
        +  NL  +I  F K SG  IN  KS+    N + Q       +    +      YLG+ L  + K      + P+++ I++  + WK +  S  GR  ++
Subjt:  AVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK--LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLI

Query:  QASLSSMPIYYLSL--YKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQLRNSALLAKWIWRFLHEQE
        + ++    IY  +    KLP      L+K    F W     N     I        +  GG+ + +F+L   A + K  W +   ++
Subjt:  QASLSSMPIYYLSL--YKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQLRNSALLAKWIWRFLHEQE

P0C2F6 Putative ribonuclease H protein At1g657503.7e-1734.96Show/hide
Query:  VIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQLRNSALLA
        ++ER+  ++  W+   +S  GR TL +A LSSMP++ +S   LP  +   LD++ R F W  +      H +KW     P   GGLG+   +  N AL++
Subjt:  VIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQLRNSALLA

Query:  KWIWRFLHEQESLWRKLITTKYY
        K  WR L E+ SLW  ++  KY+
Subjt:  KWIWRFLHEQESLWRKLITTKYY

P11369 LINE-1 retrotransposable element ORF2 protein4.2e-2120.87Show/hide
Query:  ILGGDFNVTRWSWEKSSDHPVTRSMRNFNLWIDSYNLIDI-----PLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLN
        I+ GDFN    S ++S    + R        +   +L DI     P   G   +S    H + S +D  +     L ++ + ++  +P + SDH+ + L 
Subjt:  ILGGDFNVTRWSWEKSSDHPVTRSMRNFNLWIDSYNLIDI-----PLQNGSFTWSRYGSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLN

Query:  F-GEISWGPSPFRFE------NSWLLHKDFKSVVVSW--WNQNPIDGWPGHGFMMKLKILKTELLRWNKSHRS-VMENLFMLISQLKTLDNIEDYDCLSV
        F   I+ G   F ++      N  L+ +  K  +  +  +N+N    +P     MK   L+ +L+  + S +     +   L + LK L+  ++ +    
Subjt:  F-GEISWGPSPFRFE------NSWLLHKDFKSVVVSW--WNQNPIDGWPGHGFMMKLKILKTELLRWNKSHRS-VMENLFMLISQLKTLDNIEDYDCLSV

Query:  DQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFFHRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLY-TKDQDMHFLPTNID
         +R +   L+ +I  +  R  +   ++ +  + ++ ++  K   R+    + K  I+++ +  G       +I+     FY++LY TK +++  +   +D
Subjt:  DQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFFHRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLY-TKDQDMHFLPTNID

Query:  WSRI-----------------REAQATTLEVP-----------------FTDD--------------EWDYNVALNETYICLIPK-RVDSRSVNDYRPIS
          ++                 +E +A    +P                 F +D              E     +  E  I LIPK + D   + ++RPIS
Subjt:  WSRI-----------------REAQATTLEVP-----------------FTDD--------------EWDYNVALNETYICLIPK-RVDSRSVNDYRPIS

Query:  LISCAYKIIARVLSNRLKQ-VAFTIAENQMAFVSNRQ---ILDAALIASELIDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIKGI----------
        L++   KI+ ++L+NR+++ +   I  +Q+ F+   Q    +  ++     I+  K  +K  ++I LD EKAFDK+   F+  VL+  GI          
Subjt:  LISCAYKIIARVLSNRLKQ-VAFTIAENQMAFVSNRQ---ILDAALIASELIDDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIKGI----------

Query:  -------------------------RQGDPLSPFLFILVSDYLSRLLSFRASLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEK
                                 RQG PLSP+LF +V + L+R +  +  +  I    IG   + ++ L  ADD +++ +   ++   L ++I  F +
Subjt:  -------------------------RQGDPLSPFLFILVSDYLSRLLSFRASLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEK

Query:  ASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK--LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSL
          G  IN  KS       + Q            +      YLG+ L    K      +  + + I++ L  WK L  S  GR  +++ ++    IY  + 
Subjt:  ASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPK--LPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSL

Query:  --YKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQLRNSALLAK--WIWRFLHEQESLWRKL
           K+P++    L+  I  F W           +K + T      GG+ + + +L   A++ K  W W +   Q   W ++
Subjt:  --YKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQLRNSALLAK--WIWRFLHEQESLWRKL

P14381 Transposon TX1 uncharacterized 149 kDa protein1.3e-2723.6Show/hide
Query:  VMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGL--AQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFNLWIDSYNLIDIPLQNG----SFTWSRY-G
        V   G  + L  +Y P+       F++ L        + +  I+GGDFN T  + +++       S       I  ++L+D+  +      +FT+ R   
Subjt:  VMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGL--AQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFNLWIDSYNLIDIPLQNG----SFTWSRY-G

Query:  SHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSP--FRFENSWLLHKDF-KSVVVSW--WNQ-----NPIDGWPGHGFMMKL
         H S S +DR  IS+  + +  S+ ++  P   SDH  +SL        P    + F NS L  + F KSV  +W  W         ++ W   G  + L
Subjt:  SHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSP--FRFENSWLLHKDF-KSVVVSW--WNQ-----NPIDGWPGHGFMMKL

Query:  KILKTELLRWNKSHRSVMENLFMLISQLKTLD----NIEDYDCLSVDQRTQRHLLQ--EQIEDITARDHVYWRHRCKLTWLKEGDENTKFFHRIIAARKR
        K+L  E  +     R+         ++++ L+    ++E     S DQ  Q   L+  E + ++  R       R ++  L + D  ++FF+ +   +  
Subjt:  KILKTELLRWNKSHRSVMENLFMLISQLKTLD----NIEDYDCLSVDQRTQRHLLQ--EQIEDITARDHVYWRHRCKLTWLKEGDENTKFFHRIIAARKR

Query:  KNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNID-----WSR---IREAQATTLEVPFTDDE-------------------------
        +  I+ L + DG  L     I      FYQ L++ D      P + D     W     + E +   LE P T DE                         
Subjt:  KNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMHFLPTNID-----WSR---IREAQATTLEVPFTDDE-------------------------

Query:  --W-----DYNVALNETY-------------ICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQV-AFTIAENQMAFVSNRQILDAALIASELI
          W     D++  L E +             + L+PK+ D R + ++RP+SL+S  YKI+A+ +S RLK V A  I  +Q   V  R I D   +  +L+
Subjt:  --W-----DYNVALNETY-------------ICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQV-AFTIAENQMAFVSNRQILDAALIASELI

Query:  DDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQI-----------------------------------KGIRQGDPLSPFLFILVSDYLSRLLSFRAS
           + T      + LD EKAFD+VD  +L   LQ                                    +G+RQG PLS  L+ L  +    LL  R +
Subjt:  DDWKTTNKKGVVIKLDLEKAFDKVDWDFLDAVLQI-----------------------------------KGIRQGDPLSPFLFILVSDYLSRLLSFRAS

Query:  LGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPT---TYLGLPLGGN
         G +   P     + V    +ADD +L +  D   ++   +  +++  AS   IN+ KS   G+      +      F  R   W +    YLG+ L   
Subjt:  LGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKASGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPT---TYLGLPLGGN

Query:  --PKLPSFWHPVIERIQKKLHSWKYL--YISKGGRYTLIQASLSSMPIYYLSLYKLPSK--VAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGG
          P   +F   + E +  +L  WK     +S  GR  +I   ++S  I+Y  +   P++  +AK+  +++ DF W G       H +    + LP   GG
Subjt:  --PKLPSFWHPVIERIQKKLHSWKYL--YISKGGRYTLIQASLSSMPIYYLSLYKLPSK--VAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGG

Query:  LGIGNFQLRNSALLAKWIWRFLHEQESLWRKLITTKYYSRDGFCSFASQYWSIMVEAFGWNM-VLPGTIFDLL
         G+   + +      + I R+L+   S     + + +Y +     +  Q + I  E F  N+  LP    D L
Subjt:  LGIGNFQLRNSALLAKWIWRFLHEQESLWRKLITTKYYSRDGFCSFASQYWSIMVEAFGWNM-VLPGTIFDLL

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)2.2e-0928.09Show/hide
Query:  LIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAFTIAENQMAFVSNRQILDAALIASELIDDW----KTTNKKGVVIKLDLEKAFDKVDWDFLD
        LIPK  D  + +++RPI++ S   +++ R+L+ RL + A  +   Q  +      +D  L+ S L+D +    +   K   V+ LD+ KAFD V    + 
Subjt:  LIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAFTIAENQMAFVSNRQILDAALIASELIDDW----KTTNKKGVVIKLDLEKAFDKVDWDFLD

Query:  AVLQ------------------------------------IKGIRQGDPLSPFLFILVSDYLSRLLSFRASLGKIATHPIGTSSLHVNHLQFADDTLLFS
          LQ                                     +G++QGDPLSPFLF  V D L  L S +++ G   T  IG   + V  L FADD LL  
Subjt:  AVLQ------------------------------------IKGIRQGDPLSPFLFILVSDYLSRLLSFRASLGKIATHPIGTSSLHVNHLQFADDTLLFS

Query:  TSDASAVDNLFDIIKIFEKASGLNINFGKSELLGI
         +D      L  +   F +  G+++N  KS  + +
Subjt:  TSDASAVDNLFDIIKIFEKASGLNINFGKSELLGI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein5.6e-1326.22Show/hide
Query:  RSMRNFNLWIDSYNLIDIPLQNGSFTWSRY-GSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEI-SWGPSPFRFENSWLLHKDF-K
        R +  F   +   +L+DIP +   +TWS +   +  +  LDR + + D    F SA         SDH P  +    +       FR+ +    H  F  
Subjt:  RSMRNFNLWIDSYNLIDIPLQNGSFTWSRY-GSHRSLSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEI-SWGPSPFRFENSWLLHKDF-K

Query:  SVVVSWWNQNPIDGWPGHGFMM--KLKILK--TELLR----WNKSHRS--VMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYW
        S+ V+W  Q P+     H F +   LK  K   +LL      N  H++   +++L  + SQL  L N  D     V+     H+ +++     A    ++
Subjt:  SVVVSWWNQNPIDGWPGHGFMM--KLKILK--TELLR----WNKSHRS--VMENLFMLISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYW

Query:  RHRCKLTWLKEGDENTKFFHRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDM
        R + ++ WL++GD NT+FFH++I A + KN I  L   D V +     +++  + +Y  L   D D+
Subjt:  RHRCKLTWLKEGDENTKFFHRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDM

AT4G29090.1 Ribonuclease H-like superfamily protein9.9e-1029.79Show/hide
Query:  SMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQLRNSALLAKWIWRFLHEQESLWRKLITTKYYSR
        ++P Y ++ + LP  V K +  ++ DF+W       G+H   W+        GG+G  + +  N ALL K +WR L   ESL  K+  ++Y+ +
Subjt:  SMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQLRNSALLAKWIWRFLHEQESLWRKLITTKYYSR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.5e-0722.45Show/hide
Query:  SMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWE-TTQLPHILGGLGIGNFQLRNSALLAKWIWRFLHEQESLWRKLITTKYYSRDGF--
        ++P+Y +S ++L   + K L   + +F+W        +  + W+   +     GGLG  +    N ALLAK  +R +H+  +L  +L+ ++Y+       
Subjt:  SMPIYYLSLYKLPSKVAKVLDKIIRDFFWEGSTRNGGLHNIKWE-TTQLPHILGGLGIGNFQLRNSALLAKWIWRFLHEQESLWRKLITTKYYSRDGF--

Query:  CSFASQYWSIMVEAFGWNMVLPGTIFDLLASVFVGHPFHGVKKTLWL
        CS  ++       ++ W  ++ G   +LL+   +     G+   +WL
Subjt:  CSFASQYWSIMVEAFGWNMVLPGTIFDLLASVFVGHPFHGVKKTLWL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.5e-0747.27Show/hide
Query:  KGIRQGDPLSPFLFILVSDYLSRLLSFRASLGKIATHPIGTSSLHVNHLQFADDT
        +G+RQGDPLSP+LFIL ++ LS L       G++    +  +S  +NHL FADDT
Subjt:  KGIRQGDPLSPFLFILVSDYLSRLLSFRASLGKIATHPIGTSSLHVNHLQFADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTCTTTCTAGAGGAGCCCACCCTTTCGATTGAAGACTTGCACCAACCCAATGAAGTGTTGCCTTTAATTTCAAAGAGCCCAATCGCTCAAGATGAAGCTCCCCC
AGTTATTGAAGAATTACAGAAGGCGATCCTTTCACCCATCGGAAACACCACAGAACCAACGGAAAATTCAAGCTACAATTTACCTCTCCCGATCAGACAACTAGCTCCAA
TACTTATTGAACATGGGTTATGCATCATGGCAATACCTCCTCCGAAAAAGAAAGTGGGCGCTTCGGGTGGTATTCTTTTATTATGGAGAGATCCGGACTTTACTATCAGA
GAAACTATCCAAGGTCTGTACTCTTTATCTATTCACATTGTTATGGCTGATGGGTTTGATTTCTGGTTGTCAGCTATTTATGGCCCCTCTCAATATGAATTGCATGATGG
TTTTTGGCAAGAATTACATGATTTAGCTGGATTGGCACAAGATCGATGGATCCTTGGTGGTGATTTTAATGTGACTCGATGGTCCTGGGAGAAATCTTCTGATCATCCAG
TCACTCGAAGCATGAGAAATTTTAATCTCTGGATTGATTCTTATAATCTTATAGATATCCCTTTGCAAAATGGCAGTTTCACTTGGTCCAGGTACGGAAGCCATAGATCT
CTATCGCTGCTGGACCGCTTTTTAATTTCGAATGATTGCCTCCAGAAGTTCGGATCAGCGCAACTGAAGAGACTACCGAGAGTTACATCTGACCATTATCCAATTAGCCT
AAATTTTGGTGAGATATCGTGGGGTCCCAGCCCATTCAGATTTGAAAATTCTTGGCTGCTTCATAAAGATTTCAAATCTGTAGTTGTTTCATGGTGGAACCAAAACCCCA
TTGATGGATGGCCTGGTCATGGTTTTATGATGAAGTTGAAAATATTAAAAACCGAACTTCTTAGATGGAATAAATCTCACAGGTCAGTTATGGAAAATCTCTTTATGCTT
ATATCTCAACTTAAAACACTGGATAATATAGAAGACTACGACTGCTTATCCGTTGATCAAAGGACACAAAGGCATCTTCTTCAAGAGCAAATTGAGGATATAACTGCAAG
AGATCATGTATACTGGAGACACCGTTGTAAACTTACTTGGTTGAAGGAGGGCGATGAGAATACTAAATTTTTTCATCGCATTATTGCTGCTCGTAAACGGAAAAATTCTA
TTTCTGAATTATTGTCTCGTGATGGAGTGAGTCTTTTAACAGATGGAGACATTGAGCAGGAGTTCATTGATTTTTATCAGAAATTATACACTAAAGATCAAGATATGCAC
TTCCTCCCAACCAACATTGATTGGAGTCGAATTAGGGAAGCTCAAGCGACCACATTAGAAGTTCCTTTTACGGATGATGAGTGGGATTATAATGTGGCCTTAAATGAAAC
TTATATCTGTCTTATTCCAAAAAGAGTGGATTCCAGATCGGTTAATGATTATCGTCCAATTAGCCTTATTTCTTGTGCTTATAAGATTATTGCTCGTGTCTTATCGAATC
GACTAAAGCAAGTTGCATTTACAATTGCTGAAAATCAGATGGCTTTTGTGTCTAATCGACAAATCCTGGATGCTGCATTGATTGCAAGTGAGCTGATTGATGATTGGAAA
ACCACCAATAAGAAAGGTGTGGTTATAAAGCTGGATTTAGAAAAAGCTTTTGATAAAGTTGATTGGGATTTTCTGGATGCGGTCCTTCAGATTAAAGGTATTCGTCAGGG
TGATCCATTATCTCCTTTTCTTTTTATCTTGGTTTCTGACTACTTAAGTCGACTTTTGTCTTTCCGTGCGAGCTTGGGTAAGATTGCCACTCACCCTATTGGTACCTCTT
CTCTCCATGTGAATCATTTGCAATTTGCTGATGATACTTTATTATTCTCTACTTCTGATGCTTCGGCGGTGGATAATCTATTTGATATTATTAAAATTTTTGAGAAGGCA
TCTGGTTTGAACATTAATTTTGGTAAAAGTGAATTATTGGGGATTAATGTTGATGATCAGAATATGGGAGTTTTCACTTCAAAGTTTGGCTGTAGACTTGGAGATTGGCC
AACAACTTATCTTGGCCTTCCTTTAGGTGGCAATCCAAAATTGCCCTCGTTTTGGCACCCAGTGATTGAAAGAATTCAGAAAAAGCTTCATAGTTGGAAGTATTTGTACA
TTTCAAAAGGTGGTAGATATACTCTAATTCAAGCTTCTTTGTCCAGCATGCCCATTTATTATCTCTCTTTGTATAAATTACCTTCAAAGGTGGCTAAAGTTTTGGATAAA
ATTATTCGTGATTTCTTTTGGGAAGGATCAACCAGAAATGGAGGATTACATAATATTAAATGGGAGACAACTCAACTTCCTCATATTTTGGGAGGTCTAGGAATTGGGAA
TTTTCAGCTTCGAAATTCTGCATTGTTGGCTAAATGGATTTGGAGGTTTCTTCATGAACAGGAGAGTCTTTGGCGGAAATTAATAACCACTAAATATTATTCTAGAGATG
GTTTTTGTTCTTTTGCATCACAATATTGGTCCATTATGGTGGAAGCCTTTGGGTGGAATATGGTTTTGCCCGGAACTATTTTTGATCTCCTTGCTTCAGTTTTTGTGGGT
CATCCCTTTCATGGTGTAAAGAAAACTCTTTGGCTTGCCATCAATAGAGTTTTCTTTTGGTATCTTTGGTGTGAGAGGAATGTCAGAATTTTCAGGGACGTCTCTCTCAA
TTTTACTGCTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTCTTTCTAGAGGAGCCCACCCTTTCGATTGAAGACTTGCACCAACCCAATGAAGTGTTGCCTTTAATTTCAAAGAGCCCAATCGCTCAAGATGAAGCTCCCCC
AGTTATTGAAGAATTACAGAAGGCGATCCTTTCACCCATCGGAAACACCACAGAACCAACGGAAAATTCAAGCTACAATTTACCTCTCCCGATCAGACAACTAGCTCCAA
TACTTATTGAACATGGGTTATGCATCATGGCAATACCTCCTCCGAAAAAGAAAGTGGGCGCTTCGGGTGGTATTCTTTTATTATGGAGAGATCCGGACTTTACTATCAGA
GAAACTATCCAAGGTCTGTACTCTTTATCTATTCACATTGTTATGGCTGATGGGTTTGATTTCTGGTTGTCAGCTATTTATGGCCCCTCTCAATATGAATTGCATGATGG
TTTTTGGCAAGAATTACATGATTTAGCTGGATTGGCACAAGATCGATGGATCCTTGGTGGTGATTTTAATGTGACTCGATGGTCCTGGGAGAAATCTTCTGATCATCCAG
TCACTCGAAGCATGAGAAATTTTAATCTCTGGATTGATTCTTATAATCTTATAGATATCCCTTTGCAAAATGGCAGTTTCACTTGGTCCAGGTACGGAAGCCATAGATCT
CTATCGCTGCTGGACCGCTTTTTAATTTCGAATGATTGCCTCCAGAAGTTCGGATCAGCGCAACTGAAGAGACTACCGAGAGTTACATCTGACCATTATCCAATTAGCCT
AAATTTTGGTGAGATATCGTGGGGTCCCAGCCCATTCAGATTTGAAAATTCTTGGCTGCTTCATAAAGATTTCAAATCTGTAGTTGTTTCATGGTGGAACCAAAACCCCA
TTGATGGATGGCCTGGTCATGGTTTTATGATGAAGTTGAAAATATTAAAAACCGAACTTCTTAGATGGAATAAATCTCACAGGTCAGTTATGGAAAATCTCTTTATGCTT
ATATCTCAACTTAAAACACTGGATAATATAGAAGACTACGACTGCTTATCCGTTGATCAAAGGACACAAAGGCATCTTCTTCAAGAGCAAATTGAGGATATAACTGCAAG
AGATCATGTATACTGGAGACACCGTTGTAAACTTACTTGGTTGAAGGAGGGCGATGAGAATACTAAATTTTTTCATCGCATTATTGCTGCTCGTAAACGGAAAAATTCTA
TTTCTGAATTATTGTCTCGTGATGGAGTGAGTCTTTTAACAGATGGAGACATTGAGCAGGAGTTCATTGATTTTTATCAGAAATTATACACTAAAGATCAAGATATGCAC
TTCCTCCCAACCAACATTGATTGGAGTCGAATTAGGGAAGCTCAAGCGACCACATTAGAAGTTCCTTTTACGGATGATGAGTGGGATTATAATGTGGCCTTAAATGAAAC
TTATATCTGTCTTATTCCAAAAAGAGTGGATTCCAGATCGGTTAATGATTATCGTCCAATTAGCCTTATTTCTTGTGCTTATAAGATTATTGCTCGTGTCTTATCGAATC
GACTAAAGCAAGTTGCATTTACAATTGCTGAAAATCAGATGGCTTTTGTGTCTAATCGACAAATCCTGGATGCTGCATTGATTGCAAGTGAGCTGATTGATGATTGGAAA
ACCACCAATAAGAAAGGTGTGGTTATAAAGCTGGATTTAGAAAAAGCTTTTGATAAAGTTGATTGGGATTTTCTGGATGCGGTCCTTCAGATTAAAGGTATTCGTCAGGG
TGATCCATTATCTCCTTTTCTTTTTATCTTGGTTTCTGACTACTTAAGTCGACTTTTGTCTTTCCGTGCGAGCTTGGGTAAGATTGCCACTCACCCTATTGGTACCTCTT
CTCTCCATGTGAATCATTTGCAATTTGCTGATGATACTTTATTATTCTCTACTTCTGATGCTTCGGCGGTGGATAATCTATTTGATATTATTAAAATTTTTGAGAAGGCA
TCTGGTTTGAACATTAATTTTGGTAAAAGTGAATTATTGGGGATTAATGTTGATGATCAGAATATGGGAGTTTTCACTTCAAAGTTTGGCTGTAGACTTGGAGATTGGCC
AACAACTTATCTTGGCCTTCCTTTAGGTGGCAATCCAAAATTGCCCTCGTTTTGGCACCCAGTGATTGAAAGAATTCAGAAAAAGCTTCATAGTTGGAAGTATTTGTACA
TTTCAAAAGGTGGTAGATATACTCTAATTCAAGCTTCTTTGTCCAGCATGCCCATTTATTATCTCTCTTTGTATAAATTACCTTCAAAGGTGGCTAAAGTTTTGGATAAA
ATTATTCGTGATTTCTTTTGGGAAGGATCAACCAGAAATGGAGGATTACATAATATTAAATGGGAGACAACTCAACTTCCTCATATTTTGGGAGGTCTAGGAATTGGGAA
TTTTCAGCTTCGAAATTCTGCATTGTTGGCTAAATGGATTTGGAGGTTTCTTCATGAACAGGAGAGTCTTTGGCGGAAATTAATAACCACTAAATATTATTCTAGAGATG
GTTTTTGTTCTTTTGCATCACAATATTGGTCCATTATGGTGGAAGCCTTTGGGTGGAATATGGTTTTGCCCGGAACTATTTTTGATCTCCTTGCTTCAGTTTTTGTGGGT
CATCCCTTTCATGGTGTAAAGAAAACTCTTTGGCTTGCCATCAATAGAGTTTTCTTTTGGTATCTTTGGTGTGAGAGGAATGTCAGAATTTTCAGGGACGTCTCTCTCAA
TTTTACTGCTTTTTGA
Protein sequenceShow/hide protein sequence
MDFFLEEPTLSIEDLHQPNEVLPLISKSPIAQDEAPPVIEELQKAILSPIGNTTEPTENSSYNLPLPIRQLAPILIEHGLCIMAIPPPKKKVGASGGILLLWRDPDFTIR
ETIQGLYSLSIHIVMADGFDFWLSAIYGPSQYELHDGFWQELHDLAGLAQDRWILGGDFNVTRWSWEKSSDHPVTRSMRNFNLWIDSYNLIDIPLQNGSFTWSRYGSHRS
LSLLDRFLISNDCLQKFGSAQLKRLPRVTSDHYPISLNFGEISWGPSPFRFENSWLLHKDFKSVVVSWWNQNPIDGWPGHGFMMKLKILKTELLRWNKSHRSVMENLFML
ISQLKTLDNIEDYDCLSVDQRTQRHLLQEQIEDITARDHVYWRHRCKLTWLKEGDENTKFFHRIIAARKRKNSISELLSRDGVSLLTDGDIEQEFIDFYQKLYTKDQDMH
FLPTNIDWSRIREAQATTLEVPFTDDEWDYNVALNETYICLIPKRVDSRSVNDYRPISLISCAYKIIARVLSNRLKQVAFTIAENQMAFVSNRQILDAALIASELIDDWK
TTNKKGVVIKLDLEKAFDKVDWDFLDAVLQIKGIRQGDPLSPFLFILVSDYLSRLLSFRASLGKIATHPIGTSSLHVNHLQFADDTLLFSTSDASAVDNLFDIIKIFEKA
SGLNINFGKSELLGINVDDQNMGVFTSKFGCRLGDWPTTYLGLPLGGNPKLPSFWHPVIERIQKKLHSWKYLYISKGGRYTLIQASLSSMPIYYLSLYKLPSKVAKVLDK
IIRDFFWEGSTRNGGLHNIKWETTQLPHILGGLGIGNFQLRNSALLAKWIWRFLHEQESLWRKLITTKYYSRDGFCSFASQYWSIMVEAFGWNMVLPGTIFDLLASVFVG
HPFHGVKKTLWLAINRVFFWYLWCERNVRIFRDVSLNFTAF