; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032116 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032116
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:25102446..25110376
RNA-Seq ExpressionLag0032116
SyntenyLag0032116
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH07150.1 TatD related DNase [Prunus dulcis]1.5e-6428.9Show/hide
Query:  YTEGFAKVFGV----WLVAGDF---RWIEDRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLISRDWADVFKEFKLDRLQR
        + E  A ++G     W + GDF   R+  ++    R  +SMR FN F+    L D  + N  FTW  + E     RLD+ L+S  W D F  ++   L R
Subjt:  YTEGFAKVFGV----WLVAGDF---RWIEDRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLISRDWADVFKEFKLDRLQR

Query:  PTFNHFSIALSIGAMKWGPSPFRFEN----------------------------------------------------------------------RAS-
         T +H  I L    +KWGPSPFRFEN                                                                      R S 
Subjt:  PTFNHFSIALSIGAMKWGPSPFRFEN----------------------------------------------------------------------RAS-

Query:  -----------------------------LEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPK
                                     LE  F  EE+ +A+   G  KS GP+G +   F++ W ++K  L++V  +F ++GI+N  TNET++CLIPK
Subjt:  -----------------------------LEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPK

Query:  KKNASNIRDFRPISLVTSSYKIIAKVLAERLKKALAQTISDCQATFVHANQSSSSSSTIHVVVAFRSLSLPSVVRELRSVRRCCRRCFSLSPACSFLSST
        K N+  + DFRPISLVTS YK+I+KVLA RL++ L  TIS  Q  FV   Q   +     V+VA        VV E+R   R                  
Subjt:  KKNASNIRDFRPISLVTSSYKIIAKVLAERLKKALAQTISDCQATFVHANQSSSSSSTIHVVVAFRSLSLPSVVRELRSVRRCCRRCFSLSPACSFLSST

Query:  THRLRSAATIVNVVATHTVVSHRRLSHLSGKEQTFYPSRGHGNESPRIEMFIDIDEELNGRDSASQTRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRC
                                              +G                               L+ ++ F+                 A+  
Subjt:  THRLRSAATIVNVVATHTVVSHRRLSHLSGKEQTFYPSRGHGNESPRIEMFIDIDEELNGRDSASQTRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRC

Query:  LDGWEFLDVILNLKAFGKTWRKWIKGCLTNTNFSIIINGRPRGKIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFS
        ++ W F+D +L  K FG  WR WI GCL + NFSI+ING+PRGK RASRGLR GDP+S FL+T+V D +S++I+   +  ++ G   G + VEV+ LQF+
Subjt:  LDGWEFLDVILNLKAFGKTWRKWIKGCLTNTNFSIIINGRPRGKIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFS

Query:  DDTMIFCPNKEEIMDKWWEILRLILEGAGLSMSHAKISIIGINMNVD
        DDT+ F   KEE      ++L+L  + +G+ ++ AK  I+GIN +++
Subjt:  DDTMIFCPNKEEIMDKWWEILRLILEGAGLSMSHAKISIIGINMNVD

CAN70693.1 hypothetical protein VITISV_041975 [Vitis vinifera]4.8e-7130.65Show/hide
Query:  PLLTEDYEWIAALAEVFAFNPHYTEGFAKVFGVWLVAGDFRWIE---DRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLI
        PLL +D+ W+  L ++F+ +          F +W V GDF  I+   ++L  +    SM+ F+ F+  CEL+D P+ N  FTW  M E     RLD+ L 
Subjt:  PLLTEDYEWIAALAEVFAFNPHYTEGFAKVFGVWLVAGDFRWIE---DRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLI

Query:  SRDWADVFKEFKLDRLQRPTFNHFSIALSIGAMKWGPSPFRFE---------------NRASLEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWN
        S +W   F +   + L R T +H+ I L     KWGP+PFRFE               N   L+  F+EEEI++AI  L   K+ GP+G T   F++ W+
Subjt:  SRDWADVFKEFKLDRLQRPTFNHFSIALSIGAMKWGPSPFRFE---------------NRASLEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWN

Query:  ILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISLVTSSYKIIAKVLAERLKKALAQTISDCQATFVHANQSSSSSSTIHVVVAFRS
        ++K  LV VF EF  + IIN+ TN +++ L+PKK     I +FRPISL+TS YKIIA VL+ RL+  L +TI   Q  FV   Q                
Subjt:  ILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISLVTSSYKIIAKVLAERLKKALAQTISDCQATFVHANQSSSSSSTIHVVVAFRS

Query:  LSLPSVVRELRSVRRCCRRCFSLSPACSFLSSTTHRLRSAATIVNVV-ATHTVVSHRRLSHLSGKEQTFYPSRGHGNESPRIEMFIDIDEELNGRDSASQ
                                                  I++VV  T+ +V  +R    SG+E+  +               ID  +  +       
Subjt:  LSLPSVVRELRSVRRCCRRCFSLSPACSFLSSTTHRLRSAATIVNVV-ATHTVVSHRRLSHLSGKEQTFYPSRGHGNESPRIEMFIDIDEELNGRDSASQ

Query:  TRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRCLDGWEFLDVILNLKAFGKTWRKWIKGCLTNTNFSIIINGRPRGKIRASRGLRPGDPISLFLYTIVG
           H+                                W+FLD +L  K F   W+KW++GCL+  +F+I++NG  +G ++ASRGLR GDP+S FL+T+V 
Subjt:  TRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRCLDGWEFLDVILNLKAFGKTWRKWIKGCLTNTNFSIIINGRPRGKIRASRGLRPGDPISLFLYTIVG

Query:  DAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDTMIFCPNKEEIMDKWWEILRLILEGAGLSMSHAKISIIGINMNVDKIKDRADKMDGKVESF
        D +S+++    ER  L+G   G+N   V+ LQF+DDT+ F    EE +     +L +    +GL ++  K +I GIN+  + +   A+ +D K   +
Subjt:  DAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDTMIFCPNKEEIMDKWWEILRLILEGAGLSMSHAKISIIGINMNVDKIKDRADKMDGKVESF

RVW21770.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]3.7e-6328.5Show/hide
Query:  FGVWLVAGDFRWI---EDRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLISRDWADVFKEFKLDRLQRPTFNHFSIALSI
        F +W V GDF  I    ++L  +R   SMR F+ F+S  EL+D P+ N  FTW  M E     RLD+ L S +W  +F +   + L R T +H+ IAL  
Subjt:  FGVWLVAGDFRWI---EDRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLISRDWADVFKEFKLDRLQRPTFNHFSIALSI

Query:  GAMKWGPSPFRFEN----RASLEESFS------------------------------------------------------------------EEEIFRA
            WGP+PFRFEN      S +E+F                                                                   EEEI +A
Subjt:  GAMKWGPSPFRFEN----RASLEESFS------------------------------------------------------------------EEEIFRA

Query:  IKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISLVTSSYKIIAKVLAERLKKALAQTISDC
        I  +   K+ GP+G T   F++ W+++K  LV VF EF ++G+IN+ TN +++ L+PKK     I DFRPISL+TS YKIIAKVL+ RL+  L +TI   
Subjt:  IKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISLVTSSYKIIAKVLAERLKKALAQTISDC

Query:  QATFVHANQSSSSSSTIHVVVAFRSLSLPSVVRELRSVRRCCRRCFSLSPACSFLSSTTHRLRSAATIVNVVATHTVVSHRRLSHLSGKEQTFYPSRGHG
        Q  FV   Q   +                                                         V+  + +V  RR    SG+E   +      
Subjt:  QATFVHANQSSSSSSTIHVVVAFRSLSLPSVVRELRSVRRCCRRCFSLSPACSFLSSTTHRLRSAATIVNVVATHTVVSHRRLSHLSGKEQTFYPSRGHG

Query:  NESPRIEMFIDIDEELNGRDSASQTRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRCLDGWEFLDVILNLKAFGKTWRKWIKGCLTNTNFSIIINGRPR
                 ID ++  +          H+                                W+FLD +L  K F   WRKW+ GCL++ ++++++NG  +
Subjt:  NESPRIEMFIDIDEELNGRDSASQTRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRCLDGWEFLDVILNLKAFGKTWRKWIKGCLTNTNFSIIINGRPR

Query:  GKIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDTMIFCPNKEEIMDKWWEILRLILEGAGLSMSHAKISIIGI
        G ++ASRGLR GDP+S FL+T+V D +S+++    ER +L+G   G+N   V+ LQF+DDT+ F   +EE +     +L      +GL ++  K +I GI
Subjt:  GKIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDTMIFCPNKEEIMDKWWEILRLILEGAGLSMSHAKISIIGI

Query:  NMNVDKIKDRADKMDGKVESF
        N++   I   A+ +  K   +
Subjt:  NMNVDKIKDRADKMDGKVESF

RVW50737.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]7.4e-6429.05Show/hide
Query:  WLVAGDFRWI---EDRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLISRDWADVFKEFKLDRLQRPTFNHFSIALSIGAM
        W V GDF  I    ++L  +R   SM+ F+ F+S CELID+P+ +  FTW  M       RLD+ L S +W   F +     L R T +H+ I L     
Subjt:  WLVAGDFRWI---EDRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLISRDWADVFKEFKLDRLQRPTFNHFSIALSIGAM

Query:  KWGPSPFRFEN------------------------------------------------------------------------RASLEESFSEEEIFRAI
        KWGP+PFRFEN                                                                           LE  F+EEEIF+AI
Subjt:  KWGPSPFRFEN------------------------------------------------------------------------RASLEESFSEEEIFRAI

Query:  KGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISLVTSSYKIIAKVLAERLKKALAQTISDCQ
          +   K+ GP+G T   F++ W ++K  LV+VF EF ++GIIN+ TN +++ L+PKK  +  I DFRPISL+TS YKIIAKVLA R++  L +TI   Q
Subjt:  KGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISLVTSSYKIIAKVLAERLKKALAQTISDCQ

Query:  ATFVHANQSSSSSSTIHVVVAFRSLSLPSVVRELRSVRRCCRRCFSLSPACSFLSSTTHRLRSAATIVNVVATHTVVSHRRLSHLSGKEQTFYPSRGHGN
          FV   Q   +                                                         V+  + +V  +R    SG+E   +       
Subjt:  ATFVHANQSSSSSSTIHVVVAFRSLSLPSVVRELRSVRRCCRRCFSLSPACSFLSSTTHRLRSAATIVNVVATHTVVSHRRLSHLSGKEQTFYPSRGHGN

Query:  ESPRIEMFIDIDEELNGRDSASQTRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRCLDGWEFLDVILNLKAFGKTWRKWIKGCLTNTNFSIIINGRPRG
                ID ++  +          H+                                W+FLD +L +K FG  WRKW++GCL++ +F++++NG  +G
Subjt:  ESPRIEMFIDIDEELNGRDSASQTRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRCLDGWEFLDVILNLKAFGKTWRKWIKGCLTNTNFSIIINGRPRG

Query:  KIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDTMIFCPNKEEIMDKWWEILRLILEGAGLSMSHAKISIIGI
         ++ASRGLR GDP+S FL+TIV D +S+++    ER +L+G   G+N   V+ LQF+DDT+ F  ++EE M     +L ++ E      S   I  +G+
Subjt:  KIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDTMIFCPNKEEIMDKWWEILRLILEGAGLSMSHAKISIIGI

RVX23043.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.1e-6428.46Show/hide
Query:  WLVAGDFRWI---EDRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLISRDWADVFKEFKLDRLQRPTFNHFSIALSIGAM
        W V GDF  I    ++L  +R   SM+ F+ F+S CELID+P+ +  FTW+ M       RLD+ L S +W   F +     L R T +H+ I L     
Subjt:  WLVAGDFRWI---EDRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLISRDWADVFKEFKLDRLQRPTFNHFSIALSIGAM

Query:  KWGPSPFRFE--------------------------------------------NRAS------------------------------------------
        KWGP+PFRFE                                            N+AS                                          
Subjt:  KWGPSPFRFE--------------------------------------------NRAS------------------------------------------

Query:  --LEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISLVTSSYKIIAKVL
          LE  F+EEEIF+AI  +   K+ GP+G T   F++ W ++K  LV+VF EF ++GIIN+ TN +++ L+PKK  +  I DFRPISL+TS YKIIAKVL
Subjt:  --LEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISLVTSSYKIIAKVL

Query:  AERLKKALAQTISDCQATFVHANQSSSSSSTIHVVVAFRSLSLPSVVRELRSVRRCCRRCFSLSPACSFLSSTTHRLRSAATIVNVVATHTVVSHRRLSH
        A R++  L +TI   Q  FV   Q   +                                                         V+  + +V  +R   
Subjt:  AERLKKALAQTISDCQATFVHANQSSSSSSTIHVVVAFRSLSLPSVVRELRSVRRCCRRCFSLSPACSFLSSTTHRLRSAATIVNVVATHTVVSHRRLSH

Query:  LSGKEQTFYPSRGHGNESPRIEMFIDIDEELNGRDSASQTRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRCLDGWEFLDVILNLKAFGKTWRKWIKGC
         SG+E   +               ID ++  +          H+                                W+FLD +L +K FG  WRKW+ GC
Subjt:  LSGKEQTFYPSRGHGNESPRIEMFIDIDEELNGRDSASQTRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRCLDGWEFLDVILNLKAFGKTWRKWIKGC

Query:  LTNTNFSIIINGRPRGKIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDTMIFCPNKEEIMDKWWEILRLILEG
        L++ +F++++NG  +  ++ASRGLR GDP+S FL+TIV D +S+++    ER +L+G   G+N   V+ LQF DDT+ F  ++EE M     +L +    
Subjt:  LTNTNFSIIINGRPRGKIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDTMIFCPNKEEIMDKWWEILRLILEG

Query:  AGLSMSHAKISIIGINMNVDKIKDRADKMDGKVESF
        +GL ++  K +I GIN+  + +   A+ +D K   +
Subjt:  AGLSMSHAKISIIGINMNVDKIKDRADKMDGKVESF

TrEMBL top hitse value%identityAlignment
A0A438CEZ3 LINE-1 retrotransposable element ORF2 protein1.8e-6328.5Show/hide
Query:  FGVWLVAGDFRWI---EDRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLISRDWADVFKEFKLDRLQRPTFNHFSIALSI
        F +W V GDF  I    ++L  +R   SMR F+ F+S  EL+D P+ N  FTW  M E     RLD+ L S +W  +F +   + L R T +H+ IAL  
Subjt:  FGVWLVAGDFRWI---EDRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLISRDWADVFKEFKLDRLQRPTFNHFSIALSI

Query:  GAMKWGPSPFRFEN----RASLEESFS------------------------------------------------------------------EEEIFRA
            WGP+PFRFEN      S +E+F                                                                   EEEI +A
Subjt:  GAMKWGPSPFRFEN----RASLEESFS------------------------------------------------------------------EEEIFRA

Query:  IKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISLVTSSYKIIAKVLAERLKKALAQTISDC
        I  +   K+ GP+G T   F++ W+++K  LV VF EF ++G+IN+ TN +++ L+PKK     I DFRPISL+TS YKIIAKVL+ RL+  L +TI   
Subjt:  IKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISLVTSSYKIIAKVLAERLKKALAQTISDC

Query:  QATFVHANQSSSSSSTIHVVVAFRSLSLPSVVRELRSVRRCCRRCFSLSPACSFLSSTTHRLRSAATIVNVVATHTVVSHRRLSHLSGKEQTFYPSRGHG
        Q  FV   Q   +                                                         V+  + +V  RR    SG+E   +      
Subjt:  QATFVHANQSSSSSSTIHVVVAFRSLSLPSVVRELRSVRRCCRRCFSLSPACSFLSSTTHRLRSAATIVNVVATHTVVSHRRLSHLSGKEQTFYPSRGHG

Query:  NESPRIEMFIDIDEELNGRDSASQTRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRCLDGWEFLDVILNLKAFGKTWRKWIKGCLTNTNFSIIINGRPR
                 ID ++  +          H+                                W+FLD +L  K F   WRKW+ GCL++ ++++++NG  +
Subjt:  NESPRIEMFIDIDEELNGRDSASQTRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRCLDGWEFLDVILNLKAFGKTWRKWIKGCLTNTNFSIIINGRPR

Query:  GKIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDTMIFCPNKEEIMDKWWEILRLILEGAGLSMSHAKISIIGI
        G ++ASRGLR GDP+S FL+T+V D +S+++    ER +L+G   G+N   V+ LQF+DDT+ F   +EE +     +L      +GL ++  K +I GI
Subjt:  GKIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDTMIFCPNKEEIMDKWWEILRLILEGAGLSMSHAKISIIGI

Query:  NMNVDKIKDRADKMDGKVESF
        N++   I   A+ +  K   +
Subjt:  NMNVDKIKDRADKMDGKVESF

A0A438ESU6 Transposon TX1 uncharacterized 149 kDa protein3.6e-6429.05Show/hide
Query:  WLVAGDFRWI---EDRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLISRDWADVFKEFKLDRLQRPTFNHFSIALSIGAM
        W V GDF  I    ++L  +R   SM+ F+ F+S CELID+P+ +  FTW  M       RLD+ L S +W   F +     L R T +H+ I L     
Subjt:  WLVAGDFRWI---EDRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLISRDWADVFKEFKLDRLQRPTFNHFSIALSIGAM

Query:  KWGPSPFRFEN------------------------------------------------------------------------RASLEESFSEEEIFRAI
        KWGP+PFRFEN                                                                           LE  F+EEEIF+AI
Subjt:  KWGPSPFRFEN------------------------------------------------------------------------RASLEESFSEEEIFRAI

Query:  KGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISLVTSSYKIIAKVLAERLKKALAQTISDCQ
          +   K+ GP+G T   F++ W ++K  LV+VF EF ++GIIN+ TN +++ L+PKK  +  I DFRPISL+TS YKIIAKVLA R++  L +TI   Q
Subjt:  KGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISLVTSSYKIIAKVLAERLKKALAQTISDCQ

Query:  ATFVHANQSSSSSSTIHVVVAFRSLSLPSVVRELRSVRRCCRRCFSLSPACSFLSSTTHRLRSAATIVNVVATHTVVSHRRLSHLSGKEQTFYPSRGHGN
          FV   Q   +                                                         V+  + +V  +R    SG+E   +       
Subjt:  ATFVHANQSSSSSSTIHVVVAFRSLSLPSVVRELRSVRRCCRRCFSLSPACSFLSSTTHRLRSAATIVNVVATHTVVSHRRLSHLSGKEQTFYPSRGHGN

Query:  ESPRIEMFIDIDEELNGRDSASQTRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRCLDGWEFLDVILNLKAFGKTWRKWIKGCLTNTNFSIIINGRPRG
                ID ++  +          H+                                W+FLD +L +K FG  WRKW++GCL++ +F++++NG  +G
Subjt:  ESPRIEMFIDIDEELNGRDSASQTRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRCLDGWEFLDVILNLKAFGKTWRKWIKGCLTNTNFSIIINGRPRG

Query:  KIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDTMIFCPNKEEIMDKWWEILRLILEGAGLSMSHAKISIIGI
         ++ASRGLR GDP+S FL+TIV D +S+++    ER +L+G   G+N   V+ LQF+DDT+ F  ++EE M     +L ++ E      S   I  +G+
Subjt:  KIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDTMIFCPNKEEIMDKWWEILRLILEGAGLSMSHAKISIIGI

A0A438KPB4 LINE-1 retrotransposable element ORF2 protein5.6e-6528.46Show/hide
Query:  WLVAGDFRWI---EDRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLISRDWADVFKEFKLDRLQRPTFNHFSIALSIGAM
        W V GDF  I    ++L  +R   SM+ F+ F+S CELID+P+ +  FTW+ M       RLD+ L S +W   F +     L R T +H+ I L     
Subjt:  WLVAGDFRWI---EDRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLISRDWADVFKEFKLDRLQRPTFNHFSIALSIGAM

Query:  KWGPSPFRFE--------------------------------------------NRAS------------------------------------------
        KWGP+PFRFE                                            N+AS                                          
Subjt:  KWGPSPFRFE--------------------------------------------NRAS------------------------------------------

Query:  --LEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISLVTSSYKIIAKVL
          LE  F+EEEIF+AI  +   K+ GP+G T   F++ W ++K  LV+VF EF ++GIIN+ TN +++ L+PKK  +  I DFRPISL+TS YKIIAKVL
Subjt:  --LEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISLVTSSYKIIAKVL

Query:  AERLKKALAQTISDCQATFVHANQSSSSSSTIHVVVAFRSLSLPSVVRELRSVRRCCRRCFSLSPACSFLSSTTHRLRSAATIVNVVATHTVVSHRRLSH
        A R++  L +TI   Q  FV   Q   +                                                         V+  + +V  +R   
Subjt:  AERLKKALAQTISDCQATFVHANQSSSSSSTIHVVVAFRSLSLPSVVRELRSVRRCCRRCFSLSPACSFLSSTTHRLRSAATIVNVVATHTVVSHRRLSH

Query:  LSGKEQTFYPSRGHGNESPRIEMFIDIDEELNGRDSASQTRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRCLDGWEFLDVILNLKAFGKTWRKWIKGC
         SG+E   +               ID ++  +          H+                                W+FLD +L +K FG  WRKW+ GC
Subjt:  LSGKEQTFYPSRGHGNESPRIEMFIDIDEELNGRDSASQTRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRCLDGWEFLDVILNLKAFGKTWRKWIKGC

Query:  LTNTNFSIIINGRPRGKIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDTMIFCPNKEEIMDKWWEILRLILEG
        L++ +F++++NG  +  ++ASRGLR GDP+S FL+TIV D +S+++    ER +L+G   G+N   V+ LQF DDT+ F  ++EE M     +L +    
Subjt:  LTNTNFSIIINGRPRGKIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDTMIFCPNKEEIMDKWWEILRLILEG

Query:  AGLSMSHAKISIIGINMNVDKIKDRADKMDGKVESF
        +GL ++  K +I GIN+  + +   A+ +D K   +
Subjt:  AGLSMSHAKISIIGINMNVDKIKDRADKMDGKVESF

A0A4Y1RS61 TatD related DNase7.3e-6528.9Show/hide
Query:  YTEGFAKVFGV----WLVAGDF---RWIEDRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLISRDWADVFKEFKLDRLQR
        + E  A ++G     W + GDF   R+  ++    R  +SMR FN F+    L D  + N  FTW  + E     RLD+ L+S  W D F  ++   L R
Subjt:  YTEGFAKVFGV----WLVAGDF---RWIEDRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLISRDWADVFKEFKLDRLQR

Query:  PTFNHFSIALSIGAMKWGPSPFRFEN----------------------------------------------------------------------RAS-
         T +H  I L    +KWGPSPFRFEN                                                                      R S 
Subjt:  PTFNHFSIALSIGAMKWGPSPFRFEN----------------------------------------------------------------------RAS-

Query:  -----------------------------LEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPK
                                     LE  F  EE+ +A+   G  KS GP+G +   F++ W ++K  L++V  +F ++GI+N  TNET++CLIPK
Subjt:  -----------------------------LEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPK

Query:  KKNASNIRDFRPISLVTSSYKIIAKVLAERLKKALAQTISDCQATFVHANQSSSSSSTIHVVVAFRSLSLPSVVRELRSVRRCCRRCFSLSPACSFLSST
        K N+  + DFRPISLVTS YK+I+KVLA RL++ L  TIS  Q  FV   Q   +     V+VA        VV E+R   R                  
Subjt:  KKNASNIRDFRPISLVTSSYKIIAKVLAERLKKALAQTISDCQATFVHANQSSSSSSTIHVVVAFRSLSLPSVVRELRSVRRCCRRCFSLSPACSFLSST

Query:  THRLRSAATIVNVVATHTVVSHRRLSHLSGKEQTFYPSRGHGNESPRIEMFIDIDEELNGRDSASQTRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRC
                                              +G                               L+ ++ F+                 A+  
Subjt:  THRLRSAATIVNVVATHTVVSHRRLSHLSGKEQTFYPSRGHGNESPRIEMFIDIDEELNGRDSASQTRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRC

Query:  LDGWEFLDVILNLKAFGKTWRKWIKGCLTNTNFSIIINGRPRGKIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFS
        ++ W F+D +L  K FG  WR WI GCL + NFSI+ING+PRGK RASRGLR GDP+S FL+T+V D +S++I+   +  ++ G   G + VEV+ LQF+
Subjt:  LDGWEFLDVILNLKAFGKTWRKWIKGCLTNTNFSIIINGRPRGKIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFS

Query:  DDTMIFCPNKEEIMDKWWEILRLILEGAGLSMSHAKISIIGINMNVD
        DDT+ F   KEE      ++L+L  + +G+ ++ AK  I+GIN +++
Subjt:  DDTMIFCPNKEEIMDKWWEILRLILEGAGLSMSHAKISIIGINMNVD

A5BY18 Reverse transcriptase domain-containing protein2.3e-7130.65Show/hide
Query:  PLLTEDYEWIAALAEVFAFNPHYTEGFAKVFGVWLVAGDFRWIE---DRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLI
        PLL +D+ W+  L ++F+ +          F +W V GDF  I+   ++L  +    SM+ F+ F+  CEL+D P+ N  FTW  M E     RLD+ L 
Subjt:  PLLTEDYEWIAALAEVFAFNPHYTEGFAKVFGVWLVAGDFRWIE---DRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLI

Query:  SRDWADVFKEFKLDRLQRPTFNHFSIALSIGAMKWGPSPFRFE---------------NRASLEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWN
        S +W   F +   + L R T +H+ I L     KWGP+PFRFE               N   L+  F+EEEI++AI  L   K+ GP+G T   F++ W+
Subjt:  SRDWADVFKEFKLDRLQRPTFNHFSIALSIGAMKWGPSPFRFE---------------NRASLEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWN

Query:  ILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISLVTSSYKIIAKVLAERLKKALAQTISDCQATFVHANQSSSSSSTIHVVVAFRS
        ++K  LV VF EF  + IIN+ TN +++ L+PKK     I +FRPISL+TS YKIIA VL+ RL+  L +TI   Q  FV   Q                
Subjt:  ILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISLVTSSYKIIAKVLAERLKKALAQTISDCQATFVHANQSSSSSSTIHVVVAFRS

Query:  LSLPSVVRELRSVRRCCRRCFSLSPACSFLSSTTHRLRSAATIVNVV-ATHTVVSHRRLSHLSGKEQTFYPSRGHGNESPRIEMFIDIDEELNGRDSASQ
                                                  I++VV  T+ +V  +R    SG+E+  +               ID  +  +       
Subjt:  LSLPSVVRELRSVRRCCRRCFSLSPACSFLSSTTHRLRSAATIVNVV-ATHTVVSHRRLSHLSGKEQTFYPSRGHGNESPRIEMFIDIDEELNGRDSASQ

Query:  TRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRCLDGWEFLDVILNLKAFGKTWRKWIKGCLTNTNFSIIINGRPRGKIRASRGLRPGDPISLFLYTIVG
           H+                                W+FLD +L  K F   W+KW++GCL+  +F+I++NG  +G ++ASRGLR GDP+S FL+T+V 
Subjt:  TRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRCLDGWEFLDVILNLKAFGKTWRKWIKGCLTNTNFSIIINGRPRGKIRASRGLRPGDPISLFLYTIVG

Query:  DAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDTMIFCPNKEEIMDKWWEILRLILEGAGLSMSHAKISIIGINMNVDKIKDRADKMDGKVESF
        D +S+++    ER  L+G   G+N   V+ LQF+DDT+ F    EE +     +L +    +GL ++  K +I GIN+  + +   A+ +D K   +
Subjt:  DAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDTMIFCPNKEEIMDKWWEILRLILEGAGLSMSHAKISIIGINMNVDKIKDRADKMDGKVESF

SwissProt top hitse value%identityAlignment
P08548 LINE-1 reverse transcriptase homolog4.9e-1033.33Show/hide
Query:  LEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKK-KNASNIRDFRPISLVTSSYKIIAKVLA
        L    S  EI   I+ L  +KS GP+G T+E ++ F   L P L+ +F    K GI+     E  + LIPK  K+ +   ++RPISL+    KI+ K+L 
Subjt:  LEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKK-KNASNIRDFRPISLVTSSYKIIAKVLA

Query:  ERLKKALAQTISDCQATFVHANQ
         R+++ + + I   Q  F+  +Q
Subjt:  ERLKKALAQTISDCQATFVHANQ

P0CT41 Transposon Tf2-12 polyprotein2.5e-0637.66Show/hide
Query:  PIDLLQPPDIPEKIWEDLTMDFIEGLSKANGFDTIPVVVDRLSKAAHVLSLKHPFTAKAIAKVFVKEIVCLHGFLKE
        P   LQP    E+ WE L+MDFI  L +++G++ + VVVDR SK A ++      TA+  A++F + ++   G  KE
Subjt:  PIDLLQPPDIPEKIWEDLTMDFIEGLSKANGFDTIPVVVDRLSKAAHVLSLKHPFTAKAIAKVFVKEIVCLHGFLKE

P11369 LINE-1 retrotransposable element ORF2 protein4.2e-0932.61Show/hide
Query:  LEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPK-KKNASNIRDFRPISLVTSSYKIIAKVLA
        L    S +EI   I  L ++KS GP+G + E ++ F   L P L ++FH+    G +     E  + LIPK +K+ + I +FRPISL+    KI+ K+LA
Subjt:  LEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPK-KKNASNIRDFRPISLVTSSYKIIAKVLA

Query:  ERLKKALAQTISDCQATFVHANQS----SSSSSTIHVV
         R+++ +   I   Q  F+   Q       S + IH +
Subjt:  ERLKKALAQTISDCQATFVHANQS----SSSSSTIHVV

P14381 Transposon TX1 uncharacterized 149 kDa protein6.9e-1234.11Show/hide
Query:  WGPSPFRFENRAS-LEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISL
        W   P   E R   LE   + +E+ +A++ +   KS G +G+T E F+ FW+ L P    V  E  K G +        + L+PKK +   I+++RP+SL
Subjt:  WGPSPFRFENRAS-LEESFSEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISL

Query:  VTSSYKIIAKVLAERLKKALAQTISDCQA
        +++ YKI+AK ++ RLK  LA+ I   Q+
Subjt:  VTSSYKIIAKVLAERLKKALAQTISDCQA

Q9UR07 Transposon Tf2-11 polyprotein2.5e-0637.66Show/hide
Query:  PIDLLQPPDIPEKIWEDLTMDFIEGLSKANGFDTIPVVVDRLSKAAHVLSLKHPFTAKAIAKVFVKEIVCLHGFLKE
        P   LQP    E+ WE L+MDFI  L +++G++ + VVVDR SK A ++      TA+  A++F + ++   G  KE
Subjt:  PIDLLQPPDIPEKIWEDLTMDFIEGLSKANGFDTIPVVVDRLSKAAHVLSLKHPFTAKAIAKVFVKEIVCLHGFLKE

Arabidopsis top hitse value%identityAlignment
ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)4.0e-0739.71Show/hide
Query:  IINGRPRGKIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDT
        IING P+G +  SRGLR GDP+S +L+ +  + +S L +   E+  L G     N   +  L F+DDT
Subjt:  IINGRPRGKIRASRGLRPGDPISLFLYTIVGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCATGGAGCGACTGAAAGAGAACAATAGTGACCGGAGTAAACTTAAGAAGGTTGAGATGCCTGTGTTCGGTGGAGATGAACCAAATTCTTGGTTGTTTCATGTCGA
TAGTTATGTTAAGACCCATGTAGCTACGTCCCCAATTGACCTTTTACAACCACCGGATATTCCTGAGAAAATATGGGAGGACCTCACGATGGATTTTATAGAGGGTTTGT
CGAAAGCAAATGGATTTGATACTATACCGGTCGTAGTGGACAGGTTGAGCAAGGCAGCTCATGTTTTGAGCTTGAAACATCCATTCACTGCGAAAGCTATAGCAAAAGTT
TTTGTGAAGGAAATTGTTTGCCTCCATGGCTTCCTAAAAGAACATCCAAAAAATGGATATCTTGGCTTGGCTGGATGGAATACTGGTGGAGTGAGAGGTAGGAGCAGTGG
CTTACCGATTACAACTACCAATAGAGCCATCTATTCACCCTATGTTCCACATATCACAGTTGAAGCAGGCTTTGGGAAATCATCAGACGACAAAGGACGGACCCCATTGC
TGACAGAAGACTATGAGTGGATAGCAGCTCTAGCCGAAGTCTTTGCTTTTAACCCTCACTATACTGAGGGCTTTGCCAAGGTATTTGGTGTGTGGCTGGTGGCTGGAGAT
TTCAGATGGATTGAAGATAGATTGTATGCTAACAGGCCTAATAGAAGTATGAGGAAATTCAATCGATTTGTTTCCCTATGTGAGCTAATTGATATTCCTATGGCTAATGG
CAGGTTCACGTGGATTAGGATGGGTGAAAGAGTGGCGGCTTCGAGGCTGGACAAAATGCTCATTTCTAGAGATTGGGCGGATGTGTTTAAGGAGTTCAAGCTTGACAGAT
TGCAACGCCCCACTTTTAACCACTTTTCGATTGCTCTATCCATAGGTGCTATGAAATGGGGCCCCTCTCCTTTTAGATTCGAGAACAGGGCCTCTCTAGAAGAATCTTTT
AGTGAGGAGGAGATTTTTAGAGCTATTAAAGGTCTTGGTAGTCAGAAATCCTTGGGCCCTAATGGTATGACCAATGAGCATTTCAAAAACTTCTGGAACATTTTGAAGCC
ACATTTAGTAGAGGTGTTCCACGAGTTTTTAAAAAACGGTATCATCAATAGAAGGACGAATGAGACGTACGTATGTTTGATCCCAAAGAAGAAAAATGCCTCCAACATTC
GGGACTTTAGACCTATTAGCTTGGTGACCTCCTCATACAAAATTATAGCAAAAGTCCTTGCTGAAAGATTGAAGAAAGCTCTTGCTCAGACGATAAGTGATTGCCAAGCT
ACTTTTGTTCATGCAAACCAGTCATCGTCGTCGTCGTCGACCATCCACGTTGTCGTAGCTTTCCGATCGCTGTCGTTGCCGAGTGTGGTCCGTGAGCTCAGGTCGGTTCG
TCGCTGCTGTCGTCGTTGTTTCTCCCTCTCACCCGCTTGCTCCTTTCTTTCGTCTACTACCCATCGCCTTAGATCCGCCGCTACCATCGTGAATGTCGTCGCTACTCACA
CTGTTGTGAGCCACCGCCGTCTCTCTCATCTTTCGGGAAAAGAACAAACTTTTTATCCTAGTAGGGGTCATGGCAATGAATCACCTAGGATTGAGATGTTTATAGACATA
GATGAGGAGCTAAATGGGCGGGACTCGGCTTCGCAGACTAGGTGCCATCTACTAATTGTGGAAGTAAAGTTTCAACTAAAAACCCATGGATGCCCCACCAGTGAGATGGT
AGCTAGAGTGTTATGCGCTCATAGGTGCCTTGATGGTTGGGAATTCCTCGATGTCATCCTTAATCTCAAAGCCTTCGGTAAAACTTGGAGAAAATGGATTAAGGGTTGCC
TTACAAACACCAACTTTTCGATTATTATTAATGGTAGACCGAGGGGCAAGATACGAGCTTCTAGAGGGTTGAGACCAGGAGATCCCATTTCCCTCTTCCTTTATACTATT
GTGGGAGATGCTATGAGTCAATTGATTCAATACTGTATAGAGAGGAAAATTCTAAAAGGCCCCTCGGGGGGAAAAAACATAGTTGAAGTAGCCATGCTCCAATTTTCCGA
TGATACCATGATCTTTTGCCCAAACAAGGAGGAGATTATGGACAAATGGTGGGAGATTTTGAGATTGATCTTAGAAGGGGCAGGATTGTCCATGAGCCATGCCAAAATCT
CGATCATTGGTATCAACATGAATGTGGATAAAATCAAAGATAGGGCAGACAAAATGGATGGTAAGGTTGAGAGTTTTGTGAGGTATCCTCACTACGATATATTGATATTT
AGAGAAAGAGTTCCAAAAGGGAATTACAAAACTTTCGGCCAATTTGAGAGGGCCAATAATCTTTCCCCTCTCTCTCTAGCTTTCAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCATGGAGCGACTGAAAGAGAACAATAGTGACCGGAGTAAACTTAAGAAGGTTGAGATGCCTGTGTTCGGTGGAGATGAACCAAATTCTTGGTTGTTTCATGTCGA
TAGTTATGTTAAGACCCATGTAGCTACGTCCCCAATTGACCTTTTACAACCACCGGATATTCCTGAGAAAATATGGGAGGACCTCACGATGGATTTTATAGAGGGTTTGT
CGAAAGCAAATGGATTTGATACTATACCGGTCGTAGTGGACAGGTTGAGCAAGGCAGCTCATGTTTTGAGCTTGAAACATCCATTCACTGCGAAAGCTATAGCAAAAGTT
TTTGTGAAGGAAATTGTTTGCCTCCATGGCTTCCTAAAAGAACATCCAAAAAATGGATATCTTGGCTTGGCTGGATGGAATACTGGTGGAGTGAGAGGTAGGAGCAGTGG
CTTACCGATTACAACTACCAATAGAGCCATCTATTCACCCTATGTTCCACATATCACAGTTGAAGCAGGCTTTGGGAAATCATCAGACGACAAAGGACGGACCCCATTGC
TGACAGAAGACTATGAGTGGATAGCAGCTCTAGCCGAAGTCTTTGCTTTTAACCCTCACTATACTGAGGGCTTTGCCAAGGTATTTGGTGTGTGGCTGGTGGCTGGAGAT
TTCAGATGGATTGAAGATAGATTGTATGCTAACAGGCCTAATAGAAGTATGAGGAAATTCAATCGATTTGTTTCCCTATGTGAGCTAATTGATATTCCTATGGCTAATGG
CAGGTTCACGTGGATTAGGATGGGTGAAAGAGTGGCGGCTTCGAGGCTGGACAAAATGCTCATTTCTAGAGATTGGGCGGATGTGTTTAAGGAGTTCAAGCTTGACAGAT
TGCAACGCCCCACTTTTAACCACTTTTCGATTGCTCTATCCATAGGTGCTATGAAATGGGGCCCCTCTCCTTTTAGATTCGAGAACAGGGCCTCTCTAGAAGAATCTTTT
AGTGAGGAGGAGATTTTTAGAGCTATTAAAGGTCTTGGTAGTCAGAAATCCTTGGGCCCTAATGGTATGACCAATGAGCATTTCAAAAACTTCTGGAACATTTTGAAGCC
ACATTTAGTAGAGGTGTTCCACGAGTTTTTAAAAAACGGTATCATCAATAGAAGGACGAATGAGACGTACGTATGTTTGATCCCAAAGAAGAAAAATGCCTCCAACATTC
GGGACTTTAGACCTATTAGCTTGGTGACCTCCTCATACAAAATTATAGCAAAAGTCCTTGCTGAAAGATTGAAGAAAGCTCTTGCTCAGACGATAAGTGATTGCCAAGCT
ACTTTTGTTCATGCAAACCAGTCATCGTCGTCGTCGTCGACCATCCACGTTGTCGTAGCTTTCCGATCGCTGTCGTTGCCGAGTGTGGTCCGTGAGCTCAGGTCGGTTCG
TCGCTGCTGTCGTCGTTGTTTCTCCCTCTCACCCGCTTGCTCCTTTCTTTCGTCTACTACCCATCGCCTTAGATCCGCCGCTACCATCGTGAATGTCGTCGCTACTCACA
CTGTTGTGAGCCACCGCCGTCTCTCTCATCTTTCGGGAAAAGAACAAACTTTTTATCCTAGTAGGGGTCATGGCAATGAATCACCTAGGATTGAGATGTTTATAGACATA
GATGAGGAGCTAAATGGGCGGGACTCGGCTTCGCAGACTAGGTGCCATCTACTAATTGTGGAAGTAAAGTTTCAACTAAAAACCCATGGATGCCCCACCAGTGAGATGGT
AGCTAGAGTGTTATGCGCTCATAGGTGCCTTGATGGTTGGGAATTCCTCGATGTCATCCTTAATCTCAAAGCCTTCGGTAAAACTTGGAGAAAATGGATTAAGGGTTGCC
TTACAAACACCAACTTTTCGATTATTATTAATGGTAGACCGAGGGGCAAGATACGAGCTTCTAGAGGGTTGAGACCAGGAGATCCCATTTCCCTCTTCCTTTATACTATT
GTGGGAGATGCTATGAGTCAATTGATTCAATACTGTATAGAGAGGAAAATTCTAAAAGGCCCCTCGGGGGGAAAAAACATAGTTGAAGTAGCCATGCTCCAATTTTCCGA
TGATACCATGATCTTTTGCCCAAACAAGGAGGAGATTATGGACAAATGGTGGGAGATTTTGAGATTGATCTTAGAAGGGGCAGGATTGTCCATGAGCCATGCCAAAATCT
CGATCATTGGTATCAACATGAATGTGGATAAAATCAAAGATAGGGCAGACAAAATGGATGGTAAGGTTGAGAGTTTTGTGAGGTATCCTCACTACGATATATTGATATTT
AGAGAAAGAGTTCCAAAAGGGAATTACAAAACTTTCGGCCAATTTGAGAGGGCCAATAATCTTTCCCCTCTCTCTCTAGCTTTCAGCTAA
Protein sequenceShow/hide protein sequence
MVMERLKENNSDRSKLKKVEMPVFGGDEPNSWLFHVDSYVKTHVATSPIDLLQPPDIPEKIWEDLTMDFIEGLSKANGFDTIPVVVDRLSKAAHVLSLKHPFTAKAIAKV
FVKEIVCLHGFLKEHPKNGYLGLAGWNTGGVRGRSSGLPITTTNRAIYSPYVPHITVEAGFGKSSDDKGRTPLLTEDYEWIAALAEVFAFNPHYTEGFAKVFGVWLVAGD
FRWIEDRLYANRPNRSMRKFNRFVSLCELIDIPMANGRFTWIRMGERVAASRLDKMLISRDWADVFKEFKLDRLQRPTFNHFSIALSIGAMKWGPSPFRFENRASLEESF
SEEEIFRAIKGLGSQKSLGPNGMTNEHFKNFWNILKPHLVEVFHEFLKNGIINRRTNETYVCLIPKKKNASNIRDFRPISLVTSSYKIIAKVLAERLKKALAQTISDCQA
TFVHANQSSSSSSTIHVVVAFRSLSLPSVVRELRSVRRCCRRCFSLSPACSFLSSTTHRLRSAATIVNVVATHTVVSHRRLSHLSGKEQTFYPSRGHGNESPRIEMFIDI
DEELNGRDSASQTRCHLLIVEVKFQLKTHGCPTSEMVARVLCAHRCLDGWEFLDVILNLKAFGKTWRKWIKGCLTNTNFSIIINGRPRGKIRASRGLRPGDPISLFLYTI
VGDAMSQLIQYCIERKILKGPSGGKNIVEVAMLQFSDDTMIFCPNKEEIMDKWWEILRLILEGAGLSMSHAKISIIGINMNVDKIKDRADKMDGKVESFVRYPHYDILIF
RERVPKGNYKTFGQFERANNLSPLSLAFS