; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026147 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026147
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:30610110..30626225
RNA-Seq ExpressionLag0026147
SyntenyLag0026147
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR009068 - S15/NS1, RNA-binding
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_041998733.1 uncharacterized protein LOC121748438 isoform X2 [Salvia splendens]2.1e-11636.68Show/hide
Query:  KLGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEEDPGVPIILGRPFLAT---------------------TRESESPIDRYEV
        KL +G+L+ST+I LQ+AD  + YP GI ED+LV V++FIFP DFVVLD+EED  VP+ILGRPFLAT                     T      + R++ 
Subjt:  KLGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEEDPGVPIILGRPFLAT---------------------TRESESPIDRYEV

Query:  NMSG--------VFGDELIDVTQDCALIWDEIDR--------------------------PLDPGREDKLIL------------DVCAQVDTPPLDLKQL
           G           DE +      + + D+++R                           LD  +E    L            D     +   L+LK L
Subjt:  NMSG--------VFGDELIDVTQDCALIWDEIDR--------------------------PLDPGREDKLIL------------DVCAQVDTPPLDLKQL

Query:  SAHLRYAFIGESSTFPVIIPSDLNKETNLVLNWEKCHFMVKEGIVLGHKVSEKGLEVDQTKISAIEQLPQPIN---------------------------
          HLRYAF+G + TFP        +ETNLVLNWEKCHFMV++GIVLGHK+S  GLEVD+ KI AIEQLP P +                           
Subjt:  SAHLRYAFIGESSTFPVIIPSDLNKETNLVLNWEKCHFMVKEGIVLGHKVSEKGLEVDQTKISAIEQLPQPIN---------------------------

Query:  -----------------------LKEKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKF
                               LK  L+ A I++  DW+QPFEIMCDASD A+G+ LGQ+RDK FR IYYASRTLDSAQ NYTTTEKEM+AVV++FDKF
Subjt:  -----------------------LKEKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKF

Query:  RPYLLGTKVIVHTDQAAIR---------------------------------------------------------------------------------
        RPYL+G K IV TD AAIR                                                                                 
Subjt:  RPYLLGTKVIVHTDQAAIR---------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------C
                                                                                                           C
Subjt:  ---------------------------------------------------------------------------------------------------C

Query:  QRTGNISRLHEMPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEA
        QR G +S+  EMPLT I+EVELFDVWGIDFMG FP S+G+ YI++ VDYVS+WVEAI T+TND++VV++F+ KNIFTRFG P+AIISD GSHF N+  EA
Subjt:  QRTGNISRLHEMPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEA

Query:  MLKKYGVKHKTSLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT
        +L ++GVKH+ +  YH Q N Q E++NREIKQIL+K+V  N +DWA KL DALW   T
Subjt:  MLKKYGVKHKTSLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT

XP_042016328.1 uncharacterized protein LOC121764359 [Salvia splendens]9.5e-11736.81Show/hide
Query:  KLGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEEDPGVPIILGRPFLAT---------------------TRESESPIDRYEV
        KL +G+L+ST+I LQ+AD  + YP GI ED+LV V++FIFP DFVVLD+EED  VP+ILGRPFLAT                     T      + R++V
Subjt:  KLGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEEDPGVPIILGRPFLAT---------------------TRESESPIDRYEV

Query:  NMSG--------VFGDELIDVTQDCALIWDEIDR--------------------------PLDPGREDKLIL------------DVCAQVDTPPLDLKQL
           G           DE +      + + D+++R                           LD  +E    L            D     +   L+LK L
Subjt:  NMSG--------VFGDELIDVTQDCALIWDEIDR--------------------------PLDPGREDKLIL------------DVCAQVDTPPLDLKQL

Query:  SAHLRYAFIGESSTFPVIIPSDLNKETNLVLNWEKCHFMVKEGIVLGHKVSEKGLEVDQTKISAIEQLPQPIN---------------------------
          HLRYAF+G + TFP        +ETNLVLNWEKCHFMV++GIVLGHK+S  GLEVD+ KI AIEQLP P +                           
Subjt:  SAHLRYAFIGESSTFPVIIPSDLNKETNLVLNWEKCHFMVKEGIVLGHKVSEKGLEVDQTKISAIEQLPQPIN---------------------------

Query:  -----------------------LKEKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKF
                               LK  L+ A I++  DW+QPFEIMCDASD A+G+ LGQ+RDK FR IYYASRTLDSAQ NYTTTEKEM+AVV++FDKF
Subjt:  -----------------------LKEKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKF

Query:  RPYLLGTKVIVHTDQAAIR---------------------------------------------------------------------------------
        RPYL+G K IV TD AAIR                                                                                 
Subjt:  RPYLLGTKVIVHTDQAAIR---------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------C
                                                                                                           C
Subjt:  ---------------------------------------------------------------------------------------------------C

Query:  QRTGNISRLHEMPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEA
        QR G +S+  EMPLT I+EVELFDVWGIDFMG FP S+G+ YI++ VDYVS+WVEAI T+TND++VV++F+ KNIFTRFG P+AIISD GSHF N+  EA
Subjt:  QRTGNISRLHEMPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEA

Query:  MLKKYGVKHKTSLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT
        +L ++GVKH+ +  YH Q N Q E++NREIKQIL K+V  N +DWA KL DALW   T
Subjt:  MLKKYGVKHKTSLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT

XP_042018355.1 uncharacterized protein LOC121766082 isoform X2 [Salvia splendens]2.1e-11636.68Show/hide
Query:  KLGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEEDPGVPIILGRPFLAT---------------------TRESESPIDRYEV
        KL +G+L+ST+I LQ+AD  + YP GI ED+LV V++FIFP DFVVLD+EED  VP+ILGRPFLAT                     T      + R++ 
Subjt:  KLGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEEDPGVPIILGRPFLAT---------------------TRESESPIDRYEV

Query:  NMSG--------VFGDELIDVTQDCALIWDEIDR--------------------------PLDPGREDKLIL------------DVCAQVDTPPLDLKQL
           G           DE +      + + D+++R                           LD  +E    L            D     +   L+LK L
Subjt:  NMSG--------VFGDELIDVTQDCALIWDEIDR--------------------------PLDPGREDKLIL------------DVCAQVDTPPLDLKQL

Query:  SAHLRYAFIGESSTFPVIIPSDLNKETNLVLNWEKCHFMVKEGIVLGHKVSEKGLEVDQTKISAIEQLPQPIN---------------------------
          HLRYAF+G + TFP        +ETNLVLNWEKCHFMV++GIVLGHK+S  GLEVD+ KI AIEQLP P +                           
Subjt:  SAHLRYAFIGESSTFPVIIPSDLNKETNLVLNWEKCHFMVKEGIVLGHKVSEKGLEVDQTKISAIEQLPQPIN---------------------------

Query:  -----------------------LKEKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKF
                               LK  L+ A I++  DW+QPFEIMCDASD A+G+ LGQ+RDK FR IYYASRTLDSAQ NYTTTEKEM+AVV++FDKF
Subjt:  -----------------------LKEKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKF

Query:  RPYLLGTKVIVHTDQAAIR---------------------------------------------------------------------------------
        RPYL+G K IV TD AAIR                                                                                 
Subjt:  RPYLLGTKVIVHTDQAAIR---------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------C
                                                                                                           C
Subjt:  ---------------------------------------------------------------------------------------------------C

Query:  QRTGNISRLHEMPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEA
        QR G +S+  EMPLT I+EVELFDVWGIDFMG FP S+G+ YI++ VDYVS+WVEAI T+TND++VV++F+ KNIFTRFG P+AIISD GSHF N+  EA
Subjt:  QRTGNISRLHEMPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEA

Query:  MLKKYGVKHKTSLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT
        +L ++GVKH+ +  YH Q N Q E++NREIKQIL+K+V  N +DWA KL DALW   T
Subjt:  MLKKYGVKHKTSLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT

XP_042041734.1 uncharacterized protein LOC121787147 isoform X1 [Salvia splendens]7.3e-11736.54Show/hide
Query:  KLGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEEDPGVPIILGRPFLAT---------------------TRESESPIDRYEV
        KL +G+L+ST+I LQ+AD  + YP GI ED+LV V++FIFP DFVVLD+EED  VP+ILGRPFLAT                     T      + R++ 
Subjt:  KLGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEEDPGVPIILGRPFLAT---------------------TRESESPIDRYEV

Query:  NMSG--------VFGDELIDVTQDCALIWDEIDR----------PLDPGREDKLILDVCAQVDT----------------------------PPLDLKQL
           G           DE +      + + D+++R          P D    +  +L+    +D+                              L+LK L
Subjt:  NMSG--------VFGDELIDVTQDCALIWDEIDR----------PLDPGREDKLILDVCAQVDT----------------------------PPLDLKQL

Query:  SAHLRYAFIGESSTFPVIIPSDLNKETNLVLNWEKCHFMVKEGIVLGHKVSEKGLEVDQTKISAIEQLPQPIN---------------------------
          HLRYAF+G + TFP        +ETNLVLNWEKCHFMV++GIVLGHK+S  GLEVD+ KI AIEQLP P +                           
Subjt:  SAHLRYAFIGESSTFPVIIPSDLNKETNLVLNWEKCHFMVKEGIVLGHKVSEKGLEVDQTKISAIEQLPQPIN---------------------------

Query:  -----------------------LKEKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKF
                               LK  L+ A I++  DW+QPFEIMCDASD A+G+ LGQ+RDK FR IYYASRTLDSAQ NYTTTEKEM+AVV++FDKF
Subjt:  -----------------------LKEKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKF

Query:  RPYLLGTKVIVHTDQAAIR---------------------------------------------------------------------------------
        RPYL+G K IV TD AAIR                                                                                 
Subjt:  RPYLLGTKVIVHTDQAAIR---------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------C
                                                                                                           C
Subjt:  ---------------------------------------------------------------------------------------------------C

Query:  QRTGNISRLHEMPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEA
        QR G +S+  EMPLT I+EVELFDVWGIDFMG FP S+G+ YI++ VDYVS+WVEAI T+TND++VV++F+ KNIFTRFG P+AIISD GSHF N+  EA
Subjt:  QRTGNISRLHEMPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEA

Query:  MLKKYGVKHKTSLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT
        +L ++GVKH+ +  YH Q N Q E++NREIKQIL+K+V  N +DWA KL DALW   T
Subjt:  MLKKYGVKHKTSLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT

XP_042055257.1 uncharacterized protein LOC121799825 isoform X2 [Salvia splendens]2.1e-11636.68Show/hide
Query:  KLGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEEDPGVPIILGRPFLAT---------------------TRESESPIDRYEV
        KL +G+L+ST+I LQ+AD  + YP GI ED+LV V++FIFP DFVVLD+EED  VP+ILGRPFLAT                     T      + R++ 
Subjt:  KLGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEEDPGVPIILGRPFLAT---------------------TRESESPIDRYEV

Query:  NMSG--------VFGDELIDVTQDCALIWDEIDR--------------------------PLDPGREDKLIL------------DVCAQVDTPPLDLKQL
           G           DE +      + + D+++R                           LD  +E    L            D     +   L+LK L
Subjt:  NMSG--------VFGDELIDVTQDCALIWDEIDR--------------------------PLDPGREDKLIL------------DVCAQVDTPPLDLKQL

Query:  SAHLRYAFIGESSTFPVIIPSDLNKETNLVLNWEKCHFMVKEGIVLGHKVSEKGLEVDQTKISAIEQLPQPIN---------------------------
          HLRYAF+G + TFP        +ETNLVLNWEKCHFMV++GIVLGHK+S  GLEVD+ KI AIEQLP P +                           
Subjt:  SAHLRYAFIGESSTFPVIIPSDLNKETNLVLNWEKCHFMVKEGIVLGHKVSEKGLEVDQTKISAIEQLPQPIN---------------------------

Query:  -----------------------LKEKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKF
                               LK  L+ A I++  DW+QPFEIMCDASD A+G+ LGQ+RDK FR IYYASRTLDSAQ NYTTTEKEM+AVV++FDKF
Subjt:  -----------------------LKEKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKF

Query:  RPYLLGTKVIVHTDQAAIR---------------------------------------------------------------------------------
        RPYL+G K IV TD AAIR                                                                                 
Subjt:  RPYLLGTKVIVHTDQAAIR---------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------C
                                                                                                           C
Subjt:  ---------------------------------------------------------------------------------------------------C

Query:  QRTGNISRLHEMPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEA
        QR G +S+  EMPLT I+EVELFDVWGIDFMG FP S+G+ YI++ VDYVS+WVEAI T+TND++VV++F+ KNIFTRFG P+AIISD GSHF N+  EA
Subjt:  QRTGNISRLHEMPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEA

Query:  MLKKYGVKHKTSLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT
        +L ++GVKH+ +  YH Q N Q E++NREIKQIL+K+V  N +DWA KL DALW   T
Subjt:  MLKKYGVKHKTSLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT

TrEMBL top hitse value%identityAlignment
A0A2G9GC30 DNA-directed DNA polymerase4.9e-10334.37Show/hide
Query:  LGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEEDPGVPIILGRPFLATTR--------ESESPIDRYEVNMSGVFGDELIDVT
        LGLGE+K T+I LQLAD  +TYP G++ED+LV VDKF F ADFVVLD+E D  + IILGRPFLA  R        E    +   ++  + +   +  + +
Subjt:  LGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEEDPGVPIILGRPFLATTR--------ESESPIDRYEVNMSGVFGDELIDVT

Query:  QDCALI----------------WDEIDR-------------------PLDPGREDKLILDVCAQVDTPP-LDLKQLSAHLRYAFIGESSTFPVIIPSDLN
         +C  +                 D ++R                   PL+     K++      ++ PP L+LK    +LRY +IGES T P+II S L+
Subjt:  QDCALI----------------WDEIDR-------------------PLDPGREDKLILDVCAQVDTPP-LDLKQLSAHLRYAFIGESSTFPVIIPSDLN

Query:  ------------------------------------------------------------------------------KETNLVLNWEKCHFMVKEGIVL
                                                                                      K+TNL LNW+KCHFMV+EGIVL
Subjt:  ------------------------------------------------------------------------------KETNLVLNWEKCHFMVKEGIVL

Query:  GHKVSEKGLEVDQTKISAIE----QLPQPINLKEKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIA
         HKV  +G+EVD+ K+S  +     L     LK KLI A II V+DW   FE+MCDASD+AIG  L QR++K F  IYYAS+TL+ AQ NYTTTEKE+  
Subjt:  GHKVSEKGLEVDQTKISAIE----QLPQPINLKEKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIA

Query:  VVFAFDKFRPYLLGTKVIVHTDQAAI--------------------------------------------------------------------------
        +V AFDKFR YL+GTKVIV+TD +AI                                                                          
Subjt:  VVFAFDKFRPYLLGTKVIVHTDQAAI--------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------RCQRTGNISRLHE
                                                                                               RCQR GNIS+ HE
Subjt:  ---------------------------------------------------------------------------------------RCQRTGNISRLHE

Query:  MPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKT
        MPL  ILEVELFD+WG DFMG F  S    YI+V VDY SKWV+A+A  +ND++V++ F+ KNIF  FGT +AIISDEG+HFCN+ F+A+L KYG+KHK 
Subjt:  MPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKT

Query:  SLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDAL
           YH QT+ Q E+SNREIK+ILEKT+    KDW+ +L ++L
Subjt:  SLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDAL

A0A2G9HBV9 DNA-directed DNA polymerase6.6e-11637.92Show/hide
Query:  SIALVLNQILK-LGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEEDPGVPIILGRPFLATTR--------ESESPIDRYEVNM
        SI L+   I + LGLGE K T+I LQLAD  +TYPKG++ED+LV VDKFIF ADFVVLD+E D  VPIILGRPFLAT R        E    +   ++  
Subjt:  SIALVLNQILK-LGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEEDPGVPIILGRPFLATTR--------ESESPIDRYEVNM

Query:  SGVFGDELIDVTQDC--ALIWDEI-------DRPLDPGREDKLILDVCAQ---------VDTPPLDLKQLSAHLRY-----AFIGESSTFPVIIPSDLN-
        +     +  + + +C    ++D +       ++PLDP   ++ +LD+  +         +   P D ++ +    Y      F+ +   +       LN 
Subjt:  SGVFGDELIDVTQDC--ALIWDEI-------DRPLDPGREDKLILDVCAQ---------VDTPPLDLKQLSAHLRY-----AFIGESSTFPVIIPSDLN-

Query:  --------KETNLVLNWEKCHFMVKEGIVLGHKVSEKGLEVDQTKISAIEQLPQPI--------------------------------------------
                ++TNL+LNW+KCHFMV+EGIVLGHKVS +G+EVD+ K+  IE+LP P                                             
Subjt:  --------KETNLVLNWEKCHFMVKEGIVLGHKVSEKGLEVDQTKISAIEQLPQPI--------------------------------------------

Query:  ------NLKEKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKFRPYLLGTKVIVHTDQA
              +LK +LI A II V DW+ PFE+MCDASD+A+GA LGQR+DK FR IYYAS+TL+ AQ NYTTTEKE++AVVFAFDKFR YL+GTKVIV+TD A
Subjt:  ------NLKEKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKFRPYLLGTKVIVHTDQA

Query:  AI--------------------------------------------------------------------------------------------------
        AI                                                                                                  
Subjt:  AI--------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------RCQRTGNISRLHEMPLTPI
                                                                                         RCQRTGNISR HEMPL  I
Subjt:  ---------------------------------------------------------------------------------RCQRTGNISRLHEMPLTPI

Query:  LEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHL
        LEVELFDVWGIDFMG F  S G  YI+V VDYVSKWVEA+A   ND++VV+ F+ KNIFTRFGTP+AIIS+ G+HFCN+ FEA+L KYGVKHK S  YH 
Subjt:  LEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHL

Query:  QTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT
        QT+ Q E+SNREIK+ILEKTV    KDW+ +L +ALW   T
Subjt:  QTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT

A0A2N9F8G8 Reverse transcriptase4.5e-10436.83Show/hide
Query:  SDSIALVLNQI---------LKLGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEE----DPGVPIILGRPFLATTRESESPID
        ++ + L+L Q+         L+LGLGELK TT+VLQL D  +  PKG++EDVLV +DKF +P DF++L+ E     +  +PIILGRPFLAT   + + I+
Subjt:  SDSIALVLNQI---------LKLGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEE----DPGVPIILGRPFLATTRESESPID

Query:  RYEVNMSGVFGDELIDVT-----------QDCALIWDEIDRPLDPGREDKLILDVCAQVDTPPLDLKQL--------------------SAHLRYAFIGE
             M   FG   ++V            +DC ++     RP         +  + + ++ P L+LKQL                    + HL Y  +  
Subjt:  RYEVNMSGVFGDELIDVT-----------QDCALIWDEIDRPLDPGREDKLILDVCAQVDTPPLDLKQL--------------------SAHLRYAFIGE

Query:  S-------------STFPVIIPSDLN--------------------KETNLVLNWEKCHFMVKEGIVLGHKVSEKGLEVDQTKISAIEQLPQPINLKEKL
        +               F  +   DL+                    +E NLVLNWEKCHFMV  GIVLGH VS +G+E D   I           L +KL
Subjt:  S-------------STFPVIIPSDLN--------------------KETNLVLNWEKCHFMVKEGIVLGHKVSEKGLEVDQTKISAIEQLPQPINLKEKL

Query:  IFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKFRPYLLGTKVIVHTDQAAI----------
          A I+   DW+ PFE+MCDASDYAIGA LGQR+DK    IYYASRTL+ AQ NYTTTEKE++A+VFA DKFR YL+G+ ++V TD AA+          
Subjt:  IFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKFRPYLLGTKVIVHTDQAAI----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------RCQRTGNISRLHEMPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFL
                                   RCQ+ G ISR + MPL PIL +E+FD WGIDFMG FP S G  YI+V VDYVSKWVEA+A K ND R+V++FL
Subjt:  ---------------------------RCQRTGNISRLHEMPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFL

Query:  HKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT
         +N+ +RFGTP+AIISD+G+HFCNK FE+++ KYGV HK + +YH QT+ Q E++NREIKQILEKTV  + KDW+ +L DALW   T
Subjt:  HKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT

A0A2N9IQ72 Integrase catalytic domain-containing protein2.6e-10437.63Show/hide
Query:  EEEIAQPVPDKSSDSIALVLNQILKLGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEE----DPGVPIILGRPFLATTRESES
        + +I Q + D  +    +  +  L+LGLGELK T +VLQLAD  +  PKG++EDVLV +DKF +P DF++L+ E     +  +PIILGRPFLAT   + +
Subjt:  EEEIAQPVPDKSSDSIALVLNQILKLGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEE----DPGVPIILGRPFLATTRESES

Query:  PIDRYEVNMSGVFGDELIDVTQDCALIWDEIDRPLDPGREDKLI--LDVCAQ------VDTPPLDLKQLSA-----HLRYAFIGESSTFPVII-------
         I+     M   FG   ++V      I++ I + +    E +++  +D   Q        + PLD    S+      LR     +   +P+++       
Subjt:  PIDRYEVNMSGVFGDELIDVTQDCALIWDEIDRPLDPGREDKLI--LDVCAQ------VDTPPLDLKQLSA-----HLRYAFIGESSTFPVII-------

Query:  ---PSDL---NKETNLVLNWEKCHFMVKEGIVLGHKVSEKGLEVDQTKISAIEQLPQPINLK--------------------------------------
           PS +    +E NLVLNWEKCHFMV  GIVLGH VS +G+EVD++KI  I +LP P  +K                                      
Subjt:  ---PSDL---NKETNLVLNWEKCHFMVKEGIVLGHKVSEKGLEVDQTKISAIEQLPQPINLK--------------------------------------

Query:  ------------EKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKFRPYLLGTKVIVHT
                    +KL  A I+   DW+ PFE+MCDASDYAIGA LGQR+D     IYYASRTL+ AQ NYTTTEKE++A+VFA DKF  YL+G+ ++V T
Subjt:  ------------EKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKFRPYLLGTKVIVHT

Query:  DQAAI-----------------------------------------------------------------------------------------------
        D AA+                                                                                               
Subjt:  DQAAI-----------------------------------------------------------------------------------------------

Query:  ---------------RCQRTGNISRLHEMPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQ
                       RCQ+ G ISR + MPL PIL +E+FD WGIDFMG FP S G  YI+V VDYVSKWVEA+A K ND R V++FL +N+ +RFGTP+
Subjt:  ---------------RCQRTGNISRLHEMPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQ

Query:  AIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT
        AIISD+G+HFCNK FE+++ KYGV HK   +YH QT+ Q E++NREIKQILEK V  + KDW+ +L DALW   T
Subjt:  AIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT

A0A6L2NCM7 Reverse transcriptase domain-containing protein1.4e-10540.87Show/hide
Query:  KLGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEEDPGVPIILGRPFLATTR------ESESPIDRYEVNMSGVFGD-------
        KL L     T +VL+LAD  ++ P G+ E+V V V KF FPADFVVLD   DP V +IL RPFL+T        E E  +   + ++    GD       
Subjt:  KLGLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEEDPGVPIILGRPFLATTR------ESESPIDRYEVNMSGVFGD-------

Query:  --------ELIDVTQD---------CALIWDEI-------------------------------------DRPL---------DPGREDKLILDVCAQVD
                +LID T +           ++ +EI                                     D P+         DP   D LIL+     D
Subjt:  --------ELIDVTQD---------CALIWDEI-------------------------------------DRPL---------DPGREDKLILDVCAQVD

Query:  ---------------------------------TPPLDLKQLSAHLRYAFIGESSTFPVIIPSDLNK------ETNLVLNWEKCHFMVKEGIVLGHKVSE
                                          P ++LK+L  HL YAF+   ++F   + ++L K      +  L LNWEK HFMVKEGIVLGHK+S+
Subjt:  ---------------------------------TPPLDLKQLSAHLRYAFIGESSTFPVIIPSDLNK------ETNLVLNWEKCHFMVKEGIVLGHKVSE

Query:  KGLEVDQTKISAIEQLPQPINLKEKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKFRP
        KG EVD+ KI  I ++P P    EKL  A I++  +W+QPFE+MCD +DYA+GA LGQR +K FRPI+YAS+T+  A+ NYTTTEKEM+AVV+A +KFR 
Subjt:  KGLEVDQTKISAIEQLPQPINLKEKLIFATIIVVLDWNQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKFRP

Query:  YLLGTKVIVHTDQAAIR---------------------CQRTGNISRLHEMPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKT
        YL+  K IV+ D +A++                         G IS+  EMP   I   E+FDVWGIDFMG FPSS G  YI+V VDY SKWVEA A  T
Subjt:  YLLGTKVIVHTDQAAIR---------------------CQRTGNISRLHEMPLTPILEVELFDVWGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKT

Query:  NDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT
        NDARVV++FL K++F+RFGTP+AIISD+G+HFC   F  ++ KYGV H+ S AYH QT+ Q E++NR +K+ILE+TV  N   W+ KL DALW   T
Subjt:  NDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHLQTNSQAEISNREIKQILEKTVRINSKDWAFKLGDALWVMWT

SwissProt top hitse value%identityAlignment
P03356 Gag-Pol polyprotein8.0e-1840.77Show/hide
Query:  WGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHLQTNSQAEI
        W IDF    P   GY Y++V VD  S WVEA  TK   ARVV + L + IF RFG PQ + SD G  F +++ +++    G+  K   AY  Q++ Q E 
Subjt:  WGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHLQTNSQAEI

Query:  SNREIKQILEK-TVRINSKDWAFKLGDALW
         NR IK+ L K T+   ++DW   L  AL+
Subjt:  SNREIKQILEK-TVRINSKDWAFKLGDALW

P08361 Gag-Pol polyprotein8.0e-1840Show/hide
Query:  WGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHLQTNSQAEI
        W IDF    P   GY Y++V VD  S W+EA  TK   A+VV + L + IF RFG PQ + +D G  F +K+ + +    G+  K   AY  Q++ Q E 
Subjt:  WGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHLQTNSQAEI

Query:  SNREIKQILEK-TVRINSKDWAFKLGDALW
         NR IK+ L K T+   S+DW   L  AL+
Subjt:  SNREIKQILEK-TVRINSKDWAFKLGDALW

P26808 Gag-Pol polyprotein4.7e-1841.54Show/hide
Query:  WGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHLQTNSQAEI
        W IDF    P   GY Y++V VD  S WVEA  TK   A+VV + L + IF RFG PQ + +D G  F +K+ + +    GV  K   AY  Q++ Q E 
Subjt:  WGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHLQTNSQAEI

Query:  SNREIKQILEK-TVRINSKDWAFKLGDALW
         NR IK+ L K T+   S+DW   L  AL+
Subjt:  SNREIKQILEK-TVRINSKDWAFKLGDALW

P26809 Gag-Pol polyprotein6.1e-1840.77Show/hide
Query:  WGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHLQTNSQAEI
        W IDF    P   GY Y++V +D  S WVEA  TK   A+VV + L + IF RFG PQ + +D G  F +K+ + +    GV  K   AY  Q++ Q E 
Subjt:  WGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHLQTNSQAEI

Query:  SNREIKQILEK-TVRINSKDWAFKLGDALW
         NR IK+ L K T+   S+DW   L  AL+
Subjt:  SNREIKQILEK-TVRINSKDWAFKLGDALW

P26810 Gag-Pol polyprotein4.7e-1841.54Show/hide
Query:  WGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHLQTNSQAEI
        W IDF    P   GY Y++V VD  S WVEA  TK   A+VV + L + IF RFG PQ + +D G  F +K+ + +    GV  K   AY  Q++ Q E 
Subjt:  WGIDFMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHLQTNSQAEI

Query:  SNREIKQILEK-TVRINSKDWAFKLGDALW
         NR IK+ L K T+   S+DW   L  AL+
Subjt:  SNREIKQILEK-TVRINSKDWAFKLGDALW

Arabidopsis top hitse value%identityAlignment
AT3G60770.1 Ribosomal protein S13/S151.2e-1184.62Show/hide
Query:  GLAPEIPVDLYHLITEVVSIQKHLERNRKDKDSKFRLIL
        GLAPEIP DLYHLI + V+I+KHLERNRKDKDSKFRLIL
Subjt:  GLAPEIPVDLYHLITEVVSIQKHLERNRKDKDSKFRLIL

AT4G00100.1 ribosomal protein S13A1.2e-1184.62Show/hide
Query:  GLAPEIPVDLYHLITEVVSIQKHLERNRKDKDSKFRLIL
        GLAPEIP DLYHLI + V+I+KHLERNRKDKDSKFRLIL
Subjt:  GLAPEIPVDLYHLITEVVSIQKHLERNRKDKDSKFRLIL

ATMG00750.1 GAG/POL/ENV polyprotein6.1e-0565.62Show/hide
Query:  CQRTGNISRLHEMPLTPILEVELFDVWGIDFM
        CQR GN ++ +EMP   ILEVE+FDVWGI FM
Subjt:  CQRTGNISRLHEMPLTPILEVELFDVWGIDFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGGCGGGATCCATTTGTTCAAGCCCCGGAGTCAGCACTTAAGGGAACAACCTCTCTACTATCCCTAATTCGGGACGCCCCCACTCGCATGTCTCTACATGGACG
ATTTATGATCGCATCGTTTATACCATCTATAAAGTGGGTCGCGTCTCATAGTGTCTCCAGAATAAGAGTCTCGACGCTGGCTGAAGATGGCACCAAGAAAAGGACGTCGA
AGCGTCTCAACGCTGTGACGTCAGCGTCTCGACGCTGGATAACATATGAGCCAATTGATGATGAGCATTTTGAGCTCAAGGACAACCATCAGAAGATCCACACAGTCATT
TGCCTTCATTTTGGATATCTGTGGAATGGCTGGAGGAAACTTTTTAGCCAAAACGTTAAGGATGCTCCAGCTTTAGTAGAAGAGTTGGCTTTAACAAGTTACCAATGGCC
ATCTGAGCAATCAGAGTCCAAACCAAAGGGGGACCATGTGAAGCCAATGAATGGGAAGCAATTAAATGAAATAGAGAGAAAAGAAGGTAAGGAGCCCAACACTAGAAAGA
TATTGGATGAGGACAATGATCAAAACACAATAGAGGAAGAGATAGCACAACCAGTGCCAGATAAGTCTTCTGACTCTATTGCTTTAGTTCTTAACCAGATTCTGAAACTT
GGTTTGGGGGAATTGAAGTCAACCACCATTGTCCTGCAATTAGCAGATCATTTGATGACGTACCCAAAAGGTATATTAGAGGATGTTTTGGTTAATGTTGACAAATTCAT
TTTCCCTGCAGATTTTGTAGTGCTAGACATAGAGGAGGACCCCGGAGTGCCTATCATTCTTGGAAGACCCTTTTTAGCAACCACTCGTGAGTCAGAGTCTCCTATTGATC
GTTATGAAGTAAACATGTCTGGTGTGTTTGGAGATGAGCTAATAGATGTTACTCAGGATTGTGCTTTAATTTGGGATGAGATTGATAGACCTTTAGATCCAGGAAGGGAG
GATAAACTCATATTAGATGTTTGTGCTCAAGTTGATACCCCTCCCCTTGATTTGAAGCAATTATCTGCTCACCTACGTTATGCTTTCATAGGAGAATCTTCTACTTTTCC
TGTTATTATACCTTCTGATTTAAATAAAGAAACTAACCTTGTCCTAAATTGGGAGAAATGCCATTTTATGGTAAAAGAAGGTATAGTTTTGGGCCACAAAGTGTCTGAAA
AAGGTTTGGAGGTAGATCAGACAAAGATCTCTGCTATTGAGCAGTTGCCTCAACCCATTAATCTTAAGGAAAAATTGATTTTTGCAACTATCATTGTTGTGCTTGATTGG
AATCAACCATTTGAGATCATGTGTGATGCTAGTGACTATGCTATAGGAGCTTTTTTAGGGCAACGTCGTGATAAATTCTTTAGGCCTATATATTATGCAAGTAGAACTCT
AGATAGTGCTCAGCAGAATTACACCACTACTGAAAAAGAGATGATTGCTGTTGTATTTGCATTTGATAAATTTAGGCCCTATTTGCTTGGTACAAAGGTAATAGTGCATA
CTGACCAAGCTGCTATTCGTTGCCAGAGGACAGGTAACATTTCTAGACTCCATGAAATGCCCCTAACCCCTATTCTTGAAGTAGAACTTTTTGATGTGTGGGGCATAGAT
TTTATGGGCCATTTCCCCTCTTCCAATGGATACTCGTATATTATTGTTGTTGTTGATTATGTATCTAAATGGGTAGAGGCGATAGCTACCAAGACAAATGATGCACGTGT
TGTGTTGCGTTTTTTGCATAAAAATATTTTCACTAGATTTGGCACACCACAGGCTATAATTAGTGATGAAGGGTCCCATTTTTGCAATAAATTGTTTGAAGCAATGCTTA
AGAAATATGGTGTCAAGCATAAAACTTCTTTAGCTTATCATCTCCAAACTAACAGTCAAGCTGAGATTTCTAATAGGGAGATAAAGCAAATCCTGGAGAAGACAGTTCGC
ATCAACAGCAAAGATTGGGCATTCAAGTTGGGCGATGCTTTGTGGGTGATGTGGACTTTTCCTACTGCCAGCACACGTGTCAAACTTGACCGTTCTATGCCAACAACGCT
CCAACGAACGAAAACGCAACAGCCTCCTTGTCTTTTGATCGCTGGAAATTTCTTCTTGATTGACACAATAACCACAGAGGTTACACCGCTACCAGTGGGACCTAATGGAC
CTGCAGATCAGAAGCTCCAACGATCCCCCACCCTCAAGAGTTGTTCAACCGTCCAAGCTTATCCCTCCACCATTCATTTTCTTCTTCGATTTCCTTTAGCGTTTTCTTCT
CTTATTTCTTCCCCCGTCACAGCCTTCGACCAGCAGCCGCTCAAGCACCTCCTCGAGCGCCACTGTCAACCCTTGCAGCTCGGCGGCGGCGGTACCGTCAAGAGTTGTTC
GACGGGATGCGGCGACAGCAGCATGGAGCAGTTCGGCGGCGACAGCAGCATGGAGCAGTTCGGCGGCGACAGCGTGGGTAGTTTGGCGGCACCTTCACAACAGCGACGAC
AATGTATTCGACCCATCTTTCTTGTGATTTTCGTAAGATTTCGGGTTGTTGGTTCGTTGAGCACACGGATAGCTTCAGTTTTGAGGTCGCTTGTGGCGCTTTCCAGTGAT
TCCTTGCGTGGGTTTCCTTTTGCTGATAAGGGTTTAATTGTAGTTTGGAGTGAACTTAGAAGCTTGGAAAGTGAGCTTTGGGGCCGATTAGGACTAGCTCCTGAAATCCC
GGTGGATCTTTACCATCTCATCACGGAAGTTGTTTCAATCCAGAAGCATTTGGAGAGAAACAGGAAAGACAAGGATTCCAAGTTCAGGTTGATTTTAAGGCACGGCATAA
AACCCAACAGGCATAAAGGATCCTTGTCTTACGGACAAGTAATGAACAATGTTGACAAGGACCAATGGATTAAAACCATGGACCTGGAAATGGAGTCTATATACTTCAAT
TCCGCCTGGGATGTTGTAGATCAACCTGATGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGGCGGGATCCATTTGTTCAAGCCCCGGAGTCAGCACTTAAGGGAACAACCTCTCTACTATCCCTAATTCGGGACGCCCCCACTCGCATGTCTCTACATGGACG
ATTTATGATCGCATCGTTTATACCATCTATAAAGTGGGTCGCGTCTCATAGTGTCTCCAGAATAAGAGTCTCGACGCTGGCTGAAGATGGCACCAAGAAAAGGACGTCGA
AGCGTCTCAACGCTGTGACGTCAGCGTCTCGACGCTGGATAACATATGAGCCAATTGATGATGAGCATTTTGAGCTCAAGGACAACCATCAGAAGATCCACACAGTCATT
TGCCTTCATTTTGGATATCTGTGGAATGGCTGGAGGAAACTTTTTAGCCAAAACGTTAAGGATGCTCCAGCTTTAGTAGAAGAGTTGGCTTTAACAAGTTACCAATGGCC
ATCTGAGCAATCAGAGTCCAAACCAAAGGGGGACCATGTGAAGCCAATGAATGGGAAGCAATTAAATGAAATAGAGAGAAAAGAAGGTAAGGAGCCCAACACTAGAAAGA
TATTGGATGAGGACAATGATCAAAACACAATAGAGGAAGAGATAGCACAACCAGTGCCAGATAAGTCTTCTGACTCTATTGCTTTAGTTCTTAACCAGATTCTGAAACTT
GGTTTGGGGGAATTGAAGTCAACCACCATTGTCCTGCAATTAGCAGATCATTTGATGACGTACCCAAAAGGTATATTAGAGGATGTTTTGGTTAATGTTGACAAATTCAT
TTTCCCTGCAGATTTTGTAGTGCTAGACATAGAGGAGGACCCCGGAGTGCCTATCATTCTTGGAAGACCCTTTTTAGCAACCACTCGTGAGTCAGAGTCTCCTATTGATC
GTTATGAAGTAAACATGTCTGGTGTGTTTGGAGATGAGCTAATAGATGTTACTCAGGATTGTGCTTTAATTTGGGATGAGATTGATAGACCTTTAGATCCAGGAAGGGAG
GATAAACTCATATTAGATGTTTGTGCTCAAGTTGATACCCCTCCCCTTGATTTGAAGCAATTATCTGCTCACCTACGTTATGCTTTCATAGGAGAATCTTCTACTTTTCC
TGTTATTATACCTTCTGATTTAAATAAAGAAACTAACCTTGTCCTAAATTGGGAGAAATGCCATTTTATGGTAAAAGAAGGTATAGTTTTGGGCCACAAAGTGTCTGAAA
AAGGTTTGGAGGTAGATCAGACAAAGATCTCTGCTATTGAGCAGTTGCCTCAACCCATTAATCTTAAGGAAAAATTGATTTTTGCAACTATCATTGTTGTGCTTGATTGG
AATCAACCATTTGAGATCATGTGTGATGCTAGTGACTATGCTATAGGAGCTTTTTTAGGGCAACGTCGTGATAAATTCTTTAGGCCTATATATTATGCAAGTAGAACTCT
AGATAGTGCTCAGCAGAATTACACCACTACTGAAAAAGAGATGATTGCTGTTGTATTTGCATTTGATAAATTTAGGCCCTATTTGCTTGGTACAAAGGTAATAGTGCATA
CTGACCAAGCTGCTATTCGTTGCCAGAGGACAGGTAACATTTCTAGACTCCATGAAATGCCCCTAACCCCTATTCTTGAAGTAGAACTTTTTGATGTGTGGGGCATAGAT
TTTATGGGCCATTTCCCCTCTTCCAATGGATACTCGTATATTATTGTTGTTGTTGATTATGTATCTAAATGGGTAGAGGCGATAGCTACCAAGACAAATGATGCACGTGT
TGTGTTGCGTTTTTTGCATAAAAATATTTTCACTAGATTTGGCACACCACAGGCTATAATTAGTGATGAAGGGTCCCATTTTTGCAATAAATTGTTTGAAGCAATGCTTA
AGAAATATGGTGTCAAGCATAAAACTTCTTTAGCTTATCATCTCCAAACTAACAGTCAAGCTGAGATTTCTAATAGGGAGATAAAGCAAATCCTGGAGAAGACAGTTCGC
ATCAACAGCAAAGATTGGGCATTCAAGTTGGGCGATGCTTTGTGGGTGATGTGGACTTTTCCTACTGCCAGCACACGTGTCAAACTTGACCGTTCTATGCCAACAACGCT
CCAACGAACGAAAACGCAACAGCCTCCTTGTCTTTTGATCGCTGGAAATTTCTTCTTGATTGACACAATAACCACAGAGGTTACACCGCTACCAGTGGGACCTAATGGAC
CTGCAGATCAGAAGCTCCAACGATCCCCCACCCTCAAGAGTTGTTCAACCGTCCAAGCTTATCCCTCCACCATTCATTTTCTTCTTCGATTTCCTTTAGCGTTTTCTTCT
CTTATTTCTTCCCCCGTCACAGCCTTCGACCAGCAGCCGCTCAAGCACCTCCTCGAGCGCCACTGTCAACCCTTGCAGCTCGGCGGCGGCGGTACCGTCAAGAGTTGTTC
GACGGGATGCGGCGACAGCAGCATGGAGCAGTTCGGCGGCGACAGCAGCATGGAGCAGTTCGGCGGCGACAGCGTGGGTAGTTTGGCGGCACCTTCACAACAGCGACGAC
AATGTATTCGACCCATCTTTCTTGTGATTTTCGTAAGATTTCGGGTTGTTGGTTCGTTGAGCACACGGATAGCTTCAGTTTTGAGGTCGCTTGTGGCGCTTTCCAGTGAT
TCCTTGCGTGGGTTTCCTTTTGCTGATAAGGGTTTAATTGTAGTTTGGAGTGAACTTAGAAGCTTGGAAAGTGAGCTTTGGGGCCGATTAGGACTAGCTCCTGAAATCCC
GGTGGATCTTTACCATCTCATCACGGAAGTTGTTTCAATCCAGAAGCATTTGGAGAGAAACAGGAAAGACAAGGATTCCAAGTTCAGGTTGATTTTAAGGCACGGCATAA
AACCCAACAGGCATAAAGGATCCTTGTCTTACGGACAAGTAATGAACAATGTTGACAAGGACCAATGGATTAAAACCATGGACCTGGAAATGGAGTCTATATACTTCAAT
TCCGCCTGGGATGTTGTAGATCAACCTGATGGATAG
Protein sequenceShow/hide protein sequence
MRGRDPFVQAPESALKGTTSLLSLIRDAPTRMSLHGRFMIASFIPSIKWVASHSVSRIRVSTLAEDGTKKRTSKRLNAVTSASRRWITYEPIDDEHFELKDNHQKIHTVI
CLHFGYLWNGWRKLFSQNVKDAPALVEELALTSYQWPSEQSESKPKGDHVKPMNGKQLNEIERKEGKEPNTRKILDEDNDQNTIEEEIAQPVPDKSSDSIALVLNQILKL
GLGELKSTTIVLQLADHLMTYPKGILEDVLVNVDKFIFPADFVVLDIEEDPGVPIILGRPFLATTRESESPIDRYEVNMSGVFGDELIDVTQDCALIWDEIDRPLDPGRE
DKLILDVCAQVDTPPLDLKQLSAHLRYAFIGESSTFPVIIPSDLNKETNLVLNWEKCHFMVKEGIVLGHKVSEKGLEVDQTKISAIEQLPQPINLKEKLIFATIIVVLDW
NQPFEIMCDASDYAIGAFLGQRRDKFFRPIYYASRTLDSAQQNYTTTEKEMIAVVFAFDKFRPYLLGTKVIVHTDQAAIRCQRTGNISRLHEMPLTPILEVELFDVWGID
FMGHFPSSNGYSYIIVVVDYVSKWVEAIATKTNDARVVLRFLHKNIFTRFGTPQAIISDEGSHFCNKLFEAMLKKYGVKHKTSLAYHLQTNSQAEISNREIKQILEKTVR
INSKDWAFKLGDALWVMWTFPTASTRVKLDRSMPTTLQRTKTQQPPCLLIAGNFFLIDTITTEVTPLPVGPNGPADQKLQRSPTLKSCSTVQAYPSTIHFLLRFPLAFSS
LISSPVTAFDQQPLKHLLERHCQPLQLGGGGTVKSCSTGCGDSSMEQFGGDSSMEQFGGDSVGSLAAPSQQRRQCIRPIFLVIFVRFRVVGSLSTRIASVLRSLVALSSD
SLRGFPFADKGLIVVWSELRSLESELWGRLGLAPEIPVDLYHLITEVVSIQKHLERNRKDKDSKFRLILRHGIKPNRHKGSLSYGQVMNNVDKDQWIKTMDLEMESIYFN
SAWDVVDQPDG