; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028093 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028093
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr8:12760852..12764822
RNA-Seq ExpressionLag0028093
SyntenyLag0028093
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016646912.1 PREDICTED: uncharacterized protein LOC103318979 [Prunus mume]1.3e-14539.67Show/hide
Query:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRL---QVEALT
        WER+KD LRKCP H  P W+Q+Q FYNGL +  +T++D AAGG+ +AKT  +A  L+E +   +YQW SE+    +  KP    EV A+ L   Q+  LT
Subjt:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRL---QVEALT

Query:  KILNKL----------IGSEIPSLPKVKVE---------------------------------------------------PKPEEIVQEKQNPLEEML-
        K ++ L           G E+ + P    E                                                   P P    QEK+  LE+ L 
Subjt:  KILNKL----------IGSEIPSLPKVKVE---------------------------------------------------PKPEEIVQEKQNPLEEML-

Query:  ------GNFINETRNLSVKQDIVIGKME--------AINRSNDNIHEGDP--------------------------------------------------
                F+ ET+     Q   I  +E         I+  N  +    P                                                  
Subjt:  ------GNFINETRNLSVKQDIVIGKME--------AINRSNDNIHEGDP--------------------------------------------------

Query:  ----TRKLIQEEETAQPVPDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKC
            T +  QEEE + P+P   S  +   + QIP+PQR RK ++D +F+KFL++ R+L INIPFA+ALEQMP YAKFMK+ILS KRKF  HE + +T +C
Subjt:  ----TRKLIQEEETAQPVPDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKC

Query:  SAIIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEE
        SAI+Q+ LP K KD GSF +PC +G+ F ++ALCDLG+SINL+P  V +K+G+GE+K TT+ LQ+ DRS+TYP G++EDVLV VD  IFPADF+VLDMEE
Subjt:  SAIIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEE

Query:  DPEGPIILGRPFLATARALIEVHKGTLTLRVEDDES---VSPADHYEISMPDVFGDELID------VNKEN--------------ALIDRFLGREEKLSL
        D +  +ILGRPFL T+R LI+V +G LTLRV ++++   V  A  +     D F  ELID        KEN              +  D  +  E  L L
Subjt:  DPEGPIILGRPFLATARALIEVHKGTLTLRVEDDES---VSPADHYEISMPDVFGDELID------VNKEN--------------ALIDRFLGREEKLSL

Query:  DMC-----------------------ARVNVPSLELKQLPAHLRYAFLGESSTFPVIISADLR-------------------------------------
        D                         + +  P+L LK LP HLRYA+LG S T PVII+A+L                                      
Subjt:  DMC-----------------------ARVNVPSLELKQLPAHLRYAFLGESSTFPVIISADLR-------------------------------------

Query:  -----------------------------------IIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFI
                                           IIY ISDS WVSP QVVPKKGGM VVKNE +E + TRTVTGWRVCIDYRKLN ATRKDHFPLPFI
Subjt:  -----------------------------------IIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFI

Query:  DQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
        DQML RLAG+ YYCFLDGYSGYNQ+ IAPEDQEKT FTCP+GT+A+RRMPFG
Subjt:  DQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG

XP_017970395.1 PREDICTED: uncharacterized protein LOC108660654 [Theobroma cacao]1.9e-14747.36Show/hide
Query:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL
        WER+K+ L++CP H  PDWLQ+Q FYNGL    KT +D AAGG+ ++K   DA  L+EE+ L +YQW SE+  S       E + +S L +QV AL+K +
Subjt:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL

Query:  NKLIGSEIPSLPKVKVEPKPEEIVQEKQNPLEEMLGNFINETR-----NLSVK---QDIVIGKMEAINRSNDNIHEGD----PTRKLIQEE---ETAQPV
        + L                    V   QN + EM G+  +  +     N S K   Q I +   + I   N+ + E +       ++ ++E   E  + V
Subjt:  NKLIGSEIPSLPKVKVEPKPEEIVQEKQNPLEEMLGNFINETR-----NLSVK---QDIVIGKMEAINRSNDNIHEGD----PTRKLIQEE---ETAQPV

Query:  PDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDPGSF
          K+     ++    P+PQR +K++L+ +F KF+++ ++LHINIPFA+ALEQMP Y KF+K ILS K K    ET+++T +CSAI+Q  LPLKLKD GSF
Subjt:  PDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDPGSF

Query:  TLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARA
        T+P  +G+ F  KAL DLG SINL+P+ ++ KLGLGE K T + L+L DRS  YP+G++EDVLV VDKFIFP DF++LDMEED + PIILGRPFLATARA
Subjt:  TLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARA

Query:  LIEVHKGTLTLRVEDDE-----------SVSPADHYEISM-----PDVF--------------------GDELIDVNKENALIDRFLGREEKLSLDMCA-
        LI+V KG LTLRV+D +            V+  D + +S+      DVF                     DE I+ +       RF    +  SLD+   
Subjt:  LIEVHKGTLTLRVEDDE-----------SVSPADHYEISM-----PDVF--------------------GDELIDVNKENALIDRFLGREEKLSLDMCA-

Query:  -------RVNVPS-LELKQLPAHLR--YAFLGESSTFPVIISADLRIIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKA
                +  PS LELK LPAHLR     + E     +I   D  IIY ISDS WVSPVQ V KKG MIVV N+ +E I TRTVTGWRVC+DY KLNKA
Subjt:  -------RVNVPS-LELKQLPAHLR--YAFLGESSTFPVIISADLRIIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKA

Query:  TRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
        TRKDHF  PFIDQML RLAG  YYCFL+GYS YNQ+AIAP+DQEKT FTCPYGT+AFRRMPFG
Subjt:  TRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]1.7e-16141.88Show/hide
Query:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL
        WERYKD +R CPQHG PDWLQ+Q FYNGL    +TI+D A+GG+ ++KT + A +L+EE+   +YQW +E+  +       E E  +AL  QV +L+  +
Subjt:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL

Query:  NKLIGSEIP--------SLPKVKVEPKPEEIVQ------------------------------------------------EKQNPLEEMLGNFINETRN
        + L    IP        S   V +    +E VQ                                                EK+  LE+ + +F+ ET+ 
Subjt:  NKLIGSEIP--------SLPKVKVEPKPEEIVQ------------------------------------------------EKQNPLEEMLGNFINETRN

Query:  LSVKQDIVIGKMEA-----------------------------------------------------INRSNDNIHEGDPT--------RKLIQEE----
           K D  +  +E                                                      I RS     E  PT         K+ +EE    
Subjt:  LSVKQDIVIGKMEA-----------------------------------------------------INRSNDNIHEGDPT--------RKLIQEE----

Query:  ---ETAQPVPDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLP
           ET  P      D+ P++   +PYPQRF+K++LD +FSKFLDI +++HINIPFADALEQMP+YAKF+K+I+S KR+ E  ETV ++ +CSAIIQK LP
Subjt:  ---ETAQPVPDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLP

Query:  LKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILG
         KLKDPGSFTLPC +G+ F  K LCDLGASINL+P  VYRKLGLGE+K TTI LQL DRS+ YP+G++EDVLV VDKFIFPADFVVLDMEED E P+ILG
Subjt:  LKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILG

Query:  RPFLATARALIEVHKGTLTLRVEDDESV---------------------------------SPADHYE-------------------ISMPDVFGD----
        RPFLAT RAL++V KG LTLRV  +E                                    PADH E                     +P V  +    
Subjt:  RPFLATARALIEVHKGTLTLRVEDDESV---------------------------------SPADHYE-------------------ISMPDVFGD----

Query:  --------ELIDVNKENALIDRFLGREEK-LSLDMCARVNVPSLELKQLPAHLRYAFLGESSTFPVIISADLR---------------------------
                ++   +KE   ++  + + +  +S +   +   P  ELKQLP HLRYAFLG+S TFPVI++A L                            
Subjt:  --------ELIDVNKENALIDRFLGREEK-LSLDMCARVNVPSLELKQLPAHLRYAFLGESSTFPVIISADLR---------------------------

Query:  ---------------------------------------------IIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKAT
                                                     IIYAISDS WVSPVQVVPKKGGM VVKN+ +E I TRTVTGWRVC+DYRKLNKAT
Subjt:  ---------------------------------------------IIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKAT

Query:  RKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
        RKDHFPLPFIDQML RLAGY YYCFLDGYSGYNQ+AIAPEDQEKT FTCPYGT+AFRRMPFG
Subjt:  RKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG

XP_030497826.1 LOW QUALITY PROTEIN: uncharacterized protein LOC115713483 [Cannabis sativa]2.8e-15145.51Show/hide
Query:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSAL----------R
        WER+KD LRKCP HG   WLQ+  FYNGL  + +T++D AAGG+F+ K+  +A  L+EE+ +T+ QW +E+  S   +K     EV A+          +
Subjt:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSAL----------R

Query:  LQVEALTKILNKLIGSEIPSLPKVKVEPKPE-EIVQEKQNPLEEML-----GNFINETRNLSVKQDIVIGKMEAINRSNDNIHEGDPTRKLIQEEETAQP
         Q++  T+  +   G +   L +   E +   + +Q +   L   +     GN +  T  ++ K++      +AI   +   ++G P++   + +E  QP
Subjt:  LQVEALTKILNKLIGSEIPSLPKVKVEPKPE-EIVQEKQNPLEEML-----GNFINETRNLSVKQDIVIGKMEAINRSNDNIHEGDPTRKLIQEEETAQP

Query:  VP----DKSSDSI-------PLVLN---QIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAI
         P     K++D +       P+ ++   +IPYPQR RK   D +FSKFL++ R+LHINIPFA+ALEQMP Y KFMKEILS KRK E+ ETV +T +CSAI
Subjt:  VP----DKSSDSI-------PLVLN---QIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAI

Query:  IQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPE
        +QK LP KLKDPGSFT+PC +G +    ALCDLGASINL+P  V+++L LGE K TT+ LQL DRS+ +P+GV+EDVLV VDKFIFPADF+VLDMEED  
Subjt:  IQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPE

Query:  GPIILGRPFLATARALIEVHKG--------TLTLRVEDDESVSPADHYEISM-PDVFGDELIDVNKENALIDRFLGREEKLSLDMCARVNVPSLELKQLP
         PIILGRPFLAT +ALI++ +          LTL  E+       + YE +M  D +G       +E  +I          +    +    P LELK LP
Subjt:  GPIILGRPFLATARALIEVHKG--------TLTLRVEDDESVSPADHYEISM-PDVFGDELIDVNKENALIDRFLGREEKLSLDMCARVNVPSLELKQLP

Query:  AHLRYAFLGESSTFPVIISA------------------------------------------------------------------------DLRIIYAI
         HLRY +LGE+ T PVI+++                                                                        D  + Y I
Subjt:  AHLRYAFLGESSTFPVIISA------------------------------------------------------------------------DLRIIYAI

Query:  SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP
        SDS WVSPVQVVPKKGGM VVKNEK+E I TRTVTGWR+CIDYRKLNKATRKDHFPLPFIDQML +LAG  YYCFLDGYSGY+Q+AIAPEDQEKT FTCP
Subjt:  SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP

Query:  YGTYAFRRMPFG
        YGT+AFRRMPFG
Subjt:  YGTYAFRRMPFG

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]8.1e-14338.59Show/hide
Query:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKP--CENEEVSALRLQVEALTK
        WER+KD  RKCP HG PDWL +Q FYNGL    +  +D AAGG+ ++K+ ++A  L+EE+   +YQW +E+   + K+ P   + + ++ L  +V++L K
Subjt:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKP--CENEEVSALRLQVEALTK

Query:  ILNKL-----IGSEI-----------------------------------------------------------------------PSLPKVK-------
        + +KL     + S +                                                                       PS P+ K       
Subjt:  ILNKL-----IGSEI-----------------------------------------------------------------------PSLPKVK-------

Query:  -------------VEPKPEEIVQEKQNPLEEMLGNFINETR-----NLSVKQDI---------------VIGKMEAINRSNDNIHEGDPTRKLIQEEETA
                     +E K +++    +N +E  LG   N        NL  K ++                +G++       D +   +  +K+ +E E  
Subjt:  -------------VEPKPEEIVQEKQNPLEEMLGNFINETR-----NLSVKQDI---------------VIGKMEAINRSNDNIHEGDPTRKLIQEEETA

Query:  QPVPDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDP
           P       P V   IP+PQR ++ ++D +F KFL + RQLHINIPFADAL Q+P Y KF+KEI+S KRK E+ ET+ +T +CSAIIQ  LP KL+DP
Subjt:  QPVPDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDP

Query:  GSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLAT
        GSF++PC +G +   +ALCDLGAS++L+P  V RKLGL ELK TTI LQL DRS+ YP G+LE+VL+ V KFI P DF+VL+MEED E PIILGRPFLAT
Subjt:  GSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLAT

Query:  ARALIEVHKGTLTLRVEDDE---SVSPADHYEISMPDVFGDELIDVNK-------------ENALIDRFLGREEKLSLDMCA------------------
        A A+I+V  G LTL+V ++E   ++  A  Y      VF  +++D +              E  L+     +++ L +   A                  
Subjt:  ARALIEVHKGTLTLRVEDDE---SVSPADHYEISMPDVFGDELIDVNK-------------ENALIDRFLGREEKLSLDMCA------------------

Query:  ------------RVNVPSLELKQLPAHLRYAFLGESSTFPVIIS--------------------------ADLR--------------------------
                     V  P LELK LP+HL YAFLGE++T PVI+S                          +DLR                          
Subjt:  ------------RVNVPSLELKQLPAHLRYAFLGESSTFPVIIS--------------------------ADLR--------------------------

Query:  --------------------IIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCF
                            IIY ISDS W+SPVQVVPKKGGM VV NE +E I TRTVTGWRVCIDYRKLN  TRKDHFPLPF+DQ+L RLAGY YYCF
Subjt:  --------------------IIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCF

Query:  LDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
        LDGYSGYNQ++I+PEDQEKT FTCPYGT+AFRRMPFG
Subjt:  LDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG

TrEMBL top hitse value%identityAlignment
A0A2G9HYA0 Reverse transcriptase6.2e-14140.46Show/hide
Query:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL
        W R++  LR CP H  P  +Q+  FY+GL    K  LD   G SFL+ T  +   L+  L    Y+ +SE+          E ++V+AL  +++ L + +
Subjt:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL

Query:  NKLIGSEIPSLPKV---------------------------KVEPKP-------------------------------------EEIVQEKQNPLEEMLG
             +++   P                             K +  P                                     ++ +QEK+  LEE L 
Subjt:  NKLIGSEIPSLPKV---------------------------KVEPKP-------------------------------------EEIVQEKQNPLEEMLG

Query:  NFINETRNLSVKQDIVIGKM-EAIN----------------------------RSNDNIHE--GDPTR---KLIQEEETAQPVPDKSSDSIPLVLNQIPY
         F+  T       +  IG++  AIN                            R+   + E   +PT+   K +  EE  + V      S P  L Q P+
Subjt:  NFINETRNLSVKQDIVIGKM-EAIN----------------------------RSNDNIHE--GDPTR---KLIQEEETAQPVPDKSSDSIPLVLNQIPY

Query:  PQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCD
        PQR +K++L+ +F KFL++ ++LHINIPFA+ALEQMP Y KFMK+ILS KR+  ++ETV +T +CSAIIQ  LP KLKDPGSFT+PC +G+ F  +ALCD
Subjt:  PQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCD

Query:  LGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARALIEVHKGTLTLRVED--
        LGASINL+P+ +YR LGLGE K T+I LQL DRS+TYPKGV+ED+LV VDKFIFPADFVVLDME D E PIILGRPFLAT R LI+V KG LT+RV+D  
Subjt:  LGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARALIEVHKGTLTLRVED--

Query:  -----------------------------DESVS--PADHYEISMPDVFGDELIDVNKENALI------DRFLGREEKLSLDMCARVNV--------PSL
                                     +ES++  P D  E ++ D+  +E    N+E+  +       +FL      SL+      V        P+L
Subjt:  -----------------------------DESVS--PADHYEISMPDVFGDELIDVNKENALI------DRFLGREEKLSLDMCARVNV--------PSL

Query:  ELKQLPAHLRYAFLGESSTFPVIISA------------------------------------------------------------------------DL
        ELK LP+HL YA+LGES T PVIIS+                                                                        D 
Subjt:  ELKQLPAHLRYAFLGESSTFPVIISA------------------------------------------------------------------------DL

Query:  RIIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEK
         IIY ISDS WVSPVQ VPKKGG+ VV N  +E I TRTVTGWRVC+DYRKLNKATRKDHFPLPFIDQML RLAG  +YCFLDGYSGYNQ+AIAPEDQEK
Subjt:  RIIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEK

Query:  TKFTCPYGTYAFRRMPFG
        T FTCPYGT+AFRRMPFG
Subjt:  TKFTCPYGTYAFRRMPFG

A0A2G9HYD8 Reverse transcriptase5.3e-14041.3Show/hide
Query:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL
        W R++  LR CP H  P  +Q+  FY+GL    K  LD   G SFL+ T  +   L+  L    Y+ +SE+       +  E ++V+AL  +++ L + +
Subjt:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL

Query:  NKL-------IGSEIPSLPKVKVEPKP-----------------------------------------EEIVQEKQNPLEEMLGNFINETRNLSVKQDIV
                      + S+  V    KP                                         ++ +QEK+  LEE L  F+          +  
Subjt:  NKL-------IGSEIPSLPKVKVEPKP-----------------------------------------EEIVQEKQNPLEEMLGNFINETRNLSVKQDIV

Query:  IGKM-EAINR----SNDNIHEGDPT---------------RKLIQEEETAQPVPDKSSDSI----------PLVLN-----QIPYPQRFRKKRLDAEFSK
        IG++  AIN     S  +  E +P                RKL  +E   +P   K  + I          PL ++     Q P+PQ+ +K++L+ +F K
Subjt:  IGKM-EAINR----SNDNIHEGDPT---------------RKLIQEEETAQPVPDKSSDSI----------PLVLN-----QIPYPQRFRKKRLDAEFSK

Query:  FLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRK
        FL++ ++LHINIPFA+ALEQMP Y KFMK+ILS KR+  ++ET  +T +C+AIIQ  LP KLKDPGSFT+PC +G+ F  +ALCDLGASINL+P+ +YR 
Subjt:  FLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRK

Query:  LGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARALIEVHKGTLTLRVEDDE---SVSPADHYEISM
        LGLGE K T+I LQL DRS+TYPKGV+ED+LV VDKFIFPADFVVLDME D E PIILGRPFLAT R LI+V KG LT+RV+D +   +V  A  +    
Subjt:  LGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARALIEVHKGTLTLRVEDDE---SVSPADHYEISM

Query:  PDVFGDELID-------------VNKENALIDRF-LGREEKL------------------SLDMCARVNV--------PSLELKQLPAHLRYAFLGESST
         + F   L D              + E AL+D    G EE L                  SL+      V        P+LELK LP HL Y +LGES T
Subjt:  PDVFGDELID-------------VNKENALIDRF-LGREEKL------------------SLDMCARVNV--------PSLELKQLPAHLRYAFLGESST

Query:  FPVIISA------------------------------------------------------------------------DLRIIYAISDSPWVSPVQVVP
         PVIIS+                                                                        D  IIY ISDS WVSPVQ VP
Subjt:  FPVIISA------------------------------------------------------------------------DLRIIYAISDSPWVSPVQVVP

Query:  KKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
        KKGG+ VV N  +E I TRTVTGWRVC+DYRKLNKATRKDHFPLPFIDQML RLAG  +YCFLDGYSGYNQ+AIAPEDQEKT FTCPYGT+AFRRMPFG
Subjt:  KKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG

A0A2P5F9V5 Uncharacterized protein1.6e-14153.21Show/hide
Query:  DPTRKLIQEEETAQPVPDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSA
        + T++++   E  +P      D+ P  +  IPYPQRF+K++LD  FSKFLD+ ++LHINIPFADALEQM  Y KFMKEILSNKRK E +ETV +T +CSA
Subjt:  DPTRKLIQEEETAQPVPDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSA

Query:  IIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDP
        I+QK LP KLKDPGSFT+PC +G+ F  KALCDLGASINL+P  VY+KLGLGE K TT+ LQL DRS+ YP+G +EDVLV VD+FIFPADF+VLDMEED 
Subjt:  IIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDP

Query:  EGPIILGRPFLATARALIEVHKGTLTLRVEDD------------ESVSPADHYEISMPDVFGDELIDVNKENALIDRF--------------LGREEKLS
        +  IILGRPFLAT R LI+V KG L LRV+++              V       I    +     + + K    + R               LG      
Subjt:  EGPIILGRPFLATARALIEVHKGTLTLRVEDD------------ESVSPADHYEISMPDVFGDELIDVNKENALIDRF--------------LGREEKLS

Query:  LDMCARVNVPSLELKQLPAHLRYAFLGESSTFPVIISADLR-----------------------------------------------------------
        L   + V  P LELK L AHLRYAFLGESST PVIISA L                                                            
Subjt:  LDMCARVNVPSLELKQLPAHLRYAFLGESSTFPVIISADLR-----------------------------------------------------------

Query:  -------------IIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGY
                     IIY ISDS WVSPVQVVPKKGGM VV+N+ +E I TRTVTGWRVCIDYRKLNKATRKDHFPLPF+DQML RLAGYP+YCFLDGYSGY
Subjt:  -------------IIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGY

Query:  NQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
        NQ+AIAPEDQEKT FTCPYGT+A+RR+PFG
Subjt:  NQVAIAPEDQEKTKFTCPYGTYAFRRMPFG

A0A6P6GDJ4 Reverse transcriptase5.3e-14039.8Show/hide
Query:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQP-ESIIKEKPCENEEVSALRLQVEA-LTK
        WER+K+ LR+CP HG+P W+Q+Q FYNGL    K ++D AAGGS + K   +A  L+EE+   +YQ+ SE+      + K  E+ +++ L  Q+     K
Subjt:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQP-ESIIKEKPCENEEVSALRLQVEA-LTK

Query:  ILNKLIG-----------------SEIPSLPKV--------------------KVEPKPEEIVQ--------EKQNPLEEMLGNFI----NETRNLSVKQ
         LN   G                 ++I ++  +                    + E  P+E VQ        +   P E   G  I    +E   +    
Subjt:  ILNKLIG-----------------SEIPSLPKV--------------------KVEPKPEEIVQ--------EKQNPLEEMLGNFI----NETRNLSVKQ

Query:  DIV--------IGKMEAINRSNDNIHEG--DPTRKLIQ---EEETAQPVPDKSSDSIPLVLNQI-------------------PYPQRFRKKRLDAEFSK
         +V        +   EA  R ++ + +    P  K I     E  + P   K  +  P +   I                   P+P RFR  +LD +F K
Subjt:  DIV--------IGKMEAINRSNDNIHEG--DPTRKLIQ---EEETAQPVPDKSSDSIPLVLNQI-------------------PYPQRFRKKRLDAEFSK

Query:  FLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRK
        FLD+ +QLH+NIPF DALEQMP Y KF+KEILSNKR++EN+E V ++ + SA +   +P KLKDPGSF +PC +G  +   ALCDLGASINL+P+ +YRK
Subjt:  FLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRK

Query:  LGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARALIEVHKGTLTLRVEDDE---------------
        LG+G++K TT+ LQ+ DRS+  P+G+LEDVLV V+KFIFPADFV+LDMEED   PIILGRPFLAT RALI+V +  +TLRV ++E               
Subjt:  LGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARALIEVHKGTLTLRVEDDE---------------

Query:  ------SVSPADHY-----------EISMPDVFGDELIDVNKENALID-------RFLGREEKLSLDMCARV----NVPSLELKQLPAHLRYAFLGESST
               V   D              + + ++FG  + +  +E  +ID        ++G+ E L      ++    + P ++LK LP+HL+Y FLG ++T
Subjt:  ------SVSPADHY-----------EISMPDVFGDELIDVNKENALID-------RFLGREEKLSLDMCARV----NVPSLELKQLPAHLRYAFLGESST

Query:  FPVIISA------------------------------------------------------------------------DLRIIYAISDSPWVSPVQVVP
         PVIIS+                                                                        D  IIY ISDS WVSP+QVVP
Subjt:  FPVIISA------------------------------------------------------------------------DLRIIYAISDSPWVSPVQVVP

Query:  KKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
        KKGGM V +N+K E I TR VTGWRVCIDYRKLNKATRKDHFPLPFIDQML RLAG  YYCFLDGYSGYNQ+AIAP+DQEKT FTCPYGT+A+RRMPFG
Subjt:  KKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG

A0A6P8DKJ2 uncharacterized protein LOC1162042312.7e-13637.65Show/hide
Query:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL
        WER+K+ +RKCP HG PD L I+ FY  L    ++++D AAGG+ + K   +A  L+EE+  +++ WQ+E+ +S +     + + ++ L  Q+ ALT  +
Subjt:  WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL

Query:  NKLIGS-----------EIPSLPKVKVE---------PKPEEI-----------------------------------------------------VQEK
        +KL  +           E+ S P   +E         P  E++                                                      Q+ 
Subjt:  NKLIGS-----------EIPSLPKVKVE---------PKPEEI-----------------------------------------------------VQEK

Query:  QNPLEEMLGNFINETRNLSVKQDIVI----GKMEAINRSNDNIHEGD-------------------------PTRKLIQEEETAQ--------PVPDKSS
        Q+ +EE++ +++ +T  +   Q   I    G++  I++   N   G                            RK   +EE+ +          P + S
Subjt:  QNPLEEMLGNFINETRNLSVKQDIVI----GKMEAINRSNDNIHEGD-------------------------PTRKLIQEEETAQ--------PVPDKSS

Query:  DSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQK---HLPLKLKDPGSFTL
          +   +  +P+P R ++++LDA+F+KFLD+ ++L INIPFA+AL+QMP YA+FMK++L+ KRKF+  E V +T +CS I+QK   +LP K +D GSFT+
Subjt:  DSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQK---HLPLKLKDPGSFTL

Query:  PCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARALI
        PC +G+   +  L D GASINL+P  ++RKLGLGE K T I LQL DRS+ YPKG++E+VLV VDKFIFP DF+VL+MEED E P+ILGRPFLAT +ALI
Subjt:  PCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARALI

Query:  EVHKGTLTLRVEDD-------ESVSPADHYEISMPDVFGDELIDVN-KENALIDRFLG------------REEKLSLDMCARVNV---------------
        +V +G LTLRV ++       +++   D  +        DELI  + +E A +D                  E+ S++  + +                 
Subjt:  EVHKGTLTLRVEDD-------ESVSPADHYEISMPDVFGDELIDVN-KENALIDRFLG------------REEKLSLDMCARVNV---------------

Query:  -----PSLELKQLPAHLRYAFLGESSTFPVIISA------------------------------------------------------------------
             P LELK LP+HL+YA+LG   T P+IIS+                                                                  
Subjt:  -----PSLELKQLPAHLRYAFLGESSTFPVIISA------------------------------------------------------------------

Query:  ------DLRIIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVA
              D  IIY ISDS WVSPVQVVPKKGGM VVKNE ++ I TRTVTGWRVCIDYRKLN ATRKDHFPLPFIDQML +LAG+ YYCFLDGYSGYNQ+ 
Subjt:  ------DLRIIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVA

Query:  IAPEDQEKTKFTCPYGTYAFRRMPFG
        IAPEDQEKT FTCPYGT+AFRRMPFG
Subjt:  IAPEDQEKTKFTCPYGTYAFRRMPFG

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.6e-1337.93Show/hide
Query:  IYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTK
        I   S+SP+ SP+ VVPKK         +D S   +    +R+ IDYRKLN+ T  D  P+P +D++L +L    Y+  +D   G++Q+ + PE   KT 
Subjt:  IYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTK

Query:  FTCPYGTYAFRRMPFG
        F+  +G Y + RMPFG
Subjt:  FTCPYGTYAFRRMPFG

P10401 Retrovirus-related Pol polyprotein from transposon gypsy1.8e-1239.29Show/hide
Query:  SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP
        S SP+ SP  VV KKG        K            R+ ID+RKLN+ T  D +P+P I  +L  L    ++  LD  SGY+Q+ +A  D+EKT F+  
Subjt:  SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP

Query:  YGTYAFRRMPFG
         G Y F R+PFG
Subjt:  YGTYAFRRMPFG

P20825 Retrovirus-related Pol polyprotein from transposon 2971.4e-1238.39Show/hide
Query:  SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP
        S+SP+ SP  VVPKK          D S + +    +RV IDYRKLN+ T  D +P+P +D++L +L    Y+  +D   G++Q+ +  E   KT F+  
Subjt:  SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP

Query:  YGTYAFRRMPFG
         G Y + RMPFG
Subjt:  YGTYAFRRMPFG

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.1e-1741.13Show/hide
Query:  SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP
        S SP  SPV +VPKK G                   +R+C+DYR LNKAT  D FPLP ID +L R+     +  LD +SGY+Q+ + P+D+ KT F  P
Subjt:  SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP

Query:  YGTYAFRRMPFGPGENPQIGSPYM
         G Y +  MPFG    P   + YM
Subjt:  YGTYAFRRMPFGPGENPQIGSPYM

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.1e-1741.13Show/hide
Query:  SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP
        S SP  SPV +VPKK G                   +R+C+DYR LNKAT  D FPLP ID +L R+     +  LD +SGY+Q+ + P+D+ KT F  P
Subjt:  SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP

Query:  YGTYAFRRMPFGPGENPQIGSPYM
         G Y +  MPFG    P   + YM
Subjt:  YGTYAFRRMPFGPGENPQIGSPYM

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCGTACATGGACGGTAGTTGGAGTCGGCGGCTCTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTTGGCCTTTGCGAACAGCTTGATGCACT
ATCTCCCTTCAACCCTTTGAGCGAAATGCGGCAAAAGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTAGGAACTACGAGATCACCCCAAGGACGCCTTC
GGTATCCTAAACTAGTTAACGACTTTACATCTAGTGGTTACTATTTTGCGGTCCGGTCTTATGCAAACTCATTGCATAGGATACCCCCACCCGCATGTCTCCTACATGGA
TGCTTTGGATCATTGCATCTGTATCGAATACAAGATAGAGCTAGGGATTGGAGGAGGAGCAATTGGGAAAGATATAAGGATTTTCTAAGGAAATGCCCACAACATGGATA
CCCTGACTGGCTTCAGATACAATTTTTCTACAATGGGTTAAAAAGGGATTTTAAGACAATTCTTGACACTGCGGCTGGAGGAAGCTTCTTAGCCAAAACTGTCAAGGATG
CTCGAACCCTAGTAGAAGAATTGCCCTTAACAAGTTATCAGTGGCAGTCAGAGCAACCAGAATCCATAATTAAGGAGAAACCATGTGAGAATGAAGAAGTGAGTGCTTTA
AGGTTGCAAGTCGAAGCTCTAACCAAGATTCTAAACAAGCTCATAGGATCAGAAATTCCAAGTTTACCTAAGGTTAAAGTGGAGCCTAAACCTGAGGAAATAGTTCAAGA
GAAGCAAAACCCTTTGGAGGAAATGCTAGGGAATTTCATAAATGAGACTAGGAATTTATCAGTAAAACAAGATATTGTCATTGGTAAGATGGAAGCAATAAATAGATCAA
ATGACAATATTCATGAAGGAGATCCAACAAGGAAACTTATCCAAGAGGAAGAGACAGCACAACCAGTGCCAGATAAGTCTTCTGATTCTATTCCTTTAGTTCTTAATCAA
ATTCCGTATCCTCAGAGATTTCGAAAGAAAAGGTTGGATGCTGAATTTTCAAAATTTCTTGATATTTTGAGACAGCTCCATATTAACATTCCTTTTGCTGATGCTTTGGA
ACAGATGCCACACTATGCTAAGTTTATGAAGGAAATATTATCAAATAAAAGGAAGTTCGAGAACCATGAGACAGTAAATATGACAGCCAAGTGCAGTGCCATAATCCAGA
AACATCTACCACTAAAATTAAAAGATCCAGGGAGTTTCACTCTCCCTTGTATCGTTGGTAGTTTGTTCATTAAAAAAGCTTTGTGCGACTTAGGGGCGAGTATAAACCTA
ATACCATTCATTGTTTACAGGAAACTTGGTTTGGGGGAATTAAAGTCAACTACAATTGTTTTGCAATTACCAGATCGTTCAATGACATACCCTAAAGGTGTGTTAGAGGA
TGTTCTGGTCAACGTTGACAAATTCATATTCCCTGCAGACTTTGTAGTTTTAGACATGGAGGAAGACCCCGAAGGCCCGATCATTCTTGGGAGACCATTTTTGGCAACCG
CTCGCGCTTTGATTGAGGTACATAAGGGAACCCTGACTCTCAGAGTGGAAGATGACGAGTCAGTGTCTCCTGCTGATCATTATGAAATAAGCATGCCTGACGTATTTGGA
GATGAATTAATAGATGTTAATAAGGAAAATGCTTTAATTGATAGATTTTTAGGAAGGGAAGAAAAATTATCATTAGATATGTGTGCTCGAGTCAATGTCCCTTCCCTTGA
ATTAAAACAATTGCCTGCTCACCTACGCTATGCTTTCTTAGGAGAATCTTCTACTTTCCCCGTTATCATTTCTGCTGATTTAAGAATTATTTATGCTATATCTGACAGTC
CTTGGGTGAGTCCTGTTCAAGTAGTGCCTAAGAAAGGAGGGATGATTGTTGTTAAGAATGAGAAAGATGAATCAATTTCCACTAGGACAGTCACTGGGTGGCGTGTGTGT
ATTGATTATAGGAAGTTAAACAAAGCCACTAGAAAAGATCATTTCCCTCTCCCCTTTATCGATCAAATGCTTTATAGGCTAGCAGGATATCCGTATTATTGTTTTCTGGA
TGGTTATTCGGGATATAATCAAGTTGCGATTGCGCCTGAAGACCAAGAGAAAACAAAATTCACGTGTCCCTATGGTACTTATGCCTTCCGGCGTATGCCATTTGGACCGG
GTGAAAACCCACAAATTGGCTCTCCATACATGGATGGAAATGGCCTCTTTTATAGTTGTCAAGCAAGCACAACGTGCGACGCTACGTCACAGCATCGCGATGCTACATTG
ACTTCGGCTCCAAGTCGTTACTTTGGCTTGGGCTCCAAGTCGTCGCAGCATCACGATGCTACATTTCCAGAGCATTCTTGCACTGCTGCCAAGCGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCGTACATGGACGGTAGTTGGAGTCGGCGGCTCTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTTGGCCTTTGCGAACAGCTTGATGCACT
ATCTCCCTTCAACCCTTTGAGCGAAATGCGGCAAAAGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTAGGAACTACGAGATCACCCCAAGGACGCCTTC
GGTATCCTAAACTAGTTAACGACTTTACATCTAGTGGTTACTATTTTGCGGTCCGGTCTTATGCAAACTCATTGCATAGGATACCCCCACCCGCATGTCTCCTACATGGA
TGCTTTGGATCATTGCATCTGTATCGAATACAAGATAGAGCTAGGGATTGGAGGAGGAGCAATTGGGAAAGATATAAGGATTTTCTAAGGAAATGCCCACAACATGGATA
CCCTGACTGGCTTCAGATACAATTTTTCTACAATGGGTTAAAAAGGGATTTTAAGACAATTCTTGACACTGCGGCTGGAGGAAGCTTCTTAGCCAAAACTGTCAAGGATG
CTCGAACCCTAGTAGAAGAATTGCCCTTAACAAGTTATCAGTGGCAGTCAGAGCAACCAGAATCCATAATTAAGGAGAAACCATGTGAGAATGAAGAAGTGAGTGCTTTA
AGGTTGCAAGTCGAAGCTCTAACCAAGATTCTAAACAAGCTCATAGGATCAGAAATTCCAAGTTTACCTAAGGTTAAAGTGGAGCCTAAACCTGAGGAAATAGTTCAAGA
GAAGCAAAACCCTTTGGAGGAAATGCTAGGGAATTTCATAAATGAGACTAGGAATTTATCAGTAAAACAAGATATTGTCATTGGTAAGATGGAAGCAATAAATAGATCAA
ATGACAATATTCATGAAGGAGATCCAACAAGGAAACTTATCCAAGAGGAAGAGACAGCACAACCAGTGCCAGATAAGTCTTCTGATTCTATTCCTTTAGTTCTTAATCAA
ATTCCGTATCCTCAGAGATTTCGAAAGAAAAGGTTGGATGCTGAATTTTCAAAATTTCTTGATATTTTGAGACAGCTCCATATTAACATTCCTTTTGCTGATGCTTTGGA
ACAGATGCCACACTATGCTAAGTTTATGAAGGAAATATTATCAAATAAAAGGAAGTTCGAGAACCATGAGACAGTAAATATGACAGCCAAGTGCAGTGCCATAATCCAGA
AACATCTACCACTAAAATTAAAAGATCCAGGGAGTTTCACTCTCCCTTGTATCGTTGGTAGTTTGTTCATTAAAAAAGCTTTGTGCGACTTAGGGGCGAGTATAAACCTA
ATACCATTCATTGTTTACAGGAAACTTGGTTTGGGGGAATTAAAGTCAACTACAATTGTTTTGCAATTACCAGATCGTTCAATGACATACCCTAAAGGTGTGTTAGAGGA
TGTTCTGGTCAACGTTGACAAATTCATATTCCCTGCAGACTTTGTAGTTTTAGACATGGAGGAAGACCCCGAAGGCCCGATCATTCTTGGGAGACCATTTTTGGCAACCG
CTCGCGCTTTGATTGAGGTACATAAGGGAACCCTGACTCTCAGAGTGGAAGATGACGAGTCAGTGTCTCCTGCTGATCATTATGAAATAAGCATGCCTGACGTATTTGGA
GATGAATTAATAGATGTTAATAAGGAAAATGCTTTAATTGATAGATTTTTAGGAAGGGAAGAAAAATTATCATTAGATATGTGTGCTCGAGTCAATGTCCCTTCCCTTGA
ATTAAAACAATTGCCTGCTCACCTACGCTATGCTTTCTTAGGAGAATCTTCTACTTTCCCCGTTATCATTTCTGCTGATTTAAGAATTATTTATGCTATATCTGACAGTC
CTTGGGTGAGTCCTGTTCAAGTAGTGCCTAAGAAAGGAGGGATGATTGTTGTTAAGAATGAGAAAGATGAATCAATTTCCACTAGGACAGTCACTGGGTGGCGTGTGTGT
ATTGATTATAGGAAGTTAAACAAAGCCACTAGAAAAGATCATTTCCCTCTCCCCTTTATCGATCAAATGCTTTATAGGCTAGCAGGATATCCGTATTATTGTTTTCTGGA
TGGTTATTCGGGATATAATCAAGTTGCGATTGCGCCTGAAGACCAAGAGAAAACAAAATTCACGTGTCCCTATGGTACTTATGCCTTCCGGCGTATGCCATTTGGACCGG
GTGAAAACCCACAAATTGGCTCTCCATACATGGATGGAAATGGCCTCTTTTATAGTTGTCAAGCAAGCACAACGTGCGACGCTACGTCACAGCATCGCGATGCTACATTG
ACTTCGGCTCCAAGTCGTTACTTTGGCTTGGGCTCCAAGTCGTCGCAGCATCACGATGCTACATTTCCAGAGCATTCTTGCACTGCTGCCAAGCGTTGA
Protein sequenceShow/hide protein sequence
MDRTWTVVGVGGSLRVPSSGIPGEEDQVGLCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRYPKLVNDFTSSGYYFAVRSYANSLHRIPPPACLLHG
CFGSLHLYRIQDRARDWRRSNWERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSAL
RLQVEALTKILNKLIGSEIPSLPKVKVEPKPEEIVQEKQNPLEEMLGNFINETRNLSVKQDIVIGKMEAINRSNDNIHEGDPTRKLIQEEETAQPVPDKSSDSIPLVLNQ
IPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINL
IPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARALIEVHKGTLTLRVEDDESVSPADHYEISMPDVFG
DELIDVNKENALIDRFLGREEKLSLDMCARVNVPSLELKQLPAHLRYAFLGESSTFPVIISADLRIIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVC
IDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFGPGENPQIGSPYMDGNGLFYSCQASTTCDATSQHRDATL
TSAPSRYFGLGSKSSQHHDATFPEHSCTAAKR