; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G006030 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G006030
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionBeta-galactosidase
Genome locationCG_Chr04:20327641..20335675
RNA-Seq ExpressionClCG04G006030
SyntenyClCG04G006030
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025363.1 Beta-galactosidase [Cucumis melo var. makuwa]6.3e-27144.97Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS+LI++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q
           KL +   VYDF+A          GRILGQRP P+L  V                        + F  R ++  S +   K +   E  K   H   Q
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q

Query:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--
        C      P         E+ ++G   +         ++T P +++  T            QSL L+SV+GK PWILDSGATDHLTGSS+HF+SY P A  
Subjt:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--

Query:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------
                                                                        D+SSG  IGTA H+RGLY LDDDTS           
Subjt:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------

Query:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLT
                                                                               SDVWG S+VTTSS K WFVTFIDDHTRLT
Subjt:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLT

Query:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI
        WV+L++DKSEV SIFQ FY TIKTQF+ K+ IL+SDN R F  + LSEFL++KGIVHQ+ CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD I
Subjt:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI

Query:  LTATHLINRMPSRVLHLQTLFECLKTLYPTTRLIVDVPLR------------------------------------------------------------
        LTA HLINRMPSR+LHLQT  +CLK  YP+TRL+ +VPLR                                                            
Subjt:  LTATHLINRMPSRVLHLQTLFECLKTLYPTTRLIVDVPLR------------------------------------------------------------

Query:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN
                GE+ SEE+N +          ++ ID  P  LP   ++  +  Y   +R+      +Q                           ++END++
Subjt:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN

Query:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK
        D+      ++       EV       E  +  + K +EYD SLD+PIALRKGT SCTK+P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK
Subjt:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK

Query:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF
         A+MEEM+ALEKN TWE   +PKGHK VGCKWVF++KYK+D TLDR+KA L                           VLLSVAVNKDW  YQLDVKNAF
Subjt:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF

Query:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--
        LNGDL EE                     K    ++   ++   +     + Q ++QGHSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK  
Subjt:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--

Query:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS
                               S+EGISVSQRKYT+DLL ETGM GC+PAD P+E N KL +  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVS
Subjt:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS

Query:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW
        QFMQAPYE+HME VNRILRYLK+TPGKGLMFRKT+RK IEAYTDSDWAGSVID+KSTSGYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIW
Subjt:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW

Query:  LKK
        L+K
Subjt:  LKK

KAA0045897.1 Beta-galactosidase [Cucumis melo var. makuwa]6.3e-27144.97Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS+LI++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q
           KL +   VYDF+A          GRILGQRP P+L  V                        + F  R ++  S +   K +   E  K   H   Q
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q

Query:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--
        C      P         E+ ++G   +         ++T P +++  T            QSL L+SV+GK PWILDSGATDHLTGSS+HF+SY P A  
Subjt:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--

Query:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------
                                                                        D+SSG  IGTA H+RGLY LDDDTS           
Subjt:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------

Query:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLT
                                                                               SDVWG S+VTTSS K WFVTFIDDHTRLT
Subjt:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLT

Query:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI
        WV+L++DKSEV SIFQ FY TIKTQF+ K+ IL+SDN R F  + LSEFL++KGIVHQ+ CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD I
Subjt:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI

Query:  LTATHLINRMPSRVLHLQTLFECLKTLYPTTRLIVDVPLR------------------------------------------------------------
        LTA HLINRMPSR+LHLQT  +CLK  YP+TRL+ +VPLR                                                            
Subjt:  LTATHLINRMPSRVLHLQTLFECLKTLYPTTRLIVDVPLR------------------------------------------------------------

Query:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN
                GE+ SEE+N +          ++ ID  P  LP   ++  +  Y   +R+      +Q                           ++END++
Subjt:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN

Query:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK
        D+      ++       EV       E  +  + K +EYD SLD+PIALRKGT SCTK+P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK
Subjt:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK

Query:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF
         A+MEEM+ALEKN TWE   +PKGHK VGCKWVF++KYK+D TLDR+KA L                           VLLSVAVNKDW  YQLDVKNAF
Subjt:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF

Query:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--
        LNGDL EE                     K    ++   ++   +     + Q ++QGHSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK  
Subjt:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--

Query:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS
                               S+EGISVSQRKYT+DLL ETGM GC+PAD P+E N KL +  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVS
Subjt:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS

Query:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW
        QFMQAPYE+HME VNRILRYLK+TPGKGLMFRKT+RK IEAYTDSDWAGSVID+KSTSGYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIW
Subjt:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW

Query:  LKK
        L+K
Subjt:  LKK

KAA0045977.1 Beta-galactosidase [Cucumis melo var. makuwa]6.1e-28247.63Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS++I++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLK---FVSRFDLRKTDPPSVQKHL----VPLE--------TSKMGNHLQCVSAVRNPGIERIS
           KL +   VYDF+A          GRIL QRP P+L    F  R +  +T+   V   L    +PL         T  MG+   CVS  RN G  RIS
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLK---FVSRFDLRKTDPPSVQKHL----VPLE--------TSKMGNHLQCVSAVRNPGIERIS

Query:  AGSYMVDHQMTNEDLRTTSPILARPYTSQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA----------------------------------
         G+  V  Q T      T   +A+    QSL L+SV+GK PWILDSGATDHLTGSS+HF+SY P A                                  
Subjt:  AGSYMVDHQMTNEDLRTTSPILARPYTSQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA----------------------------------

Query:  --------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-------------------------------------------
                                        D+SSG  IGTA H+RGLY LDDDTS                                           
Subjt:  --------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-------------------------------------------

Query:  ------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQ
                     SDVWG S+VTTSS K WFVTFIDDHTRLTWV+L++DKSEV SIFQ FY TIKTQF+ K+ IL+SDN R F  + LSEFL++KGIVHQ
Subjt:  ------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQ

Query:  SLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPILTATHLINRMPSRVLHLQTLFECLKTLYPTTRLIVDVPLR------------------
        + CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD ILTA HLINRMPSR+LHLQT  +CLK  YP+TRL+ +VPLR                  
Subjt:  SLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPILTATHLINRMPSRVLHLQTLFECLKTLYPTTRLIVDVPLR------------------

Query:  --------------------------------------------------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRR
                                                          GE+ SEE+N +          ++ ID  P  LP   ++  +  Y   +R+
Subjt:  --------------------------------------------------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRR

Query:  LQSQHLNQ-----------------------HRLKLAENDRNDMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTK
              +Q                           ++END++D+      ++       EV       E  +  + K +EYD SLD+PIALRKGT SCTK
Subjt:  LQSQHLNQ-----------------------HRLKLAENDRNDMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTK

Query:  YPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL-------
        +P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK A+MEEM+ALEKN TWE   +PKGHK VGCKWVF++KYK+D TLDR+KA L       
Subjt:  YPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL-------

Query:  --------------------VLLSVAVNKDWSFYQLDVKNAFLNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQG
                            VLLSVAVNKDW  YQLDVKNAFLNGDL EE                     K    ++   ++   +     + Q ++QG
Subjt:  --------------------VLLSVAVNKDWSFYQLDVKNAFLNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQG

Query:  HSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK-------------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECN
        HSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK                         S+EGISVSQRKYT+DLL ETGM GC+PAD P+E N
Subjt:  HSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK-------------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECN

Query:  AKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTS
         KL +  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVSQFMQAPYE+HME VNRILRYLK+TPGKGLMFRKT+RK IEAYTDSDWAGSVID+KSTS
Subjt:  AKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTS

Query:  GYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK
        GYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIWL+K
Subjt:  GYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK

KAA0048203.1 Beta-galactosidase [Cucumis melo var. makuwa]6.3e-27144.97Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS+LI++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q
           KL +   VYDF+A          GRILGQRP P+L  V                        + F  R ++  S +   K +   E  K   H   Q
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q

Query:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--
        C      P         E+ ++G   +         ++T P +++  T            QSL L+SV+GK PWILDSGATDHLTGSS+HF+SY P A  
Subjt:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--

Query:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------
                                                                        D+SSG  IGTA H+RGLY LDDDTS           
Subjt:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------

Query:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLT
                                                                               SDVWG S+VTTSS K WFVTFIDDHTRLT
Subjt:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLT

Query:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI
        WV+L++DKSEV SIFQ FY TIKTQF+ K+ IL+SDN R F  + LSEFL++KGIVHQ+ CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD I
Subjt:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI

Query:  LTATHLINRMPSRVLHLQTLFECLKTLYPTTRLIVDVPLR------------------------------------------------------------
        LTA HLINRMPSR+LHLQT  +CLK  YP+TRL+ +VPLR                                                            
Subjt:  LTATHLINRMPSRVLHLQTLFECLKTLYPTTRLIVDVPLR------------------------------------------------------------

Query:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN
                GE+ SEE+N +          ++ ID  P  LP   ++  +  Y   +R+      +Q                           ++END++
Subjt:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN

Query:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK
        D+      ++       EV       E  +  + K +EYD SLD+PIALRKGT SCTK+P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK
Subjt:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK

Query:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF
         A+MEEM+ALEKN TWE   +PKGHK VGCKWVF++KYK+D TLDR+KA L                           VLLSVAVNKDW  YQLDVKNAF
Subjt:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF

Query:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--
        LNGDL EE                     K    ++   ++   +     + Q ++QGHSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK  
Subjt:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--

Query:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS
                               S+EGISVSQRKYT+DLL ETGM GC+PAD P+E N KL +  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVS
Subjt:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS

Query:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW
        QFMQAPYE+HME VNRILRYLK+TPGKGLMFRKT+RK IEAYTDSDWAGSVID+KSTSGYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIW
Subjt:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW

Query:  LKK
        L+K
Subjt:  LKK

KAA0052172.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.0e-27345.73Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS+LI++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q
           KL +   VYDF+A          GRILGQRP P+L  V                        + F  R ++  S +   K +   E  K   H   Q
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q

Query:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--
        C      P         E+ ++G   +   +     ++T P +++  T            QSL L+S++GK PWILDSGATDHLTGSS+HF+SY   A  
Subjt:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--

Query:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------
                                                                        D+SSG  IGTA H+RGLY LDDDTS           
Subjt:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------

Query:  --------------------------------------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQF
                                                     SDVWG S+VTTSS K WFVTFIDDHTRLTWV+L++DKSEV SIFQ FY TIKTQF
Subjt:  --------------------------------------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQF

Query:  NAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPILTATHLINRMPSRVLHLQTLFECLKT
        + K+ IL+SDN R F  + LSEFL++KGIVHQ+ CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD ILTA HLINRMPSR+LHLQT  +CLK 
Subjt:  NAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPILTATHLINRMPSRVLHLQTLFECLKT

Query:  LYPTTRLIVDVPLR--------------------------------------------------------------------GESSSEETNCS-------
         YP+TRL+ +VPLR                                                                    GE+ SEE+N +       
Subjt:  LYPTTRLIVDVPLR--------------------------------------------------------------------GESSSEETNCS-------

Query:  -SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRNDMT-----KDNGVTRNAEVSTGAQES
           ++ ID  P  LP   ++  +  Y   +R+      +Q                           ++END++D+      ++       EV       
Subjt:  -SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRNDMT-----KDNGVTRNAEVSTGAQES

Query:  EIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHK
        E  +  + K +EYD SLD+PIALRKGT SCTK+P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK A+MEEM+ALEKN TWE   +PKGHK
Subjt:  EIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHK

Query:  LVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAFLNGDLEEE------------------
         VGCKWVF++KYK+D TLDR+KA L                           VLLSVAVNKDW  YQLDVKNAFLNGDL EE                  
Subjt:  LVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAFLNGDLEEE------------------

Query:  ---KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK-------------------------SRE
           K    ++   ++   +     + Q ++QGHSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK                         S+E
Subjt:  ---KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK-------------------------SRE

Query:  GISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPG
        GISVSQRKYT+DLL ETGM GC+PAD P+E N KL +  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVSQFMQAPYE+HME VNRILRYLK+TPG
Subjt:  GISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPG

Query:  KGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK
        KGLMFRKT+RK IEAYTDSDWAGSVID+KSTSGYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIWL+K
Subjt:  KGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK

TrEMBL top hitse value%identityAlignment
A0A5A7TVI6 Beta-galactosidase3.1e-27144.97Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS+LI++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q
           KL +   VYDF+A          GRILGQRP P+L  V                        + F  R ++  S +   K +   E  K   H   Q
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q

Query:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--
        C      P         E+ ++G   +         ++T P +++  T            QSL L+SV+GK PWILDSGATDHLTGSS+HF+SY P A  
Subjt:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--

Query:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------
                                                                        D+SSG  IGTA H+RGLY LDDDTS           
Subjt:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------

Query:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLT
                                                                               SDVWG S+VTTSS K WFVTFIDDHTRLT
Subjt:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLT

Query:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI
        WV+L++DKSEV SIFQ FY TIKTQF+ K+ IL+SDN R F  + LSEFL++KGIVHQ+ CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD I
Subjt:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI

Query:  LTATHLINRMPSRVLHLQTLFECLKTLYPTTRLIVDVPLR------------------------------------------------------------
        LTA HLINRMPSR+LHLQT  +CLK  YP+TRL+ +VPLR                                                            
Subjt:  LTATHLINRMPSRVLHLQTLFECLKTLYPTTRLIVDVPLR------------------------------------------------------------

Query:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN
                GE+ SEE+N +          ++ ID  P  LP   ++  +  Y   +R+      +Q                           ++END++
Subjt:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN

Query:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK
        D+      ++       EV       E  +  + K +EYD SLD+PIALRKGT SCTK+P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK
Subjt:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK

Query:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF
         A+MEEM+ALEKN TWE   +PKGHK VGCKWVF++KYK+D TLDR+KA L                           VLLSVAVNKDW  YQLDVKNAF
Subjt:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF

Query:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--
        LNGDL EE                     K    ++   ++   +     + Q ++QGHSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK  
Subjt:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--

Query:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS
                               S+EGISVSQRKYT+DLL ETGM GC+PAD P+E N KL +  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVS
Subjt:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS

Query:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW
        QFMQAPYE+HME VNRILRYLK+TPGKGLMFRKT+RK IEAYTDSDWAGSVID+KSTSGYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIW
Subjt:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW

Query:  LKK
        L+K
Subjt:  LKK

A0A5A7TVR1 Beta-galactosidase3.0e-28247.63Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS++I++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLK---FVSRFDLRKTDPPSVQKHL----VPLE--------TSKMGNHLQCVSAVRNPGIERIS
           KL +   VYDF+A          GRIL QRP P+L    F  R +  +T+   V   L    +PL         T  MG+   CVS  RN G  RIS
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLK---FVSRFDLRKTDPPSVQKHL----VPLE--------TSKMGNHLQCVSAVRNPGIERIS

Query:  AGSYMVDHQMTNEDLRTTSPILARPYTSQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA----------------------------------
         G+  V  Q T      T   +A+    QSL L+SV+GK PWILDSGATDHLTGSS+HF+SY P A                                  
Subjt:  AGSYMVDHQMTNEDLRTTSPILARPYTSQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA----------------------------------

Query:  --------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-------------------------------------------
                                        D+SSG  IGTA H+RGLY LDDDTS                                           
Subjt:  --------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-------------------------------------------

Query:  ------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQ
                     SDVWG S+VTTSS K WFVTFIDDHTRLTWV+L++DKSEV SIFQ FY TIKTQF+ K+ IL+SDN R F  + LSEFL++KGIVHQ
Subjt:  ------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQ

Query:  SLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPILTATHLINRMPSRVLHLQTLFECLKTLYPTTRLIVDVPLR------------------
        + CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD ILTA HLINRMPSR+LHLQT  +CLK  YP+TRL+ +VPLR                  
Subjt:  SLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPILTATHLINRMPSRVLHLQTLFECLKTLYPTTRLIVDVPLR------------------

Query:  --------------------------------------------------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRR
                                                          GE+ SEE+N +          ++ ID  P  LP   ++  +  Y   +R+
Subjt:  --------------------------------------------------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRR

Query:  LQSQHLNQ-----------------------HRLKLAENDRNDMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTK
              +Q                           ++END++D+      ++       EV       E  +  + K +EYD SLD+PIALRKGT SCTK
Subjt:  LQSQHLNQ-----------------------HRLKLAENDRNDMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTK

Query:  YPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL-------
        +P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK A+MEEM+ALEKN TWE   +PKGHK VGCKWVF++KYK+D TLDR+KA L       
Subjt:  YPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL-------

Query:  --------------------VLLSVAVNKDWSFYQLDVKNAFLNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQG
                            VLLSVAVNKDW  YQLDVKNAFLNGDL EE                     K    ++   ++   +     + Q ++QG
Subjt:  --------------------VLLSVAVNKDWSFYQLDVKNAFLNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQG

Query:  HSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK-------------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECN
        HSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK                         S+EGISVSQRKYT+DLL ETGM GC+PAD P+E N
Subjt:  HSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK-------------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECN

Query:  AKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTS
         KL +  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVSQFMQAPYE+HME VNRILRYLK+TPGKGLMFRKT+RK IEAYTDSDWAGSVID+KSTS
Subjt:  AKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTS

Query:  GYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK
        GYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIWL+K
Subjt:  GYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK

A0A5A7TW20 Beta-galactosidase3.1e-27144.97Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS+LI++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q
           KL +   VYDF+A          GRILGQRP P+L  V                        + F  R ++  S +   K +   E  K   H   Q
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q

Query:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--
        C      P         E+ ++G   +         ++T P +++  T            QSL L+SV+GK PWILDSGATDHLTGSS+HF+SY P A  
Subjt:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--

Query:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------
                                                                        D+SSG  IGTA H+RGLY LDDDTS           
Subjt:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------

Query:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLT
                                                                               SDVWG S+VTTSS K WFVTFIDDHTRLT
Subjt:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLT

Query:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI
        WV+L++DKSEV SIFQ FY TIKTQF+ K+ IL+SDN R F  + LSEFL++KGIVHQ+ CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD I
Subjt:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI

Query:  LTATHLINRMPSRVLHLQTLFECLKTLYPTTRLIVDVPLR------------------------------------------------------------
        LTA HLINRMPSR+LHLQT  +CLK  YP+TRL+ +VPLR                                                            
Subjt:  LTATHLINRMPSRVLHLQTLFECLKTLYPTTRLIVDVPLR------------------------------------------------------------

Query:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN
                GE+ SEE+N +          ++ ID  P  LP   ++  +  Y   +R+      +Q                           ++END++
Subjt:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN

Query:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK
        D+      ++       EV       E  +  + K +EYD SLD+PIALRKGT SCTK+P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK
Subjt:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK

Query:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF
         A+MEEM+ALEKN TWE   +PKGHK VGCKWVF++KYK+D TLDR+KA L                           VLLSVAVNKDW  YQLDVKNAF
Subjt:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF

Query:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--
        LNGDL EE                     K    ++   ++   +     + Q ++QGHSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK  
Subjt:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--

Query:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS
                               S+EGISVSQRKYT+DLL ETGM GC+PAD P+E N KL +  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVS
Subjt:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS

Query:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW
        QFMQAPYE+HME VNRILRYLK+TPGKGLMFRKT+RK IEAYTDSDWAGSVID+KSTSGYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIW
Subjt:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW

Query:  LKK
        L+K
Subjt:  LKK

A0A5A7U8U2 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-27345.73Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS+LI++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q
           KL +   VYDF+A          GRILGQRP P+L  V                        + F  R ++  S +   K +   E  K   H   Q
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q

Query:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--
        C      P         E+ ++G   +   +     ++T P +++  T            QSL L+S++GK PWILDSGATDHLTGSS+HF+SY   A  
Subjt:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--

Query:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------
                                                                        D+SSG  IGTA H+RGLY LDDDTS           
Subjt:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------

Query:  --------------------------------------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQF
                                                     SDVWG S+VTTSS K WFVTFIDDHTRLTWV+L++DKSEV SIFQ FY TIKTQF
Subjt:  --------------------------------------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQF

Query:  NAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPILTATHLINRMPSRVLHLQTLFECLKT
        + K+ IL+SDN R F  + LSEFL++KGIVHQ+ CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD ILTA HLINRMPSR+LHLQT  +CLK 
Subjt:  NAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPILTATHLINRMPSRVLHLQTLFECLKT

Query:  LYPTTRLIVDVPLR--------------------------------------------------------------------GESSSEETNCS-------
         YP+TRL+ +VPLR                                                                    GE+ SEE+N +       
Subjt:  LYPTTRLIVDVPLR--------------------------------------------------------------------GESSSEETNCS-------

Query:  -SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRNDMT-----KDNGVTRNAEVSTGAQES
           ++ ID  P  LP   ++  +  Y   +R+      +Q                           ++END++D+      ++       EV       
Subjt:  -SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRNDMT-----KDNGVTRNAEVSTGAQES

Query:  EIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHK
        E  +  + K +EYD SLD+PIALRKGT SCTK+P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK A+MEEM+ALEKN TWE   +PKGHK
Subjt:  EIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHK

Query:  LVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAFLNGDLEEE------------------
         VGCKWVF++KYK+D TLDR+KA L                           VLLSVAVNKDW  YQLDVKNAFLNGDL EE                  
Subjt:  LVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAFLNGDLEEE------------------

Query:  ---KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK-------------------------SRE
           K    ++   ++   +     + Q ++QGHSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK                         S+E
Subjt:  ---KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK-------------------------SRE

Query:  GISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPG
        GISVSQRKYT+DLL ETGM GC+PAD P+E N KL +  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVSQFMQAPYE+HME VNRILRYLK+TPG
Subjt:  GISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPG

Query:  KGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK
        KGLMFRKT+RK IEAYTDSDWAGSVID+KSTSGYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIWL+K
Subjt:  KGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK

A0A5A7VLQ7 Beta-galactosidase3.1e-27144.97Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS+LI++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q
           KL +   VYDF+A          GRILGQRP P+L  V                        + F  R ++  S +   K +   E  K   H   Q
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q

Query:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--
        C      P         E+ ++G   +         ++T P +++  T            QSL L+SV+GK PWILDSGATDHLTGSS+HF+SY P A  
Subjt:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--

Query:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------
                                                                        D+SSG  IGTA H+RGLY LDDDTS           
Subjt:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------

Query:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLT
                                                                               SDVWG S+VTTSS K WFVTFIDDHTRLT
Subjt:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKCWFVTFIDDHTRLT

Query:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI
        WV+L++DKSEV SIFQ FY TIKTQF+ K+ IL+SDN R F  + LSEFL++KGIVHQ+ CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD I
Subjt:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI

Query:  LTATHLINRMPSRVLHLQTLFECLKTLYPTTRLIVDVPLR------------------------------------------------------------
        LTA HLINRMPSR+LHLQT  +CLK  YP+TRL+ +VPLR                                                            
Subjt:  LTATHLINRMPSRVLHLQTLFECLKTLYPTTRLIVDVPLR------------------------------------------------------------

Query:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN
                GE+ SEE+N +          ++ ID  P  LP   ++  +  Y   +R+      +Q                           ++END++
Subjt:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN

Query:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK
        D+      ++       EV       E  +  + K +EYD SLD+PIALRKGT SCTK+P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK
Subjt:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK

Query:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF
         A+MEEM+ALEKN TWE   +PKGHK VGCKWVF++KYK+D TLDR+KA L                           VLLSVAVNKDW  YQLDVKNAF
Subjt:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF

Query:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--
        LNGDL EE                     K    ++   ++   +     + Q ++QGHSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK  
Subjt:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--

Query:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS
                               S+EGISVSQRKYT+DLL ETGM GC+PAD P+E N KL +  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVS
Subjt:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS

Query:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW
        QFMQAPYE+HME VNRILRYLK+TPGKGLMFRKT+RK IEAYTDSDWAGSVID+KSTSGYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIW
Subjt:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW

Query:  LKK
        L+K
Subjt:  LKK

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.0e-5026.47Show/hide
Query:  SDVWGTSRVTTSSVKCWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVV
        SDV G     T   K +FV F+D  T     +L+  KS+V S+FQ F    +  FN K+V L  DN R +L+N + +F   KGI +     + PQ N V 
Subjt:  SDVWGTSRVTTSSVKCWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVV

Query:  ERKNRHLLEVARSLCS-TSLPSYLWGDPILTATHLINRMPSRVL-------------------HLQT---------------------------------
        ER  R + E AR++ S   L    WG+ +LTAT+LINR+PSR L                   HL+                                  
Subjt:  ERKNRHLLEVARSLCS-TSLPSYLWGDPILTATHLINRMPSRVL-------------------HLQT---------------------------------

Query:  -LFECLKTLYPTTR-LIVD----VPLRG--------ESSSEETNCSSPLAPIDILPTPLP----------VLR--RISERKCYSLLIRRL----------
         L++ +   +   R ++VD    V  R         + S E  N + P     I+ T  P           L+  + SE K +    R++          
Subjt:  -LFECLKTLYPTTR-LIVD----VPLRG--------ESSSEETNCSSPLAPIDILPTPLP----------VLR--RISERKCYSLLIRRL----------

Query:  ---------QSQHLNQHRL-KLAENDRNDMTKDNGVTRNAEVSTGAQESEIVEEES-DKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFK--F
                  S+  N++ L +  +  R+D   ++  + N   S  ++ +E ++E   D P + D    + I  R+     TK P  S+   +N   K   
Subjt:  ---------QSQHLNQHRL-KLAENDRNDMTKDNGVTRNAEVSTGAQESEIVEEES-DKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFK--F

Query:  RAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADLV----------------------
         A T+  D       I    +   W+ AI  E+ A + NNTW     P+   +V  +WVF++KY       RYKA LV                      
Subjt:  RAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADLV----------------------

Query:  -----LLSVAVNKDWSFYQLDVKNAFLNGDLEEE---KDTVWIETVTKSVV---RKIYYLCQVQ--------------EFTQGHSDHTLF-TKRSSLGKV
             +LS+ +  +   +Q+DVK AFLNG L+EE   +    I   + +V    + IY L Q                EF     D  ++   + ++ + 
Subjt:  -----LLSVAVNKDWSFYQLDVKNAFLNGDLEEE---KDTVWIETVTKSVV---RKIYYLCQVQ--------------EFTQGHSDHTLF-TKRSSLGKV

Query:  VVLIVYVDDIVLSGDDAAEINKLK-------------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPM--ECNAKLRDPVDKVPINK
        + +++YVDD+V++  D   +N  K                           + I +SQ  Y   +L +  M  C     P+  + N +L +  +      
Subjt:  VVLIVYVDDIVLSGDDAAEINKLK-------------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPM--ECNAKLRDPVDKVPINK

Query:  ERYQSLVARLIY-LVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKT---DRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWG-N
           +SL+  L+Y ++ TRPD++ A++++S++      E  + + R+LRYLK T    L+F+K    + K I  Y DSDWAGS ID+KST+GY   ++  N
Subjt:  ERYQSLVARLIY-LVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKT---DRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWG-N

Query:  LVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLK
        L+ W +K+Q+ V  SS EAEY A+ + + E +WLK
Subjt:  LVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.0e-6628.55Show/hide
Query:  DTSSSDVWGTSRVTTSSVKCWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQ
        D   SDV G   + +     +FVTFIDD +R  WV++L  K +V  +FQ+F+  ++ +   K+  L+SDN   + +    E+ S+ GI H+      PQ 
Subjt:  DTSSSDVWGTSRVTTSSVKCWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQ

Query:  NEVVERKNRHLLEVARS-LCSTSLPSYLWGDPILTATHLINRMPSRVLHL---QTLFECLKTLYPTTRLIVDVPLRGESSSEETNCSSPLAPIDIL----
        N V ER NR ++E  RS L    LP   WG+ + TA +LINR PS  L     + ++   +  Y   ++            + T       P   +    
Subjt:  NEVVERKNRHLLEVARS-LCSTSLPSYLWGDPILTATHLINRMPSRVLHL---QTLFECLKTLYPTTRLIVDVPLRGESSSEETNCSSPLAPIDIL----

Query:  ---------PTPLPVLR------RISERKCYSLLIRRLQSQHL-------NQHRLKLAENDRNDMTKDNGVTRNAEVSTGAQESEIVEEESDKPEEYDAS
                 P    V+R      R SE +  + +  ++++  +       +      +     D   + G      +  G Q  E VEE     +  +  
Subjt:  ---------PTPLPVLR------RISERKCYSLLIRRLQSQHL-------NQHRLKLAENDRNDMTKDNGVTRNAEVSTGAQESEIVEEESDKPEEYDAS

Query:  LDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPE---WKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYK
          +  + R   ES  +YP+  +V             +S D    P+++   +  PE      A+ EEM +L+KN T++ V +PKG + + CKWVF +K  
Subjt:  LDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPE---WKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYK

Query:  SDETLDRYKADLV---------------------------LLSVAVNKDWSFYQLDVKNAFLNGDLEEE----------------------KDTVWIETV
         D  L RYKA LV                           +LS+A + D    QLDVK AFL+GDLEEE                      K    ++  
Subjt:  SDETLDRYKADLV---------------------------LLSVAVNKDWSFYQLDVKNAFLNGDLEEE----------------------KDTVWIETV

Query:  TKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLKS-----------------------REGIS----VSQRKYT
         +    K     + Q + + +SD  ++ KR S    ++L++YVDD+++ G D   I KLK                        RE  S    +SQ KY 
Subjt:  TKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLKS-----------------------REGIS----VSQRKYT

Query:  IDLLKETGMTGCKPADNPMECNAKLRDPV------DKVPINKERYQSLVARLIY-LVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGL
          +L+   M   KP   P+  + KL   +      +K  + K  Y S V  L+Y +V TRPDI++A+ VVS+F++ P +EH E V  ILRYL+ T G  L
Subjt:  IDLLKETGMTGCKPADNPMECNAKLRDPV------DKVPINKERYQSLVARLIY-LVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGL

Query:  MFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK
         F  +D   ++ YTD+D AG + ++KS++GY     G  ++W+SK Q  V  S+ EAEY A ++   E IWLK+
Subjt:  MFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK

P92519 Uncharacterized mitochondrial protein AtMg008101.4e-3134.5Show/hide
Query:  LIVYVDDIVLSGDDAAEINKL-------------------------KSREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQ
        L++YVDDI+L+G     +N L                             G+ +SQ KY   +L   GM  CKP   P+           K P +   ++
Subjt:  LIVYVDDIVLSGDDAAEINKL-------------------------KSREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQ

Query:  SLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQ
        S+V  L YL  TRPDISYA+++V Q M  P     +++ R+LRY+K T   GL   K  +  ++A+ DSDWAG    ++ST+G+CTF+  N+++W +K+Q
Subjt:  SLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQ

Query:  SMVVRSSAEAEYRAMSKGICEEIWLKKKR
          V RSS E EYRA++    E  W    R
Subjt:  SMVVRSSAEAEYRAMSKGICEEIWLKKKR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.7e-6727.51Show/hide
Query:  SDVWGTSRVTTSSVKCWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVV
        SDVW +  ++  + + ++V F+D  TR TW++ L  KS+V   F  F   ++ +F  ++    SDN   F+   L E+ S  GI H +   + P+ N + 
Subjt:  SDVWGTSRVTTSSVKCWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVV

Query:  ERKNRHLLEVARSLCS-TSLPSYLWGDPILTATHLINRMPSRVLHLQTLFE------------------CLKTLYP------------------------
        ERK+RH++E   +L S  S+P   W      A +LINR+P+ +L L++ F+                  C   L P                        
Subjt:  ERKNRHLLEVARSLCS-TSLPSYLWGDPILTATHLINRMPSRVLHLQTLFE------------------CLKTLYP------------------------

Query:  -------TTRLIVDVPLR-GESSSEETNCSSPLAPID--------------ILPTPLPVL---------RRISERKCYSLLIRRLQSQHLN---QHRLKL
               T+RL +   +R  E+    +N  + L+P+                LPT  PVL            +     S   R  Q    N         
Subjt:  -------TTRLIVDVPLR-GESSSEETNCSSPLAPID--------------ILPTPLPVL---------RRISERKCYSLLIRRLQSQHLN---QHRLKL

Query:  AENDRNDMTKDNG---VTRNAEVSTGAQESEIVEEE---SDKPEEYDASLDMPIALRKGTESCTKYPTYSFVS-------------------------YN
          +      + NG    T+  +  T    S+   +    ++ P +   SL  P      + S T   + S  S                          N
Subjt:  AENDRNDMTKDNG---VTRNAEVSTGAQESEIVEEE---SDKPEEYDASLDMPIALRKGTESCTKYPTYSFVS-------------------------YN

Query:  NLSFKFRA------------FTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGH-KLVGCKWVFTMKYKSDETLDRYKADLV---
          S   RA              VSL   + P+    A++   W+ A+  E+ A   N+TW+ V  P  H  +VGC+W+FT KY SD +L+RYKA LV   
Subjt:  NLSFKFRA------------FTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGH-KLVGCKWVFTMKYKSDETLDRYKADLV---

Query:  ------------------------LLSVAVNKDWSFYQLDVKNAFLNGDLEEE----KDTVWIETVTKSVVRK----IYYLCQVQE--------------
                                +L VAV++ W   QLDV NAFL G L ++    +   +I+    + V K    +Y L Q                 
Subjt:  ------------------------LLSVAVNKDWSFYQLDVKNAFLNGDLEEE----KDTVWIETVTKSVVRK----IYYLCQVQE--------------

Query:  FTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEIN----------KLKSRE---------------GISVSQRKYTIDLLKETGMTGCKPADNP
        F    SD +LF  +     +V ++VYVDDI+++G+D   ++           +K  E               G+ +SQR+Y +DLL  T M   KP   P
Subjt:  FTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEIN----------KLKSRE---------------GISVSQRKYTIDLLKETGMTGCKPADNP

Query:  MECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDK
        M  + KL         +   Y+ +V  L YL  TRPDISYA++ +SQFM  P EEH++ + RILRYL  TP  G+  +K +   + AY+D+DWAG   D 
Subjt:  MECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDK

Query:  KSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWL
         ST+GY  ++  + ++W SKKQ  VVRSS EAEYR+++    E  W+
Subjt:  KSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.4e-6327.32Show/hide
Query:  SDVWGTSRVTTSSVKCWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVV
        SDVW +  ++  + + ++V F+D  TR TW++ L  KS+V   F  F + ++ +F  ++  L SDN   F+   L ++LS  GI H +   + P+ N + 
Subjt:  SDVWGTSRVTTSSVKCWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVV

Query:  ERKNRHLLEVARSLCS-TSLPSYLWGDPILTATHLINRMPSRVLHLQTLFE------------------CLKTLYPTTR---------------------
        ERK+RH++E+  +L S  S+P   W      A +LINR+P+ +L LQ+ F+                  C   L P  R                     
Subjt:  ERKNRHLLEVARSLCS-TSLPSYLWGDPILTATHLINRMPSRVLHLQTLFE------------------CLKTLYPTTR---------------------

Query:  -LIVDVPLR----------------------GESSSEETNCSS----------PLAPIDILPTP--LPVLRRISERKCYS---LLIRRLQSQHLNQHRLK
         L + +P                        G S+S+E    S          P  P+ +LP P  L      S R   S   L   ++ S +L    + 
Subjt:  -LIVDVPLR----------------------GESSSEETNCSS----------PLAPIDILPTP--LPVLRRISERKCYS---LLIRRLQSQHLNQHRLK

Query:  LAENDRNDMTKDNGVTRNA---EVSTGAQESEIV-------------EEESDKPEEYDASLDMPIALRKGTE------SCTKYPTYSFV-----------
           +        NG    A   +       S I+              + S  P+   +S  +P      +E      S T  P    V           
Subjt:  LAENDRNDMTKDNGVTRNA---EVSTGAQESEIV-------------EEESDKPEEYDASLDMPIALRKGTE------SCTKYPTYSFV-----------

Query:  --SYNNLSFKFRA------------FTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECV-TIPKGHKLVGCKWVFTMKYKSDETLDRYKAD
            N  S   RA            +  SL   + P+    AM+   W+ A+  E+ A   N+TW+ V   P    +VGC+W+FT K+ SD +L+RYKA 
Subjt:  --SYNNLSFKFRA------------FTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECV-TIPKGHKLVGCKWVFTMKYKSDETLDRYKAD

Query:  LV---------------------------LLSVAVNKDWSFYQLDVKNAFLNGDLEEE----KDTVWIETVTKSVV----RKIYYLCQVQE---------
        LV                           +L VAV++ W   QLDV NAFL G L +E    +   +++      V    + IY L Q            
Subjt:  LV---------------------------LLSVAVNKDWSFYQLDVKNAFLNGDLEEE----KDTVWIETVTKSVV----RKIYYLCQVQE---------

Query:  -----FTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAA----EINKLKSR---------------------EGISVSQRKYTIDLLKETGMTGCK
             F    SD +LF  +     ++ ++VYVDDI+++G+D       ++ L  R                     +G+ +SQR+YT+DLL  T M   K
Subjt:  -----FTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAA----EINKLKSR---------------------EGISVSQRKYTIDLLKETGMTGCK

Query:  PADNPMECNAKLR-DPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWA
        P   PM  + KL      K+P +   Y+ +V  L YL  TRPD+SYA++ +SQ+M  P ++H   + R+LRYL  TP  G+  +K +   + AY+D+DWA
Subjt:  PADNPMECNAKLR-DPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWA

Query:  GSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWL
        G   D  ST+GY  ++  + ++W SKKQ  VVRSS EAEYR+++    E  W+
Subjt:  GSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.4e-6333.33Show/hide
Query:  SCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADLV--
        S T +    F+SY  +S  + +F V +     P     A E   W  A+ +E+ A+E  +TWE  T+P   K +GCKWV+ +KY SD T++RYKA LV  
Subjt:  SCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADLV--

Query:  -------------------------LLSVAVNKDWSFYQLDVKNAFLNGDLEEE-------------KDTVWIETVTKSVVRKIYYLCQVQE--------
                                 +L+++   +++ +QLD+ NAFLNGDL+EE              D++    V   + + IY L Q           
Subjt:  -------------------------LLSVAVNKDWSFYQLDVKNAFLNGDLEEE-------------KDTVWIETVTKSVVRKIYYLCQVQE--------

Query:  ------FTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLKSR-------------------------EGISVSQRKYTIDLLKETGMTGC
              F Q HSDHT F K ++   + VL VYVDDI++  ++ A +++LKS+                          GI++ QRKY +DLL ETG+ GC
Subjt:  ------FTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLKSR-------------------------EGISVSQRKYTIDLLKETGMTGC

Query:  KPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWA
        KP+  PM+ +           ++ + Y+ L+ RL+YL  TR DIS+A++ +SQF +AP   H + V +IL Y+K T G+GL +       ++ ++D+ + 
Subjt:  KPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWA

Query:  GSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWL
             ++ST+GYC F+  +L++W+SKKQ +V +SSAEAEYRA+S    E +WL
Subjt:  GSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWL

ATMG00240.1 Gag-Pol-related retrotransposon family protein7.5e-1237.8Show/hide
Query:  IYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFV
        +YL  TRPD+++A++ +SQF  A     M+ V ++L Y+K T G+GL +  T    ++A+ DSDWA     ++S +G+C+ V
Subjt:  IYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFV

ATMG00810.1 DNA/RNA polymerases superfamily protein1.0e-3234.5Show/hide
Query:  LIVYVDDIVLSGDDAAEINKL-------------------------KSREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQ
        L++YVDDI+L+G     +N L                             G+ +SQ KY   +L   GM  CKP   P+           K P +   ++
Subjt:  LIVYVDDIVLSGDDAAEINKL-------------------------KSREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQ

Query:  SLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQ
        S+V  L YL  TRPDISYA+++V Q M  P     +++ R+LRY+K T   GL   K  +  ++A+ DSDWAG    ++ST+G+CTF+  N+++W +K+Q
Subjt:  SLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQ

Query:  SMVVRSSAEAEYRAMSKGICEEIWLKKKR
          V RSS E EYRA++    E  W    R
Subjt:  SMVVRSSAEAEYRAMSKGICEEIWLKKKR

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.3e-1141.3Show/hide
Query:  TYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADLV
        T S    N L+ K+ + T++      PK++  A++ P W  A+ EE+ AL +N TW  V  P    ++GCKWVF  K  SD TLDR KA LV
Subjt:  TYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATCAGAGCGTGGTGACGAAACCCTAGAAACCCTAGAAACCCTAGCAGTCATTGACGAAAACGAATCTCAGCACGATCTGCCGCCATCGTCGATGGAGTCCGTTGT
CACCGTCGAATCCGCCGTCCAAGAAGCTGTCAACAGCTTCAAGGCTGCCGCAAGTCAAAACCCTAACGCCACGGCGCCGATGGGATATGTAGCAGCCGCAACTCCCACCG
TTCGGAACGCCACCGCCGCTGCTGTCGGGTCGTCAGAACAAATACAGTCGTCGAAGTCTCGCGTGTCTGCTCCCTTCTTCGTCGCGAGTTCCTCATCTTCGTCGTCGGGG
TTTGGTGGCCAGCCGAGGTCCGATTGTCTTCCATATGCTGCTACCAGCACTGCCCTCTGTCCCAAAGTCGCCGGCGCCCAGTCTTTACTTCCGACCACCGTGGCTACCAC
CAGATCTGGTGGCTTTTACTCGATTCCAAGGCTACTGCCGGATCTGGAGCGCATCTGGAAGGGAGAGGACTCCTTACTTCGTTCTCTGTTGATTCACAACATGGAACCTC
AAATCGTAAAACCTCTGTTGTATGCCGCTACTGCTCTAGACATCTGGGATGCAGTTCAGAAGCTATATTCCAAACGACAGAATGCATCGTGGCTTTACACCTTAAGGAAA
CAGATTCATGAATGCAAGCAAGGAACTATGGATGTCACATCATATTTTAACAAACTATCCCTTATTTGGCAAGAAATGGATTGTGCCGAGAACTCATCTGGGATTGGCTA
TATGGAGGAGTTCAATATTATAAAATTGAGGAAATTGACCGTGTATGATTTTGTAGCAGGAAGGATACTGGGACAGAGACCGAGACCAACTCTTAAGTTTGTTTCGAGGT
TCGACTTGAGGAAGACAGATCCACCTTCAGTGCAAAAACATCTGGTTCCACTGGAGACAAGCAAAATGGGAAACCACCTCCAGTGTGTGAGCGCTGTAAGAAACCCTGGC
ATAGAAAGGATCAGTGCTGGAAGTTACATGGTCGACCACCAAATGACAAACGAGGACCTTCGAACGACAAGCCCAATCCTAGCTAGGCCCTATACATCTCAATCCTTGAG
TCTCCTTAGTGTTAATGGCAAGAAGCCCTGGATTCTTGATTCAGGGGCTACAGATCATCTGACTGGATCCTCTGATCATTTTCTCTCATATCATCCACATGCTGACTTAA
GCTCGGGGACGATGATTGGCACTGCCTGGCACAATAGGGGACTCTACTCCCTTGATGATGATACTTCTTCTAGTGATGTTTGGGGAACCTCGAGAGTTACTACCTCTTCG
GTTAAGTGCTGGTTTGTGACCTTCATCGATGACCACACTCGTCTTACTTGGGTCTTTCTCCTCACTGATAAATCTGAGGTCTCATCCATTTTTCAACAATTTTACACTAC
CATTAAGACTCAATTTAATGCCAAAATGGTCATCCTTCAGAGTGACAATGATCGTGGGTTCCTTACTAATACCCTTAGTGAGTTTTTATCCACTAAAGGTATTGTTCATC
AGAGCTTGTGTGCCTATAACCCCCAACAAAATGAAGTGGTTGAAAGAAAAAATCGTCATCTCCTCGAGGTTGCCCGATCCCTATGCTCTACCTCTCTTCCATCGTACTTG
TGGGGGGACCCAATTTTGACTGCAACCCACCTTATAAATCGGATGCCTTCCCGTGTGCTCCACCTCCAAACCCTTTTTGAGTGCCTCAAAACCCTCTACCCTACCACACG
CCTTATTGTTGATGTTCCTCTCCGGGGAGAGAGTAGTAGTGAAGAGACTAACTGTTCATCACCTTTAGCTCCCATTGATATTCTTCCTACACCCCTTCCTGTCCTAAGAA
GAATCTCAGAAAGGAAGTGTTATTCCCTACTGATTCGCCGGCTCCAGTCCCAACACCTGAACCAACACAGGCTTAAGTTGGCTGAGAATGACAGGAATGATATGACAAAA
GACAACGGGGTTACTAGAAATGCTGAGGTGTCAACAGGAGCACAGGAAAGTGAGATCGTTGAGGAGGAATCCGACAAACCAGAGGAGTATGATGCCTCTCTTGACATGCC
CATTGCTCTGAGAAAGGGCACCGAGTCCTGCACCAAATATCCCACGTATAGCTTCGTCTCTTATAATAATCTGTCTTTTAAGTTCAGAGCATTTACAGTCAGTCTTGACA
CTGTAACAATACCAAAGAACATATATGTGGCTATGGAAATTCCTGAGTGGAAGACTGCCATCATGGAAGAAATGAGGGCTCTTGAGAAGAATAATACATGGGAATGTGTA
ACCATCCCTAAAGGGCATAAACTAGTTGGGTGCAAGTGGGTGTTTACAATGAAGTATAAATCTGATGAAACTCTAGACAGATATAAGGCAGACTTGGTCCTTCTTTCGGT
AGCTGTCAACAAAGACTGGTCTTTTTATCAACTGGATGTGAAGAATGCATTTTTAAATGGTGATTTAGAAGAAGAGAAAGACACTGTATGGATTGAAACAGTCACCAAGA
GTGTGGTTCGAAAGATTTACTACCTTTGTCAAGTTCAAGAGTTCACTCAAGGACACTCTGATCACACTTTGTTTACAAAAAGGTCAAGTTTGGGTAAAGTTGTCGTCCTA
ATTGTGTATGTTGATGACATTGTCTTATCCGGAGACGATGCTGCTGAGATCAACAAACTAAAGTCGAGAGAAGGAATCTCAGTTTCACAACGGAAATACACTATTGATCT
TCTGAAAGAAACAGGTATGACTGGATGTAAGCCTGCTGACAACCCTATGGAATGTAATGCCAAATTGAGAGACCCTGTTGACAAAGTCCCAATTAATAAAGAAAGGTACC
AAAGCCTTGTTGCAAGACTAATTTACCTAGTTCACACTCGACCAGACATTTCCTATGCTATCAGTGTTGTTAGCCAATTTATGCAGGCACCCTACGAAGAACACATGGAA
GTTGTAAACCGAATTCTGAGGTATTTGAAAAGTACTCCAGGAAAAGGTTTGATGTTTAGAAAAACCGACAGGAAATGCATTGAGGCCTATACCGACTCTGACTGGGCAGG
ATCAGTAATTGACAAAAAATCTACATCTGGGTATTGTACCTTTGTTTGGGGTAATCTAGTAACGTGGAGAAGTAAGAAGCAAAGTATGGTGGTTAGAAGCAGCGCTGAAG
CTGAATATAGAGCCATGAGTAAGGGTATTTGTGAAGAAATCTGGTTGAAGAAAAAGAGAGATGAAACAAACGAAGTATCACTTCGAAAGATCAAAGATTGTGCTCACCTA
AAATCTGTTGAAGGATCGGTACCCTGCCAACCCATGTCCTTCCAGAGCTCAGATTTAAGAGGTGGAAGCTCCCTGTCTGGATATGCCAACCTCCATAACCTTTTCAGAGC
ATCCTACGCAATAGCAAGATTTGGAGCTAAAACGAGTCAAAATGGTCCAATAATGAATACTAAAGACAGACCGGGACCAAAGGATCAGTGGGGAAGAAAACGAGGAAAAG
AAGCAAAGCAGCCCAAAAACAAGGCAGTGCCGAGGCACAAAAACCCAAGACCTACCTTATCAGGCAGCGCCACTCTGTGTTGCGGCAAAAGGTCTATTTTTGGCAACAAC
TTGGTGATCGAGCTATGTTCTTTCTACAACACACTCCCCAACAAACCTCAAGAAATGAACTTCCTCTCTGAAAGGCGGACAGAGCCTAAAGAAAGGCTTACAACAAAAGC
CCTCAATAGGGTTAGGACTCCAAAGACGAGTATCCCATCTCCCAACCCTAAAATTGCCTCTTACAACAAAGAAAGAAGAGAGGCAACCTCCGTCGTTTCCCTAAGCGTTC
CATTGATCTTCGTAATGCAACGGACCAAATGGGAAAAAGAAGAGAGCTCAAGTGAAATATCTTTCCACGGGTTTCAGTGTGTGCCTTTAGCCCCTTTAACCACCCAATCA
AAAGGATATTTCCAATCTCCAACACTCCCTGATTCAGGAGGCACCCAACAACCTCCCAACTTACCAGATGAGAACCCTTCCCTTCATCCATCCATTCCCAAAGAAAGTCT
CTCATGTACTTCTCAAGGCTCTTACAGATTGAACTCAGAGATCAAAATAGGGAGAAAACCTAATTCTTTCCTTGTTCACAAGGAAAGACAAGTTTCTCAAAAAAAGCGGC
CAAAGGGCATTGGGGGTAAACTTTTGGAGCTGTTGGTTGAGAGAATTCAAAGTCTCCGGGTTTATAATAGAGATGGCATAGCTCAACTGAAGGAGGCTAACTTCTCTGCA
ATCTTGTGGGGCATTTTGATTGAAAATTTGAGAGGAATAATAGATTTTTTAAAGAGGTGGAGAGATCAGGAGAGGAAGTTCCAGAGGTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTATCAGAGCGTGGTGACGAAACCCTAGAAACCCTAGAAACCCTAGCAGTCATTGACGAAAACGAATCTCAGCACGATCTGCCGCCATCGTCGATGGAGTCCGTTGT
CACCGTCGAATCCGCCGTCCAAGAAGCTGTCAACAGCTTCAAGGCTGCCGCAAGTCAAAACCCTAACGCCACGGCGCCGATGGGATATGTAGCAGCCGCAACTCCCACCG
TTCGGAACGCCACCGCCGCTGCTGTCGGGTCGTCAGAACAAATACAGTCGTCGAAGTCTCGCGTGTCTGCTCCCTTCTTCGTCGCGAGTTCCTCATCTTCGTCGTCGGGG
TTTGGTGGCCAGCCGAGGTCCGATTGTCTTCCATATGCTGCTACCAGCACTGCCCTCTGTCCCAAAGTCGCCGGCGCCCAGTCTTTACTTCCGACCACCGTGGCTACCAC
CAGATCTGGTGGCTTTTACTCGATTCCAAGGCTACTGCCGGATCTGGAGCGCATCTGGAAGGGAGAGGACTCCTTACTTCGTTCTCTGTTGATTCACAACATGGAACCTC
AAATCGTAAAACCTCTGTTGTATGCCGCTACTGCTCTAGACATCTGGGATGCAGTTCAGAAGCTATATTCCAAACGACAGAATGCATCGTGGCTTTACACCTTAAGGAAA
CAGATTCATGAATGCAAGCAAGGAACTATGGATGTCACATCATATTTTAACAAACTATCCCTTATTTGGCAAGAAATGGATTGTGCCGAGAACTCATCTGGGATTGGCTA
TATGGAGGAGTTCAATATTATAAAATTGAGGAAATTGACCGTGTATGATTTTGTAGCAGGAAGGATACTGGGACAGAGACCGAGACCAACTCTTAAGTTTGTTTCGAGGT
TCGACTTGAGGAAGACAGATCCACCTTCAGTGCAAAAACATCTGGTTCCACTGGAGACAAGCAAAATGGGAAACCACCTCCAGTGTGTGAGCGCTGTAAGAAACCCTGGC
ATAGAAAGGATCAGTGCTGGAAGTTACATGGTCGACCACCAAATGACAAACGAGGACCTTCGAACGACAAGCCCAATCCTAGCTAGGCCCTATACATCTCAATCCTTGAG
TCTCCTTAGTGTTAATGGCAAGAAGCCCTGGATTCTTGATTCAGGGGCTACAGATCATCTGACTGGATCCTCTGATCATTTTCTCTCATATCATCCACATGCTGACTTAA
GCTCGGGGACGATGATTGGCACTGCCTGGCACAATAGGGGACTCTACTCCCTTGATGATGATACTTCTTCTAGTGATGTTTGGGGAACCTCGAGAGTTACTACCTCTTCG
GTTAAGTGCTGGTTTGTGACCTTCATCGATGACCACACTCGTCTTACTTGGGTCTTTCTCCTCACTGATAAATCTGAGGTCTCATCCATTTTTCAACAATTTTACACTAC
CATTAAGACTCAATTTAATGCCAAAATGGTCATCCTTCAGAGTGACAATGATCGTGGGTTCCTTACTAATACCCTTAGTGAGTTTTTATCCACTAAAGGTATTGTTCATC
AGAGCTTGTGTGCCTATAACCCCCAACAAAATGAAGTGGTTGAAAGAAAAAATCGTCATCTCCTCGAGGTTGCCCGATCCCTATGCTCTACCTCTCTTCCATCGTACTTG
TGGGGGGACCCAATTTTGACTGCAACCCACCTTATAAATCGGATGCCTTCCCGTGTGCTCCACCTCCAAACCCTTTTTGAGTGCCTCAAAACCCTCTACCCTACCACACG
CCTTATTGTTGATGTTCCTCTCCGGGGAGAGAGTAGTAGTGAAGAGACTAACTGTTCATCACCTTTAGCTCCCATTGATATTCTTCCTACACCCCTTCCTGTCCTAAGAA
GAATCTCAGAAAGGAAGTGTTATTCCCTACTGATTCGCCGGCTCCAGTCCCAACACCTGAACCAACACAGGCTTAAGTTGGCTGAGAATGACAGGAATGATATGACAAAA
GACAACGGGGTTACTAGAAATGCTGAGGTGTCAACAGGAGCACAGGAAAGTGAGATCGTTGAGGAGGAATCCGACAAACCAGAGGAGTATGATGCCTCTCTTGACATGCC
CATTGCTCTGAGAAAGGGCACCGAGTCCTGCACCAAATATCCCACGTATAGCTTCGTCTCTTATAATAATCTGTCTTTTAAGTTCAGAGCATTTACAGTCAGTCTTGACA
CTGTAACAATACCAAAGAACATATATGTGGCTATGGAAATTCCTGAGTGGAAGACTGCCATCATGGAAGAAATGAGGGCTCTTGAGAAGAATAATACATGGGAATGTGTA
ACCATCCCTAAAGGGCATAAACTAGTTGGGTGCAAGTGGGTGTTTACAATGAAGTATAAATCTGATGAAACTCTAGACAGATATAAGGCAGACTTGGTCCTTCTTTCGGT
AGCTGTCAACAAAGACTGGTCTTTTTATCAACTGGATGTGAAGAATGCATTTTTAAATGGTGATTTAGAAGAAGAGAAAGACACTGTATGGATTGAAACAGTCACCAAGA
GTGTGGTTCGAAAGATTTACTACCTTTGTCAAGTTCAAGAGTTCACTCAAGGACACTCTGATCACACTTTGTTTACAAAAAGGTCAAGTTTGGGTAAAGTTGTCGTCCTA
ATTGTGTATGTTGATGACATTGTCTTATCCGGAGACGATGCTGCTGAGATCAACAAACTAAAGTCGAGAGAAGGAATCTCAGTTTCACAACGGAAATACACTATTGATCT
TCTGAAAGAAACAGGTATGACTGGATGTAAGCCTGCTGACAACCCTATGGAATGTAATGCCAAATTGAGAGACCCTGTTGACAAAGTCCCAATTAATAAAGAAAGGTACC
AAAGCCTTGTTGCAAGACTAATTTACCTAGTTCACACTCGACCAGACATTTCCTATGCTATCAGTGTTGTTAGCCAATTTATGCAGGCACCCTACGAAGAACACATGGAA
GTTGTAAACCGAATTCTGAGGTATTTGAAAAGTACTCCAGGAAAAGGTTTGATGTTTAGAAAAACCGACAGGAAATGCATTGAGGCCTATACCGACTCTGACTGGGCAGG
ATCAGTAATTGACAAAAAATCTACATCTGGGTATTGTACCTTTGTTTGGGGTAATCTAGTAACGTGGAGAAGTAAGAAGCAAAGTATGGTGGTTAGAAGCAGCGCTGAAG
CTGAATATAGAGCCATGAGTAAGGGTATTTGTGAAGAAATCTGGTTGAAGAAAAAGAGAGATGAAACAAACGAAGTATCACTTCGAAAGATCAAAGATTGTGCTCACCTA
AAATCTGTTGAAGGATCGGTACCCTGCCAACCCATGTCCTTCCAGAGCTCAGATTTAAGAGGTGGAAGCTCCCTGTCTGGATATGCCAACCTCCATAACCTTTTCAGAGC
ATCCTACGCAATAGCAAGATTTGGAGCTAAAACGAGTCAAAATGGTCCAATAATGAATACTAAAGACAGACCGGGACCAAAGGATCAGTGGGGAAGAAAACGAGGAAAAG
AAGCAAAGCAGCCCAAAAACAAGGCAGTGCCGAGGCACAAAAACCCAAGACCTACCTTATCAGGCAGCGCCACTCTGTGTTGCGGCAAAAGGTCTATTTTTGGCAACAAC
TTGGTGATCGAGCTATGTTCTTTCTACAACACACTCCCCAACAAACCTCAAGAAATGAACTTCCTCTCTGAAAGGCGGACAGAGCCTAAAGAAAGGCTTACAACAAAAGC
CCTCAATAGGGTTAGGACTCCAAAGACGAGTATCCCATCTCCCAACCCTAAAATTGCCTCTTACAACAAAGAAAGAAGAGAGGCAACCTCCGTCGTTTCCCTAAGCGTTC
CATTGATCTTCGTAATGCAACGGACCAAATGGGAAAAAGAAGAGAGCTCAAGTGAAATATCTTTCCACGGGTTTCAGTGTGTGCCTTTAGCCCCTTTAACCACCCAATCA
AAAGGATATTTCCAATCTCCAACACTCCCTGATTCAGGAGGCACCCAACAACCTCCCAACTTACCAGATGAGAACCCTTCCCTTCATCCATCCATTCCCAAAGAAAGTCT
CTCATGTACTTCTCAAGGCTCTTACAGATTGAACTCAGAGATCAAAATAGGGAGAAAACCTAATTCTTTCCTTGTTCACAAGGAAAGACAAGTTTCTCAAAAAAAGCGGC
CAAAGGGCATTGGGGGTAAACTTTTGGAGCTGTTGGTTGAGAGAATTCAAAGTCTCCGGGTTTATAATAGAGATGGCATAGCTCAACTGAAGGAGGCTAACTTCTCTGCA
ATCTTGTGGGGCATTTTGATTGAAAATTTGAGAGGAATAATAGATTTTTTAAAGAGGTGGAGAGATCAGGAGAGGAAGTTCCAGAGGTGGTAG
Protein sequenceShow/hide protein sequence
MVSERGDETLETLETLAVIDENESQHDLPPSSMESVVTVESAVQEAVNSFKAAASQNPNATAPMGYVAAATPTVRNATAAAVGSSEQIQSSKSRVSAPFFVASSSSSSSG
FGGQPRSDCLPYAATSTALCPKVAGAQSLLPTTVATTRSGGFYSIPRLLPDLERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRK
QIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEFNIIKLRKLTVYDFVAGRILGQRPRPTLKFVSRFDLRKTDPPSVQKHLVPLETSKMGNHLQCVSAVRNPG
IERISAGSYMVDHQMTNEDLRTTSPILARPYTSQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHADLSSGTMIGTAWHNRGLYSLDDDTSSSDVWGTSRVTTSS
VKCWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSLCSTSLPSYL
WGDPILTATHLINRMPSRVLHLQTLFECLKTLYPTTRLIVDVPLRGESSSEETNCSSPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQHRLKLAENDRNDMTK
DNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECV
TIPKGHKLVGCKWVFTMKYKSDETLDRYKADLVLLSVAVNKDWSFYQLDVKNAFLNGDLEEEKDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVL
IVYVDDIVLSGDDAAEINKLKSREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLRDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHME
VVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKKKRDETNEVSLRKIKDCAHL
KSVEGSVPCQPMSFQSSDLRGGSSLSGYANLHNLFRASYAIARFGAKTSQNGPIMNTKDRPGPKDQWGRKRGKEAKQPKNKAVPRHKNPRPTLSGSATLCCGKRSIFGNN
LVIELCSFYNTLPNKPQEMNFLSERRTEPKERLTTKALNRVRTPKTSIPSPNPKIASYNKERREATSVVSLSVPLIFVMQRTKWEKEESSSEISFHGFQCVPLAPLTTQS
KGYFQSPTLPDSGGTQQPPNLPDENPSLHPSIPKESLSCTSQGSYRLNSEIKIGRKPNSFLVHKERQVSQKKRPKGIGGKLLELLVERIQSLRVYNRDGIAQLKEANFSA
ILWGILIENLRGIIDFLKRWRDQERKFQRW