; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G06365 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G06365
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionBeta-galactosidase
Genome locationClcChr04:19569546..19577579
RNA-Seq ExpressionClc04G06365
SyntenyClc04G06365
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025363.1 Beta-galactosidase [Cucumis melo var. makuwa]4.0e-27345.2Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS+LI++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q
           KL +   VYDF+A          GRILGQRP P+L  V                        + F  R ++  S +   K +   E  K   H   Q
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q

Query:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--
        C      P         E+ ++G   +         ++T P +++  T            QSL L+SV+GK PWILDSGATDHLTGSS+HF+SY P A  
Subjt:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--

Query:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------
                                                                        D+SSG  IGTA H+RGLY LDDDTS           
Subjt:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------

Query:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLT
                                                                               SDVWG S+VTTSS KRWFVTFIDDHTRLT
Subjt:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLT

Query:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI
        WV+L++DKSEV SIFQ FY TIKTQF+ K+ IL+SDN R F  + LSEFL++KGIVHQ+ CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD I
Subjt:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI

Query:  LTATHLINRMPSRVLHLQTLLECLKTLYPTTRLIVDVPLR------------------------------------------------------------
        LTA HLINRMPSR+LHLQT L+CLK  YP+TRL+ +VPLR                                                            
Subjt:  LTATHLINRMPSRVLHLQTLLECLKTLYPTTRLIVDVPLR------------------------------------------------------------

Query:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN
                GE+ SEE+N +          ++ ID  P  LP   ++  +  Y   +R+      +Q                           ++END++
Subjt:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN

Query:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK
        D+      ++       EV       E  +  + K +EYD SLD+PIALRKGT SCTK+P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK
Subjt:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK

Query:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF
         A+MEEM+ALEKN TWE   +PKGHK VGCKWVF++KYK+D TLDR+KA L                           VLLSVAVNKDW  YQLDVKNAF
Subjt:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF

Query:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--
        LNGDL EE                     K    ++   ++   +     + Q ++QGHSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK  
Subjt:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--

Query:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS
                               S+EGISVSQRKYT+DLL ETGM GC+PAD P+E N KLG+  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVS
Subjt:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS

Query:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW
        QFMQAPYE+HME VNRILRYLK+TPGKGLMFRKT+RK IEAYTDSDWAGSVID+KSTSGYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIW
Subjt:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW

Query:  LKK
        L+K
Subjt:  LKK

KAA0045897.1 Beta-galactosidase [Cucumis melo var. makuwa]4.0e-27345.2Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS+LI++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q
           KL +   VYDF+A          GRILGQRP P+L  V                        + F  R ++  S +   K +   E  K   H   Q
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q

Query:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--
        C      P         E+ ++G   +         ++T P +++  T            QSL L+SV+GK PWILDSGATDHLTGSS+HF+SY P A  
Subjt:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--

Query:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------
                                                                        D+SSG  IGTA H+RGLY LDDDTS           
Subjt:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------

Query:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLT
                                                                               SDVWG S+VTTSS KRWFVTFIDDHTRLT
Subjt:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLT

Query:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI
        WV+L++DKSEV SIFQ FY TIKTQF+ K+ IL+SDN R F  + LSEFL++KGIVHQ+ CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD I
Subjt:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI

Query:  LTATHLINRMPSRVLHLQTLLECLKTLYPTTRLIVDVPLR------------------------------------------------------------
        LTA HLINRMPSR+LHLQT L+CLK  YP+TRL+ +VPLR                                                            
Subjt:  LTATHLINRMPSRVLHLQTLLECLKTLYPTTRLIVDVPLR------------------------------------------------------------

Query:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN
                GE+ SEE+N +          ++ ID  P  LP   ++  +  Y   +R+      +Q                           ++END++
Subjt:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN

Query:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK
        D+      ++       EV       E  +  + K +EYD SLD+PIALRKGT SCTK+P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK
Subjt:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK

Query:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF
         A+MEEM+ALEKN TWE   +PKGHK VGCKWVF++KYK+D TLDR+KA L                           VLLSVAVNKDW  YQLDVKNAF
Subjt:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF

Query:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--
        LNGDL EE                     K    ++   ++   +     + Q ++QGHSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK  
Subjt:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--

Query:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS
                               S+EGISVSQRKYT+DLL ETGM GC+PAD P+E N KLG+  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVS
Subjt:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS

Query:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW
        QFMQAPYE+HME VNRILRYLK+TPGKGLMFRKT+RK IEAYTDSDWAGSVID+KSTSGYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIW
Subjt:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW

Query:  LKK
        L+K
Subjt:  LKK

KAA0045977.1 Beta-galactosidase [Cucumis melo var. makuwa]3.8e-28447.87Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS++I++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLK---FVSRFDLRKTDPPSVQKHL----VPLE--------TSKMGNHLQCVSAVRNPGIERIS
           KL +   VYDF+A          GRIL QRP P+L    F  R +  +T+   V   L    +PL         T  MG+   CVS  RN G  RIS
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLK---FVSRFDLRKTDPPSVQKHL----VPLE--------TSKMGNHLQCVSAVRNPGIERIS

Query:  AGSYMVDHQMTNEDLRTTSPILARPYTSQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA----------------------------------
         G+  V  Q T      T   +A+    QSL L+SV+GK PWILDSGATDHLTGSS+HF+SY P A                                  
Subjt:  AGSYMVDHQMTNEDLRTTSPILARPYTSQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA----------------------------------

Query:  --------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-------------------------------------------
                                        D+SSG  IGTA H+RGLY LDDDTS                                           
Subjt:  --------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-------------------------------------------

Query:  ------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQ
                     SDVWG S+VTTSS KRWFVTFIDDHTRLTWV+L++DKSEV SIFQ FY TIKTQF+ K+ IL+SDN R F  + LSEFL++KGIVHQ
Subjt:  ------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQ

Query:  SLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPILTATHLINRMPSRVLHLQTLLECLKTLYPTTRLIVDVPLR------------------
        + CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD ILTA HLINRMPSR+LHLQT L+CLK  YP+TRL+ +VPLR                  
Subjt:  SLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPILTATHLINRMPSRVLHLQTLLECLKTLYPTTRLIVDVPLR------------------

Query:  --------------------------------------------------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRR
                                                          GE+ SEE+N +          ++ ID  P  LP   ++  +  Y   +R+
Subjt:  --------------------------------------------------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRR

Query:  LQSQHLNQ-----------------------HRLKLAENDRNDMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTK
              +Q                           ++END++D+      ++       EV       E  +  + K +EYD SLD+PIALRKGT SCTK
Subjt:  LQSQHLNQ-----------------------HRLKLAENDRNDMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTK

Query:  YPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL-------
        +P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK A+MEEM+ALEKN TWE   +PKGHK VGCKWVF++KYK+D TLDR+KA L       
Subjt:  YPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL-------

Query:  --------------------VLLSVAVNKDWSFYQLDVKNAFLNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQG
                            VLLSVAVNKDW  YQLDVKNAFLNGDL EE                     K    ++   ++   +     + Q ++QG
Subjt:  --------------------VLLSVAVNKDWSFYQLDVKNAFLNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQG

Query:  HSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK-------------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECN
        HSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK                         S+EGISVSQRKYT+DLL ETGM GC+PAD P+E N
Subjt:  HSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK-------------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECN

Query:  AKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTS
         KLG+  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVSQFMQAPYE+HME VNRILRYLK+TPGKGLMFRKT+RK IEAYTDSDWAGSVID+KSTS
Subjt:  AKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTS

Query:  GYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK
        GYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIWL+K
Subjt:  GYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK

KAA0048203.1 Beta-galactosidase [Cucumis melo var. makuwa]4.0e-27345.2Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS+LI++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q
           KL +   VYDF+A          GRILGQRP P+L  V                        + F  R ++  S +   K +   E  K   H   Q
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q

Query:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--
        C      P         E+ ++G   +         ++T P +++  T            QSL L+SV+GK PWILDSGATDHLTGSS+HF+SY P A  
Subjt:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--

Query:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------
                                                                        D+SSG  IGTA H+RGLY LDDDTS           
Subjt:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------

Query:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLT
                                                                               SDVWG S+VTTSS KRWFVTFIDDHTRLT
Subjt:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLT

Query:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI
        WV+L++DKSEV SIFQ FY TIKTQF+ K+ IL+SDN R F  + LSEFL++KGIVHQ+ CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD I
Subjt:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI

Query:  LTATHLINRMPSRVLHLQTLLECLKTLYPTTRLIVDVPLR------------------------------------------------------------
        LTA HLINRMPSR+LHLQT L+CLK  YP+TRL+ +VPLR                                                            
Subjt:  LTATHLINRMPSRVLHLQTLLECLKTLYPTTRLIVDVPLR------------------------------------------------------------

Query:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN
                GE+ SEE+N +          ++ ID  P  LP   ++  +  Y   +R+      +Q                           ++END++
Subjt:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN

Query:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK
        D+      ++       EV       E  +  + K +EYD SLD+PIALRKGT SCTK+P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK
Subjt:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK

Query:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF
         A+MEEM+ALEKN TWE   +PKGHK VGCKWVF++KYK+D TLDR+KA L                           VLLSVAVNKDW  YQLDVKNAF
Subjt:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF

Query:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--
        LNGDL EE                     K    ++   ++   +     + Q ++QGHSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK  
Subjt:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--

Query:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS
                               S+EGISVSQRKYT+DLL ETGM GC+PAD P+E N KLG+  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVS
Subjt:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS

Query:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW
        QFMQAPYE+HME VNRILRYLK+TPGKGLMFRKT+RK IEAYTDSDWAGSVID+KSTSGYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIW
Subjt:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW

Query:  LKK
        L+K
Subjt:  LKK

KAA0052172.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.5e-27545.97Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS+LI++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q
           KL +   VYDF+A          GRILGQRP P+L  V                        + F  R ++  S +   K +   E  K   H   Q
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q

Query:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--
        C      P         E+ ++G   +   +     ++T P +++  T            QSL L+S++GK PWILDSGATDHLTGSS+HF+SY   A  
Subjt:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--

Query:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------
                                                                        D+SSG  IGTA H+RGLY LDDDTS           
Subjt:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------

Query:  --------------------------------------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQF
                                                     SDVWG S+VTTSS KRWFVTFIDDHTRLTWV+L++DKSEV SIFQ FY TIKTQF
Subjt:  --------------------------------------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQF

Query:  NAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPILTATHLINRMPSRVLHLQTLLECLKT
        + K+ IL+SDN R F  + LSEFL++KGIVHQ+ CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD ILTA HLINRMPSR+LHLQT L+CLK 
Subjt:  NAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPILTATHLINRMPSRVLHLQTLLECLKT

Query:  LYPTTRLIVDVPLR--------------------------------------------------------------------GESSSEETNCS-------
         YP+TRL+ +VPLR                                                                    GE+ SEE+N +       
Subjt:  LYPTTRLIVDVPLR--------------------------------------------------------------------GESSSEETNCS-------

Query:  -SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRNDMT-----KDNGVTRNAEVSTGAQES
           ++ ID  P  LP   ++  +  Y   +R+      +Q                           ++END++D+      ++       EV       
Subjt:  -SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRNDMT-----KDNGVTRNAEVSTGAQES

Query:  EIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHK
        E  +  + K +EYD SLD+PIALRKGT SCTK+P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK A+MEEM+ALEKN TWE   +PKGHK
Subjt:  EIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHK

Query:  LVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAFLNGDLEEE------------------
         VGCKWVF++KYK+D TLDR+KA L                           VLLSVAVNKDW  YQLDVKNAFLNGDL EE                  
Subjt:  LVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAFLNGDLEEE------------------

Query:  ---KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK-------------------------SRE
           K    ++   ++   +     + Q ++QGHSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK                         S+E
Subjt:  ---KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK-------------------------SRE

Query:  GISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPG
        GISVSQRKYT+DLL ETGM GC+PAD P+E N KLG+  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVSQFMQAPYE+HME VNRILRYLK+TPG
Subjt:  GISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPG

Query:  KGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK
        KGLMFRKT+RK IEAYTDSDWAGSVID+KSTSGYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIWL+K
Subjt:  KGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK

TrEMBL top hitse value%identityAlignment
A0A5A7TVI6 Beta-galactosidase1.9e-27345.2Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS+LI++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q
           KL +   VYDF+A          GRILGQRP P+L  V                        + F  R ++  S +   K +   E  K   H   Q
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q

Query:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--
        C      P         E+ ++G   +         ++T P +++  T            QSL L+SV+GK PWILDSGATDHLTGSS+HF+SY P A  
Subjt:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--

Query:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------
                                                                        D+SSG  IGTA H+RGLY LDDDTS           
Subjt:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------

Query:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLT
                                                                               SDVWG S+VTTSS KRWFVTFIDDHTRLT
Subjt:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLT

Query:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI
        WV+L++DKSEV SIFQ FY TIKTQF+ K+ IL+SDN R F  + LSEFL++KGIVHQ+ CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD I
Subjt:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI

Query:  LTATHLINRMPSRVLHLQTLLECLKTLYPTTRLIVDVPLR------------------------------------------------------------
        LTA HLINRMPSR+LHLQT L+CLK  YP+TRL+ +VPLR                                                            
Subjt:  LTATHLINRMPSRVLHLQTLLECLKTLYPTTRLIVDVPLR------------------------------------------------------------

Query:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN
                GE+ SEE+N +          ++ ID  P  LP   ++  +  Y   +R+      +Q                           ++END++
Subjt:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN

Query:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK
        D+      ++       EV       E  +  + K +EYD SLD+PIALRKGT SCTK+P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK
Subjt:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK

Query:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF
         A+MEEM+ALEKN TWE   +PKGHK VGCKWVF++KYK+D TLDR+KA L                           VLLSVAVNKDW  YQLDVKNAF
Subjt:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF

Query:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--
        LNGDL EE                     K    ++   ++   +     + Q ++QGHSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK  
Subjt:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--

Query:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS
                               S+EGISVSQRKYT+DLL ETGM GC+PAD P+E N KLG+  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVS
Subjt:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS

Query:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW
        QFMQAPYE+HME VNRILRYLK+TPGKGLMFRKT+RK IEAYTDSDWAGSVID+KSTSGYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIW
Subjt:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW

Query:  LKK
        L+K
Subjt:  LKK

A0A5A7TVR1 Beta-galactosidase1.9e-28447.87Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS++I++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLK---FVSRFDLRKTDPPSVQKHL----VPLE--------TSKMGNHLQCVSAVRNPGIERIS
           KL +   VYDF+A          GRIL QRP P+L    F  R +  +T+   V   L    +PL         T  MG+   CVS  RN G  RIS
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLK---FVSRFDLRKTDPPSVQKHL----VPLE--------TSKMGNHLQCVSAVRNPGIERIS

Query:  AGSYMVDHQMTNEDLRTTSPILARPYTSQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA----------------------------------
         G+  V  Q T      T   +A+    QSL L+SV+GK PWILDSGATDHLTGSS+HF+SY P A                                  
Subjt:  AGSYMVDHQMTNEDLRTTSPILARPYTSQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA----------------------------------

Query:  --------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-------------------------------------------
                                        D+SSG  IGTA H+RGLY LDDDTS                                           
Subjt:  --------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-------------------------------------------

Query:  ------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQ
                     SDVWG S+VTTSS KRWFVTFIDDHTRLTWV+L++DKSEV SIFQ FY TIKTQF+ K+ IL+SDN R F  + LSEFL++KGIVHQ
Subjt:  ------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQ

Query:  SLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPILTATHLINRMPSRVLHLQTLLECLKTLYPTTRLIVDVPLR------------------
        + CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD ILTA HLINRMPSR+LHLQT L+CLK  YP+TRL+ +VPLR                  
Subjt:  SLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPILTATHLINRMPSRVLHLQTLLECLKTLYPTTRLIVDVPLR------------------

Query:  --------------------------------------------------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRR
                                                          GE+ SEE+N +          ++ ID  P  LP   ++  +  Y   +R+
Subjt:  --------------------------------------------------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRR

Query:  LQSQHLNQ-----------------------HRLKLAENDRNDMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTK
              +Q                           ++END++D+      ++       EV       E  +  + K +EYD SLD+PIALRKGT SCTK
Subjt:  LQSQHLNQ-----------------------HRLKLAENDRNDMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTK

Query:  YPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL-------
        +P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK A+MEEM+ALEKN TWE   +PKGHK VGCKWVF++KYK+D TLDR+KA L       
Subjt:  YPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL-------

Query:  --------------------VLLSVAVNKDWSFYQLDVKNAFLNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQG
                            VLLSVAVNKDW  YQLDVKNAFLNGDL EE                     K    ++   ++   +     + Q ++QG
Subjt:  --------------------VLLSVAVNKDWSFYQLDVKNAFLNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQG

Query:  HSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK-------------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECN
        HSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK                         S+EGISVSQRKYT+DLL ETGM GC+PAD P+E N
Subjt:  HSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK-------------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECN

Query:  AKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTS
         KLG+  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVSQFMQAPYE+HME VNRILRYLK+TPGKGLMFRKT+RK IEAYTDSDWAGSVID+KSTS
Subjt:  AKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTS

Query:  GYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK
        GYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIWL+K
Subjt:  GYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK

A0A5A7TW20 Beta-galactosidase1.9e-27345.2Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS+LI++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q
           KL +   VYDF+A          GRILGQRP P+L  V                        + F  R ++  S +   K +   E  K   H   Q
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q

Query:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--
        C      P         E+ ++G   +         ++T P +++  T            QSL L+SV+GK PWILDSGATDHLTGSS+HF+SY P A  
Subjt:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--

Query:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------
                                                                        D+SSG  IGTA H+RGLY LDDDTS           
Subjt:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------

Query:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLT
                                                                               SDVWG S+VTTSS KRWFVTFIDDHTRLT
Subjt:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLT

Query:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI
        WV+L++DKSEV SIFQ FY TIKTQF+ K+ IL+SDN R F  + LSEFL++KGIVHQ+ CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD I
Subjt:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI

Query:  LTATHLINRMPSRVLHLQTLLECLKTLYPTTRLIVDVPLR------------------------------------------------------------
        LTA HLINRMPSR+LHLQT L+CLK  YP+TRL+ +VPLR                                                            
Subjt:  LTATHLINRMPSRVLHLQTLLECLKTLYPTTRLIVDVPLR------------------------------------------------------------

Query:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN
                GE+ SEE+N +          ++ ID  P  LP   ++  +  Y   +R+      +Q                           ++END++
Subjt:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN

Query:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK
        D+      ++       EV       E  +  + K +EYD SLD+PIALRKGT SCTK+P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK
Subjt:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK

Query:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF
         A+MEEM+ALEKN TWE   +PKGHK VGCKWVF++KYK+D TLDR+KA L                           VLLSVAVNKDW  YQLDVKNAF
Subjt:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF

Query:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--
        LNGDL EE                     K    ++   ++   +     + Q ++QGHSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK  
Subjt:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--

Query:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS
                               S+EGISVSQRKYT+DLL ETGM GC+PAD P+E N KLG+  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVS
Subjt:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS

Query:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW
        QFMQAPYE+HME VNRILRYLK+TPGKGLMFRKT+RK IEAYTDSDWAGSVID+KSTSGYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIW
Subjt:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW

Query:  LKK
        L+K
Subjt:  LKK

A0A5A7U8U2 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-27545.97Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS+LI++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q
           KL +   VYDF+A          GRILGQRP P+L  V                        + F  R ++  S +   K +   E  K   H   Q
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q

Query:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--
        C      P         E+ ++G   +   +     ++T P +++  T            QSL L+S++GK PWILDSGATDHLTGSS+HF+SY   A  
Subjt:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--

Query:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------
                                                                        D+SSG  IGTA H+RGLY LDDDTS           
Subjt:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------

Query:  --------------------------------------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQF
                                                     SDVWG S+VTTSS KRWFVTFIDDHTRLTWV+L++DKSEV SIFQ FY TIKTQF
Subjt:  --------------------------------------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQF

Query:  NAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPILTATHLINRMPSRVLHLQTLLECLKT
        + K+ IL+SDN R F  + LSEFL++KGIVHQ+ CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD ILTA HLINRMPSR+LHLQT L+CLK 
Subjt:  NAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPILTATHLINRMPSRVLHLQTLLECLKT

Query:  LYPTTRLIVDVPLR--------------------------------------------------------------------GESSSEETNCS-------
         YP+TRL+ +VPLR                                                                    GE+ SEE+N +       
Subjt:  LYPTTRLIVDVPLR--------------------------------------------------------------------GESSSEETNCS-------

Query:  -SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRNDMT-----KDNGVTRNAEVSTGAQES
           ++ ID  P  LP   ++  +  Y   +R+      +Q                           ++END++D+      ++       EV       
Subjt:  -SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRNDMT-----KDNGVTRNAEVSTGAQES

Query:  EIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHK
        E  +  + K +EYD SLD+PIALRKGT SCTK+P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK A+MEEM+ALEKN TWE   +PKGHK
Subjt:  EIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHK

Query:  LVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAFLNGDLEEE------------------
         VGCKWVF++KYK+D TLDR+KA L                           VLLSVAVNKDW  YQLDVKNAFLNGDL EE                  
Subjt:  LVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAFLNGDLEEE------------------

Query:  ---KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK-------------------------SRE
           K    ++   ++   +     + Q ++QGHSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK                         S+E
Subjt:  ---KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK-------------------------SRE

Query:  GISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPG
        GISVSQRKYT+DLL ETGM GC+PAD P+E N KLG+  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVSQFMQAPYE+HME VNRILRYLK+TPG
Subjt:  GISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPG

Query:  KGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK
        KGLMFRKT+RK IEAYTDSDWAGSVID+KSTSGYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIWL+K
Subjt:  KGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK

A0A5A7VLQ7 Beta-galactosidase1.9e-27345.2Show/hide
Query:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF
        LER+WKGEDSL+RS+LI++MEPQI KPLLYAATA D+WD  Q LYSKRQNAS LYTLRKQ+H CKQGT+DVT+YFNKLSL+WQEMD    +      +  
Subjt:  LERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRKQIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEF

Query:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q
           KL +   VYDF+A          GRILGQRP P+L  V                        + F  R ++  S +   K +   E  K   H   Q
Subjt:  NIIKLRKL-TVYDFVA----------GRILGQRPRPTLKFV------------------------SRFDLRKTDPPSVQ---KHLVPLETSKMGNHL--Q

Query:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--
        C      P         E+ ++G   +         ++T P +++  T            QSL L+SV+GK PWILDSGATDHLTGSS+HF+SY P A  
Subjt:  CVSAVRNP-------GIERISAGSYMVDHQMTNEDLRTTSPILARPYT-----------SQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHA--

Query:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------
                                                                        D+SSG  IGTA H+RGLY LDDDTS           
Subjt:  ----------------------------------------------------------------DLSSGTMIGTAWHNRGLYSLDDDTS-----------

Query:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLT
                                                                               SDVWG S+VTTSS KRWFVTFIDDHTRLT
Subjt:  ----------------------------------------------------------------------SSDVWGTSRVTTSSVKRWFVTFIDDHTRLT

Query:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI
        WV+L++DKSEV SIFQ FY TIKTQF+ K+ IL+SDN R F  + LSEFL++KGIVHQ+ CAY PQQN V ERKNRHL+EVARSL  STSLPSYLWGD I
Subjt:  WVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSL-CSTSLPSYLWGDPI

Query:  LTATHLINRMPSRVLHLQTLLECLKTLYPTTRLIVDVPLR------------------------------------------------------------
        LTA HLINRMPSR+LHLQT L+CLK  YP+TRL+ +VPLR                                                            
Subjt:  LTATHLINRMPSRVLHLQTLLECLKTLYPTTRLIVDVPLR------------------------------------------------------------

Query:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN
                GE+ SEE+N +          ++ ID  P  LP   ++  +  Y   +R+      +Q                           ++END++
Subjt:  --------GESSSEETNCS--------SPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQ-----------------------HRLKLAENDRN

Query:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK
        D+      ++       EV       E  +  + K +EYD SLD+PIALRKGT SCTK+P  ++VSY+NLS +FRAFT +LD+  IPKNIY A+E PEWK
Subjt:  DMT-----KDNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWK

Query:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF
         A+MEEM+ALEKN TWE   +PKGHK VGCKWVF++KYK+D TLDR+KA L                           VLLSVAVNKDW  YQLDVKNAF
Subjt:  TAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADL---------------------------VLLSVAVNKDWSFYQLDVKNAF

Query:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--
        LNGDL EE                     K    ++   ++   +     + Q ++QGHSDHTLFTK S  GK+ +LIVYVDDIVL+GDD  EI++LK  
Subjt:  LNGDLEEE---------------------KDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLK--

Query:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS
                               S+EGISVSQRKYT+DLL ETGM GC+PAD P+E N KLG+  D+VP++KE+YQ LV +LIYL HTRPDIS+A+SVVS
Subjt:  -----------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVS

Query:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW
        QFMQAPYE+HME VNRILRYLK+TPGKGLMFRKT+RK IEAYTDSDWAGSVID+KSTSGYCTFVWGNLVTWRSKKQS+V RSSAEAEYRAMS GICEEIW
Subjt:  QFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIW

Query:  LKK
        L+K
Subjt:  LKK

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-5026.64Show/hide
Query:  SDVWGTSRVTTSSVKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVV
        SDV G     T   K +FV F+D  T     +L+  KS+V S+FQ F    +  FN K+V L  DN R +L+N + +F   KGI +     + PQ N V 
Subjt:  SDVWGTSRVTTSSVKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVV

Query:  ERKNRHLLEVARSLCS-TSLPSYLWGDPILTATHLINRMPSRVL-------------------HLQT---------------------------------
        ER  R + E AR++ S   L    WG+ +LTAT+LINR+PSR L                   HL+                                  
Subjt:  ERKNRHLLEVARSLCS-TSLPSYLWGDPILTATHLINRMPSRVL-------------------HLQT---------------------------------

Query:  -LLECLKTLYPTTR-LIVD----VPLRG--------ESSSEETNCSSPLAPIDILPTPLP----------VLR--RISERKCYSLLIRRL----------
         L + +   +   R ++VD    V  R         + S E  N + P     I+ T  P           L+  + SE K +    R++          
Subjt:  -LLECLKTLYPTTR-LIVD----VPLRG--------ESSSEETNCSSPLAPIDILPTPLP----------VLR--RISERKCYSLLIRRL----------

Query:  ---------QSQHLNQHRL-KLAENDRNDMTKDNGVTRNAEVSTGAQESEIVEEES-DKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFK--F
                  S+  N++ L +  +  R+D   ++  + N   S  ++ +E ++E   D P + D    + I  R+     TK P  S+   +N   K   
Subjt:  ---------QSQHLNQHRL-KLAENDRNDMTKDNGVTRNAEVSTGAQESEIVEEES-DKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFK--F

Query:  RAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADLV----------------------
         A T+  D       I    +   W+ AI  E+ A + NNTW     P+   +V  +WVF++KY       RYKA LV                      
Subjt:  RAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADLV----------------------

Query:  -----LLSVAVNKDWSFYQLDVKNAFLNGDLEEE---KDTVWIETVTKSVV---RKIYYLCQVQ--------------EFTQGHSDHTLF-TKRSSLGKV
             +LS+ +  +   +Q+DVK AFLNG L+EE   +    I   + +V    + IY L Q                EF     D  ++   + ++ + 
Subjt:  -----LLSVAVNKDWSFYQLDVKNAFLNGDLEEE---KDTVWIETVTKSVV---RKIYYLCQVQ--------------EFTQGHSDHTLF-TKRSSLGKV

Query:  VVLIVYVDDIVLSGDDAAEINKLK-------------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPM--ECNAKL--GDPVDKVPI
        + +++YVDD+V++  D   +N  K                           + I +SQ  Y   +L +  M  C     P+  + N +L   D     P 
Subjt:  VVLIVYVDDIVLSGDDAAEINKLK-------------------------SREGISVSQRKYTIDLLKETGMTGCKPADNPM--ECNAKL--GDPVDKVPI

Query:  NKERYQSLVARLIY-LVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKT---DRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWG
             +SL+  L+Y ++ TRPD++ A++++S++      E  + + R+LRYLK T    L+F+K    + K I  Y DSDWAGS ID+KST+GY   ++ 
Subjt:  NKERYQSLVARLIY-LVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKT---DRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWG

Query:  -NLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLK
         NL+ W +K+Q+ V  SS EAEY A+ + + E +WLK
Subjt:  -NLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-6628.55Show/hide
Query:  DTSSSDVWGTSRVTTSSVKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQ
        D   SDV G   + +    ++FVTFIDD +R  WV++L  K +V  +FQ+F+  ++ +   K+  L+SDN   + +    E+ S+ GI H+      PQ 
Subjt:  DTSSSDVWGTSRVTTSSVKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQ

Query:  NEVVERKNRHLLEVARS-LCSTSLPSYLWGDPILTATHLINRMPSRVLHL---QTLLECLKTLYPTTRLIVDVPLRGESSSEETNCSSPLAPIDIL----
        N V ER NR ++E  RS L    LP   WG+ + TA +LINR PS  L     + +    +  Y   ++            + T       P   +    
Subjt:  NEVVERKNRHLLEVARS-LCSTSLPSYLWGDPILTATHLINRMPSRVLHL---QTLLECLKTLYPTTRLIVDVPLRGESSSEETNCSSPLAPIDIL----

Query:  ---------PTPLPVLR------RISERKCYSLLIRRLQSQHL-------NQHRLKLAENDRNDMTKDNGVTRNAEVSTGAQESEIVEEESDKPEEYDAS
                 P    V+R      R SE +  + +  ++++  +       +      +     D   + G      +  G Q  E VEE     +  +  
Subjt:  ---------PTPLPVLR------RISERKCYSLLIRRLQSQHL-------NQHRLKLAENDRNDMTKDNGVTRNAEVSTGAQESEIVEEESDKPEEYDAS

Query:  LDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPE---WKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYK
          +  + R   ES  +YP+  +V             +S D    P+++   +  PE      A+ EEM +L+KN T++ V +PKG + + CKWVF +K  
Subjt:  LDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPE---WKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYK

Query:  SDETLDRYKADLV---------------------------LLSVAVNKDWSFYQLDVKNAFLNGDLEEE----------------------KDTVWIETV
         D  L RYKA LV                           +LS+A + D    QLDVK AFL+GDLEEE                      K    ++  
Subjt:  SDETLDRYKADLV---------------------------LLSVAVNKDWSFYQLDVKNAFLNGDLEEE----------------------KDTVWIETV

Query:  TKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLKS-----------------------REGIS----VSQRKYT
         +    K     + Q + + +SD  ++ KR S    ++L++YVDD+++ G D   I KLK                        RE  S    +SQ KY 
Subjt:  TKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLKS-----------------------REGIS----VSQRKYT

Query:  IDLLKETGMTGCKPADNPMECNAKLGDPV------DKVPINKERYQSLVARLIY-LVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGL
          +L+   M   KP   P+  + KL   +      +K  + K  Y S V  L+Y +V TRPDI++A+ VVS+F++ P +EH E V  ILRYL+ T G  L
Subjt:  IDLLKETGMTGCKPADNPMECNAKLGDPV------DKVPINKERYQSLVARLIY-LVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGL

Query:  MFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK
         F  +D   ++ YTD+D AG + ++KS++GY     G  ++W+SK Q  V  S+ EAEY A ++   E IWLK+
Subjt:  MFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKK

P92519 Uncharacterized mitochondrial protein AtMg008108.4e-3234.5Show/hide
Query:  LIVYVDDIVLSGDDAAEINKL-------------------------KSREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQ
        L++YVDDI+L+G     +N L                             G+ +SQ KY   +L   GM  CKP   P+           K P +   ++
Subjt:  LIVYVDDIVLSGDDAAEINKL-------------------------KSREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQ

Query:  SLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQ
        S+V  L YL  TRPDISYA+++V Q M  P     +++ R+LRY+K T   GL   K  +  ++A+ DSDWAG    ++ST+G+CTF+  N+++W +K+Q
Subjt:  SLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQ

Query:  SMVVRSSAEAEYRAMSKGICEEIWLKKKR
          V RSS E EYRA++    E  W    R
Subjt:  SMVVRSSAEAEYRAMSKGICEEIWLKKKR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.6e-6727.63Show/hide
Query:  SDVWGTSRVTTSSVKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVV
        SDVW +S + +    R++V F+D  TR TW++ L  KS+V   F  F   ++ +F  ++    SDN   F+   L E+ S  GI H +   + P+ N + 
Subjt:  SDVWGTSRVTTSSVKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVV

Query:  ERKNRHLLEVARSLCS-TSLPSYLWGDPILTATHLINRMPSRVLHLQTLLE------------------CLKTLYP------------------------
        ERK+RH++E   +L S  S+P   W      A +LINR+P+ +L L++  +                  C   L P                        
Subjt:  ERKNRHLLEVARSLCS-TSLPSYLWGDPILTATHLINRMPSRVLHLQTLLE------------------CLKTLYP------------------------

Query:  -------TTRLIVDVPLR-GESSSEETNCSSPLAPID--------------ILPTPLPVL---------RRISERKCYSLLIRRLQSQHLN---QHRLKL
               T+RL +   +R  E+    +N  + L+P+                LPT  PVL            +     S   R  Q    N         
Subjt:  -------TTRLIVDVPLR-GESSSEETNCSSPLAPID--------------ILPTPLPVL---------RRISERKCYSLLIRRLQSQHLN---QHRLKL

Query:  AENDRNDMTKDNG---VTRNAEVSTGAQESEIVEEE---SDKPEEYDASLDMPIALRKGTESCTKYPTYSFVS-------------------------YN
          +      + NG    T+  +  T    S+   +    ++ P +   SL  P      + S T   + S  S                          N
Subjt:  AENDRNDMTKDNG---VTRNAEVSTGAQESEIVEEE---SDKPEEYDASLDMPIALRKGTESCTKYPTYSFVS-------------------------YN

Query:  NLSFKFRA------------FTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGH-KLVGCKWVFTMKYKSDETLDRYKADLV---
          S   RA              VSL   + P+    A++   W+ A+  E+ A   N+TW+ V  P  H  +VGC+W+FT KY SD +L+RYKA LV   
Subjt:  NLSFKFRA------------FTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGH-KLVGCKWVFTMKYKSDETLDRYKADLV---

Query:  ------------------------LLSVAVNKDWSFYQLDVKNAFLNGDLEEE----KDTVWIETVTKSVVRK----IYYLCQVQE--------------
                                +L VAV++ W   QLDV NAFL G L ++    +   +I+    + V K    +Y L Q                 
Subjt:  ------------------------LLSVAVNKDWSFYQLDVKNAFLNGDLEEE----KDTVWIETVTKSVVRK----IYYLCQVQE--------------

Query:  FTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEIN----------KLKSRE---------------GISVSQRKYTIDLLKETGMTGCKPADNP
        F    SD +LF  +     +V ++VYVDDI+++G+D   ++           +K  E               G+ +SQR+Y +DLL  T M   KP   P
Subjt:  FTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEIN----------KLKSRE---------------GISVSQRKYTIDLLKETGMTGCKPADNP

Query:  MECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDK
        M  + KL         +   Y+ +V  L YL  TRPDISYA++ +SQFM  P EEH++ + RILRYL  TP  G+  +K +   + AY+D+DWAG   D 
Subjt:  MECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDK

Query:  KSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWL
         ST+GY  ++  + ++W SKKQ  VVRSS EAEYR+++    E  W+
Subjt:  KSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.0e-6327.43Show/hide
Query:  SDVWGTSRVTTSSVKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVV
        SDVW +S + +    R++V F+D  TR TW++ L  KS+V   F  F + ++ +F  ++  L SDN   F+   L ++LS  GI H +   + P+ N + 
Subjt:  SDVWGTSRVTTSSVKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVV

Query:  ERKNRHLLEVARSLCS-TSLPSYLWGDPILTATHLINRMPSRVLHLQTLLE------------------CLKTLYPTTR---------------------
        ERK+RH++E+  +L S  S+P   W      A +LINR+P+ +L LQ+  +                  C   L P  R                     
Subjt:  ERKNRHLLEVARSLCS-TSLPSYLWGDPILTATHLINRMPSRVLHLQTLLE------------------CLKTLYPTTR---------------------

Query:  -LIVDVPLR----------------------GESSSEETNCSS----------PLAPIDILPTP--LPVLRRISERKCYS---LLIRRLQSQHLNQHRLK
         L + +P                        G S+S+E    S          P  P+ +LP P  L      S R   S   L   ++ S +L    + 
Subjt:  -LIVDVPLR----------------------GESSSEETNCSS----------PLAPIDILPTP--LPVLRRISERKCYS---LLIRRLQSQHLNQHRLK

Query:  LAENDRNDMTKDNGVTRNA---EVSTGAQESEIV-------------EEESDKPEEYDASLDMPIALRKGTE------SCTKYPTYSFV-----------
           +        NG    A   +       S I+              + S  P+   +S  +P      +E      S T  P    V           
Subjt:  LAENDRNDMTKDNGVTRNA---EVSTGAQESEIV-------------EEESDKPEEYDASLDMPIALRKGTE------SCTKYPTYSFV-----------

Query:  --SYNNLSFKFRA------------FTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECV-TIPKGHKLVGCKWVFTMKYKSDETLDRYKAD
            N  S   RA            +  SL   + P+    AM+   W+ A+  E+ A   N+TW+ V   P    +VGC+W+FT K+ SD +L+RYKA 
Subjt:  --SYNNLSFKFRA------------FTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECV-TIPKGHKLVGCKWVFTMKYKSDETLDRYKAD

Query:  LV---------------------------LLSVAVNKDWSFYQLDVKNAFLNGDLEEE----KDTVWIETVTKSVV----RKIYYLCQVQE---------
        LV                           +L VAV++ W   QLDV NAFL G L +E    +   +++      V    + IY L Q            
Subjt:  LV---------------------------LLSVAVNKDWSFYQLDVKNAFLNGDLEEE----KDTVWIETVTKSVV----RKIYYLCQVQE---------

Query:  -----FTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAA----EINKLKSR---------------------EGISVSQRKYTIDLLKETGMTGCK
             F    SD +LF  +     ++ ++VYVDDI+++G+D       ++ L  R                     +G+ +SQR+YT+DLL  T M   K
Subjt:  -----FTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAA----EINKLKSR---------------------EGISVSQRKYTIDLLKETGMTGCK

Query:  PADNPMECNAKLG-DPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWA
        P   PM  + KL      K+P +   Y+ +V  L YL  TRPD+SYA++ +SQ+M  P ++H   + R+LRYL  TP  G+  +K +   + AY+D+DWA
Subjt:  PADNPMECNAKLG-DPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWA

Query:  GSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWL
        G   D  ST+GY  ++  + ++W SKKQ  VVRSS EAEYR+++    E  W+
Subjt:  GSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.5e-6433.33Show/hide
Query:  SCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADLV--
        S T +    F+SY  +S  + +F V +     P     A E   W  A+ +E+ A+E  +TWE  T+P   K +GCKWV+ +KY SD T++RYKA LV  
Subjt:  SCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADLV--

Query:  -------------------------LLSVAVNKDWSFYQLDVKNAFLNGDLEEE-------------KDTVWIETVTKSVVRKIYYLCQVQE--------
                                 +L+++   +++ +QLD+ NAFLNGDL+EE              D++    V   + + IY L Q           
Subjt:  -------------------------LLSVAVNKDWSFYQLDVKNAFLNGDLEEE-------------KDTVWIETVTKSVVRKIYYLCQVQE--------

Query:  ------FTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLKSR-------------------------EGISVSQRKYTIDLLKETGMTGC
              F Q HSDHT F K ++   + VL VYVDDI++  ++ A +++LKS+                          GI++ QRKY +DLL ETG+ GC
Subjt:  ------FTQGHSDHTLFTKRSSLGKVVVLIVYVDDIVLSGDDAAEINKLKSR-------------------------EGISVSQRKYTIDLLKETGMTGC

Query:  KPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWA
        KP+  PM+ +           ++ + Y+ L+ RL+YL  TR DIS+A++ +SQF +AP   H + V +IL Y+K T G+GL +       ++ ++D+ + 
Subjt:  KPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWA

Query:  GSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWL
             ++ST+GYC F+  +L++W+SKKQ +V +SSAEAEYRA+S    E +WL
Subjt:  GSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWL

ATMG00240.1 Gag-Pol-related retrotransposon family protein7.5e-1237.8Show/hide
Query:  IYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFV
        +YL  TRPD+++A++ +SQF  A     M+ V ++L Y+K T G+GL +  T    ++A+ DSDWA     ++S +G+C+ V
Subjt:  IYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFV

ATMG00810.1 DNA/RNA polymerases superfamily protein5.9e-3334.5Show/hide
Query:  LIVYVDDIVLSGDDAAEINKL-------------------------KSREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQ
        L++YVDDI+L+G     +N L                             G+ +SQ KY   +L   GM  CKP   P+           K P +   ++
Subjt:  LIVYVDDIVLSGDDAAEINKL-------------------------KSREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQ

Query:  SLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQ
        S+V  L YL  TRPDISYA+++V Q M  P     +++ R+LRY+K T   GL   K  +  ++A+ DSDWAG    ++ST+G+CTF+  N+++W +K+Q
Subjt:  SLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHMEVVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQ

Query:  SMVVRSSAEAEYRAMSKGICEEIWLKKKR
          V RSS E EYRA++    E  W    R
Subjt:  SMVVRSSAEAEYRAMSKGICEEIWLKKKR

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.3e-1141.3Show/hide
Query:  TYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADLV
        T S    N L+ K+ + T++      PK++  A++ P W  A+ EE+ AL +N TW  V  P    ++GCKWVF  K  SD TLDR KA LV
Subjt:  TYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECVTIPKGHKLVGCKWVFTMKYKSDETLDRYKADLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATCAGAGCGTGGTGACGAAACCCTAGAAACCCTAGAAACCCTAGCAGTCATTGACGAAAACGAATCTCAGCACGATCTGCCGCCATCGTCGATGGAGTCCGTTGT
CACCGTCGAATCCGCCGTCCAAGAAGCTGTCAACAGCTTCAAGGCTGCCGCAAGTCAAAACCCTAACGCCACGGCGCCGATGGGATATGTAGCAGCCGCAACTCCCACCG
TTCGGAACGCCACCGCCGCTGCTGTCGGGTCGTCAGAACAAATACAGTCGTCGAAGTCTCGCGTGTCTGCTCCCTTCTTCGTCGCGAGTTCCTCATCTTCGTCGTCGGGG
TTTGGTGGCCAGCCGAGGTCCGATTGTCTTCCATATGCTGCTACCAGCACTGCCCTCTGTCCCAAAGTCGCCGGCGCCCAGTCTTTACTTCCGACCACCGTGGCTACCAC
CAGATCTGGTGGCTTTTACTCGATTCCAAGGCTACTGCCGGATCTGGAGCGCATCTGGAAGGGAGAGGACTCCTTACTTCGTTCTCTGTTGATTCACAACATGGAACCTC
AAATCGTAAAACCTCTGTTGTATGCCGCTACTGCTCTAGACATCTGGGATGCAGTTCAGAAGCTATATTCCAAACGACAGAATGCATCGTGGCTTTACACCTTAAGGAAA
CAGATTCATGAATGCAAGCAAGGAACTATGGATGTCACATCATATTTTAACAAACTATCCCTTATTTGGCAAGAAATGGATTGTGCCGAGAACTCATCTGGGATTGGCTA
TATGGAGGAGTTCAATATTATAAAATTGAGGAAATTGACCGTGTATGATTTTGTAGCAGGAAGGATACTGGGACAGAGACCGAGACCAACTCTTAAGTTTGTTTCGAGGT
TCGACTTGAGGAAGACAGATCCACCTTCAGTGCAAAAACATCTGGTTCCACTGGAGACAAGCAAAATGGGAAACCACCTCCAGTGTGTGAGCGCTGTAAGAAACCCTGGC
ATAGAAAGGATCAGTGCTGGAAGTTACATGGTCGACCACCAAATGACAAACGAGGACCTTCGAACGACAAGCCCAATCCTAGCTAGGCCCTATACATCTCAATCCTTGAG
TCTCCTTAGTGTTAATGGCAAGAAGCCCTGGATTCTTGATTCAGGGGCTACAGATCATCTGACTGGATCCTCTGATCATTTTCTCTCATATCATCCACATGCTGACTTAA
GCTCGGGGACGATGATTGGCACTGCCTGGCACAATAGGGGACTCTACTCCCTTGATGATGATACTTCTTCTAGTGATGTTTGGGGAACCTCGAGAGTTACTACCTCTTCG
GTTAAGCGCTGGTTTGTGACCTTCATCGATGACCACACTCGTCTTACTTGGGTCTTTCTCCTCACTGATAAATCTGAGGTCTCATCCATTTTTCAACAATTTTACACTAC
CATTAAGACTCAATTTAATGCCAAAATGGTCATCCTTCAGAGTGACAATGATCGTGGGTTCCTTACTAATACCCTTAGTGAGTTTTTATCCACTAAAGGTATTGTTCATC
AGAGCTTGTGTGCCTATAACCCCCAACAAAATGAAGTGGTTGAAAGAAAAAATCGTCATCTCCTCGAGGTTGCCCGATCCCTATGCTCTACCTCTCTTCCATCGTACTTG
TGGGGGGACCCAATTTTGACTGCAACCCACCTTATAAATCGGATGCCTTCCCGTGTGCTCCACCTCCAAACCCTTCTTGAGTGCCTCAAAACCCTCTACCCTACCACACG
CCTTATTGTTGATGTTCCTCTCCGGGGAGAGAGTAGTAGTGAAGAGACTAACTGTTCATCACCTTTAGCTCCCATTGATATTCTTCCTACACCCCTTCCTGTCCTAAGAA
GAATCTCAGAAAGGAAGTGTTATTCCCTACTGATTCGCCGGCTCCAGTCCCAACACCTGAACCAACACAGGCTTAAGTTGGCTGAGAATGACAGGAATGATATGACAAAA
GACAACGGGGTTACTAGAAATGCTGAGGTGTCAACAGGAGCACAGGAAAGTGAGATCGTTGAGGAGGAATCCGACAAACCAGAGGAGTATGATGCCTCTCTTGACATGCC
CATTGCTCTGAGAAAGGGCACCGAGTCCTGCACCAAATATCCCACGTATAGCTTCGTCTCTTATAATAATCTGTCTTTTAAGTTCAGAGCATTTACAGTCAGTCTTGACA
CTGTAACAATACCAAAGAACATATATGTGGCTATGGAAATTCCTGAGTGGAAGACTGCCATCATGGAAGAAATGAGGGCTCTTGAGAAGAATAATACATGGGAATGTGTA
ACCATCCCTAAAGGACATAAACTAGTTGGGTGCAAGTGGGTGTTTACAATGAAGTATAAATCTGATGAAACTCTAGACAGATATAAGGCAGACTTGGTCCTTCTTTCGGT
AGCTGTCAACAAAGACTGGTCTTTTTATCAACTGGATGTGAAGAATGCATTTTTAAATGGTGATTTAGAAGAAGAGAAAGACACTGTATGGATTGAAACAGTCACCAAGA
GTGTGGTTCGAAAGATTTACTACCTTTGTCAAGTTCAAGAGTTCACTCAAGGACACTCTGATCACACTTTGTTTACAAAAAGGTCAAGTTTGGGTAAAGTTGTCGTCCTA
ATTGTGTATGTTGATGACATTGTCTTATCCGGAGACGATGCTGCTGAGATCAACAAACTAAAGTCGAGAGAAGGAATCTCAGTTTCACAACGGAAATACACTATTGATCT
TCTGAAAGAAACAGGTATGACTGGATGTAAGCCTGCTGACAACCCTATGGAATGTAATGCCAAATTGGGAGACCCTGTTGACAAAGTCCCAATTAATAAAGAAAGGTACC
AAAGCCTTGTTGCAAGACTAATTTACCTAGTTCACACTCGACCAGACATTTCCTATGCTATCAGTGTTGTTAGCCAATTTATGCAGGCACCCTACGAAGAACACATGGAA
GTTGTAAACCGAATTCTGAGGTATTTGAAAAGTACTCCAGGAAAAGGTTTGATGTTTAGAAAAACCGACAGGAAATGCATTGAGGCCTATACCGACTCTGACTGGGCAGG
ATCAGTAATTGACAAAAAATCTACATCTGGGTATTGTACCTTTGTTTGGGGTAATCTAGTAACGTGGAGAAGTAAGAAGCAAAGTATGGTGGTTAGAAGCAGCGCTGAAG
CTGAATATAGAGCCATGAGTAAGGGTATTTGTGAAGAAATCTGGTTGAAGAAAAAGAGAGATGAAACAAACGAAGTATCACTTCGAAAGATCAAAGATTGTGCTCACCTA
AAATCTGTTGAAGGATCGGTACCCTGCCAACCCATGTCCTTCCAGAGCTCAGATTTAAGAGGTGGAAGCTCCCTGTCTGGATATGCCAACCTCCATAACCTTTTCAGAGC
ATCCTACGCAATAGCAAGATTTGGAGCTAAAACGAGTCAAAATGGTCCAATAATGAATACTAAAGACAGACCGGGACCAAAGGATCAGTGGGGAAGAAAACGAGGAAAAG
AAGCAAAGCAGCCCAAAAACAAGGCAGTGCCGAGGCACAAAAACCCAAGACCTACCTTATCAGGCAGCGCCACTCTGTGTTGCGGCAAAAGGTCTATTTTTGGCAACAAC
TTGGTGATCGAGCTATGTTCTTTCTACAACACACTCCCCAACAAACCTCAAGAAATGAACTTCCTCTCTGAAAGGCGGACAGAGCCTAAAGAAAGGCTTACAACAAAAGC
CCTCAATAGGGTTAGGACTCCAAAGACGAGTATCCCATCTCCCAACCCTAAAATTGCCTCTTACAACAAAGAAAGAAGAGAGGCAACCTCCGTCGTTTCCCTAAGCGTTC
CATTGATCTTCGTAATGCAACGGACCAAATGGGAAAAAGAAGAGAGCTCAAGTGAAATATCTTTCCACGGGTTTCAGTGTGTGCCTTTAGCCCCTTTAACCACCCAATCA
AAAGGATATTTCCAATCTCCAACACTCCCTGATTCAGGAGGCACCCAACAACCTCCCAACTTACCAGATGAGAACCCTTCCCTTCATCCATCCATTCCCAAAGAAAGTCT
CTCATGTACTTCTCAAGGCTCTTACAGATTGAACTCAGAGATCAAAATAGGGAGAAAACCTAATTCTTTCCTTGTTCACAAGGAAAGACAAGTTTCTCAAAAAAAGCGGC
CAAAGGGCATTGGGGGTAAACTTTTGGAGCTGTTGGTTGAGAGAATTCAAAGTCTCCGGGTTTATAATAGAGATGGCATAGCTCAACTGAAGGAGGCTAACTTCTCTGCA
ATCTTGTGGGGCATTTTGATTGAAAATTTGAGAGGAATAATAGATTTTTTAAAGAGGTGGAGAGATCAGGAGAGGAAGTTCCAGAGGTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTATCAGAGCGTGGTGACGAAACCCTAGAAACCCTAGAAACCCTAGCAGTCATTGACGAAAACGAATCTCAGCACGATCTGCCGCCATCGTCGATGGAGTCCGTTGT
CACCGTCGAATCCGCCGTCCAAGAAGCTGTCAACAGCTTCAAGGCTGCCGCAAGTCAAAACCCTAACGCCACGGCGCCGATGGGATATGTAGCAGCCGCAACTCCCACCG
TTCGGAACGCCACCGCCGCTGCTGTCGGGTCGTCAGAACAAATACAGTCGTCGAAGTCTCGCGTGTCTGCTCCCTTCTTCGTCGCGAGTTCCTCATCTTCGTCGTCGGGG
TTTGGTGGCCAGCCGAGGTCCGATTGTCTTCCATATGCTGCTACCAGCACTGCCCTCTGTCCCAAAGTCGCCGGCGCCCAGTCTTTACTTCCGACCACCGTGGCTACCAC
CAGATCTGGTGGCTTTTACTCGATTCCAAGGCTACTGCCGGATCTGGAGCGCATCTGGAAGGGAGAGGACTCCTTACTTCGTTCTCTGTTGATTCACAACATGGAACCTC
AAATCGTAAAACCTCTGTTGTATGCCGCTACTGCTCTAGACATCTGGGATGCAGTTCAGAAGCTATATTCCAAACGACAGAATGCATCGTGGCTTTACACCTTAAGGAAA
CAGATTCATGAATGCAAGCAAGGAACTATGGATGTCACATCATATTTTAACAAACTATCCCTTATTTGGCAAGAAATGGATTGTGCCGAGAACTCATCTGGGATTGGCTA
TATGGAGGAGTTCAATATTATAAAATTGAGGAAATTGACCGTGTATGATTTTGTAGCAGGAAGGATACTGGGACAGAGACCGAGACCAACTCTTAAGTTTGTTTCGAGGT
TCGACTTGAGGAAGACAGATCCACCTTCAGTGCAAAAACATCTGGTTCCACTGGAGACAAGCAAAATGGGAAACCACCTCCAGTGTGTGAGCGCTGTAAGAAACCCTGGC
ATAGAAAGGATCAGTGCTGGAAGTTACATGGTCGACCACCAAATGACAAACGAGGACCTTCGAACGACAAGCCCAATCCTAGCTAGGCCCTATACATCTCAATCCTTGAG
TCTCCTTAGTGTTAATGGCAAGAAGCCCTGGATTCTTGATTCAGGGGCTACAGATCATCTGACTGGATCCTCTGATCATTTTCTCTCATATCATCCACATGCTGACTTAA
GCTCGGGGACGATGATTGGCACTGCCTGGCACAATAGGGGACTCTACTCCCTTGATGATGATACTTCTTCTAGTGATGTTTGGGGAACCTCGAGAGTTACTACCTCTTCG
GTTAAGCGCTGGTTTGTGACCTTCATCGATGACCACACTCGTCTTACTTGGGTCTTTCTCCTCACTGATAAATCTGAGGTCTCATCCATTTTTCAACAATTTTACACTAC
CATTAAGACTCAATTTAATGCCAAAATGGTCATCCTTCAGAGTGACAATGATCGTGGGTTCCTTACTAATACCCTTAGTGAGTTTTTATCCACTAAAGGTATTGTTCATC
AGAGCTTGTGTGCCTATAACCCCCAACAAAATGAAGTGGTTGAAAGAAAAAATCGTCATCTCCTCGAGGTTGCCCGATCCCTATGCTCTACCTCTCTTCCATCGTACTTG
TGGGGGGACCCAATTTTGACTGCAACCCACCTTATAAATCGGATGCCTTCCCGTGTGCTCCACCTCCAAACCCTTCTTGAGTGCCTCAAAACCCTCTACCCTACCACACG
CCTTATTGTTGATGTTCCTCTCCGGGGAGAGAGTAGTAGTGAAGAGACTAACTGTTCATCACCTTTAGCTCCCATTGATATTCTTCCTACACCCCTTCCTGTCCTAAGAA
GAATCTCAGAAAGGAAGTGTTATTCCCTACTGATTCGCCGGCTCCAGTCCCAACACCTGAACCAACACAGGCTTAAGTTGGCTGAGAATGACAGGAATGATATGACAAAA
GACAACGGGGTTACTAGAAATGCTGAGGTGTCAACAGGAGCACAGGAAAGTGAGATCGTTGAGGAGGAATCCGACAAACCAGAGGAGTATGATGCCTCTCTTGACATGCC
CATTGCTCTGAGAAAGGGCACCGAGTCCTGCACCAAATATCCCACGTATAGCTTCGTCTCTTATAATAATCTGTCTTTTAAGTTCAGAGCATTTACAGTCAGTCTTGACA
CTGTAACAATACCAAAGAACATATATGTGGCTATGGAAATTCCTGAGTGGAAGACTGCCATCATGGAAGAAATGAGGGCTCTTGAGAAGAATAATACATGGGAATGTGTA
ACCATCCCTAAAGGACATAAACTAGTTGGGTGCAAGTGGGTGTTTACAATGAAGTATAAATCTGATGAAACTCTAGACAGATATAAGGCAGACTTGGTCCTTCTTTCGGT
AGCTGTCAACAAAGACTGGTCTTTTTATCAACTGGATGTGAAGAATGCATTTTTAAATGGTGATTTAGAAGAAGAGAAAGACACTGTATGGATTGAAACAGTCACCAAGA
GTGTGGTTCGAAAGATTTACTACCTTTGTCAAGTTCAAGAGTTCACTCAAGGACACTCTGATCACACTTTGTTTACAAAAAGGTCAAGTTTGGGTAAAGTTGTCGTCCTA
ATTGTGTATGTTGATGACATTGTCTTATCCGGAGACGATGCTGCTGAGATCAACAAACTAAAGTCGAGAGAAGGAATCTCAGTTTCACAACGGAAATACACTATTGATCT
TCTGAAAGAAACAGGTATGACTGGATGTAAGCCTGCTGACAACCCTATGGAATGTAATGCCAAATTGGGAGACCCTGTTGACAAAGTCCCAATTAATAAAGAAAGGTACC
AAAGCCTTGTTGCAAGACTAATTTACCTAGTTCACACTCGACCAGACATTTCCTATGCTATCAGTGTTGTTAGCCAATTTATGCAGGCACCCTACGAAGAACACATGGAA
GTTGTAAACCGAATTCTGAGGTATTTGAAAAGTACTCCAGGAAAAGGTTTGATGTTTAGAAAAACCGACAGGAAATGCATTGAGGCCTATACCGACTCTGACTGGGCAGG
ATCAGTAATTGACAAAAAATCTACATCTGGGTATTGTACCTTTGTTTGGGGTAATCTAGTAACGTGGAGAAGTAAGAAGCAAAGTATGGTGGTTAGAAGCAGCGCTGAAG
CTGAATATAGAGCCATGAGTAAGGGTATTTGTGAAGAAATCTGGTTGAAGAAAAAGAGAGATGAAACAAACGAAGTATCACTTCGAAAGATCAAAGATTGTGCTCACCTA
AAATCTGTTGAAGGATCGGTACCCTGCCAACCCATGTCCTTCCAGAGCTCAGATTTAAGAGGTGGAAGCTCCCTGTCTGGATATGCCAACCTCCATAACCTTTTCAGAGC
ATCCTACGCAATAGCAAGATTTGGAGCTAAAACGAGTCAAAATGGTCCAATAATGAATACTAAAGACAGACCGGGACCAAAGGATCAGTGGGGAAGAAAACGAGGAAAAG
AAGCAAAGCAGCCCAAAAACAAGGCAGTGCCGAGGCACAAAAACCCAAGACCTACCTTATCAGGCAGCGCCACTCTGTGTTGCGGCAAAAGGTCTATTTTTGGCAACAAC
TTGGTGATCGAGCTATGTTCTTTCTACAACACACTCCCCAACAAACCTCAAGAAATGAACTTCCTCTCTGAAAGGCGGACAGAGCCTAAAGAAAGGCTTACAACAAAAGC
CCTCAATAGGGTTAGGACTCCAAAGACGAGTATCCCATCTCCCAACCCTAAAATTGCCTCTTACAACAAAGAAAGAAGAGAGGCAACCTCCGTCGTTTCCCTAAGCGTTC
CATTGATCTTCGTAATGCAACGGACCAAATGGGAAAAAGAAGAGAGCTCAAGTGAAATATCTTTCCACGGGTTTCAGTGTGTGCCTTTAGCCCCTTTAACCACCCAATCA
AAAGGATATTTCCAATCTCCAACACTCCCTGATTCAGGAGGCACCCAACAACCTCCCAACTTACCAGATGAGAACCCTTCCCTTCATCCATCCATTCCCAAAGAAAGTCT
CTCATGTACTTCTCAAGGCTCTTACAGATTGAACTCAGAGATCAAAATAGGGAGAAAACCTAATTCTTTCCTTGTTCACAAGGAAAGACAAGTTTCTCAAAAAAAGCGGC
CAAAGGGCATTGGGGGTAAACTTTTGGAGCTGTTGGTTGAGAGAATTCAAAGTCTCCGGGTTTATAATAGAGATGGCATAGCTCAACTGAAGGAGGCTAACTTCTCTGCA
ATCTTGTGGGGCATTTTGATTGAAAATTTGAGAGGAATAATAGATTTTTTAAAGAGGTGGAGAGATCAGGAGAGGAAGTTCCAGAGGTGGTAG
Protein sequenceShow/hide protein sequence
MVSERGDETLETLETLAVIDENESQHDLPPSSMESVVTVESAVQEAVNSFKAAASQNPNATAPMGYVAAATPTVRNATAAAVGSSEQIQSSKSRVSAPFFVASSSSSSSG
FGGQPRSDCLPYAATSTALCPKVAGAQSLLPTTVATTRSGGFYSIPRLLPDLERIWKGEDSLLRSLLIHNMEPQIVKPLLYAATALDIWDAVQKLYSKRQNASWLYTLRK
QIHECKQGTMDVTSYFNKLSLIWQEMDCAENSSGIGYMEEFNIIKLRKLTVYDFVAGRILGQRPRPTLKFVSRFDLRKTDPPSVQKHLVPLETSKMGNHLQCVSAVRNPG
IERISAGSYMVDHQMTNEDLRTTSPILARPYTSQSLSLLSVNGKKPWILDSGATDHLTGSSDHFLSYHPHADLSSGTMIGTAWHNRGLYSLDDDTSSSDVWGTSRVTTSS
VKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIKTQFNAKMVILQSDNDRGFLTNTLSEFLSTKGIVHQSLCAYNPQQNEVVERKNRHLLEVARSLCSTSLPSYL
WGDPILTATHLINRMPSRVLHLQTLLECLKTLYPTTRLIVDVPLRGESSSEETNCSSPLAPIDILPTPLPVLRRISERKCYSLLIRRLQSQHLNQHRLKLAENDRNDMTK
DNGVTRNAEVSTGAQESEIVEEESDKPEEYDASLDMPIALRKGTESCTKYPTYSFVSYNNLSFKFRAFTVSLDTVTIPKNIYVAMEIPEWKTAIMEEMRALEKNNTWECV
TIPKGHKLVGCKWVFTMKYKSDETLDRYKADLVLLSVAVNKDWSFYQLDVKNAFLNGDLEEEKDTVWIETVTKSVVRKIYYLCQVQEFTQGHSDHTLFTKRSSLGKVVVL
IVYVDDIVLSGDDAAEINKLKSREGISVSQRKYTIDLLKETGMTGCKPADNPMECNAKLGDPVDKVPINKERYQSLVARLIYLVHTRPDISYAISVVSQFMQAPYEEHME
VVNRILRYLKSTPGKGLMFRKTDRKCIEAYTDSDWAGSVIDKKSTSGYCTFVWGNLVTWRSKKQSMVVRSSAEAEYRAMSKGICEEIWLKKKRDETNEVSLRKIKDCAHL
KSVEGSVPCQPMSFQSSDLRGGSSLSGYANLHNLFRASYAIARFGAKTSQNGPIMNTKDRPGPKDQWGRKRGKEAKQPKNKAVPRHKNPRPTLSGSATLCCGKRSIFGNN
LVIELCSFYNTLPNKPQEMNFLSERRTEPKERLTTKALNRVRTPKTSIPSPNPKIASYNKERREATSVVSLSVPLIFVMQRTKWEKEESSSEISFHGFQCVPLAPLTTQS
KGYFQSPTLPDSGGTQQPPNLPDENPSLHPSIPKESLSCTSQGSYRLNSEIKIGRKPNSFLVHKERQVSQKKRPKGIGGKLLELLVERIQSLRVYNRDGIAQLKEANFSA
ILWGILIENLRGIIDFLKRWRDQERKFQRW