; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G16210 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G16210
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationClcChr01:29028489..29031431
RNA-Seq ExpressionClc01G16210
SyntenyClc01G16210
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN72141.1 hypothetical protein VITISV_017108 [Vitis vinifera]6.9e-15740.04Show/hide
Query:  FETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVD-SVDSSALVTESTAM-KASDQSNKT
        +  ++ +++ D +H++KT++D RI+KFLVGLNVEFDEVR RI+ +  LP++ + FS+VRREES+RNVM+GKK    +++ S LVT      K +    K+
Subjt:  FETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVD-SVDSSALVTESTAM-KASDQSNKT

Query:  HDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYT-GNPSVSLAQTGNYPQALSC-LNSSPW
         ++P VWCD CNKP HTRE CWK+HGK  NWK  K  ++        AN  ++S    EQ++ +L LLKSN T G  SVSLA TGN   ALSC   S+PW
Subjt:  HDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYT-GNPSVSLAQTGNYPQALSC-LNSSPW

Query:  IIDSGATDHMTSFSCLFDSYSP---------------------------------------------------------VYSKEKSVLPMDQDSGETIGR
        IIDSGA+DHMT+ S +F+SYSP                                                         V   E   +  D+ S +TIG 
Subjt:  IIDSGATDHMTSFSCLFDSYSP---------------------------------------------------------VYSKEKSVLPMDQDSGETIGR

Query:  ARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVW
        ARMI+GLYYF++   S+K  QGLSS+SSL V++ IM WH +LG P+F YLKHLFP LF+ +D   FQCE C   K  R T++ K Y  S PFYL H+DVW
Subjt:  ARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVW

Query:  GPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKN
        GPSKV T +GK+W                                 IE QFQTKI IL SDNGT++FN+   TF + KGI+HQ++C DTPQQNG+A+RKN
Subjt:  GPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKN

Query:  RHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP-------------------------------------------------------
        +HLLE+ARA+MF M++PKYL GDA+LTA+YLINRMPTK  + ++P                                                       
Subjt:  RHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP-------------------------------------------------------

Query:  ----------------SISSMEN------------------------------------------------SSTG---GETLQTDLTGRDPELKFYTRRN
                         +S MEN                                                S  G    E L+      + E   Y+R+ 
Subjt:  ----------------SISSMEN------------------------------------------------SSTG---GETLQTDLTGRDPELKFYTRRN

Query:  RTQRGRNQTVELTQDQS-----------------DTPVNGPKNSGISLS-PSSHNTLPNVS-----------------DLDIPIAQRKGSCQCTKYLIAN
         + R ++Q +     Q                   TP++   +S   LS PS     P +S                 DLD+PIA RKG+  CTK+LIA 
Subjt:  RTQRGRNQTVELTQDQS-----------------DTPVNGPKNSGISLS-PSSHNTLPNVS-----------------DLDIPIAQRKGSCQCTKYLIAN

Query:  YLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
        Y+SY  LSDNH+AFT+ I+ L +PRNIQEAL++ +WKLAV +EMNAL K+GTW+ VDLP +KK VGCKWVFTIK  ADGS+ERYKARLVAKGFT
Subjt:  YLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT

CAN79134.1 hypothetical protein VITISV_000843 [Vitis vinifera]5.8e-18041.76Show/hide
Query:  ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD
        ENSMVMTWL+NSM EDIN NYMCY T +ELW++V QMY DLGNQSQ+FEL LKLG++RQG ++VT+YF+SLK+IWQ+LD F TYEWKS  D  H++KT++
Subjt:  ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD

Query:  DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCW
        D RI+KFL GLNVEFDE                                                           K+ ++P  WCD CNKP HTRE CW
Subjt:  DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCW

Query:  KLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYT-GNPSVSLAQTGNYPQALSC-LNSSPWIIDSGATDHMTSFSCLFDSYSP
        K+HGKP NWK  K  ++         N  ++SP   EQ++  L LLKSN T G  SVSLA TGN   ALSC   S+PWI+D GA+DHMT+ S +F+SYSP
Subjt:  KLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYT-GNPSVSLAQTGNYPQALSC-LNSSPWIIDSGATDHMTSFSCLFDSYSP

Query:  VYSKE--------------------------KSVL---PMDQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYL
            +                          KSVL    +DQ SG+TIG ARMIDGLYYF++   S+K  QGLSS+SSL V++ IM WH RLGHP+F YL
Subjt:  VYSKE--------------------------KSVL---PMDQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYL

Query:  KHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQ
        KHLFP LF+ +D   FQCE C   K  R T++PK Y  S PFYL H+DVWGPSKV T +GK+W                                 IE Q
Subjt:  KHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQ

Query:  FQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP-----
        FQTKI IL SDNG E+FN+   TF ++KGI+HQ++C DT +QNG+AE KN+HLLE+ARA+MF M++PKYL  DA+LTA+YLINRMPTK  + ++P     
Subjt:  FQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP-----

Query:  ------------------------------------------------------------------SISSMEN---------------------------
                                                                           +S MEN                           
Subjt:  ------------------------------------------------------------------SISSMEN---------------------------

Query:  -----SSTGGETLQTDLTGRDPE--------LKFYTRRNRTQ---RGRNQTVELTQDQSDTPVNG-PK---NSGISLS----------------------
              S   E  +T  T  + E        L+    RN  +     R +    ++DQ   P +G PK   N  +++S                      
Subjt:  -----SSTGGETLQTDLTGRDPE--------LKFYTRRNRTQ---RGRNQTVELTQDQSDTPVNG-PK---NSGISLS----------------------

Query:  PSSHNTLPNVS-----------------DLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KH
        PS     P +S                 DLD+PIA RKG+  CTK+ I+ Y+SY  LSDN++AFT+ I+ L +PRNIQE L++ +WKLAV EEMNAL K+
Subjt:  PSSHNTLPNVS-----------------DLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KH

Query:  GTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAK
        GTW+++DLP +KK VGCKWVFTIK   DGS+ERYKARLVAK
Subjt:  GTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAK

GAU39772.1 hypothetical protein TSUD_220160 [Trifolium subterraneum]1.5e-20446.35Show/hide
Query:  TKSVRMYIRGQ-------------------------ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSV
        ++SVRMY+RG+                         ENSMVMTWL+NSM E+I++NY+CY TAK+LWD+V+QMYSDL NQSQV+EL L+LG ++QG +SV
Subjt:  TKSVRMYIRGQ-------------------------ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSV

Query:  TQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDS---VDSSA
        T+YF+ LKRIWQ+LDLF+ YEWKS  D KHY KTVD  R++KFL GLNVEFDEVRGRILG++ +P + +VF++VRREESRR VM+GKK V +   V+ SA
Subjt:  TQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDS---VDSSA

Query:  LVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYTGN-PSVSLAQ
        L       K+        DK H++CD+C +  H RE C+KLHG+P N K+ K    + ++  ++AN   SSP  KEQ+D + KLL+SN + N P  ++AQ
Subjt:  LVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYTGN-PSVSLAQ

Query:  TGNYPQALSCLN-SSPWIIDSGATDHMTSFSCLFDSYSPVYSKEK-------------------------------------------------------
        TG    ALS  N S+PWIIDSGA++HMT+ S LF SY      EK                                                       
Subjt:  TGNYPQALSCLN-SSPWIIDSGATDHMTSFSCLFDSYSPVYSKEK-------------------------------------------------------

Query:  --SVLPMDQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVS-SLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFL
          S +  DQ+SG+ IG AR I+GLYY DE    +KK   L S S  L V + +M WHRRLGHP+F YLK+LFP   K I+ S   CE C   K HR +F 
Subjt:  --SVLPMDQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVS-SLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFL

Query:  PKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIH
         K Y  S PFYL H+DVWGPSK+ T +GK+W                                 IETQFQTKI IL SDNGTE+FN+   TFL  KGIIH
Subjt:  PKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIH

Query:  QATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTK------------------------------------SCRSSS--
        Q+TCRDTPQQNG+AERKNRHLLE+ RA+M SM+VPKYL G+A+LTA YLINRMPT+                                    SC SS+  
Subjt:  QATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTK------------------------------------SCRSSS--

Query:  ------------------PSISSME------------NSSTGGETLQTDLTG-RDPELKFYTRRNRTQRGRNQTVELTQDQSDTPVNGP-KNSGISLSPS
                          P+   ME             S TGGET    LTG R+ ELK Y R+   +      +     QSD+P  GP  NS  + SP 
Subjt:  ------------------PSISSME------------NSSTGGETLQTDLTG-RDPELKFYTRRNRTQRGRNQTVELTQDQSDTPVNGP-KNSGISLSPS

Query:  ----SHNTLP---------------NVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KH
            S N LP               N+ DLD+PIA RK    CTK+ I+NYLSY +LS  HKA+ S+I+NLF+PR +QEAL D NWKLAV EEM+AL K+
Subjt:  ----SHNTLP---------------NVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KH

Query:  GTWDIVD-LPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
         TW I D LP+ KKAVGCKWVFT+KC ADGS+ERYKARLVAKGFT
Subjt:  GTWDIVD-LPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT

XP_024044151.1 uncharacterized protein LOC18046468 isoform X1 [Citrus clementina]4.4e-16437.61Show/hide
Query:  ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD
        +NSM+M+WL+NSM ++I   Y+   TAK+LWD+VT+ YSDLGN +Q+++L  ++ + +QG   VT+Y++ LK +WQELD +   EW+   D   Y+K ++
Subjt:  ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD

Query:  DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCW
          R+++FL GL+ + DEVRGR+LGK  LP+  +VFS VRREESR+NVM+G  + ++    ++  E+  +  +    K+ +K  VWCD+C+KP HTR+ CW
Subjt:  DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCW

Query:  KLHGKPPNWKSSKQYERYS--------HQHASNANVVDSSPL-KEQIDQILKLL-KSNYTGNPS--VSLAQTGNYPQALSCL--NSSPWIIDSGATDHMT
        KLHGKPPN K++K   ++S        +Q  +N    +S    KEQ++Q+ + L +S    NPS   SLAQ GN   AL  +     PWIIDSGATDHMT
Subjt:  KLHGKPPNWKSSKQYERYS--------HQHASNANVVDSSPL-KEQIDQILKLL-KSNYTGNPS--VSLAQTGNYPQALSCL--NSSPWIIDSGATDHMT

Query:  SFSCLFDSYSPVYSKEK--------------------------SVLPM-------------------------------DQDSGETIGRARMIDGLYYFD
        S S LF SY P    +K                          SVL +                               D  SG+ IG AR +DGLYYF+
Subjt:  SFSCLFDSYSPVYSKEK--------------------------SVLPM-------------------------------DQDSGETIGRARMIDGLYYFD

Query:  EVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGK
        E  +   + Q  ++  +  +++ IM WH RLGHP+F YL+HLFP LFK  + S+FQCE C   KHHR++F  + YK S+PF LIH+D+WGPS+V   +G 
Subjt:  EVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGK

Query:  RW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALM
        +W                                 I+TQFQ KI++  +DNG E+F      +  + GI+HQ++C DTPQQNGVAERKNRHLLE+AR+LM
Subjt:  RW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALM

Query:  FSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP----------------------------SISSMENSSTGGETL-----------------------
        F+  VPK   G+A+LTA+YLINRMPT+     SP                             +     S      L                       
Subjt:  FSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP----------------------------SISSMENSSTGGETL-----------------------

Query:  -----------------QTDLTGRD------------------------------------------PELKFYTRRNRTQRGRNQTVELTQDQSDTPVNG
                         +T L G D                                          PEL+ YTRRN ++R  + +    QD      N 
Subjt:  -----------------QTDLTGRD------------------------------------------PELKFYTRRNRTQRGRNQTVELTQDQSDTPVNG

Query:  PKNSGISLSPSSHNTLPNVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDL
             +  +P S + L   +DLD+PIAQRKG+  CT + I+ Y+SYHRLS   +AFT+ ++ + +P+++Q+AL+   W+ AV  EM AL K+ TW++V L
Subjt:  PKNSGISLSPSSHNTLPNVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDL

Query:  PEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
        PE+KK VGCKW+FT+K  ADGS+ERYKARLVAKGFT
Subjt:  PEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT

XP_024044152.1 uncharacterized protein LOC18046468 isoform X2 [Citrus clementina]4.4e-16437.61Show/hide
Query:  ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD
        +NSM+M+WL+NSM ++I   Y+   TAK+LWD+VT+ YSDLGN +Q+++L  ++ + +QG   VT+Y++ LK +WQELD +   EW+   D   Y+K ++
Subjt:  ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD

Query:  DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCW
          R+++FL GL+ + DEVRGR+LGK  LP+  +VFS VRREESR+NVM+G  + ++    ++  E+  +  +    K+ +K  VWCD+C+KP HTR+ CW
Subjt:  DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCW

Query:  KLHGKPPNWKSSKQYERYS--------HQHASNANVVDSSPL-KEQIDQILKLL-KSNYTGNPS--VSLAQTGNYPQALSCL--NSSPWIIDSGATDHMT
        KLHGKPPN K++K   ++S        +Q  +N    +S    KEQ++Q+ + L +S    NPS   SLAQ GN   AL  +     PWIIDSGATDHMT
Subjt:  KLHGKPPNWKSSKQYERYS--------HQHASNANVVDSSPL-KEQIDQILKLL-KSNYTGNPS--VSLAQTGNYPQALSCL--NSSPWIIDSGATDHMT

Query:  SFSCLFDSYSPVYSKEK--------------------------SVLPM-------------------------------DQDSGETIGRARMIDGLYYFD
        S S LF SY P    +K                          SVL +                               D  SG+ IG AR +DGLYYF+
Subjt:  SFSCLFDSYSPVYSKEK--------------------------SVLPM-------------------------------DQDSGETIGRARMIDGLYYFD

Query:  EVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGK
        E  +   + Q  ++  +  +++ IM WH RLGHP+F YL+HLFP LFK  + S+FQCE C   KHHR++F  + YK S+PF LIH+D+WGPS+V   +G 
Subjt:  EVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGK

Query:  RW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALM
        +W                                 I+TQFQ KI++  +DNG E+F      +  + GI+HQ++C DTPQQNGVAERKNRHLLE+AR+LM
Subjt:  RW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALM

Query:  FSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP----------------------------SISSMENSSTGGETL-----------------------
        F+  VPK   G+A+LTA+YLINRMPT+     SP                             +     S      L                       
Subjt:  FSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP----------------------------SISSMENSSTGGETL-----------------------

Query:  -----------------QTDLTGRD------------------------------------------PELKFYTRRNRTQRGRNQTVELTQDQSDTPVNG
                         +T L G D                                          PEL+ YTRRN ++R  + +    QD      N 
Subjt:  -----------------QTDLTGRD------------------------------------------PELKFYTRRNRTQRGRNQTVELTQDQSDTPVNG

Query:  PKNSGISLSPSSHNTLPNVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDL
             +  +P S + L   +DLD+PIAQRKG+  CT + I+ Y+SYHRLS   +AFT+ ++ + +P+++Q+AL+   W+ AV  EM AL K+ TW++V L
Subjt:  PKNSGISLSPSSHNTLPNVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDL

Query:  PEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
        PE+KK VGCKW+FT+K  ADGS+ERYKARLVAKGFT
Subjt:  PEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT

TrEMBL top hitse value%identityAlignment
A0A2N9GQ49 Uncharacterized protein6.0e-15945.01Show/hide
Query:  TKSVRMYIRGQ-------------------------ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSV
        ++SVRMYIRG+                         ENSMVMTWL+NSM EDI+SNYMCY TA+ELW++V QMYSDLGNQSQ+FEL LKLG+MRQG +SV
Subjt:  TKSVRMYIRGQ-------------------------ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSV

Query:  TQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVD-SVDSSALV
        T+YF+SLKR+WQ+LDLF TYEWKS  D +H++K V+D RI+KFL GLN+E DEVRGR++G+  +P + DVFS+VRREESRRNVM+GKK    +V+SSALV
Subjt:  TQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVD-SVDSSALV

Query:  -TESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPLKEQIDQILKLLKSN-YTGNPSVSLAQTG
          ++ + KA     +T DKP VWCD+CNKP HTRETCWK+HGKP NWKSSK  +R      +      +S  KEQ++ +L LLKSN  +G PSVS+AQTG
Subjt:  -TESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPLKEQIDQILKLLKSN-YTGNPSVSLAQTG

Query:  NYPQALS-CLNSS-PWIIDSGATDHMTSFSCLFDSYSPVYSKE--------------------------KSVLPM---------DQDSGETIGRARMIDG
        N P ALS CLNSS PWIIDSGA+DHMTS    F+SYSP    E                          KSVL +         DQ SG TIG ARMI+G
Subjt:  NYPQALS-CLNSS-PWIIDSGATDHMTSFSCLFDSYSPVYSKE--------------------------KSVLPM---------DQDSGETIGRARMIDG

Query:  LYYFDEVSTSHKKIQGLSSVSSLPVQETIM---------FWHRRLGHPNFV-----YLKHLFPGLFKGIDCSVFQCEDCKHHRSTFLPKSYKPSSPFYLI
        LYYFD+  +S KK QG SS+SS+ V+E IM         FW +    PN +           P +F  I+ S+   +D    R +FLP   +      ++
Subjt:  LYYFDEVSTSHKKIQGLSSVSSLPVQETIM---------FWHRRLGHPNFV-----YLKHLFPGLFKGIDCSVFQCEDCKHHRSTFLPKSYKPSSPFYLI

Query:  HTDVWGP-SKVLTKNGKRWIETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLT
          D   P S++L    KR  E      I                                  P QN  +E  N   L I+                    
Subjt:  HTDVWGP-SKVLTKNGKRWIETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLT

Query:  AAYLINRMPTKSCRSSSPSISSMENSSTGGETLQTDLTGRDPELKFYTRRNRTQRGRNQTVELTQDQSDTPVNGPKNSGISLSPSSHNTLPNVSDLDIPI
             N  P     +S P +SS   S +                                          P+  PKN                SDLDIPI
Subjt:  AAYLINRMPTKSCRSSSPSISSMENSSTGGETLQTDLTGRDPELKFYTRRNRTQRGRNQTVELTQDQSDTPVNGPKNSGISLSPSSHNTLPNVSDLDIPI

Query:  AQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITN-LFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIER
        A RKG   CTKY IA Y+SY RLS+NH+AF S I++ + +PRNIQEAL+D NWKLAV+EEMNAL K+GTW++VDLP DKK VGCKWVF++K  ADGSIER
Subjt:  AQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITN-LFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIER

Query:  YKARLVAKGFT
        YKARLVAKGFT
Subjt:  YKARLVAKGFT

A0A2Z6NTX3 Integrase catalytic domain-containing protein7.3e-20546.35Show/hide
Query:  TKSVRMYIRGQ-------------------------ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSV
        ++SVRMY+RG+                         ENSMVMTWL+NSM E+I++NY+CY TAK+LWD+V+QMYSDL NQSQV+EL L+LG ++QG +SV
Subjt:  TKSVRMYIRGQ-------------------------ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSV

Query:  TQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDS---VDSSA
        T+YF+ LKRIWQ+LDLF+ YEWKS  D KHY KTVD  R++KFL GLNVEFDEVRGRILG++ +P + +VF++VRREESRR VM+GKK V +   V+ SA
Subjt:  TQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDS---VDSSA

Query:  LVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYTGN-PSVSLAQ
        L       K+        DK H++CD+C +  H RE C+KLHG+P N K+ K    + ++  ++AN   SSP  KEQ+D + KLL+SN + N P  ++AQ
Subjt:  LVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYTGN-PSVSLAQ

Query:  TGNYPQALSCLN-SSPWIIDSGATDHMTSFSCLFDSYSPVYSKEK-------------------------------------------------------
        TG    ALS  N S+PWIIDSGA++HMT+ S LF SY      EK                                                       
Subjt:  TGNYPQALSCLN-SSPWIIDSGATDHMTSFSCLFDSYSPVYSKEK-------------------------------------------------------

Query:  --SVLPMDQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVS-SLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFL
          S +  DQ+SG+ IG AR I+GLYY DE    +KK   L S S  L V + +M WHRRLGHP+F YLK+LFP   K I+ S   CE C   K HR +F 
Subjt:  --SVLPMDQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVS-SLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFL

Query:  PKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIH
         K Y  S PFYL H+DVWGPSK+ T +GK+W                                 IETQFQTKI IL SDNGTE+FN+   TFL  KGIIH
Subjt:  PKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIH

Query:  QATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTK------------------------------------SCRSSS--
        Q+TCRDTPQQNG+AERKNRHLLE+ RA+M SM+VPKYL G+A+LTA YLINRMPT+                                    SC SS+  
Subjt:  QATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTK------------------------------------SCRSSS--

Query:  ------------------PSISSME------------NSSTGGETLQTDLTG-RDPELKFYTRRNRTQRGRNQTVELTQDQSDTPVNGP-KNSGISLSPS
                          P+   ME             S TGGET    LTG R+ ELK Y R+   +      +     QSD+P  GP  NS  + SP 
Subjt:  ------------------PSISSME------------NSSTGGETLQTDLTG-RDPELKFYTRRNRTQRGRNQTVELTQDQSDTPVNGP-KNSGISLSPS

Query:  ----SHNTLP---------------NVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KH
            S N LP               N+ DLD+PIA RK    CTK+ I+NYLSY +LS  HKA+ S+I+NLF+PR +QEAL D NWKLAV EEM+AL K+
Subjt:  ----SHNTLP---------------NVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KH

Query:  GTWDIVD-LPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
         TW I D LP+ KKAVGCKWVFT+KC ADGS+ERYKARLVAKGFT
Subjt:  GTWDIVD-LPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT

A5B9Y8 Integrase catalytic domain-containing protein3.3e-15740.04Show/hide
Query:  FETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVD-SVDSSALVTESTAM-KASDQSNKT
        +  ++ +++ D +H++KT++D RI+KFLVGLNVEFDEVR RI+ +  LP++ + FS+VRREES+RNVM+GKK    +++ S LVT      K +    K+
Subjt:  FETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVD-SVDSSALVTESTAM-KASDQSNKT

Query:  HDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYT-GNPSVSLAQTGNYPQALSC-LNSSPW
         ++P VWCD CNKP HTRE CWK+HGK  NWK  K  ++        AN  ++S    EQ++ +L LLKSN T G  SVSLA TGN   ALSC   S+PW
Subjt:  HDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYT-GNPSVSLAQTGNYPQALSC-LNSSPW

Query:  IIDSGATDHMTSFSCLFDSYSP---------------------------------------------------------VYSKEKSVLPMDQDSGETIGR
        IIDSGA+DHMT+ S +F+SYSP                                                         V   E   +  D+ S +TIG 
Subjt:  IIDSGATDHMTSFSCLFDSYSP---------------------------------------------------------VYSKEKSVLPMDQDSGETIGR

Query:  ARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVW
        ARMI+GLYYF++   S+K  QGLSS+SSL V++ IM WH +LG P+F YLKHLFP LF+ +D   FQCE C   K  R T++ K Y  S PFYL H+DVW
Subjt:  ARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVW

Query:  GPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKN
        GPSKV T +GK+W                                 IE QFQTKI IL SDNGT++FN+   TF + KGI+HQ++C DTPQQNG+A+RKN
Subjt:  GPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKN

Query:  RHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP-------------------------------------------------------
        +HLLE+ARA+MF M++PKYL GDA+LTA+YLINRMPTK  + ++P                                                       
Subjt:  RHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP-------------------------------------------------------

Query:  ----------------SISSMEN------------------------------------------------SSTG---GETLQTDLTGRDPELKFYTRRN
                         +S MEN                                                S  G    E L+      + E   Y+R+ 
Subjt:  ----------------SISSMEN------------------------------------------------SSTG---GETLQTDLTGRDPELKFYTRRN

Query:  RTQRGRNQTVELTQDQS-----------------DTPVNGPKNSGISLS-PSSHNTLPNVS-----------------DLDIPIAQRKGSCQCTKYLIAN
         + R ++Q +     Q                   TP++   +S   LS PS     P +S                 DLD+PIA RKG+  CTK+LIA 
Subjt:  RTQRGRNQTVELTQDQS-----------------DTPVNGPKNSGISLS-PSSHNTLPNVS-----------------DLDIPIAQRKGSCQCTKYLIAN

Query:  YLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
        Y+SY  LSDNH+AFT+ I+ L +PRNIQEAL++ +WKLAV +EMNAL K+GTW+ VDLP +KK VGCKWVFTIK  ADGS+ERYKARLVAKGFT
Subjt:  YLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT

A5BNN1 Integrase catalytic domain-containing protein2.8e-18041.76Show/hide
Query:  ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD
        ENSMVMTWL+NSM EDIN NYMCY T +ELW++V QMY DLGNQSQ+FEL LKLG++RQG ++VT+YF+SLK+IWQ+LD F TYEWKS  D  H++KT++
Subjt:  ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD

Query:  DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCW
        D RI+KFL GLNVEFDE                                                           K+ ++P  WCD CNKP HTRE CW
Subjt:  DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCW

Query:  KLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYT-GNPSVSLAQTGNYPQALSC-LNSSPWIIDSGATDHMTSFSCLFDSYSP
        K+HGKP NWK  K  ++         N  ++SP   EQ++  L LLKSN T G  SVSLA TGN   ALSC   S+PWI+D GA+DHMT+ S +F+SYSP
Subjt:  KLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYT-GNPSVSLAQTGNYPQALSC-LNSSPWIIDSGATDHMTSFSCLFDSYSP

Query:  VYSKE--------------------------KSVL---PMDQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYL
            +                          KSVL    +DQ SG+TIG ARMIDGLYYF++   S+K  QGLSS+SSL V++ IM WH RLGHP+F YL
Subjt:  VYSKE--------------------------KSVL---PMDQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYL

Query:  KHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQ
        KHLFP LF+ +D   FQCE C   K  R T++PK Y  S PFYL H+DVWGPSKV T +GK+W                                 IE Q
Subjt:  KHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQ

Query:  FQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP-----
        FQTKI IL SDNG E+FN+   TF ++KGI+HQ++C DT +QNG+AE KN+HLLE+ARA+MF M++PKYL  DA+LTA+YLINRMPTK  + ++P     
Subjt:  FQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP-----

Query:  ------------------------------------------------------------------SISSMEN---------------------------
                                                                           +S MEN                           
Subjt:  ------------------------------------------------------------------SISSMEN---------------------------

Query:  -----SSTGGETLQTDLTGRDPE--------LKFYTRRNRTQ---RGRNQTVELTQDQSDTPVNG-PK---NSGISLS----------------------
              S   E  +T  T  + E        L+    RN  +     R +    ++DQ   P +G PK   N  +++S                      
Subjt:  -----SSTGGETLQTDLTGRDPE--------LKFYTRRNRTQ---RGRNQTVELTQDQSDTPVNG-PK---NSGISLS----------------------

Query:  PSSHNTLPNVS-----------------DLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KH
        PS     P +S                 DLD+PIA RKG+  CTK+ I+ Y+SY  LSDN++AFT+ I+ L +PRNIQE L++ +WKLAV EEMNAL K+
Subjt:  PSSHNTLPNVS-----------------DLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KH

Query:  GTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAK
        GTW+++DLP +KK VGCKWVFTIK   DGS+ERYKARLVAK
Subjt:  GTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAK

A5BR93 Integrase catalytic domain-containing protein1.8e-15038.62Show/hide
Query:  ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD
        ENSM+M+WLINSM  DI  N++ + TAK++WD+  + YS   N S++F++   L D RQG  SVTQY+++L R WQ+LDLFET+ WK ++D   YR+ V+
Subjt:  ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD

Query:  DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKK--AVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRET
          R++KF +GLN E D+VRGRI+G   LP+L + FS+VRREESR+ VM+G K     ++D+SAL   S      D+  +  D+P  WCD+C KP H +ET
Subjt:  DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKK--AVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRET

Query:  CWKLHGKPPNWKSSKQYERYSHQH----ASNANVVDSSPL-KEQIDQILKLLKSNYTGNPSVSLAQTGNYPQALSCLNSSPWIIDSGATDHMTSFSCLFD
        CWKLHGKP +WK   +++R    H    + + +V + SP  KEQ++ + KLL    +G+ +  +A T N           PWI+D+GA+DHMT  + +  
Subjt:  CWKLHGKPPNWKSSKQYERYSHQH----ASNANVVDSSPL-KEQIDQILKLLKSNYTGNPSVSLAQTGNYPQALSCLNSSPWIIDSGATDHMTSFSCLFD

Query:  SYSPVY----------SKEK----------------SVLPM-------------------------------DQDSGETIGRARMIDGLY------YFDE
        +Y P            SK K                SVL +                               D  SG+ IG A +  GLY      + ++
Subjt:  SYSPVY----------SKEK----------------SVLPM-------------------------------DQDSGETIGRARMIDGLY------YFDE

Query:  VS-----TSHKKIQGLSSVSSLPVQE--TIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKV
        VS      S    +  +SVS+  V +   I+  H RLGHP+FVYL  LFP LF   + + + CE C   KH R+ +    YKPS+ F L+H+DVWGPS++
Subjt:  VS-----TSHKKIQGLSSVSSLPVQE--TIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKV

Query:  LTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLE
           +G RW                                 ++ QF +KI++L SDN  E+F    +T+L + GIIH ++C DTPQQNGVAERKNRHLLE
Subjt:  LTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLE

Query:  IARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSPS---ISSMENSSTGGETLQTDLTG--------RDPELKFYTRRNR--------TQRGRNQ
        +AR LMFS +VP Y  G+A+LTA YLINRMP++     SP    +    ++      L   + G             KF  R N+        TQ+GR +
Subjt:  IARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSPS---ISSMENSSTGGETLQTDLTG--------RDPELKFYTRRNR--------TQRGRNQ

Query:  TVELTQDQSDT------PVNGPKNSGISLSPSSHNTLPNVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWK
          EL      T        +      I    +    +P++ D  +PIA RKG  +CT + I NY++Y  LS +++AF + + +  +P  IQEAL  S WK
Subjt:  TVELTQDQSDT------PVNGPKNSGISLSPSSHNTLPNVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWK

Query:  LAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
         AV +E++AL K+GTW I DLP  K+ VGCKW+FTIK  ADGS+ER+KARLVA+GFT
Subjt:  LAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-1629.63Show/hide
Query:  WHRRLGHPN-----FVYLKHLF--PGLFKGIDCSVFQCEDCKHHRSTFLP-KSYKPSS----PFYLIHTDVWGPSKVLTKNGKRWI--------------
        WH R GH +      +  K++F    L   ++ S   CE C + +   LP K  K  +    P +++H+DV GP   +T + K +               
Subjt:  WHRRLGHPN-----FVYLKHLF--PGLFKGIDCSVFQCEDCKHHRSTFLP-KSYKPSS----PFYLIHTDVWGPSKVLTKNGKRWI--------------

Query:  -------------------ETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTA
                           E  F  K+  L+ DNG E+ +     F   KGI +  T   TPQ NGV+ER  R + E AR ++    + K   G+AVLTA
Subjt:  -------------------ETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTA

Query:  AYLINRMPTKSCRSSS
         YLINR+P+++   SS
Subjt:  AYLINRMPTKSCRSSS

P04146 Copia protein2.9e-0948.53Show/hide
Query:  IQEALNDSNWKLAVIEEMNALK-HGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
        IQ   + S+W+ A+  E+NA K + TW I   PE+K  V  +WVF++K N  G+  RYKARLVA+GFT
Subjt:  IQEALNDSNWKLAVIEEMNALK-HGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.1e-2324.68Show/hide
Query:  TIMFWHRRLGHPN----FVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW----------------
        ++  WH+R+GH +     +  K       KG   +V  C+ C   K HR +F   S +  +   L+++DV GP ++ +  G ++                
Subjt:  TIMFWHRRLGHPN----FVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW----------------

Query:  -----------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAA
                         +E +   K++ L SDNG E+ +     +    GI H+ T   TPQ NGVAER NR ++E  R+++    +PK   G+AV TA 
Subjt:  -----------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAA

Query:  YLINRMPTK------------------------SCRSSSPSISSMENSSTGGETLQTDLTGRDPE---------LKFYTRRNRTQRGRNQTVELTQDQSD
        YLINR P+                          CR+ +  +   + +    +++     G   E         +K    R+R    R   V    D S+
Subjt:  YLINRMPTK------------------------SCRSSSPSISSMENSSTGGETLQTDLTGRDPE---------LKFYTRRNRTQRGRNQTVELTQDQSD

Query:  TPVNGPKNSGISLSPSSHN------TLPNVSD--------------LDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSK---ITNLFLPRNIQEA
           NG   + +++  +S+N      T   VS+              LD  + + +   Q  +       S     ++ +  +++   I++   P +++E 
Subjt:  TPVNGPKNSGISLSPSSHN------TLPNVSD--------------LDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSK---ITNLFLPRNIQEA

Query:  LN---DSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGF
        L+    +    A+ EEM +L K+GT+ +V+LP+ K+ + CKWVF +K + D  + RYKARLV KGF
Subjt:  LN---DSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGF

P92520 Uncharacterized mitochondrial protein AtMg008202.4e-1147.14Show/hide
Query:  PRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGF
        P+++  AL D  W  A+ EE++AL ++ TW +V  P ++  +GCKWVF  K ++DG+++R KARLVAKGF
Subjt:  PRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGF

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-1722.71Show/hide
Query:  DQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGI---DCSVFQCEDC---KHHRSTFLPKSYK
        D ++G  + + +  D LY +   S+     Q +S  +S   + T   WH RLGHP    L  +       +         C DC   K ++  F   +  
Subjt:  DQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGI---DCSVFQCEDC---KHHRSTFLPKSYK

Query:  PSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCR
         + P   I++DVW  S +L+ +  R+                                 +E +FQT+I   +SDNG EF       +    GI H  +  
Subjt:  PSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCR

Query:  DTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPT--------------------------------------------------
         TP+ NG++ERK+RH++E    L+    +PK     A   A YLINR+PT                                                  
Subjt:  DTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPT--------------------------------------------------

Query:  -----------------------------------------------------------------------KSCRS--------SSPSISSMENSSTGGE
                                                                                SC          SSPS +   NS     
Subjt:  -----------------------------------------------------------------------KSCRS--------SSPSISSMENSSTGGE

Query:  TLQTDLTGRDPELKFYT--RRNRTQRGRNQTVELTQDQS--DTPVNGPKNSGISLSPSSHNTLPNVSDLDIPIAQRKGSCQCTK---------------Y
         L +  +   P     T  R+N  Q     T   TQ  S  +T  N P N   S    S +T P  S    P      S   T                 
Subjt:  TLQTDLTGRDPELKFYT--RRNRTQRGRNQTVELTQDQS--DTPVNGPKNSGISLSPSSHNTLPNVSDLDIPIAQRKGSCQCTK---------------Y

Query:  LIANY----LSYHRLSDNHKAFTSKITNLFL----------PRNIQEALNDSNWKLAVIEEMNA-LKHGTWDIVDLPEDK-KAVGCKWVFTIKCNADGSI
        ++ N     L+ H +    KA   K    +           PR   +AL D  W+ A+  E+NA + + TWD+V  P      VGC+W+FT K N+DGS+
Subjt:  LIANY----LSYHRLSDNHKAFTSKITNLFL----------PRNIQEALNDSNWKLAVIEEMNA-LKHGTWDIVDLPEDK-KAVGCKWVFTIKCNADGSI

Query:  ERYKARLVAKGF
         RYKARLVAKG+
Subjt:  ERYKARLVAKGF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-2023.54Show/hide
Query:  DQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHP-----NFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKS
        D ++G  + + +  D LY +   S+     Q +S  +S   + T   WH RLGHP     N V   H  P L       +  C DC   K H+  F   +
Subjt:  DQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHP-----NFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKS

Query:  YKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQAT
           S P   I++DVW  S +L+ +  R+                                 +E +FQT+I  L+SDNG EF       +L   GI H  +
Subjt:  YKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQAT

Query:  CRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP----------------------------SISSMENSSTGG
           TP+ NG++ERK+RH++E+   L+    VPK     A   A YLINR+PT   +  SP                            +   +E+ S   
Subjt:  CRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP----------------------------SISSMENSSTGG

Query:  ETLQTDLTG------RDPELKFYTRR-----------NRTQRGRNQTVELTQDQS----------DTPV-------------------------------
          +   LT         P  + YT R           + T  G + + E   D +           TP+                               
Subjt:  ETLQTDLTG------RDPELKFYTRR-----------NRTQRGRNQTVELTQDQS----------DTPV-------------------------------

Query:  ----------------------NGPK-------------NSGISLSPSSHNTLPNVSDLDIPIAQR--------------------KGSCQCTKYL----
                              NGP+             NS I  +P+ ++  PN  + + P+ Q                       S   T  L    
Subjt:  ----------------------NGPK-------------NSGISLSPSSHNTLPNVSDLDIPIAQR--------------------KGSCQCTKYL----

Query:  -------------IANYLSYHRLSD-----NHK-AFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNA-LKHGTWDIVDLPEDK-KAVGCKWVFTIKCNA
                     +  +    R  D     N K ++ + +     PR   +A+ D  W+ A+  E+NA + + TWD+V  P      VGC+W+FT K N+
Subjt:  -------------IANYLSYHRLSD-----NHK-AFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNA-LKHGTWDIVDLPEDK-KAVGCKWVFTIKCNA

Query:  DGSIERYKARLVAKGF
        DGS+ RYKARLVAKG+
Subjt:  DGSIERYKARLVAKGF

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).4.5e-1331.13Show/hide
Query:  QENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETY-EWKSTNDQKHYRKT
        Q N+MVM WL+NSM + +  + M   TA ++W+ + +++    +  ++++L  +L  +RQGG+SV +YF  L ++W EL  +    E K         K 
Subjt:  QENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETY-EWKSTNDQKHYRKT

Query:  VDDGR----IYKFLVG--LNVEFDEVRGRILGKSILPNLNDVFSKVRREES
         ++ R     Y+FL+G  LN  F+ V  +I+ +   P+L++ F+ V+  ES
Subjt:  VDDGR----IYKFLVG--LNVEFDEVRGRILGKSILPNLNDVFSKVRREES

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.7e-1745.36Show/hide
Query:  IANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNALK-HGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
        I+ +LSY ++S  + +F   I     P    EA     W  A+ +E+ A++   TW+I  LP +KK +GCKWV+ IK N+DG+IERYKARLVAKG+T
Subjt:  IANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNALK-HGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.7e-1247.14Show/hide
Query:  PRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGF
        P+++  AL D  W  A+ EE++AL ++ TW +V  P ++  +GCKWVF  K ++DG+++R KARLVAKGF
Subjt:  PRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
atgtccgaaactaagagtgttcggatgtatattcgtggtcaagaaaactccatggttatgacgtggcttatcaactctatggtagaagacatcaacagtaactacatgtg
ctacactacggccaaggaattatgggatagtgtgacccaaatgtactctgatttggggaaccaatcacaagtgttcgagctaaaccttaagttgggtgatatgcgacaag
gaggcaattcagttacacaatattttcactctctgaaaaggatatggcaagaacttgatctgtttgagacgtatgagtggaaatccacaaacgaccaaaaacattatcgg
aaaactgttgatgatggtcgcatttacaaatttcttgttggcctcaatgttgagtttgatgaggttagaggcaggatacttgggaaaagtattcttccaaatcttaatga
tgttttttctaaagttcgcagggaagaaagccgcaggaatgttatgattgggaaaaaggcagttgactcagtggacagttctgcactagtgactgaaagtactgcaatga
aagcttctgatcaatccaacaaaactcatgacaagccccatgtatggtgtgatcattgcaacaaaccctgtcatacgagggaaacttgttggaaactacatggcaaacct
ccaaattggaagagttcgaaacaatatgagagatattctcatcagcatgcctccaatgcaaatgttgttgattccagtccactcaaagagcaaattgatcaaatcctgaa
gctgctaaaatccaattatacgggtaatcctagtgtttccttggcacaaacaggtaattaccctcaagctctctcgtgtctaaattcctctccgtggatcattgattccg
gagctactgatcacatgactagtttctcgtgtttatttgattcatactcccctgtttatagtaaagaaaagtctgtattgccgatggatcaggattcgggagagacgatt
ggacgtgctaggatgattgatggtctctattactttgatgaagtttcaactagtcataaaaagattcagggcttgagtagtgtcagttctcttcctgttcaagaaactat
tatgttttggcatcgtagattaggacatcctaatttcgtttatttaaaacatttgtttcctggtttatttaaaggaattgattgttctgtgtttcaatgtgaagattgca
aacatcatcgatctacgtttttacccaaatcctataaaccctcatcacccttttacttaattcatactgatgtttgggggccatctaaggttttgactaaaaatggcaag
cgctggattgagactcaatttcaaactaaaattcgcattcttcactctgataatgggactgaattttttaacgaaccacaaaccacctttttacatgacaagggcattat
tcaccaagcgacatgtcgcgatacccctcagcaaaatggtgttgctgaacggaaaaatcgacacttgcttgaaattgctcgtgccctcatgttttcgatgcatgttccaa
aatatctgttgggggatgcagtcctaacagctgcttacctaatcaatagaatgcctactaagtcctgtaggagctctagtccttcgatctcaagcatggaaaactcttcg
acagggggagaaacactacaaacagatctgacaggtcgagatcctgaacttaagttttatactagaagaaacagaactcaaaggggtagaaatcagacagtcgaactaac
acaggaccaatctgatactccagtaaatggtcctaaaaattcgggtatctctcttagtccttcctctcataatacattgcctaatgtctctgatcttgatattccaattg
cccagagaaaaggttcctgccaatgtacaaaatatctcattgcgaactatctctcctatcatagattgtctgataatcataaagctttcacatccaaaataaccaaccta
tttcttccaaggaatatacaagaagctctaaatgattcgaattggaaattagcagtgatagaagagatgaatgcgctgaaacatggtacttgggacatagttgatctacc
agaagacaagaaagcagtgggatgtaagtgggttttcacgataaaatgtaatgcggatggtagtatcgaaaggtacaaggccaggctagtggctaagggattcacctag
mRNA sequenceShow/hide mRNA sequence
atgtccgaaactaagagtgttcggatgtatattcgtggtcaagaaaactccatggttatgacgtggcttatcaactctatggtagaagacatcaacagtaactacatgtg
ctacactacggccaaggaattatgggatagtgtgacccaaatgtactctgatttggggaaccaatcacaagtgttcgagctaaaccttaagttgggtgatatgcgacaag
gaggcaattcagttacacaatattttcactctctgaaaaggatatggcaagaacttgatctgtttgagacgtatgagtggaaatccacaaacgaccaaaaacattatcgg
aaaactgttgatgatggtcgcatttacaaatttcttgttggcctcaatgttgagtttgatgaggttagaggcaggatacttgggaaaagtattcttccaaatcttaatga
tgttttttctaaagttcgcagggaagaaagccgcaggaatgttatgattgggaaaaaggcagttgactcagtggacagttctgcactagtgactgaaagtactgcaatga
aagcttctgatcaatccaacaaaactcatgacaagccccatgtatggtgtgatcattgcaacaaaccctgtcatacgagggaaacttgttggaaactacatggcaaacct
ccaaattggaagagttcgaaacaatatgagagatattctcatcagcatgcctccaatgcaaatgttgttgattccagtccactcaaagagcaaattgatcaaatcctgaa
gctgctaaaatccaattatacgggtaatcctagtgtttccttggcacaaacaggtaattaccctcaagctctctcgtgtctaaattcctctccgtggatcattgattccg
gagctactgatcacatgactagtttctcgtgtttatttgattcatactcccctgtttatagtaaagaaaagtctgtattgccgatggatcaggattcgggagagacgatt
ggacgtgctaggatgattgatggtctctattactttgatgaagtttcaactagtcataaaaagattcagggcttgagtagtgtcagttctcttcctgttcaagaaactat
tatgttttggcatcgtagattaggacatcctaatttcgtttatttaaaacatttgtttcctggtttatttaaaggaattgattgttctgtgtttcaatgtgaagattgca
aacatcatcgatctacgtttttacccaaatcctataaaccctcatcacccttttacttaattcatactgatgtttgggggccatctaaggttttgactaaaaatggcaag
cgctggattgagactcaatttcaaactaaaattcgcattcttcactctgataatgggactgaattttttaacgaaccacaaaccacctttttacatgacaagggcattat
tcaccaagcgacatgtcgcgatacccctcagcaaaatggtgttgctgaacggaaaaatcgacacttgcttgaaattgctcgtgccctcatgttttcgatgcatgttccaa
aatatctgttgggggatgcagtcctaacagctgcttacctaatcaatagaatgcctactaagtcctgtaggagctctagtccttcgatctcaagcatggaaaactcttcg
acagggggagaaacactacaaacagatctgacaggtcgagatcctgaacttaagttttatactagaagaaacagaactcaaaggggtagaaatcagacagtcgaactaac
acaggaccaatctgatactccagtaaatggtcctaaaaattcgggtatctctcttagtccttcctctcataatacattgcctaatgtctctgatcttgatattccaattg
cccagagaaaaggttcctgccaatgtacaaaatatctcattgcgaactatctctcctatcatagattgtctgataatcataaagctttcacatccaaaataaccaaccta
tttcttccaaggaatatacaagaagctctaaatgattcgaattggaaattagcagtgatagaagagatgaatgcgctgaaacatggtacttgggacatagttgatctacc
agaagacaagaaagcagtgggatgtaagtgggttttcacgataaaatgtaatgcggatggtagtatcgaaaggtacaaggccaggctagtggctaagggattcacctag
Protein sequenceShow/hide protein sequence
MSETKSVRMYIRGQENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYR
KTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCWKLHGKP
PNWKSSKQYERYSHQHASNANVVDSSPLKEQIDQILKLLKSNYTGNPSVSLAQTGNYPQALSCLNSSPWIIDSGATDHMTSFSCLFDSYSPVYSKEKSVLPMDQDSGETI
GRARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDCKHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGK
RWIETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSPSISSMENSS
TGGETLQTDLTGRDPELKFYTRRNRTQRGRNQTVELTQDQSDTPVNGPKNSGISLSPSSHNTLPNVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNL
FLPRNIQEALNDSNWKLAVIEEMNALKHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT