; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026387 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026387
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr10:35894305..35903707
RNA-Seq ExpressionLag0026387
SyntenyLag0026387
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
DAD31904.1 TPA_asm: hypothetical protein HUJ06_010755 [Nelumbo nucifera]4.6e-9935.04Show/hide
Query:  QEYIEHVRDVNSTKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWV
        +EYI+HVR ++S K++W+TLERLF QKNT RLQYLENE+    Q N+ +S+YFLKVK +C++I+ELD  +PIS+ARL RYLIR LRKEFMPFISSIQ W+
Subjt:  QEYIEHVRDVNSTKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWV

Query:  NQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDVH
        NQPSI+ELEN LSNQEAL+KQM G        VE  ++ KDK K  ++ K SS+D K  K EG+     K         +  Y C     G++   F V 
Subjt:  NQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDVH

Query:  THQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLSNSKKIHYEVEAVVYE
             +++    +    +    RV + +   ++A  + E                  +D  N    + +E                          V+ +
Subjt:  THQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLSNSKKIHYEVEAVVYE

Query:  KDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDAHTHQGKRVIATVNNSLLPVVEERRVNVKDDAPNVAGVSLE
         D    N ++    ND                 +DY+++WI+D G +HHATG+ +LL D   +                             N   ++LE
Subjt:  KDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDAHTHQGKRVIATVNNSLLPVVEERRVNVKDDAPNVAGVSLE

Query:  DVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDED
         VY VPGLKK ++ +S I D                                   E V++  DK  +  + +    D         F G  K  L     
Subjt:  DVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDED

Query:  WIIDYGCSHHATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPNDEIHHDVVC
         +   G     T   + +S  H   G      V   LL  +   R+        + GV L ++    G                              VC
Subjt:  WIIDYGCSHHATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPNDEIHHDVVC

Query:  PGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF-MLSDQCQNYCKEHGIQ
          CQ+ KSHRL F  S N A+    L+H DLMGPT TPSY+G  Y+ V VDDFS ++WVYFL+ KSE  + F QFK  VE EF  LS +   + +EHGI+
Subjt:  PGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF-MLSDQCQNYCKEHGIQ

Query:  CQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKFR
         Q T P+TPQQNGVAERKLAHLT++ L WLH KNLPRELWA A+  ACHVI RLPPW+G++ S FE+    KP+ + F+
Subjt:  CQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKFR

KAA8537014.1 hypothetical protein F0562_029492 [Nyssa sinensis]7.6e-10255.01Show/hide
Query:  YDEDWIIDYGCSHHATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPND----
        Y +DWI+D GCSHHAT NVSLL +V  H GKR I T +NSL PVV+EG  NVK+D  N  GVSL+DVYHVPGLKKNL SVSQIAD GRYV+F PND    
Subjt:  YDEDWIIDYGCSHHATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPND----

Query:  ----------------------------------------------------------------------EIHHDVVCPGCQFGKSHRLHFPNSNNKATG
                                                                              EIH DVVC GCQ+GKSHRL FPNS N+AT 
Subjt:  ----------------------------------------------------------------------EIHHDVVCPGCQFGKSHRLHFPNSNNKATG

Query:  TLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF--------------MLSDQCQNYCKEHGIQCQMTSPDTP
         L LVHSDLMGPTRTPSY G  YV V VD FS FTWV+FL+ KSETFSKF+QFKEQVE EF               +S+Q  NYC+EH IQCQMT P TP
Subjt:  TLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF--------------MLSDQCQNYCKEHGIQCQMTSPDTP

Query:  QQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKF
        QQNGVAERKLAHLTSMCL WL  KNLPRELWA AI + CHVINRLPPW+G+  SPFE     KP  + F
Subjt:  QQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKF

KAA8549858.1 hypothetical protein F0562_001542 [Nyssa sinensis]2.9e-12540.48Show/hide
Query:  QEYIEHVRDVNSTKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWV
        QEYI+HVRD  S KQVW+TLERLF QKNT RLQ+L+NE+ G  Q NLS+ EYFLK+KTLCSEI+ELD  +P+S+ARLHRYLI GLRKEFMPFISSIQ W 
Subjt:  QEYIEHVRDVNSTKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWV

Query:  NQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDVH
        NQP IIELEN LSNQEALMKQ+  N+K+   +VE  +Y KDK K NS +K SS DSK SK +GQ +GN K +          Y C         L  D H
Subjt:  NQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDVH

Query:  THQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLSNSKKIHYEVEAVVYE
             +V+       + + +  RVN+     NVA  +           KF                         ++   +Q LS        +EAV   
Subjt:  THQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLSNSKKIHYEVEAVVYE

Query:  KDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDAHTHQGKRVIATVNNSLLPVVEERRVNVKDDAPNVAGVSLE
         D+  + +S  + +N      VE     +    +DY +DWI+D+GCSHHA GN  LL +   H GKR I T +NSL P+V+E   NVK D  NV+GVSL+
Subjt:  KDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDAHTHQGKRVIATVNNSLLPVVEERRVNVKDDAPNVAGVSLE

Query:  DVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDED
        DVYHVP LKK ++ +S I D                                     V++  D  K+ S+ K    D         F G  K     D  
Subjt:  DVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDED

Query:  WIIDYGCSH-HATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPNDEIHHDVV
        +++    ++   TG  + ++  H   G                                      HV       +S  ++ D     +F    EIH DVV
Subjt:  WIIDYGCSH-HATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPNDEIHHDVV

Query:  CPGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF--------------ML
        CPGCQ+GKSH L F NS NKAT  L L                                V+FL+ KSETFSKF+QFKEQVE EF               +
Subjt:  CPGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF--------------ML

Query:  SDQCQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKFR
        SDQ  NYC+EH I+C+MT P+TPQQNGVAERKLAHLTSMCL WLH K+LPRELWA AI  ACHVINRLPPW G+  S FE     KP  + FR
Subjt:  SDQCQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKFR

KAG6437849.1 hypothetical protein SASPL_102779 [Salvia splendens]1.6e-10436.01Show/hide
Query:  QEYIEHVRDVNSTKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWV
        +E+I+HVRDV+S K+VW+TLE +  +KNT RLQ LENE+    QG +SVSEYFL++K+ C+EI+E+D  + IS A L RYLIRGLRKE+ PF++SIQ W 
Subjt:  QEYIEHVRDVNSTKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWV

Query:  NQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDVH
        NQPS+ ELEN L NQEAL KQM  N      + +AV++   KGK N     +SN  K                  DE+   D G S              
Subjt:  NQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDVH

Query:  THQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLSNSKKIHYEVEAVVYE
            K+ I       L  +++    VK    NVA  +  D                          IE +                     + VEA+  +
Subjt:  THQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLSNSKKIHYEVEAVVYE

Query:  KDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDAHTHQGKRVIATVNNSLLPVVEERRVNVK-DDAPNVAGVSL
             VN                         +++  E+WIID GCSHHATGN +L  +   H G+RV+ T +NS  PV +E  V +     P+   V L
Subjt:  KDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDAHTHQGKRVIATVNNSLLPVVEERRVNVK-DDAPNVAGVSL

Query:  EDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDE
         DVYHVPGLK+ +  ++ I +                                   + V++  +  KV  + K   N S D    G+ KG+L        
Subjt:  EDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDE

Query:  DWIIDYGCSH----HATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPNDEIH
         +++  G ++      T N S+      H G +++  +++  L                VAG+              LV+V +                 
Subjt:  DWIIDYGCSH----HATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPNDEIH

Query:  HDVVCPGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF------------
         DV+C GCQ+GKSHRL F  S N+ +    LVH+DLMGPTRTPS S  RYV V VDD S FTWV FLK KSE  SKF++F++ VE EF            
Subjt:  HDVVCPGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF------------

Query:  --MLSDQCQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKFR
           +SD    YC+++GIQ QMT PDTPQQNGVAERKLAHLTS+CL WLH KNLPRELWA A+  ACHV NRLPPW G+  SPFE+     P+ + FR
Subjt:  --MLSDQCQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKFR

RWR74934.1 Integrase, catalytic core [Cinnamomum micranthum f. kanehirae]9.2e-14042.17Show/hide
Query:  QEYIEHVRDVNSTKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWV
        Q+YI  VRDV+S KQVWE LERLF QKNT RLQYLENE+ G  QG LS+ EYFLKVKTLC+EI+ELD  +P+S+ARLHRYLIRGLRKEFMPFISSIQ W 
Subjt:  QEYIEHVRDVNSTKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWV

Query:  NQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDVH
         QPSIIELEN LSNQEAL+KQM  N KK    VE  +Y KD+G  N   K+ S+D++ S  EG+F+GN KG     +   I   C  HA           
Subjt:  NQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDVH

Query:  THQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLSNSKKIHYEVEAVVYE
             RV+         +    RV + +   NVA                                                                 E
Subjt:  THQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLSNSKKIHYEVEAVVYE

Query:  KDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDAHTHQGKRVIATVNNSLLPVVEERRVNVKDDAPNVAGVSLE
        KD+ + ++     S  +  S +    + ++   +DY++ WI+D GCSHHATGN SLL D   H GK+ I T +NSL PV  E+  + + D  N  GVSL 
Subjt:  KDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDAHTHQGKRVIATVNNSLLPVVEERRVNVKDDAPNVAGVSLE

Query:  DVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDED
        +VYHV GLKK ++ +S I D   +  +   +N          Q+L N K I  +V      K+   V S++                            D
Subjt:  DVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDED

Query:  WIIDYGCSHHATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPNDEIHHDVVC
          ++  C +    + +L      H G +++  ++   L                        +  +P  K                      EIHHDVVC
Subjt:  WIIDYGCSHHATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPNDEIHHDVVC

Query:  PGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEFML--------------S
        PGCQ+ KSHRL FP S N+A+  L LVHSDLMGPT+T SYS  RYV + VDDFS FTWVYFL+ KSE FSKFVQFKEQVE EF L              S
Subjt:  PGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEFML--------------S

Query:  DQCQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKFR
        DQ  NYCKEHGIQ QMT P+TPQQNGVAERKLAHLTSMCL WLH KNLPRELWA A+ SACHVINRLP W G+  SPFE     KP  + F+
Subjt:  DQCQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKFR

TrEMBL top hitse value%identityAlignment
A0A1J3CK86 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-8031.47Show/hide
Query:  EHVRDVNSTKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWVNQPS
        EHV    S   +W+ L++LF++KN  RLQ LENE+    QG  S+SE+F+KVK LCSEI  L+  + IS ARL R +IRGLR E+ PF++S+Q W  QPS
Subjt:  EHVRDVNSTKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWVNQPS

Query:  IIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDVHTHQG
        + E EN L++QE+L  QM G   KIH          D G    + ++ S           FKG  K    YD                        T+ G
Subjt:  IIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDVHTHQG

Query:  KRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLSNSKKIHYEVEAVVYEKDKG
        +   +T +                                   K+F  +              +L +F  +    +K+      K H E E      + G
Subjt:  KRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLSNSKKIHYEVEAVVYEKDKG

Query:  KVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDAHTHQGKRVIATVNNSLLPVVEERRVNVKDDAPNVAGVSLEDVYH
        K  +    SS  S    +     GN         DWI+D GCSHH TGN  L      H+GK  I T +NS+  V +E  V +  D  +   ++L++VYH
Subjt:  KVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDAHTHQGKRVIATVNNSLLPVVEERRVNVKDDAPNVAGVSLEDVYH

Query:  VPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIID
        VPG+KK                     N LS   A     + +   + +  + V + K+  ++ +    +    KD  V       ++     D D+I  
Subjt:  VPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIID

Query:  YGCSHHATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPNDEIHHDVVCPGCQ
            H   G++++          ++   VN  L                            V GL K      +I D G+              +C GCQ
Subjt:  YGCSHHATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPNDEIHHDVVCPGCQ

Query:  FGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF--------------MLSDQCQ
        +GKSHRL F NS ++    L  VHSDLMGPTRT SYSG RY+ +FVDDFS +TWVYF+K KSE FSKF +FK  VE E                +S++  
Subjt:  FGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF--------------MLSDQCQ

Query:  NYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKFR
        ++C++ GI+ + T P TPQQNGVAERK+ HL+  C  WLH KNLP+ LWA  +  A +VINR+P    +  SP+E+    KP    FR
Subjt:  NYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKFR

A0A443N8T5 Integrase, catalytic core4.5e-14042.17Show/hide
Query:  QEYIEHVRDVNSTKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWV
        Q+YI  VRDV+S KQVWE LERLF QKNT RLQYLENE+ G  QG LS+ EYFLKVKTLC+EI+ELD  +P+S+ARLHRYLIRGLRKEFMPFISSIQ W 
Subjt:  QEYIEHVRDVNSTKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWV

Query:  NQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDVH
         QPSIIELEN LSNQEAL+KQM  N KK    VE  +Y KD+G  N   K+ S+D++ S  EG+F+GN KG     +   I   C  HA           
Subjt:  NQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDVH

Query:  THQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLSNSKKIHYEVEAVVYE
             RV+         +    RV + +   NVA                                                                 E
Subjt:  THQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLSNSKKIHYEVEAVVYE

Query:  KDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDAHTHQGKRVIATVNNSLLPVVEERRVNVKDDAPNVAGVSLE
        KD+ + ++     S  +  S +    + ++   +DY++ WI+D GCSHHATGN SLL D   H GK+ I T +NSL PV  E+  + + D  N  GVSL 
Subjt:  KDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDAHTHQGKRVIATVNNSLLPVVEERRVNVKDDAPNVAGVSLE

Query:  DVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDED
        +VYHV GLKK ++ +S I D   +  +   +N          Q+L N K I  +V      K+   V S++                            D
Subjt:  DVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDED

Query:  WIIDYGCSHHATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPNDEIHHDVVC
          ++  C +    + +L      H G +++  ++   L                        +  +P  K                      EIHHDVVC
Subjt:  WIIDYGCSHHATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPNDEIHHDVVC

Query:  PGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEFML--------------S
        PGCQ+ KSHRL FP S N+A+  L LVHSDLMGPT+T SYS  RYV + VDDFS FTWVYFL+ KSE FSKFVQFKEQVE EF L              S
Subjt:  PGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEFML--------------S

Query:  DQCQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKFR
        DQ  NYCKEHGIQ QMT P+TPQQNGVAERKLAHLTSMCL WLH KNLPRELWA A+ SACHVINRLP W G+  SPFE     KP  + F+
Subjt:  DQCQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKFR

A0A5J5B3A5 Integrase catalytic domain-containing protein3.7e-10255.01Show/hide
Query:  YDEDWIIDYGCSHHATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPND----
        Y +DWI+D GCSHHAT NVSLL +V  H GKR I T +NSL PVV+EG  NVK+D  N  GVSL+DVYHVPGLKKNL SVSQIAD GRYV+F PND    
Subjt:  YDEDWIIDYGCSHHATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPND----

Query:  ----------------------------------------------------------------------EIHHDVVCPGCQFGKSHRLHFPNSNNKATG
                                                                              EIH DVVC GCQ+GKSHRL FPNS N+AT 
Subjt:  ----------------------------------------------------------------------EIHHDVVCPGCQFGKSHRLHFPNSNNKATG

Query:  TLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF--------------MLSDQCQNYCKEHGIQCQMTSPDTP
         L LVHSDLMGPTRTPSY G  YV V VD FS FTWV+FL+ KSETFSKF+QFKEQVE EF               +S+Q  NYC+EH IQCQMT P TP
Subjt:  TLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF--------------MLSDQCQNYCKEHGIQCQMTSPDTP

Query:  QQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKF
        QQNGVAERKLAHLTSMCL WL  KNLPRELWA AI + CHVINRLPPW+G+  SPFE     KP  + F
Subjt:  QQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKF

A0A5J5BCB3 Uncharacterized protein7.7e-8438.92Show/hide
Query:  QEYIEHVRDVNSTKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWV
        QEYI+HVRD  S KQVWETLERLF QKNT RLQ+LEN++ G  Q NLS+SEYFLK+KTLCSEI+ELD  +P+S+ARL RYLIRGLRKEFMPFISSIQ W 
Subjt:  QEYIEHVRDVNSTKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWV

Query:  NQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDVH
        NQPSIIELEN LSNQEALMKQM  ++K+   +VE  +Y KDK K NS +K SS D+K SK EGQ +GN +               S +  G +      H
Subjt:  NQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDVH

Query:  THQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLSNSKKIHYEVEAVVYE
          +  RV    N       + G +  ++   N+ G    +V H                        E   F   ++   +Q LS        +EAV   
Subjt:  THQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLSNSKKIHYEVEAVVYE

Query:  KDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDAHTHQGKRVIATVNNSLLPVVEERRVNVKDDAPNVAGVSLE
         D+  + +S  + +N      VE     +    +DY +DWI+D GCSHHATGN SLL +   H GKR I T +NSL PVV+E   NVK D  N  GVSL+
Subjt:  KDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDAHTHQGKRVIATVNNSLLPVVEERRVNVKDDAPNVAGVSLE

Query:  DVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDED
        DVYHVPGLKK ++ +S I D                                     V++  D  K+ S+ K    D         F G  K  L     
Subjt:  DVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDED

Query:  WIIDYGCSHHATGNVSLLSDVHTHQ-GKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPNDEIHHDVV
                      V   SD +  + G+    T+ ++ L     G V  +             + H    KK L  V                EIH DVV
Subjt:  WIIDYGCSHHATGNVSLLSDVHTHQ-GKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPNDEIHHDVV

Query:  CPGCQFGKSHRLHFPNSNNKATGTLHL
        CPGCQ+GKSHR  FPNS N+AT  L L
Subjt:  CPGCQFGKSHRLHFPNSNNKATGTLHL

A0A5J5C3K7 Uncharacterized protein1.4e-12540.48Show/hide
Query:  QEYIEHVRDVNSTKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWV
        QEYI+HVRD  S KQVW+TLERLF QKNT RLQ+L+NE+ G  Q NLS+ EYFLK+KTLCSEI+ELD  +P+S+ARLHRYLI GLRKEFMPFISSIQ W 
Subjt:  QEYIEHVRDVNSTKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWV

Query:  NQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDVH
        NQP IIELEN LSNQEALMKQ+  N+K+   +VE  +Y KDK K NS +K SS DSK SK +GQ +GN K +          Y C         L  D H
Subjt:  NQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDVH

Query:  THQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLSNSKKIHYEVEAVVYE
             +V+       + + +  RVN+     NVA  +           KF                         ++   +Q LS        +EAV   
Subjt:  THQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLSNSKKIHYEVEAVVYE

Query:  KDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDAHTHQGKRVIATVNNSLLPVVEERRVNVKDDAPNVAGVSLE
         D+  + +S  + +N      VE     +    +DY +DWI+D+GCSHHA GN  LL +   H GKR I T +NSL P+V+E   NVK D  NV+GVSL+
Subjt:  KDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDAHTHQGKRVIATVNNSLLPVVEERRVNVKDDAPNVAGVSLE

Query:  DVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDED
        DVYHVP LKK ++ +S I D                                     V++  D  K+ S+ K    D         F G  K     D  
Subjt:  DVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDED

Query:  WIIDYGCSH-HATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPNDEIHHDVV
        +++    ++   TG  + ++  H   G                                      HV       +S  ++ D     +F    EIH DVV
Subjt:  WIIDYGCSH-HATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPNDEIHHDVV

Query:  CPGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF--------------ML
        CPGCQ+GKSH L F NS NKAT  L L                                V+FL+ KSETFSKF+QFKEQVE EF               +
Subjt:  CPGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF--------------ML

Query:  SDQCQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKFR
        SDQ  NYC+EH I+C+MT P+TPQQNGVAERKLAHLTSMCL WLH K+LPRELWA AI  ACHVINRLPPW G+  S FE     KP  + FR
Subjt:  SDQCQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEV----KPEAAKFR

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-1730.65Show/hide
Query:  VCPGCQFGKSHRLHFPNSNNKA--TGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF-------------
        +C  C  GK  RL F    +K      L +VHSD+ GP    +     Y  +FVD F+ +   Y +K KS+ FS F  F  + E  F             
Subjt:  VCPGCQFGKSHRLHFPNSNNKA--TGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF-------------

Query:  -MLSDQCQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLP--PWMGSNLSPFEV
          LS++ + +C + GI   +T P TPQ NGV+ER +  +T      +    L +  W  A+ +A ++INR+P    + S+ +P+E+
Subjt:  -MLSDQCQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLP--PWMGSNLSPFEV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-2537.28Show/hide
Query:  CPGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF--------------ML
        C  C FGK HR+ F  S+ +    L LV+SD+ GP    S  G +Y   F+DD S   WVY LK K + F  F +F   VE E                 
Subjt:  CPGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF--------------ML

Query:  SDQCQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLP
        S + + YC  HGI+ + T P TPQ NGVAER    +       L +  LP+  W  A+ +AC++INR P
Subjt:  SDQCQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.9e-2031.75Show/hide
Query:  HHDVVCPGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF------MLSDQ
        H  + C  C   KS+++ F  S   +T  L  ++SD+   +   S+   RY  +FVD F+ +TW+Y LK KS+    F+ FK  +EN F        SD 
Subjt:  HHDVVCPGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF------MLSDQ

Query:  ------CQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFE----VKPEAAKFRA---
                 Y  +HGI    + P TP+ NG++ERK  H+    L  L   ++P+  W  A   A ++INRLP  +    SPF+      P   K R    
Subjt:  ------CQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFE----VKPEAAKFRA---

Query:  ---KWLQNANQ
            WL+  NQ
Subjt:  ---KWLQNANQ

Q9FFJ8 L10-interacting MYB domain-containing protein3.7e-0636.62Show/hide
Query:  IEDDPSRIDVRRRDVPGCSILEVMNALRGIPKIVEGGELFMKAVDIFTKRENREMFVALEKPETQVAWLKQ
        +++ P  +++     P  SI E + +L  I ++ +G EL+M A+D+F KRE RE+F+ L+KP  ++AWL++
Subjt:  IEDDPSRIDVRRRDVPGCSILEVMNALRGIPKIVEGGELFMKAVDIFTKRENREMFVALEKPETQVAWLKQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.2e-2134.43Show/hide
Query:  HHDVVCPGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF------MLSDQ
        H  + C  C   KSH++ F NS   ++  L  ++SD+   +   S    RY  +FVD F+ +TW+Y LK KS+    F+ FK  VEN F      + SD 
Subjt:  HHDVVCPGCQFGKSHRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEF------MLSDQ

Query:  ------CQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFE
               ++Y  +HGI    + P TP+ NG++ERK  H+  M L  L   ++P+  W  A   A ++INRLP  +    SPF+
Subjt:  ------CQNYCKEHGIQCQMTSPDTPQQNGVAERKLAHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFE

Arabidopsis top hitse value%identityAlignment
AT2G19220.1 unknown protein7.8e-0429.17Show/hide
Query:  PRSIEDDPSRIDVRRRDVPGCSILEVMNALRGIPKIVEGGELFMKAVDIFTKRENREMFVALEKPETQVAWL
        P  +E  P    V++++    +  E +  L  I ++ +GG+L+M A+D+F  ++ R +F+ L+K   ++AWL
Subjt:  PRSIEDDPSRIDVRRRDVPGCSILEVMNALRGIPKIVEGGELFMKAVDIFTKRENREMFVALEKPETQVAWL

AT3G11310.1 unknown protein4.6e-0437.04Show/hide
Query:  SILEVMNALRGIPKIVEGGELFMKAVDIFTKRENREMFVALEKPETQVAWLKQK
        +I E M  L  + ++ +G EL+M A+D+F  +E REMF+ LE    +++WL ++
Subjt:  SILEVMNALRGIPKIVEGGELFMKAVDIFTKRENREMFVALEKPETQVAWLKQK

AT5G05800.1 unknown protein2.6e-0736.62Show/hide
Query:  IEDDPSRIDVRRRDVPGCSILEVMNALRGIPKIVEGGELFMKAVDIFTKRENREMFVALEKPETQVAWLKQ
        +++ P  +++     P  SI E + +L  I ++ +G EL+M A+D+F KRE RE+F+ L+KP  ++AWL++
Subjt:  IEDDPSRIDVRRRDVPGCSILEVMNALRGIPKIVEGGELFMKAVDIFTKRENREMFVALEKPETQVAWLKQ

AT5G05800.2 unknown protein2.6e-0736.62Show/hide
Query:  IEDDPSRIDVRRRDVPGCSILEVMNALRGIPKIVEGGELFMKAVDIFTKRENREMFVALEKPETQVAWLKQ
        +++ P  +++     P  SI E + +L  I ++ +G EL+M A+D+F KRE RE+F+ L+KP  ++AWL++
Subjt:  IEDDPSRIDVRRRDVPGCSILEVMNALRGIPKIVEGGELFMKAVDIFTKRENREMFVALEKPETQVAWLKQ

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.2e-0525.68Show/hide
Query:  STKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWVNQPSIIELENF
        + + +W +LE LF      R    ENE+  T   +LSV EY  K+K+L   +T +D   PIS+  L  +L+ GL +++   ++ I+     PS  E  + 
Subjt:  STKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKTLCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWVNQPSIIELENF

Query:  LSNQEALM--KQMLGNSKKIHYEVEAVV---------YEKDKGKVNSSTKRSSNDSKD---SKVEGQFKGNLKGFLDYDEDWI
        L  +E+ +  K     S   H  +  V+         Y ++    NS+  R  +  K+      +G++  N    L+    WI
Subjt:  LSNQEALM--KQMLGNSKKIHYEVEAVV---------YEKDKGKVNSSTKRSSNDSKD---SKVEGQFKGNLKGFLDYDEDWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATACGTGTTCGTCGGCGTACCGGATGAAGGTTGCGGGGACGGCAGGGGGGGCGTTGACGTGATGCGAGAGGTGCAGGTAGTAAATATGCAGATGGACAGGAATGT
TGTTCCTGATCAATGGAGTTCAGAGGTAGTAAATATGCAGATGGACAGGAATATTGTTCCTGATCAATGGAGTTCAGAGGTGTTTCTCTCTACTGACGGGGTGGTTGAAC
TTTGTACTCATTGTCCGACAAGGCATGGCTACCGACATTTTAGGGAACTGTTTACTGGATTATTTTGGACATTTTTACCCTTGTATTTACGGTTATTTACAGACGCCGTT
TACGTCATTTTCGAGGACAAAGATGTGTTTTGGATGGTCAACGAATTTCAAAACGGTACATACAGGGGGTGTTGTGTAGTCGTTGACATCCCGAGTGTCAATACTGGTAT
ACACCCCGTGGGGACAGTTTCAAAGGGAGGGGAGAAACGACCGAGCATGCAAGAGATGGTTGTCGACGCGGCAGATGAAGGTCGTTGGCACGGCGGAGATGGTTCGTCGG
CGTGGTATGAGAGGGGGTGCTGTGTAGTCGTTGACATCCCGAGTGTCAATACTGGTATACACCCCGTGGGGACAGTTTCAAAGGGAGGGGAGAAACGACCGAGCATGCAA
GAGATGGTTGTCGACGCGGCGGATGAAGGTCGTTGGCACGGCGGAGATGGTTCATCGGCGTGGTGTGAGAGAACTCAATTTGATTCACTTCCTTCTTGCAATTTCTATCT
TGTTTCCACTATTTGTTCTTCTTTGTTTATATGGTATCAAAGCTTCCAGGAGTATATTGAGCATGTTCGTGATGTGAATTCTACAAAGCAAGTGTGGGAAACACTTGAAA
GATTGTTTGATCAAAAGAACACGACAAGGTTGCAGTATTTGGAGAATGAAATTGTCGGAACTATTCAAGGTAACTTGTCGGTTTCAGAATATTTTCTAAAAGTTAAGACT
TTGTGTTCTGAAATTACAGAATTGGACAAAGCGAAACCTATTAGTAATGCCCGGTTGCATCGGTATCTTATTCGTGGACTGCGAAAGGAGTTTATGCCATTTATTTCTTC
GATACAAGATTGGGTAAATCAACCTTCTATCATTGAGCTGGAAAACTTTCTCTCAAATCAGGAAGCATTGATGAAACAAATGCTTGGCAACAGCAAAAAAATTCACTATG
AGGTGGAAGCTGTTGTTTATGAAAAAGATAAAGGAAAAGTTAATTCTTCTACCAAGCGTTCTTCAAATGATAGCAAGGATTCCAAGGTTGAAGGGCAGTTCAAGGGCAAT
TTAAAAGGATTTTTAGATTATGATGAAGATTGGATTATTGATTATGGATGTTCTCATCATGCTACTGGAAATGTTTCTCTTCTCTTTGATGTTCATACCCATCAGGGAAA
AAGAGTTATTGCAACGGTCAATAATTCCTTACTTCCTGTTGTTGAAGAAGGGCGTGTTAATGTTAAGGATGATGCACCAAATGTTGCTGGTGTTTCTCTTGAAGATGTTT
ATCATGTTCCAGGTCTAAAGAAGTTTATGTCATTTATTTCTTCGATACAAGATTGGGTAAATCAACCTTCTATCATTGAGCTAGAAAACTTTCTCTCAAATCAGGAAGCA
TTGATGAAACAAATGCTTAGCAACAGCAAAAAAATTCACTATGAGGTGGAAGCTGTTGTTTATGAAAAAGATAAAGGAAAAGTTAATTCTTCTACCAAGCGTTCTTCAAA
TGATAGCAAGGATTCCAAGGTTGAAGGGCAGTTCAAGGGCAATTTAAAAGGATTTTTAGATTATGATGAAGATTGGATTATTGATTATGGATGTTCTCATCATGCTACTG
GAAATGTTTCTCTTCTCTTTGATGCTCATACCCATCAGGGAAAAAGAGTTATTGCAACGGTCAATAATTCCTTACTTCCTGTTGTTGAAGAAAGGCGTGTTAATGTTAAG
GATGATGCACCAAATGTTGCTGGTGTTTCTCTTGAAGATGTTTATCATGTTCCAGGTCTAAAGAAGTTTATGTCATTTATTTCTTCGATACAAGATTGGGTAAATCAACC
TTCTATCATTGAGCTAGAAAACTTTCTCTCAAATCAGGAAGCATTGATGAAACAAATGCTTGGCAACAGCAAAAAAATTCACTATGAGGTGGAAGCTGTTGTTTATGAAA
AAGATAAAGGAAAAGTTAATTCTTCTACCAAGCGTTCTTCAAATGATAGCAAGGATTCCAAGGTTGAAGGGCAGTTCAAGGGCAATTTAAAAGGATTTTTAGATTATGAT
GAAGATTGGATTATTGATTATGGATGTTCTCATCATGCTACTGGAAATGTTTCTCTTCTCTCTGATGTTCATACCCATCAGGGAAAAAGAGTTATTGCAACGGTCAATAA
TTCCTTACTTCCTGTTGTTGAAGAAGGGCGTGTTAATGTTAAGGATGATGCACCAAATGTTGCTGGTGTTTCTCTTGAAGATGTTTATCATGTTCCAGGTCTAAAGAAGA
ATTTGGTTTCAGTATCTCAGATTGCTGATTCTGGGAGGTATGTTATCTTTGGTCCAAATGATGAAATTCATCACGATGTGGTTTGTCCTGGTTGTCAATTTGGAAAATCA
CATCGCCTTCATTTCCCAAATTCAAATAACAAGGCTACTGGTACATTGCATTTGGTTCATTCAGATTTGATGGGGCCAACTAGAACACCCAGTTATTCTGGTTGTCGTTA
TGTGAGGGTTTTTGTGGACGATTTTTCTCCATTCACGTGGGTGTATTTCTTGAAAGCTAAAAGTGAGACTTTCTCCAAGTTTGTCCAGTTCAAGGAGCAAGTAGAAAATG
AATTCATGTTGTCTGATCAATGTCAGAATTATTGCAAAGAGCATGGAATTCAATGCCAAATGACAAGTCCTGACACTCCACAACAGAATGGAGTTGCTGAACGTAAATTA
GCACATCTTACATCTATGTGCTTGTGTTGGTTGCATGTAAAGAACCTTCCAAGGGAGCTTTGGGCAACAGCTATTCATTCAGCTTGTCACGTCATAAACCGTCTACCTCC
ATGGATGGGATCGAATCTGTCTCCTTTTGAGGTTAAGCCTGAGGCAGCAAAATTTCGTGCCAAGTGGTTACAAAATGCAAATCAATTAGACATTCTTTTTAAAGATATTG
CAGTTACAGGAGATGGAGCATGGGCACCCTCTCAAGGATTTGTACCTCGAAGTATTGAGGATGATCCATCTAGAATAGATGTTAGGAGAAGAGATGTACCAGGTTGTAGC
ATTCTTGAAGTGATGAATGCCTTACGAGGCATACCTAAAATTGTGGAAGGCGGTGAGCTCTTCATGAAAGCTGTAGATATCTTTACAAAGAGAGAAAATAGAGAAATGTT
TGTAGCATTGGAAAAGCCTGAGACTCAAGTTGCGTGGCTCAAGCAGAAGAGAGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAATACGTGTTCGTCGGCGTACCGGATGAAGGTTGCGGGGACGGCAGGGGGGGCGTTGACGTGATGCGAGAGGTGCAGGTAGTAAATATGCAGATGGACAGGAATGT
TGTTCCTGATCAATGGAGTTCAGAGGTAGTAAATATGCAGATGGACAGGAATATTGTTCCTGATCAATGGAGTTCAGAGGTGTTTCTCTCTACTGACGGGGTGGTTGAAC
TTTGTACTCATTGTCCGACAAGGCATGGCTACCGACATTTTAGGGAACTGTTTACTGGATTATTTTGGACATTTTTACCCTTGTATTTACGGTTATTTACAGACGCCGTT
TACGTCATTTTCGAGGACAAAGATGTGTTTTGGATGGTCAACGAATTTCAAAACGGTACATACAGGGGGTGTTGTGTAGTCGTTGACATCCCGAGTGTCAATACTGGTAT
ACACCCCGTGGGGACAGTTTCAAAGGGAGGGGAGAAACGACCGAGCATGCAAGAGATGGTTGTCGACGCGGCAGATGAAGGTCGTTGGCACGGCGGAGATGGTTCGTCGG
CGTGGTATGAGAGGGGGTGCTGTGTAGTCGTTGACATCCCGAGTGTCAATACTGGTATACACCCCGTGGGGACAGTTTCAAAGGGAGGGGAGAAACGACCGAGCATGCAA
GAGATGGTTGTCGACGCGGCGGATGAAGGTCGTTGGCACGGCGGAGATGGTTCATCGGCGTGGTGTGAGAGAACTCAATTTGATTCACTTCCTTCTTGCAATTTCTATCT
TGTTTCCACTATTTGTTCTTCTTTGTTTATATGGTATCAAAGCTTCCAGGAGTATATTGAGCATGTTCGTGATGTGAATTCTACAAAGCAAGTGTGGGAAACACTTGAAA
GATTGTTTGATCAAAAGAACACGACAAGGTTGCAGTATTTGGAGAATGAAATTGTCGGAACTATTCAAGGTAACTTGTCGGTTTCAGAATATTTTCTAAAAGTTAAGACT
TTGTGTTCTGAAATTACAGAATTGGACAAAGCGAAACCTATTAGTAATGCCCGGTTGCATCGGTATCTTATTCGTGGACTGCGAAAGGAGTTTATGCCATTTATTTCTTC
GATACAAGATTGGGTAAATCAACCTTCTATCATTGAGCTGGAAAACTTTCTCTCAAATCAGGAAGCATTGATGAAACAAATGCTTGGCAACAGCAAAAAAATTCACTATG
AGGTGGAAGCTGTTGTTTATGAAAAAGATAAAGGAAAAGTTAATTCTTCTACCAAGCGTTCTTCAAATGATAGCAAGGATTCCAAGGTTGAAGGGCAGTTCAAGGGCAAT
TTAAAAGGATTTTTAGATTATGATGAAGATTGGATTATTGATTATGGATGTTCTCATCATGCTACTGGAAATGTTTCTCTTCTCTTTGATGTTCATACCCATCAGGGAAA
AAGAGTTATTGCAACGGTCAATAATTCCTTACTTCCTGTTGTTGAAGAAGGGCGTGTTAATGTTAAGGATGATGCACCAAATGTTGCTGGTGTTTCTCTTGAAGATGTTT
ATCATGTTCCAGGTCTAAAGAAGTTTATGTCATTTATTTCTTCGATACAAGATTGGGTAAATCAACCTTCTATCATTGAGCTAGAAAACTTTCTCTCAAATCAGGAAGCA
TTGATGAAACAAATGCTTAGCAACAGCAAAAAAATTCACTATGAGGTGGAAGCTGTTGTTTATGAAAAAGATAAAGGAAAAGTTAATTCTTCTACCAAGCGTTCTTCAAA
TGATAGCAAGGATTCCAAGGTTGAAGGGCAGTTCAAGGGCAATTTAAAAGGATTTTTAGATTATGATGAAGATTGGATTATTGATTATGGATGTTCTCATCATGCTACTG
GAAATGTTTCTCTTCTCTTTGATGCTCATACCCATCAGGGAAAAAGAGTTATTGCAACGGTCAATAATTCCTTACTTCCTGTTGTTGAAGAAAGGCGTGTTAATGTTAAG
GATGATGCACCAAATGTTGCTGGTGTTTCTCTTGAAGATGTTTATCATGTTCCAGGTCTAAAGAAGTTTATGTCATTTATTTCTTCGATACAAGATTGGGTAAATCAACC
TTCTATCATTGAGCTAGAAAACTTTCTCTCAAATCAGGAAGCATTGATGAAACAAATGCTTGGCAACAGCAAAAAAATTCACTATGAGGTGGAAGCTGTTGTTTATGAAA
AAGATAAAGGAAAAGTTAATTCTTCTACCAAGCGTTCTTCAAATGATAGCAAGGATTCCAAGGTTGAAGGGCAGTTCAAGGGCAATTTAAAAGGATTTTTAGATTATGAT
GAAGATTGGATTATTGATTATGGATGTTCTCATCATGCTACTGGAAATGTTTCTCTTCTCTCTGATGTTCATACCCATCAGGGAAAAAGAGTTATTGCAACGGTCAATAA
TTCCTTACTTCCTGTTGTTGAAGAAGGGCGTGTTAATGTTAAGGATGATGCACCAAATGTTGCTGGTGTTTCTCTTGAAGATGTTTATCATGTTCCAGGTCTAAAGAAGA
ATTTGGTTTCAGTATCTCAGATTGCTGATTCTGGGAGGTATGTTATCTTTGGTCCAAATGATGAAATTCATCACGATGTGGTTTGTCCTGGTTGTCAATTTGGAAAATCA
CATCGCCTTCATTTCCCAAATTCAAATAACAAGGCTACTGGTACATTGCATTTGGTTCATTCAGATTTGATGGGGCCAACTAGAACACCCAGTTATTCTGGTTGTCGTTA
TGTGAGGGTTTTTGTGGACGATTTTTCTCCATTCACGTGGGTGTATTTCTTGAAAGCTAAAAGTGAGACTTTCTCCAAGTTTGTCCAGTTCAAGGAGCAAGTAGAAAATG
AATTCATGTTGTCTGATCAATGTCAGAATTATTGCAAAGAGCATGGAATTCAATGCCAAATGACAAGTCCTGACACTCCACAACAGAATGGAGTTGCTGAACGTAAATTA
GCACATCTTACATCTATGTGCTTGTGTTGGTTGCATGTAAAGAACCTTCCAAGGGAGCTTTGGGCAACAGCTATTCATTCAGCTTGTCACGTCATAAACCGTCTACCTCC
ATGGATGGGATCGAATCTGTCTCCTTTTGAGGTTAAGCCTGAGGCAGCAAAATTTCGTGCCAAGTGGTTACAAAATGCAAATCAATTAGACATTCTTTTTAAAGATATTG
CAGTTACAGGAGATGGAGCATGGGCACCCTCTCAAGGATTTGTACCTCGAAGTATTGAGGATGATCCATCTAGAATAGATGTTAGGAGAAGAGATGTACCAGGTTGTAGC
ATTCTTGAAGTGATGAATGCCTTACGAGGCATACCTAAAATTGTGGAAGGCGGTGAGCTCTTCATGAAAGCTGTAGATATCTTTACAAAGAGAGAAAATAGAGAAATGTT
TGTAGCATTGGAAAAGCCTGAGACTCAAGTTGCGTGGCTCAAGCAGAAGAGAGTTTAA
Protein sequenceShow/hide protein sequence
MQYVFVGVPDEGCGDGRGGVDVMREVQVVNMQMDRNVVPDQWSSEVVNMQMDRNIVPDQWSSEVFLSTDGVVELCTHCPTRHGYRHFRELFTGLFWTFLPLYLRLFTDAV
YVIFEDKDVFWMVNEFQNGTYRGCCVVVDIPSVNTGIHPVGTVSKGGEKRPSMQEMVVDAADEGRWHGGDGSSAWYERGCCVVVDIPSVNTGIHPVGTVSKGGEKRPSMQ
EMVVDAADEGRWHGGDGSSAWCERTQFDSLPSCNFYLVSTICSSLFIWYQSFQEYIEHVRDVNSTKQVWETLERLFDQKNTTRLQYLENEIVGTIQGNLSVSEYFLKVKT
LCSEITELDKAKPISNARLHRYLIRGLRKEFMPFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGN
LKGFLDYDEDWIIDYGCSHHATGNVSLLFDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEA
LMKQMLSNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYDEDWIIDYGCSHHATGNVSLLFDAHTHQGKRVIATVNNSLLPVVEERRVNVK
DDAPNVAGVSLEDVYHVPGLKKFMSFISSIQDWVNQPSIIELENFLSNQEALMKQMLGNSKKIHYEVEAVVYEKDKGKVNSSTKRSSNDSKDSKVEGQFKGNLKGFLDYD
EDWIIDYGCSHHATGNVSLLSDVHTHQGKRVIATVNNSLLPVVEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADSGRYVIFGPNDEIHHDVVCPGCQFGKS
HRLHFPNSNNKATGTLHLVHSDLMGPTRTPSYSGCRYVRVFVDDFSPFTWVYFLKAKSETFSKFVQFKEQVENEFMLSDQCQNYCKEHGIQCQMTSPDTPQQNGVAERKL
AHLTSMCLCWLHVKNLPRELWATAIHSACHVINRLPPWMGSNLSPFEVKPEAAKFRAKWLQNANQLDILFKDIAVTGDGAWAPSQGFVPRSIEDDPSRIDVRRRDVPGCS
ILEVMNALRGIPKIVEGGELFMKAVDIFTKRENREMFVALEKPETQVAWLKQKRV