; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028665 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028665
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr8:27871227..27873282
RNA-Seq ExpressionLag0028665
SyntenyLag0028665
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PNY00673.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]1.1e-3542.74Show/hide
Query:  CVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFALPSS-PPSSTSLPSELLPFVSTDPSFASCNLPSPSFTTPTVQHPTPATFEPSPSVENTSTN
        C+FLGY L    Y+CY+    K YIS HV F E++FPF+  ++  PSST +P E  P   T+ S +S ++P      PT  +    T +  P   N S  
Subjt:  CVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFALPSS-PPSSTSLPSELLPFVSTDPSFASCNLPSPSFTTPTVQHPTPATFEPSPSVENTSTN

Query:  VIASSPTNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPSVSNHHPMQTRSKSGISK-KRVFAAVSKTLP--EEPTSFTQASEVPNWQTAMIDEYTTL
        ++   P+++TPE +LS ++ + +  P P+  P            + H M TRSK+GI K K++F A    LP   EPT  +QA +   W+ AM DE+T L
Subjt:  VIASSPTNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPSVSNHHPMQTRSKSGISK-KRVFAAVSKTLP--EEPTSFTQASEVPNWQTAMIDEYTTL

Query:  INQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ
        +N  TW L P  PH  VIG KWV+RIKR+PDGSIARYK RLVAKG+HQ
Subjt:  INQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ

TQD88914.1 hypothetical protein C1H46_025506 [Malus baccata]1.9e-3842.08Show/hide
Query:  NPSWVECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFP--FALPSS----PPSSTSLPSELLPFVSTDPSFASCNLPSPSFTTPTVQHPTPATFE
        +P    C+FLGY   YKGY+CY +S  K  +S HVLFDES+FP  + L SS      SS S+P  L P             P+     P +  P P    
Subjt:  NPSWVECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFP--FALPSS----PPSSTSLPSELLPFVSTDPSFASCNLPSPSFTTPTVQHPTPATFE

Query:  PSPSVENTSTNVIAS-SPTNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPSV--SNHHPMQTRSKSGISKKRVFAAVSKTLPEEPTSFTQASEVPNW
          PSV  +    +    P      +  + SL   + E  P +   ++    H SV  SN HPMQTRSKSGI KK+VF+AV     +EP SF+ A+    W
Subjt:  PSPSVENTSTNVIAS-SPTNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPSV--SNHHPMQTRSKSGISKKRVFAAVSKTLPEEPTSFTQASEVPNW

Query:  QTAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ
        + AM +E   LI Q+TW L PLP HK ++GCKW+Y++KR+PDGS+ARYK RLVAKG+ Q
Subjt:  QTAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ

TQE01264.1 hypothetical protein C1H46_013171 [Malus baccata]3.6e-3742.59Show/hide
Query:  PSWVECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFALPSSPPSSTSLPSEL-LPFVSTDPSFASCNLPSPSFTTPTVQHPTP-ATFEPSPSV
        P   +C+F+GY   YKGYLC +    K Y+S HVLFDE+ FP+   SS  +S S  S +  P +   P      LPS   +  TV  P P    E SPS 
Subjt:  PSWVECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFALPSSPPSSTSLPSEL-LPFVSTDPSFASCNLPSPSFTTPTVQHPTP-ATFEPSPSV

Query:  ENTSTNVIAS----------SPTNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPSVSNHHPMQTRSKSGISKKRVFAA--VSKTLPEEPTSFTQASE
           ++    S          SPT+    T+L       +    P  +P +++      + + HPMQTRSKSGISKK+VF+A   S     EP +F  A +
Subjt:  ENTSTNVIAS----------SPTNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPSVSNHHPMQTRSKSGISKKRVFAA--VSKTLPEEPTSFTQASE

Query:  VPNWQTAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ
        +P W  AM DE T L +QNTW L PLP  K ++GCKWVYRIK +PDGS+ARYK RLVAKGY Q
Subjt:  VPNWQTAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ

TQE09310.1 hypothetical protein C1H46_005046 [Malus baccata]4.6e-3740.7Show/hide
Query:  PSWVECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFALPSSPPS----STSLPSELLPFVSTDPSFASCNLPSPSFTTPTVQHPTPATFEPSP
        P   EC+FLGY   YKG++C++    K  +S HVLFDE  FP    SS  S    S S+ S + P     P       P   F+  +   PT     P  
Subjt:  PSWVECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFALPSSPPS----STSLPSELLPFVSTDPSFASCNLPSPSFTTPTVQHPTPATFEPSP

Query:  SVENTSTNVIASSPTNTTPETSL--SHSLPTDIHEPSPSLEPPEINGTDHPSVS--NHHPMQTRSKSGISKKR-VFAAVSKTLPEEPTSFTQASEVPNWQ
             S  +++SSP  + P  S+   HS P    E  P +   + +     S+   N HPMQTRSKSGISKK+  F+   ++  +EP S++ A ++ +W+
Subjt:  SVENTSTNVIASSPTNTTPETSL--SHSLPTDIHEPSPSLEPPEINGTDHPSVS--NHHPMQTRSKSGISKKR-VFAAVSKTLPEEPTSFTQASEVPNWQ

Query:  TAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ
         AM +E T L  QNTW L PLPP K ++GCKW+Y+IKRHPDG++ARYK RLVAKG+ Q
Subjt:  TAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ

WP_081894301.1 DDE-type integrase/transposase/recombinase [Acetobacter malorum]4.4e-3539.5Show/hide
Query:  PSWVECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPF----ALPS-------SPPSSTSLPSELLPFVS--TDPSFASCNLPSPSFTTPTVQHP
        P  V+C+FLGY   YKGY+C++    +FY+S HV+F E+ FP+      PS       +PPS T L +     VS  T  S AS  L  P+  +  +  P
Subjt:  PSWVECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPF----ALPS-------SPPSSTSLPSELLPFVS--TDPSFASCNLPSPSFTTPTVQHP

Query:  TPAT-FEPSP---------------SVENTSTNVIASSPTNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPSVSNHHPMQTRSKSGISKKRVFAAVS
        T A+ F  SP                + + S ++ + SPT      S    +P D     P  +P  +         + HPMQTRSKSGI KK+ F A  
Subjt:  TPAT-FEPSP---------------SVENTSTNVIASSPTNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPSVSNHHPMQTRSKSGISKKRVFAAVS

Query:  KTLPE---EPTSFTQASEVPNWQTAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ
         ++ +   EP++F  AS++  WQ+AM DE   L  Q+TW+L PLP  K ++GCKWVYR+K++PDGSIARYK RLVAKGY+Q
Subjt:  KTLPE---EPTSFTQASEVPNWQTAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ

TrEMBL top hitse value%identityAlignment
A0A2N9FMC6 Integrase catalytic domain-containing protein5.9e-3845.2Show/hide
Query:  CVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFALPSSPPSSTSLPSELLPFVSTDPSFASCNLPS---PSFTTPTVQHPTPATFEPSPSVENTS
        CVFLGY L  KGYLC +L   K  IS HV F E+ FPF   +S PSS S PS    ++S+   F  C  PS   P  + P +   TP     S S+ + S
Subjt:  CVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFALPSSPPSSTSLPSELLPFVSTDPSFASCNLPS---PSFTTPTVQHPTPATFEPSPSVENTS

Query:  TNVIASSP-TNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPSVSNHHPMQTRSKSGISKKRVF--AAVSKTLPEEPTSFTQASEVPNWQTAMIDEYT
              SP  +TT    +S  +P+    PS  + PP           N HPMQTR KSGISK+++         L  EP S+  AS+ P WQ+AM+DEYT
Subjt:  TNVIASSP-TNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPSVSNHHPMQTRSKSGISKKRVF--AAVSKTLPEEPTSFTQASEVPNWQTAMIDEYT

Query:  TLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ
         L  Q TW L P P +  ++GCKWVY+IKR PDGS+ARYK RLVAKGYHQ
Subjt:  TLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ

A0A2N9FSZ9 Uncharacterized protein1.0e-3741.41Show/hide
Query:  PSWVECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFALPSSPPSSTSLP----SELLPFVSTDPSFASCNLPSPSFTTPTVQHPTPATFEPSP
        P  V+CVFLGYP   KG+LC+D    +F++S HV FDE++FPF   SS PS    P    S    ++ST   F  C++PS     P   + TP T  P  
Subjt:  PSWVECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFALPSSPPSSTSLP----SELLPFVSTDPSFASCNLPSPSFTTPTVQHPTPATFEPSP

Query:  SVENTSTNVIASSPTNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPSVSNHHPMQTRSKSGISKKR---VFAAVSKTLPEEPTSFTQASEVPNWQTA
         V   ST    S+P           S+PT +H P        ++ +  P   N HPMQTR+KSGISKK+   + A     L  EP SF  A  +P W  A
Subjt:  SVENTSTNVIASSPTNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPSVSNHHPMQTRSKSGISKKR---VFAAVSKTLPEEPTSFTQASEVPNWQTA

Query:  MIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ
        M  E+  L  Q+TW L P  P + +IGC WV+++KR+ DGS+ARYK RLVA+G HQ
Subjt:  MIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ

A0A2N9GRJ0 Uncharacterized protein5.9e-3845.2Show/hide
Query:  CVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFALPSSPPSSTSLPSELLPFVSTDPSFASCNLPS---PSFTTPTVQHPTPATFEPSPSVENTS
        CVFLGY L  KGYLC +L   K  IS HV F E+ FPF   +S PSS S PS    ++S+   F  C  PS   P  + P +   TP     S S+ + S
Subjt:  CVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFALPSSPPSSTSLPSELLPFVSTDPSFASCNLPS---PSFTTPTVQHPTPATFEPSPSVENTS

Query:  TNVIASSP-TNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPSVSNHHPMQTRSKSGISKKRVF--AAVSKTLPEEPTSFTQASEVPNWQTAMIDEYT
              SP  +TT    +S  +P+    PS  + PP           N HPMQTR KSGISK+++         L  EP S+  AS+ P WQ+AM+DEYT
Subjt:  TNVIASSP-TNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPSVSNHHPMQTRSKSGISKKRVF--AAVSKTLPEEPTSFTQASEVPNWQTAMIDEYT

Query:  TLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ
         L  Q TW L P P +  ++GCKWVY+IKR PDGS+ARYK RLVAKGYHQ
Subjt:  TLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ

A0A2N9HKM9 Uncharacterized protein5.4e-3941.79Show/hide
Query:  PSWVECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFALPSSPPSSTSLPSELL----PFVSTDPSFASCNLPSPSFTTPTVQHPTPATFEPSP
        P  V+C+FLGYP   KG+LC+D    +F++S HV FDES+FPF   SS PS + LP+        ++S    F SC+LPS     PT      +T  P P
Subjt:  PSWVECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFALPSSPPSSTSLPSELL----PFVSTDPSFASCNLPSPSFTTPTVQHPTPATFEPSP

Query:  SVENTSTNVIAS---SPTNTTP-ETSLSHSLPTDIHEPSPSLEPPEINGT-------DHPSVSNHHPMQTRSKSGISKKRVFAAVSKTLPE----EPTSF
         +  TS  V++S    P++T P  TS +  +PT    P PS     +  +         P V+N HPMQTR KSGI+KK+    ++K+ P+    EP SF
Subjt:  SVENTSTNVIAS---SPTNTTP-ETSLSHSLPTDIHEPSPSLEPPEINGT-------DHPSVSNHHPMQTRSKSGISKKRVFAAVSKTLPE----EPTSF

Query:  TQASEVPNWQTAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ
        + A  +P W  AM  E+  L  Q+TW L P  P   +IGC WV+++KR+ DGS+ARYK RLVAKG HQ
Subjt:  TQASEVPNWQTAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ

A0A540LQZ0 Integrase catalytic domain-containing protein9.1e-3942.08Show/hide
Query:  NPSWVECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFP--FALPSS----PPSSTSLPSELLPFVSTDPSFASCNLPSPSFTTPTVQHPTPATFE
        +P    C+FLGY   YKGY+CY +S  K  +S HVLFDES+FP  + L SS      SS S+P  L P             P+     P +  P P    
Subjt:  NPSWVECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFP--FALPSS----PPSSTSLPSELLPFVSTDPSFASCNLPSPSFTTPTVQHPTPATFE

Query:  PSPSVENTSTNVIAS-SPTNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPSV--SNHHPMQTRSKSGISKKRVFAAVSKTLPEEPTSFTQASEVPNW
          PSV  +    +    P      +  + SL   + E  P +   ++    H SV  SN HPMQTRSKSGI KK+VF+AV     +EP SF+ A+    W
Subjt:  PSPSVENTSTNVIAS-SPTNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPSV--SNHHPMQTRSKSGISKKRVFAAVSKTLPEEPTSFTQASEVPNW

Query:  QTAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ
        + AM +E   LI Q+TW L PLP HK ++GCKW+Y++KR+PDGS+ARYK RLVAKG+ Q
Subjt:  QTAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.3e-0526.9Show/hide
Query:  PTDIHEPSPSLEPPEINGTDHPS------VSNHHPMQTRSKSGISKKRVFAAVSKTLPEEPTSFTQASEVPN-------------WQTAMIDEYTTLINQ
        P +  E   +    EI G D+P+      + N    + ++K  IS      +++K +    T F   ++VPN             W+ A+  E       
Subjt:  PTDIHEPSPSLEPPEINGTDHPS------VSNHHPMQTRSKSGISKKRVFAAVSKTLPEEPTSFTQASEVPN-------------WQTAMIDEYTTLINQ

Query:  NTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ
        NTW +T  P +K ++  +WV+ +K +  G+  RYK RLVA+G+ Q
Subjt:  NTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-0724.9Show/hide
Query:  VECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFALPSSPPSSTSLPSELLPFVSTDPSFASCNLPSPSFTTPTVQHPTPATFEPSPSVENTST
        + C+F+GY     GY  +D   KK   S  V+F ES     + ++   S  + + ++P   T PS              T  +PT               
Subjt:  VECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFALPSSPPSSTSLPSELLPFVSTDPSFASCNLPSPSFTTPTVQHPTPATFEPSPSVENTST

Query:  NVIASSPTNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPS--VSNHHPMQTRSKSGISKKRVFAAVSKTLPE--EPTSFTQASEVP---NWQTAMID
             S  +TT E S     P ++ E    L+   +   +HP+     H P++   +  +  +R  +     + +  EP S  +    P       AM +
Subjt:  NVIASSPTNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPS--VSNHHPMQTRSKSGISKKRVFAAVSKTLPE--EPTSFTQASEVP---NWQTAMID

Query:  EYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ
        E  +L    T++L  LP  K+ + CKWV+++K+  D  + RYK RLV KG+ Q
Subjt:  EYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ

P92520 Uncharacterized mitochondrial protein AtMg008201.6e-1643.43Show/hide
Query:  MQTRSKSGISK--KRVFAAVSKTLPEEPTSFTQASEVPNWQTAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ
        M TRSK+GI+K   +    ++ T+ +EP S   A + P W  AM +E   L    TW L P P ++ ++GCKWV++ K H DG++ R K RLVAKG+HQ
Subjt:  MQTRSKSGISK--KRVFAAVSKTLPEEPTSFTQASEVPNWQTAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.0e-2632.81Show/hide
Query:  ECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFA--------------------------------LP----------SSPPSSTSLP--SELL
        +CVFLGY L    YLC  L   + YIS HV FDE+ FPF+                                LP          ++PPSS S P  +  +
Subjt:  ECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFA--------------------------------LP----------SSPPSSTSLP--SELL

Query:  PFVSTDPSFASCNLPSPSFTTPTVQHPTPATFEPSPSVENT--STNVIASSPTNTTPE---TSLSHSLPTDIHEPSPS----------------LEPP--
           + D SF+S    SP  T P    P P T +P+ +   T  S N   ++PTN +P     SLS    +    PSP+                + PP  
Subjt:  PFVSTDPSFASCNLPSPSFTTPTVQHPTPATFEPSPSVENT--STNVIASSPTNTTPE---TSLSHSLPTDIHEPSPS----------------LEPP--

Query:  --EINGTDHPSVSNHHPMQTRSKSGISK--KRVFAAVSKTLPEEPTSFTQASEVPNWQTAMIDEYTTLINQNTWELTPLPP-HKQVIGCKWVYRIKRHPD
          +I   ++ +  N H M TR+K+GI K   +   AVS     EP +  QA +   W+ AM  E    I  +TW+L P PP H  ++GC+W++  K + D
Subjt:  --EINGTDHPSVSNHHPMQTRSKSGISK--KRVFAAVSKTLPEEPTSFTQASEVPNWQTAMIDEYTTLINQNTWELTPLPP-HKQVIGCKWVYRIKRHPD

Query:  GSIARYKVRLVAKGYHQ
        GS+ RYK RLVAKGY+Q
Subjt:  GSIARYKVRLVAKGYHQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-1633.77Show/hide
Query:  PFALPSSPPSSTSLPSELLPF-VSTDPSFASCNLPSP----------SFTTPTVQHPTPATFEP-SPSVENTSTNVIASSPTNTTPETSLSH-SLPTDIH
        P  L ++  SS++LPS  +    S++P+  S N P P          +  +P + +P P +  P SP+  +       SSP   TP TS+S  + P+   
Subjt:  PFALPSSPPSSTSLPSELLPF-VSTDPSFASCNLPSP----------SFTTPTVQHPTPATFEP-SPSVENTSTNVIASSPTNTTPETSLSH-SLPTDIH

Query:  EPSPSLEP----PEINGTDHPSVSNHHPMQTRSKSGISK--KRVFAAVSKTLPEEPTSFTQASEVPNWQTAMIDEYTTLINQNTWELTPLPPHK-QVIGC
          +P L P    P I   +  +  N H M TR+K GI K  ++   A S     EP +  QA +   W+ AM  E    I  +TW+L P PP    ++GC
Subjt:  EPSPSLEP----PEINGTDHPSVSNHHPMQTRSKSGISK--KRVFAAVSKTLPEEPTSFTQASEVPNWQTAMIDEYTTLINQNTWELTPLPPHK-QVIGC

Query:  KWVYRIKRHPDGSIARYKVRLVAKGYHQ
        +W++  K + DGS+ RYK RLVAKGY+Q
Subjt:  KWVYRIKRHPDGSIARYKVRLVAKGYHQ

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.7e-1752.7Show/hide
Query:  EEPTSFTQASEVPNWQTAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ
        +EP+++ +A E   W  AM DE   +   +TWE+  LPP+K+ IGCKWVY+IK + DG+I RYK RLVAKGY Q
Subjt:  EEPTSFTQASEVPNWQTAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.1e-1743.43Show/hide
Query:  MQTRSKSGISK--KRVFAAVSKTLPEEPTSFTQASEVPNWQTAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ
        M TRSK+GI+K   +    ++ T+ +EP S   A + P W  AM +E   L    TW L P P ++ ++GCKWV++ K H DG++ R K RLVAKG+HQ
Subjt:  MQTRSKSGISK--KRVFAAVSKTLPEEPTSFTQASEVPNWQTAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGGAAAAGAAAACGAGAGAATATCAGCCGTTAACAGCGGAGAGTCGTTGGAGGAAGCCTCCGAAAGTGATACCACAGCTCCGAATCAGAATCCAAGCTGGGTTGA
GTGTGTGTTTCTTGGTTATCCTCTAGGGTATAAGGGCTACTTGTGTTATGACTTGTCCATCAAGAAATTCTACATCTCTTGCCATGTTTTGTTTGATGAAAGCCTTTTTC
CCTTTGCTTTACCATCCTCACCACCCTCATCCACATCCTTGCCATCTGAACTTCTTCCCTTTGTTTCCACTGATCCGTCTTTTGCCTCATGTAACTTGCCTAGTCCTTCA
TTTACCACTCCTACTGTTCAGCATCCAACCCCTGCCACTTTTGAACCTTCTCCTTCTGTTGAAAATACTTCCACCAATGTTATTGCATCATCACCTACTAATACTACCCC
TGAAACTTCTCTTTCCCATTCCTTGCCCACTGATATTCATGAACCTTCTCCCTCTCTTGAACCACCTGAAATTAATGGGACTGACCATCCCTCTGTGAGCAATCATCATC
CAATGCAGACACGCTCCAAATCAGGCATTTCCAAGAAAAGAGTTTTTGCTGCAGTTTCAAAAACTTTACCTGAGGAACCTACCTCCTTCACACAAGCATCCGAGGTTCCA
AACTGGCAAACTGCTATGATTGATGAATACACAACCTTGATCAATCAAAATACTTGGGAACTCACTCCTCTACCTCCTCATAAACAAGTCATTGGCTGCAAATGGGTATA
TAGGATCAAAAGGCATCCTGATGGTTCAATTGCTCGATATAAAGTCAGGCTCGTTGCTAAAGGATATCATCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGATCGGAAAAGAAAACGAGAGAATATCAGCCGTTAACAGCGGAGAGTCGTTGGAGGAAGCCTCCGAAAGTGATACCACAGCTCCGAATCAGAATCCAAGCTGGGTTGA
GTGTGTGTTTCTTGGTTATCCTCTAGGGTATAAGGGCTACTTGTGTTATGACTTGTCCATCAAGAAATTCTACATCTCTTGCCATGTTTTGTTTGATGAAAGCCTTTTTC
CCTTTGCTTTACCATCCTCACCACCCTCATCCACATCCTTGCCATCTGAACTTCTTCCCTTTGTTTCCACTGATCCGTCTTTTGCCTCATGTAACTTGCCTAGTCCTTCA
TTTACCACTCCTACTGTTCAGCATCCAACCCCTGCCACTTTTGAACCTTCTCCTTCTGTTGAAAATACTTCCACCAATGTTATTGCATCATCACCTACTAATACTACCCC
TGAAACTTCTCTTTCCCATTCCTTGCCCACTGATATTCATGAACCTTCTCCCTCTCTTGAACCACCTGAAATTAATGGGACTGACCATCCCTCTGTGAGCAATCATCATC
CAATGCAGACACGCTCCAAATCAGGCATTTCCAAGAAAAGAGTTTTTGCTGCAGTTTCAAAAACTTTACCTGAGGAACCTACCTCCTTCACACAAGCATCCGAGGTTCCA
AACTGGCAAACTGCTATGATTGATGAATACACAACCTTGATCAATCAAAATACTTGGGAACTCACTCCTCTACCTCCTCATAAACAAGTCATTGGCTGCAAATGGGTATA
TAGGATCAAAAGGCATCCTGATGGTTCAATTGCTCGATATAAAGTCAGGCTCGTTGCTAAAGGATATCATCAATAA
Protein sequenceShow/hide protein sequence
MIGKENERISAVNSGESLEEASESDTTAPNQNPSWVECVFLGYPLGYKGYLCYDLSIKKFYISCHVLFDESLFPFALPSSPPSSTSLPSELLPFVSTDPSFASCNLPSPS
FTTPTVQHPTPATFEPSPSVENTSTNVIASSPTNTTPETSLSHSLPTDIHEPSPSLEPPEINGTDHPSVSNHHPMQTRSKSGISKKRVFAAVSKTLPEEPTSFTQASEVP
NWQTAMIDEYTTLINQNTWELTPLPPHKQVIGCKWVYRIKRHPDGSIARYKVRLVAKGYHQ