; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022947 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022947
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr7:41453837..41455418
RNA-Seq ExpressionLag0022947
SyntenyLag0022947
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]2.5e-4733.05Show/hide
Query:  NPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVL
        NP Y  W  QD+A++TL+NATLS+ A S VIG  +S+E W ALE+RFS+ T S+I ++KS+LH I KG  +SID Y+ +I++  D L++VSV ++DED+L
Subjt:  NPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVL

Query:  LYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFI---GQQNKATAAPG-----------------------GRGSNSPPIQNQGNQ----GGR
        +Y LNGL  EYN+F+TSI T+S+ +TL+E++++LK E + I    +QN +   PG                       GRG       N+G +    G  
Subjt:  LYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFI---GQQNKATAAPG-----------------------GRGSNSPPIQNQGNQ----GGR

Query:  GPQNFGQTNQSTQNRGNFNGNSSTTSIGGNQGGRVVCQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDTTP--TTTTWLADSGCNTHVTPE--
           NFGQ+N     +     N  +     N    VVCQIC K  H ALDC++ ++ SYQG+ P+ +L AM+  Y+T    +   W  D+G   H+T +  
Subjt:  GPQNFGQTNQSTQNRGNFNGNSSTTSIGGNQGGRVVCQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDTTP--TTTTWLADSGCNTHVTPE--

Query:  ---FKLQRGGCHHI----GKWTGTSHYSGWIWYY-----FTISGLLTTIKL--------------------------IMYKDTRQILYKGKSKDELYPII
           F ++  G  +I    G+    SH SG    +     F ++ +L    +                          I  K T+Q+L++G S   LYP+ 
Subjt:  ---FKLQRGGCHHI----GKWTGTSHYSGWIWYY-----FTISGLLTTIKL--------------------------IMYKDTRQILYKGKSKDELYPII

Query:  VDGVPKPRQVSGLPS------RPVCA---------------SAVVSKVSHSDLWHFRLGHPSTFILNKV
           + K    S  P          CA               +A + K   + LWH RLGHPST  L  +
Subjt:  VDGVPKPRQVSGLPS------RPVCA---------------SAVVSKVSHSDLWHFRLGHPSTFILNKV

KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]3.6e-4641.05Show/hide
Query:  NPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVL
        NP Y  WIA+D+A++T+INATLS  A ++V+G  SS++VW  L K +SS + S++  +KS L TI K   ESID Y+ RI+E+ DKL+ VS  +++ED+L
Subjt:  NPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVL

Query:  LYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFIGQQNKA----------TAAPGGRGSNSPPIQNQGNQGGRGPQNFGQTNQSTQNRGNFNG
        +Y LNGL  EYN+FRTS+ TRS PVT +ELH LL++E   + +Q+K            ++     S +P   N   +G    +N+G         G F+ 
Subjt:  LYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFIGQQNKA----------TAAPGGRGSNSPPIQNQGNQGGRGPQNFGQTNQSTQNRGNFNG

Query:  NSSTTSIGGNQGGRVV------CQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDT---TPTTTTWLADSGCNTHVTPE
        ++ T   G +Q  + V      CQIC++  H ALDCFN +N ++QGRHP  +LAAM  + +    +   ++ L DSGCNTH+T +
Subjt:  NSSTTSIGGNQGGRVV------CQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDT---TPTTTTWLADSGCNTHVTPE

KAF8406331.1 hypothetical protein HHK36_008417 [Tetracentron sinense]7.6e-4937.47Show/hide
Query:  IFNPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDED
        + + ++  W+ QD+AI+T+INATLS  A + +IG ++S+EVW ALE+RFSS++ S+I ++K++L T++KG S+++D Y+ +I+E  D L+T +V+ DDED
Subjt:  IFNPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDED

Query:  VLLYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFIGQQNK----------ATAAP----GGRGSNSPPIQN----QGNQGGRGPQNFGQTNQ
        +L+ TLNGL  EYN+F TSI TRS P++L++LH LLK+E + I   NK           TAA     G RG  +   +     +G +GGR  Q    +N 
Subjt:  VLLYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFIGQQNK----------ATAAP----GGRGSNSPPIQN----QGNQGGRGPQNFGQTNQ

Query:  STQNRGNFN--------GNSST---TSIGGN-QGG--RVVCQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDT-----TPTTTTWLADSGCNT
         +   GNF+         NSST   +S G N QGG  R+ CQIC K  H A+DC+N +N ++QGRHP ++LAAM  ++       TP    WL DS    
Subjt:  STQNRGNFN--------GNSST---TSIGGN-QGG--RVVCQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDT-----TPTTTTWLADSGCNT

Query:  HVTPE-----FKLQRGGCHHIGKWTGTSHYSGWIWYYFTISGLLTTIKLIMYKDTRQILYKGKSKDELYPIIVDGVPKPRQVSGLPS
        H+T +     F     G   +    G    SG                        +ILY+GKS++ LYP+     P    V   PS
Subjt:  HVTPE-----FKLQRGGCHHIGKWTGTSHYSGWIWYYFTISGLLTTIKLIMYKDTRQILYKGKSKDELYPIIVDGVPKPRQVSGLPS

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]3.6e-4641.05Show/hide
Query:  NPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVL
        NP Y  WIA+D+A++T+INATLS  A ++V+G  SS++VW  L K +SS + S++  +KS L TI K   ESID Y+ RI+E+ DKL+ VS  +++ED+L
Subjt:  NPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVL

Query:  LYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFIGQQNKA----------TAAPGGRGSNSPPIQNQGNQGGRGPQNFGQTNQSTQNRGNFNG
        +Y LNGL  EYN+FRTS+ TRS PVT +ELH LL++E   + +Q+K            ++     S +P   N   +G    +N+G         G F+ 
Subjt:  LYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFIGQQNKA----------TAAPGGRGSNSPPIQNQGNQGGRGPQNFGQTNQSTQNRGNFNG

Query:  NSSTTSIGGNQGGRVV------CQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDT---TPTTTTWLADSGCNTHVTPE
        ++ T   G +Q  + V      CQIC++  H ALDCFN +N ++QGRHP  +LAAM  + +    +   ++ L DSGCNTH+T +
Subjt:  NSSTTSIGGNQGGRVV------CQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDT---TPTTTTWLADSGCNTHVTPE

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]4.2e-4740.82Show/hide
Query:  SSSSTTEALSGGSLQSNTSIFLLSNIFNPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESI
        +SSS TE+      Q  T+  L   + NP +  WIA+D+A++TLINATLS  A ++V+   +S++VW  LEK +SS + +++  +KS L +I+K + ESI
Subjt:  SSSSTTEALSGGSLQSNTSIFLLSNIFNPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESI

Query:  DDYLIRIRELVDKLSTVSVKVDDEDVLLYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFIGQQNK---ATAAPGGRGSNSPPIQN-------
        D Y+ RI+E+ DK + VS+ ++DE +L+Y LNGLS EYN+  TS+ TR+  V+ +ELH  +KSE   I +Q K       P    ++SP  QN       
Subjt:  DDYLIRIRELVDKLSTVSVKVDDEDVLLYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFIGQQNK---ATAAPGGRGSNSPPIQN-------

Query:  -------QGNQGGRGPQNFGQTNQSTQNRGNFNGNSSTTSIGGNQGGRVVCQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDTT------PTT
               +G   GRG  NF  T  + Q RG  +GN  T+    N   R  CQIC K  H ALDC+N +N  +QGRHP  +LAAM    + +       + 
Subjt:  -------QGNQGGRGPQNFGQTNQSTQNRGNFNGNSSTTSIGGNQGGRVVCQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDTT------PTT

Query:  TTWLADSGCNTHVTPE
        TTWLADS CNTH+T +
Subjt:  TTWLADSGCNTHVTPE

TrEMBL top hitse value%identityAlignment
A0A2N9F3A0 Reverse transcriptase Ty1/copia-type domain-containing protein3.1e-4836.19Show/hide
Query:  PAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVLL
        P Y QWIA+D+A+++LI+ATLS  A S +IG  S+  +W+ L KR++S++ S+I  +K  LH  +K +++S+  YL +I+E  DKL+ V   VDDED+L 
Subjt:  PAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVLL

Query:  YTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFI------GQQNKATAAPGGRG---SNSPPIQ--NQGNQGGRGPQNFGQTNQSTQNRGN---
          L GL +EY SF +++ T+S+ V  +ELH L+ ++ + +       ++N   A    RG   +N+P  Q  N   Q   G QN G+   + + RGN   
Subjt:  YTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFI------GQQNKATAAPGGRG---SNSPPIQ--NQGNQGGRGPQNFGQTNQSTQNRGN---

Query:  ---------------FNGNSSTTSIGGNQG------GRVVCQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDTTPTTTTWLADSGCNTHVTPE
                       F+ N+S  S   N         R  CQIC KP H A+DC+N +N SYQGRHP +KLAAMA A   +P+   W++D+G   H TP+
Subjt:  ---------------FNGNSSTTSIGGNQG------GRVVCQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDTTPTTTTWLADSGCNTHVTPE

Query:  FKLQRGGCHHIGKWTGTSHYSGWIWYYFTISGLLTTIKLIMYKDTRQ--ILYKGKSKDELYPIIVDGVPKP-RQVSGL----PSRPVCASAVVSKVSHSD
                                      S L    +L+   + +Q  I + G S+D LYP  V G+  P R  S L     S P C  +V SKVS +D
Subjt:  FKLQRGGCHHIGKWTGTSHYSGWIWYYFTISGLLTTIKLIMYKDTRQ--ILYKGKSKDELYPIIVDGVPKP-RQVSGL----PSRPVCASAVVSKVSHSD

Query:  LWHFRLGHP
        LWH RLGHP
Subjt:  LWHFRLGHP

A0A2N9J112 gag_pre-integrs domain-containing protein3.1e-4834.18Show/hide
Query:  NPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVL
        NP Y+QW A+D+A+ +LI++TLS  A S V+G  S+  +W  L  R++SI+ S+I  +K  L++ IK +S+S+ DYL +I+E+ +KL +V + +DDE++L
Subjt:  NPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVL

Query:  LYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSE----------------------------------AKFIGQQNKATAAPGGRGSNSPPIQNQGN
           L GL +E++SF +++ T+++ V  +ELH+L+K+E                                   +F G + +      GRGSN    QN  N
Subjt:  LYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSE----------------------------------AKFIGQQNKATAAPGGRGSNSPPIQNQGN

Query:  -QGGRGPQNFGQTNQSTQNRGNFNGNSSTTSIGGNQGGRVVCQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDTT--PTTTTWLADSGCNTHV
         +GG  P NF        N  NF  NS       N   R  CQIC K  H A+DC+  +N SYQGRHP +KLAAMA A   +  P  TTW++D+G   H 
Subjt:  -QGGRGPQNFGQTNQSTQNRGNFNGNSSTTSIGGNQGGRVVCQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDTT--PTTTTWLADSGCNTHV

Query:  TPEFKLQRGGCHHIGKWTGTSHYSGWIWYYFTISGLLT----TIKLIMYKD--TRQILYKGKSKDELYPIIVDGVPK--------------PRQVSGLPS
        TP+                           +T S L++    T   I + D  T + LYKG SKD LYPI    +P               P Q      
Subjt:  TPEFKLQRGGCHHIGKWTGTSHYSGWIWYYFTISGLLT----TIKLIMYKD--TRQILYKGKSKDELYPIIVDGVPK--------------PRQVSGLPS

Query:  RPVCASAVVSKVSHSDLWHFRLGHPSTFILNKV
            AS   S +S +DLWH RLGHP   +L+ V
Subjt:  RPVCASAVVSKVSHSDLWHFRLGHPSTFILNKV

A0A5B7BD59 Retrotran_gag_3 domain-containing protein3.6e-5242.37Show/hide
Query:  NPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVL
        NP+Y+ WI QD+A++ +INATL+  A S VIG K+S++VW ALE+ FSS + S+I ++K++L ++ KG  ++ID Y+ +++   D L+ VSV +DDED+L
Subjt:  NPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVL

Query:  LYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKS-----EAKFIGQQNKATAAPGGRGSNSPPI-QNQGNQGGRG------------------PQNFG
        +YTLNGL +E+N+FRT+I TR+ P+TL+E+H LL++     EA F   QN+ T +     S+  P   N+GN  GRG                  PQ F 
Subjt:  LYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKS-----EAKFIGQQNKATAAPGGRGSNSPPI-QNQGNQGGRG------------------PQNFG

Query:  QTNQST---QNRGNFNGNSSTTSIGGNQGGRVVCQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDTTPTT-TTWLADSGCNTHVTPEF
          N S    + +  F  ++ +++   NQ  ++ CQIC KP H ALDC++ +N +YQGRHP S+LAAMA +Y ++ T    WL DSG   HVT +F
Subjt:  QTNQST---QNRGNFNGNSSTTSIGGNQGGRVVCQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDTTPTT-TTWLADSGCNTHVTPEF

A0A5B7C9B1 Retrotran_gag_3 domain-containing protein1.2e-5242.72Show/hide
Query:  NPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVL
        NP Y +WI  D+A++TLINATLS  A ++VIG  +S+EVW ALE+RFSS +  +I ++KS+LHT+ KG ++S+D Y+ RI++  D L+ VSV ++DED+L
Subjt:  NPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVL

Query:  LYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFIGQQN--KATAAP------GGRGSNSPPIQNQGNQG---------GRGPQNFGQTNQSTQ
        ++TLNGL  +YN+F+TSIHTRS P+TL+ELH+LLK+E + +   +  K+ + P          + +P   N+G  G         GR  QNFG++ Q  Q
Subjt:  LYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFIGQQN--KATAAP------GGRGSNSPPIQNQGNQG---------GRGPQNFGQTNQSTQ

Query:  NRG--------NFNGNSSTTSIGGNQG---------GRVVCQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYD-TTPTTTTWLADSGCNTHVTP
        N G        +F+GN S    G +QG          ++ CQIC KP H ALDC++ ++ SYQGRHP  +L AMA  ++  +     WL D+G   H+T 
Subjt:  NRG--------NFNGNSSTTSIGGNQG---------GRVVCQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYD-TTPTTTTWLADSGCNTHVTP

Query:  EF
        +F
Subjt:  EF

A0A5J5A1U7 Integrase catalytic domain-containing protein1.2e-4733.05Show/hide
Query:  NPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVL
        NP Y  W  QD+A++TL+NATLS+ A S VIG  +S+E W ALE+RFS+ T S+I ++KS+LH I KG  +SID Y+ +I++  D L++VSV ++DED+L
Subjt:  NPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVL

Query:  LYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFI---GQQNKATAAPG-----------------------GRGSNSPPIQNQGNQ----GGR
        +Y LNGL  EYN+F+TSI T+S+ +TL+E++++LK E + I    +QN +   PG                       GRG       N+G +    G  
Subjt:  LYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFI---GQQNKATAAPG-----------------------GRGSNSPPIQNQGNQ----GGR

Query:  GPQNFGQTNQSTQNRGNFNGNSSTTSIGGNQGGRVVCQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDTTP--TTTTWLADSGCNTHVTPE--
           NFGQ+N     +     N  +     N    VVCQIC K  H ALDC++ ++ SYQG+ P+ +L AM+  Y+T    +   W  D+G   H+T +  
Subjt:  GPQNFGQTNQSTQNRGNFNGNSSTTSIGGNQGGRVVCQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDTTP--TTTTWLADSGCNTHVTPE--

Query:  ---FKLQRGGCHHI----GKWTGTSHYSGWIWYY-----FTISGLLTTIKL--------------------------IMYKDTRQILYKGKSKDELYPII
           F ++  G  +I    G+    SH SG    +     F ++ +L    +                          I  K T+Q+L++G S   LYP+ 
Subjt:  ---FKLQRGGCHHI----GKWTGTSHYSGWIWYY-----FTISGLLTTIKL--------------------------IMYKDTRQILYKGKSKDELYPII

Query:  VDGVPKPRQVSGLPS------RPVCA---------------SAVVSKVSHSDLWHFRLGHPSTFILNKV
           + K    S  P          CA               +A + K   + LWH RLGHPST  L  +
Subjt:  VDGVPKPRQVSGLPS------RPVCA---------------SAVVSKVSHSDLWHFRLGHPSTFILNKV

SwissProt top hitse value%identityAlignment
Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-0924.23Show/hide
Query:  NPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVL
        NP Y++W  QD+ I + I   +S      V    ++ ++W  L K +++ ++ H+ +++                ++ R     D+L+ +   +D ++ +
Subjt:  NPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVL

Query:  LYTLNGLSAEYNSFRTSIHTRSDPVTLDELHS-LLKSEAKFIGQQNKATAAPGGRG----SNSPPIQNQGNQGGRGPQNFGQTNQSTQNRGNFNGNSSTT
           L  L  +Y      I  +  P +L E+H  L+  E+K +   N A   P         N+   +NQ N+G    +N+   N    NR N    SS+ 
Subjt:  LYTLNGLSAEYNSFRTSIHTRSDPVTLDELHS-LLKSEAKFIGQQNKATAAPGGRG----SNSPPIQNQGNQGGRGPQNFGQTNQSTQNRGNFNGNSSTT

Query:  SIGGNQGGRVV---CQICTKPRHGALDC-----FNCLNLSYQGRHPTSKLAAMA-LAYDTTPTTTTWLADSGCNTHVTPEFKLQRGGCHHIGKWTGTSHY
        S   N+  +     CQIC+   H A  C     F       Q   P +     A LA ++      WL DSG   H+T +F        + G        
Subjt:  SIGGNQGGRVV---CQICTKPRHGALDC-----FNCLNLSYQGRHPTSKLAAMA-LAYDTTPTTTTWLADSGCNTHVTPEFKLQRGGCHHIGKWTGTSHY

Query:  SGWIWYYFTISGLLTTIKLIMYKDTRQILYKGKSKDELYPII---------VDGVPKPRQV----SGLP-----------SRPVCASAVV-------SKV
           I    T S  L T    +  D  ++LY       L  +          V+  P   QV    +G+P             P+ +S  V       SK 
Subjt:  SGWIWYYFTISGLLTTIKLIMYKDTRQILYKGKSKDELYPII---------VDGVPKPRQV----SGLP-----------SRPVCASAVV-------SKV

Query:  SHSDLWHFRLGHPSTFILNKV
        +HS  WH RLGHPS  ILN V
Subjt:  SHSDLWHFRLGHPSTFILNKV

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.6e-0424.48Show/hide
Query:  WIAQDRAIITLINATLSKVAF--SFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVLLYTL
        W  +D  +   +  TL+   F  SFV    +S+++W  ++ +F +   +    + S L T   G    + DY  ++++L D L  V V V D ++++Y L
Subjt:  WIAQDRAIITLINATLSKVAF--SFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRIRELVDKLSTVSVKVDDEDVLLYTL

Query:  NGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFIGQQNKATAAPGGRGSNS--------PPIQNQGNQGGR--GPQNFGQTNQSTQNRG
        NGL+ ++++    I  R    + D+  ++L+ E   + +  K         S+S        PP+ N    GG   G +  G+ N   + RG
Subjt:  NGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFIGQQNKATAAPGGRGSNS--------PPIQNQGNQGGR--GPQNFGQTNQSTQNRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCCTCGAGCACCACTGAAGCCCTATCTGGCGGATCACTTCAGTCCAATACCTCAATCTTCCTCTTGTCAAACATTTTCAATCCCGCTTACTCACAATG
GATAGCTCAGGATCGCGCCATCATCACCCTAATCAATGCTACGTTGTCTAAGGTGGCCTTCTCCTTTGTAATTGGTTGCAAATCTTCTCAAGAGGTATGGACTGCTTTAG
AAAAACGTTTTTCATCTATTACTTGGTCTCATATTCATGAAATGAAATCATCCCTACATACTATTATAAAAGGCTCAAGTGAGTCGATTGATGATTACTTAATTCGCATA
AGGGAATTAGTCGATAAACTCTCTACCGTCTCTGTGAAAGTTGACGATGAAGATGTCTTACTTTACACACTTAATGGCCTATCAGCTGAGTATAACTCGTTTCGTACCTC
TATTCATACAAGAAGTGACCCTGTAACTTTAGATGAACTTCATTCCTTGTTGAAATCAGAAGCTAAGTTCATTGGGCAACAAAACAAAGCCACTGCTGCTCCTGGCGGCC
GTGGATCAAATTCTCCGCCGATTCAGAATCAAGGAAATCAGGGAGGTCGTGGACCTCAAAATTTTGGCCAAACTAATCAGTCTACTCAGAATCGAGGGAATTTCAATGGA
AATTCCTCTACTACCTCAATTGGTGGAAATCAAGGAGGAAGAGTTGTTTGCCAAATTTGCACTAAGCCAAGACATGGTGCACTAGATTGCTTTAATTGCCTCAACCTGTC
TTACCAAGGGCGCCACCCTACTTCAAAGTTAGCTGCAATGGCACTCGCCTATGACACTACACCTACCACAACAACTTGGCTTGCTGACAGTGGCTGCAACACACATGTTA
CCCCTGAATTCAAGCTACAACGGGGAGGATGCCATCACATTGGCAAATGGACAGGGACTTCCCATTACTCAGGCTGGATTTGGTACTATTTCACAATCTCGGGGCTCCTT
ACAACTATCAAACTTATTATGTACAAGGATACGAGGCAAATACTCTACAAGGGCAAGAGTAAGGACGAATTATATCCCATTATTGTTGATGGTGTTCCAAAGCCAAGACA
AGTTTCTGGTTTGCCTTCTAGACCCGTATGTGCTTCTGCTGTTGTTTCTAAAGTGTCTCATAGTGATTTGTGGCATTTTCGGTTGGGTCACCCTTCTACTTTTATTTTGA
ATAAAGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCTTCCTCGAGCACCACTGAAGCCCTATCTGGCGGATCACTTCAGTCCAATACCTCAATCTTCCTCTTGTCAAACATTTTCAATCCCGCTTACTCACAATG
GATAGCTCAGGATCGCGCCATCATCACCCTAATCAATGCTACGTTGTCTAAGGTGGCCTTCTCCTTTGTAATTGGTTGCAAATCTTCTCAAGAGGTATGGACTGCTTTAG
AAAAACGTTTTTCATCTATTACTTGGTCTCATATTCATGAAATGAAATCATCCCTACATACTATTATAAAAGGCTCAAGTGAGTCGATTGATGATTACTTAATTCGCATA
AGGGAATTAGTCGATAAACTCTCTACCGTCTCTGTGAAAGTTGACGATGAAGATGTCTTACTTTACACACTTAATGGCCTATCAGCTGAGTATAACTCGTTTCGTACCTC
TATTCATACAAGAAGTGACCCTGTAACTTTAGATGAACTTCATTCCTTGTTGAAATCAGAAGCTAAGTTCATTGGGCAACAAAACAAAGCCACTGCTGCTCCTGGCGGCC
GTGGATCAAATTCTCCGCCGATTCAGAATCAAGGAAATCAGGGAGGTCGTGGACCTCAAAATTTTGGCCAAACTAATCAGTCTACTCAGAATCGAGGGAATTTCAATGGA
AATTCCTCTACTACCTCAATTGGTGGAAATCAAGGAGGAAGAGTTGTTTGCCAAATTTGCACTAAGCCAAGACATGGTGCACTAGATTGCTTTAATTGCCTCAACCTGTC
TTACCAAGGGCGCCACCCTACTTCAAAGTTAGCTGCAATGGCACTCGCCTATGACACTACACCTACCACAACAACTTGGCTTGCTGACAGTGGCTGCAACACACATGTTA
CCCCTGAATTCAAGCTACAACGGGGAGGATGCCATCACATTGGCAAATGGACAGGGACTTCCCATTACTCAGGCTGGATTTGGTACTATTTCACAATCTCGGGGCTCCTT
ACAACTATCAAACTTATTATGTACAAGGATACGAGGCAAATACTCTACAAGGGCAAGAGTAAGGACGAATTATATCCCATTATTGTTGATGGTGTTCCAAAGCCAAGACA
AGTTTCTGGTTTGCCTTCTAGACCCGTATGTGCTTCTGCTGTTGTTTCTAAAGTGTCTCATAGTGATTTGTGGCATTTTCGGTTGGGTCACCCTTCTACTTTTATTTTGA
ATAAAGTGTAG
Protein sequenceShow/hide protein sequence
MASSSSSTTEALSGGSLQSNTSIFLLSNIFNPAYSQWIAQDRAIITLINATLSKVAFSFVIGCKSSQEVWTALEKRFSSITWSHIHEMKSSLHTIIKGSSESIDDYLIRI
RELVDKLSTVSVKVDDEDVLLYTLNGLSAEYNSFRTSIHTRSDPVTLDELHSLLKSEAKFIGQQNKATAAPGGRGSNSPPIQNQGNQGGRGPQNFGQTNQSTQNRGNFNG
NSSTTSIGGNQGGRVVCQICTKPRHGALDCFNCLNLSYQGRHPTSKLAAMALAYDTTPTTTTWLADSGCNTHVTPEFKLQRGGCHHIGKWTGTSHYSGWIWYYFTISGLL
TTIKLIMYKDTRQILYKGKSKDELYPIIVDGVPKPRQVSGLPSRPVCASAVVSKVSHSDLWHFRLGHPSTFILNKV