; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021948 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021948
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr7:14396564..14397670
RNA-Seq ExpressionLag0021948
SyntenyLag0021948
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.3e-5937.5Show/hide
Query:  MESPSIEKSNSEIEISSQSPKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYLTSADSSK-----IPNPAYDHWVRQDSLIIA
        M S S        E SS   +I   GNKI+ +KL++D FLLWK QILT L  + L++ L+ ++E PSKYL S +SS       PNPAY  W RQD LI +
Subjt:  MESPSIEKSNSEIEISSQSPKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYLTSADSSK-----IPNPAYDHWVRQDSLIIA

Query:  WLLGSMSNSLLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNV
        WLLGSMS  +L++ML C++ +E+W+ LQ  FSSR +A+ M  ++KL ++KKG++ L++YFL++   VD L +  + +S +DH+LYIL GLG+ Y S ++V
Subjt:  WLLGSMSNSLLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNV

Query:  ITEKDETPPLQKTVS----------------------------------------------SDSNNHRNSNGKGNNNRRSWNNNNKPQCQLCRRFGHTVQ
        I+ + ++P +Q+ +S                                              + S N R   G G +NR    N NKPQCQ+C + G++  
Subjt:  ITEKDETPPLQKTVS----------------------------------------------SDSNNHRNSNGKGNNNRRSWNNNNKPQCQLCRRFGHTVQ

Query:  RCYYRFERWFQGPNGNSNNQQVSPGLGQQSSGSNNQQAAMNAFVVQNDLNKDNHWYPDSGASNHVT----NVGIGA
        RC++R+          SN+   SP     S  + N    M+A V   DLN D++WYPDSGA+NH+T    N+ IG+
Subjt:  RCYYRFERWFQGPNGNSNNQQVSPGLGQQSSGSNNQQAAMNAFVVQNDLNKDNHWYPDSGASNHVT----NVGIGA

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.3e-5937.5Show/hide
Query:  MESPSIEKSNSEIEISSQSPKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYLTSADSSK-----IPNPAYDHWVRQDSLIIA
        M S S        E SS   +I   GNKI+ +KL++D FLLWK QILT L  + L++ L+ ++E PSKYL S +SS       PNPAY  W RQD LI +
Subjt:  MESPSIEKSNSEIEISSQSPKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYLTSADSSK-----IPNPAYDHWVRQDSLIIA

Query:  WLLGSMSNSLLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNV
        WLLGSMS  +L++ML C++ +E+W+ LQ  FSSR +A+ M  ++KL ++KKG++ L++YFL++   VD L +  + +S +DH+LYIL GLG+ Y S ++V
Subjt:  WLLGSMSNSLLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNV

Query:  ITEKDETPPLQKTVS----------------------------------------------SDSNNHRNSNGKGNNNRRSWNNNNKPQCQLCRRFGHTVQ
        I+ + ++P +Q+ +S                                              + S N R   G G +NR    N NKPQCQ+C + G++  
Subjt:  ITEKDETPPLQKTVS----------------------------------------------SDSNNHRNSNGKGNNNRRSWNNNNKPQCQLCRRFGHTVQ

Query:  RCYYRFERWFQGPNGNSNNQQVSPGLGQQSSGSNNQQAAMNAFVVQNDLNKDNHWYPDSGASNHVT----NVGIGA
        RC++R+          SN+   SP     S  + N    M+A V   DLN D++WYPDSGA+NH+T    N+ IG+
Subjt:  RCYYRFERWFQGPNGNSNNQQVSPGLGQQSSGSNNQQAAMNAFVVQNDLNKDNHWYPDSGASNHVT----NVGIGA

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]5.1e-5142.91Show/hide
Query:  VRQDSLIIAWLLGSMSNSLLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLG
        ++QD LI +WL  SM   +L EM+ C T REVW+IL+N ++SRN+AR+M L+SKLE++KKGNL L+DYF +V+ LVD L AAG+K++ EDH+++IL GL 
Subjt:  VRQDSLIIAWLLGSMSNSLLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLG

Query:  TKYDSTVNVITEKDETPPLQKTVS-----------------------------SDSN-------------NHRNSNGKGNNNRRSWNNNNKPQCQLCRRF
        ++++STV+VI+ + +T  LQ+  S                              +SN             N+R+ N    N RR+WN+NN+PQCQ+  +F
Subjt:  TKYDSTVNVITEKDETPPLQKTVS-----------------------------SDSN-------------NHRNSNGKGNNNRRSWNNNNKPQCQLCRRF

Query:  GHTVQRCYYRFERWFQGPNGNSNNQQVSPGLGQQSSGSN-----NQQAA------------MNAFVVQNDLNKDNHWYPDSGASNHVTN
        GHT  RCY RFE+ F GPNG S+ Q    G    S  SN     NQQ A            M AF+ Q D N+D +WYPDSGA+NHVT+
Subjt:  GHTVQRCYYRFERWFQGPNGNSNNQQVSPGLGQQSSGSN-----NQQAA------------MNAFVVQNDLNKDNHWYPDSGASNHVTN

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]5.1e-5142.91Show/hide
Query:  VRQDSLIIAWLLGSMSNSLLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLG
        ++QD LI +WL  SM   +L EM+ C T REVW+IL+N ++SRN+AR+M L+SKLE++KKGNL L+DYF +V+ LVD L AAG+K++ EDH+++IL GL 
Subjt:  VRQDSLIIAWLLGSMSNSLLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLG

Query:  TKYDSTVNVITEKDETPPLQKTVS-----------------------------SDSN-------------NHRNSNGKGNNNRRSWNNNNKPQCQLCRRF
        ++++STV+VI+ + +T  LQ+  S                              +SN             N+R+ N    N RR+WN+NN+PQCQ+  +F
Subjt:  TKYDSTVNVITEKDETPPLQKTVS-----------------------------SDSN-------------NHRNSNGKGNNNRRSWNNNNKPQCQLCRRF

Query:  GHTVQRCYYRFERWFQGPNGNSNNQQVSPGLGQQSSGSN-----NQQAA------------MNAFVVQNDLNKDNHWYPDSGASNHVTN
        GHT  RCY RFE+ F GPNG S+ Q    G    S  SN     NQQ A            M AF+ Q D N+D +WYPDSGA+NHVT+
Subjt:  GHTVQRCYYRFERWFQGPNGNSNNQQVSPGLGQQSSGSN-----NQQAA------------MNAFVVQNDLNKDNHWYPDSGASNHVTN

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]1.6e-6840.85Show/hide
Query:  NSEIEISSQSPKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYL-----TSADSSKIPNPAYDHWVRQDSLIIAWLLGSMSNS
        NS+     Q+ K INPG+K++ ++L++DN LLWK QI T L+G+GL+ ++D + + P++++      S+ SS   NPAY  W++QD LI AWLLGSM+  
Subjt:  NSEIEISSQSPKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYL-----TSADSSKIPNPAYDHWVRQDSLIIAWLLGSMSNS

Query:  LLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNVITEKDETPP
        +LS+MLDC++ RE+W +L+  F+SR +AR+M L+ KLE+ KKGNL L+DYFL+++NLVD L  AG+K+S EDH+++IL GLG ++D+ ++VIT ++    
Subjt:  LLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNVITEKDETPP

Query:  LQKTVS----SDSNNHRN--------------------------------------SNGKGNN----NRRSWNNNNKPQCQLCRRFGHTVQRCYYRFERW
        LQ+  S     +  N RN                                        G+G N    NRR+W  NNKPQCQ+C RFGHT  RCY RFER 
Subjt:  LQKTVS----SDSNNHRN--------------------------------------SNGKGNN----NRRSWNNNNKPQCQLCRRFGHTVQRCYYRFERW

Query:  FQGPNGN------------------SNNQQVSPGLGQQSSGSNN-QQAAMNAFVVQNDLNKDNHWYPDSGASNHVTN
        F GPN N                  S+N   SP     +   N+   + M A +V  D N+D++WY DSG +NHVTN
Subjt:  FQGPNGN------------------SNNQQVSPGLGQQSSGSNN-QQAAMNAFVVQNDLNKDNHWYPDSGASNHVTN

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-5937.5Show/hide
Query:  MESPSIEKSNSEIEISSQSPKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYLTSADSSK-----IPNPAYDHWVRQDSLIIA
        M S S        E SS   +I   GNKI+ +KL++D FLLWK QILT L  + L++ L+ ++E PSKYL S +SS       PNPAY  W RQD LI +
Subjt:  MESPSIEKSNSEIEISSQSPKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYLTSADSSK-----IPNPAYDHWVRQDSLIIA

Query:  WLLGSMSNSLLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNV
        WLLGSMS  +L++ML C++ +E+W+ LQ  FSSR +A+ M  ++KL ++KKG++ L++YFL++   VD L +  + +S +DH+LYIL GLG+ Y S ++V
Subjt:  WLLGSMSNSLLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNV

Query:  ITEKDETPPLQKTVS----------------------------------------------SDSNNHRNSNGKGNNNRRSWNNNNKPQCQLCRRFGHTVQ
        I+ + ++P +Q+ +S                                              + S N R   G G +NR    N NKPQCQ+C + G++  
Subjt:  ITEKDETPPLQKTVS----------------------------------------------SDSNNHRNSNGKGNNNRRSWNNNNKPQCQLCRRFGHTVQ

Query:  RCYYRFERWFQGPNGNSNNQQVSPGLGQQSSGSNNQQAAMNAFVVQNDLNKDNHWYPDSGASNHVT----NVGIGA
        RC++R+          SN+   SP     S  + N    M+A V   DLN D++WYPDSGA+NH+T    N+ IG+
Subjt:  RCYYRFERWFQGPNGNSNNQQVSPGLGQQSSGSNNQQAAMNAFVVQNDLNKDNHWYPDSGASNHVT----NVGIGA

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-5937.5Show/hide
Query:  MESPSIEKSNSEIEISSQSPKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYLTSADSSK-----IPNPAYDHWVRQDSLIIA
        M S S        E SS   +I   GNKI+ +KL++D FLLWK QILT L  + L++ L+ ++E PSKYL S +SS       PNPAY  W RQD LI +
Subjt:  MESPSIEKSNSEIEISSQSPKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYLTSADSSK-----IPNPAYDHWVRQDSLIIA

Query:  WLLGSMSNSLLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNV
        WLLGSMS  +L++ML C++ +E+W+ LQ  FSSR +A+ M  ++KL ++KKG++ L++YFL++   VD L +  + +S +DH+LYIL GLG+ Y S ++V
Subjt:  WLLGSMSNSLLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNV

Query:  ITEKDETPPLQKTVS----------------------------------------------SDSNNHRNSNGKGNNNRRSWNNNNKPQCQLCRRFGHTVQ
        I+ + ++P +Q+ +S                                              + S N R   G G +NR    N NKPQCQ+C + G++  
Subjt:  ITEKDETPPLQKTVS----------------------------------------------SDSNNHRNSNGKGNNNRRSWNNNNKPQCQLCRRFGHTVQ

Query:  RCYYRFERWFQGPNGNSNNQQVSPGLGQQSSGSNNQQAAMNAFVVQNDLNKDNHWYPDSGASNHVT----NVGIGA
        RC++R+          SN+   SP     S  + N    M+A V   DLN D++WYPDSGA+NH+T    N+ IG+
Subjt:  RCYYRFERWFQGPNGNSNNQQVSPGLGQQSSGSNNQQAAMNAFVVQNDLNKDNHWYPDSGASNHVT----NVGIGA

A0A6J1C6N9 dr1-associated corepressor homolog isoform X12.5e-5142.91Show/hide
Query:  VRQDSLIIAWLLGSMSNSLLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLG
        ++QD LI +WL  SM   +L EM+ C T REVW+IL+N ++SRN+AR+M L+SKLE++KKGNL L+DYF +V+ LVD L AAG+K++ EDH+++IL GL 
Subjt:  VRQDSLIIAWLLGSMSNSLLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLG

Query:  TKYDSTVNVITEKDETPPLQKTVS-----------------------------SDSN-------------NHRNSNGKGNNNRRSWNNNNKPQCQLCRRF
        ++++STV+VI+ + +T  LQ+  S                              +SN             N+R+ N    N RR+WN+NN+PQCQ+  +F
Subjt:  TKYDSTVNVITEKDETPPLQKTVS-----------------------------SDSN-------------NHRNSNGKGNNNRRSWNNNNKPQCQLCRRF

Query:  GHTVQRCYYRFERWFQGPNGNSNNQQVSPGLGQQSSGSN-----NQQAA------------MNAFVVQNDLNKDNHWYPDSGASNHVTN
        GHT  RCY RFE+ F GPNG S+ Q    G    S  SN     NQQ A            M AF+ Q D N+D +WYPDSGA+NHVT+
Subjt:  GHTVQRCYYRFERWFQGPNGNSNNQQVSPGLGQQSSGSN-----NQQAA------------MNAFVVQNDLNKDNHWYPDSGASNHVTN

A0A6J1C8R2 dr1-associated corepressor homolog isoform X22.5e-5142.91Show/hide
Query:  VRQDSLIIAWLLGSMSNSLLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLG
        ++QD LI +WL  SM   +L EM+ C T REVW+IL+N ++SRN+AR+M L+SKLE++KKGNL L+DYF +V+ LVD L AAG+K++ EDH+++IL GL 
Subjt:  VRQDSLIIAWLLGSMSNSLLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLG

Query:  TKYDSTVNVITEKDETPPLQKTVS-----------------------------SDSN-------------NHRNSNGKGNNNRRSWNNNNKPQCQLCRRF
        ++++STV+VI+ + +T  LQ+  S                              +SN             N+R+ N    N RR+WN+NN+PQCQ+  +F
Subjt:  TKYDSTVNVITEKDETPPLQKTVS-----------------------------SDSN-------------NHRNSNGKGNNNRRSWNNNNKPQCQLCRRF

Query:  GHTVQRCYYRFERWFQGPNGNSNNQQVSPGLGQQSSGSN-----NQQAA------------MNAFVVQNDLNKDNHWYPDSGASNHVTN
        GHT  RCY RFE+ F GPNG S+ Q    G    S  SN     NQQ A            M AF+ Q D N+D +WYPDSGA+NHVT+
Subjt:  GHTVQRCYYRFERWFQGPNGNSNNQQVSPGLGQQSSGSN-----NQQAA------------MNAFVVQNDLNKDNHWYPDSGASNHVTN

A0A6J1DLT9 uncharacterized protein LOC1110217577.6e-6940.85Show/hide
Query:  NSEIEISSQSPKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYL-----TSADSSKIPNPAYDHWVRQDSLIIAWLLGSMSNS
        NS+     Q+ K INPG+K++ ++L++DN LLWK QI T L+G+GL+ ++D + + P++++      S+ SS   NPAY  W++QD LI AWLLGSM+  
Subjt:  NSEIEISSQSPKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYL-----TSADSSKIPNPAYDHWVRQDSLIIAWLLGSMSNS

Query:  LLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNVITEKDETPP
        +LS+MLDC++ RE+W +L+  F+SR +AR+M L+ KLE+ KKGNL L+DYFL+++NLVD L  AG+K+S EDH+++IL GLG ++D+ ++VIT ++    
Subjt:  LLSEMLDCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNVITEKDETPP

Query:  LQKTVS----SDSNNHRN--------------------------------------SNGKGNN----NRRSWNNNNKPQCQLCRRFGHTVQRCYYRFERW
        LQ+  S     +  N RN                                        G+G N    NRR+W  NNKPQCQ+C RFGHT  RCY RFER 
Subjt:  LQKTVS----SDSNNHRN--------------------------------------SNGKGNN----NRRSWNNNNKPQCQLCRRFGHTVQRCYYRFERW

Query:  FQGPNGN------------------SNNQQVSPGLGQQSSGSNN-QQAAMNAFVVQNDLNKDNHWYPDSGASNHVTN
        F GPN N                  S+N   SP     +   N+   + M A +V  D N+D++WY DSG +NHVTN
Subjt:  FQGPNGN------------------SNNQQVSPGLGQQSSGSNN-QQAAMNAFVVQNDLNKDNHWYPDSGASNHVTN

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.9e-1923.8Show/hide
Query:  EISSQSPKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYLTSADSSKIPNPAYDHWVRQDSLIIAWLLGSMSNSLLSEMLDCE
        E+   +  I+N  N     KL   N+L+W  Q+     G+ L   LD    +P   +   D++   NP Y  W RQD LI + +LG++S S+   +    
Subjt:  EISSQSPKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYLTSADSSKIPNPAYDHWVRQDSLIIAWLLGSMSNSLLSEMLDCE

Query:  TTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNVITEKDETPPL--------
        T  ++W+ L+  +++ +   +  L+++L+   KG   ++DY   +    D L   G+ + H++ V  +L+ L  +Y   ++ I  KD  P L        
Subjt:  TTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNVITEKDETPPL--------

Query:  ------------------------QKTVSSDSNNHRNSNGK-----GNNNRRSW----------NNNNKP---QCQLCRRFGHTVQRCYYRFERWFQGPN
                                + T ++++NN+ N N +      NNN + W          NN +KP   +CQ+C   GH+ +RC            
Subjt:  ------------------------QKTVSSDSNNHRNSNGK-----GNNNRRSW----------NNNNKP---QCQLCRRFGHTVQRCYYRFERWFQGPN

Query:  GNSNNQQVSPGLGQQSSGSNNQQAAMNAFVVQNDLNKDNHWYPDSGASNHVTN
          S  Q     +  Q   S        A +        N+W  DSGA++H+T+
Subjt:  GNSNNQQVSPGLGQQSSGSNNQQAAMNAFVVQNDLNKDNHWYPDSGASNHVTN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-1323.89Show/hide
Query:  EISSQSPKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYLTSADSSKIPNPAYDHWVRQDSLIIAWLLGSMSNSLLSEMLDCE
        EI   +  I+N  N     KL   N+L+W  Q+     G+ L   LD    +P   +   D+    NP Y  W RQD LI + +LG++S S+   +    
Subjt:  EISSQSPKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYLTSADSSKIPNPAYDHWVRQDSLIIAWLLGSMSNSLLSEMLDCE

Query:  TTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNVITEKDETPPL----QKTV
        T  ++W+ L+  +++ +   +                     LR     D L   G+ + H++ V  +L+ L   Y   ++ I  KD  P L    ++ +
Subjt:  TTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNVITEKDETPPL----QKTV

Query:  SSDS-----------------------NNHRNSNGKGNNNRRSWNNNN--------------------KP---QCQLCRRFGHTVQRCYYRFERWFQGPN
        + +S                       N +RN N +G+N  R++NNNN                    KP   +CQ+C   GH+ +RC            
Subjt:  SSDS-----------------------NNHRNSNGKGNNNRRSWNNNN--------------------KP---QCQLCRRFGHTVQRCYYRFERWFQGPN

Query:  GNSNNQQVSPGLGQQSSGSNNQQAAM-------NAFVVQNDLNKDNHWYPDSGASNHVTN
                 P L Q  S +N QQ+          A +  N     N+W  DSGA++H+T+
Subjt:  GNSNNQQVSPGLGQQSSGSNNQQAAM-------NAFVVQNDLNKDNHWYPDSGASNHVTN

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).3.5e-1026.28Show/hide
Query:  PKIINPGN-KITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYLTSADSSKIPNPAYDHWVRQDSLIIAWLLGSMSNSLLSEMLDCETTREV
        P I +P +  I  +  DEDN++ WK++  + LR       +D     P  +          +P Y  W + +++++ WL+ SM++ LL  ++  ET  ++
Subjt:  PKIINPGN-KITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYLTSADSSKIPNPAYDHWVRQDSLIIAWLLGSMSNSLLSEMLDCETTREV

Query:  WKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYF
        W+ L+  F      +I  L+ +L +L++G   +E+YF
Subjt:  WKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYF

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.5e-1323.56Show/hide
Query:  MKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYLTSADSSKIPNPAYD-HWVRQDSLIIAWLLGSMS-NSLLSEMLDCETTREVWKILQNRFSSR
        + ++E N+  W+   LT      +  H+              D + +P  A D +W ++D ++   L G+++        +   T+R++W  ++N+F + 
Subjt:  MKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYLTSADSSKIPNPAYD-HWVRQDSLIIAWLLGSMS-NSLLSEMLDCETTREVWKILQNRFSSR

Query:  NVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNVITEKDETP
          AR + L S+L +   G++++ DY+ +++ L D L      ++  + V+Y+L GL  K+D+ +NVI  +   P
Subjt:  NVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNVITEKDETP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCTCCTTCAATAGAGAAGAGCAATTCTGAGATTGAAATCTCATCTCAGAGTCCGAAAATCATAAACCCAGGCAACAAGATCACTACGATGAAGCTTGACGAAGA
CAATTTCCTATTGTGGAAGTTGCAGATCCTCACCACTTTGCGAGGACATGGGTTGAAACACCATCTCGATGAAGATGCGGAAGTTCCGTCGAAGTATCTCACAAGTGCAG
ATTCTTCAAAAATCCCTAATCCGGCGTATGATCATTGGGTTCGACAGGATAGCCTAATTATCGCCTGGTTGTTAGGCTCAATGTCCAACTCTCTTCTTTCGGAAATGCTG
GATTGTGAAACGACTCGAGAAGTGTGGAAAATATTGCAAAATCGCTTTTCCTCACGGAATGTTGCTCGAATTATGGACCTCCAGTCGAAATTAGAATCCCTCAAGAAAGG
TAACCTGAAACTCGAAGATTACTTTCTCAGAGTCAGAAATCTTGTTGATTTGTTAAATGCTGCTGGAAGGAAAATTTCACATGAAGATCATGTCTTATACATTCTTAAGG
GACTAGGAACGAAATATGACTCCACGGTGAATGTAATTACGGAAAAAGACGAGACTCCCCCGTTGCAGAAGACTGTTTCATCTGACTCCAATAATCACAGGAACTCAAAT
GGAAAGGGTAACAACAATCGTCGATCTTGGAACAATAACAATAAACCACAGTGTCAGTTGTGCAGACGATTTGGACATACGGTTCAAAGGTGTTATTATCGGTTCGAACG
TTGGTTTCAAGGACCAAATGGTAATTCAAACAATCAACAAGTATCACCTGGCCTTGGACAACAATCCTCTGGTTCTAATAACCAGCAAGCAGCCATGAATGCCTTCGTAG
TGCAAAATGATCTTAACAAAGACAATCACTGGTATCCGGATTCAGGAGCATCGAATCACGTCACCAATGTTGGGATTGGTGCCCTAAAACTCGTAGATAATGAATGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCTCCTTCAATAGAGAAGAGCAATTCTGAGATTGAAATCTCATCTCAGAGTCCGAAAATCATAAACCCAGGCAACAAGATCACTACGATGAAGCTTGACGAAGA
CAATTTCCTATTGTGGAAGTTGCAGATCCTCACCACTTTGCGAGGACATGGGTTGAAACACCATCTCGATGAAGATGCGGAAGTTCCGTCGAAGTATCTCACAAGTGCAG
ATTCTTCAAAAATCCCTAATCCGGCGTATGATCATTGGGTTCGACAGGATAGCCTAATTATCGCCTGGTTGTTAGGCTCAATGTCCAACTCTCTTCTTTCGGAAATGCTG
GATTGTGAAACGACTCGAGAAGTGTGGAAAATATTGCAAAATCGCTTTTCCTCACGGAATGTTGCTCGAATTATGGACCTCCAGTCGAAATTAGAATCCCTCAAGAAAGG
TAACCTGAAACTCGAAGATTACTTTCTCAGAGTCAGAAATCTTGTTGATTTGTTAAATGCTGCTGGAAGGAAAATTTCACATGAAGATCATGTCTTATACATTCTTAAGG
GACTAGGAACGAAATATGACTCCACGGTGAATGTAATTACGGAAAAAGACGAGACTCCCCCGTTGCAGAAGACTGTTTCATCTGACTCCAATAATCACAGGAACTCAAAT
GGAAAGGGTAACAACAATCGTCGATCTTGGAACAATAACAATAAACCACAGTGTCAGTTGTGCAGACGATTTGGACATACGGTTCAAAGGTGTTATTATCGGTTCGAACG
TTGGTTTCAAGGACCAAATGGTAATTCAAACAATCAACAAGTATCACCTGGCCTTGGACAACAATCCTCTGGTTCTAATAACCAGCAAGCAGCCATGAATGCCTTCGTAG
TGCAAAATGATCTTAACAAAGACAATCACTGGTATCCGGATTCAGGAGCATCGAATCACGTCACCAATGTTGGGATTGGTGCCCTAAAACTCGTAGATAATGAATGTTAA
Protein sequenceShow/hide protein sequence
MESPSIEKSNSEIEISSQSPKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDAEVPSKYLTSADSSKIPNPAYDHWVRQDSLIIAWLLGSMSNSLLSEML
DCETTREVWKILQNRFSSRNVARIMDLQSKLESLKKGNLKLEDYFLRVRNLVDLLNAAGRKISHEDHVLYILKGLGTKYDSTVNVITEKDETPPLQKTVSSDSNNHRNSN
GKGNNNRRSWNNNNKPQCQLCRRFGHTVQRCYYRFERWFQGPNGNSNNQQVSPGLGQQSSGSNNQQAAMNAFVVQNDLNKDNHWYPDSGASNHVTNVGIGALKLVDNEC