; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038419 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038419
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr2:17022756..17023677
RNA-Seq ExpressionLag0038419
SyntenyLag0038419
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6754633.1 hypothetical protein POTOM_040425 [Populus tomentosa]4.0e-1032.14Show/hide
Query:  FRATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEY----IPIVVQSQQWEETMEAIKGTLSESSRETTTME
        F ++S+AR  QLR  LQ+ KKGS+ MI+Y+  +K+A ++L  IG PV   D ++N L  L S+Y    +P+ + ++  E    A++ T +++S +   + 
Subjt:  FRATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEY----IPIVVQSQQWEETMEAIKGTLSESSRETTTME

Query:  EVEEDIIMTINDKKTPSQPASCVGTTPDILTDPKWLANSRTTNHVTADVGNLAVKNKYNGKDSLIVRN
          E      +  K+ P+  AS      + L D  W  +SR ++H+T + GNL     Y   D +IV N
Subjt:  EVEEDIIMTINDKKTPSQPASCVGTTPDILTDPKWLANSRTTNHVTADVGNLAVKNKYNGKDSLIVRN

RZB67542.1 Retrovirus-related Pol polyprotein from transposon RE1 [Glycine soja]9.5e-1234.5Show/hide
Query:  ATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIVVQ-SQQWEETMEAIKGTL--------SESSRETT
        A +K+R   L+    NT+KG MKM +YL  MK  ++ L+L G+P+S  DL++  L  LD++Y P+VV+ S Q     + ++  L         E++RE  
Subjt:  ATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIVVQ-SQQWEETMEAIKGTL--------SESSRETT

Query:  TMEEVEEDIIMTINDKKTPSQPASCVGTTPDILTDPKWLANSRTTNHVTADVGNLAVKNKYNGKDSLIVRN
        T EEVEE++ +        +Q  S    +P    D +W  +S  +NHVT     L   ++ NGK+SL+V N
Subjt:  TMEEVEEDIIMTINDKKTPSQPASCVGTTPDILTDPKWLANSRTTNHVTADVGNLAVKNKYNGKDSLIVRN

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]6.8e-1028.38Show/hide
Query:  FRATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIVVQSQ-----QWEET-------------MEAIK
        F   S+A  + L++V Q T KGS++MIEYL +MK  ++NL L G+ VS+ DL+   L  LD EY PIVV  Q      W E                ++K
Subjt:  FRATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIVVQSQ-----QWEET-------------MEAIK

Query:  GTLSESSRETTTMEEVEEDIIM--------------------------------------TINDKKTPSQP---------ASCVGTTPDILTDPKWLANS
          +  +  +T ++  V+                                           T +   TPS            S   TTP+ + DP W A+S
Subjt:  GTLSESSRETTTMEEVEEDIIM--------------------------------------TINDKKTPSQP---------ASCVGTTPDILTDPKWLANS

Query:  RTTNHVTADVGNLAVKNKYNGKDSLIVRN
          T+HVTA+  N+  K  Y+G +++IV N
Subjt:  RTTNHVTADVGNLAVKNKYNGKDSLIVRN

XP_022157748.1 uncharacterized protein LOC111024384 isoform X1 [Momordica charantia]7.0e-1531.43Show/hide
Query:  DNFRATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIVVQ-----SQQWEE---TMEAIKGTLSE-SS
        D + ATSKAR NQLR VLQNTKK S+KM EYL +MKQASE+L+L G PV+   L+   L+ L++EY+PIV Q     S  W+E   T+   + TL   + 
Subjt:  DNFRATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIVVQ-----SQQWEE---TMEAIKGTLSE-SS

Query:  RETTTMEEVEEDIIMTINDKKTP--------------------------------------------SQPA-----------------------------
          T T E + +     ++ K+                                              S+P+                             
Subjt:  RETTTMEEVEEDIIMTINDKKTP--------------------------------------------SQPA-----------------------------

Query:  ----SCVGTTPDILTDPKWLANSRTTNHVTADVGNLAVKNKYNGK
            S     P+I+ +P WLA+S  T+HVT+D+ NL VK+ YNGK
Subjt:  ----SCVGTTPDILTDPKWLANSRTTNHVTADVGNLAVKNKYNGK

XP_022157750.1 uncharacterized protein LOC111024384 isoform X2 [Momordica charantia]2.0e-1431.02Show/hide
Query:  DNFRATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIVVQ-----SQQWEE---TMEAIKGTLSE-SS
        D + ATSKAR NQLR VLQNTKK S+KM EYL +MKQASE+L+L G PV+   L+   L+ L++EY+PIV Q     S  W+E   T+   + TL   + 
Subjt:  DNFRATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIVVQ-----SQQWEE---TMEAIKGTLSE-SS

Query:  RETTTMEEVEEDIIMTINDKKTP--------------------------------------------SQPA-----------------------------
          T T E + +     ++ K+                                              S+P+                             
Subjt:  RETTTMEEVEEDIIMTINDKKTP--------------------------------------------SQPA-----------------------------

Query:  ----SCVGTTPDILTDPKWLANSRTTNHVTADVGNLAVKNKYNGK
            S     P+I+ +P WLA+S  T+HVT+D+ NL VK+ YNG+
Subjt:  ----SCVGTTPDILTDPKWLANSRTTNHVTADVGNLAVKNKYNGK

TrEMBL top hitse value%identityAlignment
A0A445H1W7 Retrovirus-related Pol polyprotein from transposon RE14.6e-1234.5Show/hide
Query:  ATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIVVQ-SQQWEETMEAIKGTL--------SESSRETT
        A +K+R   L+    NT+KG MKM +YL  MK  ++ L+L G+P+S  DL++  L  LD++Y P+VV+ S Q     + ++  L         E++RE  
Subjt:  ATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIVVQ-SQQWEETMEAIKGTL--------SESSRETT

Query:  TMEEVEEDIIMTINDKKTPSQPASCVGTTPDILTDPKWLANSRTTNHVTADVGNLAVKNKYNGKDSLIVRN
        T EEVEE++ +        +Q  S    +P    D +W  +S  +NHVT     L   ++ NGK+SL+V N
Subjt:  TMEEVEEDIIMTINDKKTPSQPASCVGTTPDILTDPKWLANSRTTNHVTADVGNLAVKNKYNGKDSLIVRN

A0A6J1CPQ7 ankyrin repeat-containing protein NPR4-like9.6e-1057.35Show/hide
Query:  FRATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIV
        F +TSKAR NQLR  LQNTKKG+MKM  YL  MKQ SE+L+L G PV+L  L    L   ++EY+PI+
Subjt:  FRATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIV

A0A6J1DCW4 uncharacterized protein LOC1110195983.3e-1028.38Show/hide
Query:  FRATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIVVQSQ-----QWEET-------------MEAIK
        F   S+A  + L++V Q T KGS++MIEYL +MK  ++NL L G+ VS+ DL+   L  LD EY PIVV  Q      W E                ++K
Subjt:  FRATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIVVQSQ-----QWEET-------------MEAIK

Query:  GTLSESSRETTTMEEVEEDIIM--------------------------------------TINDKKTPSQP---------ASCVGTTPDILTDPKWLANS
          +  +  +T ++  V+                                           T +   TPS            S   TTP+ + DP W A+S
Subjt:  GTLSESSRETTTMEEVEEDIIM--------------------------------------TINDKKTPSQP---------ASCVGTTPDILTDPKWLANS

Query:  RTTNHVTADVGNLAVKNKYNGKDSLIVRN
          T+HVTA+  N+  K  Y+G +++IV N
Subjt:  RTTNHVTADVGNLAVKNKYNGKDSLIVRN

A0A6J1DTZ7 uncharacterized protein LOC111024384 isoform X29.9e-1531.02Show/hide
Query:  DNFRATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIVVQ-----SQQWEE---TMEAIKGTLSE-SS
        D + ATSKAR NQLR VLQNTKK S+KM EYL +MKQASE+L+L G PV+   L+   L+ L++EY+PIV Q     S  W+E   T+   + TL   + 
Subjt:  DNFRATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIVVQ-----SQQWEE---TMEAIKGTLSE-SS

Query:  RETTTMEEVEEDIIMTINDKKTP--------------------------------------------SQPA-----------------------------
          T T E + +     ++ K+                                              S+P+                             
Subjt:  RETTTMEEVEEDIIMTINDKKTP--------------------------------------------SQPA-----------------------------

Query:  ----SCVGTTPDILTDPKWLANSRTTNHVTADVGNLAVKNKYNGK
            S     P+I+ +P WLA+S  T+HVT+D+ NL VK+ YNG+
Subjt:  ----SCVGTTPDILTDPKWLANSRTTNHVTADVGNLAVKNKYNGK

A0A6J1DU77 uncharacterized protein LOC111024384 isoform X13.4e-1531.43Show/hide
Query:  DNFRATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIVVQ-----SQQWEE---TMEAIKGTLSE-SS
        D + ATSKAR NQLR VLQNTKK S+KM EYL +MKQASE+L+L G PV+   L+   L+ L++EY+PIV Q     S  W+E   T+   + TL   + 
Subjt:  DNFRATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIVVQ-----SQQWEE---TMEAIKGTLSE-SS

Query:  RETTTMEEVEEDIIMTINDKKTP--------------------------------------------SQPA-----------------------------
          T T E + +     ++ K+                                              S+P+                             
Subjt:  RETTTMEEVEEDIIMTINDKKTP--------------------------------------------SQPA-----------------------------

Query:  ----SCVGTTPDILTDPKWLANSRTTNHVTADVGNLAVKNKYNGK
            S     P+I+ +P WLA+S  T+HVT+D+ NL VK+ YNGK
Subjt:  ----SCVGTTPDILTDPKWLANSRTTNHVTADVGNLAVKNKYNGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTATGGTGGATCAGACAATTTCAGAGCTACTAGTAAGGCTCGAGCGAATCAACTCCGAAGAGTTCTACAAAATACCAAGAAAGGCTCGATGAAAATGATCGAATA
CTTGACTATTATGAAACAAGCGTCAGAGAACCTTCAGCTGATTGGTAATCCTGTATCTCTAGGTGATCTAATTTTGAATGCTCTTGCTAGTTTAGACTCTGAATATATCC
CTATAGTTGTTCAATCCCAACAATGGGAAGAGACAATGGAAGCAATCAAGGGAACTCTCAGCGAGAGTTCTCGAGAAACAACAACAATGGAAGAGGTAGAGGAAGATATC
ATAATGACTATCAACGACAAGAAAACTCCAAGCCAACCTGCTAGCTGTGTGGGAACTACCCCTGATATTTTGACTGATCCAAAATGGTTAGCAAATAGTAGGACAACCAA
TCATGTAACTGCCGATGTTGGCAATCTTGCAGTCAAGAACAAATACAATGGTAAGGACTCCTTAATAGTTCGCAATAACGCTGTAGACCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACTATGGTGGATCAGACAATTTCAGAGCTACTAGTAAGGCTCGAGCGAATCAACTCCGAAGAGTTCTACAAAATACCAAGAAAGGCTCGATGAAAATGATCGAATA
CTTGACTATTATGAAACAAGCGTCAGAGAACCTTCAGCTGATTGGTAATCCTGTATCTCTAGGTGATCTAATTTTGAATGCTCTTGCTAGTTTAGACTCTGAATATATCC
CTATAGTTGTTCAATCCCAACAATGGGAAGAGACAATGGAAGCAATCAAGGGAACTCTCAGCGAGAGTTCTCGAGAAACAACAACAATGGAAGAGGTAGAGGAAGATATC
ATAATGACTATCAACGACAAGAAAACTCCAAGCCAACCTGCTAGCTGTGTGGGAACTACCCCTGATATTTTGACTGATCCAAAATGGTTAGCAAATAGTAGGACAACCAA
TCATGTAACTGCCGATGTTGGCAATCTTGCAGTCAAGAACAAATACAATGGTAAGGACTCCTTAATAGTTCGCAATAACGCTGTAGACCCCTAA
Protein sequenceShow/hide protein sequence
MDYGGSDNFRATSKARANQLRRVLQNTKKGSMKMIEYLTIMKQASENLQLIGNPVSLGDLILNALASLDSEYIPIVVQSQQWEETMEAIKGTLSESSRETTTMEEVEEDI
IMTINDKKTPSQPASCVGTTPDILTDPKWLANSRTTNHVTADVGNLAVKNKYNGKDSLIVRNNAVDP