; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026022 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026022
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr10:26980815..26982256
RNA-Seq ExpressionLag0026022
SyntenyLag0026022
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149380.1 uncharacterized protein LOC111017810 [Momordica charantia]4.3e-3734.53Show/hide
Query:  SRSHDPEVPTVREDDQAEEVTMLQEVNPLIPPGQRRVDPPPTPSAAP---------LLITSEALQTMFDNMAQRNARPRRNPNWVPENAEESQFIRDFKC
        +R+H+ E P  R +  A+       V P++P G      PP P AAP         + + +EALQ + DN         + P       EE QFIRDFK 
Subjt:  SRSHDPEVPTVREDDQAEEVTMLQEVNPLIPPGQRRVDPPPTPSAAP---------LLITSEALQTMFDNMAQRNARPRRNPNWVPENAEESQFIRDFKC

Query:  YGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKGN--RWQQPKTMLI------------------------------EWKIKRFIKG
        +GPP F+G SE P  AE W+ +LEAL+  + C++  K+RG VFML+G    W + +   +                              + KI +FI G
Subjt:  YGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKGN--RWQQPKTMLI------------------------------EWKIKRFIKG

Query:  LREEIRGSIALSRHATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKLSPLINPP-IESTQKQVKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKG
        LR EI+G + L    T+  A+  AL+MDK + ++PQ     GS+S VKRK +   +       Q  V+     P CPSC K H G CWL +++CFKC K 
Subjt:  LREEIRGSIALSRHATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKLSPLINPP-IESTQKQVKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKG

Query:  GHYAKDC
        GH+A++C
Subjt:  GHYAKDC

XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]1.1e-3234.66Show/hide
Query:  ESQFIRDFKCYGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKGNR-------------------WQQPKTMLIEW-----------
        E+ FI+DFK YGPP+FDG+SE    AE WI +LEA +  + C D  K++G VFML+G                     W + K +L ++           
Subjt:  ESQFIRDFKCYGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKGNR-------------------WQQPKTMLIEW-----------

Query:  ------------------------------------KIKRFIKGLREEIRGSIALSRHATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKLSP-LI
                                            KIKRF+KGL + IRG + L R A++ EA+  ALIMDK+VS K     E GS+S VKRK  P   
Subjt:  ------------------------------------KIKRFIKGLREEIRGSIALSRHATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKLSP-LI

Query:  NPPIESTQKQVKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKGGHYAKDC
        +P + + Q Q +     P CP+C K H G+CW   K CF+C +  H+A++C
Subjt:  NPPIESTQKQVKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKGGHYAKDC

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]1.4e-3232.65Show/hide
Query:  PPPTPSAAPLLITSEALQTMFDNMAQRNARPRRNPNWVPENAEESQFIRDFKCYGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKG
        PP  P    LL  +EALQ + DN         + P+    + EE QFIRDFK +GPP F+G SE P  AE W+ +LEAL+  + C+D  K+RG VFML+G
Subjt:  PPPTPSAAPLLITSEALQTMFDNMAQRNARPRRNPNWVPENAEESQFIRDFKCYGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKG

Query:  N-------------------RWQQPKTMLIEW-----------------------------------------------KIKRFIKGLREEIRGSIALSR
                             W + K +L E+                                               KI +FI GLR EI+G + L  
Subjt:  N-------------------RWQQPKTMLIEW-----------------------------------------------KIKRFIKGLREEIRGSIALSR

Query:  HATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKLSPL-INPPIESTQKQVKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKGGHYAKDC
          T+  A+  AL+MDK + ++PQ     GS+S VKRK +    + P    Q   +     P CPSC K H G CW+ +++C++C K GH+A++C
Subjt:  HATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKLSPL-INPPIESTQKQVKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKGGHYAKDC

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]2.6e-3435.06Show/hide
Query:  ESQFIRDFKCYGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKGNR-------------------WQQPKTMLIEW-----------
        E++FI+DFK YGPP+FDG+SE     E WI +LEAL+  + C D  K++G VFML+G                     W + K +L ++           
Subjt:  ESQFIRDFKCYGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKGNR-------------------WQQPKTMLIEW-----------

Query:  ------------------------------------KIKRFIKGLREEIRGSIALSRHATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKL-SPLI
                                            KIKRF+KGLR+ IRG + L R  T+ EA+  AL+MDK+VS K  P  E GS+S VKRK  S   
Subjt:  ------------------------------------KIKRFIKGLREEIRGSIALSRHATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKL-SPLI

Query:  NPPIESTQKQVKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKGGHYAKDC
        +  + + Q+Q +     P CP+C K H G+CW   K CF+C + GH+A++C
Subjt:  NPPIESTQKQVKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKGGHYAKDC

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]1.5e-3431.38Show/hide
Query:  SRSHDPEVPTVREDDQAEEVTMLQEVNPLIPPGQRRVDPPPTPSAAP---------LLITSEALQTMFDNMAQRNARPRRNPNWVPENAEESQFIRDFKC
        +R+H+ E P +R ++ A+       V+P++P G      PP P AAP         + + +EALQ +  N         + P       +E QFIRDFKC
Subjt:  SRSHDPEVPTVREDDQAEEVTMLQEVNPLIPPGQRRVDPPPTPSAAP---------LLITSEALQTMFDNMAQRNARPRRNPNWVPENAEESQFIRDFKC

Query:  YGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKGN-------------------RWQQPKTMLIEW---------------------
        +GPP F+G SE P  AE W+ +LEAL+  + C+D  K+RG VFML+G                     W + K +L E+                     
Subjt:  YGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKGN-------------------RWQQPKTMLIEW---------------------

Query:  --------------------------KIKRFIKGLREEIRGSIALSRHATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKLSPL-INPPIESTQKQ
                                  KI +FI GLR EI+G + L    T+  A+  AL+MDK + ++PQ     GS S VKRK +    +      Q  
Subjt:  --------------------------KIKRFIKGLREEIRGSIALSRHATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKLSPL-INPPIESTQKQ

Query:  VKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKGGHYAKDC
         +     P CPSC K H   CWL +K+CFKC K GH+ ++C
Subjt:  VKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKGGHYAKDC

TrEMBL top hitse value%identityAlignment
A0A6J1D5J7 uncharacterized protein LOC1110178102.1e-3734.53Show/hide
Query:  SRSHDPEVPTVREDDQAEEVTMLQEVNPLIPPGQRRVDPPPTPSAAP---------LLITSEALQTMFDNMAQRNARPRRNPNWVPENAEESQFIRDFKC
        +R+H+ E P  R +  A+       V P++P G      PP P AAP         + + +EALQ + DN         + P       EE QFIRDFK 
Subjt:  SRSHDPEVPTVREDDQAEEVTMLQEVNPLIPPGQRRVDPPPTPSAAP---------LLITSEALQTMFDNMAQRNARPRRNPNWVPENAEESQFIRDFKC

Query:  YGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKGN--RWQQPKTMLI------------------------------EWKIKRFIKG
        +GPP F+G SE P  AE W+ +LEAL+  + C++  K+RG VFML+G    W + +   +                              + KI +FI G
Subjt:  YGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKGN--RWQQPKTMLI------------------------------EWKIKRFIKG

Query:  LREEIRGSIALSRHATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKLSPLINPP-IESTQKQVKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKG
        LR EI+G + L    T+  A+  AL+MDK + ++PQ     GS+S VKRK +   +       Q  V+     P CPSC K H G CWL +++CFKC K 
Subjt:  LREEIRGSIALSRHATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKLSPLINPP-IESTQKQVKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKG

Query:  GHYAKDC
        GH+A++C
Subjt:  GHYAKDC

A0A6J1DL73 uncharacterized protein LOC1110221445.3e-3334.66Show/hide
Query:  ESQFIRDFKCYGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKGNR-------------------WQQPKTMLIEW-----------
        E+ FI+DFK YGPP+FDG+SE    AE WI +LEA +  + C D  K++G VFML+G                     W + K +L ++           
Subjt:  ESQFIRDFKCYGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKGNR-------------------WQQPKTMLIEW-----------

Query:  ------------------------------------KIKRFIKGLREEIRGSIALSRHATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKLSP-LI
                                            KIKRF+KGL + IRG + L R A++ EA+  ALIMDK+VS K     E GS+S VKRK  P   
Subjt:  ------------------------------------KIKRFIKGLREEIRGSIALSRHATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKLSP-LI

Query:  NPPIESTQKQVKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKGGHYAKDC
        +P + + Q Q +     P CP+C K H G+CW   K CF+C +  H+A++C
Subjt:  NPPIESTQKQVKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKGGHYAKDC

A0A6J1DNV8 uncharacterized protein LOC1110229257.0e-3332.65Show/hide
Query:  PPPTPSAAPLLITSEALQTMFDNMAQRNARPRRNPNWVPENAEESQFIRDFKCYGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKG
        PP  P    LL  +EALQ + DN         + P+    + EE QFIRDFK +GPP F+G SE P  AE W+ +LEAL+  + C+D  K+RG VFML+G
Subjt:  PPPTPSAAPLLITSEALQTMFDNMAQRNARPRRNPNWVPENAEESQFIRDFKCYGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKG

Query:  N-------------------RWQQPKTMLIEW-----------------------------------------------KIKRFIKGLREEIRGSIALSR
                             W + K +L E+                                               KI +FI GLR EI+G + L  
Subjt:  N-------------------RWQQPKTMLIEW-----------------------------------------------KIKRFIKGLREEIRGSIALSR

Query:  HATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKLSPL-INPPIESTQKQVKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKGGHYAKDC
          T+  A+  AL+MDK + ++PQ     GS+S VKRK +    + P    Q   +     P CPSC K H G CW+ +++C++C K GH+A++C
Subjt:  HATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKLSPL-INPPIESTQKQVKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKGGHYAKDC

A0A6J1DQB9 Reverse transcriptase7.5e-3531.38Show/hide
Query:  SRSHDPEVPTVREDDQAEEVTMLQEVNPLIPPGQRRVDPPPTPSAAP---------LLITSEALQTMFDNMAQRNARPRRNPNWVPENAEESQFIRDFKC
        +R+H+ E P +R ++ A+       V+P++P G      PP P AAP         + + +EALQ +  N         + P       +E QFIRDFKC
Subjt:  SRSHDPEVPTVREDDQAEEVTMLQEVNPLIPPGQRRVDPPPTPSAAP---------LLITSEALQTMFDNMAQRNARPRRNPNWVPENAEESQFIRDFKC

Query:  YGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKGN-------------------RWQQPKTMLIEW---------------------
        +GPP F+G SE P  AE W+ +LEAL+  + C+D  K+RG VFML+G                     W + K +L E+                     
Subjt:  YGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKGN-------------------RWQQPKTMLIEW---------------------

Query:  --------------------------KIKRFIKGLREEIRGSIALSRHATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKLSPL-INPPIESTQKQ
                                  KI +FI GLR EI+G + L    T+  A+  AL+MDK + ++PQ     GS S VKRK +    +      Q  
Subjt:  --------------------------KIKRFIKGLREEIRGSIALSRHATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKLSPL-INPPIESTQKQ

Query:  VKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKGGHYAKDC
         +     P CPSC K H   CWL +K+CFKC K GH+ ++C
Subjt:  VKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKGGHYAKDC

A0A6J1DUM2 uncharacterized protein LOC1110232471.3e-3435.06Show/hide
Query:  ESQFIRDFKCYGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKGNR-------------------WQQPKTMLIEW-----------
        E++FI+DFK YGPP+FDG+SE     E WI +LEAL+  + C D  K++G VFML+G                     W + K +L ++           
Subjt:  ESQFIRDFKCYGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKGNR-------------------WQQPKTMLIEW-----------

Query:  ------------------------------------KIKRFIKGLREEIRGSIALSRHATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKL-SPLI
                                            KIKRF+KGLR+ IRG + L R  T+ EA+  AL+MDK+VS K  P  E GS+S VKRK  S   
Subjt:  ------------------------------------KIKRFIKGLREEIRGSIALSRHATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKL-SPLI

Query:  NPPIESTQKQVKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKGGHYAKDC
        +  + + Q+Q +     P CP+C K H G+CW   K CF+C + GH+A++C
Subjt:  NPPIESTQKQVKEYIPYPPCPSCHKLHKGECWLKRKVCFKCNKGGHYAKDC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACGTCGTCATGCTGCTGAAATTTTCGGTGCACACGGTTAGTTTTGGGTTAAGAGTTAGTAATGTCGCTAGGATAGCTATTAAAATCCTGGGGCGTTACAGTTGGTA
TCAGAGCGGACTTTTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGATGTTTAGGGTTATGGTCTTCATCGTTCTCCTCTCCATCACCAGTACCACCTTCTCAGG
CAATGTCTCGTAGTCATGATCCTGAAGTTCCAACTGTCAGGGAAGATGACCAAGCAGAGGAAGTTACTATGCTGCAAGAGGTTAATCCCCTGATTCCTCCCGGTCAGCGT
AGGGTTGATCCTCCTCCAACCCCTTCTGCAGCTCCTTTGCTGATCACTTCGGAAGCCCTCCAGACCATGTTCGATAACATGGCCCAGAGAAATGCTAGGCCACGGCGGAA
CCCTAATTGGGTACCTGAGAACGCGGAGGAATCCCAGTTCATTAGGGACTTCAAGTGCTACGGGCCTCCCTCCTTTGATGGGCAATCCGAAAATCCGTTGGTAGCAGAAC
GGTGGATCGCTGATTTGGAGGCACTGTTTGACCTCATGAACTGTAATGATTCCTTGAAGATTAGAGGACCAGTCTTCATGCTCAAGGGCAATCGATGGCAGCAGCCAAAG
ACCATGCTAATCGAGTGGAAGATCAAGAGGTTCATTAAAGGTCTTCGTGAGGAAATTCGTGGCTCTATAGCCCTGAGTAGGCATGCGACCTTTGTTGAAGCACTCATAAG
TGCATTGATCATGGATAAGAATGTTTCCAAGAAGCCACAACCTTATCTTGAGAAGGGATCAACCTCTAGAGTTAAAAGAAAGTTGTCTCCCCTGATAAACCCACCTATTG
AGTCTACTCAGAAGCAAGTGAAAGAGTACATTCCATATCCTCCTTGCCCTTCTTGTCACAAGCTTCACAAAGGAGAGTGTTGGCTAAAAAGAAAAGTTTGCTTCAAGTGC
AATAAGGGAGGTCACTATGCTAAGGATTGTTCATCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTACGTCGTCATGCTGCTGAAATTTTCGGTGCACACGGTTAGTTTTGGGTTAAGAGTTAGTAATGTCGCTAGGATAGCTATTAAAATCCTGGGGCGTTACAGTTGGTA
TCAGAGCGGACTTTTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGATGTTTAGGGTTATGGTCTTCATCGTTCTCCTCTCCATCACCAGTACCACCTTCTCAGG
CAATGTCTCGTAGTCATGATCCTGAAGTTCCAACTGTCAGGGAAGATGACCAAGCAGAGGAAGTTACTATGCTGCAAGAGGTTAATCCCCTGATTCCTCCCGGTCAGCGT
AGGGTTGATCCTCCTCCAACCCCTTCTGCAGCTCCTTTGCTGATCACTTCGGAAGCCCTCCAGACCATGTTCGATAACATGGCCCAGAGAAATGCTAGGCCACGGCGGAA
CCCTAATTGGGTACCTGAGAACGCGGAGGAATCCCAGTTCATTAGGGACTTCAAGTGCTACGGGCCTCCCTCCTTTGATGGGCAATCCGAAAATCCGTTGGTAGCAGAAC
GGTGGATCGCTGATTTGGAGGCACTGTTTGACCTCATGAACTGTAATGATTCCTTGAAGATTAGAGGACCAGTCTTCATGCTCAAGGGCAATCGATGGCAGCAGCCAAAG
ACCATGCTAATCGAGTGGAAGATCAAGAGGTTCATTAAAGGTCTTCGTGAGGAAATTCGTGGCTCTATAGCCCTGAGTAGGCATGCGACCTTTGTTGAAGCACTCATAAG
TGCATTGATCATGGATAAGAATGTTTCCAAGAAGCCACAACCTTATCTTGAGAAGGGATCAACCTCTAGAGTTAAAAGAAAGTTGTCTCCCCTGATAAACCCACCTATTG
AGTCTACTCAGAAGCAAGTGAAAGAGTACATTCCATATCCTCCTTGCCCTTCTTGTCACAAGCTTCACAAAGGAGAGTGTTGGCTAAAAAGAAAAGTTTGCTTCAAGTGC
AATAAGGGAGGTCACTATGCTAAGGATTGTTCATCATGA
Protein sequenceShow/hide protein sequence
MYVVMLLKFSVHTVSFGLRVSNVARIAIKILGRYSWYQSGLFPVDWPRKSRLFGCLGLWSSSFSSPSPVPPSQAMSRSHDPEVPTVREDDQAEEVTMLQEVNPLIPPGQR
RVDPPPTPSAAPLLITSEALQTMFDNMAQRNARPRRNPNWVPENAEESQFIRDFKCYGPPSFDGQSENPLVAERWIADLEALFDLMNCNDSLKIRGPVFMLKGNRWQQPK
TMLIEWKIKRFIKGLREEIRGSIALSRHATFVEALISALIMDKNVSKKPQPYLEKGSTSRVKRKLSPLINPPIESTQKQVKEYIPYPPCPSCHKLHKGECWLKRKVCFKC
NKGGHYAKDCSS