; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005427 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005427
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr6:17479300..17482731
RNA-Seq ExpressionLag0005427
SyntenyLag0005427
Gene Ontology termsGO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]4.3e-6046.29Show/hide
Query:  KVNSSSTGGRVID-PFRSTLAPSTVEALICAQNWLRSNAIS-----------IDLQSHLEEVSRF--EEVLTVVYILNNVPSKSVPSKSVSETPFEMWRR
        K+  S  GG  +D  F+  +    +++ + A    + N +S             + S+ +  S F    V T V+ILNN     VPSKSVSETPFE+WR 
Subjt:  KVNSSSTGGRVID-PFRSTLAPSTVEALICAQNWLRSNAIS-----------IDLQSHLEEVSRF--EEVLTVVYILNNVPSKSVPSKSVSETPFEMWRR

Query:  RK---------------------------------SYPKETRGEYFFDPQENK-------------KLQNHQQE-----VEQAGTSTRVVDRVG------
        RK                                  YPKETRG  FFDPQEN+              ++NH+        E    STRVVD VG      
Subjt:  RK---------------------------------SYPKETRGEYFFDPQENK-------------KLQNHQQE-----VEQAGTSTRVVDRVG------

Query:  ---TSGPSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLE
           TSG SHPSQ LR PRR GRV++QP+ YLGL ET +VIPDDGVEDPL+Y+QAM DVDK+QW+KAMDLE             MESMYFN VW+LVD  E
Subjt:  ---TSGPSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLE

Query:  ELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK
         +KPIGCKWIYKRKRD+ GKVQTFKARLVAKGYTQR+
Subjt:  ELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK

KAA0050437.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-5957.87Show/hide
Query:  VLTVVYILNNVPSKSVPSKSVSETPFEMWRRRKSYPKETRGEYFFDPQENK-------------KLQNHQQE-----VEQAGTSTRVVDRVG--------
        V T V+ILNN     VPSKSVSETPFE+WR      ++TRG  FFDPQEN+              ++NH+        E    STRVVD VG        
Subjt:  VLTVVYILNNVPSKSVPSKSVSETPFEMWRRRKSYPKETRGEYFFDPQENK-------------KLQNHQQE-----VEQAGTSTRVVDRVG--------

Query:  -TSGPSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLEEL
         TSG SHPSQ LR P+R GR+++QP+ YLGL ET +VIPDDGVEDPL+Y+QAM DVDK+QW+KAMDLE             MESMYFN VW+LVD  E +
Subjt:  -TSGPSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLEEL

Query:  KPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK
        KPIGCKWIYKRKRD+ GKVQTFKARLVAKGYTQR+
Subjt:  KPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]4.3e-6046.29Show/hide
Query:  KVNSSSTGGRVID-PFRSTLAPSTVEALICAQNWLRSNAIS-----------IDLQSHLEEVSRF--EEVLTVVYILNNVPSKSVPSKSVSETPFEMWRR
        K+  S  GG  +D  F+  +    +++ + A    + N +S             + S+ +  S F    V T V+ILNN     VPSKSVSETPFE+WR 
Subjt:  KVNSSSTGGRVID-PFRSTLAPSTVEALICAQNWLRSNAIS-----------IDLQSHLEEVSRF--EEVLTVVYILNNVPSKSVPSKSVSETPFEMWRR

Query:  RK---------------------------------SYPKETRGEYFFDPQENK-------------KLQNHQQE-----VEQAGTSTRVVDRVG------
        RK                                  YPKETRG  FFDPQEN+              ++NH+        E    STRVVD VG      
Subjt:  RK---------------------------------SYPKETRGEYFFDPQENK-------------KLQNHQQE-----VEQAGTSTRVVDRVG------

Query:  ---TSGPSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLE
           TSG SHPSQ LR PRR GRV++QP+ YLGL ET +VIPDDGVEDPL+Y+QAM DVDK+QW+KAMDLE             MESMYFN VW+LVD  E
Subjt:  ---TSGPSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLE

Query:  ELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK
         +KPIGCKWIYKRKRD+ GKVQTFKARLVAKGYTQR+
Subjt:  ELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK

TYJ98102.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-5955.41Show/hide
Query:  TVVYILNNVPSKSVPSKSVSETPFEMWRRRKSYPKETRGEYFFDPQENK-------------KLQNHQ------------QEVEQAGTSTRVVDRVGTSG
        TV+YILNN     VPSKSVS TP+E+W+ RK YPKE++   F+DPQENK              ++NHQ               ++  +ST++VD+    G
Subjt:  TVVYILNNVPSKSVPSKSVSETPFEMWRRRKSYPKETRGEYFFDPQENK-------------KLQNHQ------------QEVEQAGTSTRVVDRVGTSG

Query:  PSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLEELKPIG
         +HPSQ+ REPRR GRV+ QPD YLGL+E  IVIPDDG+EDPLTY+QAM D+D +QWIKAMDLE             MESMY N VW LVDQ   +KPIG
Subjt:  PSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLEELKPIG

Query:  CKWIYKRKRDTVGKVQTFKARLVAKGYTQRK
        CKWIYKRKRD  GKVQTFKARL+AKGYTQ++
Subjt:  CKWIYKRKRDTVGKVQTFKARLVAKGYTQRK

TYK02840.1 gag/pol protein [Cucumis melo var. makuwa]4.3e-6046.29Show/hide
Query:  KVNSSSTGGRVID-PFRSTLAPSTVEALICAQNWLRSNAIS-----------IDLQSHLEEVSRF--EEVLTVVYILNNVPSKSVPSKSVSETPFEMWRR
        K+  S  GG  +D  F+  +    +++ + A    + N +S             + S+ +  S F    V T V+ILNN     VPSKSVSETPFE+WR 
Subjt:  KVNSSSTGGRVID-PFRSTLAPSTVEALICAQNWLRSNAIS-----------IDLQSHLEEVSRF--EEVLTVVYILNNVPSKSVPSKSVSETPFEMWRR

Query:  RK---------------------------------SYPKETRGEYFFDPQENK-------------KLQNHQQE-----VEQAGTSTRVVDRVG------
        RK                                  YPKETRG  FFDPQEN+              ++NH+        E    STRVVD VG      
Subjt:  RK---------------------------------SYPKETRGEYFFDPQENK-------------KLQNHQQE-----VEQAGTSTRVVDRVG------

Query:  ---TSGPSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLE
           TSG SHPSQ LR PRR GRV++QP+ YLGL ET +VIPDDGVEDPL+Y+QAM DVDK+QW+KAMDLE             MESMYFN VW+LVD  E
Subjt:  ---TSGPSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLE

Query:  ELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK
         +KPIGCKWIYKRKRD+ GKVQTFKARLVAKGYTQR+
Subjt:  ELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein2.1e-6046.29Show/hide
Query:  KVNSSSTGGRVID-PFRSTLAPSTVEALICAQNWLRSNAIS-----------IDLQSHLEEVSRF--EEVLTVVYILNNVPSKSVPSKSVSETPFEMWRR
        K+  S  GG  +D  F+  +    +++ + A    + N +S             + S+ +  S F    V T V+ILNN     VPSKSVSETPFE+WR 
Subjt:  KVNSSSTGGRVID-PFRSTLAPSTVEALICAQNWLRSNAIS-----------IDLQSHLEEVSRF--EEVLTVVYILNNVPSKSVPSKSVSETPFEMWRR

Query:  RK---------------------------------SYPKETRGEYFFDPQENK-------------KLQNHQQE-----VEQAGTSTRVVDRVG------
        RK                                  YPKETRG  FFDPQEN+              ++NH+        E    STRVVD VG      
Subjt:  RK---------------------------------SYPKETRGEYFFDPQENK-------------KLQNHQQE-----VEQAGTSTRVVDRVG------

Query:  ---TSGPSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLE
           TSG SHPSQ LR PRR GRV++QP+ YLGL ET +VIPDDGVEDPL+Y+QAM DVDK+QW+KAMDLE             MESMYFN VW+LVD  E
Subjt:  ---TSGPSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLE

Query:  ELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK
         +KPIGCKWIYKRKRD+ GKVQTFKARLVAKGYTQR+
Subjt:  ELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK

A0A5A7U7T0 Gag/pol protein1.4e-5957.87Show/hide
Query:  VLTVVYILNNVPSKSVPSKSVSETPFEMWRRRKSYPKETRGEYFFDPQENK-------------KLQNHQQE-----VEQAGTSTRVVDRVG--------
        V T V+ILNN     VPSKSVSETPFE+WR      ++TRG  FFDPQEN+              ++NH+        E    STRVVD VG        
Subjt:  VLTVVYILNNVPSKSVPSKSVSETPFEMWRRRKSYPKETRGEYFFDPQENK-------------KLQNHQQE-----VEQAGTSTRVVDRVG--------

Query:  -TSGPSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLEEL
         TSG SHPSQ LR P+R GR+++QP+ YLGL ET +VIPDDGVEDPL+Y+QAM DVDK+QW+KAMDLE             MESMYFN VW+LVD  E +
Subjt:  -TSGPSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLEEL

Query:  KPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK
        KPIGCKWIYKRKRD+ GKVQTFKARLVAKGYTQR+
Subjt:  KPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK

A0A5A7UYE8 Gag/pol protein2.1e-6046.29Show/hide
Query:  KVNSSSTGGRVID-PFRSTLAPSTVEALICAQNWLRSNAIS-----------IDLQSHLEEVSRF--EEVLTVVYILNNVPSKSVPSKSVSETPFEMWRR
        K+  S  GG  +D  F+  +    +++ + A    + N +S             + S+ +  S F    V T V+ILNN     VPSKSVSETPFE+WR 
Subjt:  KVNSSSTGGRVID-PFRSTLAPSTVEALICAQNWLRSNAIS-----------IDLQSHLEEVSRF--EEVLTVVYILNNVPSKSVPSKSVSETPFEMWRR

Query:  RK---------------------------------SYPKETRGEYFFDPQENK-------------KLQNHQQE-----VEQAGTSTRVVDRVG------
        RK                                  YPKETRG  FFDPQEN+              ++NH+        E    STRVVD VG      
Subjt:  RK---------------------------------SYPKETRGEYFFDPQENK-------------KLQNHQQE-----VEQAGTSTRVVDRVG------

Query:  ---TSGPSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLE
           TSG SHPSQ LR PRR GRV++QP+ YLGL ET +VIPDDGVEDPL+Y+QAM DVDK+QW+KAMDLE             MESMYFN VW+LVD  E
Subjt:  ---TSGPSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLE

Query:  ELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK
         +KPIGCKWIYKRKRD+ GKVQTFKARLVAKGYTQR+
Subjt:  ELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK

A0A5D3BEB1 Gag/pol protein1.4e-5955.41Show/hide
Query:  TVVYILNNVPSKSVPSKSVSETPFEMWRRRKSYPKETRGEYFFDPQENK-------------KLQNHQ------------QEVEQAGTSTRVVDRVGTSG
        TV+YILNN     VPSKSVS TP+E+W+ RK YPKE++   F+DPQENK              ++NHQ               ++  +ST++VD+    G
Subjt:  TVVYILNNVPSKSVPSKSVSETPFEMWRRRKSYPKETRGEYFFDPQENK-------------KLQNHQ------------QEVEQAGTSTRVVDRVGTSG

Query:  PSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLEELKPIG
         +HPSQ+ REPRR GRV+ QPD YLGL+E  IVIPDDG+EDPLTY+QAM D+D +QWIKAMDLE             MESMY N VW LVDQ   +KPIG
Subjt:  PSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLEELKPIG

Query:  CKWIYKRKRDTVGKVQTFKARLVAKGYTQRK
        CKWIYKRKRD  GKVQTFKARL+AKGYTQ++
Subjt:  CKWIYKRKRDTVGKVQTFKARLVAKGYTQRK

A0A5D3BUN8 Gag/pol protein2.1e-6046.29Show/hide
Query:  KVNSSSTGGRVID-PFRSTLAPSTVEALICAQNWLRSNAIS-----------IDLQSHLEEVSRF--EEVLTVVYILNNVPSKSVPSKSVSETPFEMWRR
        K+  S  GG  +D  F+  +    +++ + A    + N +S             + S+ +  S F    V T V+ILNN     VPSKSVSETPFE+WR 
Subjt:  KVNSSSTGGRVID-PFRSTLAPSTVEALICAQNWLRSNAIS-----------IDLQSHLEEVSRF--EEVLTVVYILNNVPSKSVPSKSVSETPFEMWRR

Query:  RK---------------------------------SYPKETRGEYFFDPQENK-------------KLQNHQQE-----VEQAGTSTRVVDRVG------
        RK                                  YPKETRG  FFDPQEN+              ++NH+        E    STRVVD VG      
Subjt:  RK---------------------------------SYPKETRGEYFFDPQENK-------------KLQNHQQE-----VEQAGTSTRVVDRVG------

Query:  ---TSGPSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLE
           TSG SHPSQ LR PRR GRV++QP+ YLGL ET +VIPDDGVEDPL+Y+QAM DVDK+QW+KAMDLE             MESMYFN VW+LVD  E
Subjt:  ---TSGPSHPSQELREPRRGGRVITQPDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLE

Query:  ELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK
         +KPIGCKWIYKRKRD+ GKVQTFKARLVAKGYTQR+
Subjt:  ELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-0628.43Show/hide
Query:  SVPSKSVSETPFEMWRRRKSYPKETRGEYFFDPQENKKLQNHQQEVEQAGTSTRVVDRVGTSGPSHPSQ--ELREP-RRGGRVITQPDCYLGLAETPIVI
        ++PS S + T  E      S   E  GE     ++ ++L    +EVE                  HP+Q  E  +P RR  R   +   Y   +   ++I
Subjt:  SVPSKSVSETPFEMWRRRKSYPKETRGEYFFDPQENKKLQNHQQEVEQAGTSTRVVDRVGTSGPSHPSQ--ELREP-RRGGRVITQPDCYLGLAETPIVI

Query:  PDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLEELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK
         DD   +P + ++ +   +KNQ +KAM  E             MES+  N  + LV+  +  +P+ CKW++K K+D   K+  +KARLV KG+ Q+K
Subjt:  PDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLEELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK

Q6AVI0 Zinc finger BED domain-containing protein RICESLEEPER 21.1e-0531.82Show/hide
Query:  TKVDVYLLECRVKGDDNFDILEWWKVN-----------------------------SSSTGGRVIDPFRSTLAPSTVEALICAQNWLR
        ++++ YL E        FDIL WWK+N                             S+ TG R++D +RS+L P  VEAL+CA++WL+
Subjt:  TKVDVYLLECRVKGDDNFDILEWWKVN-----------------------------SSSTGGRVIDPFRSTLAPSTVEALICAQNWLR

Q75HY5 Zinc finger BED domain-containing protein RICESLEEPER 34.9e-0630.34Show/hide
Query:  TKVDVYLLECRVKGDDNFDILEWWKVN------------------------------SSSTGGRVIDPFRSTLAPSTVEALICAQNWLR
        ++++ YL E  +    +F+ILEWWK+N                              +++TG +++D +RS+L P TVEAL CA++WL+
Subjt:  TKVDVYLLECRVKGDDNFDILEWWKVN------------------------------SSSTGGRVIDPFRSTLAPSTVEALICAQNWLR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.3e-0633.33Show/hide
Query:  DPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLEELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQR
        +P T  QA++D    +W  AM  E+ +   N+ WDL            +      +  +GC+WI+ +K ++ G +  +KARLVAKGY QR
Subjt:  DPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLEELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.7e-0634.44Show/hide
Query:  DPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLEELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQR
        +P T  QAM+D   ++W +AM  E+ +   N+ WDL            +      +  +GC+WI+ +K ++ G +  +KARLVAKGY QR
Subjt:  DPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLEELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQR

Arabidopsis top hitse value%identityAlignment
AT3G42170.1 BED zinc finger ;hAT family dimerisation domain9.4e-0530.85Show/hide
Query:  DVDTKVDVYLLECRVKGDDNFDILEWWKVNS--------------------------SSTGGRVIDPFRSTLAPSTVEALICAQNW-LRSNAIS
        ++ +++D YL E  +     FD+L+WWK N                                R +D ++++L P TVEALICA+ W L SNA S
Subjt:  DVDTKVDVYLLECRVKGDDNFDILEWWKVNS--------------------------SSTGGRVIDPFRSTLAPSTVEALICAQNW-LRSNAIS

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.4e-1033.08Show/hide
Query:  EDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLEELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRKAQLVAKGE
        ++P TY +A E +    W  AMD E             + +M   + W++       KPIGCKW+YK K ++ G ++ +KARLVAKGYTQ+        E
Subjt:  EDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLEELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRKAQLVAKGE

Query:  GLKFLNFQWKLVTATSVAFFFIVDSRRRNFLFH
        G+ F+     +   TSV     + S   NF  H
Subjt:  GLKFLNFQWKLVTATSVAFFFIVDSRRRNFLFH

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.1e-0436.36Show/hide
Query:  KMESMYFNYVWDLVDQLEELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK
        +++++  N  W LV        +GCKW++K K  + G +   KARLVAKG+ Q +
Subjt:  KMESMYFNYVWDLVDQLEELKPIGCKWIYKRKRDTVGKVQTFKARLVAKGYTQRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCAATCCAATTCTACTGATTCTATTGCTAGAACAAGTAATCCTTTTCGCTTTCAACATGCTGATATGGAAGATGATGAGGATGATTTTGATGTGGACACA
AAGGTGGATGTTTACTTATTAGAATGTCGTGTAAAAGGGGATGATAATTTTGACATCTTAGAATGGTGGAAAGTGAACTCTTCTAGTACTGGTGGAAGAGTGATA
GATCCTTTTCGCTCCACATTGGCTCCATCCACAGTTGAGGCACTTATTTGCGCACAAAATTGGCTACGTTCTAATGCTATCAGTATTGATCTTCAATCACATTTG
GAGGAAGTTTCTCGCTTTGAAGAAGTGTTAACTGTTGTTTATATCTTGAACAACGTTCCATCCAAGAGCGTTCCATCCAAGAGTGTTTCTGAAACACCGTTTGAA
ATGTGGAGAAGGCGTAAAAGCTACCCAAAGGAAACGAGAGGTGAATACTTCTTCGATCCACAAGAAAATAAAAAGCTACAAAATCATCAACAAGAGGTTGAACAG
GCTGGTACTTCAACAAGGGTTGTTGATAGAGTAGGTACCTCAGGTCCGTCACATCCTTCTCAAGAGTTGAGAGAGCCTCGACGTGGTGGGAGAGTTATTACTCAA
CCTGACTGCTATCTGGGTTTAGCTGAAACTCCAATCGTTATACCTGATGATGGTGTTGAGGATCCATTGACCTATAGACAAGCAATGGAGGATGTAGACAAGAAC
CAGTGGATCAAAGCCATGGACCTTGAAATGGAGTCTATGTACTTCAATTATGTCTGGGACCTTAAAATGGAGTCTATGTACTTCAATTATGTTTGGGACCTTGTA
GATCAACTTGAAGAGTTAAAACCCATAGGGTGTAAATGGATCTACAAGAGGAAACGAGACACAGTTGGAAAGGTACAGACCTTTAAGGCTCGACTAGTGGCAAAG
GGTTATACCCAGAGGAAGGCTCAACTAGTGGCAAAGGGTGAAGGATTGAAATTCCTAAATTTCCAGTGGAAGCTTGTCACCGCCACTTCCGTCGCTTTCTTCTTC
ATCGTCGACAGCCGCCGCCGCAACTTTCTTTTTCATTGTCAACATTCGCCGTCGCTTTCTCCTTTAACAGTCGCCGCCGTCCCTTTCTTCTTCATCGTCGACAGC
CACCACCTTTCTTCATCGTCGACAATCGCCACCGTCGTTGCTTTCTTCTTCAAAACCAAATCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTCAATCCAATTCTACTGATTCTATTGCTAGAACAAGTAATCCTTTTCGCTTTCAACATGCTGATATGGAAGATGATGAGGATGATTTTGATGTGGACACA
AAGGTGGATGTTTACTTATTAGAATGTCGTGTAAAAGGGGATGATAATTTTGACATCTTAGAATGGTGGAAAGTGAACTCTTCTAGTACTGGTGGAAGAGTGATA
GATCCTTTTCGCTCCACATTGGCTCCATCCACAGTTGAGGCACTTATTTGCGCACAAAATTGGCTACGTTCTAATGCTATCAGTATTGATCTTCAATCACATTTG
GAGGAAGTTTCTCGCTTTGAAGAAGTGTTAACTGTTGTTTATATCTTGAACAACGTTCCATCCAAGAGCGTTCCATCCAAGAGTGTTTCTGAAACACCGTTTGAA
ATGTGGAGAAGGCGTAAAAGCTACCCAAAGGAAACGAGAGGTGAATACTTCTTCGATCCACAAGAAAATAAAAAGCTACAAAATCATCAACAAGAGGTTGAACAG
GCTGGTACTTCAACAAGGGTTGTTGATAGAGTAGGTACCTCAGGTCCGTCACATCCTTCTCAAGAGTTGAGAGAGCCTCGACGTGGTGGGAGAGTTATTACTCAA
CCTGACTGCTATCTGGGTTTAGCTGAAACTCCAATCGTTATACCTGATGATGGTGTTGAGGATCCATTGACCTATAGACAAGCAATGGAGGATGTAGACAAGAAC
CAGTGGATCAAAGCCATGGACCTTGAAATGGAGTCTATGTACTTCAATTATGTCTGGGACCTTAAAATGGAGTCTATGTACTTCAATTATGTTTGGGACCTTGTA
GATCAACTTGAAGAGTTAAAACCCATAGGGTGTAAATGGATCTACAAGAGGAAACGAGACACAGTTGGAAAGGTACAGACCTTTAAGGCTCGACTAGTGGCAAAG
GGTTATACCCAGAGGAAGGCTCAACTAGTGGCAAAGGGTGAAGGATTGAAATTCCTAAATTTCCAGTGGAAGCTTGTCACCGCCACTTCCGTCGCTTTCTTCTTC
ATCGTCGACAGCCGCCGCCGCAACTTTCTTTTTCATTGTCAACATTCGCCGTCGCTTTCTCCTTTAACAGTCGCCGCCGTCCCTTTCTTCTTCATCGTCGACAGC
CACCACCTTTCTTCATCGTCGACAATCGCCACCGTCGTTGCTTTCTTCTTCAAAACCAAATCTTAA
Protein sequenceShow/hide protein sequence
MSQSNSTDSIARTSNPFRFQHADMEDDEDDFDVDTKVDVYLLECRVKGDDNFDILEWWKVNSSSTGGRVIDPFRSTLAPSTVEALICAQNWLRSNAISIDLQSHL
EEVSRFEEVLTVVYILNNVPSKSVPSKSVSETPFEMWRRRKSYPKETRGEYFFDPQENKKLQNHQQEVEQAGTSTRVVDRVGTSGPSHPSQELREPRRGGRVITQ
PDCYLGLAETPIVIPDDGVEDPLTYRQAMEDVDKNQWIKAMDLEMESMYFNYVWDLKMESMYFNYVWDLVDQLEELKPIGCKWIYKRKRDTVGKVQTFKARLVAK
GYTQRKAQLVAKGEGLKFLNFQWKLVTATSVAFFFIVDSRRRNFLFHCQHSPSLSPLTVAAVPFFFIVDSHHLSSSSTIATVVAFFFKTKS