; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025793 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025793
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr10:20731332..20738258
RNA-Seq ExpressionLag0025793
SyntenyLag0025793
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8523936.1 hypothetical protein F0562_010359 [Nyssa sinensis]8.2e-8342.59Show/hide
Query:  MILGFTFKNKMGFIDGTLPQPTG---EMKKSWIICNSIVTAWILNSLSTEISTSVNFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTY
        M++    KNK+GF+DG++P+P G   ++  SWI  N+IV +WILNS+S EIS S+ FA SAREIWLDL+ R Q++N PRIF+L+RE+ NL Q+Q+SV  Y
Subjt:  MILGFTFKNKMGFIDGTLPQPTG---EMKKSWIICNSIVTAWILNSLSTEISTSVNFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTY

Query:  FAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFLMGLNDSFSQIRTQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTAL
        F KLKT+W ELS+  P CSCG CSC GVK LN + Q EY+M+FLMGL+DSFSQ+R QLLLM+P P I + FSLI QE +QR +   P+  +S+ T   A 
Subjt:  FAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFLMGLNDSFSQIRTQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTAL

Query:  LVK-----------NSNGNSRSQSGSNKKKERPFCTHCNILRHTIDRCYKICGYPPRYQ-----------------------------------------
         VK            ++ NS S +  N+K+++P+CTHC I  HT+DRCYKI GYPP Y+                                         
Subjt:  LVK-----------NSNGNSRSQSGSNKKKERPFCTHCNILRHTIDRCYKICGYPPRYQ-----------------------------------------

Query:  ------------AKATPTESTSATTHVAGIC-----SSLMHSLKSWVLDSSASTHISYERSFFTALRPVTGKYVSLPNLIRIAVQFIGDIQLNAHICLRN
                     K T     S T  +AGIC     S L  S + W++DS A+ HI  + S F +L  +    V+LPN  +I V F GD++L++ + L++
Subjt:  ------------AKATPTESTSATTHVAGIC-----SSLMHSLKSWVLDSSASTHISYERSFFTALRPVTGKYVSLPNLIRIAVQFIGDIQLNAHICLRN

Query:  VMFIPDFSFNLISISALTNDQNFVV
        V+F+P F FNLIS+SAL       +
Subjt:  VMFIPDFSFNLISISALTNDQNFVV

XP_012856897.1 PREDICTED: uncharacterized protein LOC105976150 [Erythranthe guttata]1.8e-8533.07Show/hide
Query:  VEQYDNSYFLHHSDSTSLKLVSDPLTETNYTSWSRAMILGFTFKNKMGFIDGTLPQPTGE---MKKSWIICNSIVTAWILNSLSTEISTSVNFADSAREI
        V+   + ++LH SD   L LVS  L+E N+ SWSRAM +  T KNK+GFI+GT+ +P+ +   +  +W+  NSIV +WILN++S +I  S+ ++DSA E+
Subjt:  VEQYDNSYFLHHSDSTSLKLVSDPLTETNYTSWSRAMILGFTFKNKMGFIDGTLPQPTGE---MKKSWIICNSIVTAWILNSLSTEISTSVNFADSAREI

Query:  WLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFLMGLNDSFSQIRTQLLLMEPE
        W DL  R  + N PRIF+L+RE+SNL QD  SV  YF KLK +W+ELS++ P C+CG C+C GV++LN ++  E+VMAFLMGLN+S +  R Q+LLM+P 
Subjt:  WLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFLMGLNDSFSQIRTQLLLMEPE

Query:  PTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTALLVKNSNGNSRS-----QSGSNKKKERPFCTHCNILRHTIDRCYKICGYPPRYQAK---------
        P I K F+L++QE  QR+       + +   N  A  ++      RS      + + K+KER FCTHCNI  HTID+CYK+ GYPP Y+AK         
Subjt:  PTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTALLVKNSNGNSRS-----QSGSNKKKERPFCTHCNILRHTIDRCYKICGYPPRYQAK---------

Query:  ----------ATPTE-STSATTH-----------------------------------------------------VAGIC--SSLMHSLKS--WVLDSS
                   +P + +TS +T                                                      V+GIC  +SL  S +   W++DS 
Subjt:  ----------ATPTE-STSATTH-----------------------------------------------------VAGIC--SSLMHSLKS--WVLDSS

Query:  ASTHISYERSFFTALRPVTGKYVSLPNLIRIAVQFIGDIQLNAHICLRNVMFIPDFSFNLISISALTNDQNFVVKF------------------------
        AS HI  +++ F++L  V    V LP+   + V+++GD+ L+  + L+NV ++P F FNLIS+SAL +     V F                        
Subjt:  ASTHISYERSFFTALRPVTGKYVSLPNLIRIAVQFIGDIQLNAHICLRNVMFIPDFSFNLISISALTNDQNFVVKF------------------------

Query:  ------------------------------------------------------------------------------------VVDDFSRYTWVHLMKQ
                                                                                            +VDD+SR+TWVHL+K 
Subjt:  ------------------------------------------------------------------------------------VVDDFSRYTWVHLMKQ

Query:  KSDALNIVPKFFKLVQTQYGVCIKKFRPDNAPELVFKELIGCSG
        KSD L  +P FF +V+TQ+   IK FR DNA EL F +L    G
Subjt:  KSDALNIVPKFFKLVQTQYGVCIKKFRPDNAPELVFKELIGCSG

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]2.0e-8956.37Show/hide
Query:  DSSSPTR----STSQSIVEQYDNSYFLHHSDSTSLKLVSDPLTETNYTSWSRAMILGFTFKNKMGFIDGTLPQPTGEMKKSWIICNSIVTAWILNSLSTE
        DS +PT       +  +VEQ+ N YFLHHSD+TSL LVSD LT+ NYTSWSR++++  T KNK+GF+DG++ +PT     SWIICN++V +WI NSLS +
Subjt:  DSSSPTR----STSQSIVEQYDNSYFLHHSDSTSLKLVSDPLTETNYTSWSRAMILGFTFKNKMGFIDGTLPQPTGEMKKSWIICNSIVTAWILNSLSTE

Query:  ISTSVNFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFLMGLNDS
        IS SV F+DSA EIWLDL++R QR+NRPRIF+L+RE+SNL QDQ SV  YF +LKTLW+EL+ Y P CSCG CS  GVK +  ++Q EYVMAFLMGLN S
Subjt:  ISTSVNFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFLMGLNDS

Query:  FSQIRTQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTALLVKNSNGNSRSQSGS---NKKKERPFCTHCNILRHTIDRCYKICGYPPRY
        FSQIR QLLLMEP PTI +AF+L+AQE++QR+ +L   P+ +SPT        NS+ NSR  S S    K+K++  CTHC I  HT+D+CYK+  YPP Y
Subjt:  FSQIRTQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTALLVKNSNGNSRSQSGS---NKKKERPFCTHCNILRHTIDRCYKICGYPPRY

Query:  QAKATPTESTSATT
        ++    T S++AT+
Subjt:  QAKATPTESTSATT

XP_038875043.1 uncharacterized protein LOC120067569 [Benincasa hispida]2.0e-8450.7Show/hide
Query:  DSSSPTRSTSQSIVEQYDNSYFLHHSDSTSLKLVSDPLTETNYTSWSRAMILGFTFKNKMGFIDGTLPQPTGEMKKSWIICNSIVTAWILNSLSTEISTS
        +S    R++  SI+EQY N YFLH  DSTSL  +S+ LTE+NY SWS+AM +  T KNK+GFI+  +P P+GE+  SWIICN +VTAWILNSLS EISTS
Subjt:  DSSSPTRSTSQSIVEQYDNSYFLHHSDSTSLKLVSDPLTETNYTSWSRAMILGFTFKNKMGFIDGTLPQPTGEMKKSWIICNSIVTAWILNSLSTEISTS

Query:  VNFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFLMGLNDSFSQI
        +NF++S +EIW+D Q+R Q KNRPR+F+L+ EISNL+Q+Q+SV  Y+ KLK LWNEL SY P CSCG  +C  VK L TYFQTEYVMAFLMGLNDS + I
Subjt:  VNFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFLMGLNDSFSQI

Query:  RTQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTALLVKNSNGNSRSQSGSNKKKERPFCTHCNILRHTIDRCYKICGYPPRYQAKATPT
        R+QLLLMEPEP+I +AFSL+ QE++Q+A        +SSP            G + S S                                  ++  T  
Subjt:  RTQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTALLVKNSNGNSRSQSGSNKKKERPFCTHCNILRHTIDRCYKICGYPPRYQAKATPT

Query:  ESTSATTHVAGICSSLMHSLKSWVLDSSASTHISYERSFFTALRPVTGKYVSLPN
        ES ++  HV GICS    S   W+LDS ASTHI Y ++ FT+LRP+    VSLPN
Subjt:  ESTSATTHVAGICSSLMHSLKSWVLDSSASTHISYERSFFTALRPVTGKYVSLPN

XP_038904477.1 uncharacterized protein LOC120090845 [Benincasa hispida]1.4e-8259.93Show/hide
Query:  MILGFTFKNKMGFIDGTLPQPTGEMKKSWIICNSIVTAWILNSLSTEISTSVNFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAK
        M +G T KNK+GFI+G + +P+GE+  SWIICN IVT WILNSLS EIS S+NF+DSA+EIW+DLQ+R QRKNRPR+F+L+RE SNL Q+Q+S+ TY+AK
Subjt:  MILGFTFKNKMGFIDGTLPQPTGEMKKSWIICNSIVTAWILNSLSTEISTSVNFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAK

Query:  LKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFLMGLNDSFSQIRTQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTALLVK
        LKTLWNEL SY P CSCG C+C GVK L TYFQTEYV+AFLMGLNDS + IR+QLLLMEP+PTI +AFSL+AQE++Q+A +      A S +N TALLVK
Subjt:  LKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFLMGLNDSFSQIRTQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTALLVK

Query:  NSNGNSRS-------QSGSNKKKERPFCTHCNILRHTIDRCYKICGYPPRYQ-------AKATPTESTSATT
        N +  S          +  NKKK+RP CTHC+I  HT+DRCYK+ GYPP ++       A+A  ++S S+ T
Subjt:  NSNGNSRS-------QSGSNKKKERPFCTHCNILRHTIDRCYKICGYPPRYQ-------AKATPTESTSATT

TrEMBL top hitse value%identityAlignment
A0A5J5A1K4 Retrotrans_gag domain-containing protein4.0e-8342.59Show/hide
Query:  MILGFTFKNKMGFIDGTLPQPTG---EMKKSWIICNSIVTAWILNSLSTEISTSVNFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTY
        M++    KNK+GF+DG++P+P G   ++  SWI  N+IV +WILNS+S EIS S+ FA SAREIWLDL+ R Q++N PRIF+L+RE+ NL Q+Q+SV  Y
Subjt:  MILGFTFKNKMGFIDGTLPQPTG---EMKKSWIICNSIVTAWILNSLSTEISTSVNFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTY

Query:  FAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFLMGLNDSFSQIRTQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTAL
        F KLKT+W ELS+  P CSCG CSC GVK LN + Q EY+M+FLMGL+DSFSQ+R QLLLM+P P I + FSLI QE +QR +   P+  +S+ T   A 
Subjt:  FAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFLMGLNDSFSQIRTQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTAL

Query:  LVK-----------NSNGNSRSQSGSNKKKERPFCTHCNILRHTIDRCYKICGYPPRYQ-----------------------------------------
         VK            ++ NS S +  N+K+++P+CTHC I  HT+DRCYKI GYPP Y+                                         
Subjt:  LVK-----------NSNGNSRSQSGSNKKKERPFCTHCNILRHTIDRCYKICGYPPRYQ-----------------------------------------

Query:  ------------AKATPTESTSATTHVAGIC-----SSLMHSLKSWVLDSSASTHISYERSFFTALRPVTGKYVSLPNLIRIAVQFIGDIQLNAHICLRN
                     K T     S T  +AGIC     S L  S + W++DS A+ HI  + S F +L  +    V+LPN  +I V F GD++L++ + L++
Subjt:  ------------AKATPTESTSATTHVAGIC-----SSLMHSLKSWVLDSSASTHISYERSFFTALRPVTGKYVSLPNLIRIAVQFIGDIQLNAHICLRN

Query:  VMFIPDFSFNLISISALTNDQNFVV
        V+F+P F FNLIS+SAL       +
Subjt:  VMFIPDFSFNLISISALTNDQNFVV

A0A5J5B2C5 Uncharacterized protein6.3e-8150Show/hide
Query:  TNSTIPRDSSSPTRSTSQSIVEQYDNSYFLHHSDSTSLKLVSDPLTETNYTSWSRAMILGFTFKNKMGFIDGTLPQPTG---EMKKSWIICNSIVTAWIL
        T S++   +S    ++++S +E+  N Y+LHHS+S    LVS  LT  NYT+WSRAM++  + KNK+GF+DG +P+P G    +  SWI  N+IV +WIL
Subjt:  TNSTIPRDSSSPTRSTSQSIVEQYDNSYFLHHSDSTSLKLVSDPLTETNYTSWSRAMILGFTFKNKMGFIDGTLPQPTG---EMKKSWIICNSIVTAWIL

Query:  NSLSTEISTSVNFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFL
        NS+S EIS S+ FA  AREIWLDL+ R Q++N PRIF+L+RE+ NL Q+Q+SV  YF K+KT+W ELS+Y P CSCG C C GVK LN Y QTEY+M+FL
Subjt:  NSLSTEISTSVNFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFL

Query:  MGLNDSFSQIRTQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTALLVK-----------NSNGNSRSQSGSNKKKERPFCTHCNILRHT
        MGL+DSFSQ+  QLLLM+  P I + FSLI QE +QR + L  +  +S+ T   A +VK            ++ NS S +  N+K++RP+CTHC IL HT
Subjt:  MGLNDSFSQIRTQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTALLVK-----------NSNGNSRSQSGSNKKKERPFCTHCNILRHT

Query:  IDRCYKICGYPPRYQAKATPTESTSA
        +DRCYKI GYPP Y+ ++    + +A
Subjt:  IDRCYKICGYPPRYQAKATPTESTSA

A0A5J5BKC2 Uncharacterized protein7.0e-8050.31Show/hide
Query:  TNSTIPRDSSSPTRSTSQSIVEQYDNSYFLHHSDSTSLKLVSDPLTETNYTSWSRAMILGFTFKNKMGFIDGTLPQPTG---EMKKSWIICNSIVTAWIL
        T S++   +S    ++++S +E+  N Y+LHHSDS    LVS  LT  NYT+WSRAM++  + KNK+GF+DG++ +P G    +  SWI  N+IV +WIL
Subjt:  TNSTIPRDSSSPTRSTSQSIVEQYDNSYFLHHSDSTSLKLVSDPLTETNYTSWSRAMILGFTFKNKMGFIDGTLPQPTG---EMKKSWIICNSIVTAWIL

Query:  NSLSTEISTSVNFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFL
        NS+S EIS S+ FA SAREIWLDL+ R Q++NRPRIF+L+RE+ NL Q+Q+SV  YF KLKT+W ELS+Y   CSCG CSC GVK LN + Q EY+M+FL
Subjt:  NSLSTEISTSVNFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFL

Query:  MGLNDSFSQIRTQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTALLVK-----------NSNGNSRSQSGSNKKKERPFCTHCNILRHT
        MGL+DSFSQ+R QLLLM+P P I + FSLI QE +QR +    +  +S+ T   A  VK            ++ NS S +  N+K++R +C HC IL HT
Subjt:  MGLNDSFSQIRTQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTALLVK-----------NSNGNSRSQSGSNKKKERPFCTHCNILRHT

Query:  IDRCYKICGYPPRYQAKATPTESTSA
        +DRCYKI GYPP Y+ K+    + +A
Subjt:  IDRCYKICGYPPRYQAKATPTESTSA

A0A6J1DIP8 uncharacterized protein LOC1110203993.5e-7960.31Show/hide
Query:  TRSTSQSI----VEQYDNSYFLHHSDSTSLKLVSDPLTETNYTSWSRAMILGFTFKNKMGFIDGTLPQPTGEMKKSWIICNSIVTAWILNSLSTEISTSV
        T STS  I    +EQY N YFLHHSD+TSL LVSDPLT  NYTSWSR+M++  T KNK+GF+DG++ +PTG++  SWIICN++V +WILNSLS EIS S+
Subjt:  TRSTSQSI----VEQYDNSYFLHHSDSTSLKLVSDPLTETNYTSWSRAMILGFTFKNKMGFIDGTLPQPTGEMKKSWIICNSIVTAWILNSLSTEISTSV

Query:  NFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFLMGLNDSFSQIR
         F+DSAREIWLDL++R +++NRPRIF+L+R++SNLVQDQ SV  YF  LKTLW EL+SY P C+ G CSC GVK++  + Q E+VM FLMGLN+SFSQ+R
Subjt:  NFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFLMGLNDSFSQIR

Query:  TQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSP-TNDTALLVKNSNGNSRSQSGSN
         QLLLMEPEPTI + FSL++QE +QRA       T++SP T  TAL+ ++S+ +  S+S SN
Subjt:  TQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSP-TNDTALLVKNSNGNSRSQSGSN

A0A6J1DNP7 uncharacterized protein LOC1110220659.8e-9056.37Show/hide
Query:  DSSSPTR----STSQSIVEQYDNSYFLHHSDSTSLKLVSDPLTETNYTSWSRAMILGFTFKNKMGFIDGTLPQPTGEMKKSWIICNSIVTAWILNSLSTE
        DS +PT       +  +VEQ+ N YFLHHSD+TSL LVSD LT+ NYTSWSR++++  T KNK+GF+DG++ +PT     SWIICN++V +WI NSLS +
Subjt:  DSSSPTR----STSQSIVEQYDNSYFLHHSDSTSLKLVSDPLTETNYTSWSRAMILGFTFKNKMGFIDGTLPQPTGEMKKSWIICNSIVTAWILNSLSTE

Query:  ISTSVNFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFLMGLNDS
        IS SV F+DSA EIWLDL++R QR+NRPRIF+L+RE+SNL QDQ SV  YF +LKTLW+EL+ Y P CSCG CS  GVK +  ++Q EYVMAFLMGLN S
Subjt:  ISTSVNFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFLMGLNDS

Query:  FSQIRTQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTALLVKNSNGNSRSQSGS---NKKKERPFCTHCNILRHTIDRCYKICGYPPRY
        FSQIR QLLLMEP PTI +AF+L+AQE++QR+ +L   P+ +SPT        NS+ NSR  S S    K+K++  CTHC I  HT+D+CYK+  YPP Y
Subjt:  FSQIRTQLLLMEPEPTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTALLVKNSNGNSRSQSGS---NKKKERPFCTHCNILRHTIDRCYKICGYPPRY

Query:  QAKATPTESTSATT
        ++    T S++AT+
Subjt:  QAKATPTESTSATT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).5.6e-2934.58Show/hide
Query:  DNSYFL-----HHSDSTSLKLVSDPLTETNYTSWSRAMILGFTFKNKMGFIDGTLPQPT--GEMKKSWIICNSIVTAWILNSLSTEISTSVNFADSAREI
        D+ Y+L     H SD +  KL  D   E NY +W            K GFIDGTLP+P     + + W  CN++V  W++NS++ ++  SV +A++A ++
Subjt:  DNSYFL-----HHSDSTSLKLVSDPLTETNYTSWSRAMILGFTFKNKMGFIDGTLPQPT--GEMKKSWIICNSIVTAWILNSLSTEISTSVNFADSAREI

Query:  WLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAKLKTLWNELSSYCPI--CSCGLCSCDGVKQLNTYFQTEYVMAFLMG--LNDSFSQIRTQLLL
        W DL++        +I++L+R ++ L Q  +SV  YF KL  +W ELS Y PI  C CG C+C+  K+     + E    FLMG  LN  F  + T+++ 
Subjt:  WLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAKLKTLWNELSSYCPI--CSCGLCSCDGVKQLNTYFQTEYVMAFLMG--LNDSFSQIRTQLLL

Query:  MEPEPTIQKAFSLI
         +P P++ +AF+++
Subjt:  MEPEPTIQKAFSLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGAAACGAACTCCACCATTCCACGAGATTCTTCTTCTCCAACTCGTTCAACCAGTCAATCGATCGTTGAACAATATGATAATTCATACTTTTTACATCATTCTGA
TAGCACAAGTCTGAAATTAGTTTCTGATCCTCTTACAGAAACTAATTATACCTCTTGGAGTCGTGCTATGATTCTTGGATTCACTTTCAAGAATAAAATGGGTTTCATCG
ATGGCACTCTACCACAACCTACTGGAGAAATGAAGAAATCATGGATCATCTGCAATAGTATCGTCACGGCTTGGATTCTAAATTCTCTTTCTACGGAGATCTCTACGAGT
GTGAATTTTGCTGATTCCGCAAGAGAAATATGGCTTGATCTTCAACAGCGCAACCAACGGAAAAATCGCCCACGAATCTTTAAATTACAGCGCGAGATTTCCAATCTTGT
TCAAGATCAGAACTCTGTGATGACTTACTTTGCCAAGCTTAAGACATTATGGAATGAACTCAGTTCTTATTGCCCCATCTGTTCATGTGGACTCTGTTCGTGTGATGGTG
TTAAGCAATTGAATACGTATTTCCAGACTGAATACGTTATGGCCTTCCTGATGGGTTTGAATGATTCGTTTTCTCAAATTCGAACCCAACTATTACTGATGGAGCCTGAG
CCAACAATTCAGAAAGCCTTCTCCTTGATTGCTCAAGAAGTAGAACAACGTGCCTCTGCCCTACTTCCAACTCCCACTGCTTCATCGCCTACTAACGACACTGCGTTACT
TGTTAAGAATTCCAATGGTAACTCTCGTTCTCAGTCTGGATCCAACAAGAAGAAGGAGCGCCCGTTTTGCACTCATTGCAACATTCTAAGGCACACTATTGACCGCTGCT
ACAAAATTTGTGGCTATCCGCCTAGATACCAGGCGAAGGCGACCCCTACGGAATCAACCTCTGCTACCACTCATGTAGCAGGTATTTGTTCTTCACTTATGCATTCTCTC
AAATCTTGGGTTCTTGACTCAAGTGCTTCTACGCACATCTCATATGAACGGTCATTCTTTACTGCTTTACGGCCTGTAACTGGAAAATATGTCTCCTTGCCCAATCTTAT
TCGCATTGCTGTTCAGTTCATTGGTGATATTCAGCTTAATGCTCATATTTGTCTTCGAAACGTCATGTTCATTCCAGACTTCAGCTTCAACCTTATTTCTATCAGTGCCT
TGACTAATGATCAGAATTTTGTTGTCAAATTTGTTGTCGATGATTTCTCACGCTATACCTGGGTACATTTGATGAAACAGAAATCCGATGCACTGAACATTGTTCCAAAG
TTTTTTAAACTAGTGCAAACCCAATATGGAGTATGCATTAAGAAGTTTAGGCCTGACAATGCCCCTGAGCTAGTGTTCAAAGAGTTAATCGGGTGCTCGGGGCGTGAAAG
GATGCAAAGGAATGAAAAGAGTAAAAGTGAAAAAAAGTCAAATCTCGGTCAACAGCAGGCTAGCGTCAAGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTCCATATC
AGATTAGGCGCGTAAAGCTTACAGCGTCGAGACGCTATGATAGGAAGCGTCCCGACGCTTCCGTTTTTCCTTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGACAGAAACGAACTCCACCATTCCACGAGATTCTTCTTCTCCAACTCGTTCAACCAGTCAATCGATCGTTGAACAATATGATAATTCATACTTTTTACATCATTCTGA
TAGCACAAGTCTGAAATTAGTTTCTGATCCTCTTACAGAAACTAATTATACCTCTTGGAGTCGTGCTATGATTCTTGGATTCACTTTCAAGAATAAAATGGGTTTCATCG
ATGGCACTCTACCACAACCTACTGGAGAAATGAAGAAATCATGGATCATCTGCAATAGTATCGTCACGGCTTGGATTCTAAATTCTCTTTCTACGGAGATCTCTACGAGT
GTGAATTTTGCTGATTCCGCAAGAGAAATATGGCTTGATCTTCAACAGCGCAACCAACGGAAAAATCGCCCACGAATCTTTAAATTACAGCGCGAGATTTCCAATCTTGT
TCAAGATCAGAACTCTGTGATGACTTACTTTGCCAAGCTTAAGACATTATGGAATGAACTCAGTTCTTATTGCCCCATCTGTTCATGTGGACTCTGTTCGTGTGATGGTG
TTAAGCAATTGAATACGTATTTCCAGACTGAATACGTTATGGCCTTCCTGATGGGTTTGAATGATTCGTTTTCTCAAATTCGAACCCAACTATTACTGATGGAGCCTGAG
CCAACAATTCAGAAAGCCTTCTCCTTGATTGCTCAAGAAGTAGAACAACGTGCCTCTGCCCTACTTCCAACTCCCACTGCTTCATCGCCTACTAACGACACTGCGTTACT
TGTTAAGAATTCCAATGGTAACTCTCGTTCTCAGTCTGGATCCAACAAGAAGAAGGAGCGCCCGTTTTGCACTCATTGCAACATTCTAAGGCACACTATTGACCGCTGCT
ACAAAATTTGTGGCTATCCGCCTAGATACCAGGCGAAGGCGACCCCTACGGAATCAACCTCTGCTACCACTCATGTAGCAGGTATTTGTTCTTCACTTATGCATTCTCTC
AAATCTTGGGTTCTTGACTCAAGTGCTTCTACGCACATCTCATATGAACGGTCATTCTTTACTGCTTTACGGCCTGTAACTGGAAAATATGTCTCCTTGCCCAATCTTAT
TCGCATTGCTGTTCAGTTCATTGGTGATATTCAGCTTAATGCTCATATTTGTCTTCGAAACGTCATGTTCATTCCAGACTTCAGCTTCAACCTTATTTCTATCAGTGCCT
TGACTAATGATCAGAATTTTGTTGTCAAATTTGTTGTCGATGATTTCTCACGCTATACCTGGGTACATTTGATGAAACAGAAATCCGATGCACTGAACATTGTTCCAAAG
TTTTTTAAACTAGTGCAAACCCAATATGGAGTATGCATTAAGAAGTTTAGGCCTGACAATGCCCCTGAGCTAGTGTTCAAAGAGTTAATCGGGTGCTCGGGGCGTGAAAG
GATGCAAAGGAATGAAAAGAGTAAAAGTGAAAAAAAGTCAAATCTCGGTCAACAGCAGGCTAGCGTCAAGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTCCATATC
AGATTAGGCGCGTAAAGCTTACAGCGTCGAGACGCTATGATAGGAAGCGTCCCGACGCTTCCGTTTTTCCTTATTAA
Protein sequenceShow/hide protein sequence
MTETNSTIPRDSSSPTRSTSQSIVEQYDNSYFLHHSDSTSLKLVSDPLTETNYTSWSRAMILGFTFKNKMGFIDGTLPQPTGEMKKSWIICNSIVTAWILNSLSTEISTS
VNFADSAREIWLDLQQRNQRKNRPRIFKLQREISNLVQDQNSVMTYFAKLKTLWNELSSYCPICSCGLCSCDGVKQLNTYFQTEYVMAFLMGLNDSFSQIRTQLLLMEPE
PTIQKAFSLIAQEVEQRASALLPTPTASSPTNDTALLVKNSNGNSRSQSGSNKKKERPFCTHCNILRHTIDRCYKICGYPPRYQAKATPTESTSATTHVAGICSSLMHSL
KSWVLDSSASTHISYERSFFTALRPVTGKYVSLPNLIRIAVQFIGDIQLNAHICLRNVMFIPDFSFNLISISALTNDQNFVVKFVVDDFSRYTWVHLMKQKSDALNIVPK
FFKLVQTQYGVCIKKFRPDNAPELVFKELIGCSGRERMQRNEKSKSEKKSNLGQQQASVKTLALERLDAHIPYQIRRVKLTASRRYDRKRPDASVFPY