; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031290 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031290
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr11:6651420..6652334
RNA-Seq ExpressionLag0031290
SyntenyLag0031290
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]2.5e-7659.78Show/hide
Query:  ASPSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTL
        +S S S+  +L SP+ LL+NICNLISI+LDS+NYV+WKFQ+ ++LKAHK+FGF+DG+   P +                    NPSY+DW AKD ALMT+
Subjt:  ASPSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTL

Query:  INATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTF
        INATL+ EALAY+VG  +SK+   VLA  YSS+SRSN+VNLK++LQ ISKK +E ID+Y+KRIKEIKD+L N+S+VVN+EDL+IYALNGLP EYNTFRT 
Subjt:  INATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTF

Query:  MRTRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNTSPRPNLTSSVSSPNFRGRGRGRNSG
        MRTRS  VTF+ELHVL+K+EESAL KQSKR+DL  QPTA+LAS  +    +  S+ ++   RGRGRGR  G
Subjt:  MRTRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNTSPRPNLTSSVSSPNFRGRGRGRNSG

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-7659.78Show/hide
Query:  ASPSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTL
        +S S S+  +L SP+ LL+NICNLISI+LDS+NYV+WKFQ+ ++LKAHK+FGF+DG+   P +                    NPSY+DW AKD ALMT+
Subjt:  ASPSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTL

Query:  INATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTF
        INATL+ EALAY+VG  +SK+   VLA  YSS+SRSN+VNLK++LQ ISKK +E ID+Y+KRIKEIKD+L N+S+VVN+EDL+IYALNGLP EYNTFRT 
Subjt:  INATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTF

Query:  MRTRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNTSPRPNLTSSVSSPNFRGRGRGRNSG
        MRTRS  VTF+ELHVL+K+EESAL KQSKR+DL  QPTA+LAS  +    +  S+ ++   RGRGRGR  G
Subjt:  MRTRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNTSPRPNLTSSVSSPNFRGRGRGRNSG

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]5.5e-7658.28Show/hide
Query:  PSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLIN
        PS S+ KD  SP+FLL+NICNLIS+RLDS+N+V+WKFQ+ +ILKAHK++GF+DG+N  P        +  SSS+S    + NPSYEDW+AKD ALMT+IN
Subjt:  PSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLIN

Query:  ATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMR
        ATL+ EALAY+VG  SSK+  +VLA  YSS SRSN+VNLK++LQ I KKP+E ID+Y+KRIKEIKD+L N+S+ +NEEDL+IYALNGLP EYNTFRT MR
Subjt:  ATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMR

Query:  TRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNT--SPRPNLTSSVSSPNFRGRGRGRNSGDISFISFWIYMRINGFSLE
        TRS  VTF+ELHVL+++EESAL KQSK +D   QPT +L+S  +  S  P   ++      RG G G++ G   F SF    R +G S E
Subjt:  TRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNT--SPRPNLTSSVSSPNFRGRGRGRNSGDISFISFWIYMRINGFSLE

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]5.5e-7658.28Show/hide
Query:  PSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLIN
        PS S+ KD  SP+FLL+NICNLIS+RLDS+N+V+WKFQ+ +ILKAHK++GF+DG+N  P        +  SSS+S    + NPSYEDW+AKD ALMT+IN
Subjt:  PSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLIN

Query:  ATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMR
        ATL+ EALAY+VG  SSK+  +VLA  YSS SRSN+VNLK++LQ I KKP+E ID+Y+KRIKEIKD+L N+S+ +NEEDL+IYALNGLP EYNTFRT MR
Subjt:  ATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMR

Query:  TRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNT--SPRPNLTSSVSSPNFRGRGRGRNSGDISFISFWIYMRINGFSLE
        TRS  VTF+ELHVL+++EESAL KQSK +D   QPT +L+S  +  S  P   ++      RG G G++ G   F SF    R +G S E
Subjt:  TRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNT--SPRPNLTSSVSSPNFRGRGRGRNSGDISFISFWIYMRINGFSLE

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]5.0e-7758.61Show/hide
Query:  MASPSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTA-SEASSSSSVTIIEINPSYEDWMAKDHALM
        M S S ++ KDL+SP+FLL+NICNL+SIRLDS+++++WKFQ+ +ILKAHK+FGF+DGS  AP+ FL S++ +E+  +++ ++  INP +EDW+AKD ALM
Subjt:  MASPSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTA-SEASSSSSVTIIEINPSYEDWMAKDHALM

Query:  TLINATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFR
        TLINATL+ EALAY+V   +SK+  EVL  HYSS SR+N+VNLK++LQ+I KK  E ID+YVKRIKEIKD+  N+S  +N+E L+IYALNGL  EYNT  
Subjt:  TLINATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFR

Query:  TFMRTRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNTSPRPNLTSSVSSPNFRGRGRGRNSG
        T MRTR+  V+F+ELHV MKSEESA+EKQ KREDL  QP A+ AS   S   N TS+        RGRG+N+G
Subjt:  TFMRTRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNTSPRPNLTSSVSSPNFRGRGRGRNSG

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X22.7e-7658.28Show/hide
Query:  PSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLIN
        PS S+ KD  SP+FLL+NICNLIS+RLDS+N+V+WKFQ+ +ILKAHK++GF+DG+N  P        +  SSS+S    + NPSYEDW+AKD ALMT+IN
Subjt:  PSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLIN

Query:  ATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMR
        ATL+ EALAY+VG  SSK+  +VLA  YSS SRSN+VNLK++LQ I KKP+E ID+Y+KRIKEIKD+L N+S+ +NEEDL+IYALNGLP EYNTFRT MR
Subjt:  ATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMR

Query:  TRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNT--SPRPNLTSSVSSPNFRGRGRGRNSGDISFISFWIYMRINGFSLE
        TRS  VTF+ELHVL+++EESAL KQSK +D   QPT +L+S  +  S  P   ++      RG G G++ G   F SF    R +G S E
Subjt:  TRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNT--SPRPNLTSSVSSPNFRGRGRGRNSGDISFISFWIYMRINGFSLE

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X32.7e-7658.28Show/hide
Query:  PSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLIN
        PS S+ KD  SP+FLL+NICNLIS+RLDS+N+V+WKFQ+ +ILKAHK++GF+DG+N  P        +  SSS+S    + NPSYEDW+AKD ALMT+IN
Subjt:  PSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLIN

Query:  ATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMR
        ATL+ EALAY+VG  SSK+  +VLA  YSS SRSN+VNLK++LQ I KKP+E ID+Y+KRIKEIKD+L N+S+ +NEEDL+IYALNGLP EYNTFRT MR
Subjt:  ATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMR

Query:  TRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNT--SPRPNLTSSVSSPNFRGRGRGRNSGDISFISFWIYMRINGFSLE
        TRS  VTF+ELHVL+++EESAL KQSK +D   QPT +L+S  +  S  P   ++      RG G G++ G   F SF    R +G S E
Subjt:  TRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNT--SPRPNLTSSVSSPNFRGRGRGRNSGDISFISFWIYMRINGFSLE

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X12.7e-7658.28Show/hide
Query:  PSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLIN
        PS S+ KD  SP+FLL+NICNLIS+RLDS+N+V+WKFQ+ +ILKAHK++GF+DG+N  P        +  SSS+S    + NPSYEDW+AKD ALMT+IN
Subjt:  PSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLIN

Query:  ATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMR
        ATL+ EALAY+VG  SSK+  +VLA  YSS SRSN+VNLK++LQ I KKP+E ID+Y+KRIKEIKD+L N+S+ +NEEDL+IYALNGLP EYNTFRT MR
Subjt:  ATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMR

Query:  TRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNT--SPRPNLTSSVSSPNFRGRGRGRNSGDISFISFWIYMRINGFSLE
        TRS  VTF+ELHVL+++EESAL KQSK +D   QPT +L+S  +  S  P   ++      RG G G++ G   F SF    R +G S E
Subjt:  TRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNT--SPRPNLTSSVSSPNFRGRGRGRNSGDISFISFWIYMRINGFSLE

A0A5D3CLI6 T4.52.7e-7658.28Show/hide
Query:  PSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLIN
        PS S+ KD  SP+FLL+NICNLIS+RLDS+N+V+WKFQ+ +ILKAHK++GF+DG+N  P        +  SSS+S    + NPSYEDW+AKD ALMT+IN
Subjt:  PSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLIN

Query:  ATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMR
        ATL+ EALAY+VG  SSK+  +VLA  YSS SRSN+VNLK++LQ I KKP+E ID+Y+KRIKEIKD+L N+S+ +NEEDL+IYALNGLP EYNTFRT MR
Subjt:  ATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMR

Query:  TRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNT--SPRPNLTSSVSSPNFRGRGRGRNSGDISFISFWIYMRINGFSLE
        TRS  VTF+ELHVL+++EESAL KQSK +D   QPT +L+S  +  S  P   ++      RG G G++ G   F SF    R +G S E
Subjt:  TRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNT--SPRPNLTSSVSSPNFRGRGRGRNSGDISFISFWIYMRINGFSLE

A0A6J1D9L6 uncharacterized protein LOC1110188922.4e-7758.61Show/hide
Query:  MASPSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTA-SEASSSSSVTIIEINPSYEDWMAKDHALM
        M S S ++ KDL+SP+FLL+NICNL+SIRLDS+++++WKFQ+ +ILKAHK+FGF+DGS  AP+ FL S++ +E+  +++ ++  INP +EDW+AKD ALM
Subjt:  MASPSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTA-SEASSSSSVTIIEINPSYEDWMAKDHALM

Query:  TLINATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFR
        TLINATL+ EALAY+V   +SK+  EVL  HYSS SR+N+VNLK++LQ+I KK  E ID+YVKRIKEIKD+  N+S  +N+E L+IYALNGL  EYNT  
Subjt:  TLINATLTLEALAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFR

Query:  TFMRTRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNTSPRPNLTSSVSSPNFRGRGRGRNSG
        T MRTR+  V+F+ELHV MKSEESA+EKQ KREDL  QP A+ AS   S   N TS+        RGRG+N+G
Subjt:  TFMRTRSGIVTFDELHVLMKSEESALEKQSKREDLTIQPTAMLASQNTSPRPNLTSSVSSPNFRGRGRGRNSG

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-1022.31Show/hide
Query:  LNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLINATLTLEAL
        LN+   L  N+ N+   +L S+NY+MW  Q+ ++   +++ GFLDGS   P A + + A+            +NP Y  W  +D  + + +   +++   
Subjt:  LNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLINATLTLEAL

Query:  AYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMRTRSGIVTF
          +    ++ +  E L   Y++ S  ++  L+T L+  + K  + ID Y++ +    D+L  +   ++ ++ +   L  LP EY      +  +    T 
Subjt:  AYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMRTRSGIVTF

Query:  DELHVLMKSEESALEKQSKREDLTIQPTAMLASQNTSPRPNLTSSVSSPNFRGRGRGRNS
         E+H  + + ES +   S    + I   A ++ +NT+   N  +   +  +  R    NS
Subjt:  DELHVLMKSEESALEKQSKREDLTIQPTAMLASQNTSPRPNLTSSVSSPNFRGRGRGRNS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.0e-0520.39Show/hide
Query:  TNICNLIS---IRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLINATLTLEALAYIVG
        TNI N+      +L S+NY+MW  Q+ ++   +++ GFLDGS   P A + + A          +  +NP Y  W  +D  + + I   +++     +  
Subjt:  TNICNLIS---IRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLINATLTLEALAYIVG

Query:  CRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMRTRSGIVTFDELHV
          ++ +  E L   Y++ S  ++  L+            FI  +        D+L  +   ++ ++ +   L  LP +Y      +  +    +  E+H 
Subjt:  CRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMRTRSGIVTFDELHV

Query:  LMKSEESALEKQSKREDLTIQPTAMLASQNTSPRPNLTSSVSSPNFRGRGRGRNS
         + + ES L   +  E + I    ++  +NT+   N  +   + N+       NS
Subjt:  LMKSEESALEKQSKREDLTIQPTAMLASQNTSPRPNLTSSVSSPNFRGRGRGRNS

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.5e-0721.24Show/hide
Query:  LTNICNLISIRLD--SSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLINATLTLEAL-AYIV
        ++NI + I + LD   SNY  W+   ++   +  + G +DG                      T++  N +  +W  +D  +   +  TLT +      V
Subjt:  LTNICNLISIRLD--SSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLINATLTLEAL-AYIV

Query:  GCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMRTRSGIVTFDELH
           +S++    +   + +   +  + L + L+          D Y +++K++ D L N+   V + +L++Y LNGL  +++     ++ R    +FD+  
Subjt:  GCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMRTRSGIVTFDELH

Query:  VLMKSEESALEKQSKREDLTI---QPTAMLASQNTSPRPNL-TSSVSSPNFRGRGRGRN
         +++ EE  L++  K     +     + +LA     P  N   S  +   +RGRGRG N
Subjt:  VLMKSEESALEKQSKREDLTI---QPTAMLASQNTSPRPNL-TSSVSSPNFRGRGRGRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCCTAGTCTTTCTTCGACCAAGGATTTGAATTCGCCGATGTTTTTGTTGACAAACATCTGCAATCTGATTTCAATCCGCCTGGATTCTTCTAATTATGTGAT
GTGGAAGTTCCAGATGATGTCAATCCTGAAAGCACACAAAATTTTTGGATTTCTGGATGGATCGAACGTTGCTCCTACTGCATTCTTAACTTCAACTGCTTCTGAAGCCT
CATCATCTTCTTCAGTCACAATCATTGAAATTAATCCGTCATACGAAGATTGGATGGCCAAGGATCACGCTTTGATGACTCTTATCAACGCTACGCTCACTCTTGAGGCA
CTCGCATATATTGTTGGCTGTCGATCCTCTAAGGAAGAAGGAGAGGTACTTGCTCCCCATTATTCCTCCACATCCCGATCCAATATTGTTAATCTCAAAACAAACCTTCA
AGCTATTTCCAAGAAGCCAAACGAATTCATTGATTCTTATGTTAAACGGATTAAAGAGATCAAAGACAGATTGGAAAATATTTCTTCTGTGGTGAATGAAGAAGATCTCA
TGATTTATGCACTAAATGGATTGCCAGTGGAATATAACACCTTCAGAACCTTTATGCGCACAAGGTCAGGAATTGTTACTTTTGATGAACTTCATGTCCTTATGAAGTCA
GAGGAATCTGCTCTGGAAAAACAGTCCAAAAGAGAGGATCTCACTATACAACCTACTGCTATGCTTGCATCTCAGAACACTTCGCCTCGACCAAATTTGACTTCTTCTGT
TTCTTCTCCTAATTTTCGAGGTCGTGGTAGAGGAAGGAACTCTGGGGATATATCTTTTATTAGTTTTTGGATTTACATGAGGATTAATGGGTTTAGTTTGGAGGCGATTG
TGCTGTCCCCCCTACTCAGTTGCGTACCTTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTCCTAGTCTTTCTTCGACCAAGGATTTGAATTCGCCGATGTTTTTGTTGACAAACATCTGCAATCTGATTTCAATCCGCCTGGATTCTTCTAATTATGTGAT
GTGGAAGTTCCAGATGATGTCAATCCTGAAAGCACACAAAATTTTTGGATTTCTGGATGGATCGAACGTTGCTCCTACTGCATTCTTAACTTCAACTGCTTCTGAAGCCT
CATCATCTTCTTCAGTCACAATCATTGAAATTAATCCGTCATACGAAGATTGGATGGCCAAGGATCACGCTTTGATGACTCTTATCAACGCTACGCTCACTCTTGAGGCA
CTCGCATATATTGTTGGCTGTCGATCCTCTAAGGAAGAAGGAGAGGTACTTGCTCCCCATTATTCCTCCACATCCCGATCCAATATTGTTAATCTCAAAACAAACCTTCA
AGCTATTTCCAAGAAGCCAAACGAATTCATTGATTCTTATGTTAAACGGATTAAAGAGATCAAAGACAGATTGGAAAATATTTCTTCTGTGGTGAATGAAGAAGATCTCA
TGATTTATGCACTAAATGGATTGCCAGTGGAATATAACACCTTCAGAACCTTTATGCGCACAAGGTCAGGAATTGTTACTTTTGATGAACTTCATGTCCTTATGAAGTCA
GAGGAATCTGCTCTGGAAAAACAGTCCAAAAGAGAGGATCTCACTATACAACCTACTGCTATGCTTGCATCTCAGAACACTTCGCCTCGACCAAATTTGACTTCTTCTGT
TTCTTCTCCTAATTTTCGAGGTCGTGGTAGAGGAAGGAACTCTGGGGATATATCTTTTATTAGTTTTTGGATTTACATGAGGATTAATGGGTTTAGTTTGGAGGCGATTG
TGCTGTCCCCCCTACTCAGTTGCGTACCTTGTTGA
Protein sequenceShow/hide protein sequence
MASPSLSSTKDLNSPMFLLTNICNLISIRLDSSNYVMWKFQMMSILKAHKIFGFLDGSNVAPTAFLTSTASEASSSSSVTIIEINPSYEDWMAKDHALMTLINATLTLEA
LAYIVGCRSSKEEGEVLAPHYSSTSRSNIVNLKTNLQAISKKPNEFIDSYVKRIKEIKDRLENISSVVNEEDLMIYALNGLPVEYNTFRTFMRTRSGIVTFDELHVLMKS
EESALEKQSKREDLTIQPTAMLASQNTSPRPNLTSSVSSPNFRGRGRGRNSGDISFISFWIYMRINGFSLEAIVLSPLLSCVPC