; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g14690 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g14690
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr11:10968087..10968920
RNA-Seq ExpressionMoc11g14690
SyntenyMoc11g14690
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]6.4e-4741.27Show/hide
Query:  ISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQAT-NDEGNSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTR
        I   GN+IS++KL D  +LLWKFQ+LT ++ +DLE F++  ++PP+++L +T +   ++T   NP Y  W +QD+L+SSWLLG+M+E+IL QM+ C S +
Subjt:  ISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQAT-NDEGNSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTR

Query:  EIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALLMSHE
        EIWE+L+  +++   A+ MQ K +L N+KKG M +KEY  K    V+AL ++   V++ +HI++IL GL ++Y S++SVISA++    +QE+ +LL++ E
Subjt:  EIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALLMSHE

Query:  NRLERNSSINLDGTIPSANLTFQNNDKRSGDMDQKNYNNQQYNKSYKQNGGR
        ++ E  S +  +  +PS N+  Q  +K +    + N NN   N SY Q GGR
Subjt:  NRLERNSSINLDGTIPSANLTFQNNDKRSGDMDQKNYNNQQYNKSYKQNGGR

KAA0053143.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]1.6e-4237.88Show/hide
Query:  SSNSTQINVVGSLPGQGSVISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEGNS-TKEENPEYLSWIKQDKLLSSWL
        SSNS+ + V  +       I   GN+IS++KL+D N+LLWKFQ+LT ++ +DLE F +   +PP+++L +T     S T+  NPEY  W + ++L+S WL
Subjt:  SSNSTQINVVGSLPGQGSVISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEGNS-TKEENPEYLSWIKQDKLLSSWL

Query:  LGAMTEDILAQMVGCTSTREIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVIS
        LG+M+E+IL QMV C S +EIW +L+  +++   A+ MQ K +L N+KKG M++KEY  K +  V+AL ++   V++ +HI++IL+GL  +Y S++S+IS
Subjt:  LGAMTEDILAQMVGCTSTREIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVIS

Query:  AKSKPKPLQEIYALLMSHENRLERNSSINLDGTIPSANLTFQNNDKRSGDM---DQKNYNNQQY
        A++    +QE+ +LL++ E++ E  S +  +  +P   +  Q  +K +       Q NY+N  +
Subjt:  AKSKPKPLQEIYALLMSHENRLERNSSINLDGTIPSANLTFQNNDKRSGDM---DQKNYNNQQY

KAE8652954.1 hypothetical protein Csa_017771 [Cucumis sativus]5.2e-4150.82Show/hide
Query:  MASSNSTQINVVGSLPGQGSVISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQA-----------TNDEGNSTKE---ENPE
        MASS+ST  +   S     S+I NPG+ +++IKLTD NYL WK Q+L TI GH LE  I   +KP    L +            + E N+ +E   ENP+
Subjt:  MASSNSTQINVVGSLPGQGSVISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQA-----------TNDEGNSTKE---ENPE

Query:  YLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTREIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNAL
        Y+SW++QD+L+  WL+G+M EDI+ QM+GC + REIW +L+QTY++SNTAKIMQLKG+LQNLKKG  +I++Y AK KNLV+AL
Subjt:  YLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTREIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNAL

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]6.4e-4741.27Show/hide
Query:  ISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQAT-NDEGNSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTR
        I   GN+IS++KL D  +LLWKFQ+LT ++ +DLE F++  ++PP+++L +T +   ++T   NP Y  W +QD+L+SSWLLG+M+E+IL QM+ C S +
Subjt:  ISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQAT-NDEGNSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTR

Query:  EIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALLMSHE
        EIWE+L+  +++   A+ MQ K +L N+KKG M +KEY  K    V+AL ++   V++ +HI++IL GL ++Y S++SVISA++    +QE+ +LL++ E
Subjt:  EIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALLMSHE

Query:  NRLERNSSINLDGTIPSANLTFQNNDKRSGDMDQKNYNNQQYNKSYKQNGGR
        ++ E  S +  +  +PS N+  Q  +K +    + N NN   N SY Q GGR
Subjt:  NRLERNSSINLDGTIPSANLTFQNNDKRSGDMDQKNYNNQQYNKSYKQNGGR

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]1.5e-5647.24Show/hide
Query:  QGSVISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEGNSTK-EENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGC
        Q S   NPG+++SI++L D N LLWKFQ+ T +QG+ LE +I      P QF+Q T DE +S+  ++NP Y  WIKQDKL+S+WLLG+M EDIL+QM+ C
Subjt:  QGSVISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEGNSTK-EENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGC

Query:  TSTREIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALL
         S REIW  L+  + +   A++MQLK +L+N KKG +++K+Y  K KNLV++L   G K++T++HI+ IL GL  E+D+I+SVI+A++ P+ LQE+ +LL
Subjt:  TSTREIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALL

Query:  MSHENRLERNSSINLDGTIPSANLTFQNNDKRSGDMDQKNYNNQQYNKSYKQNG
        +  E R ERN  IN DG++PS NLT  ++ K++     K +N  Q N S +  G
Subjt:  MSHENRLERNSSINLDGTIPSANLTFQNNDKRSGDMDQKNYNNQQYNKSYKQNG

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-943.1e-4741.27Show/hide
Query:  ISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQAT-NDEGNSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTR
        I   GN+IS++KL D  +LLWKFQ+LT ++ +DLE F++  ++PP+++L +T +   ++T   NP Y  W +QD+L+SSWLLG+M+E+IL QM+ C S +
Subjt:  ISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQAT-NDEGNSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTR

Query:  EIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALLMSHE
        EIWE+L+  +++   A+ MQ K +L N+KKG M +KEY  K    V+AL ++   V++ +HI++IL GL ++Y S++SVISA++    +QE+ +LL++ E
Subjt:  EIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALLMSHE

Query:  NRLERNSSINLDGTIPSANLTFQNNDKRSGDMDQKNYNNQQYNKSYKQNGGR
        ++ E  S +  +  +PS N+  Q  +K +    + N NN   N SY Q GGR
Subjt:  NRLERNSSINLDGTIPSANLTFQNNDKRSGDMDQKNYNNQQYNKSYKQNGGR

A0A5A7UB21 Keratin, type II cytoskeletal 1-like7.9e-4337.88Show/hide
Query:  SSNSTQINVVGSLPGQGSVISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEGNS-TKEENPEYLSWIKQDKLLSSWL
        SSNS+ + V  +       I   GN+IS++KL+D N+LLWKFQ+LT ++ +DLE F +   +PP+++L +T     S T+  NPEY  W + ++L+S WL
Subjt:  SSNSTQINVVGSLPGQGSVISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEGNS-TKEENPEYLSWIKQDKLLSSWL

Query:  LGAMTEDILAQMVGCTSTREIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVIS
        LG+M+E+IL QMV C S +EIW +L+  +++   A+ MQ K +L N+KKG M++KEY  K +  V+AL ++   V++ +HI++IL+GL  +Y S++S+IS
Subjt:  LGAMTEDILAQMVGCTSTREIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVIS

Query:  AKSKPKPLQEIYALLMSHENRLERNSSINLDGTIPSANLTFQNNDKRSGDM---DQKNYNNQQY
        A++    +QE+ +LL++ E++ E  S +  +  +P   +  Q  +K +       Q NY+N  +
Subjt:  AKSKPKPLQEIYALLMSHENRLERNSSINLDGTIPSANLTFQNNDKRSGDM---DQKNYNNQQY

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-943.1e-4741.27Show/hide
Query:  ISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQAT-NDEGNSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTR
        I   GN+IS++KL D  +LLWKFQ+LT ++ +DLE F++  ++PP+++L +T +   ++T   NP Y  W +QD+L+SSWLLG+M+E+IL QM+ C S +
Subjt:  ISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQAT-NDEGNSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTR

Query:  EIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALLMSHE
        EIWE+L+  +++   A+ MQ K +L N+KKG M +KEY  K    V+AL ++   V++ +HI++IL GL ++Y S++SVISA++    +QE+ +LL++ E
Subjt:  EIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALLMSHE

Query:  NRLERNSSINLDGTIPSANLTFQNNDKRSGDMDQKNYNNQQYNKSYKQNGGR
        ++ E  S +  +  +PS N+  Q  +K +    + N NN   N SY Q GGR
Subjt:  NRLERNSSINLDGTIPSANLTFQNNDKRSGDMDQKNYNNQQYNKSYKQNGGR

A0A6J1DLT9 uncharacterized protein LOC1110217577.3e-5747.24Show/hide
Query:  QGSVISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEGNSTK-EENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGC
        Q S   NPG+++SI++L D N LLWKFQ+ T +QG+ LE +I      P QF+Q T DE +S+  ++NP Y  WIKQDKL+S+WLLG+M EDIL+QM+ C
Subjt:  QGSVISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEGNSTK-EENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGC

Query:  TSTREIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALL
         S REIW  L+  + +   A++MQLK +L+N KKG +++K+Y  K KNLV++L   G K++T++HI+ IL GL  E+D+I+SVI+A++ P+ LQE+ +LL
Subjt:  TSTREIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALL

Query:  MSHENRLERNSSINLDGTIPSANLTFQNNDKRSGDMDQKNYNNQQYNKSYKQNG
        +  E R ERN  IN DG++PS NLT  ++ K++     K +N  Q N S +  G
Subjt:  MSHENRLERNSSINLDGTIPSANLTFQNNDKRSGDMDQKNYNNQQYNKSYKQNG

A0A6J1DSS1 uncharacterized protein LOC1110235869.6e-4146.31Show/hide
Query:  KFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEG---NSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTREIWESLKQTYTTSNTAKIM
        KFQVLT IQGH LE++I    +PP++F+Q  N +G   ++T++ NPEY  WIKQDKL+S WLLG+M+E+IL+QM+ C   +EIW  L+ T+ + N A++M
Subjt:  KFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEG---NSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTREIWESLKQTYTTSNTAKIM

Query:  QLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALLMSHENRLERNSSINLDGTIPSAN
        QLK +L+N+KKG MN+K Y  K KNLV++L   G ++ T +HI+ IL  L  E+DSIVSVIS +  P+ +QE  +   SH    +  SS     +   A 
Subjt:  QLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALLMSHENRLERNSSINLDGTIPSAN

Query:  LTF
          F
Subjt:  LTF

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.0e-2027.38Show/hide
Query:  NQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEGNSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTREIWESL
        N  ++ KLT  NYL+W  QV     G++L  F+      P   +       ++    NP+Y  W +QDKL+ S +LGA++  +   +   T+  +IWE+L
Subjt:  NQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEGNSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTREIWESL

Query:  KQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALLMSHENRLERN
        ++ Y   +   + QL+ +L+   KG   I +Y+       + L  +G  +   E +  +L  L  EY  ++  I+AK  P  L EI+  L++HE+++   
Subjt:  KQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALLMSHENRLERN

Query:  SS---INLDGTIPSANLTFQNNDKRSGDMDQK--NYNNQQYNKSYKQNGGRF
        SS   I +     S   T   N+  +G+ + +  N NN   +K ++Q+   F
Subjt:  SS---INLDGTIPSANLTFQNNDKRSGDMDQK--NYNNQQYNKSYKQNGGRF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-1425.91Show/hide
Query:  NQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEGNSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTREIWESL
        N  ++ KLT  NYL+W  QV     G++L  F+ +G+ P       T    ++    NP+Y  W +QDKL+ S +LGA++  +   +   T+  +IWE+L
Subjt:  NQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEGNSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTREIWESL

Query:  KQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALLMSHENRLERN
        ++ Y   +   + QL+               ++ +     + L  +G  +   E +  +L  L  +Y  ++  I+AK  P  L EI+  L++ E++L   
Subjt:  KQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALLMSHENRLERN

Query:  SSINLDGTIP-SANLTFQNNDKRSGDMDQKNYNNQQYNKSYKQNGGR
         ++N    +P +AN+    N   +     +N NN+  N++Y  N  R
Subjt:  SSINLDGTIP-SANLTFQNNDKRSGDMDQKNYNNQQYNKSYKQNGGR

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).4.0e-0724.82Show/hide
Query:  NQISIIKLT--DINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEGNSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTREIWE
        +  SI KL+  + NY+ WK +  + ++      FI      P  F              +P Y  W + + ++  WL+ +MT+ +L  ++   +  ++WE
Subjt:  NQISIIKLT--DINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEGNSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQMVGCTSTREIWE

Query:  SLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAK
         L++ +      KI QL+  L  L++GG +++EY  K
Subjt:  SLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAK

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.0e-1024.54Show/hide
Query:  VVGSLPGQGSVISNPGNQISI-IKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEGNSTKEENPEYLSWIKQDKLLSSWLLGAMT-ED
        VV   P Q   +SN  + I + + + + NY  W+   LT     D+   I +G   PT                N   ++W K+D ++   L G +T + 
Subjt:  VVGSLPGQGSVISNPGNQISI-IKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEGNSTKEENPEYLSWIKQDKLLSSWLLGAMT-ED

Query:  ILAQMVGCTSTREIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKP
             V  +++R+IW  +K  +  +  A+ ++L  EL+    G M + +Y  K K L ++L  V   VT +  ++++L GL  ++D+I++VI  +     
Subjt:  ILAQMVGCTSTREIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKP

Query:  LQEIYALLMSHENRLERNSSINLDGTIPSANLTF--------QNNDKRSGDMDQKNYNNQQYNKSYKQNGGRF
          +   +L   E+RL+R    N      S++ T           N +RSG          + N  ++  GGRF
Subjt:  LQEIYALLMSHENRLERNSSINLDGTIPSANLTF--------QNNDKRSGDMDQKNYNNQQYNKSYKQNGGRF

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.4e-1227.72Show/hide
Query:  EGNSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQM--VGCTSTREIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVG
        +G+ST     E   W ++D L+  W+ G +T+ +L  +  VGCT+ R++W SL+  +  +  A+ +Q + EL+      +++ EY  K K+L + L  V 
Subjt:  EGNSTKEENPEYLSWIKQDKLLSSWLLGAMTEDILAQM--VGCTSTREIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVG

Query:  CKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALLMSHENRLERNSSINLDGT----IPSANLTFQNNDKRSGDMDQKNYNNQQYNKSYKQN
          ++ +  ++ +L GL  +YD I++VI  KS      E  ++L+  E+RL   S  +L  T    + +   T     +R       N +N    +S K+N
Subjt:  CKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALLMSHENRLERNSSINLDGT----IPSANLTFQNNDKRSGDMDQKNYNNQQYNKSYKQN

Query:  GG
         G
Subjt:  GG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTTCCAATTCTACTCAAATTAATGTAGTAGGTTCTCTTCCGGGGCAAGGTTCGGTGATCTCCAACCCTGGCAATCAAATTTCGATCATAAAACTTACTGATAT
CAATTATCTATTATGGAAATTTCAAGTTCTAACAACTATTCAAGGTCACGATTTGGAGAAATTCATCAAAGAAGGTGCTAAGCCTCCAACGCAATTTCTTCAAGCCACCA
ATGACGAAGGAAATTCTACAAAAGAGGAGAATCCTGAATATCTTTCTTGGATAAAACAAGATAAATTGTTATCTTCTTGGCTTTTAGGAGCAATGACAGAAGATATATTA
GCACAGATGGTAGGTTGCACTTCAACAAGGGAAATATGGGAATCTCTTAAACAAACCTACACAACATCAAATACTGCTAAGATCATGCAATTGAAGGGTGAGTTACAAAA
TCTGAAGAAAGGAGGTATGAATATTAAAGAGTATGTTGCAAAAGCGAAAAATCTTGTTAATGCTCTTAATGCAGTAGGATGCAAAGTTACAACACAAGAACACATAGTCT
TTATCCTCTTAGGTTTATGTACAGAATATGACTCAATTGTTTCTGTTATCTCTGCAAAATCTAAACCCAAACCATTACAAGAAATTTATGCTTTACTCATGAGTCATGAA
AATAGATTAGAGAGGAATTCTTCTATTAATCTTGATGGAACTATACCTTCTGCGAATCTTACATTTCAGAATAATGATAAAAGGTCAGGTGATATGGATCAAAAGAATTA
CAATAATCAACAGTACAACAAGAGCTACAAACAAAATGGTGGACGGTTTGGTAAAAGGAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTTCCAATTCTACTCAAATTAATGTAGTAGGTTCTCTTCCGGGGCAAGGTTCGGTGATCTCCAACCCTGGCAATCAAATTTCGATCATAAAACTTACTGATAT
CAATTATCTATTATGGAAATTTCAAGTTCTAACAACTATTCAAGGTCACGATTTGGAGAAATTCATCAAAGAAGGTGCTAAGCCTCCAACGCAATTTCTTCAAGCCACCA
ATGACGAAGGAAATTCTACAAAAGAGGAGAATCCTGAATATCTTTCTTGGATAAAACAAGATAAATTGTTATCTTCTTGGCTTTTAGGAGCAATGACAGAAGATATATTA
GCACAGATGGTAGGTTGCACTTCAACAAGGGAAATATGGGAATCTCTTAAACAAACCTACACAACATCAAATACTGCTAAGATCATGCAATTGAAGGGTGAGTTACAAAA
TCTGAAGAAAGGAGGTATGAATATTAAAGAGTATGTTGCAAAAGCGAAAAATCTTGTTAATGCTCTTAATGCAGTAGGATGCAAAGTTACAACACAAGAACACATAGTCT
TTATCCTCTTAGGTTTATGTACAGAATATGACTCAATTGTTTCTGTTATCTCTGCAAAATCTAAACCCAAACCATTACAAGAAATTTATGCTTTACTCATGAGTCATGAA
AATAGATTAGAGAGGAATTCTTCTATTAATCTTGATGGAACTATACCTTCTGCGAATCTTACATTTCAGAATAATGATAAAAGGTCAGGTGATATGGATCAAAAGAATTA
CAATAATCAACAGTACAACAAGAGCTACAAACAAAATGGTGGACGGTTTGGTAAAAGGAGGTAA
Protein sequenceShow/hide protein sequence
MASSNSTQINVVGSLPGQGSVISNPGNQISIIKLTDINYLLWKFQVLTTIQGHDLEKFIKEGAKPPTQFLQATNDEGNSTKEENPEYLSWIKQDKLLSSWLLGAMTEDIL
AQMVGCTSTREIWESLKQTYTTSNTAKIMQLKGELQNLKKGGMNIKEYVAKAKNLVNALNAVGCKVTTQEHIVFILLGLCTEYDSIVSVISAKSKPKPLQEIYALLMSHE
NRLERNSSINLDGTIPSANLTFQNNDKRSGDMDQKNYNNQQYNKSYKQNGGRFGKRR