; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g02890 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g02890
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:2201760..2205045
RNA-Seq ExpressionMoc03g02890
SyntenyMoc03g02890
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046195.1 putative Ty1-copia-like retrotransposon [Cucumis melo var. makuwa]3.1e-4142.92Show/hide
Query:  ILNTEYSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDH
        +LN+EY  W RQD LI++WLLGSMS  +L++ML   + +++W+ L   +SSR +A+ M  K+KL   KKG + L+EYF KI+  VDALA+  + IS +DH
Subjt:  ILNTEYSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDH

Query:  VLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLTTHNPVKQSSTVNNDPNRRG----------------KNSGQ
        +L+IL GLG+EY S++S+I+ +  SPS+Q   SLLL QE++IE  S I  + SLP++N+TTH     S    ++   RG                K+   
Subjt:  VLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLTTHNPVKQSSTVNNDPNRRG----------------KNSGQ

Query:  KFNNRRFWNNNGRAQCQTCGRFGHTAARCYFRF
          +NR    N  + QCQ C +FGH A RCYFR+
Subjt:  KFNNRRFWNNNGRAQCQTCGRFGHTAARCYFRF

KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.1e-4142.25Show/hide
Query:  ATLILNTEYSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISH
        AT   N  Y  W RQD LI++WLLGSMS  +L++ML CK+ +E+W+ L   FSSR +A+ M  K+KL   KKG + L+EYF KI   VDALA+  + +S 
Subjt:  ATLILNTEYSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISH

Query:  EDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLTTHNPVKQSSTV----------NNDPNRRGKNSGQKFN
        +DH+L+IL GLGS+Y S++SVI+ +  SPS+Q+V SLLL QE++ E  S +  + +LPS+N+ T    K + +           N+  N+RG     + N
Subjt:  EDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLTTHNPVKQSSTV----------NNDPNRRGKNSGQKFN

Query:  NRRFWNNNGRAQCQTCGRFGHTAARCYFRF--ERNFQG--PNAH-SPHQNTNQFSQIS
          R  N N + QCQ C + G++A RC+FR+    N  G  PN+H + + N N   Q+S
Subjt:  NRRFWNNNGRAQCQTCGRFGHTAARCYFRF--ERNFQG--PNAH-SPHQNTNQFSQIS

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]2.5e-5447.22Show/hide
Query:  IRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDHVLHILQGLG
        ++QD LIT+WL  SM   +L EM+ C T REVWQ+L   ++SRN+AR+M LKSKLE  KKG L L++YF+K+K LVD+LA AG+K++ EDH++HIL GL 
Subjt:  IRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDHVLHILQGLG

Query:  SEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLT--THNPVKQSSTVNNDP---NRRGKNSGQKFNNRRFWNNNGRAQCQTCG
        SE++S VSVI+ +  + +LQ+VYSLLL  E R ER+S IN DG+LPS+NLT  T N     S     P   N R KNSG   N RR WN+N R QCQ  G
Subjt:  SEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLT--THNPVKQSSTVNNDP---NRRGKNSGQKFNNRRFWNNNGRAQCQTCG

Query:  RFGHTAARCYFRFERNFQGPNAHSPHQNTNQFSQISVQSSSQPPF----NAFTLQHELNKENSLPSPILSPPPAASTSASPTNGGNQH
        +FGHTA RCY RFE+ F GPN  S  Q+       S  +S+   F     AFT   + N  + + + +        T+  P +G   H
Subjt:  RFGHTAARCYFRFERNFQGPNAHSPHQNTNQFSQISVQSSSQPPF----NAFTLQHELNKENSLPSPILSPPPAASTSASPTNGGNQH

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]2.5e-5447.22Show/hide
Query:  IRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDHVLHILQGLG
        ++QD LIT+WL  SM   +L EM+ C T REVWQ+L   ++SRN+AR+M LKSKLE  KKG L L++YF+K+K LVD+LA AG+K++ EDH++HIL GL 
Subjt:  IRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDHVLHILQGLG

Query:  SEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLT--THNPVKQSSTVNNDP---NRRGKNSGQKFNNRRFWNNNGRAQCQTCG
        SE++S VSVI+ +  + +LQ+VYSLLL  E R ER+S IN DG+LPS+NLT  T N     S     P   N R KNSG   N RR WN+N R QCQ  G
Subjt:  SEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLT--THNPVKQSSTVNNDP---NRRGKNSGQKFNNRRFWNNNGRAQCQTCG

Query:  RFGHTAARCYFRFERNFQGPNAHSPHQNTNQFSQISVQSSSQPPF----NAFTLQHELNKENSLPSPILSPPPAASTSASPTNGGNQH
        +FGHTA RCY RFE+ F GPN  S  Q+       S  +S+   F     AFT   + N  + + + +        T+  P +G   H
Subjt:  RFGHTAARCYFRFERNFQGPNAHSPHQNTNQFSQISVQSSSQPPF----NAFTLQHELNKENSLPSPILSPPPAASTSASPTNGGNQH

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]1.4e-6251.07Show/hide
Query:  ATLILNTEYSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISH
        ++L  N  Y  WI+QD LI+AWLLGSM+  +LS+MLDCK+ RE+W VL   F+SR +AR+M LK KLE  KKG L L++YF KIKNLVD+LA AG+K+S 
Subjt:  ATLILNTEYSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISH

Query:  EDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLTTH-----NPVKQSSTVN---NDPNRRGKNSGQKFNNR
        EDH++HIL GLG E+D+++SVIT +++  +LQ+V SLLL QE R ER + IN DGSLPS+NLT +     N + QS   N   ++ ++RG+ +  + +NR
Subjt:  EDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLTTH-----NPVKQSSTVN---NDPNRRGKNSGQKFNNR

Query:  RFWNNNGRAQCQTCGRFGHTAARCYFRFERNFQGPNAHSPHQNTNQFSQISVQSS---SQPPFNAFTLQHELNKENSLPS
        R W  N + QCQ CGRFGHTA RCY RFERNF GPN      N N FS     S    + P  NAF+     N   S+ S
Subjt:  RFWNNNGRAQCQTCGRFGHTAARCYFRFERNFQGPNAHSPHQNTNQFSQISVQSS---SQPPFNAFTLQHELNKENSLPS

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-4142.25Show/hide
Query:  ATLILNTEYSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISH
        AT   N  Y  W RQD LI++WLLGSMS  +L++ML CK+ +E+W+ L   FSSR +A+ M  K+KL   KKG + L+EYF KI   VDALA+  + +S 
Subjt:  ATLILNTEYSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISH

Query:  EDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLTTHNPVKQSSTV----------NNDPNRRGKNSGQKFN
        +DH+L+IL GLGS+Y S++SVI+ +  SPS+Q+V SLLL QE++ E  S +  + +LPS+N+ T    K + +           N+  N+RG     + N
Subjt:  EDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLTTHNPVKQSSTV----------NNDPNRRGKNSGQKFN

Query:  NRRFWNNNGRAQCQTCGRFGHTAARCYFRF--ERNFQG--PNAH-SPHQNTNQFSQIS
          R  N N + QCQ C + G++A RC+FR+    N  G  PN+H + + N N   Q+S
Subjt:  NRRFWNNNGRAQCQTCGRFGHTAARCYFRF--ERNFQG--PNAH-SPHQNTNQFSQIS

A0A5D3CRZ7 Putative Ty1-copia-like retrotransposon1.5e-4142.92Show/hide
Query:  ILNTEYSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDH
        +LN+EY  W RQD LI++WLLGSMS  +L++ML   + +++W+ L   +SSR +A+ M  K+KL   KKG + L+EYF KI+  VDALA+  + IS +DH
Subjt:  ILNTEYSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDH

Query:  VLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLTTHNPVKQSSTVNNDPNRRG----------------KNSGQ
        +L+IL GLG+EY S++S+I+ +  SPS+Q   SLLL QE++IE  S I  + SLP++N+TTH     S    ++   RG                K+   
Subjt:  VLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLTTHNPVKQSSTVNNDPNRRG----------------KNSGQ

Query:  KFNNRRFWNNNGRAQCQTCGRFGHTAARCYFRF
          +NR    N  + QCQ C +FGH A RCYFR+
Subjt:  KFNNRRFWNNNGRAQCQTCGRFGHTAARCYFRF

A0A6J1C6N9 dr1-associated corepressor homolog isoform X11.2e-5447.22Show/hide
Query:  IRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDHVLHILQGLG
        ++QD LIT+WL  SM   +L EM+ C T REVWQ+L   ++SRN+AR+M LKSKLE  KKG L L++YF+K+K LVD+LA AG+K++ EDH++HIL GL 
Subjt:  IRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDHVLHILQGLG

Query:  SEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLT--THNPVKQSSTVNNDP---NRRGKNSGQKFNNRRFWNNNGRAQCQTCG
        SE++S VSVI+ +  + +LQ+VYSLLL  E R ER+S IN DG+LPS+NLT  T N     S     P   N R KNSG   N RR WN+N R QCQ  G
Subjt:  SEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLT--THNPVKQSSTVNNDP---NRRGKNSGQKFNNRRFWNNNGRAQCQTCG

Query:  RFGHTAARCYFRFERNFQGPNAHSPHQNTNQFSQISVQSSSQPPF----NAFTLQHELNKENSLPSPILSPPPAASTSASPTNGGNQH
        +FGHTA RCY RFE+ F GPN  S  Q+       S  +S+   F     AFT   + N  + + + +        T+  P +G   H
Subjt:  RFGHTAARCYFRFERNFQGPNAHSPHQNTNQFSQISVQSSSQPPF----NAFTLQHELNKENSLPSPILSPPPAASTSASPTNGGNQH

A0A6J1C8R2 dr1-associated corepressor homolog isoform X21.2e-5447.22Show/hide
Query:  IRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDHVLHILQGLG
        ++QD LIT+WL  SM   +L EM+ C T REVWQ+L   ++SRN+AR+M LKSKLE  KKG L L++YF+K+K LVD+LA AG+K++ EDH++HIL GL 
Subjt:  IRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDHVLHILQGLG

Query:  SEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLT--THNPVKQSSTVNNDP---NRRGKNSGQKFNNRRFWNNNGRAQCQTCG
        SE++S VSVI+ +  + +LQ+VYSLLL  E R ER+S IN DG+LPS+NLT  T N     S     P   N R KNSG   N RR WN+N R QCQ  G
Subjt:  SEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLT--THNPVKQSSTVNNDP---NRRGKNSGQKFNNRRFWNNNGRAQCQTCG

Query:  RFGHTAARCYFRFERNFQGPNAHSPHQNTNQFSQISVQSSSQPPF----NAFTLQHELNKENSLPSPILSPPPAASTSASPTNGGNQH
        +FGHTA RCY RFE+ F GPN  S  Q+       S  +S+   F     AFT   + N  + + + +        T+  P +G   H
Subjt:  RFGHTAARCYFRFERNFQGPNAHSPHQNTNQFSQISVQSSSQPPF----NAFTLQHELNKENSLPSPILSPPPAASTSASPTNGGNQH

A0A6J1DLT9 uncharacterized protein LOC1110217576.9e-6351.07Show/hide
Query:  ATLILNTEYSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISH
        ++L  N  Y  WI+QD LI+AWLLGSM+  +LS+MLDCK+ RE+W VL   F+SR +AR+M LK KLE  KKG L L++YF KIKNLVD+LA AG+K+S 
Subjt:  ATLILNTEYSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISH

Query:  EDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLTTH-----NPVKQSSTVN---NDPNRRGKNSGQKFNNR
        EDH++HIL GLG E+D+++SVIT +++  +LQ+V SLLL QE R ER + IN DGSLPS+NLT +     N + QS   N   ++ ++RG+ +  + +NR
Subjt:  EDHVLHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLTTH-----NPVKQSSTVN---NDPNRRGKNSGQKFNNR

Query:  RFWNNNGRAQCQTCGRFGHTAARCYFRFERNFQGPNAHSPHQNTNQFSQISVQSS---SQPPFNAFTLQHELNKENSLPS
        R W  N + QCQ CGRFGHTA RCY RFERNF GPN      N N FS     S    + P  NAF+     N   S+ S
Subjt:  RFWNNNGRAQCQTCGRFGHTAARCYFRFERNFQGPNAHSPHQNTNQFSQISVQSS---SQPPFNAFTLQHELNKENSLPS

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.3e-1725.81Show/hide
Query:  LNTEYSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDHV
        +N +Y+ W RQD LI + +LG++S S+   +    T  ++W+ L   +++ +   +  L+++L+   KG   +++Y + +    D LA  G+ + H++ V
Subjt:  LNTEYSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDHV

Query:  LHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRI--ERHSTINP--DGSLPSINLTTHNPVKQSSTVNNDPNRRGKNSGQKF--NNRRFWNNN
          +L+ L  EY  V+  I  KD  P+L +++  LL  E++I     +T+ P    ++   N TT N     +  N   NR   N+ + +  ++  F  NN
Subjt:  LHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRI--ERHSTINP--DGSLPSINLTTHNPVKQSSTVNNDPNRRGKNSGQKF--NNRRFWNNN

Query:  GRA-----QCQTCGRFGHTAARC-----YFRFERNFQGPNAHSPHQNTNQFSQISVQSSSQPPFNAFTLQHELNKENSL
         ++     +CQ CG  GH+A RC     +     + Q P+  +P Q     +  S  SS+    ++    H  +  N+L
Subjt:  GRA-----QCQTCGRFGHTAARC-----YFRFERNFQGPNAHSPHQNTNQFSQISVQSSSQPPFNAFTLQHELNKENSL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-1124.53Show/hide
Query:  LNTEYSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDHV
        +N +Y+ W RQD LI + +LG++S S+   +    T  ++W+ L   +++ +   +  L                   +     D LA  G+ + H++ V
Subjt:  LNTEYSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDHV

Query:  LHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLP-SINLTTHNPVKQSSTVNNDPNRRGKNSGQKFNNRR--FWNNNGR-
          +L+ L  +Y  V+  I  KD  PSL +++  L+ +E+++     +N    +P + N+ TH    +++  N + N RG N     NN R   W  +   
Subjt:  LHILQGLGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLP-SINLTTHNPVKQSSTVNNDPNRRGKNSGQKFNNRR--FWNNNGR-

Query:  ------------AQCQTCGRFGHTAARCYFRFERNFQGPNAHSPHQNTNQFSQISVQSSSQPPFN
                     +CQ C   GH+A RC          P  H     TNQ    S  +  QP  N
Subjt:  ------------AQCQTCGRFGHTAARCYFRFERNFQGPNAHSPHQNTNQFSQISVQSSSQPPFN

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.9e-0527.85Show/hide
Query:  YSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNL
        Y  W + ++++  WL+ SM++ LL  ++  +T  ++W+ L   F      ++  L+ +L T ++GG  +EEYF K+  +
Subjt:  YSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNL

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.8e-1023.38Show/hide
Query:  WIRQDSLITAWLLGSMS-NSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDHVLHILQG
        W ++D ++   L G+++        +   T+R++W  +  +F +   AR + L S+L T   G +++ +Y++K+K L D+L      ++  + V+++L G
Subjt:  WIRQDSLITAWLLGSMS-NSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDHVLHILQG

Query:  LGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINP---DGSLPSINLTTHNPVKQSSTVNNDPNRRGKNSGQKFNNRRFWNNNGRAQCQTCG
        L  ++D++++VI  +   PS     ++L  +E+R++R    NP   D S  S  L        ++   +  N+ G   G+   N  F    GR       
Subjt:  LGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINP---DGSLPSINLTTHNPVKQSSTVNNDPNRRGKNSGQKFNNRRFWNNNGRAQCQTCG

Query:  RFGH-TAARCYFRFERNFQGPNAHSPHQNTN
         F        Y    + +  P  + P+ NTN
Subjt:  RFGH-TAARCYFRFERNFQGPNAHSPHQNTN

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.1e-1529.13Show/hide
Query:  WIRQDSLITAWLLGSMSNSLLSEMLDCK-TTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDHVLHILQG
        W  +D L+  W+ G++++SLL  ++    T R++W  L   F     AR +  +++L TT    L + EY +K+K+L D L      IS    V+H+L G
Subjt:  WIRQDSLITAWLLGSMSNSLLSEMLDCK-TTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDHVLHILQG

Query:  LGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHS------TINPDGSLPSINLTT------------HNPVKQSSTVNNDPNRRGKNSGQKFNNR
        L  +YD +++VI  K   PS  +  S+LL++E+R+   S      T +P  SL ++  T             +N        +   NR G +S  ++NN 
Subjt:  LGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHS------TINPDGSLPSINLTT------------HNPVKQSSTVNNDPNRRGKNSGQKFNNR

Query:  RFWNNN
          W  N
Subjt:  RFWNNN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACTCTGATTCTAAATACTGAGTACTCTTACTGGATTCGTCAAGATAGTTTGATTACGGCGTGGCTCTTGGGTTCTATGTCCAATTCTCTCCTATCAGAAATGTT
AGACTGCAAGACTACTAGAGAGGTATGGCAAGTTTTGAATGCTCGGTTTTCTTCACGAAATATGGCGAGACTAATGGATTTGAAATCCAAACTCGAAACAACGAAGAAAG
GAGGTCTGAAACTTGAAGAGTATTTTAAAAAGATTAAAAATCTGGTTGATGCCTTGGCTACGGCTGGACGCAAAATTTCGCATGAGGATCATGTATTACATATTTTGCAA
GGCTTAGGTTCTGAGTATGATTCTGTAGTCTCTGTAATTACCGATAAAGATATCTCTCCCTCCTTACAGAAAGTTTATTCACTTTTGCTGATTCAAGAAAATAGAATTGA
ACGTCACTCTACCATCAATCCCGATGGTTCTCTGCCTTCAATAAACCTTACCACTCACAATCCAGTCAAACAGTCATCTACAGTGAACAATGATCCAAATCGAAGAGGGA
AAAATTCGGGACAGAAGTTCAATAATCGACGATTCTGGAACAACAATGGTAGGGCCCAATGTCAAACTTGTGGACGCTTTGGCCACACTGCTGCACGTTGCTACTTCAGA
TTTGAGCGAAATTTTCAAGGTCCGAATGCTCACTCACCTCATCAGAATACCAATCAATTTTCTCAAATATCAGTGCAAAGTTCTTCTCAACCTCCTTTCAATGCCTTTAC
TCTACAACATGAATTGAACAAGGAAAATTCGTTGCCATCTCCCATTCTTTCTCCTCCACCAGCTGCTTCGACATCTGCTTCTCCAACAAATGGTGGTAACCAACATCCCA
TGGTTACTCGGAGAGAGAAGTTTTCTGATGTTCGGCTGTATCGTAGTACTATGGGTGCTTTACAATATGCTACTCTTGTTGGGTTTTGGCAAAATATTGGAATTAAAGTT
GTCCTAGAATACAAGGATGGTCTCCATGTGCTAGAAGTTGTCCCACATCGGAAAAGTACAAGGAGGAGATGTGTTTCAACTACTATAAAAGGAGCTCATGAGCTTGGGTT
TTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAACTCTGATTCTAAATACTGAGTACTCTTACTGGATTCGTCAAGATAGTTTGATTACGGCGTGGCTCTTGGGTTCTATGTCCAATTCTCTCCTATCAGAAATGTT
AGACTGCAAGACTACTAGAGAGGTATGGCAAGTTTTGAATGCTCGGTTTTCTTCACGAAATATGGCGAGACTAATGGATTTGAAATCCAAACTCGAAACAACGAAGAAAG
GAGGTCTGAAACTTGAAGAGTATTTTAAAAAGATTAAAAATCTGGTTGATGCCTTGGCTACGGCTGGACGCAAAATTTCGCATGAGGATCATGTATTACATATTTTGCAA
GGCTTAGGTTCTGAGTATGATTCTGTAGTCTCTGTAATTACCGATAAAGATATCTCTCCCTCCTTACAGAAAGTTTATTCACTTTTGCTGATTCAAGAAAATAGAATTGA
ACGTCACTCTACCATCAATCCCGATGGTTCTCTGCCTTCAATAAACCTTACCACTCACAATCCAGTCAAACAGTCATCTACAGTGAACAATGATCCAAATCGAAGAGGGA
AAAATTCGGGACAGAAGTTCAATAATCGACGATTCTGGAACAACAATGGTAGGGCCCAATGTCAAACTTGTGGACGCTTTGGCCACACTGCTGCACGTTGCTACTTCAGA
TTTGAGCGAAATTTTCAAGGTCCGAATGCTCACTCACCTCATCAGAATACCAATCAATTTTCTCAAATATCAGTGCAAAGTTCTTCTCAACCTCCTTTCAATGCCTTTAC
TCTACAACATGAATTGAACAAGGAAAATTCGTTGCCATCTCCCATTCTTTCTCCTCCACCAGCTGCTTCGACATCTGCTTCTCCAACAAATGGTGGTAACCAACATCCCA
TGGTTACTCGGAGAGAGAAGTTTTCTGATGTTCGGCTGTATCGTAGTACTATGGGTGCTTTACAATATGCTACTCTTGTTGGGTTTTGGCAAAATATTGGAATTAAAGTT
GTCCTAGAATACAAGGATGGTCTCCATGTGCTAGAAGTTGTCCCACATCGGAAAAGTACAAGGAGGAGATGTGTTTCAACTACTATAAAAGGAGCTCATGAGCTTGGGTT
TTAA
Protein sequenceShow/hide protein sequence
MATLILNTEYSYWIRQDSLITAWLLGSMSNSLLSEMLDCKTTREVWQVLNARFSSRNMARLMDLKSKLETTKKGGLKLEEYFKKIKNLVDALATAGRKISHEDHVLHILQ
GLGSEYDSVVSVITDKDISPSLQKVYSLLLIQENRIERHSTINPDGSLPSINLTTHNPVKQSSTVNNDPNRRGKNSGQKFNNRRFWNNNGRAQCQTCGRFGHTAARCYFR
FERNFQGPNAHSPHQNTNQFSQISVQSSSQPPFNAFTLQHELNKENSLPSPILSPPPAASTSASPTNGGNQHPMVTRREKFSDVRLYRSTMGALQYATLVGFWQNIGIKV
VLEYKDGLHVLEVVPHRKSTRRRCVSTTIKGAHELGF