; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g26470 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g26470
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr9:19774043..19776227
RNA-Seq ExpressionMoc09g26470
SyntenyMoc09g26470
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]4.5e-8742.26Show/hide
Query:  APAFYDFNHVITDPIIEAGRFELKPAMFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDL
        AP F + N  I  P I+A +FELKP MFQMLQ +G F G+ +EDPH HLR FM+++D FK  GV ++A+RLKLF YS+RD  R WL+SLPA S+T+WNDL
Subjt:  APAFYDFNHVITDPIIEAGRFELKPAMFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDL

Query:  AGQFLMQYIPPSNNAELRNKINNFQQLPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKH
          +FL +Y PP+ NA+LRN+IN+FQQ   ESL D+W+RFK LL+KC HHGI   IQ+ET+YNG+N  T++V+DAS NGALLS  Y +A+++LE I+   +
Subjt:  AGQFLMQYIPPSNNAELRNKINNFQQLPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKH

Query:  QWSKSQAASTTASPTGLVTEDVVADLNSKISHLADIIMKNVT----TNEVVGSKAKVAAVQNNLCTYCEGQHHFENC-----------------------
        QWS S+ A T     G+   D +  + ++++ + + ++KN++     ++     +++   +N  C +C   H +++C                       
Subjt:  QWSKSQAASTTASPTGLVTEDVVADLNSKISHLADIIMKNVT----TNEVVGSKAKVAAVQNNLCTYCEGQHHFENC-----------------------

Query:  ---------FSGQNQWN-----------------TQKQPESMSLKEMFKIYMIKNDANVQSQAALLQSQAASLRNMEVQIGQLATELKNRPKGVLPSDIG
                 FS  NQ                   +Q+ P+S SL+ M K Y+IKN+A+     AL+QSQAASLRN+E Q+GQLA EL+NRP G LPSD  
Subjt:  ---------FSGQNQWN-----------------TQKQPESMSLKEMFKIYMIKNDANVQSQAALLQSQAASLRNMEVQIGQLATELKNRPKGVLPSDIG

Query:  QPKRDGKKQCKALTLRSGKALTPIQQQEKDHEEALAEPKTNAEELTDPAKAQ
        +PK  G + CKA+TL+SGK L       K  +    EP  N EE+ D  +++
Subjt:  QPKRDGKKQCKALTLRSGKALTPIQQQEKDHEEALAEPKTNAEELTDPAKAQ

XP_017239618.1 PREDICTED: uncharacterized protein LOC108212402 [Daucus carota subsp. sativus]6.9e-8842.73Show/hide
Query:  APAFYDFNHVITDPIIEAGRFELKPAMFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDL
        AP F + N  I  P I+A +FELKP MFQMLQ +G F G+ +EDPH HLR FM+++D FK  GV ++A+RLKLF YS+RD  R WL+SLPA S+T+WNDL
Subjt:  APAFYDFNHVITDPIIEAGRFELKPAMFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDL

Query:  AGQFLMQYIPPSNNAELRNKINNFQQLPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKH
          +FL +Y PP+ NA+L N+IN+FQQ   ESL D+W+RFK LL+KC HHGI   IQ+ET+YNG+N  T++V+DAS NGALLS  Y +A+++LE I+ N +
Subjt:  AGQFLMQYIPPSNNAELRNKINNFQQLPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKH

Query:  QWSKSQAASTTASPTGLVTEDVVADLNSKISHLADIIMKNVT----TNEVVGSKAKVAAVQNNLCTYCEGQHHFENC-----------------------
        QW  S+ A T     G+   D +  + ++++ + + ++KN++     ++     +++   +N  C +C   H +++C                       
Subjt:  QWSKSQAASTTASPTGLVTEDVVADLNSKISHLADIIMKNVT----TNEVVGSKAKVAAVQNNLCTYCEGQHHFENC-----------------------

Query:  ---------FSGQNQWN-----------------TQKQPESMSLKEMFKIYMIKNDANVQSQAALLQSQAASLRNMEVQIGQLATELKNRPKGVLPSDIG
                 FS  NQ                   +Q+ P+S SL+ M K Y+IKN+A+     AL+QSQAASLRN+E Q+GQL  EL+NRP G LPSD  
Subjt:  ---------FSGQNQWN-----------------TQKQPESMSLKEMFKIYMIKNDANVQSQAALLQSQAASLRNMEVQIGQLATELKNRPKGVLPSDIG

Query:  QPKRDGKKQCKALTLRSGKALTPIQQQEKDHEEALAEPKTNAEELTD
        +PK DG + CKA+TL+SGK L       K H++++ EP  N EE++D
Subjt:  QPKRDGKKQCKALTLRSGKALTPIQQQEKDHEEALAEPKTNAEELTD

XP_030483210.1 uncharacterized protein LOC115699807 [Cannabis sativa]7.6e-8741.46Show/hide
Query:  APAFYDFNHVITDPIIEAGRFELKPAMFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDL
        AP F + N  I  P I+A +FELKP MFQMLQ VG F G+ +EDPH HLR FM+V+D FK  GV+++A+RLKLF YSLRD  RAWL+SLP+ S+T+W +L
Subjt:  APAFYDFNHVITDPIIEAGRFELKPAMFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDL

Query:  AGQFLMQYIPPSNNAELRNKINNFQQLPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKH
        A +FLM+Y PP+ NA+LR +I +FQQ   ESL ++W+RFK LL+KC HHGIP  IQ+ET+YNG+N  T++V+DAS NGALL+  Y EA+D++ERIS N +
Subjt:  AGQFLMQYIPPSNNAELRNKINNFQQLPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKH

Query:  QWSKSQAASTTASPTGLVTEDVVADLNSKISHLADIIMKNVTTNEVVGSK---AKVAAVQNNLCTYCEGQHHFENC------------------------
        QW  ++         G++  D +  L+++++ ++++I KN++  + +G +   + V  ++   C +C   H F+NC                        
Subjt:  QWSKSQAASTTASPTGLVTEDVVADLNSKISHLADIIMKNVTTNEVVGSK---AKVAAVQNNLCTYCEGQHHFENC------------------------

Query:  -------------------------FSGQNQWNTQKQPESMSLKEMFKIYMIK-------NDANVQSQAALLQSQAASLRNMEVQIGQLATELKNRPKGV
                                 FS   Q   Q+  ++ SL+ M + +M K        ++ +      +QSQA S+R +E Q+GQLA EL+NRP+G 
Subjt:  -------------------------FSGQNQWNTQKQPESMSLKEMFKIYMIK-------NDANVQSQAALLQSQAASLRNMEVQIGQLATELKNRPKGV

Query:  LPSDIGQPKRDGKKQCKALTLRSGKALTPIQQQEKDHEE
        LPSD   P+RDGK+ CKA+ LRSGK L   ++  KD  E
Subjt:  LPSDIGQPKRDGKKQCKALTLRSGKALTPIQQQEKDHEE

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]5.1e-9145.02Show/hide
Query:  APAFYDFNHVITDPIIEAGRFELKPAMFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDL
        AP F + N  I  P I+A  FELKP MFQMLQ VG F G  +EDPH H+R F++V+D FK  GVS+EA+RLKLF +SLRD  RAWL++LP +S+T+WNDL
Subjt:  APAFYDFNHVITDPIIEAGRFELKPAMFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDL

Query:  AGQFLMQYIPPSNNAELRNKINNFQQLPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKH
        A +FL +Y PP+ NA+ R++I +FQQ   E+ SD+W+RFK LL+KC HHGIP  IQ+ET+YNG+N  +++V+DAS NGA+LS  Y EAF++LERI+ N +
Subjt:  AGQFLMQYIPPSNNAELRNKINNFQQLPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKH

Query:  QWSKSQAASTTASPTGLVTEDVVADLNSKISHLADIIMKNVTTNEVVGSKAKVAAVQ--NNLCTYCEGQHHFEN--------CFSGQNQWNTQKQP----
        QWS ++ A T+    G++  D +  L ++++ + + I+KN+      GS    AA+Q   N C YC   H FEN        C+ G   +N    P    
Subjt:  QWSKSQAASTTASPTGLVTEDVVADLNSKISHLADIIMKNVTTNEVVGSKAKVAAVQ--NNLCTYCEGQHHFEN--------CFSGQNQWNTQKQP----

Query:  ----------------------------------------ESMSLKEMFKIYMIKNDANVQSQAALLQSQAASLRNMEVQIGQLATELKNRPKGVLPSDI
                                                ++ SL+ + + YM KND        ++QSQAASLRN+EVQ+GQLA +LKNRP+G LPSD 
Subjt:  ----------------------------------------ESMSLKEMFKIYMIKNDANVQSQAALLQSQAASLRNMEVQIGQLATELKNRPKGVLPSDI

Query:  GQPKRDGKKQCKALTLRSGKAL
          P+RDGK+ CKA+TLRSGK +
Subjt:  GQPKRDGKKQCKALTLRSGKAL

XP_030510138.1 uncharacterized protein LOC115724905 [Cannabis sativa]2.6e-8743.54Show/hide
Query:  FYDFNHVITDPIIEAGRFELKPAMFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDLAGQ
        F + N  I  P I+A  FELKP MFQMLQ VG F G  +EDPH H+  F++V+D FK  GVS+EA+RLKLF +SLRD  RAWL++LP++S+T+WNDLA  
Subjt:  FYDFNHVITDPIIEAGRFELKPAMFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDLAGQ

Query:  FLMQYIPPSNNAELRNKINNFQQLPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKHQWS
        FL +Y PP+ NA+ R++I +FQQL  E+ SD+W+RFK LL+KC HHGIP  IQ+ET+YNG+N  +++V+DAS NGA+LS  Y EAF++LERI+ N +QWS
Subjt:  FLMQYIPPSNNAELRNKINNFQQLPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKHQWS

Query:  KSQAASTTASPTGLVTEDVVADLNSKISHLADIIMKNVTTNEVVGSKAKVAAVQNN--LCTYCEGQHHFEN--------CFSGQNQWNTQKQP-------
         ++ A T+    G++  D +  L ++++ + + I+KN+      GS    AA+Q     C YC   H FEN        C+ G   +N    P       
Subjt:  KSQAASTTASPTGLVTEDVVADLNSKISHLADIIMKNVTTNEVVGSKAKVAAVQNN--LCTYCEGQHHFEN--------CFSGQNQWNTQKQP-------

Query:  ------------------------------------------ESMSLKEMFKIYMIKNDANVQSQAALLQSQAASLRNMEVQIGQLATELKNRPKGVLPS
                                                  ++ SL+ + + YM KND       A++QSQAASLRN+EVQ+GQLA +LKNRP+G LPS
Subjt:  ------------------------------------------ESMSLKEMFKIYMIKNDANVQSQAALLQSQAASLRNMEVQIGQLATELKNRPKGVLPS

Query:  DIGQPKRDGKKQCKALTLRSGKALTPIQQQEKDHEEALAEP
        D   P+RD K+ CKA+TLRSGK    I +      EA+ EP
Subjt:  DIGQPKRDGKKQCKALTLRSGKALTPIQQQEKDHEEALAEP

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129458.8e-6536.26Show/hide
Query:  VAPAFYDFNHVITDPIIEAGRFELKPAMFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWND
        V P     +  I  P I A  FE+KPA  QM+Q    F GL S+DP+ HL  F+++ D FK  GV+ +A+RL+LF +SLRD  ++WL+SLP  SIT+W D
Subjt:  VAPAFYDFNHVITDPIIEAGRFELKPAMFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWND

Query:  LAGQFLMQYIPPSNNAELRNKINNFQQLPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNK
        LA +FL ++ PP+  A++RN I +F Q   ESL ++W+RFK LL++C HHGIP ++Q++T+YNG+    + +IDA+  GAL+S    +A+++LE ++ N 
Subjt:  LAGQFLMQYIPPSNNAELRNKINNFQQLPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNK

Query:  HQWSKSQAASTTASPTGLVTEDVVADLNSKISHLADIIMKNVTTNEVVGSKAKVAAVQNNL--CTYCEGQHHFENC--------FSG-------------
        +QW   ++ S  A   G    D +  L ++++ L+    K + T         V AVQN+L  C  C   H ++ C        F G             
Subjt:  HQWSKSQAASTTASPTGLVTEDVVADLNSKISHLADIIMKNVTTNEVVGSKAKVAAVQNNL--CTYCEGQHHFENC--------FSG-------------

Query:  ------------------------------QNQWNTQKQPESMSLKEMFKIYMIKNDANVQSQAALLQSQAASLRNMEVQIGQLATELKNRPKGVLPSDI
                                      Q Q   Q   +   L+E+   Y+ K D       A++QSQ ASLRN+E Q+GQLA  + NRP+G LPSD 
Subjt:  ------------------------------QNQWNTQKQPESMSLKEMFKIYMIKNDANVQSQAALLQSQAASLRNMEVQIGQLATELKNRPKGVLPSDI

Query:  GQPKRDGKKQCKALTLRSGKALTPIQQQEKDHE
         Q    GK+QC+A+TLRSGK +  + Q+  + E
Subjt:  GQPKRDGKKQCKALTLRSGKALTPIQQQEKDHE

A0A6J1DTD1 uncharacterized protein LOC1110241367.5e-6441.53Show/hide
Query:  MFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDLAGQFLMQYIPPSNNAELRNKINNFQQ
        MFQMLQ VG F+G A+EDPH+HL++ M V + FK+ G+SK+ MRLKLF +SLRD  R WL+SLP+ESITSW+DLA +FLM+Y PP+ NA+ RN+INNFQQ
Subjt:  MFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDLAGQFLMQYIPPSNNAELRNKINNFQQ

Query:  LPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKHQWSKSQAASTTASPTGLVTEDVVADL
           ES + +                                                   EAF++LERIS N H W   +A    +S   LV  +    L
Subjt:  LPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKHQWSKSQAASTTASPTGLVTEDVVADL

Query:  NSKISHLADIIMKNVTTNEVVGSKA---KVAAVQNNLCTYCEGQHHFENC---------FSGQN---------QWNTQKQPESMSLKEMFKIYMIKNDAN
        NSKI +L D++M+++T     G+      V  +Q   C++ EG HH+ NC           G N         Q   Q +    SL+++ K YM  NDA 
Subjt:  NSKISHLADIIMKNVTTNEVVGSKA---KVAAVQNNLCTYCEGQHHFENC---------FSGQN---------QWNTQKQPESMSLKEMFKIYMIKNDAN

Query:  VQSQAALLQSQAASLRNMEVQIGQLATELKNRPKGVLPSDIGQPKRDGKKQCKALTLRSGKALTPI
        V       + Q + LRN+E+Q+GQLAT+L +RP G LPSD   PKRDGK+QCKALTL SGKAL P+
Subjt:  VQSQAALLQSQAASLRNMEVQIGQLATELKNRPKGVLPSDIGQPKRDGKKQCKALTLRSGKALTPI

A0A6J1DWK1 uncharacterized protein LOC1110250531.8e-6544.41Show/hide
Query:  MFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDLAGQFLMQYIPPSNNAELRNKINNFQQ
        MFQM+ IVG F+G A+E PH+HL++FM V + FK+ G+SK  +RLKLFSYSLR   R WL+SL +E ITSW+DL  +FLM+Y  PS              
Subjt:  MFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDLAGQFLMQYIPPSNNAELRNKINNFQQ

Query:  LPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKHQWSKSQAASTTASPTGLVTEDVVADL
                     K L Q+C +HGIP  IQIETYY G++  T+LVIDAS NGALL  PY +A ++LERIS + H WS  +A    +S   LV  +    L
Subjt:  LPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKHQWSKSQAASTTASPTGLVTEDVVADL

Query:  NSKISHLADIIMKNVTTNEVVGSKAK------VAAVQNNLCTYCEGQHHFEN--------CFSGQNQWNTQKQPESMSLKEMFKIYMIKNDANVQSQAAL
        NSKI  L D+  +N + +     + +       +  Q    T       F+          + GQ   + Q +    SL+ + K YM  NDA V      
Subjt:  NSKISHLADIIMKNVTTNEVVGSKAK------VAAVQNNLCTYCEGQHHFEN--------CFSGQNQWNTQKQPESMSLKEMFKIYMIKNDANVQSQAAL

Query:  LQSQAASLRNMEVQIGQLATELKNRPKGVLPSDIGQPKRDGKKQCKALTLRSGKALTP
         QSQAASLRN+E+Q+GQLA +LK+RP G LPSD   PKRD K+QC ALTLRSGKAL P
Subjt:  LQSQAASLRNMEVQIGQLATELKNRPKGVLPSDIGQPKRDGKKQCKALTLRSGKALTP

A0A6J1G7Q6 uncharacterized protein LOC1114515981.3e-6837.5Show/hide
Query:  PAFYDFNHVITDPIIEAGRFELKPAMFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDLA
        PA  + N  I  P ++A  FELKP MFQMLQ +G F+GL+S+DPH HL+ F+ V+D F+  GV K+ +RL  FSYSLRD  ++WL+ L    I SWN LA
Subjt:  PAFYDFNHVITDPIIEAGRFELKPAMFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDLA

Query:  GQFLMQYIPPSNNAELRNKINNFQQLPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKHQ
         +FL +Y PP+ +A  RN+I  FQ+   E+LS++W+RFK  L+KC HHG+P  IQIET+YNG+N  T+ V+DAS NG +LS  Y EA+++LERI+ N  Q
Subjt:  GQFLMQYIPPSNNAELRNKINNFQQLPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKHQ

Query:  WSKSQAASTTASPTGLVTE-DVVADLNSKISHLADIIMKNVTTNE--VVGSKAKVAAVQ----NNLCTYCEGQHHFENCFS-------------------
        W      S     T  V E D ++ +N++++ + + I++N+   +  ++ + A  A V        C YC  +H F+ C S                   
Subjt:  WSKSQAASTTASPTGLVTE-DVVADLNSKISHLADIIMKNVTTNE--VVGSKAKVAAVQ----NNLCTYCEGQHHFENCFS-------------------

Query:  --------------------GQNQWNTQKQPES----------------------------------MSLKEMFKIYMIKNDANVQSQAALLQSQAASLR
                            GQ  +N Q  P++                                    L+ + K YM +ND       A++QSQ  SLR
Subjt:  --------------------GQNQWNTQKQPES----------------------------------MSLKEMFKIYMIKNDANVQSQAALLQSQAASLR

Query:  NMEVQIGQLATELKNRPKGVLPSDIGQPKRDG
        N+EVQ+GQLA EL+NRP G LP+D   PKR+G
Subjt:  NMEVQIGQLATELKNRPKGVLPSDIGQPKRDG

U5CUI2 Retrotrans_gag domain-containing protein2.0e-6944.75Show/hide
Query:  APAFYDFNHVITDPIIEAGRFELKPAMFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDL
        AP F + N  I  P I+A +FELKP MFQMLQ VG F G+ +EDPH HLR F++V+D FK  GVS+E +RLKLF +SLRD  R+WL++LP +S+T+WNDL
Subjt:  APAFYDFNHVITDPIIEAGRFELKPAMFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLFSYSLRDSVRAWLDSLPAESITSWNDL

Query:  AGQFLMQYIPPSNNAELRNKINNFQQLPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKH
        A +FL +Y PP+ NA+ R++I +FQQL  ES SD+W+RFK LL+KC HHGIP  IQ+ET+YNG+N  +++V+DAS NGA+LS  Y EAF++LE I+ N +
Subjt:  AGQFLMQYIPPSNNAELRNKINNFQQLPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIPYTEAFDMLERISRNKH

Query:  QWSKSQAASTTASPTGLVTEDVVADLNSKISHLADIIMKNVTTNEVVGSKAKV---AAVQNN--LCTYCEGQHHFENC---------FSGQNQWNTQKQP
        QWS ++ A T+    G++  D +  L ++++      M NV  N  +G+   +   AA+Q++   C +C   H FE C            QN   T    
Subjt:  QWSKSQAASTTASPTGLVTEDVVADLNSKISHLADIIMKNVTTNEVVGSKAKV---AAVQNN--LCTYCEGQHHFENC---------FSGQNQWNTQKQP

Query:  ESMSLKEMFKIYMIKNDANVQSQA
        +++++K    I +       Q+QA
Subjt:  ESMSLKEMFKIYMIKNDANVQSQA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGTTGTTACCACGACGGGAGTTGGGTTAGAACATTTTCTGGGTAACTCGGAAGAGAGCCGTGACGATCTGGGTTCTAAGAGTTTCATGTGTTTTTCTTCAGTAGC
TCCAGCATTCTATGACTTCAACCATGTCATTACTGATCCAATCATCGAAGCAGGGAGATTTGAGCTAAAGCCTGCAATGTTTCAAATGTTGCAAATAGTGGGACCATTTT
ATGGACTGGCATCAGAAGACCCGCACAAACACTTAAGATATTTTATGCAAGTAGCTGATTTGTTTAAGGAAGTTGGAGTTAGTAAGGAAGCAATGAGATTGAAGCTATTT
TCTTACTCGTTGAGGGACTCTGTTAGAGCATGGTTGGACTCCTTGCCGGCCGAATCAATTACTTCATGGAACGACTTAGCAGGGCAATTTTTGATGCAGTACATCCCACC
TTCGAATAATGCTGAACTCAGAAACAAGATTAACAACTTTCAGCAGTTACCAAGAGAATCCTTAAGTGATTCTTGGAAAAGATTCAAAGGGTTGTTGCAAAAATGCCTCC
ATCATGGAATACCTCGCTACATTCAGATTGAGACTTATTATAATGGGGTAAACGAGGTCACACAATTGGTAATTGATGCCTCAACTAATGGAGCTTTGCTGTCAATACCG
TATACAGAAGCTTTTGACATGTTGGAAAGAATTTCAAGGAATAAACATCAATGGTCGAAATCACAAGCTGCATCAACAACAGCAAGTCCCACAGGGTTAGTAACAGAGGA
TGTAGTGGCAGACCTCAATTCCAAAATTTCACACTTGGCCGATATTATCATGAAGAATGTCACTACAAATGAGGTCGTAGGGTCTAAGGCGAAGGTGGCAGCTGTACAGA
ACAACCTGTGCACATACTGTGAAGGGCAACATCATTTTGAAAACTGTTTTTCTGGACAAAATCAATGGAATACACAAAAGCAACCAGAATCAATGAGCTTAAAGGAGATG
TTTAAAATCTACATGATTAAGAATGATGCCAACGTGCAAAGTCAGGCAGCTCTCTTGCAGAGTCAGGCAGCATCACTGCGAAACATGGAAGTCCAAATAGGACAGCTGGC
GACAGAACTGAAGAATAGACCAAAAGGTGTGCTGCCAAGTGATATAGGGCAACCTAAAAGAGATGGTAAAAAGCAGTGCAAAGCATTGACACTACGGAGTGGCAAAGCTT
TAACACCAATACAGCAACAGGAAAAAGATCACGAAGAAGCACTTGCAGAGCCCAAAACCAATGCAGAAGAGTTGACAGACCCAGCTAAGGCACAAGCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTTGTTGTTACCACGACGGGAGTTGGGTTAGAACATTTTCTGGGTAACTCGGAAGAGAGCCGTGACGATCTGGGTTCTAAGAGTTTCATGTGTTTTTCTTCAGTAGC
TCCAGCATTCTATGACTTCAACCATGTCATTACTGATCCAATCATCGAAGCAGGGAGATTTGAGCTAAAGCCTGCAATGTTTCAAATGTTGCAAATAGTGGGACCATTTT
ATGGACTGGCATCAGAAGACCCGCACAAACACTTAAGATATTTTATGCAAGTAGCTGATTTGTTTAAGGAAGTTGGAGTTAGTAAGGAAGCAATGAGATTGAAGCTATTT
TCTTACTCGTTGAGGGACTCTGTTAGAGCATGGTTGGACTCCTTGCCGGCCGAATCAATTACTTCATGGAACGACTTAGCAGGGCAATTTTTGATGCAGTACATCCCACC
TTCGAATAATGCTGAACTCAGAAACAAGATTAACAACTTTCAGCAGTTACCAAGAGAATCCTTAAGTGATTCTTGGAAAAGATTCAAAGGGTTGTTGCAAAAATGCCTCC
ATCATGGAATACCTCGCTACATTCAGATTGAGACTTATTATAATGGGGTAAACGAGGTCACACAATTGGTAATTGATGCCTCAACTAATGGAGCTTTGCTGTCAATACCG
TATACAGAAGCTTTTGACATGTTGGAAAGAATTTCAAGGAATAAACATCAATGGTCGAAATCACAAGCTGCATCAACAACAGCAAGTCCCACAGGGTTAGTAACAGAGGA
TGTAGTGGCAGACCTCAATTCCAAAATTTCACACTTGGCCGATATTATCATGAAGAATGTCACTACAAATGAGGTCGTAGGGTCTAAGGCGAAGGTGGCAGCTGTACAGA
ACAACCTGTGCACATACTGTGAAGGGCAACATCATTTTGAAAACTGTTTTTCTGGACAAAATCAATGGAATACACAAAAGCAACCAGAATCAATGAGCTTAAAGGAGATG
TTTAAAATCTACATGATTAAGAATGATGCCAACGTGCAAAGTCAGGCAGCTCTCTTGCAGAGTCAGGCAGCATCACTGCGAAACATGGAAGTCCAAATAGGACAGCTGGC
GACAGAACTGAAGAATAGACCAAAAGGTGTGCTGCCAAGTGATATAGGGCAACCTAAAAGAGATGGTAAAAAGCAGTGCAAAGCATTGACACTACGGAGTGGCAAAGCTT
TAACACCAATACAGCAACAGGAAAAAGATCACGAAGAAGCACTTGCAGAGCCCAAAACCAATGCAGAAGAGTTGACAGACCCAGCTAAGGCACAAGCCTAG
Protein sequenceShow/hide protein sequence
MLVVTTTGVGLEHFLGNSEESRDDLGSKSFMCFSSVAPAFYDFNHVITDPIIEAGRFELKPAMFQMLQIVGPFYGLASEDPHKHLRYFMQVADLFKEVGVSKEAMRLKLF
SYSLRDSVRAWLDSLPAESITSWNDLAGQFLMQYIPPSNNAELRNKINNFQQLPRESLSDSWKRFKGLLQKCLHHGIPRYIQIETYYNGVNEVTQLVIDASTNGALLSIP
YTEAFDMLERISRNKHQWSKSQAASTTASPTGLVTEDVVADLNSKISHLADIIMKNVTTNEVVGSKAKVAAVQNNLCTYCEGQHHFENCFSGQNQWNTQKQPESMSLKEM
FKIYMIKNDANVQSQAALLQSQAASLRNMEVQIGQLATELKNRPKGVLPSDIGQPKRDGKKQCKALTLRSGKALTPIQQQEKDHEEALAEPKTNAEELTDPAKAQA