; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g18460 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g18460
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr3:12179682..12185160
RNA-Seq ExpressionMoc03g18460
SyntenyMoc03g18460
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]1.8e-21790.11Show/hide
Query:  MPPRRSMRLRADVDPAPGGVNGTDPPPPPAGNQAGVVPPFPPAAARERADPPVPPAVPQSEARFIKDFKRYGPPTFDGESERATAPEEWIRELEALYAYL
        MPPR SMRLRAD DPAPGGV G   PPP   +                         PQSEARFIKDFKRYGPPTFDGESERATA EEWIRELEALYAYL
Subjt:  MPPRRSMRLRADVDPAPGGVNGTDPPPPPAGNQAGVVPPFPPAAARERADPPVPPAVPQSEARFIKDFKRYGPPTFDGESERATAPEEWIRELEALYAYL

Query:  GCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFTELSRFALELIPTEALKIKR
        GCEDQFKVKGAVFMLRGEALNWWDSVAAAED+ANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFTELSRFALELIPTEALKIKR
Subjt:  GCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFTELSRFALELIPTEALKIKR

Query:  FVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQGMPPVCLTCQKRHTGQCWTGSKGCFR
        FVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYAD VLRAPQRQ QHQGMPPVC TCQKRHTGQCWTGSKGCFR
Subjt:  FVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQGMPPVCLTCQKRHTGQCWTGSKGCFR

Query:  CGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSV
        CGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTG VLVHDVPAYVLFDSGSSHTFISS FVRQATLELEPLGFLLSV
Subjt:  CGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSV

Query:  STPSGSILIASQKVRACELSFDNQTLRARLIQLDM
        STPSGSILIASQKVRA ELSFDNQTLRARLIQLDM
Subjt:  STPSGSILIASQKVRACELSFDNQTLRARLIQLDM

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]5.6e-16356.86Show/hide
Query:  RRSMRLRADVDPAPGGVNGTDPPPPPAGNQAGVVPPFPPAAAR----------------------------ERADPPVPPAVPQSEARFIKDFKRYGPPT
        RR+ R     DP   G N  DP   P     GVVPP P AA +                             +   P    +PQ E +FI+DFK +GPP 
Subjt:  RRSMRLRADVDPAPGGVNGTDPPPPPAGNQAGVVPPFPPAAAR----------------------------ERADPPVPPAVPQSEARFIKDFKRYGPPT

Query:  FDGESERATAPEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQ
        F+G SER TA EEW+RELEALY YLGC D FKV+GAVFMLRGEA+NWW+SVAAAEDHANVP+ WARFK+LLY+YY+P   ++ K  EFL L QG+L+VAQ
Subjt:  FDGESERATAPEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQ

Query:  YERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQG
        YERKFTELSRF  + +PTE LKI +F+ GLR+ I+G + L+ PTTYA AVR ALVMDK +    S    +GS+SGVKRKF S  A    R  Q   Q Q 
Subjt:  YERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQG

Query:  MPPVCLTCQKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSS
         PPVC +C+K H   CW G K CF+C +EGHF REC M+ +NTQ L Q+ P   +TQG  Q ARVFALTR +   AE VVTG +L+  +PAY LFDSGSS
Subjt:  MPPVCLTCQKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSS

Query:  HTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRACELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPSGRNFTFKG
        H+FI+S FVR A LELE  GF LSVSTPSGS+L+ SQ V+  +LSF  QTL   LIQL+MQDFDVI+GMDWLA N+ANINCS++EVSF L SG+NFTFKG
Subjt:  HTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRACELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPSGRNFTFKG

Query:  VTGRVPRTVSALKARRLLQNGAWGYLANVVDISKTPPNIDSVPVVRE
        V   VPR VSALKA  LLQ G W YLA+VVD  K  P+I+ V VV E
Subjt:  VTGRVPRTVSALKARRLLQNGAWGYLANVVDISKTPPNIDSVPVVRE

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]8.6e-15655.86Show/hide
Query:  RRSMRLRADVDPAPGGVNGTDP-----------PPPPAGNQAGVVPPFPPAAARERA----------------DPPVPPAVPQSEARFIKDFKRYGPPTF
        RR+ R     DP P G    DP           PP P     GV    P  A    A                  P    +PQ E +FI+DFKR+GPP F
Subjt:  RRSMRLRADVDPAPGGVNGTDP-----------PPPPAGNQAGVVPPFPPAAARERA----------------DPPVPPAVPQSEARFIKDFKRYGPPTF

Query:  DGESERATAPEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQY
        +G SER TA EEW+RELEALY YLGC D FKV+GAVFMLRGEA+NWW+SVAAAEDH NVP+ WARFK+LLY+YY+P TV++ K AEFL L QG+L+VAQY
Subjt:  DGESERATAPEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQY

Query:  ERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQGM
        ERKFTELSRF ++ IPTE LKI +F+ GLR  I+G + ++ PTTYA A+R ALVMDK +    S    +GSSSGVKRKF    +    R  Q   Q Q  
Subjt:  ERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQGM

Query:  PPVCLTCQKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSSH
        PPVC +C+K H G CW G + CFRC                     Q+ P   + QG  QRARVFALTR +   AE VVTG +LV  +PAY LFDSGSSH
Subjt:  PPVCLTCQKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSSH

Query:  TFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRACELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPSGRNFTFKGV
        +FI+S FVR A LELE LGFLLSVSTPSGS+L+ SQ V+  +LSFD QT   +LIQLDMQDFDVI+GMDWLA N+ANINCS++EVSF+LPSG+NFTFK V
Subjt:  TFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRACELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPSGRNFTFKGV

Query:  TGRVPRTVSALKARRLLQNGAWGYLANVVDISKTPPNIDSVPVVRE
           VPR VSALKA  LLQ GAW YLA+VVD  K  P+I++V VV E
Subjt:  TGRVPRTVSALKARRLLQNGAWGYLANVVDISKTPPNIDSVPVVRE

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]1.3e-15958.75Show/hide
Query:  RRSMRLRADVDPAPGGVNGTDPPPPPAGNQAGVVPPFPPAAARERADPPVPPAVP-QSEARFI------------KDFKRYGPPTFDGESERATAPEEWI
        RR+ R     DP P G    DP  PPA    GV PP P AA+  +  P V P V   +EA  +                R+     +  SER TA EEW+
Subjt:  RRSMRLRADVDPAPGGVNGTDPPPPPAGNQAGVVPPFPPAAARERADPPVPPAVP-QSEARFI------------KDFKRYGPPTFDGESERATAPEEWI

Query:  RELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFTELSRFALEL
        RELEALY YLGC D FKV+GAVFMLRGEA+NWW+SVAAAEDHANVP+ WARFK+LLY+YY+P TV++ K  EFL L QG+L+VA+YERKFTELSRF ++ 
Subjt:  RELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFTELSRFALEL

Query:  IPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQGMPPVCLTCQKRHTGQ
        IPT+ LKI +F+ GLR+ I+G + L+ PTTYA AVR ALVMDK +    S    +GSSSGVKRKF S  +    R  Q   Q Q  PPVC +C+K H G 
Subjt:  IPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQGMPPVCLTCQKRHTGQ

Query:  CWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSSHTFISSAFVRQATLE
        CW G + C+RC +EGHFARECPM+ +NTQ LGQRIP   + QG   RARVFALTR +   AE VVT  VLV  +PAY LFDSGSSH+FI+S FV  A LE
Subjt:  CWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSSHTFISSAFVRQATLE

Query:  LEPLGFLLSVSTPSGSILIASQKVRACELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPSGRNFTFKGVTGRVPRTVSALKAR
        LE LGFLLSVSTPSGS+L+ SQ V+  +LSFD QTL  +LIQLDMQDFDVI+GMDWLA N+ANI+CS+++VSF+LPSG+NFTFKGV   VPR V ALKA 
Subjt:  LEPLGFLLSVSTPSGSILIASQKVRACELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPSGRNFTFKGVTGRVPRTVSALKAR

Query:  RLLQNGAWGYLANVVDISKTPPNIDS
         LLQ GAW YLA+VVD  K  P+I++
Subjt:  RLLQNGAWGYLANVVDISKTPPNIDS

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]1.5e-18478.25Show/hide
Query:  YLGCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFTELSRFALELIPTEALKI
        YL CE+QFKVKG VFMLRGEALNWWDSVA AEDHANVPI WARFK+LLYDYYYP+T+KDMKEAEFLH   GTL+VAQYERKFTELS FA ELIPTEA+KI
Subjt:  YLGCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFTELSRFALELIPTEALKI

Query:  KRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQGMPPVCLTCQKRHTGQCWTGSKGC
        KRFVKGLRKGIRGPVDLQRP TYAEAVRG L+MD DVSN   PL EVGSSSGVKRK    YAD   RAPQR  Q QG+PPVC +CQKR  GQCWTG++GC
Subjt:  KRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQGMPPVCLTCQKRHTGQCWTGSKGC

Query:  FRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLL
        FRCGREGHFAREC M+AANTQRLGQR  P VSTQG                       G  LVH+VPAYVLFD GSSHTFIS+AFVRQATLELEPLGFLL
Subjt:  FRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLL

Query:  SVSTPSGSILIASQKVRACELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPSGRNFTFKGVTGRVPRTVSALKARRLLQNGAW
        SVSTPSGS+LIASQ VRA ELSFDNQTL ARLIQLDM+DFDVI+GMDWLATNQANINCS+REVSFQLPSGR+FTFKGV+G VPR VSALKARRLL NGAW
Subjt:  SVSTPSGSILIASQKVRACELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPSGRNFTFKGVTGRVPRTVSALKARRLLQNGAW

Query:  GYLANVVDISKTPPNIDSVPVVR
         YLA+VVDIS TPP+IDS  VV+
Subjt:  GYLANVVDISKTPPNIDSVPVVR

TrEMBL top hitse value%identityAlignment
A0A6J1DQB9 Reverse transcriptase2.7e-16356.86Show/hide
Query:  RRSMRLRADVDPAPGGVNGTDPPPPPAGNQAGVVPPFPPAAAR----------------------------ERADPPVPPAVPQSEARFIKDFKRYGPPT
        RR+ R     DP   G N  DP   P     GVVPP P AA +                             +   P    +PQ E +FI+DFK +GPP 
Subjt:  RRSMRLRADVDPAPGGVNGTDPPPPPAGNQAGVVPPFPPAAAR----------------------------ERADPPVPPAVPQSEARFIKDFKRYGPPT

Query:  FDGESERATAPEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQ
        F+G SER TA EEW+RELEALY YLGC D FKV+GAVFMLRGEA+NWW+SVAAAEDHANVP+ WARFK+LLY+YY+P   ++ K  EFL L QG+L+VAQ
Subjt:  FDGESERATAPEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQ

Query:  YERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQG
        YERKFTELSRF  + +PTE LKI +F+ GLR+ I+G + L+ PTTYA AVR ALVMDK +    S    +GS+SGVKRKF S  A    R  Q   Q Q 
Subjt:  YERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQG

Query:  MPPVCLTCQKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSS
         PPVC +C+K H   CW G K CF+C +EGHF REC M+ +NTQ L Q+ P   +TQG  Q ARVFALTR +   AE VVTG +L+  +PAY LFDSGSS
Subjt:  MPPVCLTCQKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSS

Query:  HTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRACELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPSGRNFTFKG
        H+FI+S FVR A LELE  GF LSVSTPSGS+L+ SQ V+  +LSF  QTL   LIQL+MQDFDVI+GMDWLA N+ANINCS++EVSF L SG+NFTFKG
Subjt:  HTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRACELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPSGRNFTFKG

Query:  VTGRVPRTVSALKARRLLQNGAWGYLANVVDISKTPPNIDSVPVVRE
        V   VPR VSALKA  LLQ G W YLA+VVD  K  P+I+ V VV E
Subjt:  VTGRVPRTVSALKARRLLQNGAWGYLANVVDISKTPPNIDSVPVVRE

A0A6J1DTA8 uncharacterized protein LOC1110241144.2e-15655.86Show/hide
Query:  RRSMRLRADVDPAPGGVNGTDP-----------PPPPAGNQAGVVPPFPPAAARERA----------------DPPVPPAVPQSEARFIKDFKRYGPPTF
        RR+ R     DP P G    DP           PP P     GV    P  A    A                  P    +PQ E +FI+DFKR+GPP F
Subjt:  RRSMRLRADVDPAPGGVNGTDP-----------PPPPAGNQAGVVPPFPPAAARERA----------------DPPVPPAVPQSEARFIKDFKRYGPPTF

Query:  DGESERATAPEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQY
        +G SER TA EEW+RELEALY YLGC D FKV+GAVFMLRGEA+NWW+SVAAAEDH NVP+ WARFK+LLY+YY+P TV++ K AEFL L QG+L+VAQY
Subjt:  DGESERATAPEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQY

Query:  ERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQGM
        ERKFTELSRF ++ IPTE LKI +F+ GLR  I+G + ++ PTTYA A+R ALVMDK +    S    +GSSSGVKRKF    +    R  Q   Q Q  
Subjt:  ERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQGM

Query:  PPVCLTCQKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSSH
        PPVC +C+K H G CW G + CFRC                     Q+ P   + QG  QRARVFALTR +   AE VVTG +LV  +PAY LFDSGSSH
Subjt:  PPVCLTCQKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSSH

Query:  TFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRACELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPSGRNFTFKGV
        +FI+S FVR A LELE LGFLLSVSTPSGS+L+ SQ V+  +LSFD QT   +LIQLDMQDFDVI+GMDWLA N+ANINCS++EVSF+LPSG+NFTFK V
Subjt:  TFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRACELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPSGRNFTFKGV

Query:  TGRVPRTVSALKARRLLQNGAWGYLANVVDISKTPPNIDSVPVVRE
           VPR VSALKA  LLQ GAW YLA+VVD  K  P+I++V VV E
Subjt:  TGRVPRTVSALKARRLLQNGAWGYLANVVDISKTPPNIDSVPVVRE

A0A6J1DUM2 uncharacterized protein LOC1110232478.5e-21890.11Show/hide
Query:  MPPRRSMRLRADVDPAPGGVNGTDPPPPPAGNQAGVVPPFPPAAARERADPPVPPAVPQSEARFIKDFKRYGPPTFDGESERATAPEEWIRELEALYAYL
        MPPR SMRLRAD DPAPGGV G   PPP   +                         PQSEARFIKDFKRYGPPTFDGESERATA EEWIRELEALYAYL
Subjt:  MPPRRSMRLRADVDPAPGGVNGTDPPPPPAGNQAGVVPPFPPAAARERADPPVPPAVPQSEARFIKDFKRYGPPTFDGESERATAPEEWIRELEALYAYL

Query:  GCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFTELSRFALELIPTEALKIKR
        GCEDQFKVKGAVFMLRGEALNWWDSVAAAED+ANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFTELSRFALELIPTEALKIKR
Subjt:  GCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFTELSRFALELIPTEALKIKR

Query:  FVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQGMPPVCLTCQKRHTGQCWTGSKGCFR
        FVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYAD VLRAPQRQ QHQGMPPVC TCQKRHTGQCWTGSKGCFR
Subjt:  FVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQGMPPVCLTCQKRHTGQCWTGSKGCFR

Query:  CGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSV
        CGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTG VLVHDVPAYVLFDSGSSHTFISS FVRQATLELEPLGFLLSV
Subjt:  CGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSV

Query:  STPSGSILIASQKVRACELSFDNQTLRARLIQLDM
        STPSGSILIASQKVRA ELSFDNQTLRARLIQLDM
Subjt:  STPSGSILIASQKVRACELSFDNQTLRARLIQLDM

A0A6J1DWP4 uncharacterized protein LOC1110252156.2e-16058.75Show/hide
Query:  RRSMRLRADVDPAPGGVNGTDPPPPPAGNQAGVVPPFPPAAARERADPPVPPAVP-QSEARFI------------KDFKRYGPPTFDGESERATAPEEWI
        RR+ R     DP P G    DP  PPA    GV PP P AA+  +  P V P V   +EA  +                R+     +  SER TA EEW+
Subjt:  RRSMRLRADVDPAPGGVNGTDPPPPPAGNQAGVVPPFPPAAARERADPPVPPAVP-QSEARFI------------KDFKRYGPPTFDGESERATAPEEWI

Query:  RELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFTELSRFALEL
        RELEALY YLGC D FKV+GAVFMLRGEA+NWW+SVAAAEDHANVP+ WARFK+LLY+YY+P TV++ K  EFL L QG+L+VA+YERKFTELSRF ++ 
Subjt:  RELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFTELSRFALEL

Query:  IPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQGMPPVCLTCQKRHTGQ
        IPT+ LKI +F+ GLR+ I+G + L+ PTTYA AVR ALVMDK +    S    +GSSSGVKRKF S  +    R  Q   Q Q  PPVC +C+K H G 
Subjt:  IPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQGMPPVCLTCQKRHTGQ

Query:  CWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSSHTFISSAFVRQATLE
        CW G + C+RC +EGHFARECPM+ +NTQ LGQRIP   + QG   RARVFALTR +   AE VVT  VLV  +PAY LFDSGSSH+FI+S FV  A LE
Subjt:  CWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSSHTFISSAFVRQATLE

Query:  LEPLGFLLSVSTPSGSILIASQKVRACELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPSGRNFTFKGVTGRVPRTVSALKAR
        LE LGFLLSVSTPSGS+L+ SQ V+  +LSFD QTL  +LIQLDMQDFDVI+GMDWLA N+ANI+CS+++VSF+LPSG+NFTFKGV   VPR V ALKA 
Subjt:  LEPLGFLLSVSTPSGSILIASQKVRACELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPSGRNFTFKGVTGRVPRTVSALKAR

Query:  RLLQNGAWGYLANVVDISKTPPNIDS
         LLQ GAW YLA+VVD  K  P+I++
Subjt:  RLLQNGAWGYLANVVDISKTPPNIDS

A0A6J1DYU5 uncharacterized protein LOC1110255177.3e-18578.25Show/hide
Query:  YLGCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFTELSRFALELIPTEALKI
        YL CE+QFKVKG VFMLRGEALNWWDSVA AEDHANVPI WARFK+LLYDYYYP+T+KDMKEAEFLH   GTL+VAQYERKFTELS FA ELIPTEA+KI
Subjt:  YLGCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQGTLSVAQYERKFTELSRFALELIPTEALKI

Query:  KRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQGMPPVCLTCQKRHTGQCWTGSKGC
        KRFVKGLRKGIRGPVDLQRP TYAEAVRG L+MD DVSN   PL EVGSSSGVKRK    YAD   RAPQR  Q QG+PPVC +CQKR  GQCWTG++GC
Subjt:  KRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQGMPPVCLTCQKRHTGQCWTGSKGC

Query:  FRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLL
        FRCGREGHFAREC M+AANTQRLGQR  P VSTQG                       G  LVH+VPAYVLFD GSSHTFIS+AFVRQATLELEPLGFLL
Subjt:  FRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLL

Query:  SVSTPSGSILIASQKVRACELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPSGRNFTFKGVTGRVPRTVSALKARRLLQNGAW
        SVSTPSGS+LIASQ VRA ELSFDNQTL ARLIQLDM+DFDVI+GMDWLATNQANINCS+REVSFQLPSGR+FTFKGV+G VPR VSALKARRLL NGAW
Subjt:  SVSTPSGSILIASQKVRACELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPSGRNFTFKGVTGRVPRTVSALKARRLLQNGAW

Query:  GYLANVVDISKTPPNIDSVPVVR
         YLA+VVDIS TPP+IDS  VV+
Subjt:  GYLANVVDISKTPPNIDSVPVVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACATGGAGAGGGCTTGACCGAGGAAGGATCACTAGCTTAGTTAGTGAGGATAGATGGAAGGGTCGTTGGGGAATTGCTTATCTCGTGTTTAGAGTTAAGACTGAGTT
AGAAACTAGCATACGTAGGAGTAGTGATCTCGGTCGACCTCCTCGTCATCACCAGACAATGCCTCCCCGTCGTAGTATGAGATTGCGAGCTGACGTCGACCCCGCTCCCG
GAGGTGTGAACGGAACGGATCCACCGCCCCCTCCTGCTGGGAATCAGGCAGGAGTAGTCCCTCCATTTCCTCCAGCAGCCGCTCGAGAGCGGGCAGATCCTCCAGTTCCC
CCAGCAGTTCCTCAGAGCGAGGCTCGCTTCATCAAGGATTTCAAGCGCTACGGACCCCCAACCTTTGACGGCGAGAGTGAAAGAGCGACTGCACCGGAAGAGTGGATCAG
AGAGTTGGAAGCCCTTTACGCGTATCTTGGTTGCGAAGACCAATTCAAGGTGAAGGGCGCGGTTTTTATGTTGAGAGGCGAGGCCCTGAACTGGTGGGACTCAGTAGCAG
CGGCAGAGGATCATGCGAATGTACCAATTCCGTGGGCAAGGTTCAAGAACTTGTTGTACGACTACTACTATCCGGAGACTGTGAAAGACATGAAGGAGGCAGAATTCCTG
CATCTAGTCCAAGGAACTTTATCGGTGGCACAGTATGAAAGGAAGTTCACGGAACTCTCCCGTTTCGCTCTAGAGTTGATTCCCACTGAGGCTTTAAAGATCAAGAGGTT
TGTTAAAGGCTTGCGCAAGGGAATCAGAGGCCCGGTGGACCTCCAGCGACCCACCACCTATGCTGAAGCGGTTAGGGGCGCCTTGGTTATGGATAAGGATGTTTCCAACA
AAGCCTCACCTCTGCCAGAGGTCGGATCATCTTCAGGTGTGAAAAGGAAGTTTCCTTCGACTTATGCCGACCCGGTTTTGAGAGCACCCCAGCGCCAGACTCAACACCAG
GGCATGCCGCCAGTATGCCTCACCTGCCAGAAAAGACATACAGGGCAGTGCTGGACGGGAAGTAAGGGTTGTTTCAGGTGCGGAAGAGAGGGGCATTTTGCAAGGGAATG
TCCCATGTCGGCCGCAAATACGCAGAGGTTGGGTCAGAGGATTCCACCACCAGTTTCGACGCAGGGAAATAACCAAAGGGCTCGTGTCTTCGCACTTACTCGCAAGGAAG
CGGCGGATGCCGAAACAGTTGTCACAGGTATTGTTTTAGTCCATGATGTGCCTGCGTATGTATTGTTTGATTCAGGGTCGAGCCACACCTTCATCTCTTCTGCGTTTGTT
CGTCAGGCAACCCTCGAATTAGAGCCGTTAGGGTTTTTGTTGTCGGTTTCTACACCATCAGGGTCGATTTTGATAGCTAGCCAAAAGGTGAGGGCATGTGAGTTGTCTTT
TGATAATCAGACTCTAAGGGCAAGACTGATTCAGCTGGACATGCAAGATTTTGACGTTATTGTGGGCATGGATTGGCTAGCTACCAACCAAGCCAACATTAATTGCTCGA
GAAGAGAAGTCTCCTTCCAACTACCTTCGGGTCGGAACTTTACGTTTAAAGGGGTTACGGGTAGAGTCCCAAGGACAGTATCAGCGTTGAAGGCAAGACGCCTGTTGCAG
AATGGAGCTTGGGGATATTTAGCCAACGTCGTCGACATTAGTAAGACTCCACCTAATATCGACTCCGTTCCCGTGGTCAGAGAAACGAACCTTGAGAAGAGACTGGTAGA
GGAAGTTGAAATGAATCTCAACGAACAAAACCCACCTGAGGATGGAGTAGTGAGGATACCAGGGGATAGAGCTGCCATTCCAGCACCTTCCCTGAACACTGTTCTGCTCG
CTGACGACACAAAGCAGGAAATTAAAGCATACGCGACTCCTGCCTTTTATGATTTCAATCCTGTCTTTGTTGATCCAATCATCGAGGCAGGAAGATTTAAGCTGAAGCCT
GCAATGTTTCAGATTAAGGAAGCTATGAGATTAAATTTGTTCCCTTACTCTTTAAGGGATAATGCCAGAGCATGGTTGGACTCCTTACCTGCTAAATCGATCACTTCGTG
GAATGACTTAGTAGAAAAATTTCGAAAGCAATATTTCCCACCTTCGAAGAATGCTGAACTTAGGAACAAGGTTAACAACTTTCAGCAACTACCAGGAGAATCCTTGATGA
TTGATGCTTCAACCAATGGAGCTTTGCTATCTAAACCATACGCAGAAGCCTTTGACATTTTGCAGAGGATTTCACGGAACAAACATCAATGGTCAAAGTCACGATCAATA
TTAACAGTAGGAAGCCTCACGGGGTTAGTAACAGATGATGTAGTAGCAGATCTTAACTCAAAGATTTCATGCCTAGCTGACATCGTTATGAAAAGCGCAACCGCGAATGA
GGCTGTAGCATCCAAAGCAAAGGTGGCTGCTGTGCAAACCAGTCTTTGCCCATACTATGAAGGGAGGGCACCATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACATGGAGAGGGCTTGACCGAGGAAGGATCACTAGCTTAGTTAGTGAGGATAGATGGAAGGGTCGTTGGGGAATTGCTTATCTCGTGTTTAGAGTTAAGACTGAGTT
AGAAACTAGCATACGTAGGAGTAGTGATCTCGGTCGACCTCCTCGTCATCACCAGACAATGCCTCCCCGTCGTAGTATGAGATTGCGAGCTGACGTCGACCCCGCTCCCG
GAGGTGTGAACGGAACGGATCCACCGCCCCCTCCTGCTGGGAATCAGGCAGGAGTAGTCCCTCCATTTCCTCCAGCAGCCGCTCGAGAGCGGGCAGATCCTCCAGTTCCC
CCAGCAGTTCCTCAGAGCGAGGCTCGCTTCATCAAGGATTTCAAGCGCTACGGACCCCCAACCTTTGACGGCGAGAGTGAAAGAGCGACTGCACCGGAAGAGTGGATCAG
AGAGTTGGAAGCCCTTTACGCGTATCTTGGTTGCGAAGACCAATTCAAGGTGAAGGGCGCGGTTTTTATGTTGAGAGGCGAGGCCCTGAACTGGTGGGACTCAGTAGCAG
CGGCAGAGGATCATGCGAATGTACCAATTCCGTGGGCAAGGTTCAAGAACTTGTTGTACGACTACTACTATCCGGAGACTGTGAAAGACATGAAGGAGGCAGAATTCCTG
CATCTAGTCCAAGGAACTTTATCGGTGGCACAGTATGAAAGGAAGTTCACGGAACTCTCCCGTTTCGCTCTAGAGTTGATTCCCACTGAGGCTTTAAAGATCAAGAGGTT
TGTTAAAGGCTTGCGCAAGGGAATCAGAGGCCCGGTGGACCTCCAGCGACCCACCACCTATGCTGAAGCGGTTAGGGGCGCCTTGGTTATGGATAAGGATGTTTCCAACA
AAGCCTCACCTCTGCCAGAGGTCGGATCATCTTCAGGTGTGAAAAGGAAGTTTCCTTCGACTTATGCCGACCCGGTTTTGAGAGCACCCCAGCGCCAGACTCAACACCAG
GGCATGCCGCCAGTATGCCTCACCTGCCAGAAAAGACATACAGGGCAGTGCTGGACGGGAAGTAAGGGTTGTTTCAGGTGCGGAAGAGAGGGGCATTTTGCAAGGGAATG
TCCCATGTCGGCCGCAAATACGCAGAGGTTGGGTCAGAGGATTCCACCACCAGTTTCGACGCAGGGAAATAACCAAAGGGCTCGTGTCTTCGCACTTACTCGCAAGGAAG
CGGCGGATGCCGAAACAGTTGTCACAGGTATTGTTTTAGTCCATGATGTGCCTGCGTATGTATTGTTTGATTCAGGGTCGAGCCACACCTTCATCTCTTCTGCGTTTGTT
CGTCAGGCAACCCTCGAATTAGAGCCGTTAGGGTTTTTGTTGTCGGTTTCTACACCATCAGGGTCGATTTTGATAGCTAGCCAAAAGGTGAGGGCATGTGAGTTGTCTTT
TGATAATCAGACTCTAAGGGCAAGACTGATTCAGCTGGACATGCAAGATTTTGACGTTATTGTGGGCATGGATTGGCTAGCTACCAACCAAGCCAACATTAATTGCTCGA
GAAGAGAAGTCTCCTTCCAACTACCTTCGGGTCGGAACTTTACGTTTAAAGGGGTTACGGGTAGAGTCCCAAGGACAGTATCAGCGTTGAAGGCAAGACGCCTGTTGCAG
AATGGAGCTTGGGGATATTTAGCCAACGTCGTCGACATTAGTAAGACTCCACCTAATATCGACTCCGTTCCCGTGGTCAGAGAAACGAACCTTGAGAAGAGACTGGTAGA
GGAAGTTGAAATGAATCTCAACGAACAAAACCCACCTGAGGATGGAGTAGTGAGGATACCAGGGGATAGAGCTGCCATTCCAGCACCTTCCCTGAACACTGTTCTGCTCG
CTGACGACACAAAGCAGGAAATTAAAGCATACGCGACTCCTGCCTTTTATGATTTCAATCCTGTCTTTGTTGATCCAATCATCGAGGCAGGAAGATTTAAGCTGAAGCCT
GCAATGTTTCAGATTAAGGAAGCTATGAGATTAAATTTGTTCCCTTACTCTTTAAGGGATAATGCCAGAGCATGGTTGGACTCCTTACCTGCTAAATCGATCACTTCGTG
GAATGACTTAGTAGAAAAATTTCGAAAGCAATATTTCCCACCTTCGAAGAATGCTGAACTTAGGAACAAGGTTAACAACTTTCAGCAACTACCAGGAGAATCCTTGATGA
TTGATGCTTCAACCAATGGAGCTTTGCTATCTAAACCATACGCAGAAGCCTTTGACATTTTGCAGAGGATTTCACGGAACAAACATCAATGGTCAAAGTCACGATCAATA
TTAACAGTAGGAAGCCTCACGGGGTTAGTAACAGATGATGTAGTAGCAGATCTTAACTCAAAGATTTCATGCCTAGCTGACATCGTTATGAAAAGCGCAACCGCGAATGA
GGCTGTAGCATCCAAAGCAAAGGTGGCTGCTGTGCAAACCAGTCTTTGCCCATACTATGAAGGGAGGGCACCATTTTGA
Protein sequenceShow/hide protein sequence
MTWRGLDRGRITSLVSEDRWKGRWGIAYLVFRVKTELETSIRRSSDLGRPPRHHQTMPPRRSMRLRADVDPAPGGVNGTDPPPPPAGNQAGVVPPFPPAAARERADPPVP
PAVPQSEARFIKDFKRYGPPTFDGESERATAPEEWIRELEALYAYLGCEDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFL
HLVQGTLSVAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGVKRKFPSTYADPVLRAPQRQTQHQ
GMPPVCLTCQKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGIVLVHDVPAYVLFDSGSSHTFISSAFV
RQATLELEPLGFLLSVSTPSGSILIASQKVRACELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPSGRNFTFKGVTGRVPRTVSALKARRLLQ
NGAWGYLANVVDISKTPPNIDSVPVVRETNLEKRLVEEVEMNLNEQNPPEDGVVRIPGDRAAIPAPSLNTVLLADDTKQEIKAYATPAFYDFNPVFVDPIIEAGRFKLKP
AMFQIKEAMRLNLFPYSLRDNARAWLDSLPAKSITSWNDLVEKFRKQYFPPSKNAELRNKVNNFQQLPGESLMIDASTNGALLSKPYAEAFDILQRISRNKHQWSKSRSI
LTVGSLTGLVTDDVVADLNSKISCLADIVMKSATANEAVASKAKVAAVQTSLCPYYEGRAPF