; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g04490 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g04490
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr7:3868804..3880023
RNA-Seq ExpressionMoc07g04490
SyntenyMoc07g04490
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]9.1e-13262.03Show/hide
Query:  MPPRRSMRLRADADPSPGGENGADPPPPPAGNQAGVVPPFPPAAARERADPPVPPTVPQVNPQLVLLVEALQAVINNAAGVGRVQAPPPQHLHTPLSEAR
        MPPR SMRLRADADP+PG                                                             GVG VQAPPPQHLHTP SEAR
Subjt:  MPPRRSMRLRADADPSPGGENGADPPPPPAGNQAGVVPPFPPAAARERADPPVPPTVPQVNPQLVLLVEALQAVINNAAGVGRVQAPPPQHLHTPLSEAR

Query:  FIKDFRRYGPPSFD-----------------ALYAYLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEF
        FIKDF+RYGPP+FD                 ALYAYLGCEDQFKVKGAVFMLRGE LNWWDSVAAAED+ANVPIPWARFKNLLYDYYYPETVKDMKEAEF
Subjt:  FIKDFRRYGPPSFD-----------------ALYAYLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEF

Query:  LHLVQETLSVAQYERKFTELSRFALELIPTEALKIKRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGEKE----------
        LHLVQ TLSVAQYERKFTELSRFALELIPTEALKIKRFVKG RKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSG K           
Subjt:  LHLVQETLSVAQYERKFTELSRFALELIPTEALKIKRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGEKE----------

Query:  ------------VP---------------------FDLCRSGIENTPAPGSTPGHAVSMPHLPEKTYGAVLDGKARVFALTRKEVADAETVVTGTVLVHD
                    +P                     F   R G      P S          +P        + +ARVFALTRKE ADAETVVTGTVLVHD
Subjt:  ------------VP---------------------FDLCRSGIENTPAPGSTPGHAVSMPHLPEKTYGAVLDGKARVFALTRKEVADAETVVTGTVLVHD

Query:  VPAYVLFDSGSSHTFISSAFVRQTTLEIEPLGFLLSVSTPLGSILIASQKVRA
        VPAYVLFDSGSSHTFISS FVRQ TLE+EPLGFLLSVSTP GSILIASQKVRA
Subjt:  VPAYVLFDSGSSHTFISSAFVRQTTLEIEPLGFLLSVSTPLGSILIASQKVRA

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]4.9e-9343.15Show/hide
Query:  RRSMRLRADADPSPGGENGADPPPPPAGNQAGVVPPFPPAAARERADPPVPPTVPQVNPQLVLLVEALQAVINNAAGVGRVQAPPPQHLHTPLSEARFIK
        RR+ R     DP+  GEN ADP   P     GVVPP P AA         P  VPQVNPQ+ LL EALQ +++NA G G  Q   P+    P  E +FI+
Subjt:  RRSMRLRADADPSPGGENGADPPPPPAGNQAGVVPPFPPAAARERADPPVPPTVPQVNPQLVLLVEALQAVINNAAGVGRVQAPPPQHLHTPLSEARFIK

Query:  DFRRYGPPSF-----------------DALYAYLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHL
        DF+ +GPP F                 +ALY YLGC D FKV+GAVFMLRGE +NWW+SVAAAEDHANVP+ WARFK+LLY+YY+P   ++ K  EFL L
Subjt:  DFRRYGPPSF-----------------DALYAYLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHL

Query:  VQETLSVAQYERKFTELSRFALELIPTEALKIKRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGEKE--VPFDLCRSG--
         Q +L+VAQYERKFTELSRF  + +PTE LKI +F+ G R+ I+G + L+ PTTYA AVR ALVMDK +    S    +GS+SG K     F   +S   
Subjt:  VQETLSVAQYERKFTELSRFALELIPTEALKIKRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGEKE--VPFDLCRSG--

Query:  ----IENTPAPGSTP-----------------------GH--------AVSMPHLPEKTYGAVL----DGKARVFALTRKEVADAETVVTGTVLVHDVPA
             +   AP   P                       GH          +   L +KT  A         ARVFALTR +V  AE VVTGT+L+  +PA
Subjt:  ----IENTPAPGSTP-----------------------GH--------AVSMPHLPEKTYGAVL----DGKARVFALTRKEVADAETVVTGTVLVHDVPA

Query:  YVLFDSGSSHTFISSAFVRQTTLEIEPLGFLLSVSTPLGSILIASQKVRA--------------------------------------------------
        Y LFDSGSSH+FI+S FVR   LE+E  GF LSVSTP GS+L+ SQ V+                                                   
Subjt:  YVLFDSGSSHTFISSAFVRQTTLEIEPLGFLLSVSTPLGSILIASQKVRA--------------------------------------------------

Query:  --------GVTGRVPRTVSALKARRLLQNGAWGYLANVVD
                GV   VPR VSALKA  LLQ G W YLA+VVD
Subjt:  --------GVTGRVPRTVSALKARRLLQNGAWGYLANVVD

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]6.6e-0558.73Show/hide
Query:  ELDRFEVELAVEDVSAVLAQLSVKPTLRQRIIAAQKGDSSLSKGPKTMVNHPFPTLLTDQIVL
        EL+  EVEL V+DVSA+LA+LSV+P+LRQRII AQK D SL+KG  +MV H   TL  +  +L
Subjt:  ELDRFEVELAVEDVSAVLAQLSVKPTLRQRIIAAQKGDSSLSKGPKTMVNHPFPTLLTDQIVL

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]9.5e-10570.59Show/hide
Query:  MPPRRSMRLRADADPSPGGENGADPPPPPAGNQAGVVPPFPPAAARERADPPVPPTVPQVNPQLVLLVEALQAVINNAAGVGRVQAPPPQHLHTPLSEAR
        MPPRRSMRLRAD DP+P GEN ADPPPPP G+QAGVVPPFPPAAA+E                         A+INN AGVG  Q  PP+HLHTP SEA+
Subjt:  MPPRRSMRLRADADPSPGGENGADPPPPPAGNQAGVVPPFPPAAARERADPPVPPTVPQVNPQLVLLVEALQAVINNAAGVGRVQAPPPQHLHTPLSEAR

Query:  FIKDFRRYGPPSF-----------------DALYAYLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEF
        FIKDF+RYGPP+F                 +ALYAYLGCEDQFKVKGAVFMLR E LNWWDSVAA EDHANVP+PWARFKNLLYD+YY ETV+DMKE EF
Subjt:  FIKDFRRYGPPSF-----------------DALYAYLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEF

Query:  LHLVQETLSVAQYERKFTELSRFALELIPTEALKIKRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGEK
        LHLVQ TL+VAQYERKFTELS FALELIPTEA+KIKRFVKG  KGIRG VDLQRP TYAEAVRG L+MDKDVSN+  PL EVGSS G K
Subjt:  LHLVQETLSVAQYERKFTELSRFALELIPTEALKIKRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGEK

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]5.6e-9745.38Show/hide
Query:  RRSMRLRADADPSPGGENGADPPPPPAGNQAGVVPPFPPAAARERADPPVPPTVPQVNPQLVLLVEALQAVINNAAGVGRVQAPPPQHLHTPLSEARFIK
        RR+ R     DP+P GE  ADP  P       V PP P AA         P  VPQVNPQ+ LL EALQ +++NA G G  Q   P+    P  E +FI+
Subjt:  RRSMRLRADADPSPGGENGADPPPPPAGNQAGVVPPFPPAAARERADPPVPPTVPQVNPQLVLLVEALQAVINNAAGVGRVQAPPPQHLHTPLSEARFIK

Query:  DFRRYGPPSF-----------------DALYAYLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHL
        DF+R+GPP F                 +ALY YLGC D FKV+GAVFMLRGE +NWW+SVAAAEDH NVP+ WARFK+LLY+YY+P TV++ K AEFL L
Subjt:  DFRRYGPPSF-----------------DALYAYLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHL

Query:  VQETLSVAQYERKFTELSRFALELIPTEALKIKRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGEKEVPFDLCRSG----
         Q +L+VAQYERKFTELSRF ++ IPTE LKI +F+ G R  I+G + ++ PTTYA A+R ALVMDK +    S    +GSSSG K   F L  S     
Subjt:  VQETLSVAQYERKFTELSRFALELIPTEALKIKRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGEKEVPFDLCRSG----

Query:  -----IENTPAPGSTP------------GHAVSMPHLPEKTYGAVLDG--KARVFALTRKEVADAETVVTGTVLVHDVPAYVLFDSGSSHTFISSAFVRQ
             ++   AP   P            G  +           A   G  +ARVFALTR +V  AE VVTGT+LV  +PAY LFDSGSSH+FI+S FVR 
Subjt:  -----IENTPAPGSTP------------GHAVSMPHLPEKTYGAVLDG--KARVFALTRKEVADAETVVTGTVLVHDVPAYVLFDSGSSHTFISSAFVRQ

Query:  TTLEIEPLGFLLSVSTPLGSILIASQKVRAG----------------------------------------------------------VTGRVPRTVSA
          LE+E LGFLLSVSTP GS+L+ SQ V+ G                                                          V   VPR VSA
Subjt:  TTLEIEPLGFLLSVSTPLGSILIASQKVRAG----------------------------------------------------------VTGRVPRTVSA

Query:  LKARRLLQNGAWGYLANVVD
        LKA  LLQ GAW YLA+VVD
Subjt:  LKARRLLQNGAWGYLANVVD

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]1.5e-9750.9Show/hide
Query:  YLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQETLSVAQYERKFTELSRFALELIPTEALKI
        YL CE+QFKVKG VFMLRGE LNWWDSVA AEDHANVPI WARFK+LLYDYYYP+T+KDMKEAEFLH    TL+VAQYERKFTELS FA ELIPTEA+KI
Subjt:  YLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQETLSVAQYERKFTELSRFALELIPTEALKI

Query:  KRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSG-EKEVPFDLCRSGIENTPAPGSTPGHAVSMPHLPEKTYGAVLDGKARV
        KRFVKG RKGIRGPVDLQRP TYAEAVRG L+MD DVSN   PL EVGSSSG +++V              P    G     P   ++  G    G    
Subjt:  KRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSG-EKEVPFDLCRSGIENTPAPGSTPGHAVSMPHLPEKTYGAVLDGKARV

Query:  FALTRKEVADAETVVT-------------------GTVLVHDVPAYVLFDSGSSHTFISSAFVRQTTLEIEPLGFLLSVSTPLGSILIASQKVRA-----
        F   R+     E  +T                   GT LVH+VPAYVLFD GSSHTFIS+AFVRQ TLE+EPLGFLLSVSTP GS+LIASQ VRA     
Subjt:  FALTRKEVADAETVVT-------------------GTVLVHDVPAYVLFDSGSSHTFISSAFVRQTTLEIEPLGFLLSVSTPLGSILIASQKVRA-----

Query:  -----------------------------------------------------GVTGRVPRTVSALKARRLLQNGAWGYLANVVDIIPPGPVLDELDRFE
                                                             GV+G VPR VSALKARRLL NGAW YLA+VVDI    P +D     +
Subjt:  -----------------------------------------------------GVTGRVPRTVSALKARRLLQNGAWGYLANVVDIIPPGPVLDELDRFE

Query:  V--ELAVEDVSAVLAQLSVKPTLRQR--IIAAQKGDSSLSKGPKTM
        V  ++  ED+  +       P +R+    I    G + +SK P  M
Subjt:  V--ELAVEDVSAVLAQLSVKPTLRQR--IIAAQKGDSSLSKGPKTM

TrEMBL top hitse value%identityAlignment
A0A6J1DQB9 Reverse transcriptase2.4e-9343.15Show/hide
Query:  RRSMRLRADADPSPGGENGADPPPPPAGNQAGVVPPFPPAAARERADPPVPPTVPQVNPQLVLLVEALQAVINNAAGVGRVQAPPPQHLHTPLSEARFIK
        RR+ R     DP+  GEN ADP   P     GVVPP P AA         P  VPQVNPQ+ LL EALQ +++NA G G  Q   P+    P  E +FI+
Subjt:  RRSMRLRADADPSPGGENGADPPPPPAGNQAGVVPPFPPAAARERADPPVPPTVPQVNPQLVLLVEALQAVINNAAGVGRVQAPPPQHLHTPLSEARFIK

Query:  DFRRYGPPSF-----------------DALYAYLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHL
        DF+ +GPP F                 +ALY YLGC D FKV+GAVFMLRGE +NWW+SVAAAEDHANVP+ WARFK+LLY+YY+P   ++ K  EFL L
Subjt:  DFRRYGPPSF-----------------DALYAYLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHL

Query:  VQETLSVAQYERKFTELSRFALELIPTEALKIKRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGEKE--VPFDLCRSG--
         Q +L+VAQYERKFTELSRF  + +PTE LKI +F+ G R+ I+G + L+ PTTYA AVR ALVMDK +    S    +GS+SG K     F   +S   
Subjt:  VQETLSVAQYERKFTELSRFALELIPTEALKIKRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGEKE--VPFDLCRSG--

Query:  ----IENTPAPGSTP-----------------------GH--------AVSMPHLPEKTYGAVL----DGKARVFALTRKEVADAETVVTGTVLVHDVPA
             +   AP   P                       GH          +   L +KT  A         ARVFALTR +V  AE VVTGT+L+  +PA
Subjt:  ----IENTPAPGSTP-----------------------GH--------AVSMPHLPEKTYGAVL----DGKARVFALTRKEVADAETVVTGTVLVHDVPA

Query:  YVLFDSGSSHTFISSAFVRQTTLEIEPLGFLLSVSTPLGSILIASQKVRA--------------------------------------------------
        Y LFDSGSSH+FI+S FVR   LE+E  GF LSVSTP GS+L+ SQ V+                                                   
Subjt:  YVLFDSGSSHTFISSAFVRQTTLEIEPLGFLLSVSTPLGSILIASQKVRA--------------------------------------------------

Query:  --------GVTGRVPRTVSALKARRLLQNGAWGYLANVVD
                GV   VPR VSALKA  LLQ G W YLA+VVD
Subjt:  --------GVTGRVPRTVSALKARRLLQNGAWGYLANVVD

A0A6J1DQB9 Reverse transcriptase3.2e-0558.73Show/hide
Query:  ELDRFEVELAVEDVSAVLAQLSVKPTLRQRIIAAQKGDSSLSKGPKTMVNHPFPTLLTDQIVL
        EL+  EVEL V+DVSA+LA+LSV+P+LRQRII AQK D SL+KG  +MV H   TL  +  +L
Subjt:  ELDRFEVELAVEDVSAVLAQLSVKPTLRQRIIAAQKGDSSLSKGPKTMVNHPFPTLLTDQIVL

A0A6J1DTA8 uncharacterized protein LOC1110241142.7e-9745.38Show/hide
Query:  RRSMRLRADADPSPGGENGADPPPPPAGNQAGVVPPFPPAAARERADPPVPPTVPQVNPQLVLLVEALQAVINNAAGVGRVQAPPPQHLHTPLSEARFIK
        RR+ R     DP+P GE  ADP  P       V PP P AA         P  VPQVNPQ+ LL EALQ +++NA G G  Q   P+    P  E +FI+
Subjt:  RRSMRLRADADPSPGGENGADPPPPPAGNQAGVVPPFPPAAARERADPPVPPTVPQVNPQLVLLVEALQAVINNAAGVGRVQAPPPQHLHTPLSEARFIK

Query:  DFRRYGPPSF-----------------DALYAYLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHL
        DF+R+GPP F                 +ALY YLGC D FKV+GAVFMLRGE +NWW+SVAAAEDH NVP+ WARFK+LLY+YY+P TV++ K AEFL L
Subjt:  DFRRYGPPSF-----------------DALYAYLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHL

Query:  VQETLSVAQYERKFTELSRFALELIPTEALKIKRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGEKEVPFDLCRSG----
         Q +L+VAQYERKFTELSRF ++ IPTE LKI +F+ G R  I+G + ++ PTTYA A+R ALVMDK +    S    +GSSSG K   F L  S     
Subjt:  VQETLSVAQYERKFTELSRFALELIPTEALKIKRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGEKEVPFDLCRSG----

Query:  -----IENTPAPGSTP------------GHAVSMPHLPEKTYGAVLDG--KARVFALTRKEVADAETVVTGTVLVHDVPAYVLFDSGSSHTFISSAFVRQ
             ++   AP   P            G  +           A   G  +ARVFALTR +V  AE VVTGT+LV  +PAY LFDSGSSH+FI+S FVR 
Subjt:  -----IENTPAPGSTP------------GHAVSMPHLPEKTYGAVLDG--KARVFALTRKEVADAETVVTGTVLVHDVPAYVLFDSGSSHTFISSAFVRQ

Query:  TTLEIEPLGFLLSVSTPLGSILIASQKVRAG----------------------------------------------------------VTGRVPRTVSA
          LE+E LGFLLSVSTP GS+L+ SQ V+ G                                                          V   VPR VSA
Subjt:  TTLEIEPLGFLLSVSTPLGSILIASQKVRAG----------------------------------------------------------VTGRVPRTVSA

Query:  LKARRLLQNGAWGYLANVVD
        LKA  LLQ GAW YLA+VVD
Subjt:  LKARRLLQNGAWGYLANVVD

A0A6J1DUM2 uncharacterized protein LOC1110232474.4e-13262.03Show/hide
Query:  MPPRRSMRLRADADPSPGGENGADPPPPPAGNQAGVVPPFPPAAARERADPPVPPTVPQVNPQLVLLVEALQAVINNAAGVGRVQAPPPQHLHTPLSEAR
        MPPR SMRLRADADP+PG                                                             GVG VQAPPPQHLHTP SEAR
Subjt:  MPPRRSMRLRADADPSPGGENGADPPPPPAGNQAGVVPPFPPAAARERADPPVPPTVPQVNPQLVLLVEALQAVINNAAGVGRVQAPPPQHLHTPLSEAR

Query:  FIKDFRRYGPPSFD-----------------ALYAYLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEF
        FIKDF+RYGPP+FD                 ALYAYLGCEDQFKVKGAVFMLRGE LNWWDSVAAAED+ANVPIPWARFKNLLYDYYYPETVKDMKEAEF
Subjt:  FIKDFRRYGPPSFD-----------------ALYAYLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEF

Query:  LHLVQETLSVAQYERKFTELSRFALELIPTEALKIKRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGEKE----------
        LHLVQ TLSVAQYERKFTELSRFALELIPTEALKIKRFVKG RKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSG K           
Subjt:  LHLVQETLSVAQYERKFTELSRFALELIPTEALKIKRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGEKE----------

Query:  ------------VP---------------------FDLCRSGIENTPAPGSTPGHAVSMPHLPEKTYGAVLDGKARVFALTRKEVADAETVVTGTVLVHD
                    +P                     F   R G      P S          +P        + +ARVFALTRKE ADAETVVTGTVLVHD
Subjt:  ------------VP---------------------FDLCRSGIENTPAPGSTPGHAVSMPHLPEKTYGAVLDGKARVFALTRKEVADAETVVTGTVLVHD

Query:  VPAYVLFDSGSSHTFISSAFVRQTTLEIEPLGFLLSVSTPLGSILIASQKVRA
        VPAYVLFDSGSSHTFISS FVRQ TLE+EPLGFLLSVSTP GSILIASQKVRA
Subjt:  VPAYVLFDSGSSHTFISSAFVRQTTLEIEPLGFLLSVSTPLGSILIASQKVRA

A0A6J1DVA0 uncharacterized protein LOC1110234244.6e-10570.59Show/hide
Query:  MPPRRSMRLRADADPSPGGENGADPPPPPAGNQAGVVPPFPPAAARERADPPVPPTVPQVNPQLVLLVEALQAVINNAAGVGRVQAPPPQHLHTPLSEAR
        MPPRRSMRLRAD DP+P GEN ADPPPPP G+QAGVVPPFPPAAA+E                         A+INN AGVG  Q  PP+HLHTP SEA+
Subjt:  MPPRRSMRLRADADPSPGGENGADPPPPPAGNQAGVVPPFPPAAARERADPPVPPTVPQVNPQLVLLVEALQAVINNAAGVGRVQAPPPQHLHTPLSEAR

Query:  FIKDFRRYGPPSF-----------------DALYAYLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEF
        FIKDF+RYGPP+F                 +ALYAYLGCEDQFKVKGAVFMLR E LNWWDSVAA EDHANVP+PWARFKNLLYD+YY ETV+DMKE EF
Subjt:  FIKDFRRYGPPSF-----------------DALYAYLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEF

Query:  LHLVQETLSVAQYERKFTELSRFALELIPTEALKIKRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGEK
        LHLVQ TL+VAQYERKFTELS FALELIPTEA+KIKRFVKG  KGIRG VDLQRP TYAEAVRG L+MDKDVSN+  PL EVGSS G K
Subjt:  LHLVQETLSVAQYERKFTELSRFALELIPTEALKIKRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGEK

A0A6J1DYU5 uncharacterized protein LOC1110255177.1e-9850.9Show/hide
Query:  YLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQETLSVAQYERKFTELSRFALELIPTEALKI
        YL CE+QFKVKG VFMLRGE LNWWDSVA AEDHANVPI WARFK+LLYDYYYP+T+KDMKEAEFLH    TL+VAQYERKFTELS FA ELIPTEA+KI
Subjt:  YLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQETLSVAQYERKFTELSRFALELIPTEALKI

Query:  KRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSG-EKEVPFDLCRSGIENTPAPGSTPGHAVSMPHLPEKTYGAVLDGKARV
        KRFVKG RKGIRGPVDLQRP TYAEAVRG L+MD DVSN   PL EVGSSSG +++V              P    G     P   ++  G    G    
Subjt:  KRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSG-EKEVPFDLCRSGIENTPAPGSTPGHAVSMPHLPEKTYGAVLDGKARV

Query:  FALTRKEVADAETVVT-------------------GTVLVHDVPAYVLFDSGSSHTFISSAFVRQTTLEIEPLGFLLSVSTPLGSILIASQKVRA-----
        F   R+     E  +T                   GT LVH+VPAYVLFD GSSHTFIS+AFVRQ TLE+EPLGFLLSVSTP GS+LIASQ VRA     
Subjt:  FALTRKEVADAETVVT-------------------GTVLVHDVPAYVLFDSGSSHTFISSAFVRQTTLEIEPLGFLLSVSTPLGSILIASQKVRA-----

Query:  -----------------------------------------------------GVTGRVPRTVSALKARRLLQNGAWGYLANVVDIIPPGPVLDELDRFE
                                                             GV+G VPR VSALKARRLL NGAW YLA+VVDI    P +D     +
Subjt:  -----------------------------------------------------GVTGRVPRTVSALKARRLLQNGAWGYLANVVDIIPPGPVLDELDRFE

Query:  V--ELAVEDVSAVLAQLSVKPTLRQR--IIAAQKGDSSLSKGPKTM
        V  ++  ED+  +       P +R+    I    G + +SK P  M
Subjt:  V--ELAVEDVSAVLAQLSVKPTLRQR--IIAAQKGDSSLSKGPKTM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCGCTTCGGCTCCGGGATTCACATCTCTTCAGATATATATGGTAATAGGCCTGTAAACTCTGATAATATGGCATTTACTTCAAAAAATATAATTACTGAATTGAA
CAAAGGGGAGAAGCTTAATGGGGATAACTATGAGATTTGGTCCATGAAAATCCAGTATGTATTAGAAGAGCAAGAGGCTGTAGAAGTACTAAGCAATGTGTTGACTGAGT
CTGAGGAAGGCAATAGTGCTCAACATAGAAGGGACAAAGAGGCTTATGAAGTATGGAAGAAGAAAAACTCACTTGCTCGCATAACGTTGCTTCGCAGCATGGACAACAAT
GTCATGCATGAGTTTTGTTCTTATGATCTTGCCCAGGACGTGTGGGAAGCGTTAAAGTTGAAATTTGGTGGGACCTCTACCATCAAAGTTAGGCAACTTACTATTAAGTT
TGATATTTATAAAAAACGCTCTAATCATAATATGAGACAACACATGAGGGAGATGTCTAATATGATTAGTGAGCTCAAAGAAGCTGGTCATGTCCTGACTGATGAACAAC
AAGTTCAAGCCGCTACTCGATCTCTTCCCCATAATTGGGAACACATGAAGGTTCAAGCCTTGAGTAGTGGCCTTGGGAATAAATGCAAGGCCGAACGTCAAATTTCTGTA
AACGGGTATTGGGGTCTTGGGGATAAATGTCAAGGACCAATACGCTGGGAAGTCATCGGTACCTTGGGGATAAACGGCAAGGACCGGTGCACAGTTCAGGCCTTGGGAAT
AAATGGCAAGGCCGAACGTCCAGTTTCTGTAGAGGAGTGTGGACCGAGGAGGAATTTTGTATTCTTTAGAACTACTTTATTTGTTATGGATGTTGCTGGTAAACGCGTCA
CTGTTGTCATTGGAGGAGTTGCTTATCTTGTGTTTAGAGTTAAGGCTAGCCAGACTGAGTTAGAAACTAGCATACTTAGGAGTAGTGATCTCGGTCGACCTCCTCGTCAG
CACCAAACAATGCCTCCCCGTCGTAGTATGAGATTGCGAGCTGACGCCGACCCCTCTCCCGGAGGTGAGAACGGAGCGGATCCACCGCCCCCTCCTGCTGGGAATCAGGC
AGGAGTAGTCCCTCCATTTCCTCCAGCAGCAGCTCGAGAGCGGGCAGATCCTCCAGTTCCCCCAACAGTTCCTCAGGTGAACCCCCAATTGGTGTTGCTTGTGGAGGCTT
TGCAAGCAGTGATCAATAACGCCGCAGGGGTGGGCAGGGTCCAAGCTCCGCCACCCCAACACCTTCATACTCCCCTGAGCGAGGCTCGCTTCATCAAGGATTTCAGGCGC
TACGGACCCCCATCCTTTGACGCCCTTTACGCGTATCTTGGTTGCGAAGACCAATTCAAGGTGAAGGGTGCAGTTTTTATGTTGAGGGGCGAGACCCTGAACTGGTGGGA
CTCAGTAGCAGCGGCAGAGGATCATGCGAATGTACCAATTCCGTGGGCGAGGTTCAAGAACTTATTGTACGACTACTACTATCCTGAGACTGTGAAAGACATGAAGGAGG
CAGAATTCCTGCATCTAGTCCAAGAAACTTTATCTGTGGCACAATATGAAAGGAAGTTCACGGAACTCTCCCGTTTCGCTCTAGAGTTGATTCCCACTGAGGCATTAAAG
ATCAAGAGGTTTGTTAAAGGCTCGCGCAAGGGAATCAGAGGCCCGGTGGACCTCCAGCGACCCACCACCTATGCTGAAGCGGTTAGGGGCGCCTTGGTTATGGATAAGGA
TGTTTCCAACAAGGCCTCACCCCTGCCAGAGGTCGGATCATCTTCAGGTGAAAAGGAAGTTCCCTTCGACTTATGCCGATCCGGTATTGAGAACACCCCAGCGCCAGGCT
CAACACCAGGGCATGCCGTCAGTATGCCCCACCTGCCAGAAAAGACATACGGGGCAGTGCTGGATGGGAAGGCTCGTGTCTTCGCACTTACTCGCAAGGAAGTGGCGGAT
GCCGAAACAGTGGTCACAGGTACTGTTTTAGTCCATGATGTGCCTGCGTATGTACTGTTTGATTCGGGGTCGAGCCACACTTTCATCTCTTCTGCGTTTGTTCGTCAGAC
AACCCTCGAAATAGAGCCGTTAGGGTTTTTGTTGTCGGTTTCTACACCATTAGGGTCGATTTTGATCGCTAGTCAAAAGGTGAGGGCAGGGGTTACGGGTAGAGTCCCAA
GGACAGTATCAGCGTTGAAGGCAAGACGCCTGTTGCAGAATGGAGCTTGGGGATATTTGGCAAATGTCGTCGACATTATACCTCCTGGACCAGTGTTAGATGAGTTGGAC
CGTTTTGAGGTGGAGTTAGCGGTAGAAGATGTTTCGGCAGTGCTAGCTCAACTCTCGGTCAAACCCACCCTAAGACAACGAATCATTGCTGCACAAAAGGGAGACTCCAG
TCTAAGCAAGGGTCCTAAAACGATGGTGAATCACCCATTTCCAACTCTTTTGACTGACCAAATCGTGCTCAAACTGATAGTGGAACACTTCTTCTACACGTATGCATCGG
TTGGAACCACCAAAACGCATTGGAAGACGCGTTTCATTAAGGGCAAAGTTGGAAGAAAACCAGTTTTTCTGAAGGCACGCAGGCGCCTGGCGCCGCCCATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCCGCTTCGGCTCCGGGATTCACATCTCTTCAGATATATATGGTAATAGGCCTGTAAACTCTGATAATATGGCATTTACTTCAAAAAATATAATTACTGAATTGAA
CAAAGGGGAGAAGCTTAATGGGGATAACTATGAGATTTGGTCCATGAAAATCCAGTATGTATTAGAAGAGCAAGAGGCTGTAGAAGTACTAAGCAATGTGTTGACTGAGT
CTGAGGAAGGCAATAGTGCTCAACATAGAAGGGACAAAGAGGCTTATGAAGTATGGAAGAAGAAAAACTCACTTGCTCGCATAACGTTGCTTCGCAGCATGGACAACAAT
GTCATGCATGAGTTTTGTTCTTATGATCTTGCCCAGGACGTGTGGGAAGCGTTAAAGTTGAAATTTGGTGGGACCTCTACCATCAAAGTTAGGCAACTTACTATTAAGTT
TGATATTTATAAAAAACGCTCTAATCATAATATGAGACAACACATGAGGGAGATGTCTAATATGATTAGTGAGCTCAAAGAAGCTGGTCATGTCCTGACTGATGAACAAC
AAGTTCAAGCCGCTACTCGATCTCTTCCCCATAATTGGGAACACATGAAGGTTCAAGCCTTGAGTAGTGGCCTTGGGAATAAATGCAAGGCCGAACGTCAAATTTCTGTA
AACGGGTATTGGGGTCTTGGGGATAAATGTCAAGGACCAATACGCTGGGAAGTCATCGGTACCTTGGGGATAAACGGCAAGGACCGGTGCACAGTTCAGGCCTTGGGAAT
AAATGGCAAGGCCGAACGTCCAGTTTCTGTAGAGGAGTGTGGACCGAGGAGGAATTTTGTATTCTTTAGAACTACTTTATTTGTTATGGATGTTGCTGGTAAACGCGTCA
CTGTTGTCATTGGAGGAGTTGCTTATCTTGTGTTTAGAGTTAAGGCTAGCCAGACTGAGTTAGAAACTAGCATACTTAGGAGTAGTGATCTCGGTCGACCTCCTCGTCAG
CACCAAACAATGCCTCCCCGTCGTAGTATGAGATTGCGAGCTGACGCCGACCCCTCTCCCGGAGGTGAGAACGGAGCGGATCCACCGCCCCCTCCTGCTGGGAATCAGGC
AGGAGTAGTCCCTCCATTTCCTCCAGCAGCAGCTCGAGAGCGGGCAGATCCTCCAGTTCCCCCAACAGTTCCTCAGGTGAACCCCCAATTGGTGTTGCTTGTGGAGGCTT
TGCAAGCAGTGATCAATAACGCCGCAGGGGTGGGCAGGGTCCAAGCTCCGCCACCCCAACACCTTCATACTCCCCTGAGCGAGGCTCGCTTCATCAAGGATTTCAGGCGC
TACGGACCCCCATCCTTTGACGCCCTTTACGCGTATCTTGGTTGCGAAGACCAATTCAAGGTGAAGGGTGCAGTTTTTATGTTGAGGGGCGAGACCCTGAACTGGTGGGA
CTCAGTAGCAGCGGCAGAGGATCATGCGAATGTACCAATTCCGTGGGCGAGGTTCAAGAACTTATTGTACGACTACTACTATCCTGAGACTGTGAAAGACATGAAGGAGG
CAGAATTCCTGCATCTAGTCCAAGAAACTTTATCTGTGGCACAATATGAAAGGAAGTTCACGGAACTCTCCCGTTTCGCTCTAGAGTTGATTCCCACTGAGGCATTAAAG
ATCAAGAGGTTTGTTAAAGGCTCGCGCAAGGGAATCAGAGGCCCGGTGGACCTCCAGCGACCCACCACCTATGCTGAAGCGGTTAGGGGCGCCTTGGTTATGGATAAGGA
TGTTTCCAACAAGGCCTCACCCCTGCCAGAGGTCGGATCATCTTCAGGTGAAAAGGAAGTTCCCTTCGACTTATGCCGATCCGGTATTGAGAACACCCCAGCGCCAGGCT
CAACACCAGGGCATGCCGTCAGTATGCCCCACCTGCCAGAAAAGACATACGGGGCAGTGCTGGATGGGAAGGCTCGTGTCTTCGCACTTACTCGCAAGGAAGTGGCGGAT
GCCGAAACAGTGGTCACAGGTACTGTTTTAGTCCATGATGTGCCTGCGTATGTACTGTTTGATTCGGGGTCGAGCCACACTTTCATCTCTTCTGCGTTTGTTCGTCAGAC
AACCCTCGAAATAGAGCCGTTAGGGTTTTTGTTGTCGGTTTCTACACCATTAGGGTCGATTTTGATCGCTAGTCAAAAGGTGAGGGCAGGGGTTACGGGTAGAGTCCCAA
GGACAGTATCAGCGTTGAAGGCAAGACGCCTGTTGCAGAATGGAGCTTGGGGATATTTGGCAAATGTCGTCGACATTATACCTCCTGGACCAGTGTTAGATGAGTTGGAC
CGTTTTGAGGTGGAGTTAGCGGTAGAAGATGTTTCGGCAGTGCTAGCTCAACTCTCGGTCAAACCCACCCTAAGACAACGAATCATTGCTGCACAAAAGGGAGACTCCAG
TCTAAGCAAGGGTCCTAAAACGATGGTGAATCACCCATTTCCAACTCTTTTGACTGACCAAATCGTGCTCAAACTGATAGTGGAACACTTCTTCTACACGTATGCATCGG
TTGGAACCACCAAAACGCATTGGAAGACGCGTTTCATTAAGGGCAAAGTTGGAAGAAAACCAGTTTTTCTGAAGGCACGCAGGCGCCTGGCGCCGCCCATGTAG
Protein sequenceShow/hide protein sequence
MVRFGSGIHISSDIYGNRPVNSDNMAFTSKNIITELNKGEKLNGDNYEIWSMKIQYVLEEQEAVEVLSNVLTESEEGNSAQHRRDKEAYEVWKKKNSLARITLLRSMDNN
VMHEFCSYDLAQDVWEALKLKFGGTSTIKVRQLTIKFDIYKKRSNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAATRSLPHNWEHMKVQALSSGLGNKCKAERQISV
NGYWGLGDKCQGPIRWEVIGTLGINGKDRCTVQALGINGKAERPVSVEECGPRRNFVFFRTTLFVMDVAGKRVTVVIGGVAYLVFRVKASQTELETSILRSSDLGRPPRQ
HQTMPPRRSMRLRADADPSPGGENGADPPPPPAGNQAGVVPPFPPAAARERADPPVPPTVPQVNPQLVLLVEALQAVINNAAGVGRVQAPPPQHLHTPLSEARFIKDFRR
YGPPSFDALYAYLGCEDQFKVKGAVFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVQETLSVAQYERKFTELSRFALELIPTEALK
IKRFVKGSRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLPEVGSSSGEKEVPFDLCRSGIENTPAPGSTPGHAVSMPHLPEKTYGAVLDGKARVFALTRKEVAD
AETVVTGTVLVHDVPAYVLFDSGSSHTFISSAFVRQTTLEIEPLGFLLSVSTPLGSILIASQKVRAGVTGRVPRTVSALKARRLLQNGAWGYLANVVDIIPPGPVLDELD
RFEVELAVEDVSAVLAQLSVKPTLRQRIIAAQKGDSSLSKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTHWKTRFIKGKVGRKPVFLKARRRLAPPM