; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g18800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g18800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag protease polyprotein
Genome locationchr9:14642913..14647488
RNA-Seq ExpressionMoc09g18800
SyntenyMoc09g18800
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]1.6e-9453.49Show/hide
Query:  MPPHRSMRLRADVDPAPRGENVADPPPPPAGDHVEEFLQFLQQQLRTGHWSI---TQRGLAVHK-LSHLDTFILPRARLSSSRISSVTDPLALTGEARKR
        MPP  SMRLRAD DPAP G      PPP      +   +F++   R G  +    ++R  AV + +  L+            ++        L GEA   
Subjt:  MPPHRSMRLRADVDPAPRGENVADPPPPPAGDHVEEFLQFLQQQLRTGHWSI---TQRGLAVHK-LSHLDTFILPRARLSSSRISSVTDPLALTGEARKR

Query:  QQQKSGSAAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGLCKGI-----------
            S +AAED+ANVP+ WARFK+LLYDYYYP T KDMKEAEFLHL QGTL VAQYERKF ELSRFALELIPTEA+KIKRFVKGL KGI           
Subjt:  QQQKSGSAAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGLCKGI-----------

Query:  -------------------------------------------------------------------------------RCGREGHFARECSMTAANTQR
                                                                                       RCGREGHFAREC M+AANTQR
Subjt:  -------------------------------------------------------------------------------RCGREGHFARECSMTAANTQR

Query:  LGQRAPPIVSTQGGNQRARVFALTRKEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRASELS
        LGQR PP VSTQG NQRARVFALTRKEAADAE VVTGTVLVH+VPAYVLFDSG SHTFIS+ FVRQATLELEPLGFLLSVSTPSGS+LI SQ VRA ELS
Subjt:  LGQRAPPIVSTQGGNQRARVFALTRKEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRASELS

Query:  FDNQTLKARLIQLDM
        FDNQTL+ARLIQLDM
Subjt:  FDNQTLKARLIQLDM

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]8.0e-10246.99Show/hide
Query:  KSGSAAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGL------------------
        +S +AAEDHANVPVTWARFKDLLY+YY+P  A++ K  EFL L+QG+L VAQYERKF ELSRF  + +PTE +KI +F+ GL                  
Subjt:  KSGSAAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGL------------------

Query:  ----------------------------------------------------------CKG-------------IRCGREGHFARECSMTAANTQRLGQR
                                                                  CK               +C +EGHF REC MT +NTQ L Q+
Subjt:  ----------------------------------------------------------CKG-------------IRCGREGHFARECSMTAANTQRLGQR

Query:  APPIVSTQGGNQRARVFALTRKEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRASELSFDNQ
         P   +TQGG Q ARVFALTR +   AE VVTGT+L+ ++PAY LFDSG SH+FI++ FVR A LELE  GF LSVSTPSGSVL+ SQ+V+  +LSF  Q
Subjt:  APPIVSTQGGNQRARVFALTRKEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRASELSFDNQ

Query:  TLKARLIQLDMRDFDVILGMDWLATNQANINCSRREFSFQLPSGRSFTFKEVTGGVARAVSALKARRLLQNGAWGYLASVVDISMTPPSIDSVHVVKEFP
        TL+  LIQL+M+DFDVILGMDWLA N+ANINCS++E SF L SG++FTFK V  GV R VSALKA  LLQ G W YLASVVD     PSI+ V VV EF 
Subjt:  TLKARLIQLDMRDFDVILGMDWLATNQANINCSRREFSFQLPSGRSFTFKEVTGGVARAVSALKARRLLQNGAWGYLASVVDISMTPPSIDSVHVVKEFP

Query:  DVFPEDLPGLPP---------VLP-RRPLDE-------LDRSEVELAVEDVLAVLARLSVEPTLRQW---VIAAQKRDPSL
        DVFPEDLPGLPP         +LP   P+ E        +  E++L +E+   +L R  + P++  W   V+  +K+D S+
Subjt:  DVFPEDLPGLPP---------VLP-RRPLDE-------LDRSEVELAVEDVLAVLARLSVEPTLRQW---VIAAQKRDPSL

XP_022156985.1 uncharacterized protein LOC111023814 [Momordica charantia]1.4e-9863.44Show/hide
Query:  AAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGLCKGIRCGREGHFARECSMTAAN
        A EDHAN+ +TWARFKDLLYDYY+P T KD+KE EFL L+QG++ V +YE+KF ELSRFA  +I TEA+KIKRFVKGLCKGI+    G    +   T A 
Subjt:  AAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGLCKGIRCGREGHFARECSMTAAN

Query:  TQRLGQRAPPIVSTQGGNQRARVFALTRKEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRAS
          +        V T+          L+++  +     ++  VLVHNVPAY LFDSG SHTFISTAFV QA L LEPLGFLLSVSTPSGS +  SQMVR  
Subjt:  TQRLGQRAPPIVSTQGGNQRARVFALTRKEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRAS

Query:  ELSFDNQTLKARLIQLDMRDFDVILGMDWLATNQANINCSRREFSFQLPSGRSFTFKEVTGGVARAVSALKARRLLQNGAWGYLASVVDISMTPPSIDSV
        +LS  + TL ARLIQLDM+DFD+ILGMDWLATNQA+IN  RRE SFQLPSG+ FTFK VTG V + VSALKAR+LLQ+GAWGYL SVVD S   PS+DSV
Subjt:  ELSFDNQTLKARLIQLDMRDFDVILGMDWLATNQANINCSRREFSFQLPSGRSFTFKEVTGGVARAVSALKARRLLQNGAWGYLASVVDISMTPPSIDSV

Query:  HVVKEFPDVFPEDLPGLPPV
         V  EFPDVFPEDLPGLPPV
Subjt:  HVVKEFPDVFPEDLPGLPPV

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]1.7e-9653.26Show/hide
Query:  KSGSAAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGL----------------CK
        +S +AAEDH NVPVTWARFKDLLY+YY+P T ++ K AEFL L+QG+L VAQYERKF ELSRF ++ IPTE +KI +F+ GL                  
Subjt:  KSGSAAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGL----------------CK

Query:  GIRC-------------------------------------GREGHFARE--------CSMTAANTQRLG-------QRAPPIVSTQGGNQRARVFALTR
         IRC                                     G + H  R+        C    A    LG       Q+ P   + QGG QRARVFALTR
Subjt:  GIRC-------------------------------------GREGHFARE--------CSMTAANTQRLG-------QRAPPIVSTQGGNQRARVFALTR

Query:  KEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRASELSFDNQTLKARLIQLDMRDFDVILGMD
         +   AE VVTGT+LV ++PAY LFDSG SH+FI++ FVR A LELE LGFLLSVSTPSGSVL++SQ+V+  +LSFD QT + +LIQLDM+DFDVILGMD
Subjt:  KEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRASELSFDNQTLKARLIQLDMRDFDVILGMD

Query:  WLATNQANINCSRREFSFQLPSGRSFTFKEVTGGVARAVSALKARRLLQNGAWGYLASVVDISMTPPSIDSVHVVKEFPDVFP
        WLA N+ANINCS++E SF+LPSG++FTFK V  GV R VSALKA  LLQ GAW YLASVVD     PSI++V VV EF DVFP
Subjt:  WLATNQANINCSRREFSFQLPSGRSFTFKEVTGGVARAVSALKARRLLQNGAWGYLASVVDISMTPPSIDSVHVVKEFPDVFP

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]1.1e-12758.16Show/hide
Query:  SGSAAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGLCKGI---------------
        S + AEDHANVP+TWARFKDLLYDYYYP T KDMKEAEFLH S GTL VAQYERKF ELS FA ELIPTEAMKIKRFVKGL KGI               
Subjt:  SGSAAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGLCKGI---------------

Query:  ---------------------------------------------------------------------------RCGREGHFARECSMTAANTQRLGQR
                                                                                   RCGREGHFARECSMTAANTQRLGQR
Subjt:  ---------------------------------------------------------------------------RCGREGHFARECSMTAANTQRLGQR

Query:  APPIVSTQGGNQRARVFALTRKEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRASELSFDNQ
        A P VSTQG                       GT LVHNVPAYVLFD G SHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLI SQMVRA ELSFDNQ
Subjt:  APPIVSTQGGNQRARVFALTRKEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRASELSFDNQ

Query:  TLKARLIQLDMRDFDVILGMDWLATNQANINCSRREFSFQLPSGRSFTFKEVTGGVARAVSALKARRLLQNGAWGYLASVVDISMTPPSIDSVHVVKEFP
        TL+ARLIQLDMRDFDVILGMDWLATNQANINCS+RE SFQLPSGRSFTFK V+GGV RAVSALKARRLL NGAW YLASVVDIS+TPPSIDS HVVK F 
Subjt:  TLKARLIQLDMRDFDVILGMDWLATNQANINCSRREFSFQLPSGRSFTFKEVTGGVARAVSALKARRLLQNGAWGYLASVVDISMTPPSIDSVHVVKEFP

Query:  DVFPEDLPGLPPV---------LP-RRPLDELDRSEVELAVEDVLAVLARL----SVEPTLRQW---VIAAQKRDPSL
        DVFPEDL GLPP+         LP   P+ +       + ++++ A L  L     + P++  W   V+  +K+D S+
Subjt:  DVFPEDLPGLPPV---------LP-RRPLDELDRSEVELAVEDVLAVLARL----SVEPTLRQW---VIAAQKRDPSL

TrEMBL top hitse value%identityAlignment
A0A6J1DQB9 Reverse transcriptase3.9e-10246.99Show/hide
Query:  KSGSAAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGL------------------
        +S +AAEDHANVPVTWARFKDLLY+YY+P  A++ K  EFL L+QG+L VAQYERKF ELSRF  + +PTE +KI +F+ GL                  
Subjt:  KSGSAAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGL------------------

Query:  ----------------------------------------------------------CKG-------------IRCGREGHFARECSMTAANTQRLGQR
                                                                  CK               +C +EGHF REC MT +NTQ L Q+
Subjt:  ----------------------------------------------------------CKG-------------IRCGREGHFARECSMTAANTQRLGQR

Query:  APPIVSTQGGNQRARVFALTRKEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRASELSFDNQ
         P   +TQGG Q ARVFALTR +   AE VVTGT+L+ ++PAY LFDSG SH+FI++ FVR A LELE  GF LSVSTPSGSVL+ SQ+V+  +LSF  Q
Subjt:  APPIVSTQGGNQRARVFALTRKEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRASELSFDNQ

Query:  TLKARLIQLDMRDFDVILGMDWLATNQANINCSRREFSFQLPSGRSFTFKEVTGGVARAVSALKARRLLQNGAWGYLASVVDISMTPPSIDSVHVVKEFP
        TL+  LIQL+M+DFDVILGMDWLA N+ANINCS++E SF L SG++FTFK V  GV R VSALKA  LLQ G W YLASVVD     PSI+ V VV EF 
Subjt:  TLKARLIQLDMRDFDVILGMDWLATNQANINCSRREFSFQLPSGRSFTFKEVTGGVARAVSALKARRLLQNGAWGYLASVVDISMTPPSIDSVHVVKEFP

Query:  DVFPEDLPGLPP---------VLP-RRPLDE-------LDRSEVELAVEDVLAVLARLSVEPTLRQW---VIAAQKRDPSL
        DVFPEDLPGLPP         +LP   P+ E        +  E++L +E+   +L R  + P++  W   V+  +K+D S+
Subjt:  DVFPEDLPGLPP---------VLP-RRPLDE-------LDRSEVELAVEDVLAVLARLSVEPTLRQW---VIAAQKRDPSL

A0A6J1DRW8 uncharacterized protein LOC1110238146.8e-9963.44Show/hide
Query:  AAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGLCKGIRCGREGHFARECSMTAAN
        A EDHAN+ +TWARFKDLLYDYY+P T KD+KE EFL L+QG++ V +YE+KF ELSRFA  +I TEA+KIKRFVKGLCKGI+    G    +   T A 
Subjt:  AAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGLCKGIRCGREGHFARECSMTAAN

Query:  TQRLGQRAPPIVSTQGGNQRARVFALTRKEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRAS
          +        V T+          L+++  +     ++  VLVHNVPAY LFDSG SHTFISTAFV QA L LEPLGFLLSVSTPSGS +  SQMVR  
Subjt:  TQRLGQRAPPIVSTQGGNQRARVFALTRKEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRAS

Query:  ELSFDNQTLKARLIQLDMRDFDVILGMDWLATNQANINCSRREFSFQLPSGRSFTFKEVTGGVARAVSALKARRLLQNGAWGYLASVVDISMTPPSIDSV
        +LS  + TL ARLIQLDM+DFD+ILGMDWLATNQA+IN  RRE SFQLPSG+ FTFK VTG V + VSALKAR+LLQ+GAWGYL SVVD S   PS+DSV
Subjt:  ELSFDNQTLKARLIQLDMRDFDVILGMDWLATNQANINCSRREFSFQLPSGRSFTFKEVTGGVARAVSALKARRLLQNGAWGYLASVVDISMTPPSIDSV

Query:  HVVKEFPDVFPEDLPGLPPV
         V  EFPDVFPEDLPGLPPV
Subjt:  HVVKEFPDVFPEDLPGLPPV

A0A6J1DTA8 uncharacterized protein LOC1110241148.4e-9753.26Show/hide
Query:  KSGSAAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGL----------------CK
        +S +AAEDH NVPVTWARFKDLLY+YY+P T ++ K AEFL L+QG+L VAQYERKF ELSRF ++ IPTE +KI +F+ GL                  
Subjt:  KSGSAAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGL----------------CK

Query:  GIRC-------------------------------------GREGHFARE--------CSMTAANTQRLG-------QRAPPIVSTQGGNQRARVFALTR
         IRC                                     G + H  R+        C    A    LG       Q+ P   + QGG QRARVFALTR
Subjt:  GIRC-------------------------------------GREGHFARE--------CSMTAANTQRLG-------QRAPPIVSTQGGNQRARVFALTR

Query:  KEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRASELSFDNQTLKARLIQLDMRDFDVILGMD
         +   AE VVTGT+LV ++PAY LFDSG SH+FI++ FVR A LELE LGFLLSVSTPSGSVL++SQ+V+  +LSFD QT + +LIQLDM+DFDVILGMD
Subjt:  KEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRASELSFDNQTLKARLIQLDMRDFDVILGMD

Query:  WLATNQANINCSRREFSFQLPSGRSFTFKEVTGGVARAVSALKARRLLQNGAWGYLASVVDISMTPPSIDSVHVVKEFPDVFP
        WLA N+ANINCS++E SF+LPSG++FTFK V  GV R VSALKA  LLQ GAW YLASVVD     PSI++V VV EF DVFP
Subjt:  WLATNQANINCSRREFSFQLPSGRSFTFKEVTGGVARAVSALKARRLLQNGAWGYLASVVDISMTPPSIDSVHVVKEFPDVFP

A0A6J1DUM2 uncharacterized protein LOC1110232477.8e-9553.49Show/hide
Query:  MPPHRSMRLRADVDPAPRGENVADPPPPPAGDHVEEFLQFLQQQLRTGHWSI---TQRGLAVHK-LSHLDTFILPRARLSSSRISSVTDPLALTGEARKR
        MPP  SMRLRAD DPAP G      PPP      +   +F++   R G  +    ++R  AV + +  L+            ++        L GEA   
Subjt:  MPPHRSMRLRADVDPAPRGENVADPPPPPAGDHVEEFLQFLQQQLRTGHWSI---TQRGLAVHK-LSHLDTFILPRARLSSSRISSVTDPLALTGEARKR

Query:  QQQKSGSAAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGLCKGI-----------
            S +AAED+ANVP+ WARFK+LLYDYYYP T KDMKEAEFLHL QGTL VAQYERKF ELSRFALELIPTEA+KIKRFVKGL KGI           
Subjt:  QQQKSGSAAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGLCKGI-----------

Query:  -------------------------------------------------------------------------------RCGREGHFARECSMTAANTQR
                                                                                       RCGREGHFAREC M+AANTQR
Subjt:  -------------------------------------------------------------------------------RCGREGHFARECSMTAANTQR

Query:  LGQRAPPIVSTQGGNQRARVFALTRKEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRASELS
        LGQR PP VSTQG NQRARVFALTRKEAADAE VVTGTVLVH+VPAYVLFDSG SHTFIS+ FVRQATLELEPLGFLLSVSTPSGS+LI SQ VRA ELS
Subjt:  LGQRAPPIVSTQGGNQRARVFALTRKEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRASELS

Query:  FDNQTLKARLIQLDM
        FDNQTL+ARLIQLDM
Subjt:  FDNQTLKARLIQLDM

A0A6J1DYU5 uncharacterized protein LOC1110255175.4e-12858.16Show/hide
Query:  SGSAAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGLCKGI---------------
        S + AEDHANVP+TWARFKDLLYDYYYP T KDMKEAEFLH S GTL VAQYERKF ELS FA ELIPTEAMKIKRFVKGL KGI               
Subjt:  SGSAAEDHANVPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGLCKGI---------------

Query:  ---------------------------------------------------------------------------RCGREGHFARECSMTAANTQRLGQR
                                                                                   RCGREGHFARECSMTAANTQRLGQR
Subjt:  ---------------------------------------------------------------------------RCGREGHFARECSMTAANTQRLGQR

Query:  APPIVSTQGGNQRARVFALTRKEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRASELSFDNQ
        A P VSTQG                       GT LVHNVPAYVLFD G SHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLI SQMVRA ELSFDNQ
Subjt:  APPIVSTQGGNQRARVFALTRKEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRASELSFDNQ

Query:  TLKARLIQLDMRDFDVILGMDWLATNQANINCSRREFSFQLPSGRSFTFKEVTGGVARAVSALKARRLLQNGAWGYLASVVDISMTPPSIDSVHVVKEFP
        TL+ARLIQLDMRDFDVILGMDWLATNQANINCS+RE SFQLPSGRSFTFK V+GGV RAVSALKARRLL NGAW YLASVVDIS+TPPSIDS HVVK F 
Subjt:  TLKARLIQLDMRDFDVILGMDWLATNQANINCSRREFSFQLPSGRSFTFKEVTGGVARAVSALKARRLLQNGAWGYLASVVDISMTPPSIDSVHVVKEFP

Query:  DVFPEDLPGLPPV---------LP-RRPLDELDRSEVELAVEDVLAVLARL----SVEPTLRQW---VIAAQKRDPSL
        DVFPEDL GLPP+         LP   P+ +       + ++++ A L  L     + P++  W   V+  +K+D S+
Subjt:  DVFPEDLPGLPPV---------LP-RRPLDELDRSEVELAVEDVLAVLARL----SVEPTLRQW---VIAAQKRDPSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACCCCATCGTAGTATGAGGCTGCGTGCAGACGTCGACCCGGCTCCCAGAGGTGAGAATGTGGCGGACCCACCACCCCCTCCTGCTGGTGATCATGTAGAGGAGTT
CCTCCAATTCCTCCAGCAGCAGCTCAGGACCGGGCACTGGTCAATAACACAGCGGGGGTTGGCGGTGCACAAACTCAGCCACCTCGACACTTTCATACTCCCTAGAGCGA
GGCTCAGTTCATCAAGGATTTCAAGCGTTACGGACCCCCTAGCTTTGACGGGGGAAGCGAGAAAGCGACAGCAGCAGAAGAGTGGTAGTGCGGCAGAAGATCATGCTAAT
GTACCAGTTACGTGGGCAAGGTTCAAGGACTTGTTGTACGACTACTATTACCCGGGGACCGCGAAAGATATGAAAGAGGCAGAATTCCTTCATCTCAGCCAGGGAACCTT
AATGGTAGCACAGTATGAAAGGAAATTTATGGAGCTCTCTCGTTTTGCTCTGGAATTAATTCCCACTGAAGCAATGAAGATCAAGAGGTTTGTTAAGGGCTTGTGCAAGG
GAATAAGGTGCGGAAGAGAAGGGCATTTTGCGAGGGAATGTTCTATGACGGCCGCGAATACTCAGAGGCTAGGCCAGAGGGCTCCCCCAATAGTTTCGACGCAGGGAGGT
AATCAGAGGGCTCGTGTCTTCGCACTTACTCGCAAGGAAGCGGCGGATGCCGAAAACGTTGTCACAGGTACGGTCTTAGTCCATAATGTGCCTGCTTATGTATTGTTTGA
CTCGGGATTGAGTCACACCTTCATCTCTACTGCGTTTGTTCGTCAGGCAACCCTCGAATTAGAGCCGTTAGGGTTTCTGTTGTCAGTTTCTACACCTTCAGGGTCGGTTT
TGATTGTTAGTCAAATGGTGAGAGCAAGTGAGTTATCTTTTGACAATCAGACCCTAAAGGCAAGATTGATCCAACTGGACATGCGGGATTTTGACGTCATTTTGGGCATG
GATTGGCTAGCTACCAACCAAGCCAATATCAATTGCTCGAGGAGGGAATTCTCTTTCCAACTACCTTCGGGTCGGAGCTTTACGTTTAAAGAAGTTACGGGTGGAGTTGC
AAGGGCAGTCTCAGCGTTGAAGGCAAGACGCCTTTTACAGAATGGTGCCTGGGGATATTTGGCCAGTGTCGTCGACATTAGTATGACTCCACCCAGCATCGACTCCGTCC
ATGTGGTCAAAGAGTTTCCAGATGTGTTCCCTGAAGACCTCCCGGGGCTACCTCCTGTCCTGCCTCGTAGACCACTAGACGAGTTGGATCGTTCTGAGGTGGAGTTAGCG
GTGGAAGACGTCTTGGCAGTGTTAGCTCGACTCTCGGTTGAACCCACTTTAAGACAGTGGGTCATCGCTGCACAGAAGAGAGATCCCAGCCTGAGCAAGGGTTTCAGTAT
GGTAGATGAAACCTTGTTTTATAAGGAGGTACCCATTGAGATCGTAGCAAGAGAGACCAAGGTGCTGCGGAATCGGGCAATTGACTTGGTGAAGGTCTTGTGGAGGAATC
ACCAAGTGGAGGAAGCCACCTGGGAAAGGGAAGACGAGATTAGAGCCCGGTACCCTGAATTATTCGAACGAAGAACTTTCGAGGACGAAAGTTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCACCCCATCGTAGTATGAGGCTGCGTGCAGACGTCGACCCGGCTCCCAGAGGTGAGAATGTGGCGGACCCACCACCCCCTCCTGCTGGTGATCATGTAGAGGAGTT
CCTCCAATTCCTCCAGCAGCAGCTCAGGACCGGGCACTGGTCAATAACACAGCGGGGGTTGGCGGTGCACAAACTCAGCCACCTCGACACTTTCATACTCCCTAGAGCGA
GGCTCAGTTCATCAAGGATTTCAAGCGTTACGGACCCCCTAGCTTTGACGGGGGAAGCGAGAAAGCGACAGCAGCAGAAGAGTGGTAGTGCGGCAGAAGATCATGCTAAT
GTACCAGTTACGTGGGCAAGGTTCAAGGACTTGTTGTACGACTACTATTACCCGGGGACCGCGAAAGATATGAAAGAGGCAGAATTCCTTCATCTCAGCCAGGGAACCTT
AATGGTAGCACAGTATGAAAGGAAATTTATGGAGCTCTCTCGTTTTGCTCTGGAATTAATTCCCACTGAAGCAATGAAGATCAAGAGGTTTGTTAAGGGCTTGTGCAAGG
GAATAAGGTGCGGAAGAGAAGGGCATTTTGCGAGGGAATGTTCTATGACGGCCGCGAATACTCAGAGGCTAGGCCAGAGGGCTCCCCCAATAGTTTCGACGCAGGGAGGT
AATCAGAGGGCTCGTGTCTTCGCACTTACTCGCAAGGAAGCGGCGGATGCCGAAAACGTTGTCACAGGTACGGTCTTAGTCCATAATGTGCCTGCTTATGTATTGTTTGA
CTCGGGATTGAGTCACACCTTCATCTCTACTGCGTTTGTTCGTCAGGCAACCCTCGAATTAGAGCCGTTAGGGTTTCTGTTGTCAGTTTCTACACCTTCAGGGTCGGTTT
TGATTGTTAGTCAAATGGTGAGAGCAAGTGAGTTATCTTTTGACAATCAGACCCTAAAGGCAAGATTGATCCAACTGGACATGCGGGATTTTGACGTCATTTTGGGCATG
GATTGGCTAGCTACCAACCAAGCCAATATCAATTGCTCGAGGAGGGAATTCTCTTTCCAACTACCTTCGGGTCGGAGCTTTACGTTTAAAGAAGTTACGGGTGGAGTTGC
AAGGGCAGTCTCAGCGTTGAAGGCAAGACGCCTTTTACAGAATGGTGCCTGGGGATATTTGGCCAGTGTCGTCGACATTAGTATGACTCCACCCAGCATCGACTCCGTCC
ATGTGGTCAAAGAGTTTCCAGATGTGTTCCCTGAAGACCTCCCGGGGCTACCTCCTGTCCTGCCTCGTAGACCACTAGACGAGTTGGATCGTTCTGAGGTGGAGTTAGCG
GTGGAAGACGTCTTGGCAGTGTTAGCTCGACTCTCGGTTGAACCCACTTTAAGACAGTGGGTCATCGCTGCACAGAAGAGAGATCCCAGCCTGAGCAAGGGTTTCAGTAT
GGTAGATGAAACCTTGTTTTATAAGGAGGTACCCATTGAGATCGTAGCAAGAGAGACCAAGGTGCTGCGGAATCGGGCAATTGACTTGGTGAAGGTCTTGTGGAGGAATC
ACCAAGTGGAGGAAGCCACCTGGGAAAGGGAAGACGAGATTAGAGCCCGGTACCCTGAATTATTCGAACGAAGAACTTTCGAGGACGAAAGTTTCTAA
Protein sequenceShow/hide protein sequence
MPPHRSMRLRADVDPAPRGENVADPPPPPAGDHVEEFLQFLQQQLRTGHWSITQRGLAVHKLSHLDTFILPRARLSSSRISSVTDPLALTGEARKRQQQKSGSAAEDHAN
VPVTWARFKDLLYDYYYPGTAKDMKEAEFLHLSQGTLMVAQYERKFMELSRFALELIPTEAMKIKRFVKGLCKGIRCGREGHFARECSMTAANTQRLGQRAPPIVSTQGG
NQRARVFALTRKEAADAENVVTGTVLVHNVPAYVLFDSGLSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIVSQMVRASELSFDNQTLKARLIQLDMRDFDVILGM
DWLATNQANINCSRREFSFQLPSGRSFTFKEVTGGVARAVSALKARRLLQNGAWGYLASVVDISMTPPSIDSVHVVKEFPDVFPEDLPGLPPVLPRRPLDELDRSEVELA
VEDVLAVLARLSVEPTLRQWVIAAQKRDPSLSKGFSMVDETLFYKEVPIEIVARETKVLRNRAIDLVKVLWRNHQVEEATWEREDEIRARYPELFERRTFEDESF