; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g10640 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g10640
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDNA-directed DNA polymerase
Genome locationchr1:6557721..6559994
RNA-Seq ExpressionMoc01g10640
SyntenyMoc01g10640
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156989.1 uncharacterized protein LOC111023818 [Momordica charantia]9.9e-6556.49Show/hide
Query:  EAPVQKSVETLGEYRPPPPCPQKLQEKSQDNQFKHFLDVLKQLHINIPLVEALEQMPNYANFLKDVLTKKR-----------------------------
        E PV+K V+   E++ PPP PQ+LQ+K+QD QF  FL+VLKQLHINIPL+EALEQMPNY  FLKD+L KKR                             
Subjt:  EAPVQKSVETLGEYRPPPPCPQKLQEKSQDNQFKHFLDVLKQLHINIPLVEALEQMPNYANFLKDVLTKKR-----------------------------

Query:  ----------------------------------SLGIGEARPTTMTLQLADWSITHPEGKIEDVLVQVDKFIFPTDFIILDYEADNKIPIILGNPFLSI
                                           LGIGEARP T+TLQLAD SIT+ EGKIEDVLVQVDKFIFP DFIILDYEAD +IPIILG PFLS 
Subjt:  ----------------------------------SLGIGEARPTTMTLQLADWSITHPEGKIEDVLVQVDKFIFPTDFIILDYEADNKIPIILGNPFLSI

Query:  DIALIDVQNAELTMIVNDQQVTLSVFNSVKFPADVEECSFLRLADDLMSKEIQKETLLDRLE
          ALIDV N ELT+ VNDQQVTLS+FNS+K+P DVEECS+LR+ADDLMS EIQ E LL++LE
Subjt:  DIALIDVQNAELTMIVNDQQVTLSVFNSVKFPADVEECSFLRLADDLMSKEIQKETLLDRLE

XP_022158490.1 uncharacterized protein LOC111024970 [Momordica charantia]1.2e-9143.32Show/hide
Query:  MDQLPKNPNLVVSSLEVDEINRGEVVVPAVIPPLNAILLADDGHKGIRAYAAPTFHGFHPVITELQIEVERFKLKPVMFQMLQTIGQFYGNAYEYSHLHL
        MDQLP+ PN V     V+ IN GEVVV A  PPLN ILL DDG + IRAYAAP  HGFHPVI    IE ERF+LK +MFQMLQT+GQF+GN  E  HLHL
Subjt:  MDQLPKNPNLVVSSLEVDEINRGEVVVPAVIPPLNAILLADDGHKGIRAYAAPTFHGFHPVITELQIEVERFKLKPVMFQMLQTIGQFYGNAYEYSHLHL

Query:  RYLLEVSDSFKMQGVSKEALHLKLFPYSLRDKARDWLNSMPAESITSWNDLAEKLLMTYFPP-----TDDKGVND-------------------------
        RY LEVSDSF MQ VSKEAL LKLFPY L DK RDWLNSMPAESITSWNDLAEK     F P     +D K VN+                         
Subjt:  RYLLEVSDSFKMQGVSKEALHLKLFPYSLRDKARDWLNSMPAESITSWNDLAEKLLMTYFPP-----TDDKGVND-------------------------

Query:  -----------------------------------------------------AGASN------------------------------------------
                                                             A +SN                                          
Subjt:  -----------------------------------------------------AGASN------------------------------------------

Query:  -MQMPMTEAQSEPEISIITE--YESM---EESYQR------------EAPVQKSVETLGEYRPPPPCPQKLQEKSQDNQFKHFLDVLKQLHINIPLVEAL
         +Q    E  +EP++  I E   E+M   E+  +R            E  V K V+   EYR PP  PQ+LQ+K+QD QF  FL+VLKQLHINIPLVEAL
Subjt:  -MQMPMTEAQSEPEISIITE--YESM---EESYQR------------EAPVQKSVETLGEYRPPPPCPQKLQEKSQDNQFKHFLDVLKQLHINIPLVEAL

Query:  EQMPNYANFLKDVLTKKRSLG---------------------------------------------------------------IGEARPTTMTLQLADW
        EQMPNY  FLKD+L KKR L                                                                IG+ARPTT+TLQLAD 
Subjt:  EQMPNYANFLKDVLTKKRSLG---------------------------------------------------------------IGEARPTTMTLQLADW

Query:  SITHPEGKIEDVLVQVDKFIFPTDFIILDYEADNKIPIILGNPFLSIDIALIDV
        SITHPEGKIEDVLVQVDKFIFP DFIILDYEAD +IPIILG PFL    ALIDV
Subjt:  SITHPEGKIEDVLVQVDKFIFPTDFIILDYEADNKIPIILGNPFLSIDIALIDV

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]1.1e-5532.96Show/hide
Query:  MFQMLQTIGQFYGNAYEYSHLHLRYLLEVSDSFKMQGVSKEALHLKLFPYSLRDKARDWLNSMPAESITSWNDLAEKLLMTYFPP---------------
        MFQMLQT+ QF+G+A E  H HL++ + V +SFK +G+S E L LKLFPYSLRD+AR WL S+P ESITSW+DLAEK LM YFPP               
Subjt:  MFQMLQTIGQFYGNAYEYSHLHLRYLLEVSDSFKMQGVSKEALHLKLFPYSLRDKARDWLNSMPAESITSWNDLAEKLLMTYFPP---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------TDDKGVNDAGAS----------------NMQMPMTEAQSEPEISII
                                                              + D+G ++AG S                N    +   QSE  I+ +
Subjt:  ------------------------------------------------------TDDKGVNDAGAS----------------NMQMPMTEAQSEPEISII

Query:  TEYESMEESY--QREAPVQKSVETL----------------------------------GEYRPPPPCPQKLQEKSQDNQFKHFLDVLKQLHINIPLVEA
           E + + Y    +A VQ    +L                                   EY P PP P++LQ+K ++ QF  FLDVLKQLH+NIPLVEA
Subjt:  TEYESMEESY--QREAPVQKSVETL----------------------------------GEYRPPPPCPQKLQEKSQDNQFKHFLDVLKQLHINIPLVEA

Query:  LEQMPNYANFLKDVLTKKRS----------------------------LGIGEARPTTMTLQLADWSITHPEGKIEDVLVQVDKFIFPTDFIILDYEADN
        LEQMPNY  FLK++L KKR+                            LGIGEARPT +TLQLAD SITHPEGKIEDVLV VDKF FP DFIILDY+AD 
Subjt:  LEQMPNYANFLKDVLTKKRS----------------------------LGIGEARPTTMTLQLADWSITHPEGKIEDVLVQVDKFIFPTDFIILDYEADN

Query:  KIPIILGNPFLSIDIALIDVQNAELTMIVND
        ++PIILG PFL+   AL+DV   ELTM V D
Subjt:  KIPIILGNPFLSIDIALIDVQNAELTMIVND

XP_030497888.1 uncharacterized protein LOC115713544 [Cannabis sativa]3.1e-7435.23Show/hide
Query:  FHGFHPVITELQIEVERFKLKPVMFQMLQTIGQFYGNAYEYSHLHLRYLLEVSDSFKMQGVSKEALHLKLFPYSLRDKARDWLNSMPAESITSWNDLAEK
        F+  +P I   +I+   F+LKPVMFQMLQT+GQF G+  E  HLH+   LEVS+SFK++GVS+EAL LKLFP+SLRD+AR WLN++P +S+T+WNDLAEK
Subjt:  FHGFHPVITELQIEVERFKLKPVMFQMLQTIGQFYGNAYEYSHLHLRYLLEVSDSFKMQGVSKEALHLKLFPYSLRDKARDWLNSMPAESITSWNDLAEK

Query:  LL-----------------------------------MTYFPPTDDK-----------------------------------------------------
         L                                    T   PT  K                                                     
Subjt:  LL-----------------------------------MTYFPPTDDK-----------------------------------------------------

Query:  ---------------------GVNDAGASNM----------QMPMTEAQSEPEISIITEYESMEESY--QREAPVQKSVETL------------------
                             G + +GA             Q P  +   +P+ S  +  ES+   Y  + +A +Q  V +L                  
Subjt:  ---------------------GVNDAGASNM----------QMPMTEAQSEPEISIITEYESMEESY--QREAPVQKSVETL------------------

Query:  ------------------------------------------------GE-----------------------------YRPPPPCPQKLQEKSQDNQFK
                                                        GE                              +PPPP P + +++  D QF 
Subjt:  ------------------------------------------------GE-----------------------------YRPPPPCPQKLQEKSQDNQFK

Query:  HFLDVLKQLHINIPLVEALEQMPNYANFLKDVLTKKR-----------------SLGIGEARPTTMTLQLADWSITHPEGKIEDVLVQVDKFIFPTDFII
         FLDVLKQLHINIPLVEALEQMP Y  FLKD+LTKKR                  LGIGEARPTT+TLQL D S+ HPEGKIEDV VQVDKFIFP DFII
Subjt:  HFLDVLKQLHINIPLVEALEQMPNYANFLKDVLTKKR-----------------SLGIGEARPTTMTLQLADWSITHPEGKIEDVLVQVDKFIFPTDFII

Query:  LDYEADNKIPIILGNPFLSIDIALIDVQNAELTMIVNDQQVTLSVFNSVKFPADVEECSFLRLADDLMSKEIQKETLLD
        LDYEAD ++PIILG PFL+    LIDVQN ELTM VNDQ+VT +VFN+++FP ++EECS L + D ++++   KE   D
Subjt:  LDYEADNKIPIILGNPFLSIDIALIDVQNAELTMIVNDQQVTLSVFNSVKFPADVEECSFLRLADDLMSKEIQKETLLD

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]5.1e-6128.37Show/hide
Query:  LNAILLADDGHKGIRAYAAPTFHGFHPVITELQIEVERFKLKPVMFQMLQTIGQFYGNAYEYSHLHLRYLLEVSDSFKMQGVSKEALHLKLFPYSLRDKA
        ++ I+L DD  + IR YAAP F+  +P I   +I+  +F+LKPVMFQMLQT+GQF     E  HLHLR  LE+SDSFK+QGVS+E   LKLFP+SLRD+A
Subjt:  LNAILLADDGHKGIRAYAAPTFHGFHPVITELQIEVERFKLKPVMFQMLQTIGQFYGNAYEYSHLHLRYLLEVSDSFKMQGVSKEALHLKLFPYSLRDKA

Query:  RDWLNSMPAESITSWNDLAEKLLMTYFPPT---------------DDKGVNDA-----------------------------------------------
        R WLN++  +S+T+WND AEK L  YFPPT               +D+  +DA                                               
Subjt:  RDWLNSMPAESITSWNDLAEKLLMTYFPPT---------------DDKGVNDA-----------------------------------------------

Query:  -------------GASNMQMPMTEAQSEPEISIITE----------------------------------------------------------------
                      ++N Q   T A    +++ + E                                                                
Subjt:  -------------GASNMQMPMTEAQSEPEISIITE----------------------------------------------------------------

Query:  -----------------------------------------------------------------YESMEESYQ--------------------------
                                                                          ES+   Y                           
Subjt:  -----------------------------------------------------------------YESMEESYQ--------------------------

Query:  -----------------------------------------------------------------------------REAPVQKSVETLGEYRPPPPCPQ
                                                                                     +++  Q+S       +PP P PQ
Subjt:  -----------------------------------------------------------------------------REAPVQKSVETLGEYRPPPPCPQ

Query:  KLQEKSQDNQFKHFLDVLKQLHINIPLVEALEQMPNYANFLKDVLTKK----------------------------------------RSLGIGEARPTT
        + Q++ QD QFK FLDVLKQLHINIPLVEALEQMPNY  FLKD+LTKK                                        R LGIGEARPTT
Subjt:  KLQEKSQDNQFKHFLDVLKQLHINIPLVEALEQMPNYANFLKDVLTKK----------------------------------------RSLGIGEARPTT

Query:  MTLQLADWSITHPEGKIEDVLVQVDKFIFPTDFIILDYEADNKIPIILGNPFLSIDIALIDVQNAELTMIVNDQQVTLSVFNSVKFPADVEECSFLRLAD
        +TLQLAD S+ HP+GKIEDVLVQVDKFIFP DFIILDYE D ++PIIL  PFL+    LIDV+  ELTM   D+Q T  VF  ++ P  + EC  +   D
Subjt:  MTLQLADWSITHPEGKIEDVLVQVDKFIFPTDFIILDYEADNKIPIILGNPFLSIDIALIDVQNAELTMIVNDQQVTLSVFNSVKFPADVEECSFLRLAD

Query:  DLM--------SKEIQKETLLDRLEK
          M        SK+++K+  L  + K
Subjt:  DLM--------SKEIQKETLLDRLEK

TrEMBL top hitse value%identityAlignment
A0A5B6X3P1 Retrovirus-related Pol polyprotein from transposon opus2.2e-5446.49Show/hide
Query:  MFQMLQTIGQFYGNAYEYSHLHLRYLLEVSDSFKMQGVSKEALHLKLFPYSLRDKARDWLNSMPAESITSWNDLAEKLLMTYFPPTDDKGVNDAGASNMQ
        M QMLQT+GQF G   E  HLHL   +EVSDSFK+ GVS+ AL LKLF YSL+D+AR WLNS+P  S++ W   +EK+L                    +
Subjt:  MFQMLQTIGQFYGNAYEYSHLHLRYLLEVSDSFKMQGVSKEALHLKLFPYSLRDKARDWLNSMPAESITSWNDLAEKLLMTYFPPTDDKGVNDAGASNMQ

Query:  MPMTEAQSEPEISIITEYESMEESYQREAPVQKSVETLGEYRPPPPCPQKLQEKSQDNQFKHFLDVLKQLHINIPLVEALEQMPNYANFLKDVLTKK-RS
          + E + +P +               +  VQ  VET     P                          LH+NIPLVEALE+M NY   +KD+L+KK R 
Subjt:  MPMTEAQSEPEISIITEYESMEESYQREAPVQKSVETLGEYRPPPPCPQKLQEKSQDNQFKHFLDVLKQLHINIPLVEALEQMPNYANFLKDVLTKK-RS

Query:  LGIGEARPTTMTLQLADWSITHPEGKIEDVLVQVDKFIFPTDFIILDYEADNKIPIILGNPFLSIDIALIDVQNAELTMIVNDQQVTLSVFNSVKFPAD
        LGIGE R T +TLQLAD  I HPEGKIED+LV VDK+IFP DF+ILD+EAD  +PIILG PFL+  I LIDVQ  ELTM V D Q  LS      FP D
Subjt:  LGIGEARPTTMTLQLADWSITHPEGKIEDVLVQVDKFIFPTDFIILDYEADNKIPIILGNPFLSIDIALIDVQNAELTMIVNDQQVTLSVFNSVKFPAD

A0A6J1DV77 uncharacterized protein LOC1110238184.8e-6556.49Show/hide
Query:  EAPVQKSVETLGEYRPPPPCPQKLQEKSQDNQFKHFLDVLKQLHINIPLVEALEQMPNYANFLKDVLTKKR-----------------------------
        E PV+K V+   E++ PPP PQ+LQ+K+QD QF  FL+VLKQLHINIPL+EALEQMPNY  FLKD+L KKR                             
Subjt:  EAPVQKSVETLGEYRPPPPCPQKLQEKSQDNQFKHFLDVLKQLHINIPLVEALEQMPNYANFLKDVLTKKR-----------------------------

Query:  ----------------------------------SLGIGEARPTTMTLQLADWSITHPEGKIEDVLVQVDKFIFPTDFIILDYEADNKIPIILGNPFLSI
                                           LGIGEARP T+TLQLAD SIT+ EGKIEDVLVQVDKFIFP DFIILDYEAD +IPIILG PFLS 
Subjt:  ----------------------------------SLGIGEARPTTMTLQLADWSITHPEGKIEDVLVQVDKFIFPTDFIILDYEADNKIPIILGNPFLSI

Query:  DIALIDVQNAELTMIVNDQQVTLSVFNSVKFPADVEECSFLRLADDLMSKEIQKETLLDRLE
          ALIDV N ELT+ VNDQQVTLS+FNS+K+P DVEECS+LR+ADDLMS EIQ E LL++LE
Subjt:  DIALIDVQNAELTMIVNDQQVTLSVFNSVKFPADVEECSFLRLADDLMSKEIQKETLLDRLE

A0A6J1DVZ9 uncharacterized protein LOC1110249706.0e-9243.32Show/hide
Query:  MDQLPKNPNLVVSSLEVDEINRGEVVVPAVIPPLNAILLADDGHKGIRAYAAPTFHGFHPVITELQIEVERFKLKPVMFQMLQTIGQFYGNAYEYSHLHL
        MDQLP+ PN V     V+ IN GEVVV A  PPLN ILL DDG + IRAYAAP  HGFHPVI    IE ERF+LK +MFQMLQT+GQF+GN  E  HLHL
Subjt:  MDQLPKNPNLVVSSLEVDEINRGEVVVPAVIPPLNAILLADDGHKGIRAYAAPTFHGFHPVITELQIEVERFKLKPVMFQMLQTIGQFYGNAYEYSHLHL

Query:  RYLLEVSDSFKMQGVSKEALHLKLFPYSLRDKARDWLNSMPAESITSWNDLAEKLLMTYFPP-----TDDKGVND-------------------------
        RY LEVSDSF MQ VSKEAL LKLFPY L DK RDWLNSMPAESITSWNDLAEK     F P     +D K VN+                         
Subjt:  RYLLEVSDSFKMQGVSKEALHLKLFPYSLRDKARDWLNSMPAESITSWNDLAEKLLMTYFPP-----TDDKGVND-------------------------

Query:  -----------------------------------------------------AGASN------------------------------------------
                                                             A +SN                                          
Subjt:  -----------------------------------------------------AGASN------------------------------------------

Query:  -MQMPMTEAQSEPEISIITE--YESM---EESYQR------------EAPVQKSVETLGEYRPPPPCPQKLQEKSQDNQFKHFLDVLKQLHINIPLVEAL
         +Q    E  +EP++  I E   E+M   E+  +R            E  V K V+   EYR PP  PQ+LQ+K+QD QF  FL+VLKQLHINIPLVEAL
Subjt:  -MQMPMTEAQSEPEISIITE--YESM---EESYQR------------EAPVQKSVETLGEYRPPPPCPQKLQEKSQDNQFKHFLDVLKQLHINIPLVEAL

Query:  EQMPNYANFLKDVLTKKRSLG---------------------------------------------------------------IGEARPTTMTLQLADW
        EQMPNY  FLKD+L KKR L                                                                IG+ARPTT+TLQLAD 
Subjt:  EQMPNYANFLKDVLTKKRSLG---------------------------------------------------------------IGEARPTTMTLQLADW

Query:  SITHPEGKIEDVLVQVDKFIFPTDFIILDYEADNKIPIILGNPFLSIDIALIDV
        SITHPEGKIEDVLVQVDKFIFP DFIILDYEAD +IPIILG PFL    ALIDV
Subjt:  SITHPEGKIEDVLVQVDKFIFPTDFIILDYEADNKIPIILGNPFLSIDIALIDV

A0A6J1E1F3 uncharacterized protein LOC1110250655.3e-5632.96Show/hide
Query:  MFQMLQTIGQFYGNAYEYSHLHLRYLLEVSDSFKMQGVSKEALHLKLFPYSLRDKARDWLNSMPAESITSWNDLAEKLLMTYFPP---------------
        MFQMLQT+ QF+G+A E  H HL++ + V +SFK +G+S E L LKLFPYSLRD+AR WL S+P ESITSW+DLAEK LM YFPP               
Subjt:  MFQMLQTIGQFYGNAYEYSHLHLRYLLEVSDSFKMQGVSKEALHLKLFPYSLRDKARDWLNSMPAESITSWNDLAEKLLMTYFPP---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------TDDKGVNDAGAS----------------NMQMPMTEAQSEPEISII
                                                              + D+G ++AG S                N    +   QSE  I+ +
Subjt:  ------------------------------------------------------TDDKGVNDAGAS----------------NMQMPMTEAQSEPEISII

Query:  TEYESMEESY--QREAPVQKSVETL----------------------------------GEYRPPPPCPQKLQEKSQDNQFKHFLDVLKQLHINIPLVEA
           E + + Y    +A VQ    +L                                   EY P PP P++LQ+K ++ QF  FLDVLKQLH+NIPLVEA
Subjt:  TEYESMEESY--QREAPVQKSVETL----------------------------------GEYRPPPPCPQKLQEKSQDNQFKHFLDVLKQLHINIPLVEA

Query:  LEQMPNYANFLKDVLTKKRS----------------------------LGIGEARPTTMTLQLADWSITHPEGKIEDVLVQVDKFIFPTDFIILDYEADN
        LEQMPNY  FLK++L KKR+                            LGIGEARPT +TLQLAD SITHPEGKIEDVLV VDKF FP DFIILDY+AD 
Subjt:  LEQMPNYANFLKDVLTKKRS----------------------------LGIGEARPTTMTLQLADWSITHPEGKIEDVLVQVDKFIFPTDFIILDYEADN

Query:  KIPIILGNPFLSIDIALIDVQNAELTMIVND
        ++PIILG PFL+   AL+DV   ELTM V D
Subjt:  KIPIILGNPFLSIDIALIDVQNAELTMIVND

A0A6J1EQ90 uncharacterized protein LOC1114364114.7e-5227.98Show/hide
Query:  NAILLADDGHKGIRAYAAPTFHGFHPVITELQIEVERFKLKPVMFQMLQTIGQFYGNAYEYSHLHLRYLLEV-------SDSFKMQGVSKEALHLKLFPY
        N I LADD  + IRAYA P     +P I   +I+   F+LKPVMFQMLQTIGQF+G   E  HLHL+  L V       SDSF+ QGV K+ + L LFPY
Subjt:  NAILLADDGHKGIRAYAAPTFHGFHPVITELQIEVERFKLKPVMFQMLQTIGQFYGNAYEYSHLHLRYLLEV-------SDSFKMQGVSKEALHLKLFPY

Query:  SLRDKARDWLNSMPAESITSWNDLAEKLLMTYFPPTDD------------------------------------------------------KGVNDAGA
         LRD A+ WLN++   +I SWN LAE  L+ YFPPT +                                                      K V DA A
Subjt:  SLRDKARDWLNSMPAESITSWNDLAEKLLMTYFPPTDD------------------------------------------------------KGVNDAGA

Query:  S---------------------------------------------------------------------------------------------------
        +                                                                                                   
Subjt:  S---------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------NMQMP----------------------------MTEAQSEPEISI---ITEY-
                                                       N QMP                             T+AQ   E SI   I EY 
Subjt:  -----------------------------------------------NMQMP----------------------------MTEAQSEPEISI---ITEY-

Query:  ----------------------------------ESMEESYQR--EAPVQK--------------------SVETLGEYRPPPPCPQKLQEKSQDNQFKH
                                          +   ++ QR  EA VQK                    S +    Y P PP PQ+++ K ++  F+ 
Subjt:  ----------------------------------ESMEESYQR--EAPVQK--------------------SVETLGEYRPPPPCPQKLQEKSQDNQFKH

Query:  FLDVLKQLHINIPLVEALEQMPNYANFLKDVLTKKR---------------------------------------------------------------S
        F+D+LK++HINIPLVEAL+QMPNY  FLKDVL  +R                                                                
Subjt:  FLDVLKQLHINIPLVEALEQMPNYANFLKDVLTKKR---------------------------------------------------------------S

Query:  LGIGEARPTTMTLQLADWSITHPEGKIEDVLVQVDKFIFPTDFIILDYEADNKIPIILGNPFLSIDIALIDVQNAELTMIVNDQQVTLSVFNSVKFPADV
        LGIGEARPTT+TLQLAD SIT+PEGKIED+L+QVDKFIF  DFIILDYE D+ +PIILG PFL I   L+DV    +T+ ++ Q+V  ++ +S+K+P  +
Subjt:  LGIGEARPTTMTLQLADWSITHPEGKIEDVLVQVDKFIFPTDFIILDYEADNKIPIILGNPFLSIDIALIDVQNAELTMIVNDQQVTLSVFNSVKFPADV

Query:  EECS
        EECS
Subjt:  EECS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAACTACCGAAAAATCCTAACCTTGTGGTTTCTAGTCTAGAAGTTGACGAAATTAATAGAGGAGAAGTGGTTGTACCAGCAGTTATTCCTCCACTTAAT
GCAATTTTGTTGGCTGATGATGGACATAAGGGTATTAGAGCTTATGCTGCTCCTACATTTCATGGCTTCCATCCAGTTATTACAGAACTACAAATCGAAGTAGAA
AGGTTTAAATTAAAGCCTGTCATGTTCCAAATGCTCCAAACAATAGGTCAGTTCTATGGAAATGCATATGAATACTCCCATCTACACTTGAGGTACTTATTGGAA
GTGAGTGATTCTTTCAAAATGCAAGGAGTATCTAAGGAGGCACTGCATTTGAAATTGTTTCCTTACTCATTGAGGGATAAGGCTAGAGATTGGTTGAATTCTATG
CCAGCTGAATCTATTACTTCCTGGAATGATCTTGCTGAAAAATTATTGATGACGTATTTTCCTCCAACTGATGATAAAGGAGTGAACGATGCTGGAGCATCGAAT
ATGCAGATGCCAATGACAGAAGCTCAGTCTGAACCTGAAATTAGCATAATAACAGAATATGAATCCATGGAGGAATCATATCAAAGAGAAGCACCAGTCCAGAAA
AGTGTGGAAACACTAGGGGAATATAGACCACCACCTCCCTGTCCTCAAAAGCTTCAAGAGAAAAGTCAAGATAATCAATTCAAGCATTTTTTGGATGTGTTGAAG
CAGCTCCACATCAATATACCCTTAGTTGAGGCTCTTGAGCAAATGCCTAACTATGCAAATTTTTTGAAGGATGTCTTGACTAAGAAAAGGAGCTTGGGTATTGGT
GAAGCAAGGCCCACCACGATGACACTACAGCTAGCTGACTGGTCAATCACTCACCCAGAGGGCAAGATTGAAGATGTGTTAGTGCAGGTGGACAAGTTCATTTTC
CCAACTGACTTCATCATCCTCGACTATGAAGCAGACAACAAGATCCCAATAATTTTGGGGAATCCTTTTCTATCCATTGATATAGCACTAATAGATGTGCAGAAT
GCGGAGTTAACCATGATAGTGAACGATCAACAGGTTACCTTATCTGTTTTTAATTCTGTTAAATTTCCTGCTGATGTAGAAGAATGCTCTTTCTTAAGGCTTGCA
GATGACTTGATGAGTAAAGAGATACAAAAGGAGACCTTGTTGGATCGCTTGGAGAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCAACTACCGAAAAATCCTAACCTTGTGGTTTCTAGTCTAGAAGTTGACGAAATTAATAGAGGAGAAGTGGTTGTACCAGCAGTTATTCCTCCACTTAAT
GCAATTTTGTTGGCTGATGATGGACATAAGGGTATTAGAGCTTATGCTGCTCCTACATTTCATGGCTTCCATCCAGTTATTACAGAACTACAAATCGAAGTAGAA
AGGTTTAAATTAAAGCCTGTCATGTTCCAAATGCTCCAAACAATAGGTCAGTTCTATGGAAATGCATATGAATACTCCCATCTACACTTGAGGTACTTATTGGAA
GTGAGTGATTCTTTCAAAATGCAAGGAGTATCTAAGGAGGCACTGCATTTGAAATTGTTTCCTTACTCATTGAGGGATAAGGCTAGAGATTGGTTGAATTCTATG
CCAGCTGAATCTATTACTTCCTGGAATGATCTTGCTGAAAAATTATTGATGACGTATTTTCCTCCAACTGATGATAAAGGAGTGAACGATGCTGGAGCATCGAAT
ATGCAGATGCCAATGACAGAAGCTCAGTCTGAACCTGAAATTAGCATAATAACAGAATATGAATCCATGGAGGAATCATATCAAAGAGAAGCACCAGTCCAGAAA
AGTGTGGAAACACTAGGGGAATATAGACCACCACCTCCCTGTCCTCAAAAGCTTCAAGAGAAAAGTCAAGATAATCAATTCAAGCATTTTTTGGATGTGTTGAAG
CAGCTCCACATCAATATACCCTTAGTTGAGGCTCTTGAGCAAATGCCTAACTATGCAAATTTTTTGAAGGATGTCTTGACTAAGAAAAGGAGCTTGGGTATTGGT
GAAGCAAGGCCCACCACGATGACACTACAGCTAGCTGACTGGTCAATCACTCACCCAGAGGGCAAGATTGAAGATGTGTTAGTGCAGGTGGACAAGTTCATTTTC
CCAACTGACTTCATCATCCTCGACTATGAAGCAGACAACAAGATCCCAATAATTTTGGGGAATCCTTTTCTATCCATTGATATAGCACTAATAGATGTGCAGAAT
GCGGAGTTAACCATGATAGTGAACGATCAACAGGTTACCTTATCTGTTTTTAATTCTGTTAAATTTCCTGCTGATGTAGAAGAATGCTCTTTCTTAAGGCTTGCA
GATGACTTGATGAGTAAAGAGATACAAAAGGAGACCTTGTTGGATCGCTTGGAGAAATAA
Protein sequenceShow/hide protein sequence
MDQLPKNPNLVVSSLEVDEINRGEVVVPAVIPPLNAILLADDGHKGIRAYAAPTFHGFHPVITELQIEVERFKLKPVMFQMLQTIGQFYGNAYEYSHLHLRYLLE
VSDSFKMQGVSKEALHLKLFPYSLRDKARDWLNSMPAESITSWNDLAEKLLMTYFPPTDDKGVNDAGASNMQMPMTEAQSEPEISIITEYESMEESYQREAPVQK
SVETLGEYRPPPPCPQKLQEKSQDNQFKHFLDVLKQLHINIPLVEALEQMPNYANFLKDVLTKKRSLGIGEARPTTMTLQLADWSITHPEGKIEDVLVQVDKFIF
PTDFIILDYEADNKIPIILGNPFLSIDIALIDVQNAELTMIVNDQQVTLSVFNSVKFPADVEECSFLRLADDLMSKEIQKETLLDRLEK