; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g01130 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g01130
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionp-loop containing nucleoside triphosphate hydrolases superfamily protein, putative
Genome locationchr9:979028..984701
RNA-Seq ExpressionMoc09g01130
SyntenyMoc09g01130
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005525 - GTP binding (molecular function)
InterPro domainsIPR010264 - Plant self-incompatibility S1
IPR027417 - P-loop containing nucleoside triphosphate hydrolase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151793.1 uncharacterized protein LOC111019687 [Momordica charantia]2.5e-247100Show/hide
Query:  MGGDTVPLSTPSIPHDSDSSQISTQLPILLRGYNKDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDKLRIRSEDLNQAKRKIL
        MGGDTVPLSTPSIPHDSDSSQISTQLPILLRGYNKDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDKLRIRSEDLNQAKRKIL
Subjt:  MGGDTVPLSTPSIPHDSDSSQISTQLPILLRGYNKDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDKLRIRSEDLNQAKRKIL

Query:  RYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKSFCLYDTRGLSDDSSDN
        RYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKSFCLYDTRGLSDDSSDN
Subjt:  RYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKSFCLYDTRGLSDDSSDN

Query:  IEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFNSPYLSYGDDKPVVVI
        IEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFNSPYLSYGDDKPVVVI
Subjt:  IEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFNSPYLSYGDDKPVVVI

Query:  THGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITVMVIVIIAAYLYLAYVH
        THGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITVMVIVIIAAYLYLAYVH
Subjt:  THGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITVMVIVIIAAYLYLAYVH

Query:  RLPEQVEAARDDSKRLEIIWSEIRHMWLDE
        RLPEQVEAARDDSKRLEIIWSEIRHMWLDE
Subjt:  RLPEQVEAARDDSKRLEIIWSEIRHMWLDE

XP_022954210.1 uncharacterized protein LOC111456535 [Cucurbita moschata]1.2e-17775.06Show/hide
Query:  MGGDTVPLSTPSIPHD---------SDSSQISTQLPILLRGYN-------KDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDK
        MGGDTVPLSTPSI HD          D SQI+TQLP  LRG N       K D+ D+ENG I+GEFD  ESQFSS+ L+VEI RRR+NNV REI ES D+
Subjt:  MGGDTVPLSTPSIPHD---------SDSSQISTQLPILLRGYN-------KDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDK

Query:  LRIRSEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKS
        LRIRSE+LNQAKRKIL YSPG+WIEQVGGMKLSDYDIP+T SL+L+GPKGSGKSSLINRISKVF+ED F  +RAQVS NSSGEDGTFFLQEYMI R SKS
Subjt:  LRIRSEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKS

Query:  FCLYDTRGLSDDSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAF
        FCLYDTRGLSDDS +NIE+LKQWMTKGV HGELV RKSDASSLINRMRCK++  + F LSR +RMINFV+FVVDG SV KSMDG DDIEKDY + ITTAF
Subjt:  FCLYDTRGLSDDSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAF

Query:  NSPYLSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITV
        N PYLSYGDDKPVVVITHGDLLSF DRVRVR HLGNLLGIP  KQIFDIP+SYDPVTEL+IIDMLHYCLEH+DK L  K WTV KDH+S +SA  YF+T+
Subjt:  NSPYLSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITV

Query:  MVIVIIAAYLYLAYVHRLPEQVEAARDDSKRLEIIWSEIRHMWLD
        MVIVII+AYLY  YVH  PEQ E  ++ SK +EI+W EIRHMWLD
Subjt:  MVIVIIAAYLYLAYVHRLPEQVEAARDDSKRLEIIWSEIRHMWLD

XP_022991684.1 uncharacterized protein LOC111488225 [Cucurbita maxima]6.7e-17674.77Show/hide
Query:  MGGDTVPLSTPSIPHD---------SDSSQISTQLPIL------LRGYNKDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDKL
        MGGDTV LSTPSI HD          D SQI+TQLP L      L    KDD+ D+ENG I+GEFD  ESQFSS+ L+VEI RRR+NNV REI ES D+L
Subjt:  MGGDTVPLSTPSIPHD---------SDSSQISTQLPIL------LRGYNKDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDKL

Query:  RIRSEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKSF
        RIRSE+LNQAKRKIL YSPG+WIEQVGGMKLSDYDIP+T SL+L+GPKGSGKSSLINRI KVF+ED F  +RAQVS NSSGEDGTFFLQEYMI R SKSF
Subjt:  RIRSEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKSF

Query:  CLYDTRGLSDDSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFN
        CLYDTRGLSDDS +NIE+LKQWMTKGV HGELV RKSDASSLINRM CK++  + F LSR +RMINFVIFVVDG SV KSMDG DDIEKDY   ITTAFN
Subjt:  CLYDTRGLSDDSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFN

Query:  SPYLSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITVM
         PYLSYGDDKPVVVITHGDLLSF DRVRVR +LGNLLGIP  KQIFDIP+SYDPVTEL+IIDMLHYCLEH+DK L  K WTV KDHVS +SA  YF+TVM
Subjt:  SPYLSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITVM

Query:  VIVIIAAYLYLAYVHRLPEQVEAARDDSKRLEIIWSEIRHMWLD
        VIVII+AYLY  YVH  PEQ E  ++ SK +EI+W EIRHMWLD
Subjt:  VIVIIAAYLYLAYVHRLPEQVEAARDDSKRLEIIWSEIRHMWLD

XP_023548122.1 uncharacterized protein LOC111806854 [Cucurbita pepo subsp. pepo]5.5e-17875.51Show/hide
Query:  MGGDTVPLSTPSIPHD---------SDSSQISTQLPILLRGYN-------KDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDK
        MGGDTV LSTPSI HD          D SQI+TQLP  LRG N       KDD+ D+ENG I+GEFD  ESQFSS+ L+VEI RRR+NNV REI ES D+
Subjt:  MGGDTVPLSTPSIPHD---------SDSSQISTQLPILLRGYN-------KDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDK

Query:  LRIRSEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKS
        LRIRSE+LNQAKRKIL YSPG+WIEQVGGMKLSDYDIP+T SL+L+GPKGSGKSSLINRISKVF+ED F  +RAQVS NSSGEDGTFFLQEYMI R SKS
Subjt:  LRIRSEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKS

Query:  FCLYDTRGLSDDSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAF
        FCLYDTRGLSDDSS+NIE+LKQWMTKGV HGELV RKSDASSLINRMRCK++  + F LSR +RMINFVIFVVDG SV KSMDG +DIEKDY + ITTAF
Subjt:  FCLYDTRGLSDDSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAF

Query:  NSPYLSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITV
        N PYLSYGDDKPVVVITHGDLLSF DRVRVR HLGNLLGIP  KQIFDIP+SYDPVTEL+IIDMLHYCLEH+DK L  K WTV KDHVS +SA  YF+T 
Subjt:  NSPYLSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITV

Query:  MVIVIIAAYLYLAYVHRLPEQVEAARDDSKRLEIIWSEIRHMWLD
        MVIVII+AYLY  YVH  PEQ E  ++ SK +EI+W EIRHMWLD
Subjt:  MVIVIIAAYLYLAYVHRLPEQVEAARDDSKRLEIIWSEIRHMWLD

XP_038899526.1 uncharacterized protein LOC120086808 isoform X1 [Benincasa hispida]1.3e-18276.58Show/hide
Query:  MGGDTVPLSTPSIPHDSDS------SQISTQLPILLRGYN------KDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDKLRIR
        MGGDTVPLSTPSI HD DS      SQISTQLP LLRG N      +DDQ DVENG IIGEFDEIES++SSA L+V+ICRRR+N V REI ESYD+LR R
Subjt:  MGGDTVPLSTPSIPHDSDS------SQISTQLPILLRGYN------KDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDKLRIR

Query:  SEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKSFCLY
        SE+ NQAK+KIL YSPGAWIEQVGGMKLSDYDIP+TTSL+L+GPKGSGKSSLINRISKVFEEDHFT +RAQVS NSSGEDGTFFLQEYMI R SKSFCLY
Subjt:  SEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKSFCLY

Query:  DTRGLSDDSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFNSPY
        DTRGLSDD SDNIE+LKQWMTKGVRHGELVTRKSDASSLINRMRCK++  +SF  SR IRMINFVIFVVDG SV +S+DGDD  +KDY + ITTAFN PY
Subjt:  DTRGLSDDSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFNSPY

Query:  LSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFIS-ARIYFITVMVI
        LSYGDDKPVVV+THGDLLSF DRVRVR HLGNLLGIPPTKQIFDIP+ YDPVTEL+IID+LHYCLEH+DK LP KGWTV+KDH+  IS A I F+ +MVI
Subjt:  LSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFIS-ARIYFITVMVI

Query:  VIIAAYLYLAY-VHRLPEQVEAARDDSKRLEIIWSEIRHMWLDE
         II+AY+Y  Y VHR PEQ    ++    LEI+W EIRH+WL+E
Subjt:  VIIAAYLYLAY-VHRLPEQVEAARDDSKRLEIIWSEIRHMWLDE

TrEMBL top hitse value%identityAlignment
A0A0A0K6S8 Uncharacterized protein4.1e-17172.69Show/hide
Query:  MGGDTVPLST-PSIPHDSDS------SQISTQLPILLRGYNK------DDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDKLRI
        MGGD +PL T  SI HD  S      SQISTQLP LLRG NK      DDQ +VENG IIGEF+EIES++SSA LDV+ICR R N V REI ESYD+LRI
Subjt:  MGGDTVPLST-PSIPHDSDS------SQISTQLPILLRGYNK------DDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDKLRI

Query:  RSEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKSFCL
        RSE+LNQAK+KIL YSPGAWIEQVGGMKLSDYDIP+TTSL+L+GPKGSGKSSLINRISKVFEEDHF  +RAQVS NSSGEDGTFFL EYMI R SKSFCL
Subjt:  RSEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKSFCL

Query:  YDTRGLSDDSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFNSP
        YDTRGLS+D SDNIEMLKQWM+KGV HG+LVTRKSDASSLINRMRCK++  +SF  SR +R+INFVIFVVDG SV KS+DGDD  +KDY + ITTAFN P
Subjt:  YDTRGLSDDSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFNSP

Query:  YLSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITVMVI
        YLSYGDDKPVVV+THGDLLSF + VRVR HLGNLLGIP TKQIFDIP+ YDPVTEL+IIDMLHYCLEH+DK LP K WTV+KD  S  +A IYF+ +++I
Subjt:  YLSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITVMVI

Query:  VIIAAYLYLAYVHRLPEQVEAARDDSKRLEIIWSEIRHMWLDE
        V I+A LY  YVH   EQ    +     +EI+W EIRH+WLDE
Subjt:  VIIAAYLYLAYVHRLPEQVEAARDDSKRLEIIWSEIRHMWLDE

A0A1S3C7I6 uncharacterized protein LOC103497586 isoform X21.2e-17072.69Show/hide
Query:  MGGDTVPLST-PSIPHDSDS------SQISTQLPILLRGYNK------DDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDKLRI
        MGGDT+PLST  SI HD  S      SQISTQ P LLRGYNK      DDQ +VENG IIGEFDEIE ++SSA LDV+ICRRR++ V REI ESYD+LR 
Subjt:  MGGDTVPLST-PSIPHDSDS------SQISTQLPILLRGYNK------DDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDKLRI

Query:  RSEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKSFCL
        RSE+L QAK+K L YSPGAWIEQVGGMKLSDYDIP+T SL+L+GPKGSGKSSLINRISKVFEEDHF  +RAQVS NSSGE GTFFL EYMI R SKSFCL
Subjt:  RSEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKSFCL

Query:  YDTRGLSDDSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFNSP
        YDTRGLSDD SDNIE LKQWM+KGVRHGELVTRKSDAS+ INRM+CK++  +SF  SR IR+INFVIFVVDG SV KS+DGDD  +KDY + ITTAFN P
Subjt:  YDTRGLSDDSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFNSP

Query:  YLSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITVMVI
        YLSYGDDKPVVV+THGDLLSF DRVRVR HLGNLLGIP TKQIFDIP+ YDPVTEL+IIDMLHYCLEH+DK LP K W V+KD  S  +A IYF+ +M+I
Subjt:  YLSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITVMVI

Query:  VIIAAYLYLAYVHRLPEQVEAARDDSKRLEIIWSEIRHMWLDE
        V I+A LY  YVHR  EQ +    +    EI+W EIRH+WLDE
Subjt:  VIIAAYLYLAYVHRLPEQVEAARDDSKRLEIIWSEIRHMWLDE

A0A6J1DD58 uncharacterized protein LOC1110196871.2e-247100Show/hide
Query:  MGGDTVPLSTPSIPHDSDSSQISTQLPILLRGYNKDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDKLRIRSEDLNQAKRKIL
        MGGDTVPLSTPSIPHDSDSSQISTQLPILLRGYNKDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDKLRIRSEDLNQAKRKIL
Subjt:  MGGDTVPLSTPSIPHDSDSSQISTQLPILLRGYNKDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDKLRIRSEDLNQAKRKIL

Query:  RYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKSFCLYDTRGLSDDSSDN
        RYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKSFCLYDTRGLSDDSSDN
Subjt:  RYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKSFCLYDTRGLSDDSSDN

Query:  IEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFNSPYLSYGDDKPVVVI
        IEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFNSPYLSYGDDKPVVVI
Subjt:  IEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFNSPYLSYGDDKPVVVI

Query:  THGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITVMVIVIIAAYLYLAYVH
        THGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITVMVIVIIAAYLYLAYVH
Subjt:  THGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITVMVIVIIAAYLYLAYVH

Query:  RLPEQVEAARDDSKRLEIIWSEIRHMWLDE
        RLPEQVEAARDDSKRLEIIWSEIRHMWLDE
Subjt:  RLPEQVEAARDDSKRLEIIWSEIRHMWLDE

A0A6J1GRU3 uncharacterized protein LOC1114565355.9e-17875.06Show/hide
Query:  MGGDTVPLSTPSIPHD---------SDSSQISTQLPILLRGYN-------KDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDK
        MGGDTVPLSTPSI HD          D SQI+TQLP  LRG N       K D+ D+ENG I+GEFD  ESQFSS+ L+VEI RRR+NNV REI ES D+
Subjt:  MGGDTVPLSTPSIPHD---------SDSSQISTQLPILLRGYN-------KDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDK

Query:  LRIRSEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKS
        LRIRSE+LNQAKRKIL YSPG+WIEQVGGMKLSDYDIP+T SL+L+GPKGSGKSSLINRISKVF+ED F  +RAQVS NSSGEDGTFFLQEYMI R SKS
Subjt:  LRIRSEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKS

Query:  FCLYDTRGLSDDSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAF
        FCLYDTRGLSDDS +NIE+LKQWMTKGV HGELV RKSDASSLINRMRCK++  + F LSR +RMINFV+FVVDG SV KSMDG DDIEKDY + ITTAF
Subjt:  FCLYDTRGLSDDSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAF

Query:  NSPYLSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITV
        N PYLSYGDDKPVVVITHGDLLSF DRVRVR HLGNLLGIP  KQIFDIP+SYDPVTEL+IIDMLHYCLEH+DK L  K WTV KDH+S +SA  YF+T+
Subjt:  NSPYLSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITV

Query:  MVIVIIAAYLYLAYVHRLPEQVEAARDDSKRLEIIWSEIRHMWLD
        MVIVII+AYLY  YVH  PEQ E  ++ SK +EI+W EIRHMWLD
Subjt:  MVIVIIAAYLYLAYVHRLPEQVEAARDDSKRLEIIWSEIRHMWLD

A0A6J1JVI8 uncharacterized protein LOC1114882253.3e-17674.77Show/hide
Query:  MGGDTVPLSTPSIPHD---------SDSSQISTQLPIL------LRGYNKDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDKL
        MGGDTV LSTPSI HD          D SQI+TQLP L      L    KDD+ D+ENG I+GEFD  ESQFSS+ L+VEI RRR+NNV REI ES D+L
Subjt:  MGGDTVPLSTPSIPHD---------SDSSQISTQLPIL------LRGYNKDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRREIFESYDKL

Query:  RIRSEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKSF
        RIRSE+LNQAKRKIL YSPG+WIEQVGGMKLSDYDIP+T SL+L+GPKGSGKSSLINRI KVF+ED F  +RAQVS NSSGEDGTFFLQEYMI R SKSF
Subjt:  RIRSEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKSF

Query:  CLYDTRGLSDDSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFN
        CLYDTRGLSDDS +NIE+LKQWMTKGV HGELV RKSDASSLINRM CK++  + F LSR +RMINFVIFVVDG SV KSMDG DDIEKDY   ITTAFN
Subjt:  CLYDTRGLSDDSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFN

Query:  SPYLSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITVM
         PYLSYGDDKPVVVITHGDLLSF DRVRVR +LGNLLGIP  KQIFDIP+SYDPVTEL+IIDMLHYCLEH+DK L  K WTV KDHVS +SA  YF+TVM
Subjt:  SPYLSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITVM

Query:  VIVIIAAYLYLAYVHRLPEQVEAARDDSKRLEIIWSEIRHMWLD
        VIVII+AYLY  YVH  PEQ E  ++ SK +EI+W EIRHMWLD
Subjt:  VIVIIAAYLYLAYVHRLPEQVEAARDDSKRLEIIWSEIRHMWLD

SwissProt top hitse value%identityAlignment
F4JLQ5 S-protein homolog 21.1e-0833.65Show/hide
Query:  ATPLLLPFERWHIHVLNGLSN-ATLFVHCKSKDDDLGDHNLLGRGDEFQWTFKTNFWMTTLYWCFMHKPNADVSFDSFWIEKRHMWLNYRCTDKNCIWIA
        +T  + P  +  + + N L N  TL  HCKSKDDDLG+   L  G+ + ++F   F+  TLY+C    PN   SFD  + + R    + +C    C+W  
Subjt:  ATPLLLPFERWHIHVLNGLSN-ATLFVHCKSKDDDLGDHNLLGRGDEFQWTFKTNFWMTTLYWCFMHKPNADVSFDSFWIEKRHMWLNYRCTDKNCIWIA

Query:  KDDG
        + +G
Subjt:  KDDG

F4JLS0 S-protein homolog 19.9e-2142.02Show/hide
Query:  GAVATPLLLP-FERWHIHVLNGLSNA-TLFVHCKSKDDDLGDHNLLGRGDEFQWTFKTNFWMTTLYWCFMHKPNADVSFDSFWIEKRHMWLNYRCTDKNC
        G     +++P    W + V+NGL+   TLF+HCKSK+DDLG+ NL  R + F W F  N   +T +WC+M+K N  ++ + FW +   + L +RC  KNC
Subjt:  GAVATPLLLP-FERWHIHVLNGLSNA-TLFVHCKSKDDDLGDHNLLGRGDEFQWTFKTNFWMTTLYWCFMHKPNADVSFDSFWIEKRHMWLNYRCTDKNC

Query:  IWIAKDDGIYLRNNPDNYD
        IW AK DG+YL N+    D
Subjt:  IWIAKDDGIYLRNNPDNYD

P0DN93 S-protein homolog 292.4e-0640.28Show/hide
Query:  PFERWHIHVLNGLS-NATLFVHCKSKDDDLGDHNLLGRGDEFQWTFKTNFWMTTLYWCFMHKPNADVSFDSF
        PF +  + V N +S   TL + C+SKDDDLG+H LL  G  F W F+ +++ TTL+ C     N    FD++
Subjt:  PFERWHIHVLNGLS-NATLFVHCKSKDDDLGDHNLLGRGDEFQWTFKTNFWMTTLYWCFMHKPNADVSFDSF

Q2HQ46 S-protein homolog 744.2e-1941.53Show/hide
Query:  VLVRPGAVATPLLLPFERWHIHVLNGLSNA-TLFVHCKSKDDDLGDHNLLGRGDEFQWTFKTNFWMTTLYWCFMHKPNADVSFDSFWIEKRHMWLNYRCT
        VL R       ++     W + V NGL+   TLF+HCKSK++DLGD NL    D F W F  N   +TL+WC+M K +  ++   FW +   + L +RC 
Subjt:  VLVRPGAVATPLLLPFERWHIHVLNGLSNA-TLFVHCKSKDDDLGDHNLLGRGDEFQWTFKTNFWMTTLYWCFMHKPNADVSFDSFWIEKRHMWLNYRCT

Query:  DKNCIWIAKDDGIYLRNN
         KNC+W AK+DG+YL N+
Subjt:  DKNCIWIAKDDGIYLRNN

Q9LW22 S-protein homolog 211.1e-0633.68Show/hide
Query:  LNGLSNATLFVHCKSKDDDLGDHNLLGRGDEFQWTFKTNFWMTTLYWCFMHKPNADVSFDSFWIEKRHMWLNYRCTDKNCI-WIAKDDGIYLRNN
        LN  +   L VHCKSK++D+G    L  G+   ++FKTNFW TT +WC ++K      +  +     +  +     D +   W+A+DDGIY   +
Subjt:  LNGLSNATLFVHCKSKDDDLGDHNLLGRGDEFQWTFKTNFWMTTLYWCFMHKPNADVSFDSFWIEKRHMWLNYRCTDKNCI-WIAKDDGIYLRNN

Arabidopsis top hitse value%identityAlignment
AT4G13030.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein4.8e-7948.73Show/hide
Query:  DEIESQFSSADLDVEICRRRMNNVRREIFESYD-KLRIRSEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFE
        D  E  F++    V   RRR     +EI +S+D  LR     L QA+ +IL Y+PG+W +    +KLSDY+IP+TTS++LVGPKG+GKSSL+N+I++V E
Subjt:  DEIESQFSSADLDVEICRRRMNNVRREIFESYD-KLRIRSEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFE

Query:  EDHFTLDRAQVSYNSSGEDGTFFLQEYMIPR-HSKSFCLYDTRGLSD-DSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGI
        +D F LDRAQ S+ +  + GT+F+QEYMI R  S SFCLYDTRGLS   SSDN  M++QWMT+GV HGE V   SD+S L +R+     TG         
Subjt:  EDHFTLDRAQVSYNSSGEDGTFFLQEYMIPR-HSKSFCLYDTRGLSD-DSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGI

Query:  RMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFNSPYLSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIID
        R +N +IFVV+   + KSM    + E  YA  ITTAFNSP L + DDKP VV+THGD+LS  +R RVR+ LG LLGIPP KQIFDIPES D  T ++I +
Subjt:  RMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFNSPYLSYGDDKPVVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIID

Query:  MLHYCLEHSDK---YLPLKGWTVLKDHVSFISARIYFITVMVIVIIAAYLYLA
        +L Y L+H+DK   +LP K +T+ K     ++    +I+++ I+ IA  L++A
Subjt:  MLHYCLEHSDK---YLPLKGWTVLKDHVSFISARIYFITVMVIVIIAAYLYLA

AT4G13030.2 P-loop containing nucleoside triphosphate hydrolases superfamily protein7.4e-8045.79Show/hide
Query:  MGGDTVPLSTPSIPHDSDSSQISTQLPILLRGYNKDDQEDVENGWIIGEFDEI-ESQFSSADLDVEICRRRMNNVRREIFESYD-KLRIRSEDLNQAKRK
        MGGDT          D +SS+ S+  P     ++ DD         +G  D++ E  F++    V   RRR     +EI +S+D  LR     L QA+ +
Subjt:  MGGDTVPLSTPSIPHDSDSSQISTQLPILLRGYNKDDQEDVENGWIIGEFDEI-ESQFSSADLDVEICRRRMNNVRREIFESYD-KLRIRSEDLNQAKRK

Query:  ILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPR-HSKSFCLYDTRGLSD-D
        IL Y+PG+W +    +KLSDY+IP+TTS++LVGPKG+GKSSL+N+I++V E+D F LDRAQ S+ +  + GT+F+QEYMI R  S SFCLYDTRGLS   
Subjt:  ILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPR-HSKSFCLYDTRGLSD-D

Query:  SSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFNSPYLSYGDDKP
        SSDN  M++QWMT+GV HGE V   SD+S L +R+     TG         R +N +IFVV+   + KSM    + E  YA  ITTAFNSP L + DDKP
Subjt:  SSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFNSPYLSYGDDKP

Query:  VVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDK---YLPLKGWTVLKDHVSFISARIYFITVMVIVIIAAY
         VV+THGD+LS  +R RVR+ LG LLGIPP KQIFDIPES D  T ++I ++L Y L+H+DK   +LP K +T+ K     ++    +I+++ I+ IA  
Subjt:  VVVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDK---YLPLKGWTVLKDHVSFISARIYFITVMVIVIIAAY

Query:  LYLA
        L++A
Subjt:  LYLA

AT4G16295.1 S-protein homologue 17.0e-2242.02Show/hide
Query:  GAVATPLLLP-FERWHIHVLNGLSNA-TLFVHCKSKDDDLGDHNLLGRGDEFQWTFKTNFWMTTLYWCFMHKPNADVSFDSFWIEKRHMWLNYRCTDKNC
        G     +++P    W + V+NGL+   TLF+HCKSK+DDLG+ NL  R + F W F  N   +T +WC+M+K N  ++ + FW +   + L +RC  KNC
Subjt:  GAVATPLLLP-FERWHIHVLNGLSNA-TLFVHCKSKDDDLGDHNLLGRGDEFQWTFKTNFWMTTLYWCFMHKPNADVSFDSFWIEKRHMWLNYRCTDKNC

Query:  IWIAKDDGIYLRNNPDNYD
        IW AK DG+YL N+    D
Subjt:  IWIAKDDGIYLRNNPDNYD

AT4G29035.1 Plant self-incompatibility protein S1 family3.0e-2041.53Show/hide
Query:  VLVRPGAVATPLLLPFERWHIHVLNGLSNA-TLFVHCKSKDDDLGDHNLLGRGDEFQWTFKTNFWMTTLYWCFMHKPNADVSFDSFWIEKRHMWLNYRCT
        VL R       ++     W + V NGL+   TLF+HCKSK++DLGD NL    D F W F  N   +TL+WC+M K +  ++   FW +   + L +RC 
Subjt:  VLVRPGAVATPLLLPFERWHIHVLNGLSNA-TLFVHCKSKDDDLGDHNLLGRGDEFQWTFKTNFWMTTLYWCFMHKPNADVSFDSFWIEKRHMWLNYRCT

Query:  DKNCIWIAKDDGIYLRNN
         KNC+W AK+DG+YL N+
Subjt:  DKNCIWIAKDDGIYLRNN

AT5G04347.1 Plant self-incompatibility protein S1 family9.5e-1138Show/hide
Query:  VLNGLSNATLFVHCKSKDDDLGDHNLLGRGDEFQWTFKTNFWMTTLYWCFMHKP---NADVSFDSFWIEKRHMWLNYRCTDKNCIWIAKDDGIYLRNNPD
        V N L+N  L V C+SKDD+LGDH +L  G   +  F  N W  TL+WC + K       V+FD++    R  W          +WIA++DGIY   +P+
Subjt:  VLNGLSNATLFVHCKSKDDDLGDHNLLGRGDEFQWTFKTNFWMTTLYWCFMHKP---NADVSFDSFWIEKRHMWLNYRCTDKNCIWIAKDDGIYLRNNPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAGGGTTCTCTTAATTTTCTTGGTGGCTCTAGTGCTGGTTCGACCCGGTGCGGTGGCGACCCCTCTTCTGTTGCCATTCGAAAGATGGCATATCCATGTGCTTAA
TGGGTTGAGCAACGCCACCTTGTTTGTGCATTGTAAGTCGAAGGACGATGATTTGGGTGACCATAACCTACTTGGACGTGGTGATGAGTTTCAATGGACTTTTAAGACTA
ACTTTTGGATGACAACATTGTATTGGTGTTTTATGCACAAGCCGAATGCTGATGTGTCATTTGATTCATTCTGGATTGAGAAGAGGCATATGTGGCTCAATTATAGATGC
ACGGATAAAAATTGCATTTGGATTGCTAAAGATGACGGAATTTACCTGAGAAACAATCCCGATAATTATGATGAGTTTAACCTAAGCCGTGGAGTAGCCATGGGTGGCGA
TACGGTTCCTCTTTCTACTCCCTCCATCCCTCACGATTCCGATTCTTCTCAAATTTCCACTCAACTGCCCATTCTCCTCAGGGGTTATAATAAGGATGATCAAGAGGATG
TGGAAAATGGATGGATCATTGGAGAATTTGATGAGATTGAGTCACAGTTCTCTTCTGCTGATTTGGATGTGGAAATCTGTCGAAGGAGGATGAATAACGTACGCAGAGAG
ATCTTTGAGAGCTATGATAAATTGCGGATTCGTAGTGAGGACTTGAACCAGGCTAAGAGAAAAATTTTGAGATATTCTCCTGGAGCATGGATTGAGCAGGTAGGTGGAAT
GAAATTATCTGACTACGATATCCCACGAACAACGTCACTTCTACTGGTTGGTCCAAAAGGATCTGGTAAAAGTAGTCTTATTAATAGGATCTCCAAGGTGTTTGAGGAGG
ACCATTTTACTCTAGACAGAGCACAAGTATCATATAATTCATCTGGTGAAGATGGAACATTTTTCCTTCAGGAATATATGATTCCCAGGCACTCAAAGTCTTTCTGTTTA
TATGACACCCGTGGTCTCTCTGATGACTCGTCAGATAACATTGAAATGTTGAAGCAGTGGATGACCAAGGGCGTTCGTCATGGGGAACTTGTCACCAGGAAATCTGATGC
TTCAAGTTTAATAAATAGAATGAGATGTAAAAGTAAAACTGGCCGGAGCTTCTCTCTTTCTAGGGGGATCAGAATGATTAATTTTGTCATATTCGTTGTTGATGGGGCTT
CGGTTTGTAAATCAATGGATGGTGATGATGACATAGAGAAAGATTATGCCCAAGCGATTACTACTGCATTCAACTCTCCTTATTTATCATATGGAGATGACAAACCTGTT
GTCGTAATAACTCATGGAGATCTACTTTCCTTTTGGGACCGTGTTCGTGTACGTATCCATTTAGGAAACTTGTTAGGTATTCCACCAACAAAACAAATATTTGACATCCC
AGAAAGTTATGATCCAGTTACCGAGTTGTCGATAATTGATATGTTACATTACTGTCTGGAGCATTCTGATAAATACCTTCCTCTCAAGGGCTGGACGGTGCTCAAGGATC
ATGTGTCCTTCATATCAGCAAGAATCTACTTTATAACCGTCATGGTGATCGTTATTATCGCAGCCTATCTCTACCTAGCGTATGTTCATCGTCTTCCCGAGCAAGTGGAG
GCCGCTCGAGACGATTCAAAAAGGTTGGAGATAATTTGGTCTGAGATTCGTCACATGTGGTTAGATGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACCAGGGTTCTCTTAATTTTCTTGGTGGCTCTAGTGCTGGTTCGACCCGGTGCGGTGGCGACCCCTCTTCTGTTGCCATTCGAAAGATGGCATATCCATGTGCTTAA
TGGGTTGAGCAACGCCACCTTGTTTGTGCATTGTAAGTCGAAGGACGATGATTTGGGTGACCATAACCTACTTGGACGTGGTGATGAGTTTCAATGGACTTTTAAGACTA
ACTTTTGGATGACAACATTGTATTGGTGTTTTATGCACAAGCCGAATGCTGATGTGTCATTTGATTCATTCTGGATTGAGAAGAGGCATATGTGGCTCAATTATAGATGC
ACGGATAAAAATTGCATTTGGATTGCTAAAGATGACGGAATTTACCTGAGAAACAATCCCGATAATTATGATGAGTTTAACCTAAGCCGTGGAGTAGCCATGGGTGGCGA
TACGGTTCCTCTTTCTACTCCCTCCATCCCTCACGATTCCGATTCTTCTCAAATTTCCACTCAACTGCCCATTCTCCTCAGGGGTTATAATAAGGATGATCAAGAGGATG
TGGAAAATGGATGGATCATTGGAGAATTTGATGAGATTGAGTCACAGTTCTCTTCTGCTGATTTGGATGTGGAAATCTGTCGAAGGAGGATGAATAACGTACGCAGAGAG
ATCTTTGAGAGCTATGATAAATTGCGGATTCGTAGTGAGGACTTGAACCAGGCTAAGAGAAAAATTTTGAGATATTCTCCTGGAGCATGGATTGAGCAGGTAGGTGGAAT
GAAATTATCTGACTACGATATCCCACGAACAACGTCACTTCTACTGGTTGGTCCAAAAGGATCTGGTAAAAGTAGTCTTATTAATAGGATCTCCAAGGTGTTTGAGGAGG
ACCATTTTACTCTAGACAGAGCACAAGTATCATATAATTCATCTGGTGAAGATGGAACATTTTTCCTTCAGGAATATATGATTCCCAGGCACTCAAAGTCTTTCTGTTTA
TATGACACCCGTGGTCTCTCTGATGACTCGTCAGATAACATTGAAATGTTGAAGCAGTGGATGACCAAGGGCGTTCGTCATGGGGAACTTGTCACCAGGAAATCTGATGC
TTCAAGTTTAATAAATAGAATGAGATGTAAAAGTAAAACTGGCCGGAGCTTCTCTCTTTCTAGGGGGATCAGAATGATTAATTTTGTCATATTCGTTGTTGATGGGGCTT
CGGTTTGTAAATCAATGGATGGTGATGATGACATAGAGAAAGATTATGCCCAAGCGATTACTACTGCATTCAACTCTCCTTATTTATCATATGGAGATGACAAACCTGTT
GTCGTAATAACTCATGGAGATCTACTTTCCTTTTGGGACCGTGTTCGTGTACGTATCCATTTAGGAAACTTGTTAGGTATTCCACCAACAAAACAAATATTTGACATCCC
AGAAAGTTATGATCCAGTTACCGAGTTGTCGATAATTGATATGTTACATTACTGTCTGGAGCATTCTGATAAATACCTTCCTCTCAAGGGCTGGACGGTGCTCAAGGATC
ATGTGTCCTTCATATCAGCAAGAATCTACTTTATAACCGTCATGGTGATCGTTATTATCGCAGCCTATCTCTACCTAGCGTATGTTCATCGTCTTCCCGAGCAAGTGGAG
GCCGCTCGAGACGATTCAAAAAGGTTGGAGATAATTTGGTCTGAGATTCGTCACATGTGGTTAGATGAGTAA
Protein sequenceShow/hide protein sequence
MTRVLLIFLVALVLVRPGAVATPLLLPFERWHIHVLNGLSNATLFVHCKSKDDDLGDHNLLGRGDEFQWTFKTNFWMTTLYWCFMHKPNADVSFDSFWIEKRHMWLNYRC
TDKNCIWIAKDDGIYLRNNPDNYDEFNLSRGVAMGGDTVPLSTPSIPHDSDSSQISTQLPILLRGYNKDDQEDVENGWIIGEFDEIESQFSSADLDVEICRRRMNNVRRE
IFESYDKLRIRSEDLNQAKRKILRYSPGAWIEQVGGMKLSDYDIPRTTSLLLVGPKGSGKSSLINRISKVFEEDHFTLDRAQVSYNSSGEDGTFFLQEYMIPRHSKSFCL
YDTRGLSDDSSDNIEMLKQWMTKGVRHGELVTRKSDASSLINRMRCKSKTGRSFSLSRGIRMINFVIFVVDGASVCKSMDGDDDIEKDYAQAITTAFNSPYLSYGDDKPV
VVITHGDLLSFWDRVRVRIHLGNLLGIPPTKQIFDIPESYDPVTELSIIDMLHYCLEHSDKYLPLKGWTVLKDHVSFISARIYFITVMVIVIIAAYLYLAYVHRLPEQVE
AARDDSKRLEIIWSEIRHMWLDE