; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g26580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g26580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr10:19525075..19526109
RNA-Seq ExpressionMoc10g26580
SyntenyMoc10g26580
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG55646.1 hypothetical protein EZV62_020902 [Acer yangbiense]3.4e-5840.96Show/hide
Query:  TEGSSSSNLNTLNVMPIATATNNTISSNPFSNPLSTVLAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQG---TGANATTLIANPAFDSW
        T   SSS+  T  V  +   +N++  S+PF N L+   A+KLD +N++LWK+M+T  + G +LDG++  T   PPE +      G + +   +NP ++ W
Subjt:  TEGSSSSNLNTLNVMPIATATNNTISSNPFSNPLSTVLAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQG---TGANATTLIANPAFDSW

Query:  STTDQSLLAWLYGSMTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLMGLD
           DQ L+ WLY SMT +VA  ++   T   +WKALE+L+GA +K +   ++ ++Q TRK    M EYL+ MK  A SLA+AG+P  EN L  N+L GLD
Subjt:  STTDQSLLAWLYGSMTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLMGLD

Query:  AEYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVT---NIIDVTNASV-NYASNRSFNLRGRSHYQNQNRGQGR-NQRGNNRGRGGRGGQNTYQRG
        +EY+PI   I  +E+ TWQE++ TLL++++ L H+N V+   N++   +A +     N + N    S+ QN N+G  R   RG  RG GGR      +  
Subjt:  AEYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVT---NIIDVTNASV-NYASNRSFNLRGRSHYQNQNRGQGR-NQRGNNRGRGGRGGQNTYQRG

Query:  NSKPTCQVCGKFRHSVAICYHRLDENYMGNTP
        NS+PTCQVCGKF HS ++CY R D+NYMG+ P
Subjt:  NSKPTCQVCGKFRHSVAICYHRLDENYMGNTP

TXG67243.1 hypothetical protein EZV62_008518 [Acer yangbiense]1.9e-5640Show/hide
Query:  TEGSSSSNLNTLNVMPIATATNNTISSNPFSNPLSTVLAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQG-----------TGANATTLI
        T   SSS+  T  V  +   +N++  S+PF N L+   A+KLD +N++LWK+M+T  + G +LDG++  T   PPE +              G + +   
Subjt:  TEGSSSSNLNTLNVMPIATATNNTISSNPFSNPLSTVLAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQG-----------TGANATTLI

Query:  ANPAFDSWSTTDQSLLAWLYGSMTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLI
        +NP ++ W   DQ L+ WLY SMT +VA  ++   T   +WKALE+L+GA +K +   ++ ++Q TRK    M EYL+ MK  A SLA+AG+P  EN L 
Subjt:  ANPAFDSWSTTDQSLLAWLYGSMTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLI

Query:  TNVLMGLDAEYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVT---NIIDVTNASV-NYASNRSFNLRGRSHYQNQNRGQGR-NQRGNNRGRGGRG
         N L GLD+EY+PI   I  +E+ TWQE++ TLL++++ L H+N V+   N++   +A +     N + N    S+ QN N+G  R   RG  RG GGR 
Subjt:  TNVLMGLDAEYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVT---NIIDVTNASV-NYASNRSFNLRGRSHYQNQNRGQGR-NQRGNNRGRGGRG

Query:  GQNTYQRGNSKPTCQVCGKFRHSVAICYHRLDENYMGNTP
             +  NS+PTCQVCGKF HS ++CY R D+NYMG+ P
Subjt:  GQNTYQRGNSKPTCQVCGKFRHSVAICYHRLDENYMGNTP

XP_022142770.1 uncharacterized protein LOC111012809 [Momordica charantia]2.1e-6359.39Show/hide
Query:  MTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLMGLDAEYLPIACQINGKE
        M+P +ACD+L++ T RDVWKALEDLY   NK RI QLK +LQ TRKNQLKMS+YLSTMKQLA  L LAGEP+S +SL+++VL GL+AEYL I CQIN KE
Subjt:  MTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLMGLDAEYLPIACQINGKE

Query:  NMTWQEMHATLLAFENTLIHLNVVTNIIDVTNASVNYASNRSFNLRGRSHYQNQNRGQGRNQRGNNRGRGGRGGQNTYQRGN-SKPTCQVCGKFRHSVAI
        N++WQE+HATL+ FEN LIHLN V +I DV+  S NY  N+S +     H Q Q RGQGRN RG N     RGG+   QR N S+PTCQVCGK  H   +
Subjt:  NMTWQEMHATLLAFENTLIHLNVVTNIIDVTNASVNYASNRSFNLRGRSHYQNQNRGQGRNQRGNNRGRGGRGGQNTYQRGN-SKPTCQVCGKFRHSVAI

Query:  CYHRLDENYMGNTPQTKQQGSWSFYGNPK
        CYHRL+  YMGNTPQ   Q   ++   P+
Subjt:  CYHRLDENYMGNTPQTKQQGSWSFYGNPK

XP_022157748.1 uncharacterized protein LOC111024384 isoform X1 [Momordica charantia]2.1e-7649.1Show/hide
Query:  MDTEGSSSSNLNTLNVMPIATATNNTIS--SNPFSNPLSTVLAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQGTGANATT--LIANPAF
        M TE + +S++    V  +A  T N     +  F +PL TVL VKLD+KNY LW+ M+ A L GQK DGY++GT+A+PP+ +       T+  L  NP +
Subjt:  MDTEGSSSSNLNTLNVMPIATATNNTIS--SNPFSNPLSTVLAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQGTGANATT--LIANPAF

Query:  DSWSTTDQSLLAWLYGSMTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLM
          W   DQ+LL WL+GSMTPS+ACD+++  + R+VWKALEDLYGAT+K RI QL+  LQ T+KN LKMSEYL  MKQ + SL LAGEPV+ N L++ VL 
Subjt:  DSWSTTDQSLLAWLYGSMTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLM

Query:  GLDAEYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVTNII--DVTNASVNYASNRSFNLRGRSHYQNQN-RGQGR------NQRGNNRGRGGRGG
        GL+AEYLPI CQI GK++ +WQE+ ATL+ FENTL+ LN+V+      +++ S NY  ++  ++  R  +Q+Q+ +GQGR      + + N RGR GRG 
Subjt:  GLDAEYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVTNII--DVTNASVNYASNRSFNLRGRSHYQNQN-RGQGR------NQRGNNRGRGGRGG

Query:  QNTYQRGNSKPTCQVCGKFRHSVAICYHRLDENY
         + Y+  NSKP+CQ+CGK+ H  A+CY R DEN+
Subjt:  QNTYQRGNSKPTCQVCGKFRHSVAICYHRLDENY

XP_022157750.1 uncharacterized protein LOC111024384 isoform X2 [Momordica charantia]2.1e-7649.1Show/hide
Query:  MDTEGSSSSNLNTLNVMPIATATNNTIS--SNPFSNPLSTVLAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQGTGANATT--LIANPAF
        M TE + +S++    V  +A  T N     +  F +PL TVL VKLD+KNY LW+ M+ A L GQK DGY++GT+A+PP+ +       T+  L  NP +
Subjt:  MDTEGSSSSNLNTLNVMPIATATNNTIS--SNPFSNPLSTVLAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQGTGANATT--LIANPAF

Query:  DSWSTTDQSLLAWLYGSMTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLM
          W   DQ+LL WL+GSMTPS+ACD+++  + R+VWKALEDLYGAT+K RI QL+  LQ T+KN LKMSEYL  MKQ + SL LAGEPV+ N L++ VL 
Subjt:  DSWSTTDQSLLAWLYGSMTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLM

Query:  GLDAEYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVTNII--DVTNASVNYASNRSFNLRGRSHYQNQN-RGQGR------NQRGNNRGRGGRGG
        GL+AEYLPI CQI GK++ +WQE+ ATL+ FENTL+ LN+V+      +++ S NY  ++  ++  R  +Q+Q+ +GQGR      + + N RGR GRG 
Subjt:  GLDAEYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVTNII--DVTNASVNYASNRSFNLRGRSHYQNQN-RGQGR------NQRGNNRGRGGRGG

Query:  QNTYQRGNSKPTCQVCGKFRHSVAICYHRLDENY
         + Y+  NSKP+CQ+CGK+ H  A+CY R DEN+
Subjt:  QNTYQRGNSKPTCQVCGKFRHSVAICYHRLDENY

TrEMBL top hitse value%identityAlignment
A0A5C7HHE9 Uncharacterized protein1.7e-5840.96Show/hide
Query:  TEGSSSSNLNTLNVMPIATATNNTISSNPFSNPLSTVLAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQG---TGANATTLIANPAFDSW
        T   SSS+  T  V  +   +N++  S+PF N L+   A+KLD +N++LWK+M+T  + G +LDG++  T   PPE +      G + +   +NP ++ W
Subjt:  TEGSSSSNLNTLNVMPIATATNNTISSNPFSNPLSTVLAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQG---TGANATTLIANPAFDSW

Query:  STTDQSLLAWLYGSMTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLMGLD
           DQ L+ WLY SMT +VA  ++   T   +WKALE+L+GA +K +   ++ ++Q TRK    M EYL+ MK  A SLA+AG+P  EN L  N+L GLD
Subjt:  STTDQSLLAWLYGSMTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLMGLD

Query:  AEYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVT---NIIDVTNASV-NYASNRSFNLRGRSHYQNQNRGQGR-NQRGNNRGRGGRGGQNTYQRG
        +EY+PI   I  +E+ TWQE++ TLL++++ L H+N V+   N++   +A +     N + N    S+ QN N+G  R   RG  RG GGR      +  
Subjt:  AEYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVT---NIIDVTNASV-NYASNRSFNLRGRSHYQNQNRGQGR-NQRGNNRGRGGRGGQNTYQRG

Query:  NSKPTCQVCGKFRHSVAICYHRLDENYMGNTP
        NS+PTCQVCGKF HS ++CY R D+NYMG+ P
Subjt:  NSKPTCQVCGKFRHSVAICYHRLDENYMGNTP

A0A6J1CLV9 uncharacterized protein LOC1110128091.0e-6359.39Show/hide
Query:  MTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLMGLDAEYLPIACQINGKE
        M+P +ACD+L++ T RDVWKALEDLY   NK RI QLK +LQ TRKNQLKMS+YLSTMKQLA  L LAGEP+S +SL+++VL GL+AEYL I CQIN KE
Subjt:  MTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLMGLDAEYLPIACQINGKE

Query:  NMTWQEMHATLLAFENTLIHLNVVTNIIDVTNASVNYASNRSFNLRGRSHYQNQNRGQGRNQRGNNRGRGGRGGQNTYQRGN-SKPTCQVCGKFRHSVAI
        N++WQE+HATL+ FEN LIHLN V +I DV+  S NY  N+S +     H Q Q RGQGRN RG N     RGG+   QR N S+PTCQVCGK  H   +
Subjt:  NMTWQEMHATLLAFENTLIHLNVVTNIIDVTNASVNYASNRSFNLRGRSHYQNQNRGQGRNQRGNNRGRGGRGGQNTYQRGN-SKPTCQVCGKFRHSVAI

Query:  CYHRLDENYMGNTPQTKQQGSWSFYGNPK
        CYHRL+  YMGNTPQ   Q   ++   P+
Subjt:  CYHRLDENYMGNTPQTKQQGSWSFYGNPK

A0A6J1DTZ7 uncharacterized protein LOC111024384 isoform X21.0e-7649.1Show/hide
Query:  MDTEGSSSSNLNTLNVMPIATATNNTIS--SNPFSNPLSTVLAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQGTGANATT--LIANPAF
        M TE + +S++    V  +A  T N     +  F +PL TVL VKLD+KNY LW+ M+ A L GQK DGY++GT+A+PP+ +       T+  L  NP +
Subjt:  MDTEGSSSSNLNTLNVMPIATATNNTIS--SNPFSNPLSTVLAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQGTGANATT--LIANPAF

Query:  DSWSTTDQSLLAWLYGSMTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLM
          W   DQ+LL WL+GSMTPS+ACD+++  + R+VWKALEDLYGAT+K RI QL+  LQ T+KN LKMSEYL  MKQ + SL LAGEPV+ N L++ VL 
Subjt:  DSWSTTDQSLLAWLYGSMTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLM

Query:  GLDAEYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVTNII--DVTNASVNYASNRSFNLRGRSHYQNQN-RGQGR------NQRGNNRGRGGRGG
        GL+AEYLPI CQI GK++ +WQE+ ATL+ FENTL+ LN+V+      +++ S NY  ++  ++  R  +Q+Q+ +GQGR      + + N RGR GRG 
Subjt:  GLDAEYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVTNII--DVTNASVNYASNRSFNLRGRSHYQNQN-RGQGR------NQRGNNRGRGGRGG

Query:  QNTYQRGNSKPTCQVCGKFRHSVAICYHRLDENY
         + Y+  NSKP+CQ+CGK+ H  A+CY R DEN+
Subjt:  QNTYQRGNSKPTCQVCGKFRHSVAICYHRLDENY

A0A6J1DU77 uncharacterized protein LOC111024384 isoform X11.0e-7649.1Show/hide
Query:  MDTEGSSSSNLNTLNVMPIATATNNTIS--SNPFSNPLSTVLAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQGTGANATT--LIANPAF
        M TE + +S++    V  +A  T N     +  F +PL TVL VKLD+KNY LW+ M+ A L GQK DGY++GT+A+PP+ +       T+  L  NP +
Subjt:  MDTEGSSSSNLNTLNVMPIATATNNTIS--SNPFSNPLSTVLAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQGTGANATT--LIANPAF

Query:  DSWSTTDQSLLAWLYGSMTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLM
          W   DQ+LL WL+GSMTPS+ACD+++  + R+VWKALEDLYGAT+K RI QL+  LQ T+KN LKMSEYL  MKQ + SL LAGEPV+ N L++ VL 
Subjt:  DSWSTTDQSLLAWLYGSMTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLM

Query:  GLDAEYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVTNII--DVTNASVNYASNRSFNLRGRSHYQNQN-RGQGR------NQRGNNRGRGGRGG
        GL+AEYLPI CQI GK++ +WQE+ ATL+ FENTL+ LN+V+      +++ S NY  ++  ++  R  +Q+Q+ +GQGR      + + N RGR GRG 
Subjt:  GLDAEYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVTNII--DVTNASVNYASNRSFNLRGRSHYQNQN-RGQGR------NQRGNNRGRGGRGG

Query:  QNTYQRGNSKPTCQVCGKFRHSVAICYHRLDENY
         + Y+  NSKP+CQ+CGK+ H  A+CY R DEN+
Subjt:  QNTYQRGNSKPTCQVCGKFRHSVAICYHRLDENY

A0A803R2Q2 Uncharacterized protein2.4e-5740.84Show/hide
Query:  GSSSSNLNTLNVMPIATATNNTISSNPFSNPLSTVLAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQ----GTGANATTLIANPAFDSWS
        G ++   +T+ + P  +++N +  S PFSN LS   ++KLD  N+ LWK+M+   + G +LDG+I G    PPE I+       A  + +  NP +++W 
Subjt:  GSSSSNLNTLNVMPIATATNNTISSNPFSNPLSTVLAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQ----GTGANATTLIANPAFDSWS

Query:  TTDQSLLAWLYGSMTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLMGLDA
          DQ L+ WLYGSMT ++A +++   +   +W ALE+LYGA ++  + +L+  +Q TRK    M+EYL   +  A SLALAGEP  E  L++NVL GLD 
Subjt:  TTDQSLLAWLYGSMTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLMGLDA

Query:  EYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVTNIID-VTNASVNYASN-RSFNLRGRSHY---QNQNRGQGRNQRGNNRGRGGRGGQNTYQRGN
        EYL I   I  +E+ +WQ++ + LL+F+  L  LN ++     V NAS N+A    S N  G+  +   QNQ RGQ +   G  RG G + G+   +   
Subjt:  EYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVTNIID-VTNASVNYASN-RSFNLRGRSHY---QNQNRGQGRNQRGNNRGRGGRGGQNTYQRGN

Query:  SKPTCQVCGKFRHSVAICYHRLDENYMGNTPQT
         KPTCQVCGK+ HS AICY+R DE+YMGN P T
Subjt:  SKPTCQVCGKFRHSVAICYHRLDENYMGNTPQT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.2e-0726Show/hide
Query:  LDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQGTGANATTLIANPAFDSWSTTDQSLLAWLYGSMTP-SVACDILNLHTFRDVWKALEDLYGAT
        ++E NY  W+ +     L   + G+I GT+         T AN           +W   D  +   LYG++TP       +   T RD+W  +++ +   
Subjt:  LDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQGTGANATTLIANPAFDSWSTTDQSLLAWLYGSMTP-SVACDILNLHTFRDVWKALEDLYGAT

Query:  NKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLMGLDAEYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVTN--I
           R  +L   L+      +++++Y   MK+LA SL     PV++ +L+  VL GL+ ++  I   I  ++     +  AT+L  E   +   +  N   
Subjt:  NKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLMGLDAEYLPIACQINGKENMTWQEMHATLLAFENTLIHLNVVTN--I

Query:  IDVTNASVNYASNRS---FNLRGRSHYQNQNRGQGRNQRGNN--RGRGGR
        +D +++S   A + +    N +     Q   RG+G   RGNN  RGRGGR
Subjt:  IDVTNASVNYASNRS---FNLRGRSHYQNQNRGQGRNQRGNN--RGRGGR

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.8e-0926.02Show/hide
Query:  LAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQGTGANATTLIANPAFDSWSTTDQSLLAWLYGSMTPSVACDILNLH-TFRDVWKALEDL
        + + L++ NY +W+ +     L   + G+I G+ + P  M +                 W   D  +  W+YG++T S+   I+ +  T RD+W +LE+L
Subjt:  LAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQGTGANATTLIANPAFDSWSTTDQSLLAWLYGSMTPSVACDILNLH-TFRDVWKALEDL

Query:  YGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLMGLDAEYLPIACQINGKENM-TWQEMHATLLAFENTLIHLNVV
        +    + R  Q +  L+ T  + L + EY   +K L+  L     P+S+  L+ ++L GL  +Y  I   I  K    ++ E  + LL  E+ L + +  
Subjt:  YGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLMGLDAEYLPIACQINGKENM-TWQEMHATLLAFENTLIHLNVV

Query:  ----TNIIDVTNASVNYASNRSFNLRGRSHYQNQNRGQGRNQRGNNRGRG---GRGGQNTYQRGNSKPT
            TN   ++N        +        H  N N G+GR+++  NRG G   GR   N   R N  PT
Subjt:  ----TNIIDVTNASVNYASNRSFNLRGRSHYQNQNRGQGRNQRGNNRGRG---GRGGQNTYQRGNSKPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACACCGAAGGTTCTTCCTCCTCAAATCTTAATACCCTAAATGTAATGCCAATCGCCACTGCAACTAACAACACCATATCATCAAATCCCTTTAGCAACCCA
CTTAGTACAGTATTGGCAGTCAAGTTGGATGAAAAGAATTATCTTCTCTGGAAATCCATGATTACTGCTGCTCTCCTTGGACAGAAGCTTGATGGCTACATTATG
GGAACAATTGCTCAACCTCCAGAAATGATTCAAGGTACCGGTGCAAATGCCACCACACTCATTGCAAATCCTGCGTTTGATTCATGGTCTACCACAGATCAATCG
CTCCTAGCCTGGTTGTATGGATCCATGACTCCATCTGTGGCTTGTGACATCCTCAATCTACACACGTTTAGAGATGTATGGAAAGCACTAGAAGATCTCTATGGA
GCAACAAACAAGGTCCGAATCACCCAACTGAAAAGAAACCTTCAAATGACGAGGAAAAATCAGTTGAAAATGAGCGAATATCTTTCAACAATGAAGCAGCTTGCT
CACAGTCTTGCTCTAGCAGGCGAACCGGTAAGTGAAAATTCTCTCATCACTAATGTTCTCATGGGTCTTGATGCAGAATATTTACCAATAGCTTGCCAAATCAAT
GGAAAAGAAAATATGACCTGGCAAGAAATGCATGCCACATTGCTAGCTTTTGAAAACACACTCATTCATCTTAATGTTGTAACCAACATCATTGATGTCACAAAT
GCATCAGTCAACTATGCCTCAAATAGGAGCTTTAATCTAAGAGGAAGGTCTCACTACCAAAATCAAAATCGCGGACAAGGAAGAAATCAAAGAGGAAACAACCGT
GGCAGAGGAGGCAGAGGAGGACAGAACACATACCAAAGAGGAAACTCCAAACCAACTTGCCAGGTTTGTGGAAAATTTAGGCACTCTGTTGCAATTTGCTACCAT
AGACTTGATGAAAATTACATGGGAAACACACCACAAACAAAACAACAAGGCTCCTGGAGCTTTTATGGCAACCCCAAATGTTGTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACACCGAAGGTTCTTCCTCCTCAAATCTTAATACCCTAAATGTAATGCCAATCGCCACTGCAACTAACAACACCATATCATCAAATCCCTTTAGCAACCCA
CTTAGTACAGTATTGGCAGTCAAGTTGGATGAAAAGAATTATCTTCTCTGGAAATCCATGATTACTGCTGCTCTCCTTGGACAGAAGCTTGATGGCTACATTATG
GGAACAATTGCTCAACCTCCAGAAATGATTCAAGGTACCGGTGCAAATGCCACCACACTCATTGCAAATCCTGCGTTTGATTCATGGTCTACCACAGATCAATCG
CTCCTAGCCTGGTTGTATGGATCCATGACTCCATCTGTGGCTTGTGACATCCTCAATCTACACACGTTTAGAGATGTATGGAAAGCACTAGAAGATCTCTATGGA
GCAACAAACAAGGTCCGAATCACCCAACTGAAAAGAAACCTTCAAATGACGAGGAAAAATCAGTTGAAAATGAGCGAATATCTTTCAACAATGAAGCAGCTTGCT
CACAGTCTTGCTCTAGCAGGCGAACCGGTAAGTGAAAATTCTCTCATCACTAATGTTCTCATGGGTCTTGATGCAGAATATTTACCAATAGCTTGCCAAATCAAT
GGAAAAGAAAATATGACCTGGCAAGAAATGCATGCCACATTGCTAGCTTTTGAAAACACACTCATTCATCTTAATGTTGTAACCAACATCATTGATGTCACAAAT
GCATCAGTCAACTATGCCTCAAATAGGAGCTTTAATCTAAGAGGAAGGTCTCACTACCAAAATCAAAATCGCGGACAAGGAAGAAATCAAAGAGGAAACAACCGT
GGCAGAGGAGGCAGAGGAGGACAGAACACATACCAAAGAGGAAACTCCAAACCAACTTGCCAGGTTTGTGGAAAATTTAGGCACTCTGTTGCAATTTGCTACCAT
AGACTTGATGAAAATTACATGGGAAACACACCACAAACAAAACAACAAGGCTCCTGGAGCTTTTATGGCAACCCCAAATGTTGTGAATGA
Protein sequenceShow/hide protein sequence
MDTEGSSSSNLNTLNVMPIATATNNTISSNPFSNPLSTVLAVKLDEKNYLLWKSMITAALLGQKLDGYIMGTIAQPPEMIQGTGANATTLIANPAFDSWSTTDQS
LLAWLYGSMTPSVACDILNLHTFRDVWKALEDLYGATNKVRITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLMGLDAEYLPIACQIN
GKENMTWQEMHATLLAFENTLIHLNVVTNIIDVTNASVNYASNRSFNLRGRSHYQNQNRGQGRNQRGNNRGRGGRGGQNTYQRGNSKPTCQVCGKFRHSVAICYH
RLDENYMGNTPQTKQQGSWSFYGNPKCCE