; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018691 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018691
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF674)
Genome locationtig00153207:544562..545513
RNA-Seq ExpressionSgr018691
SyntenySgr018691
Gene Ontology termsNA
InterPro domainsIPR007750 - Protein of unknown function DUF674


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461735.1 PREDICTED: uncharacterized protein LOC103500268 [Cucumis melo]1.6e-5257.33Show/hide
Query:  VRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPYIDTALATSI
        VRLKLLIDS+ +RVL+GEA+KN IDFLFNLLSLPLG VIRLLKK+GMVG L NLYES+E LN+TYLQPNQS D +LKPKVSF+ ST LLP I++      
Subjt:  VRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPYIDTALATSI

Query:  ITFTAPAPVPALALSPSPLSSGFTLSSPSIQRITSGFSSSSIQPITSGETVGFVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEVDSLEEKVITLDV
                   +A +P+ +      +      + +  ++++      GE  GFVKG+ TY+VMDDL+VK +S  S  TLL KFNIKEV +LEEKVITLDV
Subjt:  ITFTAPAPVPALALSPSPLSSGFTLSSPSIQRITSGFSSSSIQPITSGETVGFVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEVDSLEEKVITLDV

Query:  DEGVELLEASLQSKTVLTNAFLKRR
        ++GV+LL ASLQSKTVLT+ FL R+
Subjt:  DEGVELLEASLQSKTVLTNAFLKRR

XP_008465479.1 PREDICTED: uncharacterized protein LOC103503094 [Cucumis melo]2.2e-7062.36Show/hide
Query:  MTQTNVRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPYIDTA
        M QT+V LKLLID K +RVLYGEA+K FIDFL N+LSLPLG VI LLKK GMVGCLGNLYES+ETLN++YLQPNQS DT+LKPK+ FN  TKLLP +   
Subjt:  MTQTNVRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPYIDTA

Query:  LATSIITFTAPAPVPALALSPSPL-SSGFTLSSPSIQRITSGFSSSSI----------------------QPITSGETVGFVKGLATYIVMDDLTVKHIS
                     VPA A  P+    +G T SS    R    +SSS++                      QP + G   G VK LATYIVMDDLTVKHIS
Subjt:  LATSIITFTAPAPVPALALSPSPL-SSGFTLSSPSIQRITSGFSSSSI----------------------QPITSGETVGFVKGLATYIVMDDLTVKHIS

Query:  DFSITTLLKKFNIKEVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSQIDNDIKLSNDVKSSV
        DFSITTLLKKFNIK+VDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRR  ID+D+KLS  +  SV
Subjt:  DFSITTLLKKFNIKEVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSQIDNDIKLSNDVKSSV

XP_022138964.1 uncharacterized protein LOC111010013 [Momordica charantia]6.7e-5657.81Show/hide
Query:  MTQTNVRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSF--NCSTKLLPYID
        M   NVRLKLLIDSK QRVL+GEA+KN IDFLFNLLSLPLG VIRLLKK+GMVGCLGNLYES+ETLN+TYLQPNQS D +LKPKVSF  + ST LLP ID
Subjt:  MTQTNVRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSF--NCSTKLLPYID

Query:  -TALATSIITFTAPAPV---PALALSPSPLSSGFTLSSPSIQRITSGFSSSSIQPITSGETV-GFVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEV
         +A AT+     + A      +++  P+ +     +    +       S+S+     + E   GFVKG+ TY+VMDDL+VK +S  S   LL KFN+KEV
Subjt:  -TALATSIITFTAPAPV---PALALSPSPLSSGFTLSSPSIQRITSGFSSSSIQPITSGETV-GFVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEV

Query:  DSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRR
         +LEEKV+TLDV+EGV+LL+ASL SKTVLT+ F++R+
Subjt:  DSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRR

XP_031741245.1 uncharacterized protein LOC105435653 [Cucumis sativus]1.3e-7064.82Show/hide
Query:  MTQTNVRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPYIDTA
        M QT+V LKLLID K +RVLYGEA+K FIDFL N+LSLPLG VIRLLKK+GMVGCLGNLYES+ETLN +YLQPNQS D +LKPK+ FN  TKL+P +D  
Subjt:  MTQTNVRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPYIDTA

Query:  LATSIITFTAPAPVPALALSPSPLSSGFTLSS--PSIQRITSGFSSSSIQPITSGETV---GFVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEVDS
         A +    T PA    +  S       ++ S+  PS  +  S        P    +     GFVK LATYIV DDLTVKHISDFSITTLLKKFNIK+VDS
Subjt:  LATSIITFTAPAPVPALALSPSPLSSGFTLSS--PSIQRITSGFSSSSIQPITSGETV---GFVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEVDS

Query:  LEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSQIDNDIKLSNDVKSSV
        LEEKVITLDV+EGVELLEASLQSKTVLTNAFLKRRRS IDND+KLS  +  SV
Subjt:  LEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSQIDNDIKLSNDVKSSV

XP_038891339.1 uncharacterized protein LOC120080784 [Benincasa hispida]4.4e-6357.35Show/hide
Query:  TNVRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNET-YLQPNQSIDTILKPKVSFNCSTKLLPYI-----
        TNVRLKLLID++ + VLYGEA+K+FIDFLFNLLSLPLGAVIRLL K+ M+GCLGNLYESIETLNET +++P QS +T+L+PKVS NCSTKLLPYI     
Subjt:  TNVRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNET-YLQPNQSIDTILKPKVSFNCSTKLLPYI-----

Query:  ------------------------DTALATSIITFTAPAPVPALALSPSPLSSGFTLSSPSIQRITSGFSS--SSIQPITSGET--------------VG
                                 T L+ S   F A +   +   S  P SS F+L S     ++SGFSS   S  PI SG T              VG
Subjt:  ------------------------DTALATSIITFTAPAPVPALALSPSPLSSGFTLSSPSIQRITSGFSS--SSIQPITSGET--------------VG

Query:  FVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSQIDN
        FVKGLATYIVMDDLTVKHISDFSI +L +KFNIK+  +LEEKVITL+VDEGVELL A+LQSK VLT+ FL+R R  IDN
Subjt:  FVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSQIDN

TrEMBL top hitse value%identityAlignment
A0A1S3CGQ2 uncharacterized protein LOC1035002687.5e-5357.33Show/hide
Query:  VRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPYIDTALATSI
        VRLKLLIDS+ +RVL+GEA+KN IDFLFNLLSLPLG VIRLLKK+GMVG L NLYES+E LN+TYLQPNQS D +LKPKVSF+ ST LLP I++      
Subjt:  VRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPYIDTALATSI

Query:  ITFTAPAPVPALALSPSPLSSGFTLSSPSIQRITSGFSSSSIQPITSGETVGFVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEVDSLEEKVITLDV
                   +A +P+ +      +      + +  ++++      GE  GFVKG+ TY+VMDDL+VK +S  S  TLL KFNIKEV +LEEKVITLDV
Subjt:  ITFTAPAPVPALALSPSPLSSGFTLSSPSIQRITSGFSSSSIQPITSGETVGFVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEVDSLEEKVITLDV

Query:  DEGVELLEASLQSKTVLTNAFLKRR
        ++GV+LL ASLQSKTVLT+ FL R+
Subjt:  DEGVELLEASLQSKTVLTNAFLKRR

A0A1S3CPD6 uncharacterized protein LOC1035030941.0e-7062.36Show/hide
Query:  MTQTNVRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPYIDTA
        M QT+V LKLLID K +RVLYGEA+K FIDFL N+LSLPLG VI LLKK GMVGCLGNLYES+ETLN++YLQPNQS DT+LKPK+ FN  TKLLP +   
Subjt:  MTQTNVRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPYIDTA

Query:  LATSIITFTAPAPVPALALSPSPL-SSGFTLSSPSIQRITSGFSSSSI----------------------QPITSGETVGFVKGLATYIVMDDLTVKHIS
                     VPA A  P+    +G T SS    R    +SSS++                      QP + G   G VK LATYIVMDDLTVKHIS
Subjt:  LATSIITFTAPAPVPALALSPSPL-SSGFTLSSPSIQRITSGFSSSSI----------------------QPITSGETVGFVKGLATYIVMDDLTVKHIS

Query:  DFSITTLLKKFNIKEVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSQIDNDIKLSNDVKSSV
        DFSITTLLKKFNIK+VDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRR  ID+D+KLS  +  SV
Subjt:  DFSITTLLKKFNIKEVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSQIDNDIKLSNDVKSSV

A0A5A7U8V2 DUF674 domain-containing protein7.5e-5357.33Show/hide
Query:  VRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPYIDTALATSI
        VRLKLLIDS+ +RVL+GEA+KN IDFLFNLLSLPLG VIRLLKK+GMVG L NLYES+E LN+TYLQPNQS D +LKPKVSF+ ST LLP I++      
Subjt:  VRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPYIDTALATSI

Query:  ITFTAPAPVPALALSPSPLSSGFTLSSPSIQRITSGFSSSSIQPITSGETVGFVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEVDSLEEKVITLDV
                   +A +P+ +      +      + +  ++++      GE  GFVKG+ TY+VMDDL+VK +S  S  TLL KFNIKEV +LEEKVITLDV
Subjt:  ITFTAPAPVPALALSPSPLSSGFTLSSPSIQRITSGFSSSSIQPITSGETVGFVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEVDSLEEKVITLDV

Query:  DEGVELLEASLQSKTVLTNAFLKRR
        ++GV+LL ASLQSKTVLT+ FL R+
Subjt:  DEGVELLEASLQSKTVLTNAFLKRR

A0A5A7V731 Putative DNA polymerase zeta catalytic subunit1.0e-7062.36Show/hide
Query:  MTQTNVRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPYIDTA
        M QT+V LKLLID K +RVLYGEA+K FIDFL N+LSLPLG VI LLKK GMVGCLGNLYES+ETLN++YLQPNQS DT+LKPK+ FN  TKLLP +   
Subjt:  MTQTNVRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPYIDTA

Query:  LATSIITFTAPAPVPALALSPSPL-SSGFTLSSPSIQRITSGFSSSSI----------------------QPITSGETVGFVKGLATYIVMDDLTVKHIS
                     VPA A  P+    +G T SS    R    +SSS++                      QP + G   G VK LATYIVMDDLTVKHIS
Subjt:  LATSIITFTAPAPVPALALSPSPL-SSGFTLSSPSIQRITSGFSSSSI----------------------QPITSGETVGFVKGLATYIVMDDLTVKHIS

Query:  DFSITTLLKKFNIKEVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSQIDNDIKLSNDVKSSV
        DFSITTLLKKFNIK+VDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRR  ID+D+KLS  +  SV
Subjt:  DFSITTLLKKFNIKEVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSQIDNDIKLSNDVKSSV

A0A6J1CBJ8 uncharacterized protein LOC1110100133.3e-5657.81Show/hide
Query:  MTQTNVRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSF--NCSTKLLPYID
        M   NVRLKLLIDSK QRVL+GEA+KN IDFLFNLLSLPLG VIRLLKK+GMVGCLGNLYES+ETLN+TYLQPNQS D +LKPKVSF  + ST LLP ID
Subjt:  MTQTNVRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSF--NCSTKLLPYID

Query:  -TALATSIITFTAPAPV---PALALSPSPLSSGFTLSSPSIQRITSGFSSSSIQPITSGETV-GFVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEV
         +A AT+     + A      +++  P+ +     +    +       S+S+     + E   GFVKG+ TY+VMDDL+VK +S  S   LL KFN+KEV
Subjt:  -TALATSIITFTAPAPV---PALALSPSPLSSGFTLSSPSIQRITSGFSSSSIQPITSGETV-GFVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEV

Query:  DSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRR
         +LEEKV+TLDV+EGV+LL+ASL SKTVLT+ F++R+
Subjt:  DSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09110.1 Protein of unknown function (DUF674)9.8e-1329.26Show/hide
Query:  LKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKK-----EGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPY-IDTAL
        L+LLID +  RV+  EA K+F+D L +LL+LP+G ++RLL+K       +VGCL NLY+S+  ++    +       +L P+ +     + L   ID   
Subjt:  LKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKK-----EGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPY-IDTAL

Query:  ATSIITFTAPAPVPALALSPSPLSSGFTLSSPSIQRITSGFSSSSIQPITSGETVG-FVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEVDSLEEKV
        AT    F  P  V   A            S+ S  +   G S     P+   +  G F     ++++ DDL V   S   +  +L  F     D L+E +
Subjt:  ATSIITFTAPAPVPALALSPSPLSSGFTLSSPSIQRITSGFSSSSIQPITSGETVG-FVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEVDSLEEKV

Query:  ITLDVDEGVELLEASLQSKTVLTNAFLKR
        I +  +E + LL     S+  LT+ FL++
Subjt:  ITLDVDEGVELLEASLQSKTVLTNAFLKR

AT3G09120.1 Protein of unknown function (DUF674)8.3e-1227.39Show/hide
Query:  VRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKE----GMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLP-YIDTA
        + LKLL+D K  +V+  EA ++F+D LF LL+ P+G + RLL+K      ++GC  NL  S+  +     +       +L PK S     + L  +ID  
Subjt:  VRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKE----GMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLP-YIDTA

Query:  LATSIITFTAPAPVPALALSPSPLSSGFTLSSPSIQRITSGFSSSSIQPITSGETVGFVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEVDSLEEKV
         AT     +      +   S +   S  +  S  I +I        +  + + E V FV   +++I+ DDL V   S   I  +L       ++ L+E +
Subjt:  LATSIITFTAPAPVPALALSPSPLSSGFTLSSPSIQRITSGFSSSSIQPITSGETVGFVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEVDSLEEKV

Query:  ITLDVDEGVELLEASLQSKTVLTNAFLKRR
        I +  +E + LL     S++ LT+ FL ++
Subjt:  ITLDVDEGVELLEASLQSKTVLTNAFLKRR

AT5G01120.1 Protein of unknown function (DUF674)1.1e-1129.72Show/hide
Query:  VRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLK-----KEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFN---CSTKLLPYI
        + LKLLID +  +V++ EA  +F+D LF+  +LP+G ++RLL+     +   +GC  N+Y S+ ++   +         +L P  S N   C    L  I
Subjt:  VRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLK-----KEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFN---CSTKLLPYI

Query:  DTALATSIITFTAPAPVPALALSPSPLSSGFTLSSPSIQRITSG-FSSSSIQPITSG------ETVGFVKGLAT-YIVMDDLTVKHISDFSITTLLKKFN
        D + AT    F  P     + +     S G+  S+    R + G F    IQ    G      +   FV+G  T +I+ DDL V+  S  S   +LK   
Subjt:  DTALATSIITFTAPAPVPALALSPSPLSSGFTLSSPSIQRITSG-FSSSSIQPITSG------ETVGFVKGLAT-YIVMDDLTVKHISDFSITTLLKKFN

Query:  IKEVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSQIDNDI
          + D L E ++ +++ E   LL     S T LT+ FLK++ S   N I
Subjt:  IKEVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSQIDNDI

AT5G01130.1 Protein of unknown function (DUF674)2.2e-1229.75Show/hide
Query:  QTNVRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEG-----MVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPYI
        +  V L+L ID +  +V+  EA K F+D LF+LL+LP+G +IRLL++        VGC  NLY S+  +     + +     +L P+     S + L Y 
Subjt:  QTNVRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEG-----MVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPYI

Query:  DTALATSIITFTAPAPVPALALSPSPLSSGFTLSSPSIQRITSG------FSSSSIQPITSGETVGFVKGLATYIVMDDLTVKHISDFSITTLLKKFNIK
           L  +      P  V    L   P+S  F  S  S  R   G      F    + P+ S      V G+  +I+ DDL V   S   +   LK     
Subjt:  DTALATSIITFTAPAPVPALALSPSPLSSGFTLSSPSIQRITSG------FSSSSIQPITSGETVGFVKGLATYIVMDDLTVKHISDFSITTLLKKFNIK

Query:  EVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSQ
        ++  L E ++ +  +E + LLE    SK  LTN FL ++  Q
Subjt:  EVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSQ

AT5G43240.1 Protein of unknown function (DUF674)5.4e-1127.62Show/hide
Query:  VRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLK-----KEGMVGCLGNLYESIETLNETYLQPNQSIDTILKP-KVSFNCSTKLLPYIDT
        ++LKLLID +  +V++ EA K+F+D LF+  +LP+G ++RLL+     ++  +GC  N+Y S+ ++   +         +L P  ++      L   +D 
Subjt:  VRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLK-----KEGMVGCLGNLYESIETLNETYLQPNQSIDTILKP-KVSFNCSTKLLPYIDT

Query:  ALATSIITFTAPAPVPALALSPSPLSSGFTLSSPS----IQRITSGFSSSSIQPITSGETVG-FVKGLAT-YIVMDDLTVKHISDFSITTLLKKFNIKEV
        + AT    F  P  V     + S   S F  S  S    +  +T       +    +G   G FV+   T +++ DDL V+  S      +LK     + 
Subjt:  ALATSIITFTAPAPVPALALSPSPLSSGFTLSSPS----IQRITSGFSSSSIQPITSGETVG-FVKGLAT-YIVMDDLTVKHISDFSITTLLKKFNIKEV

Query:  DSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRS
        + L+EK+  ++++E   LLE    S   LT+ FLK++ S
Subjt:  DSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACAAACTAATGTGAGATTGAAGCTTTTGATCGACTCAAAAGCTCAAAGAGTTCTTTACGGCGAAGCAGAAAAGAATTTCATCGACTTTCTTTTCAATCTACTTTC
TCTCCCACTTGGAGCTGTAATTAGGCTCCTGAAAAAGGAAGGCATGGTGGGGTGCTTGGGAAATCTGTACGAGAGTATAGAAACCTTGAACGAGACGTATTTACAGCCAA
ACCAAAGCATAGATACGATCTTAAAACCCAAAGTCTCATTCAATTGTTCGACCAAGCTTTTGCCTTATATCGATACCGCCTTAGCTACATCTATAATAACATTTACAGCT
CCAGCTCCAGTTCCAGCTCTAGCTCTATCTCCATCTCCACTTTCTTCTGGCTTTACTCTTTCAAGTCCAAGCATCCAACGAATTACTAGTGGTTTTTCAAGTTCAAGCAT
CCAACCAATTACTAGTGGGGAAACGGTGGGATTTGTGAAGGGTTTGGCAACTTACATTGTGATGGATGACCTAACTGTGAAGCACATTTCTGACTTCTCCATTACTACCC
TTTTGAAAAAGTTCAATATCAAGGAAGTGGATTCTTTGGAGGAGAAAGTTATCACTTTGGATGTTGATGAGGGTGTGGAGTTACTAGAGGCCTCTCTGCAGTCGAAGACA
GTTCTTACGAATGCTTTTCTTAAGAGACGCAGATCACAGATTGACAATGATATTAAGTTGTCTAATGATGTTAAGTCTTCTGTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGACACAAACTAATGTGAGATTGAAGCTTTTGATCGACTCAAAAGCTCAAAGAGTTCTTTACGGCGAAGCAGAAAAGAATTTCATCGACTTTCTTTTCAATCTACTTTC
TCTCCCACTTGGAGCTGTAATTAGGCTCCTGAAAAAGGAAGGCATGGTGGGGTGCTTGGGAAATCTGTACGAGAGTATAGAAACCTTGAACGAGACGTATTTACAGCCAA
ACCAAAGCATAGATACGATCTTAAAACCCAAAGTCTCATTCAATTGTTCGACCAAGCTTTTGCCTTATATCGATACCGCCTTAGCTACATCTATAATAACATTTACAGCT
CCAGCTCCAGTTCCAGCTCTAGCTCTATCTCCATCTCCACTTTCTTCTGGCTTTACTCTTTCAAGTCCAAGCATCCAACGAATTACTAGTGGTTTTTCAAGTTCAAGCAT
CCAACCAATTACTAGTGGGGAAACGGTGGGATTTGTGAAGGGTTTGGCAACTTACATTGTGATGGATGACCTAACTGTGAAGCACATTTCTGACTTCTCCATTACTACCC
TTTTGAAAAAGTTCAATATCAAGGAAGTGGATTCTTTGGAGGAGAAAGTTATCACTTTGGATGTTGATGAGGGTGTGGAGTTACTAGAGGCCTCTCTGCAGTCGAAGACA
GTTCTTACGAATGCTTTTCTTAAGAGACGCAGATCACAGATTGACAATGATATTAAGTTGTCTAATGATGTTAAGTCTTCTGTCTAA
Protein sequenceShow/hide protein sequence
MTQTNVRLKLLIDSKAQRVLYGEAEKNFIDFLFNLLSLPLGAVIRLLKKEGMVGCLGNLYESIETLNETYLQPNQSIDTILKPKVSFNCSTKLLPYIDTALATSIITFTA
PAPVPALALSPSPLSSGFTLSSPSIQRITSGFSSSSIQPITSGETVGFVKGLATYIVMDDLTVKHISDFSITTLLKKFNIKEVDSLEEKVITLDVDEGVELLEASLQSKT
VLTNAFLKRRRSQIDNDIKLSNDVKSSV