; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020410 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020410
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionNDR1/HIN1-like protein 12
Genome locationtig00153490:923093..923695
RNA-Seq ExpressionSgr020410
SyntenySgr020410
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600429.1 hypothetical protein SDJN03_05662, partial [Cucurbita argyrosperma subsp. sororia]3.2e-8883.5Show/hide
Query:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF
        MPPKLIS+AK RTHPL+WFAAVLCTLVSIAVII GV VFIGYLVIHPRIPTISVI AHLDNF+NDIAGRLEV LTI+VEAENDNAKAHASFSDTS FLSF
Subjt:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF

Query:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK
        LG+NIA+LVA PFDVRKNSS +F YAV+S  IPLNPEQMEEVDFALKTDL +FDL GNTR QWRVGL GSVK+ C +HC LKFH SNGT+L  PCSSRAK
Subjt:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK

KAG7031080.1 hypothetical protein SDJN02_05119, partial [Cucurbita argyrosperma subsp. argyrosperma]1.8e-8883.5Show/hide
Query:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF
        MPPKLIS+AK RTHPL+WFAAVLCTLVSIAVII GV VFIGYLVIHPRIPTISVI AHLDNF+NDIAGRLEV LTI+VEA NDNAKAHASFSDTS FLSF
Subjt:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF

Query:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK
        LG+NIA+LVA PFDVRKNSS +FHYAV+S  IPLNPEQMEEVDFALKTDL +FDL GNTR QWRVGL GSVK+ C +HC LKFH SNGT+L  PCSSRAK
Subjt:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK

XP_004150550.1 uncharacterized protein LOC101214190 [Cucumis sativus]2.7e-8782Show/hide
Query:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF
        MPPKLIS AK  THPLVW AA+LCT+VSIAVIIGG+VVFIGYLVIHPRIPTIS++ AHLDNF+ DIAGRLEVQLTI++EA+NDNAKAHASFSD+SFFL F
Subjt:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF

Query:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK
        LGI IAQLVA PF+VRKNSS +F YAV S+SIPLNPEQME VD  LK DLSRFDL GNTR QWRVGLLGSVKY CHLHC LKFHPSNGT+LS PCSSR K
Subjt:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK

XP_008451870.1 PREDICTED: NDR1/HIN1-like protein 12 [Cucumis melo]3.5e-8782Show/hide
Query:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF
        MPPKLIS AK RTHPLVW AA+LCT+VSIAVIIGG+VVFIGYLVIHPRIPTIS++ AHLDNF+ DIAGRLEVQLTIVVEA+NDNAKAHASFSD+SFFL F
Subjt:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF

Query:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK
        L I IAQLVA PF+V+KNSS +FHYAV S+SIPLNPEQME VD+ LK DLS F L GNTR QWRVGLLGSVKY CHLHCELKFHPSNGT+L  PCSSR K
Subjt:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK

XP_022136394.1 uncharacterized protein LOC111008115 [Momordica charantia]3.5e-9586Show/hide
Query:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF
        MPP+LIS AK RTHPLVWFAAVLCT++SIAVIIGG+VVF+GYLVIHPRIPTISVIGAHLD F+ D+AGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF
Subjt:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF

Query:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK
        LGI IA+LVA PF VRKNSS +F YA+ SSSIPLNPEQMEE D+ALK+DLSRFDLKGNTR QWRVG+LGSVKYWCHLHCELKFHPSNGT+LS+PCSS+AK
Subjt:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK

TrEMBL top hitse value%identityAlignment
A0A0A0KX91 Uncharacterized protein1.3e-8782Show/hide
Query:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF
        MPPKLIS AK  THPLVW AA+LCT+VSIAVIIGG+VVFIGYLVIHPRIPTIS++ AHLDNF+ DIAGRLEVQLTI++EA+NDNAKAHASFSD+SFFL F
Subjt:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF

Query:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK
        LGI IAQLVA PF+VRKNSS +F YAV S+SIPLNPEQME VD  LK DLSRFDL GNTR QWRVGLLGSVKY CHLHC LKFHPSNGT+LS PCSSR K
Subjt:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK

A0A1S3BSH0 NDR1/HIN1-like protein 121.7e-8782Show/hide
Query:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF
        MPPKLIS AK RTHPLVW AA+LCT+VSIAVIIGG+VVFIGYLVIHPRIPTIS++ AHLDNF+ DIAGRLEVQLTIVVEA+NDNAKAHASFSD+SFFL F
Subjt:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF

Query:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK
        L I IAQLVA PF+V+KNSS +FHYAV S+SIPLNPEQME VD+ LK DLS F L GNTR QWRVGLLGSVKY CHLHCELKFHPSNGT+L  PCSSR K
Subjt:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK

A0A5D3D1V8 NDR1/HIN1-like protein 121.7e-8782Show/hide
Query:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF
        MPPKLIS AK RTHPLVW AA+LCT+VSIAVIIGG+VVFIGYLVIHPRIPTIS++ AHLDNF+ DIAGRLEVQLTIVVEA+NDNAKAHASFSD+SFFL F
Subjt:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF

Query:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK
        L I IAQLVA PF+V+KNSS +FHYAV S+SIPLNPEQME VD+ LK DLS F L GNTR QWRVGLLGSVKY CHLHCELKFHPSNGT+L  PCSSR K
Subjt:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK

A0A6J1C468 uncharacterized protein LOC1110081151.7e-9586Show/hide
Query:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF
        MPP+LIS AK RTHPLVWFAAVLCT++SIAVIIGG+VVF+GYLVIHPRIPTISVIGAHLD F+ D+AGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF
Subjt:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF

Query:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK
        LGI IA+LVA PF VRKNSS +F YA+ SSSIPLNPEQMEE D+ALK+DLSRFDLKGNTR QWRVG+LGSVKYWCHLHCELKFHPSNGT+LS+PCSS+AK
Subjt:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK

A0A6J1JD00 uncharacterized protein LOC1114833752.2e-8781Show/hide
Query:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF
        MPPKLIS AK RTHPLVW AA+LCT+VSIAVIIGG+V+FIGYLVIHPR+P I +I AHLDNF+NDIAGRLEVQLT+VV+AENDNAKAHASFSDTSFFL F
Subjt:  MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSF

Query:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK
        LGI IAQLVA  FDVRKN S   HY V S+SIPL PEQMEEVD+ALK+D+SRFDL G+TR QWRVGLLGSVKY CHLHC LKFHPSNGT+ S PCSSRAK
Subjt:  LGINIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13050.1 unknown protein1.7e-1527.66Show/hide
Query:  RTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSFLGINIAQLVAG
        RT P+   A + C ++ I +I+ G+++ + YL   PR P   +  A L+    D+   L   L +VV   N + K+   FS   F L F    IA     
Subjt:  RTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSFLGINIAQLVAG

Query:  PFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFH-PSNGTFLSMPCSSR
        PF V K  S    + + SS + +   Q +++   L T     +L+G   A+  +G L    YW H  C +  + P  GT  +  C+++
Subjt:  PFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFH-PSNGTFLSMPCSSR

AT1G13050.2 unknown protein4.2e-1427.22Show/hide
Query:  AAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSFLGINIAQLVAGPFDVRKNS
        A + C ++ I +I+ G+++ + YL   PR P   +  A L+    D+   L   L +VV   N + K+   FS   F L F    IA     PF V K  
Subjt:  AAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSFLGINIAQLVAGPFDVRKNS

Query:  SREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFH-PSNGTFLSMPCSSR
        S    + + SS + +   Q +++   L T     +L+G   A+  +G L    YW H  C +  + P  GT  +  C+++
Subjt:  SREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFH-PSNGTFLSMPCSSR

AT3G26350.1 LOCATED IN: chloroplast6.9e-1728.34Show/hide
Query:  THPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSFLGINIAQLVAGP
        T+ + W AA  C +  + +I+GG+++ I YLV  PR P + +  A+L+    D+   L   LTI+    N + K+   FS  +F L +    IA     P
Subjt:  THPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSFLGINIAQLVAGP

Query:  FDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFH-PSNGTFLSMPCSSR
        F V K +S   +  + SS + L   Q  E+   ++T     +L+G   A+  +G L    Y  H HC +  + P  G   +  C+++
Subjt:  FDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFH-PSNGTFLSMPCSSR

AT4G26490.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.3e-1125.4Show/hide
Query:  SRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSFLGINIAQLVA
        SRT   +W  A  C + S+ +I   +   I +L I PRIP   +  A+L     D        L+++V   N N K    F      L F    IA  V 
Subjt:  SRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSFLGINIAQLVA

Query:  GPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKF-HPSNGTFLSMPCSSR
         PF  +K+ +R     + SS + L      E+   L+ +   ++++G  + +   G++    Y  H  C+L+   P  G  +S  C+++
Subjt:  GPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKF-HPSNGTFLSMPCSSR

AT5G45320.1 FUNCTIONS IN: molecular_function unknown2.9e-5553.03Show/hide
Query:  PKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSFLG
        P+L S  +  T P +W AA++C ++SI VI+GG++VF+GYLVIHPR+P ISV  AHLD  K DI G L+ QLTIV+  ENDNAKAHA F +T F LS+ G
Subjt:  PKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSFLG

Query:  INIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK
          IA L A  F+V K  S    Y V S  IPLNP  M+ VD+A+K D+  F+LKG +R +WRVG LGSVK+ C+L C+L+F PS+ +++  PC+S  K
Subjt:  INIAQLVAGPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCCGAAGTTGATCTCCCACGCAAAGAGCCGCACGCACCCGCTGGTCTGGTTCGCCGCCGTCCTCTGCACTCTCGTATCCATCGCCGTCATAATCGGAGGCGTCGT
CGTCTTCATCGGCTACTTAGTGATCCACCCGAGGATTCCGACGATCAGCGTCATCGGCGCGCATCTCGACAACTTCAAGAACGACATCGCCGGCCGCCTCGAAGTCCAGT
TGACGATCGTCGTCGAGGCGGAGAATGACAACGCCAAAGCGCACGCGAGCTTCTCCGATACGAGCTTCTTCCTCAGCTTCCTAGGAATCAACATCGCGCAGTTAGTGGCG
GGTCCGTTCGATGTGAGGAAAAACAGCTCTCGCGAGTTCCATTACGCCGTCGATTCGTCGTCGATTCCGCTGAATCCGGAGCAGATGGAGGAAGTTGATTTCGCGTTGAA
GACGGACCTCAGTCGGTTCGATTTGAAGGGTAATACGAGGGCTCAATGGCGAGTGGGACTCCTTGGCTCGGTGAAGTACTGGTGTCACCTCCATTGCGAGCTCAAATTCC
ATCCCTCCAATGGAACATTCTTGAGTATGCCCTGTAGCTCCAGGGCCAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCCGAAGTTGATCTCCCACGCAAAGAGCCGCACGCACCCGCTGGTCTGGTTCGCCGCCGTCCTCTGCACTCTCGTATCCATCGCCGTCATAATCGGAGGCGTCGT
CGTCTTCATCGGCTACTTAGTGATCCACCCGAGGATTCCGACGATCAGCGTCATCGGCGCGCATCTCGACAACTTCAAGAACGACATCGCCGGCCGCCTCGAAGTCCAGT
TGACGATCGTCGTCGAGGCGGAGAATGACAACGCCAAAGCGCACGCGAGCTTCTCCGATACGAGCTTCTTCCTCAGCTTCCTAGGAATCAACATCGCGCAGTTAGTGGCG
GGTCCGTTCGATGTGAGGAAAAACAGCTCTCGCGAGTTCCATTACGCCGTCGATTCGTCGTCGATTCCGCTGAATCCGGAGCAGATGGAGGAAGTTGATTTCGCGTTGAA
GACGGACCTCAGTCGGTTCGATTTGAAGGGTAATACGAGGGCTCAATGGCGAGTGGGACTCCTTGGCTCGGTGAAGTACTGGTGTCACCTCCATTGCGAGCTCAAATTCC
ATCCCTCCAATGGAACATTCTTGAGTATGCCCTGTAGCTCCAGGGCCAAATGA
Protein sequenceShow/hide protein sequence
MPPKLISHAKSRTHPLVWFAAVLCTLVSIAVIIGGVVVFIGYLVIHPRIPTISVIGAHLDNFKNDIAGRLEVQLTIVVEAENDNAKAHASFSDTSFFLSFLGINIAQLVA
GPFDVRKNSSREFHYAVDSSSIPLNPEQMEEVDFALKTDLSRFDLKGNTRAQWRVGLLGSVKYWCHLHCELKFHPSNGTFLSMPCSSRAK