; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g1162 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g1162
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationMC03:17782762..17783881
RNA-Seq ExpressionMC03g1162
SyntenyMC03g1162
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR040344 - Uncharacterized protein At3g17950-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584313.1 hypothetical protein SDJN03_20245, partial [Cucurbita argyrosperma subsp. sororia]1.64e-11192.59Show/hide
Query:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW
        MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFR PSQNRDQHA A AAA G SR+SKKPKRKTTAAPALVADRKRRWW
Subjt:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW

Query:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV--DQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG
        RLCRDDGVKPASLGEFLEVERRFGDGAF+GNAVDLEGVV  DQQRNGR LFADGRVLPP  QT+ED SAAGALCRFSVSLTGICSGGAG
Subjt:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV--DQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG

XP_022137456.1 uncharacterized protein At3g17950 [Momordica charantia]9.00e-125100Show/hide
Query:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW
        MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW
Subjt:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW

Query:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVDQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG
        RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVDQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG
Subjt:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVDQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG

XP_022924210.1 uncharacterized protein At3g17950 [Cucurbita moschata]5.75e-11293.12Show/hide
Query:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW
        MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFR PSQNRDQHA A AAA G SR+SKKPKRKTTAAPALVADRKRRWW
Subjt:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW

Query:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV--DQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG
        RLCRDDGVKPASLGEFLEVERRFGDGAF+GNAVDLEGVV  DQQRNGR LFADGRVLPP  QTEED SAAGALCRFSVSLTGICSGGAG
Subjt:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV--DQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG

XP_023520118.1 uncharacterized protein At3g17950-like [Cucurbita pepo subsp. pepo]1.64e-11192.59Show/hide
Query:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW
        MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFR PSQNRDQH+ A AAA G SR+SKKPKRKTTAAPALVADRKRRWW
Subjt:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW

Query:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV--DQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG
        RLCRDDGVKPASLGEFLEVERRFGDGAF+GNAVDLEGVV  DQQRNGR LFADGRVLPP  QTEED SAAGALCRFSVSLTGICSGGAG
Subjt:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV--DQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG

XP_031739825.1 uncharacterized protein At3g17950 [Cucumis sativus]6.53e-11673.5Show/hide
Query:  RTKLRINKREEERKQTQKGCTNNHF--------------NPFSLSLSLSNSKSKFLILSLSLL---SNFNLSFSESLEREIERERERERESRGEMLNPAN
        RTKLRI KR+ ++K+ +KGCTNNHF              + FSLS S S+S S     S S+    SNF LSFS      I+RER+       EMLNPAN
Subjt:  RTKLRINKREEERKQTQKGCTNNHF--------------NPFSLSLSLSNSKSKFLILSLSLL---SNFNLSFSESLEREIERERERERESRGEMLNPAN

Query:  DLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWWRLCRDD
        DLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFR PSQNRDQH  A   A G SR+SKK KRKTT APALVADRKRRWWRLCRDD
Subjt:  DLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWWRLCRDD

Query:  GVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV--DQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG
        GVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV  DQQRNGR LFADGRVLPP  QTEED S A  LCRFSVSLTGICSGGAG
Subjt:  GVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV--DQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG

TrEMBL top hitse value%identityAlignment
A0A1S3CPP8 uncharacterized protein At3g17950 isoform X12.32e-10990.1Show/hide
Query:  RGEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKR
        R EMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFR PSQNRDQH  A   A G SR+SKK KRKTT APALVADRKR
Subjt:  RGEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKR

Query:  RWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV--DQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG
        RWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV  DQQRNGR LFADGRVLPP  QTEED SA GALCRFSVSLTGICSGGAG
Subjt:  RWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV--DQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG

A0A1S4DUB1 uncharacterized protein At3g17950 isoform X21.25e-10890.48Show/hide
Query:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW
        MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFR PSQNRDQH  A   A G SR+SKK KRKTT APALVADRKRRWW
Subjt:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW

Query:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV--DQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG
        RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV  DQQRNGR LFADGRVLPP  QTEED SA GALCRFSVSLTGICSGGAG
Subjt:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV--DQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG

A0A6J1C6N6 uncharacterized protein At3g179504.36e-125100Show/hide
Query:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW
        MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW
Subjt:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW

Query:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVDQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG
        RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVDQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG
Subjt:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVDQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG

A0A6J1E8Y2 uncharacterized protein At3g179502.78e-11293.12Show/hide
Query:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW
        MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFR PSQNRDQHA A AAA G SR+SKKPKRKTTAAPALVADRKRRWW
Subjt:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW

Query:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV--DQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG
        RLCRDDGVKPASLGEFLEVERRFGDGAF+GNAVDLEGVV  DQQRNGR LFADGRVLPP  QTEED SAAGALCRFSVSLTGICSGGAG
Subjt:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV--DQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG

A0A6J1KNQ0 uncharacterized protein At3g17950-like1.08e-10991.53Show/hide
Query:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW
        MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFR PSQNRDQH  A  AA G SR+SKKPKRKTTAAPALVADRKRRWW
Subjt:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWW

Query:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV--DQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG
        RLCRDDGVKPASLGEFLEVERRFGDGAF+ NAVDLEGVV  DQQRNGR LFADGRVLPP  QTEED SAAGALCRFSVSLTGICSGGAG
Subjt:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV--DQQRNGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG

SwissProt top hitse value%identityAlignment
Q6DR24 Uncharacterized protein At3g179501.1e-3551.22Show/hide
Query:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRAPSQNR-DQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWWRLCRDD-
        PSSPT SS+SSSDLDTESTGSFFHDRS +LGTLMG SF A   + FRA S+         + A++  +RR+ + KR  + +      R+R+WWR CRDD 
Subjt:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRAPSQNR-DQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWWRLCRDD-

Query:  ------GV-------KPASLGEFLEVERRFGDGAFYGNA-VDLEGVV-----DQQ--RNGRMLFADGRVLPPPPQ---TEEDASAAGALCRFSVSLTGIC
              G+       K +SLGE+LEVERRFGD A Y +A  +LE  V     DQQ     R LFADGRVLPP      T E    A +LCRF VSLTGIC
Subjt:  ------GV-------KPASLGEFLEVERRFGDGAFYGNA-VDLEGVV-----DQQ--RNGRMLFADGRVLPPPPQ---TEEDASAAGALCRFSVSLTGIC

Query:  SGGAG
        SGG G
Subjt:  SGGAG

Arabidopsis top hitse value%identityAlignment
AT3G17950.1 unknown protein7.9e-3751.22Show/hide
Query:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRAPSQNR-DQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWWRLCRDD-
        PSSPT SS+SSSDLDTESTGSFFHDRS +LGTLMG SF A   + FRA S+         + A++  +RR+ + KR  + +      R+R+WWR CRDD 
Subjt:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRAPSQNR-DQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWWRLCRDD-

Query:  ------GV-------KPASLGEFLEVERRFGDGAFYGNA-VDLEGVV-----DQQ--RNGRMLFADGRVLPPPPQ---TEEDASAAGALCRFSVSLTGIC
              G+       K +SLGE+LEVERRFGD A Y +A  +LE  V     DQQ     R LFADGRVLPP      T E    A +LCRF VSLTGIC
Subjt:  ------GV-------KPASLGEFLEVERRFGDGAFYGNA-VDLEGVV-----DQQ--RNGRMLFADGRVLPPPPQ---TEEDASAAGALCRFSVSLTGIC

Query:  SGGAG
        SGG G
Subjt:  SGGAG

AT3G17950.2 unknown protein2.1e-2144.19Show/hide
Query:  MGVSFPA---ITFRAPSQNR-DQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWWRLCRDD-------GV-------KPASLGEFLEVERRFGDG
        MG SF A   + FRA S+         + A++  +RR+ + KR  + +      R+R+WWR CRDD       G+       K +SLGE+LEVERRFGD 
Subjt:  MGVSFPA---ITFRAPSQNR-DQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWWRLCRDD-------GV-------KPASLGEFLEVERRFGDG

Query:  AFYGNA-VDLEGVV-----DQQ--RNGRMLFADGRVLPPPPQ---TEEDASAAGALCRFSVSLTGICSGGAG
        A Y +A  +LE  V     DQQ     R LFADGRVLPP      T E    A +LCRF VSLTGICSGG G
Subjt:  AFYGNA-VDLEGVV-----DQQ--RNGRMLFADGRVLPPPPQ---TEEDASAAGALCRFSVSLTGICSGGAG

AT5G02440.1 unknown protein2.1e-0550.91Show/hide
Query:  SPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAA
        SP+ SS SSSDLD++S GSFF DRS +LG L+G+S      R  ++ R+   GAA
Subjt:  SPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AGGACAAAATTGAGAATAAATAAAAGAGAAGAAGAAAGGAAACAGACACAAAAAGGCTGCACTAACAATCATTTTAACCCATTCTCTCTCTCTCTCTCTCTCTCCAATTC
CAAATCCAAATTCTTAATCCTTTCTCTCTCTCTTCTTTCGAATTTCAATCTCAGTTTCTCTGAATCTCTAGAGAGAGAGATAGAGAGAGAGAGAGAGAGAGAGAGAGAGA
GCAGAGGAGAGATGTTGAATCCGGCGAACGATCTGTTACCGCCGCCGTCTTCTCCAACCAATTCATCCATCTCCTCCTCCGATCTCGACACTGAGTCTACAGGTTCGTTT
TTCCATGATAGGAGCACGAGCTTGGGGACTCTAATGGGCGTCAGTTTTCCGGCGATAACCTTCCGAGCGCCGTCACAGAACCGAGACCAACACGCCGGCGCAGCCGCCGC
CGCCGCCGGCCCTTCCCGCAGGAGCAAGAAGCCGAAGAGGAAAACCACGGCGGCGCCGGCACTCGTCGCGGATCGGAAGCGGAGGTGGTGGCGGCTGTGCCGGGACGACG
GCGTTAAGCCGGCATCTCTCGGCGAGTTTCTCGAGGTCGAACGGAGATTTGGGGACGGTGCTTTCTACGGTAACGCGGTGGATCTGGAAGGCGTGGTAGATCAACAGAGG
AATGGGCGAATGTTGTTCGCCGACGGGAGAGTGCTTCCGCCGCCGCCGCAAACGGAGGAAGACGCGTCGGCGGCCGGCGCTCTCTGCCGGTTTTCCGTATCGCTCACCGG
GATTTGCAGCGGCGGTGCCGGCTAA
mRNA sequenceShow/hide mRNA sequence
AGGACAAAATTGAGAATAAATAAAAGAGAAGAAGAAAGGAAACAGACACAAAAAGGCTGCACTAACAATCATTTTAACCCATTCTCTCTCTCTCTCTCTCTCTCCAATTC
CAAATCCAAATTCTTAATCCTTTCTCTCTCTCTTCTTTCGAATTTCAATCTCAGTTTCTCTGAATCTCTAGAGAGAGAGATAGAGAGAGAGAGAGAGAGAGAGAGAGAGA
GCAGAGGAGAGATGTTGAATCCGGCGAACGATCTGTTACCGCCGCCGTCTTCTCCAACCAATTCATCCATCTCCTCCTCCGATCTCGACACTGAGTCTACAGGTTCGTTT
TTCCATGATAGGAGCACGAGCTTGGGGACTCTAATGGGCGTCAGTTTTCCGGCGATAACCTTCCGAGCGCCGTCACAGAACCGAGACCAACACGCCGGCGCAGCCGCCGC
CGCCGCCGGCCCTTCCCGCAGGAGCAAGAAGCCGAAGAGGAAAACCACGGCGGCGCCGGCACTCGTCGCGGATCGGAAGCGGAGGTGGTGGCGGCTGTGCCGGGACGACG
GCGTTAAGCCGGCATCTCTCGGCGAGTTTCTCGAGGTCGAACGGAGATTTGGGGACGGTGCTTTCTACGGTAACGCGGTGGATCTGGAAGGCGTGGTAGATCAACAGAGG
AATGGGCGAATGTTGTTCGCCGACGGGAGAGTGCTTCCGCCGCCGCCGCAAACGGAGGAAGACGCGTCGGCGGCCGGCGCTCTCTGCCGGTTTTCCGTATCGCTCACCGG
GATTTGCAGCGGCGGTGCCGGCTAAAATGTGGGATTTCTGCAGTTTATTTATCATATTATATTTATTATATATTAATTAATCAAATGCTTTAATTAACGTGGAAGAAAGA
AAGGGAAAAAAAAAACGTCACCTTTATCTTGTAGGTGACCCTCTTTTATTTGATTTCCTTTAATCATCTCACACATTACTCTTCTCTTTTTTCTTTAAAAAAAAAGTAAC
ACTCAGATCTTCATCTTTGAAGCTTCCGGAGATTTTTC
Protein sequenceShow/hide protein sequence
RTKLRINKREEERKQTQKGCTNNHFNPFSLSLSLSNSKSKFLILSLSLLSNFNLSFSESLEREIERERERERESRGEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSF
FHDRSTSLGTLMGVSFPAITFRAPSQNRDQHAGAAAAAAGPSRRSKKPKRKTTAAPALVADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVDQQR
NGRMLFADGRVLPPPPQTEEDASAAGALCRFSVSLTGICSGGAG