; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g30880 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g30880
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMuDRA-like transposase
Genome locationchr9:23316287..23317402
RNA-Seq ExpressionMoc09g30880
SyntenyMoc09g30880
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154803.1 uncharacterized protein LOC111021969 [Momordica charantia]7.5e-14070.8Show/hide
Query:  GQWNQTGTVYECGVMGRLNVDEGITYRDLVNAIFRMTRINPDLFNIVLQCIYKFEYQ--VPNFYISDDTSLGFYLMGPPHPFQVPLYVSVVPKERQGSGS
        GQWN+ GTVYE GVMG LNVDEGITYRDLV+AIFRMTRINPD+FNIVLQCIYKFE+Q  VPNFYI DDTSLGFYLMGPPHP QVPLYVSVVPKERQ SGS
Subjt:  GQWNQTGTVYECGVMGRLNVDEGITYRDLVNAIFRMTRINPDLFNIVLQCIYKFEYQ--VPNFYISDDTSLGFYLMGPPHPFQVPLYVSVVPKERQGSGS

Query:  NSRDGTLYPQTETLASFPCQVEQNVPSPIPLNHMHSPIDTVNPPCSVQFMTPLTDNIVPCNLGDDEQKHFGQWDDVG-----------------------
        NS D  LYPQTET ASFP QVEQNVPSP PLNH+HS I TV+PP SV+FMTPLTDN+VPCNLGDDE +HFGQWDDVG                       
Subjt:  NSRDGTLYPQTETLASFPCQVEQNVPSPIPLNHMHSPIDTVNPPCSVQFMTPLTDNIVPCNLGDDEQKHFGQWDDVG-----------------------

Query:  -------------------------VQSVSRNAPCATTNGQHASLEQMDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFAITKNFEYKIKKSTTKLLSV
                                 VQSVSRNAPCAT  G HASLEQMDTIG +D  V DIA+GS+FRSKDELRF LAVFAI KNFE+++KKST  LLSV
Subjt:  -------------------------VQSVSRNAPCATTNGQHASLEQMDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFAITKNFEYKIKKSTTKLLSV

Query:  ACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHDHRQAGSWVVGQLIKSNFEDVSR
        AC E GC+WALRAR+IKGSDTFLISTFSE+H  +R TL HDH+QAGSWVVGQLIK+N ED+SR
Subjt:  ACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHDHRQAGSWVVGQLIKSNFEDVSR

XP_022156802.1 uncharacterized protein LOC111023635 [Momordica charantia]3.2e-5039.13Show/hide
Query:  MVRLFVIYRGQWNQTGTVYECGVMGRLNVDEGITYRDLVNAIFRMTRINPDLFNIVLQCIYKFEYQVPNFYISDDTSLGFYLMGPPHPFQVPLYVSVVPK
        M RLFVIY G+WN+ GTVYE G MG L+VDE ITY +LV+A+  +TRI+ D F++++QC+Y   +++                        PL  +V+  
Subjt:  MVRLFVIYRGQWNQTGTVYECGVMGRLNVDEGITYRDLVNAIFRMTRINPDLFNIVLQCIYKFEYQVPNFYISDDTSLGFYLMGPPHPFQVPLYVSVVPK

Query:  ERQGSGSNSRDGTLYPQTETLASFPCQVEQNVPSPIPLNHMHSPIDTVNPPCSVQFMTPLTDNIVPCNLGDDEQKHFGQWDDVGVQSVSRNAPCATTNGQ
              ++  +G LY + E    F    E +          +   D V       +     D     ++ D+  ++ GQ       +VS NAP  T    
Subjt:  ERQGSGSNSRDGTLYPQTETLASFPCQVEQNVPSPIPLNHMHSPIDTVNPPCSVQFMTPLTDNIVPCNLGDDEQKHFGQWDDVGVQSVSRNAPCATTNGQ

Query:  HASLEQMDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFAITKNFEYKIKKSTTKLLSVACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHD
             Q    GN    + +IAV  +FRSK+ELRFKL+V A+  NF++K+KKST  L +V CTE GCKW LRA+ I+G D+F+IS F++ H C+R  L HD
Subjt:  HASLEQMDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFAITKNFEYKIKKSTTKLLSVACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHD

Query:  HRQAGSWVVGQLIKSNFEDVSR
        HRQA SWVVGQL+KSN EDVSR
Subjt:  HRQAGSWVVGQLIKSNFEDVSR

XP_022156834.1 uncharacterized protein LOC111023667 [Momordica charantia]8.7e-6440.42Show/hide
Query:  MVRLFVIYRGQWNQTGTVYECGVMGRLNVDEGITYRDLVNAIFRMTRINPDLFNIVLQCIYKFE--YQVPNFYISDDTSLGFYLMGPPHPFQVPLYVSVV
        M RLFVIY G+WN+ GT+YE GVMG L+VDE ITY +LV+A+  +TRI+PD F++++QC+Y+F+  Y+VPN+ I DD+SL FYL GPP P QVPLYV+V+
Subjt:  MVRLFVIYRGQWNQTGTVYECGVMGRLNVDEGITYRDLVNAIFRMTRINPDLFNIVLQCIYKFE--YQVPNFYISDDTSLGFYLMGPPHPFQVPLYVSVV

Query:  PKERQGSGSNSRDGTLYPQTETLASFPCQVEQNVPSPIPLNHMHSPIDTVNPPCSVQFMTPLTDNIVPCNLGDDEQKHFGQ-----------------WD
        PK   GSG  SR G  + +T+T +SFP    QN P       + SP+D V  P  +  +TPL DN++PCNL DDE  ++GQ                 +D
Subjt:  PKERQGSGSNSRDGTLYPQTETLASFPCQVEQNVPSPIPLNHMHSPIDTVNPPCSVQFMTPLTDNIVPCNLGDDEQKHFGQ-----------------WD

Query:  DV----GVQ------------------------------------SVSRNAPCATTNGQHASLEQMDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFAI
        +     GV+                                    ++S NAP  T     +   Q    GN    + +IAV  +F SK ELRFKL     
Subjt:  DV----GVQ------------------------------------SVSRNAPCATTNGQHASLEQMDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFAI

Query:  TKNFEYKIKKSTTKLLSVACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHDHRQAGSWVVGQLIKSNFEDVSR
                                    LRA+ I+G D+F+IS F++ H C+R  L HDHRQA SWVVGQL+KSN EDVSR
Subjt:  TKNFEYKIKKSTTKLLSVACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHDHRQAGSWVVGQLIKSNFEDVSR

XP_022157237.1 protein FAR-RED ELONGATED HYPOCOTYL 3-like [Momordica charantia]4.3e-5593.97Show/hide
Query:  MDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFAITKNFEYKIKKSTTKLLSVACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHDHRQAGS
        MDTIGNEDG VHDIAVGSVFRSKDELRFKLAVFAITKNFEYK+KKSTTKLLSVACTENGCKWALR RRIKGS+TFLISTFSENHSC+R TLAHDHRQAGS
Subjt:  MDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFAITKNFEYKIKKSTTKLLSVACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHDHRQAGS

Query:  WVVGQLIKSNFEDVSR
        WVVGQLIKSNFE+VSR
Subjt:  WVVGQLIKSNFEDVSR

XP_022158743.1 PKS-NRPS hybrid synthetase CHGG_01239-like [Momordica charantia]5.0e-4340.07Show/hide
Query:  VVPKERQGSGSNSRDGTLYPQTETLASFPCQVEQNVPSPIPLNHMHSPIDTVNPPCSVQFMTPLTDNIVPCNLGDDEQKHFGQWD---------------
        ++PK+R G GS+S++  + P  +   SFP Q+ Q+VP PIP+   HS +  +   CSV  +TPLTDN+V  NLGDDE  +  QWD               
Subjt:  VVPKERQGSGSNSRDGTLYPQTETLASFPCQVEQNVPSPIPLNHMHSPIDTVNPPCSVQFMTPLTDNIVPCNLGDDEQKHFGQWD---------------

Query:  -------------------DVGVQS----------------------VSRNAPCATTNGQHASLEQMDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFA
                           DVG+ +                      VS NAPCA TN    S +   T+     +   I V S F+S  EL+F  +VFA
Subjt:  -------------------DVGVQS----------------------VSRNAPCATTNGQHASLEQMDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFA

Query:  ITKNFEYKIKKSTTKLLSVACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHDHRQAGSWVVGQLIKSNFEDVSR
        +  NFEY++KKST  LL+V C  +GCKW + ARRI+GSDTFLIS F   H+C+   + HDHRQA S +VGQ+IK+NFED SR
Subjt:  ITKNFEYKIKKSTTKLLSVACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHDHRQAGSWVVGQLIKSNFEDVSR

TrEMBL top hitse value%identityAlignment
A0A6J1DLB0 uncharacterized protein LOC1110219693.6e-14070.8Show/hide
Query:  GQWNQTGTVYECGVMGRLNVDEGITYRDLVNAIFRMTRINPDLFNIVLQCIYKFEYQ--VPNFYISDDTSLGFYLMGPPHPFQVPLYVSVVPKERQGSGS
        GQWN+ GTVYE GVMG LNVDEGITYRDLV+AIFRMTRINPD+FNIVLQCIYKFE+Q  VPNFYI DDTSLGFYLMGPPHP QVPLYVSVVPKERQ SGS
Subjt:  GQWNQTGTVYECGVMGRLNVDEGITYRDLVNAIFRMTRINPDLFNIVLQCIYKFEYQ--VPNFYISDDTSLGFYLMGPPHPFQVPLYVSVVPKERQGSGS

Query:  NSRDGTLYPQTETLASFPCQVEQNVPSPIPLNHMHSPIDTVNPPCSVQFMTPLTDNIVPCNLGDDEQKHFGQWDDVG-----------------------
        NS D  LYPQTET ASFP QVEQNVPSP PLNH+HS I TV+PP SV+FMTPLTDN+VPCNLGDDE +HFGQWDDVG                       
Subjt:  NSRDGTLYPQTETLASFPCQVEQNVPSPIPLNHMHSPIDTVNPPCSVQFMTPLTDNIVPCNLGDDEQKHFGQWDDVG-----------------------

Query:  -------------------------VQSVSRNAPCATTNGQHASLEQMDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFAITKNFEYKIKKSTTKLLSV
                                 VQSVSRNAPCAT  G HASLEQMDTIG +D  V DIA+GS+FRSKDELRF LAVFAI KNFE+++KKST  LLSV
Subjt:  -------------------------VQSVSRNAPCATTNGQHASLEQMDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFAITKNFEYKIKKSTTKLLSV

Query:  ACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHDHRQAGSWVVGQLIKSNFEDVSR
        AC E GC+WALRAR+IKGSDTFLISTFSE+H  +R TL HDH+QAGSWVVGQLIK+N ED+SR
Subjt:  ACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHDHRQAGSWVVGQLIKSNFEDVSR

A0A6J1DSY0 uncharacterized protein LOC1110236351.6e-5039.13Show/hide
Query:  MVRLFVIYRGQWNQTGTVYECGVMGRLNVDEGITYRDLVNAIFRMTRINPDLFNIVLQCIYKFEYQVPNFYISDDTSLGFYLMGPPHPFQVPLYVSVVPK
        M RLFVIY G+WN+ GTVYE G MG L+VDE ITY +LV+A+  +TRI+ D F++++QC+Y   +++                        PL  +V+  
Subjt:  MVRLFVIYRGQWNQTGTVYECGVMGRLNVDEGITYRDLVNAIFRMTRINPDLFNIVLQCIYKFEYQVPNFYISDDTSLGFYLMGPPHPFQVPLYVSVVPK

Query:  ERQGSGSNSRDGTLYPQTETLASFPCQVEQNVPSPIPLNHMHSPIDTVNPPCSVQFMTPLTDNIVPCNLGDDEQKHFGQWDDVGVQSVSRNAPCATTNGQ
              ++  +G LY + E    F    E +          +   D V       +     D     ++ D+  ++ GQ       +VS NAP  T    
Subjt:  ERQGSGSNSRDGTLYPQTETLASFPCQVEQNVPSPIPLNHMHSPIDTVNPPCSVQFMTPLTDNIVPCNLGDDEQKHFGQWDDVGVQSVSRNAPCATTNGQ

Query:  HASLEQMDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFAITKNFEYKIKKSTTKLLSVACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHD
             Q    GN    + +IAV  +FRSK+ELRFKL+V A+  NF++K+KKST  L +V CTE GCKW LRA+ I+G D+F+IS F++ H C+R  L HD
Subjt:  HASLEQMDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFAITKNFEYKIKKSTTKLLSVACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHD

Query:  HRQAGSWVVGQLIKSNFEDVSR
        HRQA SWVVGQL+KSN EDVSR
Subjt:  HRQAGSWVVGQLIKSNFEDVSR

A0A6J1DU12 protein FAR-RED ELONGATED HYPOCOTYL 3-like2.1e-5593.97Show/hide
Query:  MDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFAITKNFEYKIKKSTTKLLSVACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHDHRQAGS
        MDTIGNEDG VHDIAVGSVFRSKDELRFKLAVFAITKNFEYK+KKSTTKLLSVACTENGCKWALR RRIKGS+TFLISTFSENHSC+R TLAHDHRQAGS
Subjt:  MDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFAITKNFEYKIKKSTTKLLSVACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHDHRQAGS

Query:  WVVGQLIKSNFEDVSR
        WVVGQLIKSNFE+VSR
Subjt:  WVVGQLIKSNFEDVSR

A0A6J1DUS4 uncharacterized protein LOC1110236674.2e-6440.42Show/hide
Query:  MVRLFVIYRGQWNQTGTVYECGVMGRLNVDEGITYRDLVNAIFRMTRINPDLFNIVLQCIYKFE--YQVPNFYISDDTSLGFYLMGPPHPFQVPLYVSVV
        M RLFVIY G+WN+ GT+YE GVMG L+VDE ITY +LV+A+  +TRI+PD F++++QC+Y+F+  Y+VPN+ I DD+SL FYL GPP P QVPLYV+V+
Subjt:  MVRLFVIYRGQWNQTGTVYECGVMGRLNVDEGITYRDLVNAIFRMTRINPDLFNIVLQCIYKFE--YQVPNFYISDDTSLGFYLMGPPHPFQVPLYVSVV

Query:  PKERQGSGSNSRDGTLYPQTETLASFPCQVEQNVPSPIPLNHMHSPIDTVNPPCSVQFMTPLTDNIVPCNLGDDEQKHFGQ-----------------WD
        PK   GSG  SR G  + +T+T +SFP    QN P       + SP+D V  P  +  +TPL DN++PCNL DDE  ++GQ                 +D
Subjt:  PKERQGSGSNSRDGTLYPQTETLASFPCQVEQNVPSPIPLNHMHSPIDTVNPPCSVQFMTPLTDNIVPCNLGDDEQKHFGQ-----------------WD

Query:  DV----GVQ------------------------------------SVSRNAPCATTNGQHASLEQMDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFAI
        +     GV+                                    ++S NAP  T     +   Q    GN    + +IAV  +F SK ELRFKL     
Subjt:  DV----GVQ------------------------------------SVSRNAPCATTNGQHASLEQMDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFAI

Query:  TKNFEYKIKKSTTKLLSVACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHDHRQAGSWVVGQLIKSNFEDVSR
                                    LRA+ I+G D+F+IS F++ H C+R  L HDHRQA SWVVGQL+KSN EDVSR
Subjt:  TKNFEYKIKKSTTKLLSVACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHDHRQAGSWVVGQLIKSNFEDVSR

A0A6J1DWY9 PKS-NRPS hybrid synthetase CHGG_01239-like2.4e-4340.07Show/hide
Query:  VVPKERQGSGSNSRDGTLYPQTETLASFPCQVEQNVPSPIPLNHMHSPIDTVNPPCSVQFMTPLTDNIVPCNLGDDEQKHFGQWD---------------
        ++PK+R G GS+S++  + P  +   SFP Q+ Q+VP PIP+   HS +  +   CSV  +TPLTDN+V  NLGDDE  +  QWD               
Subjt:  VVPKERQGSGSNSRDGTLYPQTETLASFPCQVEQNVPSPIPLNHMHSPIDTVNPPCSVQFMTPLTDNIVPCNLGDDEQKHFGQWD---------------

Query:  -------------------DVGVQS----------------------VSRNAPCATTNGQHASLEQMDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFA
                           DVG+ +                      VS NAPCA TN    S +   T+     +   I V S F+S  EL+F  +VFA
Subjt:  -------------------DVGVQS----------------------VSRNAPCATTNGQHASLEQMDTIGNEDGSVHDIAVGSVFRSKDELRFKLAVFA

Query:  ITKNFEYKIKKSTTKLLSVACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHDHRQAGSWVVGQLIKSNFEDVSR
        +  NFEY++KKST  LL+V C  +GCKW + ARRI+GSDTFLIS F   H+C+   + HDHRQA S +VGQ+IK+NFED SR
Subjt:  ITKNFEYKIKKSTTKLLSVACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHDHRQAGSWVVGQLIKSNFEDVSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGCCTTTTTGTGATATATAGGGGTCAGTGGAATCAGACAGGCACTGTATATGAATGTGGAGTTATGGGTAGGTTAAATGTTGATGAAGGAATAACATATAGAGA
CCTTGTAAATGCGATCTTTAGGATGACTAGAATAAATCCTGATCTTTTCAATATTGTTCTACAATGTATCTACAAATTTGAGTACCAAGTGCCAAACTTTTATATATCCG
ATGATACTAGCCTTGGTTTTTACCTTATGGGCCCTCCACATCCCTTCCAAGTGCCTTTGTATGTATCTGTTGTACCAAAGGAAAGACAAGGTAGTGGTAGTAACTCACGT
GATGGTACATTATATCCCCAAACCGAAACATTGGCTTCATTTCCATGCCAAGTAGAACAAAACGTTCCTTCTCCTATCCCACTAAATCACATGCACTCACCTATAGATAC
TGTCAACCCGCCATGTTCTGTACAATTTATGACTCCATTGACGGACAATATTGTCCCTTGCAATTTAGGTGACGATGAACAAAAACACTTCGGGCAGTGGGATGATGTTG
GAGTACAGTCTGTAAGTAGGAATGCCCCTTGTGCAACTACTAATGGTCAACATGCGTCCTTAGAACAGATGGATACAATTGGTAATGAGGATGGTTCTGTTCACGACATT
GCAGTTGGCAGTGTTTTTCGGTCTAAGGATGAATTACGATTTAAACTGGCTGTTTTTGCTATCACCAAGAATTTTGAGTACAAGATTAAGAAATCTACCACGAAGTTATT
ATCTGTTGCATGCACGGAAAATGGGTGCAAATGGGCCCTACGTGCAAGAAGAATCAAAGGTTCGGATACCTTTCTCATCTCAACGTTTTCTGAGAATCACAGTTGCCAAC
GACCGACACTGGCACATGATCATCGACAAGCTGGTAGTTGGGTCGTGGGTCAGTTGATAAAGTCAAACTTTGAGGACGTCAGCCGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGCCTTTTTGTGATATATAGGGGTCAGTGGAATCAGACAGGCACTGTATATGAATGTGGAGTTATGGGTAGGTTAAATGTTGATGAAGGAATAACATATAGAGA
CCTTGTAAATGCGATCTTTAGGATGACTAGAATAAATCCTGATCTTTTCAATATTGTTCTACAATGTATCTACAAATTTGAGTACCAAGTGCCAAACTTTTATATATCCG
ATGATACTAGCCTTGGTTTTTACCTTATGGGCCCTCCACATCCCTTCCAAGTGCCTTTGTATGTATCTGTTGTACCAAAGGAAAGACAAGGTAGTGGTAGTAACTCACGT
GATGGTACATTATATCCCCAAACCGAAACATTGGCTTCATTTCCATGCCAAGTAGAACAAAACGTTCCTTCTCCTATCCCACTAAATCACATGCACTCACCTATAGATAC
TGTCAACCCGCCATGTTCTGTACAATTTATGACTCCATTGACGGACAATATTGTCCCTTGCAATTTAGGTGACGATGAACAAAAACACTTCGGGCAGTGGGATGATGTTG
GAGTACAGTCTGTAAGTAGGAATGCCCCTTGTGCAACTACTAATGGTCAACATGCGTCCTTAGAACAGATGGATACAATTGGTAATGAGGATGGTTCTGTTCACGACATT
GCAGTTGGCAGTGTTTTTCGGTCTAAGGATGAATTACGATTTAAACTGGCTGTTTTTGCTATCACCAAGAATTTTGAGTACAAGATTAAGAAATCTACCACGAAGTTATT
ATCTGTTGCATGCACGGAAAATGGGTGCAAATGGGCCCTACGTGCAAGAAGAATCAAAGGTTCGGATACCTTTCTCATCTCAACGTTTTCTGAGAATCACAGTTGCCAAC
GACCGACACTGGCACATGATCATCGACAAGCTGGTAGTTGGGTCGTGGGTCAGTTGATAAAGTCAAACTTTGAGGACGTCAGCCGTTGA
Protein sequenceShow/hide protein sequence
MVRLFVIYRGQWNQTGTVYECGVMGRLNVDEGITYRDLVNAIFRMTRINPDLFNIVLQCIYKFEYQVPNFYISDDTSLGFYLMGPPHPFQVPLYVSVVPKERQGSGSNSR
DGTLYPQTETLASFPCQVEQNVPSPIPLNHMHSPIDTVNPPCSVQFMTPLTDNIVPCNLGDDEQKHFGQWDDVGVQSVSRNAPCATTNGQHASLEQMDTIGNEDGSVHDI
AVGSVFRSKDELRFKLAVFAITKNFEYKIKKSTTKLLSVACTENGCKWALRARRIKGSDTFLISTFSENHSCQRPTLAHDHRQAGSWVVGQLIKSNFEDVSR