; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg038687 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg038687
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold12:6613755..6622345
RNA-Seq ExpressionSpg038687
SyntenySpg038687
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025322 - Protein of unknown function DUF4228, plant
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022948492.1 uncharacterized protein LOC111452155 isoform X1 [Cucurbita moschata]1.8e-5266.86Show/hide
Query:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC
        A+A SSWLCS GAKSKLVRIVHPGGHIELHD P+ AAEIMLRNP+F LTHSQS + PWAIVSPD+TLMLG KYYLLPEN+VGN QK     EN  LES+C
Subjt:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC

Query:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS
         +GG CMM     EKGER VR           ESG+DL+GN    + VF +NGN RGSPKRPFGSSDNW+P+ +S
Subjt:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS

XP_022948495.1 uncharacterized protein LOC111452155 isoform X2 [Cucurbita moschata]1.8e-5266.86Show/hide
Query:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC
        A+A SSWLCS GAKSKLVRIVHPGGHIELHD P+ AAEIMLRNP+F LTHSQS + PWAIVSPD+TLMLG KYYLLPEN+VGN QK     EN  LES+C
Subjt:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC

Query:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS
         +GG CMM     EKGER VR           ESG+DL+GN    + VF +NGN RGSPKRPFGSSDNW+P+ +S
Subjt:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS

XP_022998311.1 uncharacterized protein LOC111492985 isoform X1 [Cucurbita maxima]1.0e-5065.14Show/hide
Query:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC
        ++A SSWLCS GAKSKLVRIVHP GHIELHD P+ AAEIMLRNP+F LTHSQS + PWAIVSPD+TLML  KY LLPEN+VGN QK     ENE LES+C
Subjt:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC

Query:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS
         + G CMM     EKGER VR           ESG+DL+GN   +RG+F +NGN RGSPKRPFGSSDNW+P+ +S
Subjt:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS

XP_023523993.1 uncharacterized protein LOC111788070 isoform X1 [Cucurbita pepo subsp. pepo]2.4e-5266.86Show/hide
Query:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC
        A+A SSWLCS GAKSKLVRIVHPGGHIELHD P+ AAEIMLRNP+F LTHSQS + PWAIVSPD+TLMLG KYYLLPEN+VGN QK     EN  LES+C
Subjt:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC

Query:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS
         +GG CMM     EKGER VR           ESG+DL+GN    + VF +NGN RGSPKRPFGSSDNW+P+ +S
Subjt:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS

XP_023523994.1 uncharacterized protein LOC111788070 isoform X2 [Cucurbita pepo subsp. pepo]2.4e-5266.86Show/hide
Query:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC
        A+A SSWLCS GAKSKLVRIVHPGGHIELHD P+ AAEIMLRNP+F LTHSQS + PWAIVSPD+TLMLG KYYLLPEN+VGN QK     EN  LES+C
Subjt:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC

Query:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS
         +GG CMM     EKGER VR           ESG+DL+GN    + VF +NGN RGSPKRPFGSSDNW+P+ +S
Subjt:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS

TrEMBL top hitse value%identityAlignment
A0A2N9H1N4 RNase H domain-containing protein7.8e-4935.45Show/hide
Query:  IQDIPIAGDSLPDTFFWHYDKTGKYSVRSGYKLAI-DELHEASSSADSNSCS-WWKTLWLLNIPNKLKIFAWRASLNILPTRMNLFNRKVQCDPRCPKCG
        I  IP++  +  D   W   K G YSV+SGY L + D   E S  +D +  S  WK++W LN+P K + F WRA  N LPTR NL +R +  DPRC  C 
Subjt:  IQDIPIAGDSLPDTFFWHYDKTGKYSVRSGYKLAI-DELHEASSSADSNSCS-WWKTLWLLNIPNKLKIFAWRASLNILPTRMNLFNRKVQCDPRCPKCG

Query:  KANETTLHALWDCKKVKKHWALWSPVQSFLNTHCDDTLDLFCRFQEKLRSDDLAIFVAILWSLWNTRNEIVFQGNRIYHQQEYDPIERASTYITEFVEAQ
        K  E+T+HALW CKKV+  W   S  Q    +   D +DL  +    L + +L +F  I WS+W  RN +  Q +      +   ++RA   ++EF EAQ
Subjt:  KANETTLHALWDCKKVKKHWALWSPVQSFLNTHCDDTLDLFCRFQEKLRSDDLAIFVAILWSLWNTRNEIVFQGNRIYHQQEYDPIERASTYITEFVEAQ

Query:  AKHQGDCDRSRDTRTSRPTEWTPPPPNHFKLNVDASL--ANGEMGTCMIIRNSNGQTMAAAESFRRTSCSVEWAEAKALVEGVQLALDSGLSPVWAEIDS
         + Q      +   +S   +W PP    +K+N D ++   +   G  +IIRN  G+ M +         SVE  EA A    +Q A D G + +  E DS
Subjt:  AKHQGDCDRSRDTRTSRPTEWTPPPPNHFKLNVDASL--ANGEMGTCMIIRNSNGQTMAAAESFRRTSCSVEWAEAKALVEGVQLALDSGLSPVWAEIDS

Query:  KIVWNLLHDREQHLNEIAPLIHQLHLLATKHLISGFLLTPREGNKVAHQLATHARTTKHSEVWLEECPLWAQSSLAHD
        K V   L  RE        +I     +A       FL T REGN +AH LA  AR  K  E WLE  P    S+L +D
Subjt:  KIVWNLLHDREQHLNEIAPLIHQLHLLATKHLISGFLLTPREGNKVAHQLATHARTTKHSEVWLEECPLWAQSSLAHD

A0A6J1G9G0 uncharacterized protein LOC111452155 isoform X18.9e-5366.86Show/hide
Query:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC
        A+A SSWLCS GAKSKLVRIVHPGGHIELHD P+ AAEIMLRNP+F LTHSQS + PWAIVSPD+TLMLG KYYLLPEN+VGN QK     EN  LES+C
Subjt:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC

Query:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS
         +GG CMM     EKGER VR           ESG+DL+GN    + VF +NGN RGSPKRPFGSSDNW+P+ +S
Subjt:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS

A0A6J1GA04 uncharacterized protein LOC111452155 isoform X28.9e-5366.86Show/hide
Query:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC
        A+A SSWLCS GAKSKLVRIVHPGGHIELHD P+ AAEIMLRNP+F LTHSQS + PWAIVSPD+TLMLG KYYLLPEN+VGN QK     EN  LES+C
Subjt:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC

Query:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS
         +GG CMM     EKGER VR           ESG+DL+GN    + VF +NGN RGSPKRPFGSSDNW+P+ +S
Subjt:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS

A0A6J1K7L4 uncharacterized protein LOC111492985 isoform X24.9e-5165.14Show/hide
Query:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC
        ++A SSWLCS GAKSKLVRIVHP GHIELHD P+ AAEIMLRNP+F LTHSQS + PWAIVSPD+TLML  KY LLPEN+VGN QK     ENE LES+C
Subjt:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC

Query:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS
         + G CMM     EKGER VR           ESG+DL+GN   +RG+F +NGN RGSPKRPFGSSDNW+P+ +S
Subjt:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS

A0A6J1K9X0 uncharacterized protein LOC111492985 isoform X14.9e-5165.14Show/hide
Query:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC
        ++A SSWLCS GAKSKLVRIVHP GHIELHD P+ AAEIMLRNP+F LTHSQS + PWAIVSPD+TLML  KY LLPEN+VGN QK     ENE LES+C
Subjt:  AAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNC

Query:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS
         + G CMM     EKGER VR           ESG+DL+GN   +RG+F +NGN RGSPKRPFGSSDNW+P+ +S
Subjt:  FSGGNCMMFLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein1.9e-2325.95Show/hide
Query:  KYSVRSGYKLAIDE--LHEASSSADSNSCSWWKTLWLLNIPNKLKIFAWRASLNILPTRMNLFNRKVQCDPRCPKCGKANETTLHALWDCKKVKKHWA--
        K  +RSGY +A  E  L E +      S    + +W L++  K+K F WR     L T   L +R +  DP C +C    ET  H +++C   +  W   
Subjt:  KYSVRSGYKLAIDE--LHEASSSADSNSCSWWKTLWLLNIPNKLKIFAWRASLNILPTRMNLFNRKVQCDPRCPKCGKANETTLHALWDCKKVKKHWA--

Query:  ------LWSPVQSFLNTHCDDTLDLFCRFQEKLRSDDLAIFVA--ILWSLWNTRNEIVFQGNRIYHQQEYDP---IERASTYITEFVEAQAKHQGDCDRS
               W P  SF     +D L+   +  +   ++ L  F+   I+W LW +RN  +FQ  +     +Y+    I+ A+ ++      +  +       
Subjt:  ------LWSPVQSFLNTHCDDTLDLFCRFQEKLRSDDLAIFVA--ILWSLWNTRNEIVFQGNRIYHQQEYDP---IERASTYITEFVEAQAKHQGDCDRS

Query:  RDTRTSRPTEWTPPPPNHFKLNVDASLANGEMGT--CMIIRNSNGQTMAAAESFRRTSCSVEWAEAKALVEGVQLALDSGLSPVWAEIDSKIVWNLLHDR
          T     ++W PPP    K N D+    G   T     IR  NG  +    +  ++S     AEA   +  +Q+    GL  VW E DSK +  L+++ 
Subjt:  RDTRTSRPTEWTPPPPNHFKLNVDASLANGEMGT--CMIIRNSNGQTMAAAESFRRTSCSVEWAEAKALVEGVQLALDSGLSPVWAEIDSKIVWNLLHDR

Query:  EQHLNEIAPLIHQLHLLATKHLISGFLLTPREGNKVAHQLATH
        E H + +  LI+ +     K          RE N  A  LA+H
Subjt:  EQHLNEIAPLIHQLHLLATKHLISGFLLTPREGNKVAHQLATH

AT3G09510.1 Ribonuclease H-like superfamily protein3.6e-3026.68Show/hide
Query:  KGNRFTWQNNQPGTAFIRKRLDRCDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYSVRSGYKLAIDELHEASSSADS-----NSCSWWKTLWLLNIPNKLK
        KG+ + W +++     I + +D+ D    I  I +A    PD   W+Y+ TG+Y+VRSGY L     H+ S++  +      S      +W L I  KLK
Subjt:  KGNRFTWQNNQPGTAFIRKRLDRCDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYSVRSGYKLAIDELHEASSSADS-----NSCSWWKTLWLLNIPNKLK

Query:  IFAWRASLNILPTRMNLFNRKVQCDPRCPKCGKANETTLHALWDCKKVKKHWALWSPV---QSFLNTHCDDTLDLFCRFQEKLRSDDL--AIFVAILWSL
         F WRA    L T   L  R ++ DP CP+C + NE+  HAL+ C      W L          ++   ++ +     F +     D    + V ++W +
Subjt:  IFAWRASLNILPTRMNLFNRKVQCDPRCPKCGKANETTLHALWDCKKVKKHWALWSPV---QSFLNTHCDDTLDLFCRFQEKLRSDDL--AIFVAILWSL

Query:  WNTRNEIVFQGNRIYHQQEYDPIERASTYITEFVEAQAKHQGDCDRSRDTRTSRPTEWTPPPPNHFKLNVDA--SLANGEMGTCMIIRNSNGQTMA-AAE
        W  RN +VF     + +     +  A     +++ A   H+     +R    ++  EW  PP  + K N DA   +   E     IIRN  G  ++  + 
Subjt:  WNTRNEIVFQGNRIYHQQEYDPIERASTYITEFVEAQAKHQGDCDRSRDTRTSRPTEWTPPPPNHFKLNVDA--SLANGEMGTCMIIRNSNGQTMA-AAE

Query:  SFRRTSCSVEWAEAKALVEGVQLALDSGLSPVWAEIDSKIVWNLLHDREQHLNEIAPLIHQLHLLATKHLISGFLLTPREGNKVAHQLATHARTTKHSEV
            TS  +E AE KAL+  +Q     G + V+ E D + + NL++    H + +A  +  +   A K     F    R+GNK+AH LA +  T      
Subjt:  SFRRTSCSVEWAEAKALVEGVQLALDSGLSPVWAEIDSKIVWNLLHDREQHLNEIAPLIHQLHLLATKHLISGFLLTPREGNKVAHQLATHARTTKHSEV

Query:  WLEECPLWAQSSLAHD
             P+W      +D
Subjt:  WLEECPLWAQSSLAHD

AT3G25270.1 Ribonuclease H-like superfamily protein9.8e-2026.43Show/hide
Query:  LWLLNIPNKLKIFAWRASLNILPTRMNLFNRKVQCDPRCPKCGKANETTLHALWDCKKVKKHW-ALWSPVQSFLNT--HCDDTLDLF---CRFQEKLRSD
        +W L    K+K F W+     L T  NL  R ++  P+C +C + +ET+ H  +DC   ++ W A   P Q    T    +  ++L    C    + +  
Subjt:  LWLLNIPNKLKIFAWRASLNILPTRMNLFNRKVQCDPRCPKCGKANETTLHALWDCKKVKKHW-ALWSPVQSFLNT--HCDDTLDLF---CRFQEKLRSD

Query:  DLAIFVAILWSLWNTRNEIVFQGNRIYHQQEYDPIERASTYITEFVEAQAKHQGDCDRSRDTRTSRP----TEWTPPPPNHFKLNVDASL----ANGEMG
        +LAI+  ILW LW +RN++VFQ   I  Q   + ++RA   + E+ +     Q    +   +R  +P    T+W  PP    K N D +      N + G
Subjt:  DLAIFVAILWSLWNTRNEIVFQGNRIYHQQEYDPIERASTYITEFVEAQAKHQGDCDRSRDTRTSRP----TEWTPPPPNHFKLNVDASL----ANGEMG

Query:  TCMIIRNSNGQTMAAAESFRRTSCSVEWAEAKALVEGVQLALDSGLSPVWAEIDSKIVWNLLHDREQHLNEIAPLIHQLHLLATKHLISGFLLTPREGNK
           ++R+ NG  M + ++   T+     +E +AL+  +Q A   G   V  E DSK V  L+++ + +       I +      +   + F   PR  N+
Subjt:  TCMIIRNSNGQTMAAAESFRRTSCSVEWAEAKALVEGVQLALDSGLSPVWAEIDSKIVWNLLHDREQHLNEIAPLIHQLHLLATKHLISGFLLTPREGNK

Query:  VAHQLATHARTTKHSEVWLEECPLWAQSSLAHD
         A  LA H      S  +    P +  S+L +D
Subjt:  VAHQLATHARTTKHSEVWLEECPLWAQSSLAHD

AT3G31430.1 unknown protein6.0e-0932.08Show/hide
Query:  PEEMIFEEAQFWVQFHNIPLGLRNEMVAQTLGSRLGIVQRVETNDDDECWGRFLRVRIKMKINEPLCRGLTLNGGPKGKILVAIKYERLPDFCYYCGMID
        P+  +F    FWVQ   IP    N  V + +G  LG V   + N +      F RV +   I  PL              L+  +YERL  FC  CGM+ 
Subjt:  PEEMIFEEAQFWVQFHNIPLGLRNEMVAQTLGSRLGIVQRVETNDDDECWGRFLRVRIKMKINEPLCRGLTLNGGPKGKILVAIKYERLPDFCYYCGMID

Query:  HNDGSC
        H+ G+C
Subjt:  HNDGSC

AT4G29090.1 Ribonuclease H-like superfamily protein1.9e-2325.26Show/hide
Query:  IQDIPIAGDSLPDTFFWHYDKTGKYSVRSGYKLAIDELHEASSSADSNSCSW---WKTLWLLNIPNKLKIFAWRASLNILPTRMNLFNRKVQCDPRCPKC
        I ++   G  + D++ W Y  +G Y+V+SGY +    +++ SS  + +  S    ++ +W      K++ F W+   N LP    L  R +  +  C +C
Subjt:  IQDIPIAGDSLPDTFFWHYDKTGKYSVRSGYKLAIDELHEASSSADSNSCSW---WKTLWLLNIPNKLKIFAWRASLNILPTRMNLFNRKVQCDPRCPKC

Query:  GKANETTLHALWDCKKVKKHWALWSPVQSFLNTHCDDTLDLFCRFQEKLRSDD------LAIFVAILWSLWNTRNEIVFQGNRIYHQQEYDPIERASTYI
            ET  H L+ C   +  WA+ S +   L     D++ +   +   L + +        +   +LW LW  RNE+VF+G R ++ QE   + RA   +
Subjt:  GKANETTLHALWDCKKVKKHWALWSPVQSFLNTHCDDTLDLFCRFQEKLRSDD------LAIFVAILWSLWNTRNEIVFQGNRIYHQQEYDPIERASTYI

Query:  TEF-VEAQAKHQGDCDRSRDTRTSRPTEWTPPPPNHFKLNVDA--SLANGEMGTCMIIRNSNGQTMAAAESFRRTSCSVEWAEAKALVEGVQLALDSGLS
         E+ +  +A+    C        S    W PPP    K N DA  +  N   G   ++RN  G+             SV  AE +A+   V        +
Subjt:  TEF-VEAQAKHQGDCDRSRDTRTSRPTEWTPPPPNHFKLNVDA--SLANGEMGTCMIIRNSNGQTMAAAESFRRTSCSVEWAEAKALVEGVQLALDSGLS

Query:  PVWAEIDSKIVWNLLHDREQHLNEIAPLIHQLHLLATKHLISGFLLTPREGNKVAHQLATHARTTKHSEVWLEE-CPLWAQSSL
         V  E DS+++  +L++ ++    + P I  L  L ++     F+  PREGN +A ++A  + +  + +  L    P WA+SS+
Subjt:  PVWAEIDSKIVWNLLHDREQHLNEIAPLIHQLHLLATKHLISGFLLTPREGNKVAHQLATHARTTKHSEVWLEE-CPLWAQSSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGCCGCCCTCTCAAGCTGGCTCTGCAGCAGTGGCGCCAAGTCCAAGCTCGTCCGAATCGTCCATCCCGGCGGCCACATCGAGCTCCACGACGTCCCCGTCCC
GGCGGCGGAGATCATGCTCCGAAACCCCCAATTCCGCCTCACCCACTCCCAATCTTATCATCAACCCTGGGCCATCGTCTCGCCTGACTCCACGCTCATGCTCGGCCGGA
AGTACTACCTCCTACCGGAAAACGCCGTCGGAAACTTCCAGAAGAGGCCGACGCCGCCGGAAAATGAAGATCTGGAATCGAATTGTTTCAGCGGCGGAAACTGTATGATG
TTTTTAGCTTCGAACGAGAAGGGCGAGAGAATTGTGAGGTCGGAAAACGGAAATAGTTGCAGAGAGAAGTCGGAATCCGGCGAGGATTTGGAGGGGAATAGGGTTAAAAG
GAGAGGGGTATTTTCGGAAAATGGGAATTCTAGAGGGTCTCCTAAGAGACCGTTTGGATCCTCTGATAATTGGCAACCAAGTGGGTCCTCAACGACTTTTGATTTGCCTG
AGGAAATGATTTTTGAAGAGGCTCAGTTCTGGGTTCAATTCCATAACATACCTCTTGGCCTTAGGAACGAAATGGTAGCACAAACTCTTGGAAGCCGTCTAGGGATAGTT
CAGAGAGTAGAAACTAACGATGATGATGAATGTTGGGGACGTTTTCTTCGAGTACGAATCAAGATGAAGATAAACGAACCATTGTGCCGAGGTCTAACTCTGAATGGCGG
ACCAAAGGGGAAAATTTTGGTAGCGATAAAGTATGAGAGATTACCGGATTTTTGTTACTATTGTGGAATGATTGACCACAATGATGGATCGTGCCCCCAACTTAAATTAC
AGACGAAACCAGAGAAGCAGTTTGGTTCATGGATGAGAGCTGCAACTCCTCCAAGACCGAACCGTTCCCACGCAAGCGGCAGAGGTTTCGATGGATTTAGAGGGGGAAGA
GGACGTTTTGGTAACAGACAAAGGAGCAGGCAACCATGGTGTTTCAATGAGGAGGAAGAAGAAGCCAACCATGGAGATTCAAGAGGTTCTCAAAGCTCCGACGGAAGTGG
TGATGAACATCAACCTGAATCCAATCGCCGGCAGCCTGAACTCGAGGACTCTCCACCGCAATCAATGCAATCAACGGAAATTACGGGTATTAAGTGTCCCCTTTCCTTTG
ATAACGTTACTAGTGTGGATTCTCTCCCTATTAATGCAGAAGACGCAAATCACGGCAGAGAGGAAGGAGAAATTAATGGCGGCAACGTTTTAGGGAGGTTTCAAAATCGT
GAGGCTACACAGGGTGAAGGGAAGAGACGAAACGGCAAGGAGATCTTAGGGCAAGGAGGGTTAGTGGAGAAAAATATGTTTGTTGGGCCCATGTCTAATAATATCACCCT
TTCAGCCCATTTACATGTTCCAACCGAAAATCCACCCAATAGCCACCAAACTAAAATTCCAAAATCCATAACCTCATGCAAAATGATCCAAACAGATATTTTTCCAACTA
AAGACCAGCCCATGATCCACCAAACCAAAATTCCAATTTTCACCCAGTTGGATTCTAGTCTGAATAGCCCATCAAGCAAAATCCAACACCACGTACAAACTAATCCTCGG
ACGTCAAGCACAGACCACTCTCATATAGAAGACATGATGATTGAAGACCTCCCCAAAATTGACCACATTCCGACAGAAATTAATAAGAATGTGGACTTTGCGGAATTTCA
GACTCAAATGGATTTTAATCCTGGGGGCTTGGCGGATGGAAAAGGGAAGAGTGTAGCCACGGAAAAAACAAGTGTTTGTGAGAAAAAGCAATTAAAAAGCTGGAAGCGAG
CCAATAGACAGAAGGATGAAATCAAGATGGATCACCCCTCCTTTCAACTACCAGAGCAGTTTGCTTACAAAAAGAGGGCGGCGGAAGATGATGATTTGGACAGAGATCTA
AATGCAATAACCCAAGTAAATGAAAAGGATGGAGGAGGAGAATTTGAGAGATTACATAGCCAACAATTTTTAAATGCCTTAAATGATTGCGCTCTTCGAGATCTTGGTTT
CAAGGGCAACCGATTCACATGGCAGAATAATCAACCTGGAACAGCCTTTATTCGAAAAAGGCTAGACCGGTGTGATGACGTGGAAAAAATTCAGGACATTCCAATTGCAG
GTGATAGCCTCCCGGACACTTTTTTTTGGCATTATGACAAAACAGGGAAGTATTCAGTCCGAAGCGGTTACAAACTTGCAATAGATGAACTTCATGAGGCGTCCTCTTCA
GCGGATTCCAATAGTTGTTCTTGGTGGAAAACACTATGGCTGCTAAACATCCCAAACAAACTCAAGATCTTCGCTTGGCGGGCAAGCCTGAATATACTCCCGACCAGAAT
GAATCTTTTCAACCGTAAAGTTCAATGCGACCCCCGTTGTCCAAAGTGTGGGAAAGCGAATGAAACAACTCTTCATGCTTTGTGGGATTGTAAGAAGGTAAAAAAGCACT
GGGCCCTCTGGAGCCCAGTGCAATCTTTTCTCAATACACATTGTGATGATACTCTTGATCTTTTTTGTAGGTTTCAGGAGAAATTAAGGTCCGATGACTTGGCAATCTTT
GTGGCCATTCTATGGTCGTTATGGAACACTAGAAATGAAATTGTTTTCCAGGGCAACCGAATCTATCACCAACAGGAATATGATCCTATAGAACGGGCATCAACTTATAT
AACAGAGTTCGTGGAGGCTCAAGCAAAGCACCAAGGAGACTGTGATCGATCAAGAGATACTAGAACCTCCCGCCCAACGGAGTGGACACCGCCGCCTCCCAACCATTTCA
AACTAAACGTGGATGCATCCTTAGCAAATGGTGAGATGGGAACATGCATGATTATCAGAAACTCAAATGGTCAAACGATGGCAGCTGCAGAAAGTTTCAGGAGAACGAGT
TGCTCTGTAGAGTGGGCAGAAGCAAAGGCACTGGTGGAAGGTGTGCAACTAGCTTTGGATTCGGGCTTATCACCAGTTTGGGCAGAGATAGATTCCAAAATTGTGTGGAA
TTTACTCCATGATCGCGAACAACATTTAAATGAGATTGCCCCCCTAATCCACCAACTCCATCTTTTGGCCACAAAGCACCTTATAAGTGGGTTTCTCCTAACTCCGAGGG
AAGGCAACAAAGTAGCACACCAGTTAGCTACTCATGCTCGCACAACAAAACATTCTGAAGTCTGGCTTGAAGAATGTCCTCTGTGGGCTCAAAGCAGTTTAGCTCATGAT
TTGCGTCATATGGACGCCCTAGCCATTTCTTTTGATTCGTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCGCCGCCCTCTCAAGCTGGCTCTGCAGCAGTGGCGCCAAGTCCAAGCTCGTCCGAATCGTCCATCCCGGCGGCCACATCGAGCTCCACGACGTCCCCGTCCC
GGCGGCGGAGATCATGCTCCGAAACCCCCAATTCCGCCTCACCCACTCCCAATCTTATCATCAACCCTGGGCCATCGTCTCGCCTGACTCCACGCTCATGCTCGGCCGGA
AGTACTACCTCCTACCGGAAAACGCCGTCGGAAACTTCCAGAAGAGGCCGACGCCGCCGGAAAATGAAGATCTGGAATCGAATTGTTTCAGCGGCGGAAACTGTATGATG
TTTTTAGCTTCGAACGAGAAGGGCGAGAGAATTGTGAGGTCGGAAAACGGAAATAGTTGCAGAGAGAAGTCGGAATCCGGCGAGGATTTGGAGGGGAATAGGGTTAAAAG
GAGAGGGGTATTTTCGGAAAATGGGAATTCTAGAGGGTCTCCTAAGAGACCGTTTGGATCCTCTGATAATTGGCAACCAAGTGGGTCCTCAACGACTTTTGATTTGCCTG
AGGAAATGATTTTTGAAGAGGCTCAGTTCTGGGTTCAATTCCATAACATACCTCTTGGCCTTAGGAACGAAATGGTAGCACAAACTCTTGGAAGCCGTCTAGGGATAGTT
CAGAGAGTAGAAACTAACGATGATGATGAATGTTGGGGACGTTTTCTTCGAGTACGAATCAAGATGAAGATAAACGAACCATTGTGCCGAGGTCTAACTCTGAATGGCGG
ACCAAAGGGGAAAATTTTGGTAGCGATAAAGTATGAGAGATTACCGGATTTTTGTTACTATTGTGGAATGATTGACCACAATGATGGATCGTGCCCCCAACTTAAATTAC
AGACGAAACCAGAGAAGCAGTTTGGTTCATGGATGAGAGCTGCAACTCCTCCAAGACCGAACCGTTCCCACGCAAGCGGCAGAGGTTTCGATGGATTTAGAGGGGGAAGA
GGACGTTTTGGTAACAGACAAAGGAGCAGGCAACCATGGTGTTTCAATGAGGAGGAAGAAGAAGCCAACCATGGAGATTCAAGAGGTTCTCAAAGCTCCGACGGAAGTGG
TGATGAACATCAACCTGAATCCAATCGCCGGCAGCCTGAACTCGAGGACTCTCCACCGCAATCAATGCAATCAACGGAAATTACGGGTATTAAGTGTCCCCTTTCCTTTG
ATAACGTTACTAGTGTGGATTCTCTCCCTATTAATGCAGAAGACGCAAATCACGGCAGAGAGGAAGGAGAAATTAATGGCGGCAACGTTTTAGGGAGGTTTCAAAATCGT
GAGGCTACACAGGGTGAAGGGAAGAGACGAAACGGCAAGGAGATCTTAGGGCAAGGAGGGTTAGTGGAGAAAAATATGTTTGTTGGGCCCATGTCTAATAATATCACCCT
TTCAGCCCATTTACATGTTCCAACCGAAAATCCACCCAATAGCCACCAAACTAAAATTCCAAAATCCATAACCTCATGCAAAATGATCCAAACAGATATTTTTCCAACTA
AAGACCAGCCCATGATCCACCAAACCAAAATTCCAATTTTCACCCAGTTGGATTCTAGTCTGAATAGCCCATCAAGCAAAATCCAACACCACGTACAAACTAATCCTCGG
ACGTCAAGCACAGACCACTCTCATATAGAAGACATGATGATTGAAGACCTCCCCAAAATTGACCACATTCCGACAGAAATTAATAAGAATGTGGACTTTGCGGAATTTCA
GACTCAAATGGATTTTAATCCTGGGGGCTTGGCGGATGGAAAAGGGAAGAGTGTAGCCACGGAAAAAACAAGTGTTTGTGAGAAAAAGCAATTAAAAAGCTGGAAGCGAG
CCAATAGACAGAAGGATGAAATCAAGATGGATCACCCCTCCTTTCAACTACCAGAGCAGTTTGCTTACAAAAAGAGGGCGGCGGAAGATGATGATTTGGACAGAGATCTA
AATGCAATAACCCAAGTAAATGAAAAGGATGGAGGAGGAGAATTTGAGAGATTACATAGCCAACAATTTTTAAATGCCTTAAATGATTGCGCTCTTCGAGATCTTGGTTT
CAAGGGCAACCGATTCACATGGCAGAATAATCAACCTGGAACAGCCTTTATTCGAAAAAGGCTAGACCGGTGTGATGACGTGGAAAAAATTCAGGACATTCCAATTGCAG
GTGATAGCCTCCCGGACACTTTTTTTTGGCATTATGACAAAACAGGGAAGTATTCAGTCCGAAGCGGTTACAAACTTGCAATAGATGAACTTCATGAGGCGTCCTCTTCA
GCGGATTCCAATAGTTGTTCTTGGTGGAAAACACTATGGCTGCTAAACATCCCAAACAAACTCAAGATCTTCGCTTGGCGGGCAAGCCTGAATATACTCCCGACCAGAAT
GAATCTTTTCAACCGTAAAGTTCAATGCGACCCCCGTTGTCCAAAGTGTGGGAAAGCGAATGAAACAACTCTTCATGCTTTGTGGGATTGTAAGAAGGTAAAAAAGCACT
GGGCCCTCTGGAGCCCAGTGCAATCTTTTCTCAATACACATTGTGATGATACTCTTGATCTTTTTTGTAGGTTTCAGGAGAAATTAAGGTCCGATGACTTGGCAATCTTT
GTGGCCATTCTATGGTCGTTATGGAACACTAGAAATGAAATTGTTTTCCAGGGCAACCGAATCTATCACCAACAGGAATATGATCCTATAGAACGGGCATCAACTTATAT
AACAGAGTTCGTGGAGGCTCAAGCAAAGCACCAAGGAGACTGTGATCGATCAAGAGATACTAGAACCTCCCGCCCAACGGAGTGGACACCGCCGCCTCCCAACCATTTCA
AACTAAACGTGGATGCATCCTTAGCAAATGGTGAGATGGGAACATGCATGATTATCAGAAACTCAAATGGTCAAACGATGGCAGCTGCAGAAAGTTTCAGGAGAACGAGT
TGCTCTGTAGAGTGGGCAGAAGCAAAGGCACTGGTGGAAGGTGTGCAACTAGCTTTGGATTCGGGCTTATCACCAGTTTGGGCAGAGATAGATTCCAAAATTGTGTGGAA
TTTACTCCATGATCGCGAACAACATTTAAATGAGATTGCCCCCCTAATCCACCAACTCCATCTTTTGGCCACAAAGCACCTTATAAGTGGGTTTCTCCTAACTCCGAGGG
AAGGCAACAAAGTAGCACACCAGTTAGCTACTCATGCTCGCACAACAAAACATTCTGAAGTCTGGCTTGAAGAATGTCCTCTGTGGGCTCAAAGCAGTTTAGCTCATGAT
TTGCGTCATATGGACGCCCTAGCCATTTCTTTTGATTCGTTGTAG
Protein sequenceShow/hide protein sequence
MAAAALSSWLCSSGAKSKLVRIVHPGGHIELHDVPVPAAEIMLRNPQFRLTHSQSYHQPWAIVSPDSTLMLGRKYYLLPENAVGNFQKRPTPPENEDLESNCFSGGNCMM
FLASNEKGERIVRSENGNSCREKSESGEDLEGNRVKRRGVFSENGNSRGSPKRPFGSSDNWQPSGSSTTFDLPEEMIFEEAQFWVQFHNIPLGLRNEMVAQTLGSRLGIV
QRVETNDDDECWGRFLRVRIKMKINEPLCRGLTLNGGPKGKILVAIKYERLPDFCYYCGMIDHNDGSCPQLKLQTKPEKQFGSWMRAATPPRPNRSHASGRGFDGFRGGR
GRFGNRQRSRQPWCFNEEEEEANHGDSRGSQSSDGSGDEHQPESNRRQPELEDSPPQSMQSTEITGIKCPLSFDNVTSVDSLPINAEDANHGREEGEINGGNVLGRFQNR
EATQGEGKRRNGKEILGQGGLVEKNMFVGPMSNNITLSAHLHVPTENPPNSHQTKIPKSITSCKMIQTDIFPTKDQPMIHQTKIPIFTQLDSSLNSPSSKIQHHVQTNPR
TSSTDHSHIEDMMIEDLPKIDHIPTEINKNVDFAEFQTQMDFNPGGLADGKGKSVATEKTSVCEKKQLKSWKRANRQKDEIKMDHPSFQLPEQFAYKKRAAEDDDLDRDL
NAITQVNEKDGGGEFERLHSQQFLNALNDCALRDLGFKGNRFTWQNNQPGTAFIRKRLDRCDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYSVRSGYKLAIDELHEASSS
ADSNSCSWWKTLWLLNIPNKLKIFAWRASLNILPTRMNLFNRKVQCDPRCPKCGKANETTLHALWDCKKVKKHWALWSPVQSFLNTHCDDTLDLFCRFQEKLRSDDLAIF
VAILWSLWNTRNEIVFQGNRIYHQQEYDPIERASTYITEFVEAQAKHQGDCDRSRDTRTSRPTEWTPPPPNHFKLNVDASLANGEMGTCMIIRNSNGQTMAAAESFRRTS
CSVEWAEAKALVEGVQLALDSGLSPVWAEIDSKIVWNLLHDREQHLNEIAPLIHQLHLLATKHLISGFLLTPREGNKVAHQLATHARTTKHSEVWLEECPLWAQSSLAHD
LRHMDALAISFDSL